BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012283
         (467 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255543315|ref|XP_002512720.1| conserved hypothetical protein [Ricinus communis]
 gi|223547731|gb|EEF49223.1| conserved hypothetical protein [Ricinus communis]
          Length = 466

 Score =  737 bits (1903), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/469 (79%), Positives = 412/469 (87%), Gaps = 5/469 (1%)

Query: 1   MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNP--PDNHSLLITTKRLSFQTHAKKK 58
           MA+TSLLPK T PFL+NP SSL STHF KPS   NP  PD+HS        +   HA KK
Sbjct: 1   MASTSLLPKIT-PFLTNP-SSLSSTHFTKPS-FLNPLSPDHHSACARKPTTTTVLHANKK 57

Query: 59  NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
           NPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPD+PNNPYGFLKFP G++ EIASLPLKI
Sbjct: 58  NPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDNPNNPYGFLKFPKGYSVEIASLPLKI 117

Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
           RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRWAN 
Sbjct: 118 RGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVLASPRGKQTYELNKNVRWAND 177

Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
           YD DDDWPEPAEY D+LG++KNRYYDMVLSTKLAGLGHA+FLFMT+ARDRVSYI+PNVNA
Sbjct: 178 YDPDDDWPEPAEYVDMLGILKNRYYDMVLSTKLAGLGHASFLFMTSARDRVSYIHPNVNA 237

Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           AGAGLLLSETFT +  NL+E GY+MY QM+DWLGRPFRSVPRH VPPLRVSISR+LKE V
Sbjct: 238 AGAGLLLSETFTPDITNLAEGGYHMYHQMLDWLGRPFRSVPRHTVPPLRVSISRKLKEFV 297

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
             KY+ AGAE+GKYIVIHGIESDSKASMQSRGDTDSLLP+QVWAEIA  +R F+P+FVIP
Sbjct: 298 EAKYRAAGAEKGKYIVIHGIESDSKASMQSRGDTDSLLPLQVWAEIAKDIRAFKPVFVIP 357

Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           HEKERE VE++VGDD SIV ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 
Sbjct: 358 HEKERENVEEIVGDDTSIVMITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFG 417

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
           S  KGRLF+PNAEEKKC+++SS+TGKL D  V  V  AMQI + SLALA
Sbjct: 418 SIEKGRLFIPNAEEKKCSIVSSKTGKLKDIDVGDVKQAMQILDLSLALA 466


>gi|341833960|gb|AEK94315.1| hypothetical protein [Pyrus x bretschneideri]
          Length = 463

 Score =  721 bits (1861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/464 (74%), Positives = 395/464 (85%), Gaps = 5/464 (1%)

Query: 3   TTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWL 62
           TTSLLPKS  PFL+N   S PST F K S  FNP D+      TKR + QT+AKKKNPWL
Sbjct: 4   TTSLLPKSFPPFLTN-APSFPSTLFTKLSS-FNPQDHPH---CTKRPNLQTNAKKKNPWL 58

Query: 63  DPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDV 122
           DPFDDGEDPDM+YGSLFADGKQDED RPP++P+NPYGFLKFPMG++ E+ASL LK+RGDV
Sbjct: 59  DPFDDGEDPDMEYGSLFADGKQDEDDRPPENPDNPYGFLKFPMGYSVEVASLALKVRGDV 118

Query: 123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD 182
           RRCCC++SGGVYENLLFFP IQLLKDRYPGV +D +ASARGKQ +ELNKNVRWA+VYD D
Sbjct: 119 RRCCCVVSGGVYENLLFFPVIQLLKDRYPGVQVDAVASARGKQAYELNKNVRWADVYDPD 178

Query: 183 DDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
           D +PEPAEYTD++G++K+RYYDM+LSTKLAGLGH  FLFMTTAR+RVSYIYPNVNAAGAG
Sbjct: 179 DHFPEPAEYTDMVGLLKSRYYDMILSTKLAGLGHGIFLFMTTARERVSYIYPNVNAAGAG 238

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
           L LSETF    MNL+E GYNMY QMVDWLGRP R VPR  +PPL++SIS++LK+ V  KY
Sbjct: 239 LFLSETFRPNGMNLAEGGYNMYHQMVDWLGRPVRDVPRQALPPLKISISKKLKQTVESKY 298

Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKE 362
           K AG ++GKYIVIHG+ESDSKASMQS+GD+DSLLPI+ WA I  G+REFRP+FVIPHEKE
Sbjct: 299 KKAGVDKGKYIVIHGLESDSKASMQSKGDSDSLLPIETWAAIVRGIREFRPVFVIPHEKE 358

Query: 363 REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELK 422
           RE VE++VG+D SIVFITTPGQLAALINDSAGVI TNTAAIQLANAR+KP IALF S  K
Sbjct: 359 RENVEEIVGEDTSIVFITTPGQLAALINDSAGVITTNTAAIQLANARQKPCIALFGSAEK 418

Query: 423 GRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
           G+LFVPNAEEK C ++SS+T  L D   EAV NA+ IFN SLAL
Sbjct: 419 GKLFVPNAEEKNCVIVSSKTKMLKDIDTEAVKNALPIFNVSLAL 462


>gi|357467927|ref|XP_003604248.1| hypothetical protein MTR_4g007080 [Medicago truncatula]
 gi|355505303|gb|AES86445.1| hypothetical protein MTR_4g007080 [Medicago truncatula]
          Length = 472

 Score =  709 bits (1831), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/463 (73%), Positives = 392/463 (84%), Gaps = 13/463 (2%)

Query: 15  LSNPLSSLPSTHFIKPSPLFNPPDNHSLLIT----------TKRLSFQTHAKKKNPWLDP 64
           +S P S   +  F  PS     P N SLL +          T+RL+ + +AKK NPWLDP
Sbjct: 13  ISKPFSHFTTNTFSIPSTY---PTNVSLLTSLDHPSHCYNSTRRLNLRVNAKKNNPWLDP 69

Query: 65  FDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRR 124
           FDDGEDP+++YGSLF+DGKQDEDPRPPD+P NPYGFLKFP G+  EIA L LK+RGDVRR
Sbjct: 70  FDDGEDPNIEYGSLFSDGKQDEDPRPPDNPKNPYGFLKFPAGYMVEIAPLGLKVRGDVRR 129

Query: 125 CCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD 184
           CCC++SGGVYENLLFFP IQL+KDRYPGV +DV+ S RGKQ +ELNKNVRWANV+D DD+
Sbjct: 130 CCCVVSGGVYENLLFFPMIQLIKDRYPGVQVDVVGSERGKQCYELNKNVRWANVFDPDDE 189

Query: 185 WPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLL 244
           +PEPAEYTD +GV+K+RYYDMVLSTKLAG+GHAA+LFMTTARDRVSY+YPNVN AGAGL 
Sbjct: 190 FPEPAEYTDFVGVLKSRYYDMVLSTKLAGVGHAAYLFMTTARDRVSYVYPNVNGAGAGLF 249

Query: 245 LSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKN 304
           LSETFT +S NLSE GY+MY QM DWLGRPFRSVPR  VPPL++S+SR+LKEVV EKY  
Sbjct: 250 LSETFTPDSTNLSEGGYHMYHQMEDWLGRPFRSVPRQVVPPLKISLSRKLKEVVEEKYTK 309

Query: 305 AGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKERE 364
           AG ++G+YIVIHGI+SDSKA+MQSRGD DSLLP++VWAEIA+ +REF PLFVIPHEKERE
Sbjct: 310 AGVKKGRYIVIHGIQSDSKATMQSRGDPDSLLPLEVWAEIADAVREFTPLFVIPHEKERE 369

Query: 365 GVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGR 424
            VE++VG+D+SI+FITTPGQLAALINDSAGVIATNTAAIQLANAREKPS+ALFSS+ KG 
Sbjct: 370 NVEEIVGEDSSIIFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSVALFSSKEKGN 429

Query: 425 LFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
            FVP AEEKKC VISS+TGKLI+  VEAV NA+Q FN SLA A
Sbjct: 430 KFVPRAEEKKCIVISSKTGKLINIDVEAVTNAIQTFNVSLAFA 472


>gi|224115860|ref|XP_002332075.1| predicted protein [Populus trichocarpa]
 gi|222831961|gb|EEE70438.1| predicted protein [Populus trichocarpa]
          Length = 465

 Score =  708 bits (1827), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/468 (73%), Positives = 397/468 (84%), Gaps = 4/468 (0%)

Query: 1   MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNP 60
           MA+TSLLPK+ SPFL+NP   LPSTHF      FNP  +H L  T K      +AKKKNP
Sbjct: 1   MASTSLLPKTISPFLTNP-PPLPSTHFTSKPSFFNPSADHHLPCTRKPSILTPYAKKKNP 59

Query: 61  WLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRG 120
           W+D FDDGED DM+YGSLF DGKQDED RP D+PNNPYGFLKFP G+N E+A L LKIRG
Sbjct: 60  WIDIFDDGEDLDMEYGSLFVDGKQDEDTRPVDNPNNPYGFLKFPKGYNVEVAQLGLKIRG 119

Query: 121 DVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYD 180
           DVRRCCC+ISGGVYENLLFFP IQ+LKDRYPG+L+DV+AS RGKQ +ELNKNVRWANVYD
Sbjct: 120 DVRRCCCMISGGVYENLLFFPVIQMLKDRYPGILVDVLASDRGKQCYELNKNVRWANVYD 179

Query: 181 LDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
            D D PEPA YTD++GV+KNRYYD++LSTKLAGLGHA+F+FM++ARD+VSYIYPNVNAAG
Sbjct: 180 PDGD-PEPAIYTDMIGVLKNRYYDLILSTKLAGLGHASFMFMSSARDKVSYIYPNVNAAG 238

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           AGLLL+ETFT +S NLSE GY+MY QM+DWLGRP  +VPR PVPPLRVS+SR+LK+ V  
Sbjct: 239 AGLLLTETFTPDSANLSEGGYHMYHQMLDWLGRPIYNVPRQPVPPLRVSLSRKLKQYVEA 298

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHE 360
           KY+ AGAE+GKYIVIHGIESDSKASMQSRGDTDSLLP++VW +IA+ +  F+P+FVIPHE
Sbjct: 299 KYRAAGAEKGKYIVIHGIESDSKASMQSRGDTDSLLPLEVWDQIADAISGFKPVFVIPHE 358

Query: 361 KEREGVEDVV-GDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSS 419
           KERE VE+++  +D  I+FITTPGQ A LINDSAGVIATNTAAIQLANAREKPSIALF S
Sbjct: 359 KERENVEEIIYNEDIGILFITTPGQ-ATLINDSAGVIATNTAAIQLANAREKPSIALFGS 417

Query: 420 ELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
           E KG++FVPNAEEKKC ++SS+TGKL D  V AV  AMQIF+ SLAL 
Sbjct: 418 EEKGKVFVPNAEEKKCIIVSSKTGKLKDIDVGAVKQAMQIFDMSLALV 465


>gi|363808214|ref|NP_001242232.1| uncharacterized protein LOC100802136 [Glycine max]
 gi|255635425|gb|ACU18065.1| unknown [Glycine max]
          Length = 471

 Score =  694 bits (1792), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/471 (75%), Positives = 399/471 (84%), Gaps = 6/471 (1%)

Query: 1   MATTSLL----PKSTSPFLSNPLSSLPSTHFIKP-SPLFNPPDNHSLLITTKRLSFQTHA 55
           MA T  L    PK+ SPFL NP  S+PS H +     L + P + S       +    +A
Sbjct: 1   MAATYYLLPTSPKTFSPFLRNP-PSIPSGHHVSLLGSLLDYPFHCSSRRGNNSVEANCNA 59

Query: 56  KKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLP 115
           KKKNPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPDDP+NPYGFLKFP G++ EIASL 
Sbjct: 60  KKKNPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDDPDNPYGFLKFPSGYSVEIASLA 119

Query: 116 LKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRW 175
           LK+RGDV RCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRW
Sbjct: 120 LKVRGDVGRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVVASERGKQTYELNKNVRW 179

Query: 176 ANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN 235
           AN YD DD++PEPAEYTD++GV+KNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN
Sbjct: 180 ANAYDPDDEFPEPAEYTDMVGVLKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN 239

Query: 236 VNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
           VNAAGAGLLLSETF  +S NLS+ GYNMY QMVDWLG+PFR VPR PVPPLRVSIS++LK
Sbjct: 240 VNAAGAGLLLSETFVPDSQNLSDGGYNMYHQMVDWLGKPFREVPRQPVPPLRVSISKKLK 299

Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
           EVV  KY+ AGA++GKY+VIHGI+SDSKASMQSRGD DSLLPI+VWAEIA+ +R+  PLF
Sbjct: 300 EVVEAKYEKAGAKKGKYVVIHGIKSDSKASMQSRGDPDSLLPIEVWAEIADVIRDVTPLF 359

Query: 356 VIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
           VIPHEKERE VE++  +DASIVFITTPGQLAALINDSAGVIATNT A+QLANAREKP IA
Sbjct: 360 VIPHEKERENVEEIFSEDASIVFITTPGQLAALINDSAGVIATNTTAVQLANAREKPCIA 419

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
           LF SE KG  FVP AEEKKC +ISS+TGKLID  VEAV NA+Q FN S AL
Sbjct: 420 LFCSEEKGNKFVPRAEEKKCIIISSKTGKLIDIDVEAVKNAVQTFNLSPAL 470


>gi|356523261|ref|XP_003530259.1| PREDICTED: uncharacterized protein LOC100803710 [Glycine max]
          Length = 467

 Score =  694 bits (1790), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/469 (75%), Positives = 397/469 (84%), Gaps = 6/469 (1%)

Query: 1   MATTSLL---PKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKK 57
           MA T  L   PK+ SP L NPLS +PS H      L  P   H        +    +AKK
Sbjct: 1   MAATYYLLPSPKTFSPLLRNPLS-IPSGHVSLLGSLDYP--FHCSSRRGNSVEANCNAKK 57

Query: 58  KNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLK 117
           KNPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPDDP+NPYGFLKFP G+  EIASL LK
Sbjct: 58  KNPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDDPDNPYGFLKFPSGYAVEIASLALK 117

Query: 118 IRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWAN 177
           +RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRWAN
Sbjct: 118 VRGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVVASERGKQTYELNKNVRWAN 177

Query: 178 VYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
            YD DD++PEPAEYTD++GV+KNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN
Sbjct: 178 AYDPDDEFPEPAEYTDMVGVLKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237

Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
           AAGAGLLLS+TF  +S+NLS+ GYNMY QMVDWLGRPFR VPR PVPPLRVSISR+LKEV
Sbjct: 238 AAGAGLLLSDTFVPDSLNLSDGGYNMYHQMVDWLGRPFREVPRQPVPPLRVSISRKLKEV 297

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
           V  KY+ AGA++GKY+VIHGI+SDSKASMQSRGD DSLLPI+VWAEIA+ +R+  PLFVI
Sbjct: 298 VEAKYEKAGAKKGKYVVIHGIKSDSKASMQSRGDPDSLLPIEVWAEIADAIRDVTPLFVI 357

Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
           PHEKERE VE++  +DA +VFITTPGQLAALINDSAGVIATNTAA+QLANAR+KP IALF
Sbjct: 358 PHEKERENVEEIFSEDAFVVFITTPGQLAALINDSAGVIATNTAAVQLANARQKPCIALF 417

Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
            SE KG  FVP A+EKKC +ISS+TGKLID  VEAV NA+Q FN S AL
Sbjct: 418 CSEEKGNKFVPQAKEKKCIIISSKTGKLIDIDVEAVKNAVQTFNLSPAL 466


>gi|359483160|ref|XP_002264393.2| PREDICTED: uncharacterized protein LOC100241596 [Vitis vinifera]
 gi|298204672|emb|CBI25170.3| unnamed protein product [Vitis vinifera]
          Length = 466

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/475 (75%), Positives = 404/475 (85%), Gaps = 17/475 (3%)

Query: 1   MATTSLLPKSTSPFLSNPLSSLP-----STHFIKPSPLFNPPDNHSLLITTK---RLSFQ 52
           MAT   LP S SPFL+NP SSLP     S+ F  P PL  PP       TTK   R+   
Sbjct: 1   MATPYFLPNSISPFLTNPSSSLPIHFTKSSFFTSPQPL--PP-------TTKPGCRIQLN 51

Query: 53  THAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIA 112
              KKKNPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPD+P+NPYGFLKFPMG++ EIA
Sbjct: 52  ATPKKKNPWLDPFDDGEDPNMEYGSLFADGKQEEDPRPPDNPDNPYGFLKFPMGYSVEIA 111

Query: 113 SLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKN 172
           SL LK+RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV +D++AS RGKQT+ELNKN
Sbjct: 112 SLALKVRGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQVDIVASPRGKQTYELNKN 171

Query: 173 VRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYI 232
           VRWAN YD D D+PEPAEYTD++G++K RYYDM+LSTKLAGLGHAAFLFM+TARDRVSYI
Sbjct: 172 VRWANAYDPDVDFPEPAEYTDMIGILKARYYDMILSTKLAGLGHAAFLFMSTARDRVSYI 231

Query: 233 YPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISR 292
           YPNVNAAGAGLLLS+TFT + MNLSE GYNMY QMV+WLGRP R+VPRH VPPL+VSISR
Sbjct: 232 YPNVNAAGAGLLLSQTFTPDGMNLSEAGYNMYHQMVEWLGRPARNVPRHHVPPLKVSISR 291

Query: 293 RLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR 352
           +LKEVV  KYK AGAE+GKYIVIHGIE DSKASMQS+GDTDSLLPIQ WAEIA+ +R  R
Sbjct: 292 KLKEVVEGKYKAAGAEKGKYIVIHGIECDSKASMQSKGDTDSLLPIQKWAEIADTIRGIR 351

Query: 353 PLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKP 412
           P+FVIPHEK RE VE+ VG+DASIVFITTPGQLAALINDS GVI TNTAA+QLA AREKP
Sbjct: 352 PIFVIPHEKVREEVEEEVGEDASIVFITTPGQLAALINDSLGVITTNTAAVQLATAREKP 411

Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
            IALFSS+ KG LFVPNA+EKKC ++SS+TGKL+D  VEAV  A +IF+ SLALA
Sbjct: 412 CIALFSSQQKGELFVPNAQEKKCLIVSSKTGKLVDINVEAVKKATEIFDVSLALA 466


>gi|388501644|gb|AFK38888.1| unknown [Medicago truncatula]
          Length = 460

 Score =  675 bits (1741), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/463 (71%), Positives = 381/463 (82%), Gaps = 25/463 (5%)

Query: 15  LSNPLSSLPSTHFIKPSPLFNPPDNHSLLIT----------TKRLSFQTHAKKKNPWLDP 64
           +S P S   +  F  PS     P N SLL +          T+RL+ + +AKK NPWLDP
Sbjct: 13  ISKPFSHFTTNTFSIPSTY---PTNVSLLTSLDHPSHCYNSTRRLNLRVNAKKNNPWLDP 69

Query: 65  FDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRR 124
           FDDGEDP+++YGSLF+DGKQDEDPRPPD+P NPYGFLKFP G+  EIA L LK+RG    
Sbjct: 70  FDDGEDPNIEYGSLFSDGKQDEDPRPPDNPKNPYGFLKFPAGYMVEIAPLGLKVRG---- 125

Query: 125 CCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD 184
                   VYENLLFFP IQL+KDRYPGV +DV+ S RGKQ +ELNKNVRWANV+D DD+
Sbjct: 126 --------VYENLLFFPMIQLIKDRYPGVQVDVVGSERGKQCYELNKNVRWANVFDPDDE 177

Query: 185 WPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLL 244
           +PEPAEYTD +GV+K+RYYDMVLSTKLAG+GHAA+LFMTTARDRVSY+YPNVN AGAGL 
Sbjct: 178 FPEPAEYTDFVGVLKSRYYDMVLSTKLAGVGHAAYLFMTTARDRVSYVYPNVNGAGAGLF 237

Query: 245 LSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKN 304
           LSETFT +S NLSE GY+MY QM DWLGRPFRSVPR  VPPL++S+SR+LKEVV EKY  
Sbjct: 238 LSETFTPDSTNLSEGGYHMYHQMEDWLGRPFRSVPRQVVPPLKISLSRKLKEVVEEKYTK 297

Query: 305 AGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKERE 364
           AG ++G+YIVIHGI+SDSKA+MQSRGD DSLLP++VWAEIA+ +REF PLFVIPHEKERE
Sbjct: 298 AGVKKGRYIVIHGIQSDSKATMQSRGDPDSLLPLEVWAEIADAVREFTPLFVIPHEKERE 357

Query: 365 GVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGR 424
            VE++VG+D+SI+FITTPGQLAALINDSAGVIATNTAAIQLANAREKPS+ALFSS+ KG 
Sbjct: 358 NVEEIVGEDSSIIFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSVALFSSKEKGN 417

Query: 425 LFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
            FVP AEEKKC VISS+TGKLI+  VEAV NA+QIFN SLA A
Sbjct: 418 KFVPRAEEKKCIVISSKTGKLINIDVEAVTNAIQIFNVSLAFA 460


>gi|297844518|ref|XP_002890140.1| hypothetical protein ARALYDRAFT_471797 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335982|gb|EFH66399.1| hypothetical protein ARALYDRAFT_471797 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 460

 Score =  674 bits (1739), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/470 (72%), Positives = 390/470 (82%), Gaps = 13/470 (2%)

Query: 1   MATTSLLPKSTSPFLSNP-LSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKN 59
           MA+  LLPK  SPF   P  S+     F+          N    +T++      + KKKN
Sbjct: 1   MASLPLLPKPISPFFKTPPFSTSKQLVFL----------NFQTRLTSRSSDVSVNLKKKN 50

Query: 60  -PWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
            PWLDPFD GEDP+ +YGSLF DGKQDEDPRPPD+P+NPYGFLKFP G+  E+ASLPLKI
Sbjct: 51  NPWLDPFDSGEDPENEYGSLFVDGKQDEDPRPPDNPDNPYGFLKFPKGYTVELASLPLKI 110

Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
           RGDVRRCCC+ISGGVYENLLFFP IQL+KDRYPGV +D++ + RGKQT+ELNKNVRWANV
Sbjct: 111 RGDVRRCCCVISGGVYENLLFFPTIQLIKDRYPGVQVDILTTDRGKQTYELNKNVRWANV 170

Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
           YD DD WPEPAEYTD++G++K RYYDM+LSTKLAGLGHAAFLFMTTARDRVSYIYPNVN+
Sbjct: 171 YDPDDHWPEPAEYTDMIGILKGRYYDMILSTKLAGLGHAAFLFMTTARDRVSYIYPNVNS 230

Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           AGAGL+LSETFTAE+ NLSE GY+MY+QM DWLGRPFRSVPR PV PLRVSISR+LK VV
Sbjct: 231 AGAGLMLSETFTAENANLSELGYSMYDQMEDWLGRPFRSVPRTPVLPLRVSISRKLKVVV 290

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
           AEKY+NAGA  GK+IVIHGIESDSKASMQS+GD+DSLL ++ WA+I  G+R F+P+FVIP
Sbjct: 291 AEKYRNAGAATGKFIVIHGIESDSKASMQSKGDSDSLLSLEKWAKIIKGIRGFKPVFVIP 350

Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           HEKERE VED VGDD SIVFITTPGQLAALINDSAGVIATNTAAIQLANAR+KPSI LFS
Sbjct: 351 HEKEREKVEDFVGDDTSIVFITTPGQLAALINDSAGVIATNTAAIQLANARDKPSIGLFS 410

Query: 419 SELKGRLFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
           SE KG+LFVP AEEK  C +I+S+TGKL D  +  V NAMQIF  SLAL 
Sbjct: 411 SEEKGKLFVPYAEEKSNCVIIASKTGKLADIDIGTVKNAMQIFEGSLALV 460


>gi|18394307|ref|NP_563986.1| NDH-dependent cyclic electron flow 1 [Arabidopsis thaliana]
 gi|6587804|gb|AAF18495.1|AC010924_8 ESTs gb|T43280, gb|AA395195 come from this gene [Arabidopsis
           thaliana]
 gi|16323198|gb|AAL15333.1| At1g15980/T24D18_8 [Arabidopsis thaliana]
 gi|21537186|gb|AAM61527.1| unknown [Arabidopsis thaliana]
 gi|21700921|gb|AAM70584.1| At1g15980/T24D18_8 [Arabidopsis thaliana]
 gi|332191271|gb|AEE29392.1| NDH-dependent cyclic electron flow 1 [Arabidopsis thaliana]
          Length = 461

 Score =  668 bits (1723), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/463 (73%), Positives = 384/463 (82%), Gaps = 11/463 (2%)

Query: 6   LLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKN-PWLDP 64
           LLPK  SPF   P        F    PL     N    +T++      + KKKN PWLDP
Sbjct: 7   LLPKPISPFFKTP-------PFSTSKPLVFL--NFQTRLTSRSSDVSVNLKKKNNPWLDP 57

Query: 65  FDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRR 124
           FD GEDPD +YGSLFADGKQDEDPRPPD+P+NPYGFLKFP G+  E+ASLPLKIRGDVRR
Sbjct: 58  FDSGEDPDNEYGSLFADGKQDEDPRPPDNPDNPYGFLKFPKGYTVELASLPLKIRGDVRR 117

Query: 125 CCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD 184
           CCC+ISGGVYENLLFFP IQL+KDRYPGV +D++ + RGKQT+ELNKNVRWANVYD DD 
Sbjct: 118 CCCVISGGVYENLLFFPTIQLIKDRYPGVQVDILTTERGKQTYELNKNVRWANVYDPDDH 177

Query: 185 WPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLL 244
           WPEPAEYTD++G++K RYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN+AGAGL+
Sbjct: 178 WPEPAEYTDMIGLLKGRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNSAGAGLM 237

Query: 245 LSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKN 304
           LSETFTAE+ NLSE GY+MY QM DWLGRPFRSVPR P+ PLRVSISR++KEVVA KY+N
Sbjct: 238 LSETFTAENTNLSELGYSMYTQMEDWLGRPFRSVPRTPLLPLRVSISRKVKEVVAAKYRN 297

Query: 305 AGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKERE 364
           AGA  GK+IVIHGIESDSKASMQS+GD DSLL ++ WA+I  G+R F+P+FVIPHEKERE
Sbjct: 298 AGAVTGKFIVIHGIESDSKASMQSKGDADSLLSLEKWAKIIKGVRGFKPVFVIPHEKERE 357

Query: 365 GVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGR 424
            VED VGDD SIVFITTPGQLAALINDSAGVIATNTAAIQLANAR+KP I LFSSE KG+
Sbjct: 358 NVEDFVGDDTSIVFITTPGQLAALINDSAGVIATNTAAIQLANARDKPCIGLFSSEEKGK 417

Query: 425 LFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
           LFVP AEEK  C +I+S+TGKL D  +  V NAMQ+F  SLAL
Sbjct: 418 LFVPYAEEKSNCVIIASKTGKLADIDIGTVKNAMQVFEGSLAL 460


>gi|449436635|ref|XP_004136098.1| PREDICTED: uncharacterized protein LOC101205787 [Cucumis sativus]
          Length = 466

 Score =  665 bits (1717), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/469 (74%), Positives = 393/469 (83%), Gaps = 7/469 (1%)

Query: 1   MATTSLLPKSTSPFLSNPLSSLPST--HFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKK 58
           MAT+SL  KS  PFL        S   HF KP   F  P   SL  ++K   FQ    KK
Sbjct: 1   MATSSLPTKSLPPFLPTSSFPPSSPSTHFTKP--FFLSPPLISLSTSSK---FQIPCSKK 55

Query: 59  NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
           NPW DPFDDGEDP+M+YGSL++DGKQ+EDPRPPD+PNNPYGFLKFP G+  E+ASL LK+
Sbjct: 56  NPWFDPFDDGEDPEMEYGSLYSDGKQEEDPRPPDNPNNPYGFLKFPQGYMVEVASLGLKL 115

Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
           RGDVRRCCC+ISGGVYENLLFFP IQLLKDRYPGV IDV+ SARGKQTFELNKNVRW+NV
Sbjct: 116 RGDVRRCCCVISGGVYENLLFFPTIQLLKDRYPGVQIDVVTSARGKQTFELNKNVRWSNV 175

Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
           YD DD +PEPAEYTD++G++K RYYDMVLSTKLAGLGHA FLFMTTARDRVSYIYP+VNA
Sbjct: 176 YDPDDYFPEPAEYTDMVGLLKGRYYDMVLSTKLAGLGHAVFLFMTTARDRVSYIYPDVNA 235

Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           AGAGL LSETF  ES NLSE G+NMY QMVDWLGRPFRSVPRHPVPPL+VSI+R+LKEV+
Sbjct: 236 AGAGLFLSETFKPESSNLSEGGFNMYHQMVDWLGRPFRSVPRHPVPPLKVSIARKLKEVI 295

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
           AEKY+ AGAE+GKYIVIHGIESDSKASMQS+GDTDSLLPI+ WAEIA  +R F+P+FVIP
Sbjct: 296 AEKYRKAGAEKGKYIVIHGIESDSKASMQSKGDTDSLLPIRAWAEIAKCIRGFKPVFVIP 355

Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           HEKERE VE+ VG++ASIVFITTPGQLAALINDSAG+IATNTAA+QLA AREKPSIALF 
Sbjct: 356 HEKERESVEEEVGNNASIVFITTPGQLAALINDSAGLIATNTAAVQLAIAREKPSIALFC 415

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
           SE K +LFVPNAE K+C  +SS TG+LID  +  +  A+Q F   LALA
Sbjct: 416 SEEKAKLFVPNAEAKRCITVSSATGRLIDIDMGTIKKAIQSFEVPLALA 464


>gi|357145443|ref|XP_003573644.1| PREDICTED: uncharacterized protein LOC100839724 [Brachypodium
           distachyon]
          Length = 470

 Score =  602 bits (1552), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 278/419 (66%), Positives = 345/419 (82%), Gaps = 7/419 (1%)

Query: 56  KKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLP 115
           KKKNPWLDPFDDG D D  Y  +++ GKQDEDPRPP+DP+NPYGFL+FPMG+NPE+ SL 
Sbjct: 50  KKKNPWLDPFDDGPDEDFDYTGVYSGGKQDEDPRPPEDPSNPYGFLRFPMGYNPELDSLA 109

Query: 116 LKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRW 175
            K+RGDVRR CC++SGGVYEN+LFFP +QLLKDRYPGVLIDV+ASARGKQ +E+ KNVR+
Sbjct: 110 SKVRGDVRRACCVVSGGVYENVLFFPVVQLLKDRYPGVLIDVVASARGKQVYEMCKNVRY 169

Query: 176 ANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN 235
           ANVYD DDDWPEPAEYT  LGV+KNRYYDM+LSTKLAG GHA FLFM++AR++V Y+YPN
Sbjct: 170 ANVYDPDDDWPEPAEYTHQLGVLKNRYYDMILSTKLAGTGHALFLFMSSAREKVGYVYPN 229

Query: 236 VNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
           VN AGAGL L+E F   + NLS+ GYNMY++M++WLGRP + VP+ P+PPLRVSIS++L+
Sbjct: 230 VNGAGAGLFLTEMFKPPTTNLSDGGYNMYQEMLEWLGRPGKGVPQQPIPPLRVSISKKLR 289

Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE----F 351
            +V +KY  AG E+GK++V+HGI SDS A+M SRGD D LLP++ WAEIA  +       
Sbjct: 290 GIVEDKYSRAGVEKGKFVVVHGIASDSVANMTSRGDDDCLLPLEQWAEIAKAISSQGNGL 349

Query: 352 RPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
           RPLFV+PH+K RE +ED+VG + S +FITTPGQL  LINDSAGV+ATNTAA+QLANAR++
Sbjct: 350 RPLFVMPHQKHREEIEDIVGRETSYLFITTPGQLTCLINDSAGVVATNTAAVQLANARDR 409

Query: 412 PSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFN--ESLALA 467
           P +ALFSS+ K +LF+P AE+KK CTV++S TGKL D  +EAV NA++ F    S ALA
Sbjct: 410 PCVALFSSKEKAKLFLPYAEDKKSCTVVASATGKLADIDIEAVKNAVKEFEPAPSFALA 468


>gi|115475672|ref|NP_001061432.1| Os08g0276100 [Oryza sativa Japonica Group]
 gi|37806059|dbj|BAC99486.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623401|dbj|BAF23346.1| Os08g0276100 [Oryza sativa Japonica Group]
 gi|215737469|dbj|BAG96599.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 277/426 (65%), Positives = 345/426 (80%), Gaps = 13/426 (3%)

Query: 52  QTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEI 111
           +  AKKKNPWLDPFDDG D +  Y   F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+
Sbjct: 45  RASAKKKNPWLDPFDDGPDDEFDYRGAFSGGKQEEDPRPPEDPANPYGFLRFPAGYNPEL 104

Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
            SL  K+R DVRR CC++SGGVYEN+LFFP +QLL+DRYPGV++DV+ASARGKQ +E+ K
Sbjct: 105 DSLASKVRRDVRRACCVVSGGVYENVLFFPVVQLLRDRYPGVVVDVVASARGKQVYEMCK 164

Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY 231
           NVR+A+VYD DDDWPEPAEYT  LGV+KNRYYD+++STKLAG+GHA FLFM++ARD+V Y
Sbjct: 165 NVRYADVYDPDDDWPEPAEYTHQLGVLKNRYYDLIISTKLAGIGHALFLFMSSARDKVGY 224

Query: 232 IYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSIS 291
           +YPNVN+AGAGL L+E F   + NL++ GYNMY+ M++W+GRP + VP HPVPPLRVSIS
Sbjct: 225 VYPNVNSAGAGLFLTEMFRPPTTNLADSGYNMYQDMLEWIGRPAKGVPEHPVPPLRVSIS 284

Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA------ 345
           ++L+  V +KY  AG E+GK++V+HGI SDS A+M+SRGD D LLP++ WAEIA      
Sbjct: 285 KKLRAFVEDKYSRAGVEKGKFVVVHGIASDSFANMRSRGDDDCLLPLEHWAEIAKEISSE 344

Query: 346 -NGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQ 404
            NGL   +PLFVIPH+K RE VE+ VG D +I+FITTPGQL  LINDS GV+ATNTAA+Q
Sbjct: 345 DNGL---KPLFVIPHQKHREEVEETVGKDTNILFITTPGQLTCLINDSVGVVATNTAAVQ 401

Query: 405 LANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFNE- 462
           LANAR+KP +ALFSS+ K RLF+P  EEKK CTV++S TGKLID  VEAV  A++ F   
Sbjct: 402 LANARDKPCVALFSSKEKARLFLPYLEEKKGCTVVASETGKLIDIDVEAVKKAVKEFKAA 461

Query: 463 -SLALA 467
            S+ALA
Sbjct: 462 PSVALA 467


>gi|226508728|ref|NP_001142788.1| uncharacterized protein LOC100275158 [Zea mays]
 gi|195609704|gb|ACG26682.1| hypothetical protein [Zea mays]
 gi|238010026|gb|ACR36048.1| unknown [Zea mays]
 gi|413916872|gb|AFW56804.1| hypothetical protein ZEAMMB73_257177 [Zea mays]
          Length = 466

 Score =  587 bits (1513), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 269/409 (65%), Positives = 336/409 (82%), Gaps = 5/409 (1%)

Query: 55  AKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
           AKKKNPWLDPFDDG D +  Y  +F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+ SL
Sbjct: 45  AKKKNPWLDPFDDGPDEEFDYQGMFSGGKQEEDPRPPEDPENPYGFLRFPQGYNPELDSL 104

Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
             K+RGDVRR CC++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARGKQ +E+ KNVR
Sbjct: 105 ASKVRGDVRRACCVVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGKQVYEMCKNVR 164

Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
           +ANVYD DDDWPEPAEYT +LGV+KNRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YP
Sbjct: 165 YANVYDPDDDWPEPAEYTHMLGVLKNRYYDLIISTRLAGIGHALFLFMSSARDKVGYVYP 224

Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRL 294
           NVN+ GAGL L+E F A + NL++ GY+MY+ M++W+GRP ++VP+ P  PLRVSIS++L
Sbjct: 225 NVNSVGAGLFLNEMFKAPTNNLADGGYHMYKDMLEWIGRPAKNVPQQPTQPLRVSISKKL 284

Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLRE 350
           +  V +KY  AG E+GKY+VIHGI SDS A+M+SRGD D LLP+  WA+IA    +G R 
Sbjct: 285 RAYVEDKYSRAGVEKGKYVVIHGIASDSVANMKSRGDDDCLLPLGHWAQIAKEISSGDRG 344

Query: 351 FRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANARE 410
            +PLFVIPHEK R+ +E+ VGDD SI+FITTPGQL  LINDSAGV+ATNTAA+QLANAR+
Sbjct: 345 LKPLFVIPHEKHRDEIEEEVGDDTSILFITTPGQLTCLINDSAGVVATNTAAVQLANARD 404

Query: 411 KPSIALFSSELKGRLFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQ 458
           KP +ALFSS  K RLF+P  E K  CTVISS TG LID  + AV N ++
Sbjct: 405 KPCVALFSSAEKARLFLPYVEGKSSCTVISSATGNLIDIDLNAVKNVVK 453


>gi|413921876|gb|AFW61808.1| hypothetical protein ZEAMMB73_450954 [Zea mays]
          Length = 471

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 269/409 (65%), Positives = 337/409 (82%), Gaps = 5/409 (1%)

Query: 55  AKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
           AKKKNPWLDPFDDG D +  Y  +F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+ SL
Sbjct: 50  AKKKNPWLDPFDDGPDEEFDYQGMFSGGKQEEDPRPPEDPANPYGFLRFPQGYNPELDSL 109

Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
             K+RGDVRR CC++SGGVYEN+LF P +Q+LKDRYPGVL+DV+AS RGKQ +E+ KNVR
Sbjct: 110 ASKVRGDVRRACCVVSGGVYENVLFLPVVQMLKDRYPGVLVDVVASPRGKQVYEMCKNVR 169

Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
           +ANVYD DDDWPEPAEYT +LGV+KNRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YP
Sbjct: 170 YANVYDPDDDWPEPAEYTHMLGVLKNRYYDLIVSTRLAGIGHALFLFMSSARDKVGYVYP 229

Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRL 294
           NVN+ GAGL L+E F A + NL++ GY+MY+ M++W+GRP ++VPR P  PLRVSIS++L
Sbjct: 230 NVNSVGAGLFLNEMFKAPTNNLADGGYHMYKDMLEWIGRPAKNVPRQPTQPLRVSISKKL 289

Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLRE 350
           +  V +KY  AG E+GKY+V+HGI SDS ASM SRGD D LLP++ WA+IA    +G + 
Sbjct: 290 RAYVEDKYSRAGVEKGKYVVVHGIASDSVASMTSRGDDDCLLPLEHWAQIAKEISSGDKG 349

Query: 351 FRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANARE 410
            +PLFVIPH+K RE VE+ VGDD +I+FITTPGQL  LINDSAGV+ATNTAA+QLANAR+
Sbjct: 350 LKPLFVIPHDKHREEVEEEVGDDTNILFITTPGQLTCLINDSAGVVATNTAAVQLANARD 409

Query: 411 KPSIALFSSELKGRLFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQ 458
           KP +ALFSS  K RLF+P  E+K  CTVISS TGKLID  + AV NA++
Sbjct: 410 KPCVALFSSAEKARLFLPYVEDKSSCTVISSMTGKLIDIDINAVKNAVK 458


>gi|326487296|dbj|BAJ89632.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326496056|dbj|BAJ90649.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 469

 Score =  573 bits (1476), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 277/456 (60%), Positives = 361/456 (79%), Gaps = 11/456 (2%)

Query: 12  SPFLSNPLSSLPSTHFIK--PSPLFNPPDNHSLLITTKRLS--FQTHAKKKNPWLDPFDD 67
           +P +  P+++ P++      P+P   P   H  L+  +R++   +  +KK+NPWLDPFDD
Sbjct: 3   TPTMPTPMAARPTSTMATKLPAPSATPRQCH--LLPGRRVAGLARASSKKRNPWLDPFDD 60

Query: 68  GEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCC 127
           G D +  Y  +++ GKQ+EDPRPP+D  NPYGFL+FPMG+ PE+ SL  K+RGDVRR CC
Sbjct: 61  GPDEEFDYTGVYSGGKQEEDPRPPEDAENPYGFLRFPMGYMPELDSLASKVRGDVRRVCC 120

Query: 128 IISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPE 187
           ++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARGKQ +E+ KNVR+ANVYD DD+WPE
Sbjct: 121 VVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGKQVYEMCKNVRYANVYDPDDEWPE 180

Query: 188 PAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSE 247
           PAEYT  LGVMKNRYYDMVLSTKLAG GHA FLFM++AR++V Y+YPNVN AGAGL L+E
Sbjct: 181 PAEYTHQLGVMKNRYYDMVLSTKLAGTGHALFLFMSSAREKVGYVYPNVNGAGAGLFLTE 240

Query: 248 TFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
            F   + NL++ GYNMY+ M++WLGRP + VP+ P+PPLRVSIS++L+ VV +KY  AG 
Sbjct: 241 MFKPATTNLADGGYNMYQDMLEWLGRPAKGVPQQPIPPLRVSISKKLRAVVEDKYNRAGV 300

Query: 308 EQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLREFRPLFVIPHEKER 363
           E+GKY+VIHGIESDS A+M+SRGD D LLP+++WAEIA    +G    RPLFV+PHE+ R
Sbjct: 301 EKGKYVVIHGIESDSVANMKSRGDDDCLLPLELWAEIAKEISSGGNGLRPLFVMPHERHR 360

Query: 364 EGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKG 423
           E +E++VG++ + +FITTPGQL  LINDSAGV+ATNTAA+QLANAR+KP +ALFSS+ K 
Sbjct: 361 EEIEEIVGEETAYLFITTPGQLTCLINDSAGVVATNTAAVQLANARDKPCVALFSSKEKA 420

Query: 424 RLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQ 458
           RLF+P  EE+K CTV++S TGKL    +EAV  A++
Sbjct: 421 RLFLPYVEERKSCTVVASATGKLAGIDIEAVKKAVK 456


>gi|326525951|dbj|BAJ93152.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 469

 Score =  571 bits (1471), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 276/456 (60%), Positives = 361/456 (79%), Gaps = 11/456 (2%)

Query: 12  SPFLSNPLSSLPSTHFIK--PSPLFNPPDNHSLLITTKRLS--FQTHAKKKNPWLDPFDD 67
           +P +  P+++ P++      P+P   P   H  L+  +R++   +  +KK+NPWLDPFDD
Sbjct: 3   TPTMPTPMAARPTSTMATKLPAPSATPRQCH--LLPGRRVAGLARASSKKRNPWLDPFDD 60

Query: 68  GEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCC 127
           G D +  Y  +++ GKQ+EDPRPP+D  NPYGFL+FPMG+ PE+ SL  K+RGDVRR CC
Sbjct: 61  GPDEEFDYTGVYSGGKQEEDPRPPEDAENPYGFLRFPMGYMPELDSLASKVRGDVRRVCC 120

Query: 128 IISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPE 187
           ++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARG+Q +E+ KNVR+ANVYD DD+WPE
Sbjct: 121 VVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGEQVYEMCKNVRYANVYDPDDEWPE 180

Query: 188 PAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSE 247
           PAEYT  LGVMKNRYYDMVLSTKLAG GHA FLFM++AR++V Y+YPNVN AGAGL L+E
Sbjct: 181 PAEYTHQLGVMKNRYYDMVLSTKLAGTGHALFLFMSSAREKVGYVYPNVNGAGAGLFLTE 240

Query: 248 TFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
            F   + NL++ GYNMY+ M++WLGRP + VP+ P+PPLRVSIS++L+ VV +KY  AG 
Sbjct: 241 MFKPATTNLADGGYNMYQDMLEWLGRPAKGVPQQPIPPLRVSISKKLRAVVEDKYNRAGV 300

Query: 308 EQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLREFRPLFVIPHEKER 363
           E+GKY+VIHGIESDS A+M+SRGD D LLP+++WAEIA    +G    RPLFV+PHE+ R
Sbjct: 301 EKGKYVVIHGIESDSVANMKSRGDDDCLLPLELWAEIAKEISSGGNGLRPLFVMPHERHR 360

Query: 364 EGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKG 423
           E +E++VG++ + +FITTPGQL  LINDSAGV+ATNTAA+QLANAR+KP +ALFSS+ K 
Sbjct: 361 EEIEEIVGEETAYLFITTPGQLTCLINDSAGVVATNTAAVQLANARDKPCVALFSSKEKA 420

Query: 424 RLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQ 458
           RLF+P  EE+K CTV++S TGKL    +EAV  A++
Sbjct: 421 RLFLPYVEERKSCTVVASATGKLAGIDIEAVKKAVK 456


>gi|218200819|gb|EEC83246.1| hypothetical protein OsI_28562 [Oryza sativa Indica Group]
          Length = 462

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 270/426 (63%), Positives = 336/426 (78%), Gaps = 15/426 (3%)

Query: 52  QTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEI 111
           +  AKKKNPWLDPFDDG D +  Y   F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+
Sbjct: 40  RASAKKKNPWLDPFDDGPDDEFDYRGAFSGGKQEEDPRPPEDPANPYGFLRFPAGYNPEL 99

Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
            SL  K+R DVRR CC++SGGVYEN+LFFP +QLL+DRYPGV++DV+ASARGKQ +E+ K
Sbjct: 100 DSLASKVRRDVRRACCVVSGGVYENVLFFPVVQLLRDRYPGVVVDVVASARGKQVYEMCK 159

Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY 231
           NVR+A+VYD DDDWPEPAE         +RYYD++LST+LAG GH  FLFM++ARD+V Y
Sbjct: 160 NVRYADVYDPDDDWPEPAETPTSSAC--SRYYDLILSTRLAGFGHGLFLFMSSARDKVGY 217

Query: 232 IYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSIS 291
           +YPNVN+AGAGL L+E F   + NL++ GYNMY+ M++W+GRP + VP HPVPPLRVSIS
Sbjct: 218 VYPNVNSAGAGLFLTEMFRPPTTNLADGGYNMYQDMLEWIGRPAKGVPEHPVPPLRVSIS 277

Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA------ 345
           ++L+  V +KY  AG E+GK++V+HGI SDS A+M+SRGD D LLP++ WAEIA      
Sbjct: 278 KKLRAFVEDKYSRAGVEKGKFVVVHGIASDSFANMRSRGDDDCLLPLEHWAEIAKEISSE 337

Query: 346 -NGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQ 404
            NGL   +PLFVIPH+K RE VE+ VG D +I+FITTPGQL  LINDS GV+ATNTAA+Q
Sbjct: 338 DNGL---KPLFVIPHQKHREEVEETVGKDTNILFITTPGQLTCLINDSVGVVATNTAAVQ 394

Query: 405 LANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFNE- 462
           LANAR+KP +ALFSSE K RLF+P  EEKK CTV++S TGKLID  VEAV  A++ F   
Sbjct: 395 LANARDKPCVALFSSEEKARLFLPYLEEKKGCTVVASETGKLIDIDVEAVKKAVKEFKAA 454

Query: 463 -SLALA 467
            S+ALA
Sbjct: 455 PSVALA 460


>gi|222640246|gb|EEE68378.1| hypothetical protein OsJ_26704 [Oryza sativa Japonica Group]
          Length = 426

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 245/426 (57%), Positives = 304/426 (71%), Gaps = 51/426 (11%)

Query: 52  QTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEI 111
           +  AKKKNPWLDPFDDG D +  Y   F+ GKQ+EDPRPP  P  P      P G     
Sbjct: 40  RASAKKKNPWLDPFDDGPDDEFDYRGAFSGGKQEEDPRPPGGPGEPVRVPAVPGG----- 94

Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
                                            LL+DRYPGV++DV+ASARGKQ +E+ K
Sbjct: 95  ---------------------------------LLRDRYPGVVVDVVASARGKQVYEMCK 121

Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY 231
           NVR+A+VYD DDDWPEPAEYT  LGV+KNRYYD+++STKLAG+GHA FLFM++ARD+V Y
Sbjct: 122 NVRYADVYDPDDDWPEPAEYTHQLGVLKNRYYDLIISTKLAGIGHALFLFMSSARDKVGY 181

Query: 232 IYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSIS 291
           +YPNVN+AGAGL L+E F   + NL++ GYNMY+ M++W+GRP + VP HPVPPLRVSIS
Sbjct: 182 VYPNVNSAGAGLFLTEMFRPPTTNLADSGYNMYQDMLEWIGRPAKGVPEHPVPPLRVSIS 241

Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA------ 345
           ++L+  V +KY  AG E+GK++V+HGI SDS A+M+SRGD D LLP++ WAEIA      
Sbjct: 242 KKLRAFVEDKYSRAGVEKGKFVVVHGIASDSFANMRSRGDDDCLLPLEHWAEIAKEISSE 301

Query: 346 -NGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQ 404
            NGL   +PLFVIPH+K RE VE+ VG D +I+FITTPGQL  LINDS GV+ATNTAA+Q
Sbjct: 302 DNGL---KPLFVIPHQKHREEVEETVGKDTNILFITTPGQLTCLINDSVGVVATNTAAVQ 358

Query: 405 LANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFNE- 462
           LANAR+KP +ALFSS+ K RLF+P  EEKK CTV++S TGKLID  VEAV  A++ F   
Sbjct: 359 LANARDKPCVALFSSKEKARLFLPYLEEKKGCTVVASETGKLIDIDVEAVKKAVKEFKAA 418

Query: 463 -SLALA 467
            S+ALA
Sbjct: 419 PSVALA 424


>gi|302757169|ref|XP_002962008.1| hypothetical protein SELMODRAFT_64169 [Selaginella moellendorffii]
 gi|300170667|gb|EFJ37268.1| hypothetical protein SELMODRAFT_64169 [Selaginella moellendorffii]
          Length = 399

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 219/400 (54%), Positives = 291/400 (72%), Gaps = 4/400 (1%)

Query: 61  WLDPF-DDGEDPDMQYGSLFADGKQ--DEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLK 117
           W DPF  + EDPD+ +G+LF +G +  +++  PP DP +  GF K P G+  EI  L  K
Sbjct: 1   WKDPFLFNEEDPDLVWGNLFTEGGRQPEQEYIPPLDPKSKSGFRKVPKGYTAEIRPLAAK 60

Query: 118 IRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWAN 177
           +R +VR+C C+++GGVYENLLFFP I+LLKDRYPGV IDVIAS RGKQ +E+NKNV  A 
Sbjct: 61  VRDEVRKCLCLVAGGVYENLLFFPVIELLKDRYPGVEIDVIASERGKQVYEMNKNVSRAW 120

Query: 178 VYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
           V+D+D  + +PA Y ++L ++KN YYDM++STKLAG+GHA F+++T     + YIYP+VN
Sbjct: 121 VFDIDQQFIKPALYMEMLTMVKNEYYDMIVSTKLAGIGHALFMWLTDCTKNIGYIYPDVN 180

Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
            AGAG  L     A  +NL+E GY+MY ++ + LG+P + +PR   P L V IS RL+ V
Sbjct: 181 GAGAGPFLYAAVDAPRLNLAEGGYHMYHELNEELGKPAKGIPRIYPPQLTVGISSRLRGV 240

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
           + +K+  A   +G++IVIHGIESDS A+M+SRGD+DSLLP+  W+ IA  L    P+FVI
Sbjct: 241 ILKKWTEASLRRGEFIVIHGIESDSAATMKSRGDSDSLLPVSRWSLIAKSLSSKVPVFVI 300

Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
           PHEKER  VE+V+G DA I+FITTPGQLAALINDS GVI TNTAA+QLA + +KP++ALF
Sbjct: 301 PHEKERSKVEEVIGPDAHIIFITTPGQLAALINDSFGVITTNTAALQLAISVKKPTVALF 360

Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           SS+ K +LFVPN     C  I+S TGKL+D  +E V NA+
Sbjct: 361 SSDEKAKLFVPN-HATSCDCIASDTGKLLDVSMEKVKNAI 399


>gi|302775324|ref|XP_002971079.1| hypothetical protein SELMODRAFT_64160 [Selaginella moellendorffii]
 gi|300161061|gb|EFJ27677.1| hypothetical protein SELMODRAFT_64160 [Selaginella moellendorffii]
          Length = 399

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 217/400 (54%), Positives = 291/400 (72%), Gaps = 4/400 (1%)

Query: 61  WLDPF-DDGEDPDMQYGSLFADGKQ--DEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLK 117
           W DPF  + EDPD+ +G+LF +G +  +++  PP DP +  GF K P G+  EI  L  K
Sbjct: 1   WKDPFLFNEEDPDLVWGNLFTEGGRQPEQEYIPPLDPKSKSGFRKVPKGYTAEIRPLAAK 60

Query: 118 IRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWAN 177
           +R +VR+C C+++GGVYENLLFFP I+LLKDRYPGV IDVIAS RGKQ +E+NKNV  A 
Sbjct: 61  VRDEVRKCLCLVAGGVYENLLFFPVIELLKDRYPGVEIDVIASERGKQVYEMNKNVSRAW 120

Query: 178 VYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
           V+D+D  + +PA Y ++L ++KN YYDM++STKLAG+GHA F+++T     + YIYP+VN
Sbjct: 121 VFDIDQQFIKPALYMEMLTMVKNEYYDMIVSTKLAGIGHALFMWLTDCTKNIGYIYPDVN 180

Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
            AGAG  L     A  +NL+E GY+MY ++ + LG+P + +PR   P L V IS RL+ V
Sbjct: 181 GAGAGPFLYAAVDAPRLNLAEGGYHMYHELNEELGKPAKGIPRIYPPQLTVGISSRLRGV 240

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
           + +K+  A   +G++IVIHGIESDS A+M+SRGD+DSLLP+  W+ IA  L    P+FVI
Sbjct: 241 ILKKWTEASLRRGEFIVIHGIESDSAATMKSRGDSDSLLPLSKWSLIAKSLSSKVPVFVI 300

Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
           PHEKER  V++V+G DA I+FITTPGQ+AALINDS GVI TNTAA+QLA + +KP++ALF
Sbjct: 301 PHEKERSKVQEVIGPDAHIIFITTPGQIAALINDSFGVITTNTAAVQLAISVKKPTVALF 360

Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           SS+ K +LFVPN     C  I+S TGKL+D  +E V NA+
Sbjct: 361 SSDEKAKLFVPN-HATSCDCIASDTGKLLDVSMEKVKNAI 399


>gi|168029037|ref|XP_001767033.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681775|gb|EDQ68199.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 473

 Score =  436 bits (1122), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 217/421 (51%), Positives = 295/421 (70%), Gaps = 4/421 (0%)

Query: 50  SFQTHAKKKNPWLDPFDDGEDPDMQ-YGSLFADGKQD-EDPRPPDDPNNPYGFLKFPMGF 107
           S    AKKK  + DPFD G D +    G L + G Q+ EDPRP  DP++  G+L FP GF
Sbjct: 54  SVVASAKKKGWFDDPFDYGADEEEDTMGELMSQGPQEAEDPRPSRDPDSESGYLDFPAGF 113

Query: 108 NPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTF 167
            PE+ASL   IR DVRRC C++SGGVYENLLFFP IQ+LK+RYPGV IDV+A+ RGKQT+
Sbjct: 114 MPEVASLGTLIRNDVRRCLCMVSGGVYENLLFFPVIQMLKNRYPGVRIDVMATPRGKQTY 173

Query: 168 ELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARD 227
           E+NKNVR A V+ +DD +  P ++T+ +G +K  YYD+V+STKLAGLG + F ++ + R+
Sbjct: 174 EMNKNVRMAWVHPVDDQFLRPVDFTETIGKIKGEYYDLVVSTKLAGLGQSIFFWLASVRN 233

Query: 228 RVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLR 287
           +VSY +P+VNAAGA   L     A  + L+E G+NMY +M++ L +  +++P+   PPL 
Sbjct: 234 KVSYTFPDVNAAGATKFLDVAIKAPQLELAESGFNMYAEMIEELSQMGKNLPKTEAPPLE 293

Query: 288 VSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANG 347
           V I R++K  V  KY  AG  +G+++V HGI+SDS ASM SRGD DSLLPI++WAEIA  
Sbjct: 294 VGIGRKVKAYVENKYLEAGLSEGEFLVFHGIKSDSSASMTSRGDKDSLLPIEMWAEIAKS 353

Query: 348 LREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLAN 407
             E + +FVIP+EK R  V++  G++  IVFITTP QL ALIN S GV+ TNTAA+QLA 
Sbjct: 354 TSE-KVVFVIPNEKLRSKVKESCGENTHIVFITTPSQLGALINASTGVVTTNTAALQLAI 412

Query: 408 AREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ-IFNESLAL 466
           A +KP+++LF+S+ K +LF+P      C V++S+TGKL +  V+AV  A+  I  E+L  
Sbjct: 413 ALKKPTVSLFASQEKAKLFIPEYARDSCAVVASKTGKLSNLDVKAVTMALSTIAKEALVA 472

Query: 467 A 467
           A
Sbjct: 473 A 473


>gi|168023336|ref|XP_001764194.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684634|gb|EDQ71035.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 475

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 210/414 (50%), Positives = 290/414 (70%), Gaps = 5/414 (1%)

Query: 58  KNPWLD-PFDDG-EDPDMQYGSLFADGKQD-EDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
           K  W D PFD G +D +   G L + G Q  EDPRP  DP++  G+L FP GF PE+ASL
Sbjct: 63  KKGWFDDPFDYGADDEEDTMGELMSQGPQGAEDPRPARDPDSESGYLDFPAGFMPEVASL 122

Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
            + IR DVRRC C+ISGGVYENLLFFP IQLLK+RYPGV IDV+A+ RGKQ +E+NKNVR
Sbjct: 123 GILIRNDVRRCLCMISGGVYENLLFFPVIQLLKNRYPGVRIDVMATPRGKQAYEMNKNVR 182

Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
            A V+ +DD +  P ++T+ +G +K  YYD+++STKLAGLG + F ++ + R++VSY YP
Sbjct: 183 KAWVHPVDDQFLRPVDFTETVGKIKGEYYDLLVSTKLAGLGQSIFFWLASVRNKVSYTYP 242

Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRL 294
           +VNAAGA   L     A  + L+E G+NMY +M++ L +  ++VP+  VPPL V I  ++
Sbjct: 243 DVNAAGAAKFLDIAIKAPQLELAESGFNMYAEMIEELSQMGKNVPKTEVPPLEVGIGSKV 302

Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPL 354
           K  V  KY+ AG  +G+++V HGIE DS ASM S+GD D LLP+ +WAEIA    + + +
Sbjct: 303 KAYVEAKYREAGVREGEFLVFHGIECDSSASMTSKGDKDCLLPLSMWAEIAKSTSD-KVV 361

Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
           FVIP+EK R  V+++ G++A IVFITTPGQL ALIN S GV+ TNTAA+Q+A A +K ++
Sbjct: 362 FVIPNEKWRRKVKEICGENAHIVFITTPGQLGALINASKGVVTTNTAALQIAIALKKSTV 421

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ-IFNESLALA 467
           ALF+S+ K  LF+P+  +  C +++S+TGKL    ++A   A+  I  E+L  A
Sbjct: 422 ALFASQEKANLFIPDYAKDACAMVASKTGKLCGLDLKAATMAVSTIAKEALVAA 475


>gi|398313921|emb|CCI55387.1| NDH subunit PnsB1 [Marchantia polymorpha]
          Length = 480

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 201/403 (49%), Positives = 274/403 (67%), Gaps = 4/403 (0%)

Query: 58  KNPWLDPFDDGEDPDMQYGSLFADGKQD-EDPRPPDDPNNPYGFLKFPMGFNPEIASLPL 116
           KN   DPFD GED D++YG L ++GKQ  E PRP  D N+ YGFL FP   N EIASL L
Sbjct: 69  KNMLDDPFDFGEDEDLEYGDLMSEGKQGVEPPRPRKDKNSKYGFLDFPGYHNLEIASLGL 128

Query: 117 KIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWA 176
            +R DVRRC C ++GGVYENLLFFP I+LLK RYPGV +D++ SARGKQ +E+NK V  A
Sbjct: 129 YVRKDVRRCLCWVAGGVYENLLFFPVIKLLKARYPGVKVDIVTSARGKQVYEMNKYVNKA 188

Query: 177 NVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARD-RVSYIYPN 235
             +D+D +W EP E  + +GV++++YYDMVLSTK AG  H   L+M   R  R+SY+ P 
Sbjct: 189 YAFDVDAEWVEPVELNEFIGVLRDQYYDMVLSTKYAGFKHCTTLYMIGGRTRRISYVLPY 248

Query: 236 VNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
            +   +   L+ +      NL++ GY+MY++++D+L +P   VP   VP + V + +R++
Sbjct: 249 HSEWVSNSFLTTSLMPPRENLADGGYHMYKELIDYLAQPGNGVPEQAVPMMEVGVPKRVR 308

Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
            V   KY  AG E GK++V HG+ESDS ASMQ++GD+DSLLP + WA++         L 
Sbjct: 309 AVATSKYTEAGVEAGKFVVFHGVESDSGASMQTKGDSDSLLPPEFWAKLKASAGT-SVLV 367

Query: 356 VIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
           VIP+ K+++ V    GDD  +VFITTPGQL A+I DSAGV++TNTAA+Q+A A  KPS+A
Sbjct: 368 VIPNMKDKKKVIAACGDDVHVVFITTPGQLGAIIRDSAGVVSTNTAAVQIAIAFNKPSVA 427

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ 458
           +F S+ K  LFVP    KKC V++S+TGKL D  V+  ++A +
Sbjct: 428 IFGSDEKASLFVPPL-AKKCKVVTSKTGKLADVEVDEAISAWK 469


>gi|449491201|ref|XP_004158827.1| PREDICTED: uncharacterized protein LOC101226812 [Cucumis sativus]
          Length = 260

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 194/265 (73%), Positives = 215/265 (81%), Gaps = 7/265 (2%)

Query: 1   MATTSLLPKSTSPFLSNPLSSLPST--HFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKK 58
           MAT+SL  KS  PFL        S   HF KP   F  P   SL  ++K   FQ    KK
Sbjct: 1   MATSSLPTKSLPPFLPTSSFPPSSPSTHFTKP--FFLSPPLISLSTSSK---FQIPCSKK 55

Query: 59  NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
           NPW DPFDDGEDP+M+YGSL++DGKQ+EDPRPPD+PNNPYGFLKFP G+  E+ASL LKI
Sbjct: 56  NPWFDPFDDGEDPEMEYGSLYSDGKQEEDPRPPDNPNNPYGFLKFPQGYMVEVASLGLKI 115

Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
           RGDVRRCCC+ISGGVYENLLFFP IQLLKDRYPGV IDV+ SARGKQTFELNKNVRW+NV
Sbjct: 116 RGDVRRCCCVISGGVYENLLFFPTIQLLKDRYPGVQIDVVTSARGKQTFELNKNVRWSNV 175

Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
           YD DD +PEPAEYTD++G++K RYYDMVLSTKLAGLGHA FLFMTTARDRVSYIYP+VNA
Sbjct: 176 YDPDDYFPEPAEYTDMVGLLKGRYYDMVLSTKLAGLGHAVFLFMTTARDRVSYIYPDVNA 235

Query: 239 AGAGLLLSETFTAESMNLSERGYNM 263
           AGAGL LSETF  ES NLSE G+NM
Sbjct: 236 AGAGLFLSETFKPESSNLSEGGFNM 260


>gi|413921875|gb|AFW61807.1| hypothetical protein ZEAMMB73_450954 [Zea mays]
          Length = 284

 Score =  345 bits (885), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 166/270 (61%), Positives = 216/270 (80%), Gaps = 5/270 (1%)

Query: 194 ILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAES 253
           +L  ++NRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YPNVN+ GAGL L+E F A +
Sbjct: 2   LLQTLQNRYYDLIVSTRLAGIGHALFLFMSSARDKVGYVYPNVNSVGAGLFLNEMFKAPT 61

Query: 254 MNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYI 313
            NL++ GY+MY+ M++W+GRP ++VPR P  PLRVSIS++L+  V +KY  AG E+GKY+
Sbjct: 62  NNLADGGYHMYKDMLEWIGRPAKNVPRQPTQPLRVSISKKLRAYVEDKYSRAGVEKGKYV 121

Query: 314 VIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLREFRPLFVIPHEKEREGVEDV 369
           V+HGI SDS ASM SRGD D LLP++ WA+IA    +G +  +PLFVIPH+K RE VE+ 
Sbjct: 122 VVHGIASDSVASMTSRGDDDCLLPLEHWAQIAKEISSGDKGLKPLFVIPHDKHREEVEEE 181

Query: 370 VGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPN 429
           VGDD +I+FITTPGQL  LINDSAGV+ATNTAA+QLANAR+KP +ALFSS  K RLF+P 
Sbjct: 182 VGDDTNILFITTPGQLTCLINDSAGVVATNTAAVQLANARDKPCVALFSSAEKARLFLPY 241

Query: 430 AEEK-KCTVISSRTGKLIDTPVEAVLNAMQ 458
            E+K  CTVISS TGKLID  + AV NA++
Sbjct: 242 VEDKSSCTVISSMTGKLIDIDINAVKNAVK 271


>gi|413916873|gb|AFW56805.1| hypothetical protein ZEAMMB73_257177 [Zea mays]
          Length = 253

 Score =  337 bits (865), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 146/209 (69%), Positives = 182/209 (87%)

Query: 55  AKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
           AKKKNPWLDPFDDG D +  Y  +F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+ SL
Sbjct: 45  AKKKNPWLDPFDDGPDEEFDYQGMFSGGKQEEDPRPPEDPENPYGFLRFPQGYNPELDSL 104

Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
             K+RGDVRR CC++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARGKQ +E+ KNVR
Sbjct: 105 ASKVRGDVRRACCVVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGKQVYEMCKNVR 164

Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
           +ANVYD DDDWPEPAEYT +LGV+KNRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YP
Sbjct: 165 YANVYDPDDDWPEPAEYTHMLGVLKNRYYDLIISTRLAGIGHALFLFMSSARDKVGYVYP 224

Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNM 263
           NVN+ GAGL L+E F A + NL++ GY+M
Sbjct: 225 NVNSVGAGLFLNEMFKAPTNNLADGGYHM 253


>gi|449528831|ref|XP_004171406.1| PREDICTED: uncharacterized LOC101205787, partial [Cucumis sativus]
          Length = 206

 Score =  307 bits (787), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 156/204 (76%), Positives = 178/204 (87%)

Query: 264 YEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSK 323
           Y QMVDWLGRPFRSVPRHPVPPL+VSI+R+LKEV+AEKYK AGAE+GKYIVIHGIESDSK
Sbjct: 1   YHQMVDWLGRPFRSVPRHPVPPLKVSIARKLKEVIAEKYKKAGAEKGKYIVIHGIESDSK 60

Query: 324 ASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPG 383
           ASMQS+GDTDSLLPI+ WAEIA  +R F+P+FVIPHEKERE VE+ VG++ASIVFITTPG
Sbjct: 61  ASMQSKGDTDSLLPIRAWAEIAKCIRGFKPVFVIPHEKERESVEEEVGNNASIVFITTPG 120

Query: 384 QLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTG 443
           QLAALINDSAG+IATNTAA+QLA AREKPSIALF SE K +LFVPNAE K+C  +SS TG
Sbjct: 121 QLAALINDSAGLIATNTAAVQLAIAREKPSIALFCSEEKAKLFVPNAEAKRCITVSSATG 180

Query: 444 KLIDTPVEAVLNAMQIFNESLALA 467
           +LID  +  +  A+Q F   LALA
Sbjct: 181 RLIDIDMGTIKKAIQSFEVPLALA 204


>gi|253761918|ref|XP_002489333.1| hypothetical protein SORBIDRAFT_0010s015060 [Sorghum bicolor]
 gi|241946981|gb|EES20126.1| hypothetical protein SORBIDRAFT_0010s015060 [Sorghum bicolor]
          Length = 116

 Score =  123 bits (308), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 53/80 (66%), Positives = 66/80 (82%)

Query: 129 ISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEP 188
           +SGGVYEN+LFFP  Q+LKDRYPGVL DV+ASA+GKQ +E+  NV +ANVYD DDDWPEP
Sbjct: 23  LSGGVYENVLFFPVAQMLKDRYPGVLFDVVASAKGKQVYEMCMNVTYANVYDPDDDWPEP 82

Query: 189 AEYTDILGVMKNRYYDMVLS 208
            EYT +LGV+K    D+V++
Sbjct: 83  TEYTHMLGVLKGDTVDLVVA 102


>gi|434384628|ref|YP_007095239.1| ADP-heptose:LPS heptosyltransferase [Chamaesiphon minutus PCC 6605]
 gi|428015618|gb|AFY91712.1| ADP-heptose:LPS heptosyltransferase [Chamaesiphon minutus PCC 6605]
          Length = 320

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 159/339 (46%), Gaps = 28/339 (8%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP I  LK RYP   IDV+   R    + ++K+V     +D   
Sbjct: 2   RILALVPGGIGDQILFFPTIDNLKGRYPDAEIDVVVEPRSTSAYRVSKSVHKVIPFDF-K 60

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
           D   PA++ ++LG++++  YD+ +S      G    L++T    R+ +     +   + L
Sbjct: 61  DVNGPADWGNLLGIIRDSEYDVAMSLGRR-FGVGLLLWLTGIPVRIGF-----DVGASKL 114

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
            LS      S+NL++    M+  +V  L          P P + +++ +   +    + +
Sbjct: 115 FLSNPV---SLNLNQYAAQMFHDLVRGLNI------TDPCPAVGINVPKADIDWAEAEQQ 165

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
             G ++  Y++IHG E+    ++      D + P++ W +I   L+  +   P+ VI   
Sbjct: 166 RLGIKESGYVLIHGGENKPDRTL----GVDRIYPVKQWQQIVKDLQTRQPNIPVAVIEGP 221

Query: 361 KEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
            + E +E ++    ++  I+ P  G+LAA I  +  ++ TN+  + LA A    +IALF 
Sbjct: 222 DDGEFIEALLTLCPALKVISPPDIGKLAATIAGANLMLCTNSGPMHLAVAVGTYTIALFG 281

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
                +L     E  +   I SRTGKL D P   +L  +
Sbjct: 282 PSDPAKLL---PENGRSIAIKSRTGKLADIPPSDILEKI 317


>gi|282896719|ref|ZP_06304727.1| Glycosyl transferase, family 9 [Raphidiopsis brookii D9]
 gi|281198437|gb|EFA73325.1| Glycosyl transferase, family 9 [Raphidiopsis brookii D9]
          Length = 317

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 159/342 (46%), Gaps = 36/342 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ G + + +LFFP +  L+  YP   IDVI   R K  +++NK+V     +D  D
Sbjct: 2   RVVALVPGSIADQILFFPTLDSLQRSYPDAWIDVIVEPRSKVAYQVNKSVHEVFSFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A+++++LG +++R YD+ ++ +     GL    FL+++    R+ +       +G
Sbjct: 62  R-NSLADWSNLLGTIRDREYDVAITARESWFVGL----FLWLSGIPTRIGF-----EGSG 111

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           A  L        S N S+   ++Y  ++  LG           PPL +++ +   E   +
Sbjct: 112 ANFLTHVI----SPNTSQYISHIYHDLLRPLGI------NTSCPPLAINLVKPDVEWAKQ 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + K  G  +  YI+I+G        + S  D D++ PI  W +I    ++ +   P+ V+
Sbjct: 162 QQKRLGIGETGYILIYG----GYGELSSEPD-DNIYPIDSWQQIIGECQQKQPDLPILVV 216

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             + +   V  +V     I  I  P  G+LAA+I  ++ ++  N + +QLA A +  +IA
Sbjct: 217 KEQGDDYFVPSLVESFPDIKVIPVPDIGKLAAIIGSASLMVTVNGSPLQLAIAIQTYTIA 276

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L SS    +L   N    K   I S TGK  D P   VL  +
Sbjct: 277 LLSSADSNKLLPVN---DKFLAIKSPTGKTADIPPATVLKTI 315


>gi|298489841|ref|YP_003720018.1| family 9 glycosyl transferase ['Nostoc azollae' 0708]
 gi|298231759|gb|ADI62895.1| glycosyl transferase family 9 ['Nostoc azollae' 0708]
          Length = 319

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 161/342 (47%), Gaps = 35/342 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R    + G + + +LFFP +  LK  YP   IDVI  ++ K  + L+K+V     +D  D
Sbjct: 2   RVVAFVPGSISDQILFFPTLDDLKRYYPDAQIDVIVESQSKAAYRLSKSVHEVLTFDYSD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ +++G +++R YD+ ++T      GL    FL++T    R+ Y        G
Sbjct: 62  H-NSLADWGNLVGTIRDREYDVAMTTGESWFIGL----FLWLTGIPTRIGY-----KGKG 111

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           AG L +        N S+    MY  ++  +G       + P P L  ++ +   +   +
Sbjct: 112 AGFLTNTI----RRNTSQYVAAMYHDLLKPVGI------KTPCPELTANVPKPDIQWAQQ 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIP 358
           + K  G  +  YI+I+G    S  S Q  G  D+  PI  W +I    +  +P    V+ 
Sbjct: 162 EQKRLGIHETGYILIYG---GSSHSSQIDG-ADTSYPIASWQQIIQDCQHKQPELPIVLI 217

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
            E + E    V+ +  S + IT+P   G+L A+I+ ++ +++     +QLA A +  +IA
Sbjct: 218 KEADDENFVRVLLESCSNLKITSPDDIGKLTAIISGASLMLSIENGPLQLAVAVQTYTIA 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L SS    +L +P ++  K   I S +GK +D P   VL  +
Sbjct: 278 LLSSTDADKL-LPKSD--KFLAIKSHSGKTVDIPPPTVLEKI 316


>gi|282900108|ref|ZP_06308065.1| Glycosyl transferase, family 9 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194990|gb|EFA69930.1| Glycosyl transferase, family 9 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 317

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 159/342 (46%), Gaps = 37/342 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ G + + +LFFP +  L+  YP   IDVI   R K  +++NK V    ++D  D
Sbjct: 2   RVVALVPGSIADQILFFPTLDSLQRSYPDAWIDVIVEPRSKVAYQVNKYVHEVFLFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS---TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A+++++LG +++R YD+ ++   +   GL    FL+++    R+ +       +G
Sbjct: 62  R-NSLADWSNLLGTIRDREYDLAITAGESWFVGL----FLWLSGIPTRIGF-----QGSG 111

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           +  L        S N S+   ++Y  ++  LG           PPL +++ +   E   +
Sbjct: 112 SNFLTHVI----SPNTSQYIPHIYHDLLQPLGI------NTACPPLAINLLKPDVEWAKQ 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + K  G  +  YI+I+G   D  +      D D++ PI  W +I    ++ +   P+ V+
Sbjct: 162 QQKLLGIGETGYILIYGGYGDLSS------DPDNIYPIDSWRQIIGECQQKQADLPILVV 215

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             + +   V  +V     I  I  P  G+LAA+I  ++ +I  N + +QLA A +  +IA
Sbjct: 216 KEQGDDYFVPSLVESFPDIKAIPVPDIGKLAAIIGGASLMITVNGSPLQLAIAIQTYTIA 275

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L SS    +L   N    K   I S TGK  D P   VL  +
Sbjct: 276 LLSSADSTKLLPVN---YKFLAIKSPTGKTADIPPATVLKTI 314


>gi|428780992|ref|YP_007172778.1| ADP-heptose--LPS heptosyltransferase [Dactylococcopsis salina PCC
           8305]
 gi|428695271|gb|AFZ51421.1| ADP-heptose:LPS heptosyltransferase [Dactylococcopsis salina PCC
           8305]
          Length = 320

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 159/341 (46%), Gaps = 32/341 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +I GG+ + +LFFP +  LK+ YP   IDV+   R K  + +  +V    ++D  +
Sbjct: 2   RILTLIPGGIGDQILFFPTLADLKNTYPEAKIDVLVEPRSKSAYRVCSHVDEVLLFDYKN 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSY-IYPNVNAAGA 241
                A+Y ++LG++++R Y++ L+  L       FL ++     RV Y   P+   +  
Sbjct: 62  R-NSLADYLNLLGIIRDREYEVALT--LGRRWTVGFLLWLNGIPLRVGYETSPSFFFSQT 118

Query: 242 GLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
             L +E + A+  +   +G N+                  P PPL V++ +   +   ++
Sbjct: 119 VPLKTEQYAAQMYHDLLQGLNI----------------NTPCPPLSVNVPKSDIQWGEQE 162

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            K    + G YIVIHG    S    Q +G  D + P+  W EI    +  +   P+ ++ 
Sbjct: 163 QKRLSIQDGNYIVIHG---GSSQLAQEKG-IDKVYPVSKWQEIIADFQSKQPDLPIVLLQ 218

Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             ++ + VE ++  + ++   + P  G+LA +I   + ++ T++A + LA A    +IAL
Sbjct: 219 GPEDEQWVEQLLEGNPNLKITSPPDIGKLAPMIAGGSLMLCTDSAPMHLAIAVGTYTIAL 278

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F      +L  P  E K C  + S TG + D  V +VL  +
Sbjct: 279 FGPTDSKKLLPP--ENKACLGVKSPTGAIADIEVNSVLEKI 317


>gi|428773015|ref|YP_007164803.1| glycosyl transferase family protein [Cyanobacterium stanieri PCC
           7202]
 gi|428687294|gb|AFZ47154.1| glycosyl transferase family 9 [Cyanobacterium stanieri PCC 7202]
          Length = 320

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 82/341 (24%), Positives = 158/341 (46%), Gaps = 32/341 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +I GG+ + +LFFP ++ LKD+YP  +IDVI   R K  + +  +V+   V+D  D
Sbjct: 2   RILALIPGGIGDQILFFPTLKTLKDQYPKAVIDVIVEPRSKNAYRVCPHVKEVLVFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A+Y ++LG++++R Y++ ++           L++    +RV Y  P         
Sbjct: 62  K-NGLADYLNLLGIIRDREYELAVTLG-RNWAVGFLLWLNGIPNRVGYKGP--------- 110

Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                F    + L    Y   MY  +V  L         +P PPL +++ +   +    +
Sbjct: 111 --KSWFINNPVELKTEQYAAYMYHDLVHGLNI------TNPCPPLSINVPKEDIQWAESE 162

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            +    ++  YI+IHG    S A  +++G  D + P+  W  I   ++  +   P+ ++ 
Sbjct: 163 QRRLDIKESGYILIHG---GSSALAKTKG-IDKIYPVAKWQRIVEDVQRKQPNLPIVLLN 218

Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
              ++E   +++    ++  I+ P  G+L+A I  +  ++ T++A + L+ A    +IAL
Sbjct: 219 GPDDQEWTAEMLQLCNNLKVISPPDIGKLSAFIAGANLMLCTDSAPMHLSVAVGTYTIAL 278

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F      +L  PN++  +   I S +  + D   E +L  M
Sbjct: 279 FGPTQADKLLPPNSD--RFIGIQSLSKNIADISTEKILEKM 317


>gi|411118898|ref|ZP_11391278.1| ADP-heptose:LPS heptosyltransferase [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410710761|gb|EKQ68268.1| ADP-heptose:LPS heptosyltransferase [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 320

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 154/341 (45%), Gaps = 32/341 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R    + ++K+VR    +D   
Sbjct: 2   RIVALVPGGIGDQILFFPTLDDLKKTYPESQIDVVVEPRAAAAYRVSKSVRDVIPFDFKG 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ ++LGV+++R YD+ LS  L   G   FL   T       I   V  AG G 
Sbjct: 62  R-NSLADFGNLLGVIRDREYDVALS--LGQRGAVGFLLWLTG------IPIRVGFAGGG- 111

Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
              E F    + L +  Y   MY  ++  LG         P P + VS+  +  +    +
Sbjct: 112 --GERFLTNPVPLKKDQYAACMYHDLLKGLGI------DSPCPDIAVSVPAKDLDWADAE 163

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            K  G   G Y++IHG      + +      D + P+  W  I     + +   P+ V+ 
Sbjct: 164 RKRLGIGTGGYVIIHG----GSSQLAKEKGIDKIYPVDNWKGIIQDFNQRQPDLPIVVVQ 219

Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
              ++E V+ ++  + ++   + P  G+L A+I  ++ ++ T++A + LA A +  ++AL
Sbjct: 220 GPDDQEFVQALLKVNPNLKITSPPDIGKLTAMIAGASLMLCTDSAPMHLAVAVKTYTLAL 279

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F      +L  PN  + +   I S TGK+ D     VL  +
Sbjct: 280 FGPTDPAKLL-PN--DSRFVGIKSMTGKMEDIVPATVLEKV 317


>gi|428206460|ref|YP_007090813.1| glycosyl transferase family protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428008381|gb|AFY86944.1| glycosyl transferase family 9 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 320

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 162/344 (47%), Gaps = 44/344 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD- 182
           R   ++ GG+ + +LFFP +  LK  YP   IDV+A       + + K+V  +NV   D 
Sbjct: 2   RIVALVPGGIGDQILFFPTLDDLKRNYPDAEIDVVAEPGSLGAYRICKSV--SNVLKFDF 59

Query: 183 DDWPEPAEYTDILGVMKNRYYDM-VLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGA 241
            D     E+ D++G +++R YD+ + + K   +G A  L+++    RVSY        G 
Sbjct: 60  KDRNSLTEWVDLIGNIRDREYDIAICAGKRPFVGMA--LWLSGVAVRVSY-------QGG 110

Query: 242 GLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
           G L    F   S+ L    Y   MY  ++  LG         P P L +++ +   E   
Sbjct: 111 GNL----FLTNSVPLKTEQYVAAMYHDLLKGLGI------STPCPDLTINVPKSDLEWAD 160

Query: 300 EKYKNAGAEQGKYIVIHGIESD---SKASMQSRGDTDSLLPIQVWAEIANGLREFR---P 353
           ++ +  G +   YI+IHG  S    +K S+Q+        P+Q W ++   L++ +   P
Sbjct: 161 KEQQRLGIKDSGYILIHGGSSQLAAAKTSIQT-------YPVQSWRQVIQNLQQRQPDLP 213

Query: 354 LFVIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
           + ++    E E +   + +    + +T P   G+ AA++  +  ++ T++A + LA A +
Sbjct: 214 IAILVQTAEDEQLVRSLQEAFPDLKVTNPPDIGKTAAIVAAANLLLCTDSAPMHLAVAVQ 273

Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
             +IALF +    +L  P+    +C  I S TGKL D   + VL
Sbjct: 274 TYTIALFGATDPKKLIPPS---DRCIAIKSPTGKLSDIAPQTVL 314


>gi|443320916|ref|ZP_21049989.1| ADP-heptose:LPS heptosyltransferase [Gloeocapsa sp. PCC 73106]
 gi|442789380|gb|ELR99040.1| ADP-heptose:LPS heptosyltransferase [Gloeocapsa sp. PCC 73106]
          Length = 320

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 157/343 (45%), Gaps = 42/343 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP ++ LK ++P  LIDVI   R K  + +  +V     +D  D
Sbjct: 2   RILTLVPGGIGDQILFFPTLKDLKTQFPKALIDVIVEPRAKAAYRVCADVNEVLTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A+Y ++LGV+++R YD  L+     LG        L++     RV Y  PN    
Sbjct: 62  R-NSMADYLNLLGVIRDREYDAALT-----LGQRWTVGLLLWLNGIPTRVGYRSPNA--- 112

Query: 240 GAGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSI-SRRLKE 296
                    F +  + L    Y  +MY  ++  LG       + P PP+ + I ++ L  
Sbjct: 113 --------WFISHPVPLKTEQYAAHMYHDLLQGLGI------KKPCPPVSIDIPAQDLDW 158

Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP-LF 355
              EK +      G YI+IHG    S    Q +G  D + P   W  +   ++  +P L 
Sbjct: 159 AQGEKQRLELQTDG-YILIHG---GSSQLAQVKG-IDKIYPAIKWQIVIQDIQRQKPELP 213

Query: 356 VIPHEKEREGV-EDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREK 411
           ++  +   +GV  + +    S + +T+P   G+LAA+I D+  ++ T++A + LA A   
Sbjct: 214 IVLIKGPEDGVWSEQMLKTCSNLKVTSPPDLGKLAAMIKDARLMLCTDSAPMHLAIALLT 273

Query: 412 PSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
           P+IALF      +L  P  ++     I S T  + D   E +L
Sbjct: 274 PTIALFGPTSATKLIPPQNDQ--VIGIQSPTKAIADIVPETIL 314


>gi|428297573|ref|YP_007135879.1| glycosyl transferase family protein [Calothrix sp. PCC 6303]
 gi|428234117|gb|AFY99906.1| glycosyl transferase family 9 [Calothrix sp. PCC 6303]
          Length = 319

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 164/339 (48%), Gaps = 29/339 (8%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + LLFFP +  LK     + IDV+   + K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIDDQLLFFPTLDDLKRYKSDIQIDVVVEPKSKAAYRVSKSVNQVIGFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G +++R YD+ ++   + L   A L++T    R+ Y        G+ L
Sbjct: 62  R-NSLADWGNLIGTIRDREYDVAITVGQSWLMGFA-LWLTGIPIRIGY-----KGQGSFL 114

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
             S  + AE   +     NMY  ++  L      V   P   L +++ +   +   ++ +
Sbjct: 115 TNSVPYKAEQYKV-----NMYHDLLQGL------VIDSPAGELTINVPKTDIDWAEKEQR 163

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI--PHEK 361
             G ++  YI+I+G  S       ++G  + + P+  W +I    ++ +P   I    E 
Sbjct: 164 RLGVKETGYILIYGGSSQVAG---TKGVDEEVYPVTNWKQIVKDFQDKQPDLAILAIQEP 220

Query: 362 EREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           + +G  +   +    + IT+P   G+LAA+I+ +  ++ T++A++QLA A +  +I LF 
Sbjct: 221 DNDGFANSFREIFPNIKITSPEDVGKLAAMISGANLILCTSSASMQLAIAVQTYTIGLFG 280

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           S    +L +P +E  K   I S TGKL+D   E V+  +
Sbjct: 281 SADPMKL-MPKSE--KFLPIQSSTGKLVDISPETVMQKV 316


>gi|428226657|ref|YP_007110754.1| glycosyl transferase family protein [Geitlerinema sp. PCC 7407]
 gi|427986558|gb|AFY67702.1| glycosyl transferase family 9 [Geitlerinema sp. PCC 7407]
          Length = 319

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 152/342 (44%), Gaps = 39/342 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   +DV+   R K  + +   V    V+D   
Sbjct: 2   RILALVPGGIGDQILFFPTLDSLKQAYPKAKLDVVVEPRAKAAYRVCPFVNEEIVFDFKG 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ ++LGV+++R YD VLS   +       L++T    RV Y      +AGA  
Sbjct: 62  S-NSLADWGNLLGVIRDREYDAVLSLGRS-WAVGVLLWLTGVPTRVGY----AGSAGAWC 115

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
           L +       +   +   + Y  ++  LG         P P  R+S+ +R  E    + K
Sbjct: 116 LTNPV----PLKTEQYAASQYHDLLSGLGI------SGPCPDPRISVPKRDIEWAEAEQK 165

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSR-GDTDSLLPIQVWAEIANGLREFR------PLFV 356
             G  +  Y++IH       AS  SR    +   P Q W  I   +R+FR      P+ +
Sbjct: 166 RLGIHESGYVLIH-----PGASALSRLKGIEKTYPAQSWKSI---IRDFRQRQPGLPVVL 217

Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
           +   ++RE VE +  D  + V I TP   G+LAALI  +  ++ T++  + LA A +  +
Sbjct: 218 VQGPEDREIVEAIAAD--ADVKIATPDDIGKLAALIAGANLMLCTDSGPMHLAVAVQTYT 275

Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
           IALF      +L        K   + S T ++ D   + VL 
Sbjct: 276 IALFGPTDSTKLL---PTSSKIIALQSTTNRVADITPDDVLK 314


>gi|186685109|ref|YP_001868305.1| glycosyl transferase family protein [Nostoc punctiforme PCC 73102]
 gi|186467561|gb|ACC83362.1| glycosyl transferase, family 9 [Nostoc punctiforme PCC 73102]
          Length = 319

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 84/340 (24%), Positives = 156/340 (45%), Gaps = 31/340 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R K  + ++K+V     +D +D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKRNYPDAQIDVVVEPRSKAAYRVSKSVHEVLNFDFND 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G +++R YD+V+  K      +  L++T    R+ Y           +
Sbjct: 62  R-NSLADWGNLVGTIRDREYDVVIVVKQI-WLLSLLLWLTGIPIRIGY------KGNGSV 113

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
            L+     ++       Y+   Q ++            PVP L V++ +   E   ++ K
Sbjct: 114 FLTHAVPFKASQYVAAAYHDLLQPLEI---------NSPVPELAVNVPKPDIEWAQKEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
             G  +  YI+IHG        +    + D + P++ W +I  G ++ +   P+ V+   
Sbjct: 165 RLGVHETGYILIHG----GSGQLSQAKELDKIYPVESWHQIIQGFQDKQPDLPVVVVKGI 220

Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
            + + V  ++G    I  +T P   G+L A+I  +  +++T++ A+QL+ A +  +IALF
Sbjct: 221 GDEQFVRSLLGSSPDIK-VTAPDDIGKLTAMIAGANLMLSTDSPALQLSVAVQTYTIALF 279

Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
                 +L   N    K   I+S TGK  D    AVL  +
Sbjct: 280 GPTDPAKLLPKN---DKFLAIASPTGKTADVSPNAVLEKI 316


>gi|428211909|ref|YP_007085053.1| ADP-heptose--LPS heptosyltransferase [Oscillatoria acuminata PCC
           6304]
 gi|428000290|gb|AFY81133.1| ADP-heptose:LPS heptosyltransferase [Oscillatoria acuminata PCC
           6304]
          Length = 324

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 160/345 (46%), Gaps = 36/345 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R K  + ++K     ++  L  
Sbjct: 2   RILALVPGGIGDQILFFPTLDSLKRVYPNAYIDVVVEPRSKGAYRVSKFFHDKSLQVLPF 61

Query: 184 DWPEP---AEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVN 237
           D+ +    A++ ++LG+M++R YD V+S     + GL     L++T    R+SY     +
Sbjct: 62  DFKDRNGLADWGNLLGIMRDREYDAVISLGQRWVVGL----LLWLTGIPIRISY-----S 112

Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
             G   LL+     ++    +    MY  +++  G         P P L V++ +     
Sbjct: 113 DLGNSWLLTNPVPLKT---EQYAAGMYHDLLEGFGVSM------PCPELSVTLLKEDLRW 163

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF-- 355
             E+ +  G +   YI+IHG  S      Q +G  D + P++ W +I   L+  +P    
Sbjct: 164 SDEEQQRLGVKDSGYILIHGGASQLS---QIKG-LDKIYPVEKWQQIIQDLQTRQPQLPV 219

Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKP 412
           V+    E + + D +      V +T+P   G+LAA I  +  ++ T++A + LA A +  
Sbjct: 220 VLIQGPEDKPLCDQLVRFCPGVKVTSPPDVGKLAATIAAANLMLCTDSAPMHLAVAVQTY 279

Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           +IALF     G+L  P     +   I S TG++ D   E VL  +
Sbjct: 280 TIALFGPTDPGKLLPPG---DRYIGIQSSTGRIADISPEKVLEKV 321


>gi|427717097|ref|YP_007065091.1| glycosyl transferase family protein [Calothrix sp. PCC 7507]
 gi|427349533|gb|AFY32257.1| glycosyl transferase family 9 [Calothrix sp. PCC 7507]
          Length = 319

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/343 (24%), Positives = 163/343 (47%), Gaps = 37/343 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  L+  YP   IDV+   R K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDNLQRYYPNAQIDVVVEPRSKAAYRVSKSVHEVLTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++++++G +++R YD  ++ + + L     L++T    R+ Y        G+  
Sbjct: 62  R-NSLADWSNLVGSIRDREYDTAIALEQSWL-VGLLLWLTGIPTRIGY-----QGKGSVF 114

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
           L +          S+     Y  ++  LG         P P L V+I +   E    + K
Sbjct: 115 LTNPV----PFKTSQYVAAAYHDLLQPLGI------ESPYPELAVNIPKPDIEWAQNEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR------PLFVI 357
             G  +  Y++I+G    SK S  ++ D D++ P+  W +I   ++EF+      P+ +I
Sbjct: 165 RLGVHETGYVLIYG---GSKQSSPTQ-DQDTIYPVDKWQQI---IQEFQHKQPDLPIVLI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
              ++ E V  ++ D +  V +T+P   G+LAA++  ++ ++ T++  +QL+ A +  +I
Sbjct: 218 KTPEDEEFVRSLL-DSSPNVKVTSPDDIGKLAAIVGGASLMLTTDSVPLQLSVAVQTYTI 276

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      ++     +  K   I S TGK+ D   + VL  +
Sbjct: 277 ALFGPTDPAKVL---PKSDKFLAIKSPTGKVADISPQVVLQKI 316


>gi|428770822|ref|YP_007162612.1| glycosyl transferase family protein [Cyanobacterium aponinum PCC
           10605]
 gi|428685101|gb|AFZ54568.1| glycosyl transferase family 9 [Cyanobacterium aponinum PCC 10605]
          Length = 320

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 157/339 (46%), Gaps = 28/339 (8%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +I GG+ + +LFFP +Q LK++YP   IDVI   R K ++ + K+V+   V+D  D
Sbjct: 2   RVLVLIPGGISDQILFFPTLQTLKNKYPQATIDVIVEPRSKNSYRICKHVQEVLVFDFQD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A+Y ++LG++++R Y++V++ +   + +   L++     RV Y       +    
Sbjct: 62  R-NGLADYLNLLGMIRDREYELVITLEQNWIVN-FLLWLDGIPTRVGY------QSSNSW 113

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
            L+ T      N  +    MY  ++  L          P P L V++ R   E    + K
Sbjct: 114 FLNCTIPK---NTDQYIPFMYHDLLKALKI------DDPCPDLSVNVPREDIEWAESEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
               +   YI+IHG      + + +    + + P+  W  +   +R  +   P+ ++   
Sbjct: 165 RLKIKDTGYIIIHG----GASILTAYQGINKIYPVPKWQRVIEDIRMKQPDIPIVLLCGP 220

Query: 361 KEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
            + E   ++V     +  ++ P  G+LAA+I  +  ++ T++A +QLA A    +IALF 
Sbjct: 221 DDLEWTTEIVSLCPYVKVVSPPDIGKLAAIIAGANLMLCTDSAPMQLAIAVGVYTIALFG 280

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
                +L  P  +  +   I S + ++ D P + +L  +
Sbjct: 281 PTKTTKLLPPKCD--RVFGIQSLSSQIADIPTDKILEQI 317


>gi|428305070|ref|YP_007141895.1| glycosyl transferase family protein [Crinalium epipsammum PCC 9333]
 gi|428246605|gb|AFZ12385.1| glycosyl transferase family 9 [Crinalium epipsammum PCC 9333]
          Length = 320

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/341 (26%), Positives = 157/341 (46%), Gaps = 32/341 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP I  LK  YP   IDVI   R    + + K+V     +D  +
Sbjct: 2   RILALVPGGIGDQILFFPTIDDLKQNYPDAQIDVIVEPRSTGAYRVCKSVNEVMRFDFRN 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ ++LG++++R YD  +S      G    L++T    RV Y      ++G   
Sbjct: 62  R-NGMADWGNLLGIIRDREYDFAVSLGRT-WGVNFLLWLTGIATRVGY------SSG--- 110

Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
            L E F +  + L    Y   MY  ++   G    S    P PPL V++ ++  E     
Sbjct: 111 -LKEAFISNPVPLKTEQYTAEMYHDLIK--GIDINS----PCPPLAVNVPKQDIEWAELA 163

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            +  G  +  YIVIHG    S    QS+G  D   P++ W +I   +++ +   P+ VI 
Sbjct: 164 QQRLGINETGYIVIHG---GSSQISQSQG-IDKTYPVKKWQQIVEDIQQRQPNLPVVVIN 219

Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
              ++  V  ++G    +  I+ P  G+LAALI  +  ++ T++A + +A A     IAL
Sbjct: 220 GPDDQAFVSSLLGYCPGVKVISPPDIGKLAALIAGANLMLCTDSAPMHIAVAVGTYLIAL 279

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F    + +  +P ++  +   + S T  + D P   VL  +
Sbjct: 280 FGPT-EAKKLLPKSD--RTIGLQSSTRSVADIPPADVLEKI 317


>gi|440681641|ref|YP_007156436.1| glycosyl transferase family 9 [Anabaena cylindrica PCC 7122]
 gi|428678760|gb|AFZ57526.1| glycosyl transferase family 9 [Anabaena cylindrica PCC 7122]
          Length = 314

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 154/343 (44%), Gaps = 42/343 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFF  +  LK  YP   IDVI   R K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFATLDDLKRNYPHAQIDVIVEPRSKAAYRVSKSVHEVLSFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ +++G+M++R YD+ ++ +     GL     L++T    R+ Y        G
Sbjct: 62  R-NSLADWGNLVGMMRDREYDIAITVEPSWFVGL----LLWLTGIPTRIGY-----KGKG 111

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           AG L +       +N S+    MY  ++  LG         P P L V++ +   E   +
Sbjct: 112 AGFLTNSV----PVNTSQYVAAMYHNLLQPLGI------NTPCPDLAVNVPKSDLEWAQQ 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + K  G  +  YI+I G   +          +D+  P+  W +I    +  +   P+ VI
Sbjct: 162 EQKRLGVHETGYILISGGSGNC---------SDTSYPVASWQQIIQDCQNKQPHLPVMVI 212

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
               + + V   + +    + IT+P   G+L A+I  ++ +++  +A +QL+ A +  +I
Sbjct: 213 KEANDEQFVRS-LQESCPDIKITSPDDIGKLTAIIGGASLMLSIESAPLQLSVAVQTYTI 271

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
            L  S   G+L     +  K   I S TGK  D P   VL  +
Sbjct: 272 TLLGSTDPGKLL---PKSDKFLAIKSPTGKTGDIPPAIVLEKI 311


>gi|428319576|ref|YP_007117458.1| glycosyl transferase family 9 [Oscillatoria nigro-viridis PCC 7112]
 gi|428243256|gb|AFZ09042.1| glycosyl transferase family 9 [Oscillatoria nigro-viridis PCC 7112]
          Length = 321

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 89/345 (25%), Positives = 159/345 (46%), Gaps = 44/345 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R K  + + K+V+    Y+  D
Sbjct: 3   RILALVPGGIGDQILFFPTLDDLKQSYPEAQIDVMVEPRAKGAYRVCKSVKEVLTYNFKD 62

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A++ ++LGV+++R Y+ V+S     LG        L++T    RV   Y   N A
Sbjct: 63  R-NAMADWGNLLGVIRDREYEAVIS-----LGQRWTVGLLLWLTGIPQRVG--YSGTNGA 114

Query: 240 ----GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
                A  L +E + A   +   +G+N+                ++P PPL +++ ++  
Sbjct: 115 RFLTAAVPLKTEQYAASMYHDLLQGFNI----------------KNPCPPLAINVPKQDI 158

Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR--- 352
           +    + +  G ++  YI+IHG  S    S       D + P   W +I    ++ +   
Sbjct: 159 QWAEAEQQRLGVKESGYILIHGGSSQLAVSK----GIDKIYPTDNWKQIIEDCQQRQPNL 214

Query: 353 PLFVIPHEKEREGVEDVVGDDASIVFIT--TPGQLAALINDSAGVIATNTAAIQLANARE 410
           P+ V+   ++ + V  +V    ++  I+    G+LAA I  +  +I T++A + LA A +
Sbjct: 215 PVVVVKGPEDADFVTKLVELCPNVKVISPDDVGKLAATIAAANLMICTDSAPMHLAVAVQ 274

Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
             +IALF      +L  P ++  +   I S TGK+ D   + VL 
Sbjct: 275 TYTIALFGPTEPAKLL-PKSD--RFLGIKSPTGKMADISPQEVLK 316


>gi|334119028|ref|ZP_08493115.1| glycosyl transferase family 9 [Microcoleus vaginatus FGP-2]
 gi|333458499|gb|EGK87116.1| glycosyl transferase family 9 [Microcoleus vaginatus FGP-2]
          Length = 321

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 90/343 (26%), Positives = 157/343 (45%), Gaps = 36/343 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDVI   R K  + + K+V+    Y+  D
Sbjct: 3   RILALVPGGIGDQILFFPTLDDLKQSYPEAQIDVIVEPRAKGAYRVCKSVKEVLTYNFKD 62

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A+  ++LGV+++R Y+ V+S     LG        L++T    RV Y   N    
Sbjct: 63  R-NAMADLGNLLGVIRDREYEAVIS-----LGQRWTVGLLLWLTGIPQRVGYSGTN---- 112

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
           GA  L +    A  +   +   +MY  ++            +P PPL +++ ++  +   
Sbjct: 113 GARFLTA----AIPLKTEQYAASMYHDLLQGFNI------TNPCPPLAINVPKQDIQWAE 162

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
            + +  G ++  YI+IHG  S    S       D + P   W +I    ++ +   P+ V
Sbjct: 163 AEQQRLGVKESGYILIHGGSSQLAVSK----GIDKIYPTDNWKQIIEDCQQRQPNLPVVV 218

Query: 357 IPHEKEREGVEDVVGDDASIVFIT--TPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
           +   ++ E V  +V    ++  I+    G+LAA I  +  +I T++A + LA A +  +I
Sbjct: 219 VKGPEDAEFVTKLVELCPNVKVISPDDVGKLAATIAAANLMICTDSAPMHLAVAVQTYTI 278

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      +L  P ++  +   I S TGK+ D   + VL  +
Sbjct: 279 ALFGPTEPAKLL-PKSD--RFLGIKSPTGKMADISPQEVLKKV 318


>gi|427730985|ref|YP_007077222.1| ADP-heptose--LPS heptosyltransferase [Nostoc sp. PCC 7524]
 gi|427366904|gb|AFY49625.1| ADP-heptose:LPS heptosyltransferase [Nostoc sp. PCC 7524]
          Length = 319

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/339 (23%), Positives = 156/339 (46%), Gaps = 29/339 (8%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   +DV+   R K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKRYYPNAQLDVVTEPRSKAAYRVSKSVNEVLSFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G +++R YD+ ++   + L    FL++T    R+ Y        G+G 
Sbjct: 62  R-NSLADWGNLVGTIRDREYDLAITVGQSWL-VGLFLWLTGIPTRIGY-----QGKGSGF 114

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
           L            S+    +Y  ++  LG         P P L V++ +   E    + K
Sbjct: 115 LTKSV----PFKPSQYAATVYHDLLQPLGI------DTPTPELGVNVPKPDIEWAQNEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHEK 361
             G  +  Y++I+G        + S    D++ P++ W EI    +  +P    V+    
Sbjct: 165 RLGVNETGYVLIYG----GSGWVSSTKGADAIYPLENWQEIIQDFQHKQPDLPIVVIQGA 220

Query: 362 EREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
             EG    + D ++ + +T P   G+LAA+I  ++ +++  +A +QL+ A +  +IAL  
Sbjct: 221 NDEGFVRSLRDLSANIKVTAPEDIGKLAAIIAGASLMLSNESAPLQLSVAVQTYTIALLG 280

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           +    ++  P ++  K   I S +G++ D   + VL  +
Sbjct: 281 ATEPAKVL-PKSD--KFLGIKSPSGRVADISPQQVLQKI 316


>gi|119510034|ref|ZP_01629175.1| Glycosyl transferase, family 9 [Nodularia spumigena CCY9414]
 gi|119465358|gb|EAW46254.1| Glycosyl transferase, family 9 [Nodularia spumigena CCY9414]
          Length = 318

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 159/341 (46%), Gaps = 34/341 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   +DV+   R K  + ++K+V     +D +D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKRNYPNAQLDVVVEPRSKAAYRVSKSVNDVLSFDYND 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++++++G +++R YD V++   + L     L++T    RV Y        GAG 
Sbjct: 62  R-NSLADWSNLVGTIRDREYDAVITVGQSWL-MGLLLWLTGIPTRVGY-----QGKGAG- 113

Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                F  +S+      Y   +Y  ++    +PF    + P   L V+I +   E    +
Sbjct: 114 -----FLTDSVPFQANKYVAAVYHDLL----QPFGI--KTPCRELAVNILKPDIEWSQNE 162

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPH 359
            K  G  +  YI+IH     + +S  +  D D L P++ W +I    ++ +P    V+  
Sbjct: 163 QKRLGVNETGYILIH-----AGSSQVTPTDVDKLYPVENWQQIIQECQQKQPDLPVVVIQ 217

Query: 360 EKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             E + +   + +    + +T P   G++AA+I  ++ +++T+   + L+ A +  +IAL
Sbjct: 218 GSENDPIRRSLLERTPGIKVTYPTNIGKIAAMIGGASLMLSTDDPLLHLSVAVQTYTIAL 277

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F      +L     +  K   I S TG++ D   + VL+ +
Sbjct: 278 FGPTDPAKLL---PKSDKFLAIKSPTGQMADISPQTVLDKI 315


>gi|428308848|ref|YP_007119825.1| ADP-heptose--LPS heptosyltransferase [Microcoleus sp. PCC 7113]
 gi|428250460|gb|AFZ16419.1| ADP-heptose:LPS heptosyltransferase [Microcoleus sp. PCC 7113]
          Length = 320

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 164/343 (47%), Gaps = 36/343 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  L+  YP   IDVI   R K  + ++ +V     +D  D
Sbjct: 2   RILALVPGGIGDQVLFFPTLDDLRAYYPTAEIDVIVEPRAKGAYRVSGSVDEVLTFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
                A++ ++LG++++R YD+ LS  + A +G    L+M+    R+ Y       A AG
Sbjct: 62  R-NGLADFGNLLGIIRDREYDVALSLAQRASIG--LLLWMSGIPTRIGY------EANAG 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
               + F  +++ L    Y  ++Y  M+  LG         P P L +S+ +   E    
Sbjct: 113 ----KWFLTDTVPLKTEQYTADIYHDMLQALGI------NTPCPDLTLSVPKPDIEWAER 162

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHE 360
           + +  G  +  YI+   I++ S    Q +G T+ + P++ W +I   +++ +P   I   
Sbjct: 163 EQQRLGIAESGYIL---IDNSSTQLAQEKG-TEQVYPVEKWRQILADIQQKQPNLPIVML 218

Query: 361 KEREGVEDVVGDDASI--VFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
           +  E  +       S   + IT+P   G+LAA+I  +  +I T++A + +A A    +IA
Sbjct: 219 RGSEDAQMSATMLQSFPDLKITSPADIGKLAAMIAGANLIICTDSAPMHVAVAVGTYTIA 278

Query: 416 LFS-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           LF  +E K RL  P +E   C  I S T ++ D   E VL  +
Sbjct: 279 LFGPTEAKKRL--PQSE--NCMSIQSPTKQIADIKPEDVLKQL 317


>gi|428778324|ref|YP_007170111.1| glycosyl transferase family protein [Halothece sp. PCC 7418]
 gi|428692603|gb|AFZ45897.1| glycosyl transferase family 9 [Halothece sp. PCC 7418]
          Length = 320

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 156/343 (45%), Gaps = 36/343 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK+ YP   IDV+   R K  + +   V    V+D  +
Sbjct: 2   RILTLVPGGIGDQILFFPTLADLKNAYPEAQIDVLVEPRAKSAYRVCSYVHDVLVFDYKN 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A+Y ++LGV+++R Y++ L+     LG        L++     RV Y      ++
Sbjct: 62  R-NSLADYLNLLGVIRDREYEVALT-----LGRRWTVGFLLWLNGIPVRVGY-----ESS 110

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
           G+  L S       +   +    MY  ++  L          P PPL+V++ +   +   
Sbjct: 111 GSAFLSSTV----PLKTEQYAAQMYHDLLKGLNI------NTPCPPLQVNVPKSDIQWGK 160

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
           ++ +    + G YIVIHG    S    Q +G  D + P+  W +I   L+  +   P+ +
Sbjct: 161 QEQERLDIKDGNYIVIHG---GSSQLAQEKG-IDKIYPVTKWKDIIADLQAKQPNLPIVL 216

Query: 357 IPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
           +   ++   VE ++  +  +   + P  G+L+A+I   + ++ T++A + LA A    +I
Sbjct: 217 VQGPEDEMWVEQLLQVNPHLKVTSPPDIGKLSAMIAGGSLMLCTDSAPMHLAVAVGTYTI 276

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      +L  P  +      I S TG + D  V +VL  +
Sbjct: 277 ALFGPTEPKKLLPP--DNSSYVGIKSPTGAIADIEVNSVLEQI 317


>gi|425463419|ref|ZP_18842758.1| Glycosyl transferase [Microcystis aeruginosa PCC 9809]
 gi|389833249|emb|CCI22392.1| Glycosyl transferase [Microcystis aeruginosa PCC 9809]
          Length = 320

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 159/342 (46%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + LLFFP ++ L+  YP   IDV+   R K  + L   V+   ++D  D
Sbjct: 2   RILTLVPGGISDQLLFFPTLETLRQTYPQGTIDVLVEPRAKAAYRLCPQVKEVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y+ +V    +  +  P  P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYQALV----KGLKIAP--PCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 EGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGERHR--YIEANTGKVADITPETVLKKI 317


>gi|307152521|ref|YP_003887905.1| glycosyl transferase family protein [Cyanothece sp. PCC 7822]
 gi|306982749|gb|ADN14630.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7822]
          Length = 328

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 158/342 (46%), Gaps = 26/342 (7%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP ++ LK +YP   +DVI   R K  + +   V    ++D  D
Sbjct: 2   RILALVPGGIGDQILFFPTLEDLKQKYPNSTLDVIVEPRAKSAYRICPYVHEVLLFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A+Y ++LGV+++R YD+ +S           L++     RV Y     N +    
Sbjct: 62  R-NGLADYLNLLGVIRDREYDVAISLP-QRWTIGLLLWLNGIPVRVGY---KTNTSVRKE 116

Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
             +  F   ++ L    Y   MY  ++  L        + P PPL+V++ +   +    +
Sbjct: 117 FRTPIFLTNAVPLKTEQYAAYMYHDLLQGLDI------QTPTPPLKVTLPKDDIDWAEAE 170

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            K    +   YI+IHG  S S A    +G  D   P+  W +I + +   +   P+ ++ 
Sbjct: 171 QKRLEIKDTGYIIIHGGSSTSAA---IKG-VDKNYPLPQWQKIVDEILAKQPDLPIVLLQ 226

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             ++ + V+ ++    ++  +T P   G+LAA+I  +  ++ T++  + L+ A    +IA
Sbjct: 227 GPEDPQWVKQLLEGHPNLK-VTKPTDVGKLAAIIAGANLMLCTDSPPMHLSVAVGTYTIA 285

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           LF   +  +L  P+    +   I S++G L D   E +L AM
Sbjct: 286 LFGPTVAEKLLPPDP--TRYIGIQSKSGNLADIKPETILEAM 325


>gi|354568658|ref|ZP_08987821.1| glycosyl transferase family 9 [Fischerella sp. JSC-11]
 gi|353539912|gb|EHC09392.1| glycosyl transferase family 9 [Fischerella sp. JSC-11]
          Length = 319

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 161/341 (47%), Gaps = 33/341 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R K  + ++K+V    ++D  D
Sbjct: 2   RIVALVPGGISDQILFFPTLDDLKRNYPDAQIDVVVEPRSKAAYRVSKSVNDVLMFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G++++R YD+ ++     L     L++T    R+ Y        GA  
Sbjct: 62  R-NSLADWANLVGILRDREYDVAIALT-QSLLMGLLLWLTGIPTRIGY-----KGKGAA- 113

Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                F   ++ L +  Y  ++Y  ++  LG       + P P L V++ +   +   ++
Sbjct: 114 -----FLTNTVPLKQEQYAADVYHTLLQELGI------KSPCPQLAVNVPKLDIDWADKE 162

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEK 361
               G ++  YI+IH    +    + +    D   P + W +I    ++ +P   +   +
Sbjct: 163 QLRLGVKETGYILIH----NGVNELPANQGQDRTYPAENWLDIIQDCQQKQPDIPVMVVQ 218

Query: 362 EREGVEDVVGDDASI--VFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
           E    E V     SI  + IT+P   G+LAA+I  +  ++ T++AA+QLA A +  +IAL
Sbjct: 219 EPGQQEFVAQLKQSIPDIKITSPDNMGKLAAMIAGANLLLCTDSAAMQLAVAVQTYTIAL 278

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F S    +L  P +E  K   I S TGK+ D   + +L  +
Sbjct: 279 FGSTDPAKLL-PKSE--KIIAIKSSTGKIADISPKTILEKI 316


>gi|300866017|ref|ZP_07110751.1| glycosyl transferase family protein [Oscillatoria sp. PCC 6506]
 gi|300336008|emb|CBN55909.1| glycosyl transferase family protein [Oscillatoria sp. PCC 6506]
          Length = 319

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 157/343 (45%), Gaps = 37/343 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   +D I   R K  + + K+V+    YD  D
Sbjct: 2   RIVALVPGGIGDQILFFPTLDGLKQLYPNTQVDAIVEPRAKGAYRVCKSVKEVIPYDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A++ ++LG++++R Y+ V+S     LG        L++T    RV Y      A 
Sbjct: 62  R-NALADWGNLLGIIRDREYEAVIS-----LGQRWSVGLLLWLTGINKRVGY------AG 109

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
           G  + L++    ++   +   Y+   Q  D            P PPL +++ ++  +   
Sbjct: 110 GGSIFLTDAIPLKTEQYAAEMYHDLLQGFD---------INTPCPPLAINVPKQDIQWAE 160

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VI 357
            + +  G +   YI+IHG    S+ ++ S+G  D + P+  W +I   L++ +P    V+
Sbjct: 161 AEQQRLGIKDSGYILIHG--GSSQLAL-SKG-IDKIYPVDKWQQIIQDLQQRQPNLPVVV 216

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
               E     D + +    V    P   G+LAA I  +  ++ T++A + LA A +  +I
Sbjct: 217 VKGPEDAAFVDKLVESCPQVLAIAPDDIGKLAAAIAAANLMLCTDSAPMHLAVAVQTYTI 276

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      +L  P ++  +   + S TGK+ D   + VL  +
Sbjct: 277 ALFGPTDPAKLL-PKSD--RFIGLKSPTGKMADISPQDVLKKV 316


>gi|422304228|ref|ZP_16391575.1| Glycosyl transferase [Microcystis aeruginosa PCC 9806]
 gi|389790695|emb|CCI13446.1| Glycosyl transferase [Microcystis aeruginosa PCC 9806]
          Length = 320

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 158/342 (46%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLRQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T  + +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRSAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F +E +     GY  + Y  +V    +  +  P  P PPL+++++R   +    
Sbjct: 113 -----WFLSEQVPRPTEGYLADNYHALV----KGLKIAP--PCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       +++ P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETVYPLESWLEIIQGIRHQQPELPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|166364044|ref|YP_001656317.1| glycosyl transferase family protein [Microcystis aeruginosa
           NIES-843]
 gi|166086417|dbj|BAG01125.1| glycosyl transferase [Microcystis aeruginosa NIES-843]
          Length = 320

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 158/342 (46%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   V+   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVKEVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y+ +V    +  +  P  P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYQALV----KGLKIAP--PCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 EGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGERHR--YIEANTGKVADITPETVLKKI 317


>gi|425470313|ref|ZP_18849183.1| Glycosyl transferase [Microcystis aeruginosa PCC 9701]
 gi|389884129|emb|CCI35564.1| Glycosyl transferase [Microcystis aeruginosa PCC 9701]
          Length = 320

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 156/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y+ +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADHYQALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITAETVLKKI 317


>gi|75908631|ref|YP_322927.1| glycosyl transferase family protein [Anabaena variabilis ATCC
           29413]
 gi|75702356|gb|ABA22032.1| Glycosyl transferase, family 9 [Anabaena variabilis ATCC 29413]
          Length = 319

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 157/340 (46%), Gaps = 31/340 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   +DV+   R K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKRYYPNAQLDVVVEPRSKAAYRVSKSVNDILTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G +++R YD+ ++   + L    FL++T    R+ Y        GA  
Sbjct: 62  R-NSLADWGNLVGTIRDREYDVAIAVGQSWL-VGLFLWLTGIPTRIGY-----QGKGASF 114

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
           L            S+    +Y  ++    +PF      P P L V++ +   +    + K
Sbjct: 115 LTKTV----PFKPSQYAAAVYHDLL----QPFGIT--TPTPELAVNVPKPDIDWANSEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
             G  +  Y++I+G      + +      DS+ P++ W EI    R+ +   P+ VI   
Sbjct: 165 RLGVSETGYVLIYG----GSSWVSQPQALDSIYPVENWQEIIQDFRQKQPDLPIVVIQGP 220

Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
            + + V   + D +  + +T+P   G+L A+I  +  ++ T++A + L+ A +  +IALF
Sbjct: 221 DDEQFVRS-LRDYSLDIKVTSPDNVGKLTAMIAGANLMLTTDSAPLHLSVAVQTYTIALF 279

Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
            S    +L     +  K   I S TG++ D   +AVL  +
Sbjct: 280 GSTDPVKLL---PQSDKLLGIKSPTGRVADILPKAVLEKV 316


>gi|113474466|ref|YP_720527.1| glycosyl transferase family protein [Trichodesmium erythraeum
           IMS101]
 gi|110165514|gb|ABG50054.1| glycosyl transferase, family 9 [Trichodesmium erythraeum IMS101]
          Length = 321

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 154/343 (44%), Gaps = 37/343 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  Y    IDVI   R K  +++ K+VR    YD  D
Sbjct: 2   RILALVPGGIGDQILFFPTLDDLKQSYRESQIDVIVEPRSKGAYQVCKSVRDVLTYDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A++ ++LG+M++R Y+ V+S     LG        L++T    RV Y      A 
Sbjct: 62  A-NSLADWGNLLGIMRDREYEAVIS-----LGQRWTVGLLLWLTGIPKRVGY------AG 109

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
             G+ LS+    ++   +   Y+   Q ++ +   F+ +  +        + + +    A
Sbjct: 110 NGGIFLSDPIPLKTEQYAAHMYHDLLQGMN-INTTFKGISIN-------VLKKDIAWAEA 161

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP-----L 354
           E+ +   AE G YI+IHG    S    Q +G  D + P   W EI + L++ +P     L
Sbjct: 162 EQQRLGVAESG-YILIHG---GSSKLAQIKG-IDKIYPTNYWLEIISQLQQKQPNLPVVL 216

Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
              P +            D  IV     G+LAA I  +  ++ T++A + LA A    +I
Sbjct: 217 VKGPEDGAWSSEISQSSRDVKIVIPGDVGKLAAFIAAANLMLCTDSAPMHLAVAVGTYTI 276

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      +L  P ++  +   + S TGK+ D   + VL  +
Sbjct: 277 ALFGPTDPKKLL-PKSD--RVIAVKSSTGKMADILPQQVLQKI 316


>gi|434406201|ref|YP_007149086.1| ADP-heptose:LPS heptosyltransferase [Cylindrospermum stagnale PCC
           7417]
 gi|428260456|gb|AFZ26406.1| ADP-heptose:LPS heptosyltransferase [Cylindrospermum stagnale PCC
           7417]
          Length = 320

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 157/339 (46%), Gaps = 34/339 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ G + + +LFF  +  LK  YP   IDV+   R K  ++++K+V     +D  D
Sbjct: 2   RVVALVPGSIGDQILFFATLDDLKRNYPNAQIDVVVEPRSKAAYQVSKSVHEVLTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G +++R YD+ ++   + L     L++T    R+ Y        G G 
Sbjct: 62  R-NSLADWGNLVGTIRDREYDVAIAVGQSWL-IGLLLWLTGIPVRIGY-------TGKG- 111

Query: 244 LLSETFTAESM--NLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                F   S+    S+     Y  ++  LG         P P L +++ +   E    +
Sbjct: 112 ---SVFLTRSVPPKPSQYAAAAYHDLLQPLGI------NSPCPELALNVPKPDIEWAQRE 162

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            K  G  +  YI+I+G    +   +      D++ PI+ W +I    +  +   P+ VI 
Sbjct: 163 QKRLGVHETGYILING----ASNHLAPADGLDTVYPIENWLQIIQDCQHKQPDLPVVVIK 218

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
              + + V  ++ + +  + +T+P   G+LAA+I  ++ +++T++  + L+ A +  +IA
Sbjct: 219 GSDDEQFVRSLLLESSPDIKVTSPDDIGKLAAIIGGASLMLSTDSVPLHLSVAVQTYTIA 278

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
           LF S     L  PN++  K   I S TGK+ D   + VL
Sbjct: 279 LFGSTDPANLL-PNSD--KFLAIKSPTGKMADISPQTVL 314


>gi|427706482|ref|YP_007048859.1| glycosyl transferase family protein [Nostoc sp. PCC 7107]
 gi|427358987|gb|AFY41709.1| glycosyl transferase family 9 [Nostoc sp. PCC 7107]
          Length = 319

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 90/337 (26%), Positives = 163/337 (48%), Gaps = 45/337 (13%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   + K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKRCYPNAKIDVVTEPQSKAAYRVSKSVHEVLTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS---TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++++++G +++R YD+ ++   + L GL     L++T    RV +        G
Sbjct: 62  R-NSLADWSNLVGTIRDREYDVAIAFGQSWLVGL----MLWLTGIPMRVGF-----KGKG 111

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           A  L   T T    N S+     Y +++  LG         P+P L V++ +   E    
Sbjct: 112 AAFL---THTV-PFNPSQYVAAAYHELLKPLGL------TTPLPELAVNVPKPDIEWSQI 161

Query: 301 KYKNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR------P 353
           + K  G  +  Y++I+ G++S  +A      D D + P++ W +I   ++EF       P
Sbjct: 162 EQKRLGVNETGYVLIYGGVDSALRAK-----DADKIYPVENWQQI---IQEFHLKQPDMP 213

Query: 354 LFVIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
           + VI    + E    ++      + +T+P   G+L A+I  +  +++T +AA+QL+ A +
Sbjct: 214 VVVI-QGADDETFVRLLRASCPQIKVTSPDNAGKLTAMIAGANLMLSTESAALQLSIAAQ 272

Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLID 447
             +IA+  S   G+L  P +E  K   I S TGK+ D
Sbjct: 273 TYTIAILGSSDPGKLL-PKSE--KFLAIKSPTGKVAD 306


>gi|425455179|ref|ZP_18834904.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9807]
 gi|389803960|emb|CCI17156.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9807]
          Length = 320

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAQM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDISISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|428217367|ref|YP_007101832.1| glycosyl transferase family protein [Pseudanabaena sp. PCC 7367]
 gi|427989149|gb|AFY69404.1| glycosyl transferase family 9 [Pseudanabaena sp. PCC 7367]
          Length = 319

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 146/320 (45%), Gaps = 31/320 (9%)

Query: 122 VRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL 181
           + R   ++ GG+ + LLFFP ++ LK  YP   IDV+   R    + +   V   N++  
Sbjct: 1   MMRLLALVPGGIGDQLLFFPTLETLKQTYPQAAIDVVVEPRSVGAYRVCPYVN--NIWKF 58

Query: 182 DDDWPEP-AEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
           D       A++ ++LG +++  YD+VLS           L++T   +R+++       +G
Sbjct: 59  DFKGSNSFADWGNLLGTIRDCEYDVVLSLG-KSFSVGLLLWLTGIPERIAF-------SG 110

Query: 241 AG-LLLSETFTAESMNLSERGYNMYEQMVD-WLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           AG  LL++    E    ++    MY  +    LG         P PP++++I +     V
Sbjct: 111 AGSFLLTQPVPLEQ---NQYAAAMYHDLASKGLGI------DQPCPPIKLAIPQGDVAWV 161

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREF---RPLF 355
             + K  G E   Y++IHG  S+    +  R   + + P++ W  I NG++      PL 
Sbjct: 162 KSELKRMGIEGKGYVLIHGGASE----LSQRKGINKIYPVESWKTIINGVQAKLYDAPLV 217

Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPS 413
           VI    +R+ ++ +      +  I  P  G+LAALI  +  ++ T++A + +  A     
Sbjct: 218 VIGGPDDRKLIKALTELQPQLKVINPPDIGKLAALIMSAQLLLCTDSAPMHIGVATGIKL 277

Query: 414 IALFSSELKGRLFVPNAEEK 433
           +ALF      +L   N + K
Sbjct: 278 VALFGPTEPAKLLPSNDQIK 297


>gi|425442475|ref|ZP_18822718.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9717]
 gi|389716501|emb|CCH99280.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9717]
          Length = 320

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QYG-LADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAQM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPELPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|119483240|ref|ZP_01618654.1| Glycosyl transferase, family 9 [Lyngbya sp. PCC 8106]
 gi|119458007|gb|EAW39129.1| Glycosyl transferase, family 9 [Lyngbya sp. PCC 8106]
          Length = 320

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 88/342 (25%), Positives = 157/342 (45%), Gaps = 38/342 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   I V+   R K  + + K+V     YD  D
Sbjct: 2   RILALVPGGIGDQVLFFPTLDDLKQAYPQARISVVVEPRAKGAYRVCKSVSEVLAYDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A++ + LG+++ R +D+VLS     LG        L++T    RV Y       +
Sbjct: 62  R-NSLADWGNFLGIVREREFDLVLS-----LGKRWTVGLLLWLTGIPQRVGY-----AGS 110

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
             G+ L+++   ++   +   Y+   Q +D L  PF  +  + VP   +S +        
Sbjct: 111 AGGMFLTQSVPLKTEQYAAEMYHDLLQGLD-LSLPFSGLAIN-VPKSDISWAE------- 161

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
            + +  G +   YI+IHG    S    Q  G  D + P++ W ++   +++ +   P+ V
Sbjct: 162 TQQQQLGIKDSGYILIHG---GSSLLAQQLG-IDKIYPVEKWQKVIQDMQQQQPNIPVVV 217

Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
           +   ++ + V  +       V +TTP   G+LAA+I  +  ++ T++A + LA A    +
Sbjct: 218 VKGPEDADWVNQLT-QSCRDVKVTTPGDIGKLAAMIAAANLMVCTDSAPMHLAVAVGTYT 276

Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
           IALF      +L  P  +  +   + S TGK+ D   E +L 
Sbjct: 277 IALFGPTDPEKLL-PKTD--RVIGVKSPTGKIADLSPEDILK 315


>gi|440755455|ref|ZP_20934657.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
           TAIHU98]
 gi|440175661|gb|ELP55030.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 320

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLRQTYPQGTIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|390439562|ref|ZP_10227953.1| Glycosyl transferase [Microcystis sp. T1-4]
 gi|389837051|emb|CCI32077.1| Glycosyl transferase [Microcystis sp. T1-4]
          Length = 320

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 156/345 (45%), Gaps = 40/345 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQGTIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPR-----HPVPPLRVSISRRLKEV 297
                 F ++ +     GY         LG  + ++ +      P PPL+++++R   + 
Sbjct: 113 -----WFLSQQVPQPTEGY---------LGDNYHALVKGLKIAAPCPPLKLTLNRDDIDW 158

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PL 354
              + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+
Sbjct: 159 AEMEQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPI 214

Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKP 412
             I  E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    
Sbjct: 215 VAIQGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTY 274

Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           +++L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 275 TLSLLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|425437611|ref|ZP_18818026.1| Glycosyl transferase [Microcystis aeruginosa PCC 9432]
 gi|389677385|emb|CCH93667.1| Glycosyl transferase [Microcystis aeruginosa PCC 9432]
          Length = 320

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLRQTYPQGTIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLKSWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|428200505|ref|YP_007079094.1| ADP-heptose--LPS heptosyltransferase [Pleurocapsa sp. PCC 7327]
 gi|427977937|gb|AFY75537.1| ADP-heptose:LPS heptosyltransferase [Pleurocapsa sp. PCC 7327]
          Length = 314

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 157/341 (46%), Gaps = 38/341 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP ++ LK +YP   IDV+   R K  + +   V    V+D  D
Sbjct: 2   RIIALVPGGISEQILFFPTLEDLKKQYPKAAIDVMIEPRAKAAYRVCPYVNEVLVFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
                A+Y ++LG++++R YDM LS ++   +G    L++     R+ Y         A 
Sbjct: 62  R-NGLADYLNLLGIIRDREYDMALSLSQRWTIG--LLLWLNGIPIRIGY------KNNAS 112

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
             LS++   +S    +   + Y  ++  LG       +   P L+V++ +   +    + 
Sbjct: 113 WFLSDSVPLKS---EQYAAHKYHDLLQALG------IQSSCPELKVTLPKEDIDWAEAEQ 163

Query: 303 KNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
           K    E+  YI+++ G E             D+  P+  W +I   L++ +   P+ ++ 
Sbjct: 164 KRLDIEESGYILLYDGFEGG-----------DNTYPVAKWRKIVEDLQQKQPNLPIVLLR 212

Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             +    V   + D  ++  ++ P  G+LAA++  +  ++ T++  + L+ A    +IAL
Sbjct: 213 DSENERWVATTIEDRPNLKVVSPPDLGKLAAIVAGANLMLCTDSVPMHLSVAVGTYTIAL 272

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F S    +L  PN+E  +   I S T K+ D   E +L  +
Sbjct: 273 FGSTDATKLLPPNSE--RSIGIQSPTKKVADIEPETILQQI 311


>gi|209524261|ref|ZP_03272811.1| glycosyl transferase family 9 [Arthrospira maxima CS-328]
 gi|376003521|ref|ZP_09781331.1| glycosyl transferase [Arthrospira sp. PCC 8005]
 gi|423066757|ref|ZP_17055547.1| glycosyl transferase family 9 [Arthrospira platensis C1]
 gi|209495352|gb|EDZ95657.1| glycosyl transferase family 9 [Arthrospira maxima CS-328]
 gi|375328178|emb|CCE17084.1| glycosyl transferase [Arthrospira sp. PCC 8005]
 gi|406711782|gb|EKD06981.1| glycosyl transferase family 9 [Arthrospira platensis C1]
          Length = 320

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 158/345 (45%), Gaps = 44/345 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           +   ++ GG+ + +LFFP +  LK  YP   I V+   R +  + + K+V     YD   
Sbjct: 2   KILALVPGGIGDQILFFPTLDDLKQAYPHAQISVVTEPRAQTAYRVCKSVSKVIPYDFKG 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A++ +++G++++R YD+V+S     LG        L++T   +RV Y      A 
Sbjct: 62  R-NSLADWGNLIGIVRDREYDVVIS-----LGQRWTVGLLLWLTGIPNRVGY------AG 109

Query: 240 GAGLLLSETFTAESMNLSERGY--NMYEQMVDWLG--RPFRSVPRHPVPPLRVSISRRLK 295
            +G    E F  +++ L    Y   MY  ++  L    PF  +  + VP   ++ + R +
Sbjct: 110 SSG----EIFLTDAVPLKTEQYAAQMYHDLLQGLDIITPFSGLQIN-VPKSDLAWADREQ 164

Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
           E       + G ++  YI+IHG      + M      D + P++ W +I   +++ +P  
Sbjct: 165 E-------SLGVKESGYILIHG----GSSQMAINKGIDKIYPVESWQKIIEDIQQRQPNL 213

Query: 356 --VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
             V+    E + +   +      V +T P   G+LAA+I  +  ++ T++A + LA A  
Sbjct: 214 PVVLVEGPEDQALITQITQRCPQVKLTQPEDIGKLAAMIAAANLMVCTDSAPMHLAIAVG 273

Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
             +IALF      +L  P+   ++   + S TGK+ D   + VL 
Sbjct: 274 TYTIALFGPTDPHKLLPPS---ERAIAVKSPTGKMADISPDEVLK 315


>gi|425460630|ref|ZP_18840111.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9808]
 gi|389826674|emb|CCI22674.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9808]
          Length = 320

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K  + L   V+   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVKEVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|409991714|ref|ZP_11274951.1| glycosyl transferase family protein [Arthrospira platensis str.
           Paraca]
 gi|291571644|dbj|BAI93916.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409937413|gb|EKN78840.1| glycosyl transferase family protein [Arthrospira platensis str.
           Paraca]
          Length = 320

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 158/345 (45%), Gaps = 44/345 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           +   ++ GG+ + +LFFP +  LK  YP   I V+   R +  + + K+V     YD   
Sbjct: 2   KILALVPGGIGDQILFFPTLDDLKQAYPHAQISVVTEPRAQTAYRVCKSVSKVIPYDFKG 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A++ +++G++++R YD+V+S     LG        L++T   +RV Y      A 
Sbjct: 62  R-NSLADWGNLIGIVRDREYDVVIS-----LGQRWTVGLLLWLTGIPNRVGY------AG 109

Query: 240 GAGLLLSETFTAESMNLSERGY--NMYEQMVDWLG--RPFRSVPRHPVPPLRVSISRRLK 295
            +G    E F  +++ L    Y   MY  ++  L    PF  +  + VP   ++ + R +
Sbjct: 110 SSG----EIFLTDAVPLKTEQYAAQMYHDLLQGLDIITPFSGLQIN-VPKSDLAWADREQ 164

Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
           E       + G ++  YI+IHG      + M      D + P++ W +I   +++ +P  
Sbjct: 165 E-------SLGVKESGYILIHG----GSSQMAINKGIDKIYPVESWQKIIEDIQQRQPNL 213

Query: 356 --VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
             V+    E + +   +      V +T P   G+LAA+I  +  ++ T++A + LA A  
Sbjct: 214 PVVLVKGPEDQALITQITQRCPQVKLTQPEDIGKLAAMIAAANLMVCTDSAPMHLAIAVG 273

Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
             +IALF      +L  P+   ++   + S TGK+ D   + VL 
Sbjct: 274 TYTIALFGPTDPHKLLPPS---ERAIAVKSPTGKMADISPDEVLK 315


>gi|425447077|ref|ZP_18827071.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9443]
 gi|389732456|emb|CCI03614.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
           aeruginosa PCC 9443]
          Length = 320

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K    L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAHRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T  + +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRSAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAQM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDISISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     +  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGDLHR--YIEANTGKVADITPETVLKKI 317


>gi|425448716|ref|ZP_18828560.1| Glycosyl transferase [Microcystis aeruginosa PCC 7941]
 gi|389763996|emb|CCI09602.1| Glycosyl transferase [Microcystis aeruginosa PCC 7941]
          Length = 320

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   + K  + L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLRQTYPQGTIDVLVEPKAKAAYRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       ++L P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLKSWLEIIQGIRHQQPDLPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|17232660|ref|NP_489208.1| hypothetical protein alr5168 [Nostoc sp. PCC 7120]
 gi|17134306|dbj|BAB76867.1| alr5168 [Nostoc sp. PCC 7120]
          Length = 319

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 83/342 (24%), Positives = 163/342 (47%), Gaps = 35/342 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   +DV+   R K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKRYYPNAQLDVVVEPRSKAAYRVSKSVNDILTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G +++R YD+V++   + L    FL++T    R+ Y        G G 
Sbjct: 62  R-NSLADWGNLVGTIRDREYDVVITVGQSWL-VGLFLWLTGIPTRIGY-------QGKG- 111

Query: 244 LLSETFTAESMNL--SERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
               +F  +++    S+    +Y  ++  +G         P P L V++ +   +    +
Sbjct: 112 ---RSFLTKTVPFKPSQYAAAVYHDLLAPIGI------TTPTPELAVNVPKPDIDWANSE 162

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            K  G  +  Y++I+G      + +      +++ P + W EI    R+ +   P+ VI 
Sbjct: 163 QKRLGVNETGYVLIYG----GSSWVSQPQALETIYPAENWQEIIQDFRQKQPDLPVVVIQ 218

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
              + + V   + D +  + +T+P   G+L A+I  +  ++ T++A + L+ A +  +IA
Sbjct: 219 GPDDEQFVRS-LRDYSLDIKVTSPDNVGKLTAMIAGANLMLTTDSAPLHLSVAVQTYTIA 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           LF S    +L  PN++  K   I S TG++ D   +AVL  +
Sbjct: 278 LFGSTDPVKLL-PNSD--KLLGIKSSTGRVADILPKAVLEKV 316


>gi|443314576|ref|ZP_21044124.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 6406]
 gi|442785819|gb|ELR95611.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 6406]
          Length = 321

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 160/342 (46%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E LLFFP ++ +K  +P   I ++A       + ++  V+  +V+    
Sbjct: 2   RVLALVPGGISEQLLFFPTLEHIKQAFPKAEIAIVAEPVSGSAYRVSTLVK--DVFPYSF 59

Query: 184 DWP-EPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGA 241
             P  PA++ ++LG++++R ++ VL+ T    LG    L+++    RVSY         +
Sbjct: 60  SKPNSPADWANLLGIIRDREFEAVLTVTPKTSLG--LMLWLSGIPTRVSY-----QTGIS 112

Query: 242 GLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
            +L++ T  A+        Y+     +D  G         P P   +++ ++  + V  +
Sbjct: 113 NVLMTATVPAKPKQYQAFQYHDLLAALDITG---------PCPAATINVPQKDIDWVGRQ 163

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            K  G     Y++ +G   D      ++GD   L P++ WA IA   +  +   PL V+ 
Sbjct: 164 TKEQGIGDQGYVLFYGGPVDG-----TQGD---LYPVESWAAIAQDFQARQPGLPLVVVQ 215

Query: 359 HEKEREGVEDV--VGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             +  + V+ +  +  +  ++     GQ+A+LI  +  +IAT++  +QL  A    ++AL
Sbjct: 216 QPETADLVKALTQIQPNLKVIRPENIGQMASLIAGANLMIATDSYPLQLGIALNVFTVAL 275

Query: 417 FSSELKGRLFVP-NAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F S +  R   P +  E +   +++ +GK+ D   E VL  +
Sbjct: 276 FGSNVPERRLPPVDGTETRFVGLTASSGKVADITPETVLKKI 317


>gi|332709136|ref|ZP_08429103.1| ADP-heptose/LPS heptosyltransferase [Moorea producens 3L]
 gi|332352047|gb|EGJ31620.1| ADP-heptose/LPS heptosyltransferase [Moorea producens 3L]
          Length = 320

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 95/347 (27%), Positives = 153/347 (44%), Gaps = 44/347 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDVI   R K  + + ++V     +D  D
Sbjct: 2   RILALVPGGIGDQILFFPTLSQLKQYYPKAEIDVIVEPRAKGAYRVCQSVDEVLTFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYD--MVLSTKLA-GLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ ++LGV+++R YD  M L  + + GL     L+MT    R+ Y       AG
Sbjct: 62  R-NGLADFGNLLGVIRDREYDAAMTLGRRWSVGL----LLWMTGIVRRIGY------QAG 110

Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           +     + F    + L    Y  +MY  ++   G         P   L V + +   E  
Sbjct: 111 S----RDLFFTNPVPLKTEQYAAHMYHDLLLGFGINV------PCQELAVRVPKSDIEWA 160

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
             + +     +  YI+IHG  S    S       D + P++ W +I   +R+ +P   I 
Sbjct: 161 EAQQQQLEIPEIGYILIHGGSSQLALSQ----GIDKIYPVEQWQKIVEDIRQKQPDLPIV 216

Query: 359 HEKEREGVEDVVGDDASI-----VFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
             K   G ED     A I     + +T P   G+LAA+I  +  ++ T++A + LA A  
Sbjct: 217 LLK---GPEDAEWCQAMIEYHRDIIVTAPEDIGKLAAMIAGANLMLCTDSAPMHLAVAVG 273

Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
            P+IALF    + +  +P  E      I S TG++ D   E VL  +
Sbjct: 274 TPTIALFGPT-QAKKLLPAKE--GVYGIQSATGRIADIKPETVLEQI 317


>gi|81299493|ref|YP_399701.1| hypothetical protein Synpcc7942_0682 [Synechococcus elongatus PCC
           7942]
 gi|81168374|gb|ABB56714.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 319

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 141/306 (46%), Gaps = 38/306 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R    + +   V     +D  D
Sbjct: 2   RILALVPGGIGDQILFFPTVADLKKYYPQAQIDVVVEPRAVGAYRVCAEVDQVFPFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLST---KLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ ++LG ++ R Y+ VLS     L GL    FL++T    RV Y       AG
Sbjct: 62  R-NSLADWGNLLGSIREREYEAVLSLGQRSLVGL----FLWLTGIPKRVGY-------AG 109

Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRR-LKEV 297
            G    + F  +++ L+   Y   +Y  ++   G         P P  +++++R+ L   
Sbjct: 110 RG----KIFLTDAVPLNREQYAGALYHDLLQGFGI------NTPCPNPKLTLARQDLDWA 159

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PL 354
            AE+ +   A QG Y+V+HG      +++        + P++ WA +   +RE R   P 
Sbjct: 160 TAEQQRLGLAGQG-YLVLHG----GSSTLAKLKGLQKVYPVEKWAIVLRSIREQRPSLPF 214

Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKP 412
            V+    + E V ++   +     +  P  G+LAA+I  +  ++ T++A + LA A    
Sbjct: 215 VVVQGPDDAEFVAELRKSNLDFQVVQPPDIGKLAAIIAGADLMLCTDSAPMHLAVASGTR 274

Query: 413 SIALFS 418
           +IALF 
Sbjct: 275 TIALFG 280


>gi|56750856|ref|YP_171557.1| hypothetical protein syc0847_c [Synechococcus elongatus PCC 6301]
 gi|56685815|dbj|BAD79037.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 319

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 141/306 (46%), Gaps = 38/306 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R    + +   V     +D  D
Sbjct: 2   RILALVPGGIGDQILFFPTVADLKKYYPQAQIDVVVEPRAVGAYRVCAEVDQVFPFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLST---KLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ ++LG ++ R Y+ VLS     L GL    FL++T    RV Y       AG
Sbjct: 62  R-NSLADWGNLLGSIREREYEAVLSLGQRSLVGL----FLWLTGIPKRVGY-------AG 109

Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRR-LKEV 297
            G    + F  +++ L+   Y   +Y  ++   G         P P  +++++R+ L   
Sbjct: 110 RG----KIFLTDAVPLNREQYAGALYHDLLQGFGI------NTPCPNPKLTLARQDLDWA 159

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PL 354
            AE+ +   A QG Y+V+HG      +++        + P++ WA +   +RE R   P 
Sbjct: 160 TAEQQRLGLAGQG-YLVLHG----GSSTLAKLKGLQKVYPVEKWAIVLRSIREQRPSLPF 214

Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKP 412
            V+    + E V ++   +     +  P  G+LAA+I  +  ++ T++A + LA A    
Sbjct: 215 VVVQGPDDAEFVAELRKSNLDFQVVQPPDIGKLAAIIAGADLMLCTDSAPMHLAVASGTR 274

Query: 413 SIALFS 418
           +IALF 
Sbjct: 275 TIALFG 280


>gi|37522681|ref|NP_926058.1| heptosyltransferase [Gloeobacter violaceus PCC 7421]
 gi|35213683|dbj|BAC91053.1| gll3112 [Gloeobacter violaceus PCC 7421]
          Length = 328

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 147/338 (43%), Gaps = 34/338 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +  GG+ + +LFFP ++ L++R+    I+V+   R +  + +  +V     +D   
Sbjct: 3   RILALNPGGIGDQVLFFPTLRGLRERFAQSRIEVVVEPRSQGAYRVCPSVNETLTFDFKG 62

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
           D P  A++ +++G++++R YD VLS   + L  A  L+MT    RV Y            
Sbjct: 63  D-PSLADWMNLIGILRDRRYDAVLSVGSSSL-VALLLWMTGIPKRVGY----------SA 110

Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
            ++E F  +S+ L+   Y   MY  ++   G     + R   PP + ++    +    + 
Sbjct: 111 WITERFLTDSVPLNRDQYAAQMYHDLLQGFG-----IQRAFTPP-QAAVYSEDERWATQA 164

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL---REFRPLFVIP 358
             + G    K +++H       + +        L P   WA++   L       P FV+ 
Sbjct: 165 LASLGGR--KPLLLH----PGASKLAELKGIRKLYPAAQWAQVVQKLLVKEADLPFFVVQ 218

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             ++ E V  + G+ A       P   G+L ALI  S  ++  ++A + LA A   P +A
Sbjct: 219 GPEDSELVTAIKGELADQARFVQPPDVGKLTALIERSRLLLCVDSAPMHLAVATGTPLVA 278

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAV 453
           LF   L  +L     E K   ++S    K+   PVE V
Sbjct: 279 LFGPTLPTKLL--PEEPKYVALVSPERDKVERIPVETV 314


>gi|284929535|ref|YP_003422057.1| ADP-heptose:LPS heptosyltransferase [cyanobacterium UCYN-A]
 gi|284809979|gb|ADB95676.1| ADP-heptose:LPS heptosyltransferase [cyanobacterium UCYN-A]
          Length = 311

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/340 (22%), Positives = 154/340 (45%), Gaps = 39/340 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R  C++ GG+ E LLFFP ++ LK++YP VLIDV+     K  + +   V     +D  D
Sbjct: 2   RILCLVPGGINEQLLFFPTLESLKNKYPNVLIDVLVEPLAKSAYRICPYVNEILFFDYQD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A+Y +++GV+++R YD+ ++T    L     L+  +   R+ Y            
Sbjct: 62  R-NVSADYLNLVGVIRDRGYDIAITTG-NKLILELLLWSNSIPWRIGY------KTQTSW 113

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
            LS + T +    +   Y  ++ ++    +P         P +++++ +        + +
Sbjct: 114 FLSHSITQKEEQYAAETY--HDLLLKLNIQP-------SCPSIKIAVPKDDISWTETQLQ 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP---LFVIPHE 360
           +   ++  YI I+G E++S              P+  W EI N +++  P   + ++   
Sbjct: 165 SLSVKENGYIAIYGGENNS-------------YPVSSWIEIINSIQKKEPSLSIVLLESN 211

Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
            E+   + ++ D+   + I  P   G+L+A+I  +  ++  ++  +QL  A    +IALF
Sbjct: 212 TEQTWTKSIL-DNCKNLKIINPDNLGKLSAVIAGANLLVCIDSVPLQLGIAVGTYTIALF 270

Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
            S+ K    +P   + +C  I S + +L D     +L  +
Sbjct: 271 -SDTKANHKIPTNYD-RCIAIQSPSNQLADISSSTILEKI 308


>gi|218439612|ref|YP_002377941.1| glycosyl transferase family protein [Cyanothece sp. PCC 7424]
 gi|218172340|gb|ACK71073.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7424]
          Length = 320

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 38/344 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP ++ LK +YP   I VI   R K  + +   V    ++D  D
Sbjct: 2   RILALVPGGIGDQILFFPTLEDLKQKYPTSTIHVIVEPRAKSAYRVCPYVNEVLLFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A+Y ++LGV+++R YD+ +S     LG        L++     RV Y        
Sbjct: 62  R-NGLADYLNLLGVIRDREYDLAVS-----LGRRWTVGLLLWLNGIPVRVGY------KT 109

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
                L+     ++   +   Y+   Q +D          + P PPL+V++ +   +   
Sbjct: 110 NTSWFLTNPVPLKTEQYAAYMYHDLLQGLD---------IQTPCPPLKVTLPKEDIDWAE 160

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
            + K    +   YI+IHG    SK ++      D + P+  W +I + +++ +   P+ +
Sbjct: 161 AEQKRLDIKDTGYIMIHG--GSSKLAIVK--GIDKIYPVSQWQKILDEIQQKQPNIPIVL 216

Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
           +   ++ E V+ ++ +    + +T P   G+LAA+I  +  ++ T++A + L+ +    +
Sbjct: 217 LQGPEDAEWVK-MMMEGYPNLKVTKPSDVGKLAAMIAGANLMLCTDSAPMHLSVSVGTYT 275

Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           IALF      +L  P  ++ +   I S +G + D   E +L AM
Sbjct: 276 IALFGPTQAEKLLPP--DQDRYIGIQSPSGNIADIKPETILQAM 317


>gi|427725266|ref|YP_007072543.1| glycosyl transferase family protein [Leptolyngbya sp. PCC 7376]
 gi|427356986|gb|AFY39709.1| glycosyl transferase family 9 [Leptolyngbya sp. PCC 7376]
          Length = 319

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/336 (27%), Positives = 159/336 (47%), Gaps = 43/336 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GGV + +LFFP +  LKD+YP  ++DV+   R K  + +   V     +D  D
Sbjct: 2   RILALVPGGVGDQILFFPTLADLKDQYPEAMVDVLVEPRAKAAYRVCPTVHEVLTFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
               PA+Y ++LG +++R Y++ LS     + GL     L++     R+ Y     N   
Sbjct: 62  R-NGPADYLNVLGTIRDREYEIALSLGRRWIVGL----LLWLNGIPTRIGY---KTN--- 110

Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWL--GRPFRSVPRHPVPPLRVSISRRLKE 296
                +  F +    L+E  Y    Y  ++  L   +P  ++P+  VP      S  ++ 
Sbjct: 111 -----TSWFISNPAPLNEEQYAAATYHDLLKALDINKPC-NLPKINVP------SSDIQW 158

Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---P 353
              E+ K  G   G YI+IH     SK S Q++G  + L PI+ W E+ N ++  +   P
Sbjct: 159 AEGEQ-KRLGLNDG-YILIHA--GASKIS-QAKG-IEKLYPIEKWEEVINDIKAKQPNLP 212

Query: 354 LFVIPHEKEREGVEDVVG--DDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
           + ++   ++ + V  +V    D  +      G+ AA+I  +  ++ T++A + LA A   
Sbjct: 213 IALLHGPEDYDWVAQLVEIFPDLRVTVPEDVGKSAAMIAGANLMVCTDSAPMHLAIAVGT 272

Query: 412 PSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLID 447
            +IALF    + RL  P+++  K   I S TG++ D
Sbjct: 273 YTIALFGPTKRERLLPPSSD--KFIGIQSPTGRIAD 306


>gi|427712861|ref|YP_007061485.1| ADP-heptose--LPS heptosyltransferase [Synechococcus sp. PCC 6312]
 gi|427376990|gb|AFY60942.1| ADP-heptose:LPS heptosyltransferase [Synechococcus sp. PCC 6312]
          Length = 325

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/336 (24%), Positives = 143/336 (42%), Gaps = 41/336 (12%)

Query: 123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL- 181
           +R   +I GG+ + +L FP +  L+  YP   IDV+   R +  +E+N  V     +   
Sbjct: 4   QRIVMLIPGGIGDQILIFPTLADLRGHYPQAEIDVVVEPRSQAAYEVNAVVNQVLTFPFR 63

Query: 182 -DDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHA----AFLFMTTARDRVSYIYPNV 236
               W    ++  ++  +++R YD+++S     LG +      L++T    RV Y     
Sbjct: 64  AKKTW---RDWWGLIQQIRSRQYDIIVS-----LGESFAVRVLLWLTGVPKRVGY----A 111

Query: 237 NAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKE 296
           N    GLL         +N ++    MY  ++  LG     + R   PPL  ++ ++  +
Sbjct: 112 NQKTWGLLTHPA----PLNKNQYAAAMYHDLLKGLG-----IDRE-YPPLVATVKQK-DQ 160

Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP--- 353
              E  +     Q  YI+IHG      + M      + + PIQ W E+ + L    P   
Sbjct: 161 AWGESERLRLQVQSPYILIHG----GSSKMARLKGINKIYPIQAWLEVLHTLNAKYPGIS 216

Query: 354 LFVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREK 411
           + ++    ++E  E +      +     P  G+LAALI  + G+I T++  + L  A   
Sbjct: 217 VLIVQGPDDQEWGEQLTAAIPELKRTQPPSFGKLAALIETATGMICTDSGPMHLGVALNT 276

Query: 412 PSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLID 447
           P +ALF      +L  P    +    + S TG++ D
Sbjct: 277 PLVALFGPTDPQKLLPPQPHFRP---VKSPTGQIGD 309


>gi|434393455|ref|YP_007128402.1| glycosyl transferase family 9 [Gloeocapsa sp. PCC 7428]
 gi|428265296|gb|AFZ31242.1| glycosyl transferase family 9 [Gloeocapsa sp. PCC 7428]
          Length = 314

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 82/340 (24%), Positives = 151/340 (44%), Gaps = 42/340 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  LK  YP   IDV+   R K+ + + K V     +D  D
Sbjct: 2   RVVALVPGGIGDQILFFPTLDDLKQNYPNAQIDVVVEPRAKEAYRICKAVNNVLAFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ +++G++++R YD+V+S       GL     L++T    R+ Y        G
Sbjct: 62  R-NSAADWVNLVGILRDREYDVVISVGQRWFVGL----LLWLTGIPTRIGY-------KG 109

Query: 241 AGLLLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           AG      F   ++ L  + Y+  MY  ++  LG         P P L V++     +  
Sbjct: 110 AG----NMFLTNAVPLKSQQYDAAMYHDLLQGLGI------NSPCPELTVNVPAADIDWA 159

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP-LFVI 357
             + K+ G +   Y+++    SDS      +   D   P++ W +I    R+ +P L ++
Sbjct: 160 EAEKKHLGIQSSGYVLL----SDSF----EQSTPDKSYPMESWRQIIQDFRQKQPELPLV 211

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
             +   +     +      + +T+P   G+LAA+I  +  ++      +QLA A +  +I
Sbjct: 212 ALQSSDDMRWTQLAQANPELKLTSPPDVGKLAAMIAGADLLLCIEGVPLQLAVAVQTYAI 271

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
           ALF      +L     +  +   I S +G++ D   + VL
Sbjct: 272 ALFGLTDPAKLL---PKSDRYIPIKSPSGQVADITPQTVL 308


>gi|16332333|ref|NP_443061.1| hypothetical protein slr0606 [Synechocystis sp. PCC 6803]
 gi|383324074|ref|YP_005384928.1| hypothetical protein SYNGTI_3166 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383327243|ref|YP_005388097.1| hypothetical protein SYNPCCP_3165 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383493127|ref|YP_005410804.1| hypothetical protein SYNPCCN_3165 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384438395|ref|YP_005653120.1| hypothetical protein SYNGTS_3167 [Synechocystis sp. PCC 6803]
 gi|451816484|ref|YP_007452936.1| hypothetical protein MYO_132030 [Synechocystis sp. PCC 6803]
 gi|1653963|dbj|BAA18873.1| slr0606 [Synechocystis sp. PCC 6803]
 gi|339275428|dbj|BAK51915.1| hypothetical protein SYNGTS_3167 [Synechocystis sp. PCC 6803]
 gi|359273394|dbj|BAL30913.1| hypothetical protein SYNGTI_3166 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359276564|dbj|BAL34082.1| hypothetical protein SYNPCCN_3165 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359279734|dbj|BAL37251.1| hypothetical protein SYNPCCP_3165 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407957031|dbj|BAM50271.1| hypothetical protein BEST7613_1340 [Synechocystis sp. PCC 6803]
 gi|451782453|gb|AGF53422.1| hypothetical protein MYO_132030 [Synechocystis sp. PCC 6803]
          Length = 317

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 133/314 (42%), Gaps = 28/314 (8%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP I  LK  YP   IDVI   R K  + +   V     +D  D
Sbjct: 2   RILALVPGGISEQILFFPTIATLKAEYPQATIDVIVEPRAKSAYRVCAQVNEVLAFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ ++LGV+++R Y+ VL+           L++     RV Y       + AG 
Sbjct: 62  R-NGLADFLNLLGVIRDREYEAVLTVARQ-WTIELLLWLNGIPQRVGY------QSSAGF 113

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
            LS T   +     +    ++  +    G         P PPL++S+ +   E +    K
Sbjct: 114 FLSATVPFKP---DQYVPFLFHDLTQGFGI------NRPCPPLQISLPKADIEWMEATQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP---LFVIPHE 360
                 G Y+V++G       +++     D   PI  W EI   +++ +P   + +IP  
Sbjct: 165 KLDLGSGGYVVLNG------GAIRQPDTGDFPYPIAQWHEIIADIKQKQPGLKIVLIPPA 218

Query: 361 KEREGVEDVVGDDASIVFIT--TPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           +    V+ +      +V I     G++AAL+  +  V+ T    + LA A    ++A+  
Sbjct: 219 ENTGWVQAMQDQHPGVVAIRPGDVGKMAALLAGANLVVCTEGIPMHLAIAVGTYTVAILG 278

Query: 419 SELKGRLFVPNAEE 432
                R+  P+ E+
Sbjct: 279 KTPAARIIPPDQEK 292


>gi|443647325|ref|ZP_21129681.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159027766|emb|CAO89636.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443335502|gb|ELS49970.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 320

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 154/342 (45%), Gaps = 34/342 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+   LLFFP ++ L+  YP   IDV+   R K    L   VR   ++D  D
Sbjct: 2   RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAHRLCPQVREVLLFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
            +   A+Y +ILGV+++R Y++ +S T    +     L++    +R+ Y     N +   
Sbjct: 62  QYG-LADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                 F ++ +     GY  + Y  +V  L          P PPL+++++R   +    
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAQM 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + +    +   YI+++   SD   S       +++ P++ W EI  G+R  +   P+  I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETVYPLESWLEIIQGIRHQQPELPIVAI 217

Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             E +   V  +   D ++    T   G++AA+I  +  ++AT +  +QLA A    S++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYSLS 277

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L    L  R+     E  +   I + TGK+ D   E VL  +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317


>gi|170078509|ref|YP_001735147.1| heptosyltransferase family protein [Synechococcus sp. PCC 7002]
 gi|169886178|gb|ACA99891.1| Heptosyltransferase family protein [Synechococcus sp. PCC 7002]
          Length = 319

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 152/340 (44%), Gaps = 37/340 (10%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GGV + +LFFP +  LK+RYP  +IDV+   R K  + +   V     +D  D
Sbjct: 2   RILALVPGGVGDQILFFPTLADLKERYPEAMIDVLVEPRAKAAYRVCPQVHEVLTFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLST-KLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
               PA+Y +ILG +++R Y++ LS  K   +G    L++     RV Y           
Sbjct: 62  R-NGPADYLNILGTIRDREYEIALSLGKNWAVGF--LLWLNGIPTRVGY----------- 107

Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
              +  F +    L+E  Y   +Y  ++  LG           P   +++     +    
Sbjct: 108 KTATSWFISNPAPLNENQYAAALYHDLLKGLGI------DTDCPSPSITVPNGDIQWAEG 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           + K     +G YI+IHG  S      Q++G  + L P+  W  I   ++  +   P+ ++
Sbjct: 162 EQKRLDLTEG-YILIHGGASQLS---QAKG-INKLYPVANWQPIIEDIQSKQPNLPIVLL 216

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
               +   V +++    ++  +T P   G+ AA+I  +  ++ T++A + LA A    +I
Sbjct: 217 QGPDDAPWVTELIALYPALK-VTRPEDIGKAAAMIAGANLMVCTDSAPMHLAAAVGTYTI 275

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
           ALF +    +L  P  +  K   I S TG++ D   + VL
Sbjct: 276 ALFGATHAEKLLPP--KNDKLIGIQSPTGEMADIQPKDVL 313


>gi|414079190|ref|YP_007000614.1| glycosyl transferase [Anabaena sp. 90]
 gi|413972469|gb|AFW96557.1| glycosyl transferase family 9 [Anabaena sp. 90]
          Length = 313

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 153/342 (44%), Gaps = 41/342 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ G +   +LFF  +  LK  YP   IDVI   + K  + ++K+V     +D  D
Sbjct: 2   RVVALVPGSIDNQILFFATLDDLKRYYPDAQIDVIVEPQSKAAYRVSKSVHDVLTFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVL---STKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ +++G++++R YD+ +    +   GL     +++T    R+ Y        G
Sbjct: 62  R-NSLADWGNLVGMIRDREYDVAIIVGQSWWVGL----LMWLTGIPTRIGY-----QGQG 111

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
           A  L +        NLSE    MY  ++    +P +     P P L V++ +   E    
Sbjct: 112 AVFLTNPI----PPNLSEYVAKMYHNLL----KPLKI--NTPCPALSVNVPKVDIEWAQA 161

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIP 358
           + K  G  +  +I+I+  E          G  D+  P++ W +I    ++ +P    V+ 
Sbjct: 162 EQKRLGVNETGFILINAGE----------GSLDTTYPVENWQQIIAACQQKQPDLPVVVI 211

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
            E   E +   + +    + +T+P   G+L A+I  ++ +++   + +QL+ A E  +I 
Sbjct: 212 KEANNEPLVRSLLEHCHNIKVTSPDDIGKLTAIIAGASLMVSVENSFLQLSIAVETYTIT 271

Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           L  S +     +P +E  K   I+S TGK+ D   + VL  +
Sbjct: 272 LLDS-IDSEKLLPISE--KVLAITSSTGKIADILPQIVLEKI 310


>gi|86606904|ref|YP_475667.1| heptosyltransferase family protein [Synechococcus sp. JA-3-3Ab]
 gi|86555446|gb|ABD00404.1| heptosyltransferase family protein [Synechococcus sp. JA-3-3Ab]
          Length = 324

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 73/310 (23%), Positives = 133/310 (42%), Gaps = 27/310 (8%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  L+ R+P   ++V+   R    +E+  +V     +   +
Sbjct: 4   RILALVPGGIGDQILFFPTLASLRQRFPEAELEVLVEPRAAAAYEVCPSVNRVLTFPFKE 63

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                 + +D+LG ++ R YD VLS   + LG    L++T    RV Y  P     G G 
Sbjct: 64  QL-SLGDISDLLGRIRERQYDAVLSLGRS-LGVKLLLWLTGIPKRVGYAPP-----GPG- 115

Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                +  + + L    Y  +MY  ++   G          +P +R+  +  L+    E+
Sbjct: 116 ---RPWLTDPVPLKPAQYAASMYHDLLQGFG----ITATAGLPQVRLKKT-DLEWAEQER 167

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            +  G     Y+++H       + +        + P   W ++  G RE R   PL ++ 
Sbjct: 168 KRLLGDAAQDYVLLH----PGASQLSQEKGIQKIYPPASWVKVIKGFREKRPELPLVLLL 223

Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             ++ +  E  +     +     P  GQLAALI  S  ++ T++A + LA A + P +A+
Sbjct: 224 GPEDEQLAEGFLSQLPDLPVSRPPHLGQLAALIAGSTLLLCTDSAPMHLAVATQTPLVAV 283

Query: 417 FSSELKGRLF 426
           F      RL 
Sbjct: 284 FGPTEPERLL 293


>gi|126659100|ref|ZP_01730240.1| hypothetical protein CY0110_04803 [Cyanothece sp. CCY0110]
 gi|126619628|gb|EAZ90357.1| hypothetical protein CY0110_04803 [Cyanothece sp. CCY0110]
          Length = 312

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 147/341 (43%), Gaps = 40/341 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP ++ LK +YP  +IDV+   R K  + +   V    ++D  D
Sbjct: 2   RILALVPGGINEQILFFPTLEDLKTQYPNAIIDVLVEPRAKSAYRVCPQVHEVLLFDYQD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY-IYPNVNAAGAG 242
                A+Y ++LG++++R YD+ L+ +      +  L++      V Y   P+   +   
Sbjct: 62  R-NGLADYLNLLGIIRDREYDIALTLE-KRWSISLLLWLNGIPVTVGYKTQPSWFISNPV 119

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
              +E +TAE          MY  ++   G   +S      P L++++ +        + 
Sbjct: 120 PQKTEQYTAE----------MYHDLLQ--GIDIKST----CPSLKIALPKEDIAWAEAEQ 163

Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKE 362
           K     +  YIVI+G  ++S              PI  W  I N ++E +P   I   + 
Sbjct: 164 KRLLLNESGYIVIYGGANES-------------YPIPQWNNIINRIQEKQPSLSIVLLQG 210

Query: 363 REGVEDVVG---DDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             G E  +       S + +T P   G+LAA+I  +  +I T++  +QL  A    ++AL
Sbjct: 211 SGGDEAWISPLLSRCSDLKVTKPGDIGKLAAMIAGANLMICTDSPPLQLGVAVGTYTVAL 270

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           F      +   PN  + +   I S T  L D     VL  +
Sbjct: 271 FGKTDAKKRLPPN--DDRFIAIQSSTYNLGDIQASKVLEKI 309


>gi|67923629|ref|ZP_00517099.1| Glycosyl transferase, family 9 [Crocosphaera watsonii WH 8501]
 gi|67854511|gb|EAM49800.1| Glycosyl transferase, family 9 [Crocosphaera watsonii WH 8501]
          Length = 336

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 87/341 (25%), Positives = 151/341 (44%), Gaps = 41/341 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP ++ LK +YP  +IDV+   R K  + +   V    ++D +D
Sbjct: 27  RILALVPGGISEQILFFPTLEDLKIQYPNAIIDVLVEPRAKVAYRVCPQVNEVLLFDYED 86

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY-IYPNVNAAGAG 242
                A+Y ++LG++++R YD+ LS +         L++      V Y   P    + A 
Sbjct: 87  R-SGLADYLNLLGIIRDREYDIALSLE-KSWSIRLLLWLNGIPLTVGYGNKPTWFISKAI 144

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
              +E + AE          MY  +++  G   +S      PPL++++ +        + 
Sbjct: 145 PQKTEQYVAE----------MYHDLME--GLDIQST----CPPLKIALPKDDISWGETEQ 188

Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHE 360
           K    E+  YI+I+G  S+S              P+  W+ I N ++E +P    V+   
Sbjct: 189 KRLLLEESGYILIYGGASES-------------YPVPQWSNIINRIQEKQPSLSIVLLQG 235

Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
              E   + +    S + +T P   G+LAA+I  +  +I T++   QL  A    ++ALF
Sbjct: 236 GGDEAWVNSLLSSCSNLKVTKPGDIGKLAAIIAGANLMICTDSTPFQLGVAVGTYTVALF 295

Query: 418 S-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
             +  K RL    A + +   I S T  L D     VL  +
Sbjct: 296 GKTNPKKRL---PANDDRFIAIQSPTYNLGDIQASKVLEKI 333


>gi|416398771|ref|ZP_11686840.1| ADP-heptose--lipooligosaccharide heptosyltransferase II
           [Crocosphaera watsonii WH 0003]
 gi|357262510|gb|EHJ11630.1| ADP-heptose--lipooligosaccharide heptosyltransferase II
           [Crocosphaera watsonii WH 0003]
          Length = 311

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 87/341 (25%), Positives = 152/341 (44%), Gaps = 41/341 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP ++ LK +YP  +IDV+   R K  + +   V    ++D +D
Sbjct: 2   RILALVPGGISEQILFFPTLEDLKIQYPNAIIDVLVEPRAKVAYRVCPQVNEVLLFDYED 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY-IYPNVNAAGAG 242
                A+Y ++LG++++R YD+ LS +         L++      V Y   P    + A 
Sbjct: 62  R-SGLADYLNLLGIIRDREYDIALSLE-KSWSIRLLLWLNGIPLTVGYGNKPTWFISKAI 119

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
              +E + AE          MY  +++  G   +S      PPL++++ +        + 
Sbjct: 120 PQKTEQYVAE----------MYHDLME--GLDIQST----CPPLKIALPKDDISWGEAEQ 163

Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHE 360
           K    E+  YI+I+G  S+S              P+  W+ I N ++E +P    V+   
Sbjct: 164 KRLLLEESGYILIYGGASES-------------YPVPQWSNIINRIQEKQPSLSIVLLQG 210

Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
              E   + +    S + +T P   G+LAA+I  +  +I T++  +QL  A    ++ALF
Sbjct: 211 GGDEAWVNSLLSSCSNLKVTKPGDIGKLAAIIAGANLMICTDSTPLQLGVAVGTYTVALF 270

Query: 418 S-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
             +  K RL    A + +   I S T  L D     VL  +
Sbjct: 271 GKTNPKKRL---PANDDRFIAIQSPTYNLGDIQASKVLEKI 308


>gi|172035840|ref|YP_001802341.1| hypothetical protein cce_0924 [Cyanothece sp. ATCC 51142]
 gi|354555073|ref|ZP_08974376.1| glycosyl transferase family 9 [Cyanothece sp. ATCC 51472]
 gi|171697294|gb|ACB50275.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353553227|gb|EHC22620.1| glycosyl transferase family 9 [Cyanothece sp. ATCC 51472]
          Length = 311

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 152/340 (44%), Gaps = 39/340 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP ++ LK +YP  +IDV+   R K  + +   V    ++D  D
Sbjct: 2   RILALVPGGISEQILFFPTLEDLKIQYPNAIIDVLVEPRAKAAYRVCPQVHEVLLFDYQD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A+Y ++LG++++R YD+ L+ +      +  L++      V Y       + +  
Sbjct: 62  R-NGLADYLNLLGIIRDREYDIALTLE-KRWSISLLLWLNGIPLTVGY------GSQSSW 113

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
            +S     ++   + +   MY  ++  LG       + P P L++++ +        + K
Sbjct: 114 FISNPVPQKTEQYTAQ---MYHDLMQGLGI------QSPCPSLKIALPKDDIPWAEAEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHEK 361
               ++  YI+I+G  S+S              P+  W+ I N ++E +P    V+    
Sbjct: 165 RLLLDESGYILIYGGASES-------------YPVPQWSNIINRIQEKQPSLSIVLLQGS 211

Query: 362 EREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
             E   + +    S + +T P   G+LAA+I  +  +I+T++  +QL  A    ++ LF 
Sbjct: 212 GDEAWINPLLSSCSDLKVTKPGDIGKLAAMIAGANLMISTDSPPLQLGVAVGTYTVGLFG 271

Query: 419 -SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
            ++ K RL     ++ +   I S T  L D     VL  +
Sbjct: 272 KTDAKKRL---PPDDDRFIGIQSSTYNLGDIQASKVLEKI 308


>gi|427735470|ref|YP_007055014.1| ADP-heptose--LPS heptosyltransferase [Rivularia sp. PCC 7116]
 gi|427370511|gb|AFY54467.1| ADP-heptose:LPS heptosyltransferase [Rivularia sp. PCC 7116]
          Length = 313

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 132/302 (43%), Gaps = 34/302 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ E +LFFP +  LK +Y    IDV+     K  ++++++V     +D  D
Sbjct: 2   RIVALVPGGIGEQILFFPTLDNLKQKYDTSRIDVVCEPSSKAAYQVSQSVSEVLAFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G++++R YD+ +++  + +     L++T    R+++        G   
Sbjct: 62  R-NSMADFGNLIGMIRDREYDVAITSSQSWI-TGLLLWLTGINTRIAF-------KGNNS 112

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISR-RLKEVVAEKY 302
           L          N  +    MY  ++  LG           P L ++I +  +    AE+ 
Sbjct: 113 LFFNCLIPR--NEEQYAAYMYNDLLQGLGI------NSTYPELALNIPKPDIDWATAEQL 164

Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHE 360
           +N   + G  +V    E           D DS  P   W  +    RE +P    V+  E
Sbjct: 165 RNGVNDTGYILVCSPFE-----------DADSAYPADKWLTVIKDFREKQPDMPVVVASE 213

Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
              +     + +    +  T+P   G+LAA+I  +  ++ TN+A +QLA   +  +IALF
Sbjct: 214 PNNQNFVRTLSEKIPDIKETSPNNVGRLAAIIAGANLMLCTNSAPMQLAVFLQTYTIALF 273

Query: 418 SS 419
            S
Sbjct: 274 GS 275


>gi|443313881|ref|ZP_21043491.1| ADP-heptose:LPS heptosyltransferase [Synechocystis sp. PCC 7509]
 gi|442776294|gb|ELR86577.1| ADP-heptose:LPS heptosyltransferase [Synechocystis sp. PCC 7509]
          Length = 309

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 142/328 (43%), Gaps = 38/328 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ G + E +LFFP +  L+  YP   IDV+   R K  + + K+V     +D  D
Sbjct: 2   RIVALVPGKIGEQILFFPTLDDLQRVYPDAQIDVVVEPRAKAAYRVCKSVTDTIAFDFKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                A++ +++G++++R YD+ +S +   L H   L++T    R+ Y        G G 
Sbjct: 62  R-NSLADWGNLIGLLRDREYDIAISPQQQSLPH-LLLWLTGIPTRIGY-------KGKGS 112

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
           +       + +  S      Y  ++  LG         PVP L +++         ++ K
Sbjct: 113 IFLTDPVPQKLQQSPAA--KYHDLLQGLGI------NSPVPELNINVPSSDITWSEKEQK 164

Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKER 363
             G  +  Y+V +   S          D +S  PI  W +I   L+E +P   +   K+ 
Sbjct: 165 RLGINESGYVVFYDRPS---------PDHNSTYPIDSWRQIVQSLQEKQPNLPVVAIKDP 215

Query: 364 EGVEDVVGDDASIVF-----ITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
           E    V    + + F     +T+P   G+LAA I  +  ++ T++A + LA A +  ++ 
Sbjct: 216 ENELFV---KSMLAFFPDLKVTSPEDIGKLAATIAAANLLVCTDSAPMYLALAVQTYTVV 272

Query: 416 LFSSELKG-RLFVPNAEEKKCTVISSRT 442
           L  + +K  RL    +   K   I+ +T
Sbjct: 273 LGETSIKSDRLMEVKSSTAKIADIAPQT 300


>gi|254423076|ref|ZP_05036794.1| Heptosyltransferase superfamily [Synechococcus sp. PCC 7335]
 gi|196190565|gb|EDX85529.1| Heptosyltransferase superfamily [Synechococcus sp. PCC 7335]
          Length = 323

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 143/344 (41%), Gaps = 41/344 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   I+ GG+ + LLFFPA++ +K  YP   I V+A  +    + ++K V     +    
Sbjct: 2   RILAIVPGGISDQLLFFPALEDIKRVYPNAEIGVVAEPKASPAYRVSKIVDTIIPFSFSA 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVS-------YIYPNV 236
               P+++ ++LG +++R Y++VL+T L+       L+++    RV+       Y Y  V
Sbjct: 62  A-NSPSDWANLLGNVRDREYEVVLTTDLS-WSMGLLLWLSGVPTRVTLEGTSAPYFYTRV 119

Query: 237 NAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKE 296
                          +S   S    + Y  ++  +G         P P L V++      
Sbjct: 120 -------------LPDSAQSSHYQADRYHNLLSAIGI------EGPTPALSVNVPEADLA 160

Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV 356
              E     G + G  ++  G ++ S        +  +  P+  W  +    RE +P   
Sbjct: 161 WAKELRDRLGLKDGYVLMYPGPDTGS-------AEAGARFPVPSWQAVIKDFREKQPQMP 213

Query: 357 IPHEKEREGVEDV----VGDDASIVF-ITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
           I   +    +  +    VGDD  IV      GQ AA+I  +  ++  ++  +QL+ A + 
Sbjct: 214 IVLLQTEGSIPQINALRVGDDKLIVANAENLGQAAAVIAGADLLLTPDSYVMQLSTALKV 273

Query: 412 PSIALFSSELKGRLF-VPNAEEKKCTVISSRTGKLIDTPVEAVL 454
            ++ALF       +      EE +   I+S+T  + D P E VL
Sbjct: 274 FTLALFGKNKPAEMLPAAKGEETRFLGIASKTNLIADIPPETVL 317


>gi|254416721|ref|ZP_05030471.1| Heptosyltransferase superfamily [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196176461|gb|EDX71475.1| Heptosyltransferase superfamily [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 320

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 136/316 (43%), Gaps = 35/316 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + LL FP +  LK  YP   IDVI   R K  + ++ +V     +D  D
Sbjct: 2   RLLALVPGGISDQLLLFPTLDDLKANYPDAEIDVIVEPRAKGAYRVSGSVDEVLPFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKL---AGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ ++LG++++R YD  ++ +     GL     L+M+    RV Y     N  G
Sbjct: 62  R-NGLADFGNLLGIIRDREYDAAIALERRWSVGL----LLWMSGIPIRVGY----ENNQG 112

Query: 241 AGLLLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
                 + F   ++ L    Y   MY  ++  LG         P P L + + +   +  
Sbjct: 113 ------KWFFTNTVPLKTEQYAAYMYHDLLTGLGM------NTPCPELAIRVPKSALDWA 160

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP--LFV 356
             +    G  +  YI+I G        +    D D + P   W ++   + + +P    V
Sbjct: 161 EGEQARLGIPESGYIMIDG----GSNHLDQSQDIDKVYPAHKWQKVIEDIHQKQPDLTLV 216

Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
           + H  E   +   +      + +T+P   G+LAA+I  +  ++ T++A++ LA A    +
Sbjct: 217 MLHGSEDTELVAALTQVCPNLQMTSPPDSGKLAAMIAGANLLLCTDSASMHLAVAVGTYT 276

Query: 414 IALFSSELKGRLFVPN 429
           IALF      +L  P+
Sbjct: 277 IALFGPTEADKLLPPS 292


>gi|434398545|ref|YP_007132549.1| glycosyl transferase family 9 [Stanieria cyanosphaera PCC 7437]
 gi|428269642|gb|AFZ35583.1| glycosyl transferase family 9 [Stanieria cyanosphaera PCC 7437]
          Length = 309

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 148/340 (43%), Gaps = 47/340 (13%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP ++ LK +YP  +IDV+   R K  + + KNV    ++D  D
Sbjct: 2   RILALVPGGIGDQILFFPTLETLKSKYPNAVIDVLVEPRAKAAYRICKNVDDVLIFDYKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAF----LFMTTARDRVSYIYPNVNAA 239
                A+Y ++LG+++++ YD+ L+T     G   F    L++     RV   Y N NA 
Sbjct: 62  R-SSLADYLNLLGIIRDQEYDVALTT-----GSPWFVELLLWLNGIPVRVG--YKNNNA- 112

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
               LLS +   +S    +   + Y  ++  LG         P PP+++++ +       
Sbjct: 113 ---WLLSSSVPFKS---EQYKAHTYHDLLAGLGI------NVPCPPIQINVPKNDINWTE 160

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VI 357
            +      +   YI+++               +  L P+  W +I   +   +P    V+
Sbjct: 161 NETNRLDIKDSGYILLYS--------------SSELYPLASWQKIVTDICSKQPELPIVL 206

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
             +         V      + I TP   G+LAA +  +  ++ T +  + LA A    +I
Sbjct: 207 LQDANNGKWISAVQQTVPSLKIITPTDLGKLAATVAGANLLLCTVSIPMYLAIATGTYAI 266

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
           AL SS  + +L  P ++      + S TGKL D   + VL
Sbjct: 267 AL-SSSTEEKLLPPKSDSYLS--LQSSTGKLADLKPDQVL 303


>gi|428220786|ref|YP_007104956.1| ADP-heptose--LPS heptosyltransferase [Synechococcus sp. PCC 7502]
 gi|427994126|gb|AFY72821.1| ADP-heptose:LPS heptosyltransferase [Synechococcus sp. PCC 7502]
          Length = 320

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 83/345 (24%), Positives = 159/345 (46%), Gaps = 52/345 (15%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD- 182
           R   ++ GG+ + LL FP +  LK  YP  LIDV+   R +  + ++++V    V++ D 
Sbjct: 2   RVLALVPGGIGDQLLMFPTLDSLKQIYPQGLIDVVVEPRSQGAYRISQSVN--KVWNFDF 59

Query: 183 DDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNA 238
                 A++ +++G ++++ YD V S     LG        L++T   +R+SY       
Sbjct: 60  KGINSLADWGNLIGTIRDQEYDAVFS-----LGQRWSVGFLLWLTGIPNRISY------- 107

Query: 239 AGAGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHP-VPPLRVSISRR-L 294
           AG G    + F   ++ L+   Y   MY  +        + +  +P    ++VSI ++ L
Sbjct: 108 AGQG----DIFLTRAIPLNRDQYAAKMYHDL-------LQGLDLNPEFSGIKVSIPKKDL 156

Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREF--- 351
               AE+ +    E G YI+IHG    S A  + +G  D + P   WA++   L+     
Sbjct: 157 DWAEAEQIRLNIKESG-YILIHG---GSSALAKQKG-IDKIYPATSWAKVIESLQTRLVN 211

Query: 352 RPLFVIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANA 408
            P+ +I   ++ + V+++    +  + +T+P   G+LA +I  +  ++ T++  + +   
Sbjct: 212 LPVVLIAGPEDGDLVKELQEQVSKPLLVTSPPDIGKLAGIIGAANLLLCTDSGPMHIGVG 271

Query: 409 REKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAV 453
                +ALF      +L     +EK+   I S  G+    P+ A+
Sbjct: 272 VGTNLVALFGPTDPEKLL---PQEKRIIAIKSPAGQ----PISAI 309


>gi|359460597|ref|ZP_09249160.1| heptosyltransferase family protein [Acaryochloris sp. CCMEE 5410]
          Length = 344

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 138/352 (39%), Gaps = 32/352 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +I G V   LLFFP ++ L   Y  V IDVI   R KQ F L + V     +  ++
Sbjct: 3   RVLALIPGDVGRQLLFFPTLETLHRHYSQVEIDVIVEPRAKQAFRLCRTVDNVLTFQFEN 62

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ +++G M++R YD  L+      AGL      ++     RV   Y N+    
Sbjct: 63  R-NGAADWGNLIGQMRDRRYDASLTISREWTAGL----LCWLVGIPQRVG--YENLALPA 115

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRH----------PVPPLRVSI 290
                 + F      +++         V    RP+ +   H          P P + + I
Sbjct: 116 WVRFARKRFPTLGGTVADLSGKCLTAAVPPNARPYDAQRYHDLLLGLGIDTPCPDISIQI 175

Query: 291 SRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE 350
               +     + +  G     YI++HG       +  S  +     PI+ W  +  G  E
Sbjct: 176 PLDDQNWAKAEQERLGIAGQPYIILHG----GTGAENSDDEPSQFYPIRSWESVIEGYLE 231

Query: 351 FR---PLFVIPHEKEREGVEDVVG--DDASIVFITTPGQLAALINDSAGVIATNTAAIQL 405
            +   PL V+   + +E V  +V       IV     GQLAA I+    +I  + A + L
Sbjct: 232 RQPDTPLVVVQTAENKEWVASLVAVCPQLQIVMPQHVGQLAAFIDAGHLMICADRAPLHL 291

Query: 406 ANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
             A +   +ALF + L+    +P A       + S T K+ D P   VL  +
Sbjct: 292 GVAVKTRLVALFGA-LEPTRHLPTAPH--FAGLKSLTSKVADIPPIQVLEKV 340


>gi|443328193|ref|ZP_21056794.1| ADP-heptose:LPS heptosyltransferase [Xenococcus sp. PCC 7305]
 gi|442792163|gb|ELS01649.1| ADP-heptose:LPS heptosyltransferase [Xenococcus sp. PCC 7305]
          Length = 309

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 78/345 (22%), Positives = 146/345 (42%), Gaps = 50/345 (14%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +I GG+   + FFP ++ L+  YP   IDV+   R K+ + + K V    ++D  D
Sbjct: 2   RILALIPGGIDAQINFFPTLETLQSTYPNAKIDVLVEPRAKKAYRVCKYVNDVLLFDYQD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-----TKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
                A+Y ++LG++++R Y+  LS     T  + L           R R  +++     
Sbjct: 62  R-NSMADYLNLLGIIRDREYEAALSFGTPWTVSSLLWLNGIPIRIGDRSRNPWLFSKTIP 120

Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
                L  + + AE  + S  G ++                +   P +++++ +     V
Sbjct: 121 -----LQKDQYAAEMYHSSLAGLDI----------------QANCPQVKINVPQEDIAWV 159

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLF 355
             + K  G   G YI++H               TD   P + W  I   ++  +   P+ 
Sbjct: 160 ESRQKGLGIRDG-YILLHS--------------TDESYPAKNWKSIITDIQAKQSEIPIV 204

Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKP 412
           ++  ++  E V  ++     +  ++ P   G+ AA+I  +   ++T++ +I LA A    
Sbjct: 205 LLQDQQNGEWVTQMLESHPQLK-VSAPSDVGKSAAIIAGANLFLSTDSVSISLAVAVGTY 263

Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           +IALF+S    +  +P   EK   + SS TGK+ D     VL  +
Sbjct: 264 TIALFNSTSDSKKMLPPVSEKYVNIPSS-TGKIADIAPNTVLEKL 307


>gi|218245491|ref|YP_002370862.1| glycosyl transferase [Cyanothece sp. PCC 8801]
 gi|257058526|ref|YP_003136414.1| glycosyl transferase family protein [Cyanothece sp. PCC 8802]
 gi|218165969|gb|ACK64706.1| glycosyl transferase family 9 [Cyanothece sp. PCC 8801]
 gi|256588692|gb|ACU99578.1| glycosyl transferase family 9 [Cyanothece sp. PCC 8802]
          Length = 315

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 76/343 (22%), Positives = 151/343 (44%), Gaps = 41/343 (11%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP ++ LK +YP   IDVI   R K  + +   V     +D  D
Sbjct: 2   RILALVPGGIGDQILFFPTLETLKTQYPKATIDVIVEPRAKAAYRVCPIVHEVLAFDYRD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
                A+Y ++LG++++R YD+ LS     LG        L++     RV Y       +
Sbjct: 62  R-NGLADYLNLLGIIRDREYDIALS-----LGQRWTIEMLLWLNGIPLRVGY------QS 109

Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
                LS     +S   S+    ++  ++  L        ++P PP+++++ +   +   
Sbjct: 110 STAWFLSNPVPLKS---SQYTPEIFHDILTGLRI------QNPCPPVKIALPKEDIDWAE 160

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPH 359
            + K    ++  YI+I+  +S +  +           P+ +W ++   ++  +P   I  
Sbjct: 161 AEQKRLDLKETGYILIYPFQSPNPKNS---------YPVGLWQKVIANIQAKQPNLPIVL 211

Query: 360 EKEREGVEDV--VGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
            ++   VE V  + ++   + +T P   G+ AA+I  +  ++ T+  A+ +A A    +I
Sbjct: 212 LQDANNVEKVALMLENTPNLKVTEPGDVGKFAAMIAAANLMLCTDNEALPVAIAVGTYTI 271

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      +   P ++  +   I S +  + D   E +L  M
Sbjct: 272 ALFGPTDPTQSIPPGSD--RYLGIQSTSDNVADIDPETILQKM 312


>gi|158337113|ref|YP_001518288.1| heptosyltransferase family protein [Acaryochloris marina MBIC11017]
 gi|158307354|gb|ABW28971.1| heptosyltransferase family protein [Acaryochloris marina MBIC11017]
          Length = 344

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/352 (23%), Positives = 139/352 (39%), Gaps = 32/352 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   +I G V   LLFFP ++ L   Y  + IDVI   R KQ F L + V     +  ++
Sbjct: 3   RVLALIPGDVGRQLLFFPTLETLHRHYSQIEIDVIVEPRAKQAFRLCRTVDNVLTFQFEN 62

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
                A++ +++G M++R YD  ++      AGL      ++     RV   Y N+    
Sbjct: 63  R-NGAADWGNLIGQMRDRRYDASITISREWTAGL----LCWLVGIPQRVG--YENLALPA 115

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRH----------PVPPLRVSI 290
                 + F      +++         V    RP+ +   H          P P + + I
Sbjct: 116 WVRFARKRFPTLGGTVADLSGKCLTAAVPPNARPYDAQRYHDLLLGLGIDTPCPDISIQI 175

Query: 291 SRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE 350
               +     + +  G     YI++HG       +  S  +     PI+ W  +  G  E
Sbjct: 176 PVDDQNWAKAEEERLGIAGKPYIILHG----GTGAENSDDEPSQFYPIRSWESVIEGYLE 231

Query: 351 FR---PLFVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQL 405
            +   PL V+   + ++ V  +V     +  +T    GQLAA I+    +I  + A + L
Sbjct: 232 RQPDTPLVVVQTPENKDWVASLVAVCPQLQIVTPQHVGQLAAFIDAGHLMICADRAPLHL 291

Query: 406 ANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
             A +   +ALF + L+    +P A       + S T K+ D P   VL  +
Sbjct: 292 GVAVKTRLVALFGA-LEPTRHLPTA--ANFAGLKSLTSKVADIPPIQVLEKV 340


>gi|220906649|ref|YP_002481960.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
 gi|219863260|gb|ACL43599.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7425]
          Length = 322

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/343 (23%), Positives = 154/343 (44%), Gaps = 34/343 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP ++ LK +YP   +DV+   R +  + +  +V     +D   
Sbjct: 2   RVVALVPGGIGDQILFFPTLEDLKRQYPQAEVDVVVEPRARAAYRVCPHVDRVLAFDFKG 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAF-LFMTTARDRVSYIYPNVNAAGAG 242
               PA++ ++LGV+++  Y++ LS  L       F L++     RVS+           
Sbjct: 62  R-NGPADWLNLLGVVRDGEYNIGLS--LGSSWTVGFLLWLMGIPVRVSF-----TKGKKS 113

Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
            LL+       +N  +    +Y  ++  LG         P P L+VSI R   E    + 
Sbjct: 114 TLLTNPV---QLNPDQYAAALYHDLLKGLGV------TTPAPDLKVSIPRSDLEWAEGER 164

Query: 303 KNAGAEQ--GKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
           +  G +     YI+IHG      + M      D + P++ W ++   ++  +   P+ ++
Sbjct: 165 QRLGLDTAGAGYILIHG----GSSQMARTKGIDKIYPVEKWQQVIQNIQARQPDLPIAIV 220

Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
              +++E    ++     I  +T+P   G+LAALI  ++ ++ T++A + L  A +   +
Sbjct: 221 QGPEDQEWAAAMLQSCPGIK-VTSPDDIGKLAALIAAASLMLCTDSAPMHLGVAVKTYLV 279

Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
           ALF      +L      + +   I S TG++ D   E +L  +
Sbjct: 280 ALFGPTDPAKLL---PRDDRFIGIKSPTGQIADIAPEKILEKI 319


>gi|22299676|ref|NP_682923.1| glycosyl transferase family protein [Thermosynechococcus elongatus
           BP-1]
 gi|22295860|dbj|BAC09685.1| tlr2133 [Thermosynechococcus elongatus BP-1]
          Length = 320

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 73/343 (21%), Positives = 143/343 (41%), Gaps = 37/343 (10%)

Query: 122 VRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL 181
           ++    ++ GG+ + +LFFP +  LK  +P   +DV+   R    ++L+ +V     +D 
Sbjct: 1   MKNIVALVPGGIGDQILFFPTLDDLKAHFPEAQLDVVVEPRAMAAYQLSGSVHQVIPFDF 60

Query: 182 DDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAA----FLFMTTARDRVSYIYPNVN 237
            D     A++ +++G ++   YD +LS     LG +      L++T    RV +    +N
Sbjct: 61  KDR-NALADWANLIGRLREGEYDAILS-----LGRSKAVRFLLWLTGIPKRVGFA--KLN 112

Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
             G    L++   A  +NL +     Y  ++   G         P P  +  I+   ++ 
Sbjct: 113 PLG---FLTD---AIPVNLEQYSAATYHDLLKAFGI------TSPCPLPKAVITAEDRQW 160

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE---FRPL 354
              + +  G   G Y ++HG      + M      + + P   W E+   L +     P 
Sbjct: 161 ATAEQQRLGLSSG-YRLLHG----GSSQMALSKGINKIYPPTRWVEVIQTLAQEEPHLPF 215

Query: 355 FVIPHEKEREGVEDVVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKP 412
            V+   +++  V  +     D  I +     +LAAL+  +  ++ T++  + +  A   P
Sbjct: 216 IVVCGPEDQAWVRALTQALPDIKISYPPDLRKLAALMQQAQQMLCTDSGPMHIGVAVGTP 275

Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
            +ALF      +L      + +   + S TG + D PV AV+ 
Sbjct: 276 LVALFGPTDPAKLL---PADPRFRAVKSPTGNIGDIPVSAVIE 315


>gi|86609430|ref|YP_478192.1| heptosyltransferase family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86557972|gb|ABD02929.1| heptosyltransferase family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 335

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 129/303 (42%), Gaps = 29/303 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFPA+  L+ R+P   ++V+   R    +E+  +V     +   +
Sbjct: 15  RILALVPGGIGDQILFFPALASLRQRFPEAELEVVVEPRAAGAYEVCPSVSRVLTFPFKE 74

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                     +  + + R YD VLS   + LG    L++T    RV Y  P     G G 
Sbjct: 75  QLSLGDLSDLLGRI-RERQYDGVLSLGRS-LGVKLLLWLTGIPKRVGYAPP-----GPG- 126

Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                +  + + L    Y   MY  +V   G    + P   +P +R+  +  L+    E+
Sbjct: 127 ---RPWLTDPVPLKPAQYAAQMYHDLVQGFG---IAAPAG-LPQIRLKKA-DLEWAEQER 178

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            +  G     Y+++H       + +        + P   W ++  G RE R   PL ++ 
Sbjct: 179 KRLLGDAAQDYVLLH----PGASQLSQEKGIQKIYPSANWVKVIKGFREQRPELPLVLVV 234

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             ++ +  E  +     +  ++ P   GQLAALI  +  ++  ++A + LA A + P +A
Sbjct: 235 GPEDEQLAEGFLAQLPELP-VSRPVRLGQLAALIGGATLLLCADSAPMHLAVAVQTPLVA 293

Query: 416 LFS 418
           LF 
Sbjct: 294 LFG 296


>gi|86607954|ref|YP_476716.1| heptosyltransferase family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86556496|gb|ABD01453.1| heptosyltransferase family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 337

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 69/303 (22%), Positives = 128/303 (42%), Gaps = 29/303 (9%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG+ + +LFFP +  L+ R+P   ++V+   R    +E+  +V     +   +
Sbjct: 17  RLLALVPGGIGDQILFFPTLASLRQRFPEAELEVVVEPRAAAAYEVCPSVSRVLTFPFKE 76

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
                     +  + + R YD VLS   + LG    L++T    RV Y  P     G G 
Sbjct: 77  QLSLGDLSDLLGRI-RERQYDGVLSLGRS-LGVKLLLWLTGIPKRVGYAPP-----GPG- 128

Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
                +  + + L    Y   MY  +V   G    + P   +P +R+  +  L+    E+
Sbjct: 129 ---RPWLTDPVPLKPAQYAAQMYHDLVQGFG---IAAPAG-LPQIRLKKA-DLEWAEQER 180

Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
            +  G     Y+++H       + +        + P   W ++  G RE R   PL ++ 
Sbjct: 181 KRLLGDAAQDYVLLH----PGASQLSQEKGIQKIYPSANWVKVIKGFREQRPELPLVLVV 236

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
             ++ +  E  +     +  ++ P   GQLAALI  +  ++  ++A + LA A + P +A
Sbjct: 237 GPEDEQLAEGFLAQLPELP-VSRPVRLGQLAALIGGATLLLCADSAPMHLAVAVQTPLVA 295

Query: 416 LFS 418
           LF 
Sbjct: 296 LFG 298


>gi|443474618|ref|ZP_21064590.1| glycosyl transferase family 9 [Pseudanabaena biceps PCC 7429]
 gi|443020604|gb|ELS34544.1| glycosyl transferase family 9 [Pseudanabaena biceps PCC 7429]
          Length = 314

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/318 (21%), Positives = 133/318 (41%), Gaps = 41/318 (12%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           R   ++ GG  + LLFFP +  LK +YP   IDV+   R    + + ++V     +D  D
Sbjct: 3   RILALVPGGTGDQLLFFPTLDTLKQQYPNAEIDVVVEPRAMAAYRVCQSVNRVLKFDFKD 62

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG- 242
                A++ ++LG +++R YD  + T+   L     L+++    R+S+        G G 
Sbjct: 63  R-NSLADFGNLLGTIRDREYDAAIVTQ-PNLSMNVLLWLSGIPKRISF-------KGQGD 113

Query: 243 LLLSETFTAESMNLSE-RGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
            LL++    +    +  + +N+   +         ++ +H  PP++V++ +   +    +
Sbjct: 114 FLLTDIIAMDPQEYTAVQNHNLLMAV---------NIQKH-CPPIKVNLPKNDLDWSTNE 163

Query: 302 YKNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIP 358
            K  G +Q  +I+++ G  ++               P   WA IA  L+  +P    V  
Sbjct: 164 QKRLGIQQSGFILLNCGAYAN--------------YPAASWATIAKDLQAKQPNLPIVAI 209

Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
                  +   +      + IT+P   G+L A I  +   I     A+QL  A     +A
Sbjct: 210 DSVNNAALLKQLTTLVPNLLITSPTDIGKLTAFIASANLFICAEGDAMQLGVAVGTALVA 269

Query: 416 LFSSELKGRLFVPNAEEK 433
           +  +       +P AE++
Sbjct: 270 ILGANTPAGKSLPIAEKR 287


>gi|427418291|ref|ZP_18908474.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 7375]
 gi|425761004|gb|EKV01857.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 7375]
          Length = 315

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/156 (23%), Positives = 74/156 (47%), Gaps = 10/156 (6%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
               ++ GG+ + L FFP +  +++ YP   I V+     K  + ++K V     ++   
Sbjct: 2   HVLALVPGGISDQLQFFPTLATIQEIYPSAEISVVVEPTSKSAYRVSKAVSEVIPFNYRG 61

Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
               PA++ ++LG++++R +++VLS           L+++    RV Y     ++A A  
Sbjct: 62  Q-NSPADWANLLGIIRDREFELVLSAS-DRWEEGVLLWLSGIPTRVGY-----SSAQAPW 114

Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVP 279
           L + T TA+         + Y+ +++ L  P +S P
Sbjct: 115 LYTNTVTADKEPYQA---SQYQALLEGLAEPSKSAP 147


>gi|452852682|ref|YP_007494366.1| Glycosyl transferase family 9 [Desulfovibrio piezophilus]
 gi|451896336|emb|CCH49215.1| Glycosyl transferase family 9 [Desulfovibrio piezophilus]
          Length = 320

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 72/313 (23%), Positives = 123/313 (39%), Gaps = 27/313 (8%)

Query: 128 IISGGVYENLLF-FPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWP 186
           I + G+  N LF  PA   LK  YPG  + ++  + GK     + NV   +V++   D  
Sbjct: 11  IFNNGMLGNTLFNMPAAAWLKQNYPGCHVGMVVDSVGKTLILHDPNVDTIHVFNKKKD-S 69

Query: 187 EPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLS 246
              + + +L + K RY       K  G+ +     +    DR  +    +  +   L L 
Sbjct: 70  LRQQLSLLLELRKTRYNVSFHLRK--GVRNELLARLAGIPDRAGF---RLKGSPQHLTLK 124

Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
                    L  R   M E M   LGRP +      +   R++I       +    + AG
Sbjct: 125 LEEDTSVHRLESRALLM-EAM---LGRPVK------LDRPRLNICTEANAAMTRFIEQAG 174

Query: 307 AEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGV 366
           A  G Y V+H        +  S+   D  LP  V+A+ A  L E  P+F+I    E E V
Sbjct: 175 AVPGNYFVLH-------PTGNSQNGKDWSLP--VYAQAAQRLAEDAPVFIICMPCEEEAV 225

Query: 367 EDVVGDDASIVFIT-TPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRL 425
              +   + I + T +    + LI +++  +  ++    LA A E P + ++  +     
Sbjct: 226 NKAIPSGSHIHYYTGSIANTSELIRNASVFLGNDSGPAHLACAWEVPRLVVYKDDEANYT 285

Query: 426 FVPNAEEKKCTVI 438
                 +  CT++
Sbjct: 286 KWKPIHKTGCTIL 298


>gi|428207215|ref|YP_007091568.1| glycosyl transferase family protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428009136|gb|AFY87699.1| glycosyl transferase family 9 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 803

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 79/350 (22%), Positives = 142/350 (40%), Gaps = 76/350 (21%)

Query: 120 GDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFEL--------NK 171
           GD+R    +    + + ++  P ++ +K+  P   + ++AS  G  T  L        + 
Sbjct: 447 GDLRNILVMRLDNIGDVIMTSPVLRTIKENLPTAKLTLMASPAGALTQPLLPWVDEVLSW 506

Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDM-VLSTKLAGLGHAAFLFMTTARDRVS 230
            V W ++  LD D   PA    ++  +  R +D  ++ T  +   + A L        V 
Sbjct: 507 RVLWQDLGRLDFD---PAREWKLIETLNQRQFDAAIILTSFSQSPYPAAL--------VC 555

Query: 231 YIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSI 290
           Y+      AG GL L E+         E G  +    VD L      V R+         
Sbjct: 556 YL------AGIGLRLGES--------KESGQGILTHWVDPLPDEIHQVERN--------- 592

Query: 291 SRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKA------------SMQSRGDTDSLLPI 338
             RL E V  K ++      + + +H  ++ + A            + QSR   D     
Sbjct: 593 -LRLIEAVGFKVRD------RRLCLHVPQTITAAIPNSYILLNPWTTCQSRNYDDCR--- 642

Query: 339 QVWAEIANGLREFR--PLFVIPHEKEREGVE---DVVGDDASIVFITTPGQLAALINDSA 393
             +A  A  L +    P+ V    K+R+      D++GD   ++  T+  + AALI ++ 
Sbjct: 643 --FATAARQLSQMTSLPIVVTGTNKDRDRARPLLDILGDAIDLIGATSLAEFAALIANAR 700

Query: 394 GVIATNTAAIQLANAREKPSIALFS-SELKGR---LFVPNAEEKKCTVIS 439
            V+  NT+ + +A+A   PS+ LF+ +EL+ +    + P    ++ TV S
Sbjct: 701 LVLTNNTSTMHIADATNTPSVILFAGTELESQWQPCYAPVKLLRRPTVCS 750


>gi|429106789|ref|ZP_19168658.1| Lipopolysaccharide heptosyltransferase III [Cronobacter malonaticus
           681]
 gi|426293512|emb|CCJ94771.1| Lipopolysaccharide heptosyltransferase III [Cronobacter malonaticus
           681]
          Length = 359

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 69/301 (22%), Positives = 117/301 (38%), Gaps = 47/301 (15%)

Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTD--- 193
           LL  P IQ L+ R+PG  IDV+     +     N ++    +Y +D  W +   Y     
Sbjct: 28  LLTTPLIQALRQRFPGAQIDVLLYEETRDMLSANPDIH--RIYGIDRQWKKQGVYHQLKM 85

Query: 194 ---ILGVMKNRYYDMVLSTKLAGLGHAAFLF-MTTARDRVSYIYPNVNAAGAGLLLSETF 249
              +L  ++ + YD+VL+  LA    +A     T AR RV + +P           ++  
Sbjct: 86  ELSLLFSLRKQKYDLVLN--LADQWRSAICTRFTGARMRVGFAFPKRRHPFWAFCHTQLV 143

Query: 250 TAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQ 309
           + +  +      +  EQ +  L       PR+   P R+S +   ++  A +       Q
Sbjct: 144 STQDHHTQ----HTVEQNLSILSAIGEDYPRN--MPARMSYTG--QDWQACQSLLPSDFQ 195

Query: 310 GKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAE-----IANGLREFRPLFVI---PHEK 361
             YIVI                  S    + W E     + N L       V+   P  K
Sbjct: 196 DDYIVIQ---------------PTSRWFFKCWREGNISAVINALSAAGHNIVLTSGPDAK 240

Query: 362 EREGVEDVVGDDASIVFITTPG-----QLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
           E++ ++ ++   A     +  G     QLAALI+ +   I  ++  + +A A   P +AL
Sbjct: 241 EKQMIDTIIAGCADARLHSLAGQLTLRQLAALIDHAKLFIGVDSVPMHMAAALGTPLVAL 300

Query: 417 F 417
           F
Sbjct: 301 F 301


>gi|373487956|ref|ZP_09578622.1| glycosyl transferase family 9 [Holophaga foetida DSM 6591]
 gi|372007730|gb|EHP08359.1| glycosyl transferase family 9 [Holophaga foetida DSM 6591]
          Length = 317

 Score = 45.4 bits (106), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 69/139 (49%), Gaps = 9/139 (6%)

Query: 330 GDTDSLLPIQVWAEIANGLREF--RPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAA 387
           GD+    P + +  +A  LRE     + V+    ERE  E +VG D+ +   TTP +LA 
Sbjct: 183 GDSKKRWPAERFRAVAVHLRECYDAEIIVLWGPGEREIAEGIVGKDSELAPPTTPLELAG 242

Query: 388 LINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGK--L 445
           L++    +I+ ++  + LA     P +ALFS+     +++P       T++    G+  L
Sbjct: 243 LLSSVDLLISGDSGPMHLAAGLGCPLVALFSTADPA-VYLPPGN----TLLVRHKGRIDL 297

Query: 446 IDTPVEAVLNAMQIFNESL 464
              PVE+V+ A++    +L
Sbjct: 298 GGMPVESVIEAVEAMRGAL 316


>gi|89074603|ref|ZP_01161068.1| RfaF protein [Photobacterium sp. SKA34]
 gi|89049541|gb|EAR55101.1| RfaF protein [Photobacterium sp. SKA34]
          Length = 333

 Score = 45.1 bits (105), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 55/102 (53%), Gaps = 2/102 (1%)

Query: 351 FRPLFVIP-HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAR 409
           F  +F++P + K ++  E  +   +S+  + +      LI  S GVI+ +TA I +A+A 
Sbjct: 227 FNCVFILPSYIKNKDVWESQLAPVSSVCCVDSIQDAIYLIKKSKGVISVDTAIIHIASAY 286

Query: 410 EKPSIALFS-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPV 450
             P+I LF  S+L+ +L+ P + +K    ++     +I++ +
Sbjct: 287 NIPTIGLFKESQLECKLWHPQSSKKHVGALTGNVEDIINSRI 328


>gi|404492876|ref|YP_006716982.1| ADP-heptose--lipopolysaccharide heptosyltransferase [Pelobacter
           carbinolicus DSM 2380]
 gi|77544951|gb|ABA88513.1| ADP-heptose--lipopolysaccharide heptosyltransferase [Pelobacter
           carbinolicus DSM 2380]
          Length = 350

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 66/299 (22%), Positives = 119/299 (39%), Gaps = 48/299 (16%)

Query: 140 FPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD-DDWPEPAEYTDILGV- 197
            P +  LK+ +P + ID +         E +  +R   VY L    W      + + GV 
Sbjct: 18  LPVLAYLKEAHPEIKIDWLVEEGFAPLIEGHPMLR--RVYRLGLKRWRREGWRSVLAGVK 75

Query: 198 -----MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYI------YPNVNAAGAGLLLS 246
                ++N  YDMVL  +        F  +  A  R  +       +PN+ A    ++L 
Sbjct: 76  STVHELRNERYDMVLDLQ-GNCKSGLFTLLCGAPRRYGFSFSGVREWPNLLATNRRVVL- 133

Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
              TA   ++S+R   +  +          + P H  P  R ++         EK   + 
Sbjct: 134 ---TAADHHISDRSLAVAREAFPVGNARSAAGPLHVTPQARTAV---------EKQLGSF 181

Query: 307 AEQGKYIVI--HGIESDSKASMQSRGDTDSLLPIQVWAEIANGL---REFRPLFVIPHEK 361
              G  +V+  +G   ++K           L P+  W  +A  L      RP+ +  +E 
Sbjct: 182 NMNGPSLVVLQYGTTWETK-----------LWPLDSWQRLARTLCTEDNLRPVLIWGNEA 230

Query: 362 EREGVEDV--VGDDASIVF-ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
           ER+  E +    D  ++++   T  +LAAL+  +  VI  +T  I +A A + P++++F
Sbjct: 231 ERDAAEAIYRATDGQAVIWPRGTLQELAALLERADLVIGGDTGPIHIAAALDTPTVSIF 289


>gi|253700849|ref|YP_003022038.1| glycosyl transferase family protein [Geobacter sp. M21]
 gi|251775699|gb|ACT18280.1| glycosyl transferase family 9 [Geobacter sp. M21]
          Length = 371

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 31/62 (50%)

Query: 123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD 182
           RR   I  GG+ + +L  PA+  L+  +PG  IDV+A +R    F +   + W   YD  
Sbjct: 50  RRFLVIRPGGIGDAVLLVPALTALQKAFPGCRIDVLAESRNAAAFLMCPGLNWVYRYDCL 109

Query: 183 DD 184
            D
Sbjct: 110 SD 111


>gi|95928765|ref|ZP_01311511.1| glycosyl transferase, family 9 [Desulfuromonas acetoxidans DSM 684]
 gi|95135110|gb|EAT16763.1| glycosyl transferase, family 9 [Desulfuromonas acetoxidans DSM 684]
          Length = 372

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 131 GGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPE--P 188
           GG+ + +L  P +Q  +D + G  I ++A  R  Q F+L + V     YD   DW +   
Sbjct: 48  GGIGDAVLLLPTLQACRDFFEGASIYILAEKRNAQVFDLCEGVAEVWCYDKLGDWKDFFF 107

Query: 189 AEYTDILGVMKNRYYDMVLSTKL 211
            +Y  I+   ++ Y   V+S  L
Sbjct: 108 KDYDLIIDTEQSHYLSAVISKLL 130


>gi|319638761|ref|ZP_07993520.1| 1,5 heptosyltransferase I [Neisseria mucosa C102]
 gi|317400002|gb|EFV80664.1| 1,5 heptosyltransferase I [Neisseria mucosa C102]
          Length = 322

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 50/112 (44%), Gaps = 16/112 (14%)

Query: 312 YIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DV 369
           Y+ +H    DSK           L P++ W  +   L E R   V +P   E E +  + 
Sbjct: 179 YVALHATSRDSK-----------LWPVEHWRALLKKLNEKRRYTVYLPWGNETEKMRAEQ 227

Query: 370 VGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           +  D    F+       Q A L+ ++ G+I  +T  + LANA EKP + +++
Sbjct: 228 IASDLPFAFVCDKMNLLQAAFLLKNAEGIIGVDTGLLHLANALEKPVVGIYT 279


>gi|375145548|ref|YP_005007989.1| glycosyl transferase family protein [Niastella koreensis GR20-10]
 gi|361059594|gb|AEV98585.1| glycosyl transferase family 9 [Niastella koreensis GR20-10]
          Length = 372

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 64/130 (49%), Gaps = 17/130 (13%)

Query: 337 PIQVWAEIANGLREF-RPLFVIPHEKEREGVEDV---VGDDAS-IVFITTPGQLAALIND 391
           P+Q W  +   + +   P+ +    KE+E    +   +GD+A+ +    T G+  AL   
Sbjct: 211 PLQHWITLGQLVHDTGLPILLTGTTKEKELTSAIAAGIGDNATDVAGQFTTGEFIALTAG 270

Query: 392 SAGVIATNTAAIQLANAREKPSIALFSS--------ELKGRLF---VPNAEEKKCTVIS- 439
           +AGV++ NTA I +A A E P + L++         ++ G++F   VP + + K  VI  
Sbjct: 271 AAGVVSVNTATIHIAAACETPVVVLYALTNPQHTPWKVSGQVFPYDVPESLQSKNEVIGY 330

Query: 440 SRTGKLIDTP 449
            R  +++  P
Sbjct: 331 VRDNRMMKQP 340


>gi|298242804|ref|ZP_06966611.1| glycosyl transferase family 9 [Ktedonobacter racemifer DSM 44963]
 gi|297555858|gb|EFH89722.1| glycosyl transferase family 9 [Ktedonobacter racemifer DSM 44963]
          Length = 406

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 59/253 (23%), Positives = 102/253 (40%), Gaps = 37/253 (14%)

Query: 203 YDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYN 262
           YD+ ++ +      A  +++     RV Y  P           +  F + ++   E  ++
Sbjct: 119 YDVAINLRPDFWWGAVLIYLAGIPRRVGYALPP----------ATPFLSHALPFPEHQHS 168

Query: 263 MYEQM------VDWLGRPFRSVPRHPVP-PLRVSISRRLKEVVAEKYKNAG-AEQGKYIV 314
            +  +      +  LG+P    P  P   PLR   + + +  VA+   +AG A +   +V
Sbjct: 169 TFYNLELASAGLQALGQPPLEAPFTPERYPLRFEPTEQERRWVAQTLHDAGIAPETPVVV 228

Query: 315 IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPL--FVI---PHEKE--REGVE 367
           IH             G    L   + WA  AN L+   P   FV+   P E+    E  E
Sbjct: 229 IH----------PGTGGAVKLWRSEAWANCANQLQASYPTLRFVLTGTPKERPLLEEVAE 278

Query: 368 DVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFV 427
            +V    SI   +  GQL AL+  ++ V+  +   + LA  ++ P+I +F      R+F 
Sbjct: 279 GMVTQPLSISDSSV-GQLTALLQRASLVMGVDNGPLHLAVTQDVPTIRIFGPT-DSRIFG 336

Query: 428 PNAEEKKCTVISS 440
           P     +  VI+S
Sbjct: 337 PWGAPARHRVITS 349


>gi|357032050|ref|ZP_09093990.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
           morbifer G707]
 gi|356414277|gb|EHH67924.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
           morbifer G707]
          Length = 302

 Score = 42.7 bits (99), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 82/179 (45%), Gaps = 20/179 (11%)

Query: 286 LRVSISRRLKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEI 344
           LR +    L   V +  K AG E  G+Y V+  +   +    Q R       P+  +A++
Sbjct: 133 LRAAGVPSLPRTVPDWLKGAGPEIAGRYAVL--VPGAAPHRPQKR------WPVSRFADV 184

Query: 345 ANGL--REFRPLFVIPHEKEREGVEDV--VGDDASIVFITTPGQLAALINDSAGVIATNT 400
           A  L  R  RP+ +   E++   +  +    D   +   TT  +LAA+++ +  VI  +T
Sbjct: 185 ARSLEARGIRPVIIGGAEEKPLALAIIRTCPDALDLTGKTTLPELAAVLDGAETVIGNDT 244

Query: 401 AAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQI 459
             + LA A + P IALFSS    RL  P       T+ + RT  L+  P  A+L A ++
Sbjct: 245 GPMHLAAAMDTPIIALFSSASDPRLTAP------LTLSADRTEILV-VPDLALLPASRV 296


>gi|241760711|ref|ZP_04758803.1| lipopolysaccharide heptosyltransferase I [Neisseria flavescens
           SK114]
 gi|241318892|gb|EER55418.1| lipopolysaccharide heptosyltransferase I [Neisseria flavescens
           SK114]
          Length = 322

 Score = 42.4 bits (98), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 77/184 (41%), Gaps = 33/184 (17%)

Query: 241 AGLLLSETFTA-ESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
           A L  S+ F   +  N   R   ++ Q+  +      ++P   V  L V  S RLK +  
Sbjct: 123 AALAYSKMFAVPKGKNAVWRNRELFAQVFGY------TMPETQVSGLSVPESGRLKNL-- 174

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IP 358
                   +   Y+ +H    DSK           L P++ W  +   L E +   V +P
Sbjct: 175 --------KSPYYVALHATSRDSK-----------LWPVEHWRALLKKLNEEQQCNVYLP 215

Query: 359 --HEKEREGVEDVVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
             +E E+   E +  D   AS+       Q A L+ ++ G+I  +T  + LANA EKP +
Sbjct: 216 WGNETEKMRAEQIASDLPFASVCDKMNLLQAAFLLKNAEGIIGVDTGLLHLANALEKPVV 275

Query: 415 ALFS 418
            +++
Sbjct: 276 GIYT 279


>gi|261379628|ref|ZP_05984201.1| lipopolysaccharide heptosyltransferase I [Neisseria subflava
           NJ9703]
 gi|284798112|gb|EFC53459.1| lipopolysaccharide heptosyltransferase I [Neisseria subflava
           NJ9703]
          Length = 322

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 16/112 (14%)

Query: 312 YIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IP--HEKEREGVED 368
           Y+ +H    DSK           L P++ W  +   L E +   V +P  +E E+   E 
Sbjct: 179 YVALHATSRDSK-----------LWPVEHWRALLKKLNEEQQCNVYLPWGNETEKMRAEQ 227

Query: 369 VVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           +  D   AS+       Q A L+ ++ G+I  +T  + LANA EKP + +++
Sbjct: 228 IASDLPFASVCDKMNLLQAAFLLKNAEGIIGVDTGLLHLANALEKPVVGIYT 279


>gi|349610547|ref|ZP_08889889.1| lipopolysaccharide heptosyltransferase I [Neisseria sp. GT4A_CT1]
 gi|348609763|gb|EGY59489.1| lipopolysaccharide heptosyltransferase I [Neisseria sp. GT4A_CT1]
          Length = 326

 Score = 42.0 bits (97), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 93/229 (40%), Gaps = 37/229 (16%)

Query: 197 VMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESM 254
           V++   +D VL ++  GL  +A  F   A+  +  +  N    G A L  ++T+   +  
Sbjct: 81  VLRQEAFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLNKNSAREGLAALAYAKTYAVPKGK 137

Query: 255 NLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIV 314
           N   R   ++ Q+  +      ++P   V  L V  + RLK +          EQ  Y  
Sbjct: 138 NAVWRNRELFAQVFGY------AMPETQVFGLTVPEAGRLKNL----------EQPYYAA 181

Query: 315 IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGD 372
           +H    DSK           L P++ W  +   L E +   V +P   E E    + + D
Sbjct: 182 LHATSRDSK-----------LWPVESWRALLQKLNEEQQCNVYLPWGNETEKTRAEQIAD 230

Query: 373 DASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
                 +       Q A L+  + G++  +T  + LANA EKP I +++
Sbjct: 231 GLPFAIVCDKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPVIGIYT 279


>gi|419797579|ref|ZP_14323048.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca VK64]
 gi|385697661|gb|EIG28072.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca VK64]
          Length = 326

 Score = 41.6 bits (96), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 99/234 (42%), Gaps = 49/234 (20%)

Query: 198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESMN 255
           ++   +D VL ++  GL  +A  F   A+  +  +  N    G A L  ++T+T  +  N
Sbjct: 82  LRQEAFDFVLDSQ--GLIKSA-CFAKIAKSPIYGLDKNSAREGLAALAYAKTYTVPKGKN 138

Query: 256 LSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVI 315
              R   ++ Q+  +      ++P   V  L V  + RLK +          EQ  Y  +
Sbjct: 139 AVWRNRELFAQVFGY------AMPETQVFGLTVPEAGRLKNL----------EQPYYAAL 182

Query: 316 HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPH----EKEREG-VED- 368
           H    DSK           L P++ W  +   L E +   V +P     EK R G + D 
Sbjct: 183 HATSRDSK-----------LWPVENWRALLQKLNEEQQCNVYLPWGNETEKTRAGQIADG 231

Query: 369 ----VVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
               +V D  +++      Q A L+  + G+I  +T  + LANA EKP + +++
Sbjct: 232 LPFAIVCDKMNLL------QAAYLLKHAVGIIGVDTGLLHLANALEKPVVGIYT 279


>gi|225077329|ref|ZP_03720528.1| hypothetical protein NEIFLAOT_02388 [Neisseria flavescens
           NRL30031/H210]
 gi|224951321|gb|EEG32530.1| hypothetical protein NEIFLAOT_02388 [Neisseria flavescens
           NRL30031/H210]
          Length = 322

 Score = 41.6 bits (96), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 76/184 (41%), Gaps = 33/184 (17%)

Query: 241 AGLLLSETFTA-ESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
           A L  S+TF   +  N   R   ++ Q+  +      ++P   V  L V  S RLK +  
Sbjct: 123 AALAYSKTFAVPKGKNAVWRNRELFAQVFGY------TMPETQVFGLSVPESGRLKNL-- 174

Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IP 358
                   +   Y  +H    DSK           L P++ W  +   L E +   V +P
Sbjct: 175 --------QSPYYAALHATSRDSK-----------LWPVEHWRALLKKLNEEQQCNVYLP 215

Query: 359 --HEKEREGVEDVVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
             +E E+   E +  D   AS+       Q A L+  + G+I  +T  + LANA EKP +
Sbjct: 216 WGNETEKMRAEQIASDLPFASVCDKMNLLQAAFLLKKAEGIIGVDTGLLHLANALEKPVV 275

Query: 415 ALFS 418
            +++
Sbjct: 276 GIYT 279


>gi|402835189|ref|ZP_10883768.1| heptosyltransferase [Selenomonas sp. CM52]
 gi|402275980|gb|EJU25110.1| heptosyltransferase [Selenomonas sp. CM52]
          Length = 353

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 65/319 (20%), Positives = 128/319 (40%), Gaps = 35/319 (10%)

Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
           A+ P++   +++R   I    + + +L  PAI+ +K+  PG  I ++      +  E+N 
Sbjct: 3   AAKPIRF-AELKRILVINLAFIGDVILTTPAIRAVKEACPGAAITMLTVPLTAEVAEMNP 61

Query: 172 NVRWANVYDLDDDWPEPAEYTDILGV------MKNRYYDMVLSTKLAGLGHAAFLFMTTA 225
            V    +YD      +  +   +LG+      ++ R +DM +    A  G A   F+   
Sbjct: 62  FVDDVMMYD------KKGKDKGLLGMFSVGRRLRRRNFDMAICMNFATRG-AIVAFLAGI 114

Query: 226 RDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPP 285
           + R+ Y     +A  AG  L+   +A    +     N +  +V  LG   R++     PP
Sbjct: 115 KIRLGY-----DAQHAGFFLTHKASAIRTTVQHETINHFGVLVP-LGFATRNLSLVCEPP 168

Query: 286 LRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA 345
                  R K   A+K +       +YI I       + ++     +  L      A+ A
Sbjct: 169 ------ERAKRSFAQKAEKLALLACEYIAICPFGRHERRNLSEETVSGFLREFFADADAA 222

Query: 346 NGLREFRPLFVIPHEKEREGVEDVVGDDAS-----IVFITTPGQLAALINDSAGVIATNT 400
                 R +++I  + + +G+E +     S     I    T  +LA  +  +  ++  +T
Sbjct: 223 KN----RKVYLIGGKNDAQGLEVIARSTGSNRVQVIAGTLTLQELALFLEKADLLVTVDT 278

Query: 401 AAIQLANAREKPSIALFSS 419
               +A A   P++ +FS+
Sbjct: 279 GPAHIAQAVHCPTLDIFST 297


>gi|410944368|ref|ZP_11376109.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
           frateurii NBRC 101659]
          Length = 302

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 76/171 (44%), Gaps = 20/171 (11%)

Query: 294 LKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL--RE 350
           L  +V E  +  G + QG+Y+V+  +   +    Q R       P+  +AE+A  L  R 
Sbjct: 141 LPRIVPEWLQGHGPKIQGRYVVL--VPGAAPHRPQKR------WPVTRFAEVAQALFQRG 192

Query: 351 FRPLFVIPHEKE--REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANA 408
             P+      +    E +  V      +   T+  +LA +++ +  VI  +T  + LA A
Sbjct: 193 ITPIIAGSAAETLLAEQIRTVCPQAIDLTGKTSLIELAGVLDRAEQVIGNDTGPMHLAAA 252

Query: 409 REKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQI 459
            ++P IALFS +   RL  P         ++S    ++D P  A+L A +I
Sbjct: 253 MDRPVIALFSQDSDPRLTAP-------LTLTSGLTTILDVPDLALLPASRI 296


>gi|414342353|ref|YP_006983874.1| lipopolysaccharide core biosynthesis protein [Gluconobacter oxydans
           H24]
 gi|411027688|gb|AFW00943.1| lipopolysaccharide core biosynthesis protein [Gluconobacter oxydans
           H24]
 gi|453331619|dbj|GAC86533.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
           thailandicus NBRC 3255]
          Length = 314

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 78/179 (43%), Gaps = 20/179 (11%)

Query: 286 LRVSISRRLKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEI 344
           LR +  + L   V E  +  G + QG+Y+V+  +   +    Q R       P+  +AE+
Sbjct: 145 LRAAGVKPLDRTVPEWLRGHGPKIQGRYVVL--VPGAAPHRPQKR------WPVTRFAEV 196

Query: 345 ANGL--REFRPLFVIPHEKE--REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNT 400
           A  L  R   P+ V    +    E +  +      +   T+  +L  +++ +  VI  +T
Sbjct: 197 AQTLFRRGITPIIVGSATETPLAEQIRSLCPQAVDLTGKTSLIELGGVLDRAEQVIGNDT 256

Query: 401 AAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQI 459
             + LA A ++P IALFS +   RL  P A     T I       +D P  A+L A +I
Sbjct: 257 GPMHLAAAMDRPIIALFSEDSDPRLTAPLALTPGRTTI-------LDVPDLALLPASRI 308


>gi|340363420|ref|ZP_08685755.1| lipopolysaccharide heptosyltransferase I [Neisseria macacae ATCC
           33926]
 gi|339885939|gb|EGQ75628.1| lipopolysaccharide heptosyltransferase I [Neisseria macacae ATCC
           33926]
          Length = 326

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 93/229 (40%), Gaps = 37/229 (16%)

Query: 197 VMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESM 254
           V++   +D VL ++  GL  +A  F   A+  +  +  N    G A L  ++T+   +  
Sbjct: 81  VLRQEAFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLDKNSAREGLAALAYAKTYAVPKGK 137

Query: 255 NLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIV 314
           N   R   ++ Q+  +      ++P   V  L V  + RLK +          EQ  Y  
Sbjct: 138 NAVWRNRELFAQVFGY------AMPETQVFGLTVPEAGRLKNL----------EQPYYAA 181

Query: 315 IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGD 372
           +H    DSK           L P++ W  +   L E +   V +P   E E    + + D
Sbjct: 182 LHATSRDSK-----------LWPVENWRALLQKLNEEQQCNVYLPWGNETEKTRAEQIAD 230

Query: 373 DASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
                 +       Q A L+  + G+I  +T  + LANA EKP + +++
Sbjct: 231 GLPFAIVCDKMNLLQAAYLLKHAVGIIGVDTGLLHLANALEKPVVGIYT 279


>gi|300721235|ref|YP_003710505.1| lipopolysaccharide core biosynthesis protein [Xenorhabdus
           nematophila ATCC 19061]
 gi|297627722|emb|CBJ88248.1| lipopolysaccharide core biosynthesis; modification of heptose
           region of core [Xenorhabdus nematophila ATCC 19061]
          Length = 354

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 69/297 (23%), Positives = 130/297 (43%), Gaps = 35/297 (11%)

Query: 135 ENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRW-ANVYDLDDDWPEPA---- 189
           + LL  P +  LK+ +P   IDV+     K+T  +  N    ++++ +D  W +      
Sbjct: 21  DTLLITPVVNTLKENFPDSEIDVLLY---KETEPMLVNFGLVSHIFSIDRQWKKAGTKAY 77

Query: 190 --EYTDILGVMKNRYYDMVLSTKLAGLGHAAFLF-MTTARDRVSYIYPNVNAAGAGLLLS 246
              +  ++  +KNR Y+++++  LA   ++A +  ++ A  R+++ +P        LL  
Sbjct: 78  IIHHWRLINQLKNRRYNLIIN--LADQWYSALVSKISGAPTRIAFDFPK----RQNLLWK 131

Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
             F+       E   ++ EQ +  L +P      +    + +S S    + V+E     G
Sbjct: 132 SCFSHLVSTEGEESLHVVEQNLSAL-KPLN--LSNTNSDVTMSYSEGDWQKVSELLHTNG 188

Query: 307 AEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGV 366
               +YIVIH            R  ++++  +Q     ++G          P EKE   +
Sbjct: 189 I-NNQYIVIHPASRWFFKCWNERKISETINALQ-----SDGHSIVVTSGTEPREKEM--I 240

Query: 367 EDVVG--DDASIVF----ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
           E ++   D  +IV     IT P QL ALI ++   I  ++  + +A+A + P IALF
Sbjct: 241 EQILSSCDKENIVSLAGQITLP-QLGALIANARLFIGVDSVPMHMASALKTPCIALF 296


>gi|256828044|ref|YP_003156772.1| glycosyl transferase family protein [Desulfomicrobium baculatum DSM
           4028]
 gi|256577220|gb|ACU88356.1| glycosyl transferase family 9 [Desulfomicrobium baculatum DSM 4028]
          Length = 348

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 49/105 (46%), Gaps = 10/105 (9%)

Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR--WA-NVYDLDDDWPEPAEYTD 193
           LL  P+I+LL +RYP   IDV    +     E N +VR  WA N  +L + W E   Y  
Sbjct: 24  LLTTPSIRLLHERYPDAAIDVFTEKKCTPVLENNPHVRKIWALNKKELPNFWAELKFYAR 83

Query: 194 ILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSYIYPNVN 237
           I        YD+++      L    F+  M+ A+ R+SY  P  N
Sbjct: 84  I----ARENYDLIVD--FQQLPRCRFVTLMSRAQVRLSYPPPWYN 122


>gi|255039329|ref|YP_003089950.1| glycosyl transferase family protein [Dyadobacter fermentans DSM
           18053]
 gi|254952085|gb|ACT96785.1| glycosyl transferase family 9 [Dyadobacter fermentans DSM 18053]
          Length = 356

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 29/127 (22%), Positives = 55/127 (43%), Gaps = 16/127 (12%)

Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE--FRPLF 355
           +A K   AG     ++++H   S++K             P++ W E+   LR     PL 
Sbjct: 177 LAVKLGEAGVGGRPFMMLHPGVSEAKRE----------YPVEKWIEVGKLLRHQFAMPLL 226

Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP----GQLAALINDSAGVIATNTAAIQLANAREK 411
           V     E +   ++     S+          G+  AL++ + GV++ NT  I +A A + 
Sbjct: 227 VTGSASESQLATEIAEGIGSMAIAAAGMFGVGEFIALVDRAVGVVSVNTGTIHIAAAMQT 286

Query: 412 PSIALFS 418
           P++ L++
Sbjct: 287 PAVVLYA 293


>gi|385811238|ref|YP_005847634.1| ADP-heptose--LPS heptosyltransferase [Ignavibacterium album JCM
           16511]
 gi|383803286|gb|AFH50366.1| ADP-heptose:LPS heptosyltransferase [Ignavibacterium album JCM
           16511]
          Length = 353

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 61/296 (20%), Positives = 133/296 (44%), Gaps = 27/296 (9%)

Query: 131 GGVYENLLFFPAIQLLKDRYPGVLIDVIASARG-KQTFELNKNVRWANVYDLDDDWPEPA 189
            G+ + L+F PA+++L+   P   IDV+    G K  F+ N NV     ++  ++    +
Sbjct: 4   SGIGDALMFTPALKILRQELPNSEIDVLVMFGGAKDIFKNNPNVNNLIHFNFINEGTIRS 63

Query: 190 EYTDILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSYIYPNVNAAGAGLLLSET 248
               +L + K   YD++++   +       + F+  A+ RV+  Y  +N    G L + T
Sbjct: 64  -LRFVLSLRKK--YDVIINVYPSNRKEYNLISFLIGAKKRVAVDYLRMNKQNFGWLNNIT 120

Query: 249 FTA-ESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
               +S++  +    M E+++D      + +P    P L + ++          ++    
Sbjct: 121 IKENDSLHNVQTNIRMIEKLID------KKIPEE--PKLELFLTEEDLNYAESFFQEKKI 172

Query: 308 EQGKYIV-IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL-REFRPLFVI---PHEKE 362
            +  +I+ IH   +  K  ++ R + +       ++++AN L + +    +I   P E E
Sbjct: 173 SENDFIIGIHPGCATLKNHIKRRWEPEK------FSQLANLLIKNYSAKILIFGGPEESE 226

Query: 363 -REGVEDVVGDDASIVFIT-TPGQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
            +  V++ +  D +I+  T T  Q +A++      I  ++A + +A+A   P +A+
Sbjct: 227 LKRQVKNNIHSDYTIIIETDTLNQSSAIMKRCDLFITNDSALMHIASALSLPIVAI 282


>gi|220908592|ref|YP_002483903.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
 gi|219865203|gb|ACL45542.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7425]
          Length = 367

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 48/77 (62%), Gaps = 5/77 (6%)

Query: 353 PLFVIPHEKEREGVE---DVVGDDA-SIVFITTPGQLAALINDSAGVIATNTAAIQLANA 408
           P+ V   E +R+  E   +++GD    ++  T   +LAAL++ ++ +++ NT+ + LA+A
Sbjct: 241 PVLVTGVENDRKSCEQLLEILGDRGIDLIGKTNLSELAALVSRASLLLSNNTSTMHLADA 300

Query: 409 REKPSIALFS-SELKGR 424
              PS+ LF+ +EL+G+
Sbjct: 301 TGTPSVILFAGTELEGQ 317


>gi|262037154|ref|ZP_06010646.1| ADP-heptose:LPS heptosyltransferase II [Leptotrichia goodfellowii
           F0264]
 gi|261748842|gb|EEY36189.1| ADP-heptose:LPS heptosyltransferase II [Leptotrichia goodfellowii
           F0264]
          Length = 338

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 129/341 (37%), Gaps = 71/341 (20%)

Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
           +   I +  + + +L  P IQ +KD+YP   ID +  +  K     N N+    +YD  D
Sbjct: 2   KILIIHTAFIGDIVLSTPLIQKIKDKYPDSQIDYMTLSGNKSIISNNPNLHDIIIYDKKD 61

Query: 184 DWPEPAEYTDILGVMKNRYYDM----------VLSTKLAGLGHAAFLFMTTARDRVSYIY 233
                  +  IL  +KN  YD+          +L  K+AG+                   
Sbjct: 62  KNKGIKGFFRILKAVKNNRYDLAIIPHRFIRSILLAKMAGIKKTV--------------- 106

Query: 234 PNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRR 293
              + A    LL E    +  ++S+        ++D+ G            PL +  S+ 
Sbjct: 107 -GFDVATGSFLLHE---KKHYDMSKHEVERLLDLIDYKGERV---------PLGIYPSKE 153

Query: 294 LKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP 353
                 +  K  G  +GKY     + + +  S +     + + PI  + E+   L E + 
Sbjct: 154 ------DMDKIDGLLKGKY--YKNLITIAPGSQRP----EKIWPINKYDELIKRLSENKE 201

Query: 354 -LFVIPHEKER----------EGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAA 402
            L V+   K            E V D+ G+ + + F       AAL++ S  +I+ ++A 
Sbjct: 202 NLIVVTGGKAEKTLDLKSVSTENVTDLRGEISLLEF-------AALLSKSDIIISNDSAP 254

Query: 403 IQLANAREKP-SIALFSSELKGRLFVPNAEEKKCTVISSRT 442
           I + +A EKP  I +F    K   F P  E  K  VI   T
Sbjct: 255 IHIGSAFEKPFVIGIFGPGKKSLGFFPWTE--KSNVIEDNT 293


>gi|269925269|ref|YP_003321892.1| glycosyl transferase family protein [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269788929|gb|ACZ41070.1| glycosyl transferase family 9 [Thermobaculum terrenum ATCC BAA-798]
          Length = 348

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 67/299 (22%), Positives = 129/299 (43%), Gaps = 52/299 (17%)

Query: 141 PAIQLLKDRYPGVLIDVIASARGKQTFEL----NKNVRWANVY-DLDDDWPEPAEYTDIL 195
           PA++ +K+  P   I ++ S  G     L    ++ + W +++ DL D   +PA   +++
Sbjct: 26  PALRAIKENLPETKITMLLSPGGASAAPLLPWIDEVIAWKSLWQDLGDLPFDPARELELV 85

Query: 196 GVMKNRYYDMVLSTKLAGLGHAAFLFMTTARD-----RVSYIYPNVNAAG-----AGLLL 245
             ++   +D             A +F + ++D      V Y+      AG      G +L
Sbjct: 86  EKLREHAFD------------GAIIFTSFSQDPHVPGYVCYLAGIPLRAGESKEFGGSVL 133

Query: 246 SETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNA 305
           +    +   N+ +   N+  ++++ LG   R         L++ I         EK  + 
Sbjct: 134 THELKSVPDNVHQVERNL--RLIESLGFKVRD------RSLKIFIDEEASLSAHEKLGSM 185

Query: 306 G--AEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL-REFRPLFVIPHEKE 362
           G   ++G Y+VIH       AS  +R       P+  +A IA  L R+   + V   ++E
Sbjct: 186 GIRVDEG-YVVIH-----PGASASARR-----YPVDRFAVIAQELTRKGIHVVVTGSDRE 234

Query: 363 REGVEDVVG---DDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           R  VE++     +  ++V +T+  + AA++  +  VI  NT  + L++A   P + LFS
Sbjct: 235 RLLVEEIANAAPEVKTLVGVTSLSEYAAVLKMAKAVICNNTLPMHLSDALRVPVVVLFS 293


>gi|401402142|ref|XP_003881177.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325115589|emb|CBZ51144.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 727

 Score = 38.9 bits (89), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 59/138 (42%), Gaps = 18/138 (13%)

Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREF 351
           R+L+E   E+  +  A Q +  V+H     S+ASM              W E A   +  
Sbjct: 265 RKLRENEQEETLSGEAFQKRLAVLHRRLVQSRASMN-------------WVEKARQRK-- 309

Query: 352 RPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
           R        ++RE  +  V DD S + +T  GQ  A I+ ++ V+++     +LA  R  
Sbjct: 310 RDAQAAREAQQREAGDSDVSDDPSTLQLTRTGQTVAGIHSASSVLSSKGLLRRLARERTG 369

Query: 412 PSIALFSSELKGRLFVPN 429
           P   L  +E  GR  +P 
Sbjct: 370 P---LVFAETAGRARIPQ 384


>gi|255065422|ref|ZP_05317277.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca ATCC
           29256]
 gi|255050247|gb|EET45711.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca ATCC
           29256]
          Length = 326

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 52/228 (22%), Positives = 92/228 (40%), Gaps = 37/228 (16%)

Query: 198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESMN 255
           ++   +D VL ++  GL  +A  F   A+  +  +  N    G A L  ++T+   +  N
Sbjct: 82  LRQEVFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLDKNSAREGLAALAYAKTYAVPKGKN 138

Query: 256 LSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVI 315
              R   ++ Q+  ++      +P   V  L V  + RLK +          EQ  Y  +
Sbjct: 139 AVWRNRELFAQVFGYV------MPETQVFGLTVPEAGRLKNL----------EQPYYAAL 182

Query: 316 HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGDD 373
           H    DSK           L P++ W  +   L E +   V +P   E E    + + D 
Sbjct: 183 HATSRDSK-----------LWPVENWRALLQKLNEEQQCNVYLPWGNETEKTRAEQIADG 231

Query: 374 ASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
                +       Q A L+  + G++  +T  + LANA EKP + +++
Sbjct: 232 LPFAIVCDKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPVVGIYT 279


>gi|390562754|ref|ZP_10244927.1| Glycosyl transferase family 9 [Nitrolancetus hollandicus Lb]
 gi|390172674|emb|CCF84240.1| Glycosyl transferase family 9 [Nitrolancetus hollandicus Lb]
          Length = 349

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 72/303 (23%), Positives = 126/303 (41%), Gaps = 58/303 (19%)

Query: 141 PAIQLLKDRYPGVLIDVIASARGKQTFEL--------NKNVRWANVYDLDDDWPEPAEYT 192
           PA++ +K   P   + ++AS  G Q  EL           V W ++  LD     PA   
Sbjct: 24  PALRAIKANLPDARLTLMASPAGAQAAELLPWIDDILAWRVLWQDLGHLDF---SPAREW 80

Query: 193 DILGVMKNRYYDM-VLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTA 251
           D++  +  R +D  ++ T  +   H A      A   +          G G+L      A
Sbjct: 81  DLVAALHERRFDAAIIFTSFSQTPHPAGFICYLAG--IPLRLGESKETGEGVLTGMLAAA 138

Query: 252 -ESMNLSERGYNMYEQM---VDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
            + ++ +ER   + E     VD      R   R P P  + +++     ++AE+   AGA
Sbjct: 139 PDEIHQAERNLRLIESAGFRVD----DRRLAIRIPEPARQAAMA-----LLAERGLPAGA 189

Query: 308 EQGKYIVI----------HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
               Y+++          + ++  ++AS Q  G T        W  +  G+ + RP    
Sbjct: 190 ---PYLLLNPWASCPARTYHLDRFAEASRQLSGLTG-------WPVVVTGVAKDRP---- 235

Query: 358 PHEKEREG-VEDVVGDDA-SIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
                R G + DV+G  A  ++  T+  +LAAL+  +  V+  +T+A+ LA+A    S+ 
Sbjct: 236 -----RSGPLLDVLGSRAVDLIGATSVPELAALVAGARLVLTNDTSAMHLADATGTLSVV 290

Query: 416 LFS 418
           LFS
Sbjct: 291 LFS 293


>gi|118580516|ref|YP_901766.1| glycosyl transferase family protein [Pelobacter propionicus DSM
           2379]
 gi|118503226|gb|ABK99708.1| glycosyl transferase, family 9 [Pelobacter propionicus DSM 2379]
          Length = 361

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 10/80 (12%)

Query: 131 GGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAE 190
           GG+ + +L  PAI  LK ++P   I V+A  R   TF L   V     YD+         
Sbjct: 31  GGIGDAVLLMPAIIALKQKFPQASITVLAERRNAATFALCPEVDRVLHYDMP-------- 82

Query: 191 YTDILGVMKNRYYDMVLSTK 210
            +++L  ++ R YD+V+ ++
Sbjct: 83  -SELLSAIRGR-YDLVIDSE 100


>gi|386816616|ref|ZP_10103834.1| lipopolysaccharide heptosyltransferase I [Thiothrix nivea DSM 5205]
 gi|386421192|gb|EIJ35027.1| lipopolysaccharide heptosyltransferase I [Thiothrix nivea DSM 5205]
          Length = 327

 Score = 38.9 bits (89), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 35/60 (58%)

Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
           P E+ER        D   ++  +  G+LAA++  ++GVI  +T  + +A A +KP++AL+
Sbjct: 224 PRERERAERLAQANDSVQVLPRSHLGELAAIMQGASGVIGMDTGLMHIAAALDKPALALY 283


>gi|59802247|ref|YP_208959.1| RfaC [Neisseria gonorrhoeae FA 1090]
 gi|194099807|ref|YP_002002942.1| RfaC [Neisseria gonorrhoeae NCCP11945]
 gi|240013072|ref|ZP_04719985.1| RfaC [Neisseria gonorrhoeae DGI18]
 gi|240017641|ref|ZP_04724181.1| RfaC [Neisseria gonorrhoeae FA6140]
 gi|240122262|ref|ZP_04735224.1| RfaC [Neisseria gonorrhoeae PID24-1]
 gi|254493450|ref|ZP_05106621.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           1291]
 gi|268595787|ref|ZP_06129954.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           35/02]
 gi|268597977|ref|ZP_06132144.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           FA19]
 gi|268600232|ref|ZP_06134399.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           MS11]
 gi|268602472|ref|ZP_06136639.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           PID18]
 gi|268681026|ref|ZP_06147888.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           PID332]
 gi|268685293|ref|ZP_06152155.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           SK-92-679]
 gi|291044959|ref|ZP_06570668.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           DGI2]
 gi|293398030|ref|ZP_06642236.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           F62]
 gi|385336733|ref|YP_005890680.1| RfaC [Neisseria gonorrhoeae TCDC-NG08107]
 gi|606855|gb|AAA93059.1| lipooligosaccharide heptosyltransferase-1 [Neisseria gonorrhoeae]
 gi|59719142|gb|AAW90547.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae FA
           1090]
 gi|193935097|gb|ACF30921.1| RfaC [Neisseria gonorrhoeae NCCP11945]
 gi|226512490|gb|EEH61835.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           1291]
 gi|268549176|gb|EEZ44594.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           35/02]
 gi|268551765|gb|EEZ46784.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           FA19]
 gi|268584363|gb|EEZ49039.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           MS11]
 gi|268586603|gb|EEZ51279.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           PID18]
 gi|268621310|gb|EEZ53710.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           PID332]
 gi|268625577|gb|EEZ57977.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           SK-92-679]
 gi|291011853|gb|EFE03849.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           DGI2]
 gi|291611976|gb|EFF41045.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           F62]
 gi|317165276|gb|ADV08817.1| RfaC [Neisseria gonorrhoeae TCDC-NG08107]
          Length = 322

 Score = 38.9 bits (89), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 18/125 (14%)

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-I 357
           A + KN    Q  Y  +H    DSK           L P++ W E+   L + +   V +
Sbjct: 168 ASRLKNLA--QPYYAALHATSRDSK-----------LWPMENWRELLQKLNQKQQCNVYL 214

Query: 358 PHEKEREGVE-DVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPS 413
           P   E E V  + + D      +       Q A L+  + G++  +T  + LANA EKP 
Sbjct: 215 PWGNEAEKVRAEQIADGLPFTIVCAKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPV 274

Query: 414 IALFS 418
           + +++
Sbjct: 275 VGIYT 279


>gi|268604739|ref|ZP_06138906.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           PID1]
 gi|268588870|gb|EEZ53546.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           PID1]
          Length = 322

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 18/125 (14%)

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-I 357
           A + KN    Q  Y  +H    DSK           L P++ W E+   L + +   V +
Sbjct: 168 ASRLKNLA--QPYYAALHATSRDSK-----------LWPMENWRELLQKLNQKQQCNVYL 214

Query: 358 PHEKEREGVE-DVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPS 413
           P   E E V  + + D      +       Q A L+  + G++  +T  + LANA EKP 
Sbjct: 215 PWGNEAEKVRAEQIADGLPFTIVCAKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPV 274

Query: 414 IALFS 418
           + +++
Sbjct: 275 VGIYT 279


>gi|398795642|ref|ZP_10555471.1| ADP-heptose:LPS heptosyltransferase [Pantoea sp. YR343]
 gi|398205821|gb|EJM92600.1| ADP-heptose:LPS heptosyltransferase [Pantoea sp. YR343]
          Length = 368

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 73/321 (22%), Positives = 136/321 (42%), Gaps = 41/321 (12%)

Query: 152 GVLIDVIASARGKQTFELNKNVRWANVYDLDD----DWPEPAEYTDILGVMKNRYYDMVL 207
           G+ IDV+A    +  F  N  +R  N+Y  D+    D    A   D++  +KN  YD+++
Sbjct: 76  GINIDVLAIKANEMIFSGNPFIR--NLYLTDEMDPRDMYNHAIPDDLIITLKNNNYDLMI 133

Query: 208 STKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYN----- 262
            T +   G    L+M      +S   P +     G    +  +   +N++   Y      
Sbjct: 134 DTGIWDTG----LYMPKMIQAIS---PKI---SLGFNKQKWLSHYDVNIAFNHYQSHVKE 183

Query: 263 MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDS 322
           +Y  ++++LG P  S  R+ V   + SIS  +K +      +      K ++I       
Sbjct: 184 IYAAIMNYLGFPDASNQRYEV-YYKQSISDEVKAI------DFFTGDNKKVII------- 229

Query: 323 KASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVED-VVGDDASIVFITT 381
             +M +  D  +L P Q    I+  L +   + ++  + ++  V+D  +G  A +  + +
Sbjct: 230 --NMYASHDDRNLTPQQTKEIISGILLKNNNVDIVVLDYKKSFVKDFFIG--AKVYHVPS 285

Query: 382 PGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSS-ELKGRLFVPNAEEKKCTVISS 440
                A IN S  VI+ +TA + +A   +KP I ++       +L+ P  +       S+
Sbjct: 286 LYHAMAAINLSDLVISPDTAVVHVAALYDKPLIGIYRDIPDNNKLWSPGYKNGHQIFTST 345

Query: 441 RTGKLIDTPVEAVLNAMQIFN 461
                  + V+ VLNA+  FN
Sbjct: 346 FVVGEDASLVDNVLNAIDKFN 366


>gi|298369675|ref|ZP_06980992.1| lipopolysaccharide heptosyltransferase III [Neisseria sp. oral
           taxon 014 str. F0314]
 gi|298282232|gb|EFI23720.1| lipopolysaccharide heptosyltransferase III [Neisseria sp. oral
           taxon 014 str. F0314]
          Length = 368

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 68/313 (21%), Positives = 118/313 (37%), Gaps = 54/313 (17%)

Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEP------AE 190
           LL  P +  LK ++P   +D++  A        N+ +  +N++ +D +W +       A 
Sbjct: 33  LLSTPVVDALKHKFPDCEVDMLVYAATTDVILDNRQI--SNIFTIDREWKKQGIRAQLAH 90

Query: 191 YTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTA----------RDRVSYIYPNVNAAG 240
              +   +K R YD   +  ++    AAFL    A          RD   + Y       
Sbjct: 91  EKALFKRLKARNYDWAFN--ISDQWRAAFLAKLCAHYSAGIACCKRDNFLWRY------- 141

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
                   F  E ++ S   ++M E  ++ L  P   +P      +R+ IS   +E V E
Sbjct: 142 -----CHDFLNEELDTS---HHMVESNLNVL--PPLVLPEEYPAKVRMEISPANRESVLE 191

Query: 301 KYKNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPH 359
           K    G     Y++ H G     K     +              + NG      L   P 
Sbjct: 192 KLSAQGWNGEPYVLAHPGARWFFKCWEDGKNAALLQ------LLLLNGKNVV--LTASPD 243

Query: 360 EKEREGVEDVVGD----DASIVFITTPG----QLAALINDSAGVIATNTAAIQLANAREK 411
             E+E V+ ++G     D   V++ +      +LAA I+ +      ++  + +A A +K
Sbjct: 244 AAEQEMVQSILGRLKIPDGVHVWVLSGCLNLRELAAAIDGAELFFGVDSVPMHIAAALDK 303

Query: 412 PSIALFSSELKGR 424
           P +ALF     GR
Sbjct: 304 PQVALFGPSWVGR 316


>gi|296313781|ref|ZP_06863722.1| lipopolysaccharide heptosyltransferase I [Neisseria polysaccharea
           ATCC 43768]
 gi|296839710|gb|EFH23648.1| lipopolysaccharide heptosyltransferase I [Neisseria polysaccharea
           ATCC 43768]
          Length = 322

 Score = 38.5 bits (88), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 93/228 (40%), Gaps = 37/228 (16%)

Query: 198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESMN 255
           ++   +D VL ++  GL  +A  F   A+  +  +  N    G A ++  +T+   +  N
Sbjct: 82  LRQEVFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLDKNSAREGLAAVVYEKTYAVPKGKN 138

Query: 256 LSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVI 315
              R   ++ Q+  ++      +P   V  L V  + RLK +          EQ  Y V+
Sbjct: 139 AVWRNRELFAQVFGYV------MPETQVFGLTVPEAGRLKNL----------EQPYYAVL 182

Query: 316 HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGDD 373
           H    DSK           L P++ W  +   L + +   V +P   E E    + + D 
Sbjct: 183 HATSRDSK-----------LWPVENWRALLQKLNQKQQCNVYLPWGNEAEKARAEQIADG 231

Query: 374 ASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
                +       Q A L+  + G++  +T  + LANA EKP + +++
Sbjct: 232 LPFAIVCAKINLLQAAYLLKHAVGIVGVDTGLLHLANALEKPVVGIYT 279


>gi|399077997|ref|ZP_10752654.1| ADP-heptose:LPS heptosyltransferase [Caulobacter sp. AP07]
 gi|398034351|gb|EJL27622.1| ADP-heptose:LPS heptosyltransferase [Caulobacter sp. AP07]
          Length = 323

 Score = 38.1 bits (87), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 78/343 (22%), Positives = 132/343 (38%), Gaps = 37/343 (10%)

Query: 121 DVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFE-LNKNVRWANVY 179
           ++++   I  G + + +L   A++ +++ +P   I ++ +      FE L K   + N  
Sbjct: 4   EIKKVLVIKLGALGDFVLALAAMKKIREAHPRAKITLLTT----PPFEALAKLSPYFNTV 59

Query: 180 DLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAA 239
           + D    +  + T +LG ++   YD V   +     +  F  +           P  +  
Sbjct: 60  ETDGRPGDFGDLTAMLGRLRKARYDRVYDLQTNSRTNWYFQALRP-------FPPQWSGI 112

Query: 240 GAGLLLSETFTAE-SMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
            AG  L +   A   M+  ER  +  +Q   W   P  + P    PP    I RR KE  
Sbjct: 113 AAGCALPQRGKARLHMHTLERQADQLKQAGIWPDAP--TEPGSAPPPDLSWILRRTKE-- 168

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI- 357
           A     A A +   +++ G          S    +   P++ +A++A  L + R L ++ 
Sbjct: 169 ARPVAGATASKPYVLLVPG---------GSAHRPEKRWPVESYAQLA-ALLKARGLDIVV 218

Query: 358 ---PHEKE-REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPS 413
              P E      ++  VG    +   T   QLA L   +A V+  +T    L  A   P+
Sbjct: 219 IGGPQESAMARQIQKAVGQARDLTGRTDFAQLAVLGAKAALVVGNDTGPTHLLAAAGAPT 278

Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNA 456
           IALFS      L  P        V   R+  L   PV  V +A
Sbjct: 279 IALFSDASDPELCGPRGH-----VAVIRSPDLKALPVSTVASA 316


>gi|397691483|ref|YP_006528737.1| glycosyl transferase, family 9 [Melioribacter roseus P3M]
 gi|395812975|gb|AFN75724.1| glycosyl transferase, family 9 [Melioribacter roseus P3M]
          Length = 342

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 48/85 (56%), Gaps = 5/85 (5%)

Query: 337 PIQVWAEIANGLREFR--PLFVIPHEKEREG---VEDVVGDDASIVFITTPGQLAALIND 391
           P +  AE+A  L +    PL ++    E E    +++ VGDDA +   TT  Q+AAL++ 
Sbjct: 194 PPEKLAELAKALNKKYDIPLLILWGPSEYEDAIRIKETVGDDAYLAPPTTIEQMAALMSR 253

Query: 392 SAGVIATNTAAIQLANAREKPSIAL 416
            A ++A ++  + ++ A + P+++L
Sbjct: 254 CAFIVANDSGPMHISTAIDTPALSL 278


>gi|268685507|ref|ZP_06152369.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           SK-93-1035]
 gi|268625791|gb|EEZ58191.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
           SK-93-1035]
          Length = 322

 Score = 38.1 bits (87), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 18/125 (14%)

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-I 357
           A + KN    Q  Y  +H    DSK           L P++ W E+   L + +   V +
Sbjct: 168 ASRLKNLA--QPYYAALHASSRDSK-----------LWPMENWRELLQKLNQKQQCNVYL 214

Query: 358 PHEKEREGVE-DVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPS 413
           P   E E V  + + D      +       Q A L+  + G++  +T  + LANA EKP 
Sbjct: 215 PWGNEAEKVRAEQIADGLPFTIVCAKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPV 274

Query: 414 IALFS 418
           + +++
Sbjct: 275 VGIYT 279


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.136    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,751,561,257
Number of Sequences: 23463169
Number of extensions: 350606597
Number of successful extensions: 810818
Number of sequences better than 100.0: 160
Number of HSP's better than 100.0 without gapping: 111
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 810446
Number of HSP's gapped (non-prelim): 192
length of query: 467
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 321
effective length of database: 8,933,572,693
effective search space: 2867676834453
effective search space used: 2867676834453
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)