BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781114|ref|YP_003065527.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] (263 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781114|ref|YP_003065527.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] gi|254040791|gb|ACT57587.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] Length = 263 Score = 546 bits (1408), Expect = e-154, Method: Compositional matrix adjust. Identities = 263/263 (100%), Positives = 263/263 (100%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR Sbjct: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF Sbjct: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM Sbjct: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW Sbjct: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 Query: 241 RKDLPPVSTTKFLPSSSSSLIKS 263 RKDLPPVSTTKFLPSSSSSLIKS Sbjct: 241 RKDLPPVSTTKFLPSSSSSLIKS 263 >gi|315122483|ref|YP_004062972.1| glycosyl transferase family protein [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495885|gb|ADR52484.1| glycosyl transferase family protein [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 253 Score = 384 bits (987), Expect = e-105, Method: Compositional matrix adjust. Identities = 184/252 (73%), Positives = 212/252 (84%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + I YVISLPFSHARREKFCHRA+ I+LQFSF DAIYGEN+ IC +IF +Q RQ FKR Sbjct: 2 ISISNYVISLPFSHARREKFCHRASLINLQFSFVDAIYGENDSICKKIFYNQNRQHHFKR 61 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LLSLPEIGCY+SHI+LWK+IA S A+GAIILEDDADFS F QL+ HL+ CDI NILIKF Sbjct: 62 LLSLPEIGCYMSHINLWKKIASSDALGAIILEDDADFSVGFPQLISHLNHCDIGNILIKF 121 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 DALRK+PK+ +LC +PGNF I QPRILSPRTTGYFIGK+AA HLL+VRK+I+RP+DMDM Sbjct: 122 DALRKRPKETDFLCKVPGNFGILQPRILSPRTTGYFIGKQAAAHLLDVRKDIFRPVDMDM 181 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 KHWWEH +PSLVTEPGAVYEA +T+DS+IE SRL +K TFSPLYFY+N YQ NLHY A Sbjct: 182 KHWWEHTVPSLVTEPGAVYEAANTDDSSIEASRLQKKRTFSPLYFYQNMRYQLNLHYQAR 241 Query: 241 RKDLPPVSTTKF 252 +K LP VS F Sbjct: 242 KKTLPYVSPEIF 253 >gi|241765554|ref|ZP_04763514.1| glycosyl transferase family 25 [Acidovorax delafieldii 2AN] gi|241364643|gb|EER59681.1| glycosyl transferase family 25 [Acidovorax delafieldii 2AN] Length = 244 Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 68/249 (27%), Positives = 115/249 (46%), Gaps = 9/249 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 MP+P+ I+L RR + R+ LQ DA++ P + ++S + Q+ Sbjct: 1 MPLPLCYINLARDEERRARIESELRRLCLQGDRIDAVWWAGVPTAQQSALYSEALNRRQY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L E GCY SH+ W+++ S A I+LEDD D F ++ +++ + ++ Sbjct: 61 YKPLVAGEKGCYASHLVAWQKLLDSTAPAMIVLEDDIRLDDRFLEVTDAIAQLEEPWDMV 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R+ K ++ G + RI S T GY + + A LL RK RPID+ Sbjct: 121 KLMG-REHEKVRAHRPLTAGTELVEYRRIPS-MTAGYVVSRSGAAKLLASRKPFGRPIDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL-HY 237 D++ WWE+++ L G V + +D+++ S + P +R + L Sbjct: 179 DLRFWWENDLRIL----GVVPPVLILDDTSLVSSIGQKPPRLGVGAKWRKFRMKLRLTAL 234 Query: 238 NAWRKDLPP 246 NAW + PP Sbjct: 235 NAWHQVRPP 243 >gi|91791576|ref|YP_561227.1| glycosyl transferase family protein [Shewanella denitrificans OS217] gi|91713578|gb|ABE53504.1| glycosyl transferase, family 25 [Shewanella denitrificans OS217] Length = 256 Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 66/254 (25%), Positives = 115/254 (45%), Gaps = 6/254 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M ++VI+L S R E + +++ ++F A+ G+N + +++ ++ Sbjct: 1 MSFKIFVINLDSSVDRIENMQAQCSKLGIEFERVSAVRGKNLSSAEKAAVYNRDVNLAKY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L+ EIGCY+SH W++I ++ILEDDA + E + + L+ I Sbjct: 61 DKELNDGEIGCYMSHARCWEQILIQKLDYSLILEDDAILTPEITAYIAKLADSTSEWDYI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K RK PK L + Q L TTG F+ + A LL I RPID+ Sbjct: 121 KLSHGRK-PKNILNAIDLGDGLSLGQCIKLPSTTTGQFVSQTGAKKLLQHAYPIARPIDI 179 Query: 179 DMKHWWEHNIPSLVTEPGAVYEA-IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHY 237 D++ W+E ++ S V P + + + + + R V K F+ + ++ ++ NL Sbjct: 180 DIQFWYEKSLRSFVVRPFPILNGDFGSEINQVTDRRQVDKRQFARI--WQKAKFELNLLL 237 Query: 238 NAWRKDLPPVSTTK 251 + R P S K Sbjct: 238 HRGRLGALPQSLVK 251 >gi|315633563|ref|ZP_07888853.1| LPS biosynthesis protein [Aggregatibacter segnis ATCC 33393] gi|315477605|gb|EFU68347.1| LPS biosynthesis protein [Aggregatibacter segnis ATCC 33393] Length = 225 Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 70/219 (31%), Positives = 105/219 (47%), Gaps = 18/219 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKRL 61 +++ISL + RR +R +++ LQF FFDAIYG++ + F + K+ Sbjct: 8 IFIISL-VNSPRRSIIKNRLSKLGLQFEFFDAIYGKDLSKEQLSQIDFQFYPQNFSAKKP 66 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKF 120 L+L EIGC +SHI L++ I + AIILEDDA S F +++ L+K ++ Sbjct: 67 LTLGEIGCAMSHIMLYEHIVKNNIREAIILEDDAIVSLYFEEIVTAALNKVSSKKEILFL 126 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSP---------RTTGYFIGKEAAIHLLNVRKN 171 D K K ++ LP + + + R SP RTTGY I E A LL+ Sbjct: 127 D--HGKAKVYPFMKRLPERYRLAKYR--SPSKKSKRSIIRTTGYLITYEGAKKLLSYAYP 182 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 I P D NI + EP V+ +D+ +IE Sbjct: 183 IRMPSDFLTGLLQLTNINAYGIEPACVFGDVDSEIDSIE 221 >gi|284105044|ref|ZP_06386173.1| Glycosyl transferase, family 25 [Candidatus Poribacteria sp. WGA-A3] gi|283830167|gb|EFC34427.1| Glycosyl transferase, family 25 [Candidatus Poribacteria sp. WGA-A3] Length = 248 Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 64/244 (26%), Positives = 114/244 (46%), Gaps = 13/244 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRL 61 P YVI+L + R + + + F DA+ G + + + + + + K Sbjct: 3 PSYVINLAENTVRMDNTARQLTAQRIPFERIDAVNGWALSESEISLAYDAEVNRRRGKHP 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L PEIGCY+SH+ W+R+A + G I EDD +++ +L LS+ + + ++K Sbjct: 63 LLRPEIGCYLSHVTAWRRVAEGESAGGFIFEDDFLATEDLPDVLSRLSEDERDWDMVKLF 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 + + P+ + P ++ P + T GY + +EA+ HL+ +RP+D D K Sbjct: 123 SFDQGPRTVTRRQLGP-RHEVVVPFRVPSCTVGYGLTREASRHLVRRATPFFRPVDEDQK 181 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDS-TIEESR--LVRKPTF----SPLYFYRNTCYQWN 234 +WE + + P V + TI E R LV++ + S +Y +R YQ Sbjct: 182 FFWETGLRVALVLPSPVLAGDQQAVTRTIREERRLLVKRGSMQAWHSLIYQFR---YQAL 238 Query: 235 LHYN 238 LH++ Sbjct: 239 LHWH 242 >gi|226192533|gb|ACO37547.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 69/232 (29%), Positives = 109/232 (46%), Gaps = 31/232 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNR 47 IP+Y+I+L EK R A + QF FF+AIYG++NP + R Sbjct: 4 IPIYIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQR 56 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L Sbjct: 57 -YNENKRLNAKGYPLTLSQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDF 114 Query: 108 LS--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 ++ K ++ D L+ K K L + GN I+Q TTGY++ +AA Sbjct: 115 INSDKNTFKFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGTTGYYLTPQAARKF 170 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 L K Y +D+ M ++E+ +P P + E +STI E + +K Sbjct: 171 LTQSKEWYLTVDVTMDRFFENKVPPYAIVPFCL-EDDGEIESTIYEKQKKQK 221 >gi|226192539|gb|ACO37551.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 78.6 bits (192), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 67/231 (29%), Positives = 109/231 (47%), Gaps = 31/231 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TTGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 K Y +D+ M ++E+ +P P + + + +STI E + R+ Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEI-ESTIYEKQKKRR 221 >gi|145636691|ref|ZP_01792358.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittHH] gi|229845801|ref|ZP_04465913.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 7P49H1] gi|145270217|gb|EDK10153.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittHH] gi|226192503|gb|ACO37527.1| Lex2B [Haemophilus influenzae] gi|226192506|gb|ACO37529.1| Lex2B [Haemophilus influenzae] gi|226192515|gb|ACO37535.1| Lex2B [Haemophilus influenzae] gi|226192536|gb|ACO37549.1| Lex2B [Haemophilus influenzae] gi|226192542|gb|ACO37553.1| Lex2B [Haemophilus influenzae] gi|229810805|gb|EEP46522.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 7P49H1] Length = 247 Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 66/227 (29%), Positives = 107/227 (47%), Gaps = 31/227 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TTGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 K Y +D+ M ++E+ +P P + + + +STI E + Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEI-ESTINEKQ 217 >gi|20977179|gb|AAM33309.1|AF503507_2 Lex2B [Haemophilus influenzae] Length = 248 Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/226 (27%), Positives = 107/226 (47%), Gaps = 22/226 (9%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS--------FFDAIYGENNP---ICNRIFSHQ 52 P+++I+L S R+ + A+ L FS FFDAIYG++NP + R ++ Sbjct: 5 PIFIINLEKSTDRK---AYMQAQFELLFSNNLIQEIHFFDAIYGKSNPNHPLFQR-YNEN 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS--K 110 KR L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ K Sbjct: 61 KRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFINSDK 119 Query: 111 CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 ++ D L+ K K L + GN I+Q TGY++ +AA L K Sbjct: 120 NTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSK 175 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 176 EWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|311766115|emb|CBW29072.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 10810] Length = 247 Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/226 (27%), Positives = 107/226 (47%), Gaps = 22/226 (9%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS--------FFDAIYGENNP---ICNRIFSHQ 52 P+++I+L S R+ + A+ L FS FFDAIYG++NP + R ++ Sbjct: 5 PIFIINLEKSTDRK---AYMQAQFELLFSNNLIQEIHFFDAIYGKSNPNHPLFQR-YNEN 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS--K 110 KR L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ K Sbjct: 61 KRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFINSDK 119 Query: 111 CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 ++ D L+ K K L + GN I+Q TGY++ +AA L K Sbjct: 120 NTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSK 175 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 176 EWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192491|gb|ACO37519.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 67/231 (29%), Positives = 108/231 (46%), Gaps = 31/231 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L++ ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTMGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TTGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 K Y +D+ M ++E+ +P P + E +STI E + R+ Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCL-EDDGEIESTIYEKQKKRR 221 >gi|507354|gb|AAA60375.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 18/224 (8%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS--------FFDAIYGENNP---ICNRIFSHQ 52 P+++I+L S R+ + A+ L FS FFDAIYG++NP + R ++ Sbjct: 5 PIFIINLEKSTDRK---AYMQAQFELLFSNNLIQEIHFFDAIYGKSNPNHPLFQR-YNEN 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD 112 KR L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 61 KRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFINSD- 118 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 N F L + K L + GN I+Q TGY++ +AA L K Sbjct: 119 -KNTFEFFWLLPDRLKNKRKLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEW 177 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 178 YLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|148826601|ref|YP_001291354.1| 50S ribosomal protein L31 [Haemophilus influenzae PittEE] gi|148716761|gb|ABQ98971.1| 50S ribosomal protein L31 [Haemophilus influenzae PittEE] gi|226192500|gb|ACO37525.1| Lex2B [Haemophilus influenzae] gi|226192521|gb|ACO37539.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 64/230 (27%), Positives = 106/230 (46%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P V P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYVIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192545|gb|ACO37555.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 64/231 (27%), Positives = 106/231 (45%), Gaps = 30/231 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNR 47 IP+Y+I+L EK R A + QF FF+AIYG++NP + R Sbjct: 4 IPIYIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQR 56 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L Sbjct: 57 -YNENKRLNAKGYPLTLSQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDF 114 Query: 108 LS--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 ++ K ++ D L+ K K L + GN I+Q TGY++ +AA Sbjct: 115 INSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKF 170 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 171 LTQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192554|gb|ACO37561.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 64/231 (27%), Positives = 107/231 (46%), Gaps = 30/231 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNR 47 IP+Y+I+L EK R A + QF FF+AIYG++NP + +R Sbjct: 4 IPIYIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFHR 56 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L Sbjct: 57 -YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDF 114 Query: 108 LS--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 ++ K ++ D L+ K K L + GN I+Q TGY++ +AA Sbjct: 115 INSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKF 170 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 171 LTQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192530|gb|ACO37545.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 106/230 (46%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR + L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKRYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|114565077|ref|YP_752591.1| glycosyl transferase family protein [Shewanella frigidimarina NCIMB 400] gi|114336370|gb|ABI73752.1| glycosyl transferase, family 25 [Shewanella frigidimarina NCIMB 400] Length = 250 Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 92/201 (45%), Gaps = 11/201 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQF 58 M V++I+L S R + ++ +++ A+YG++ + +++ Q ++ Sbjct: 1 MKFKVFLINLDNSTERFTFMDEQLKQLGVEYQRISAVYGKDLHDTEIAKVYDPQTNLQKY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L+L EIGCY+SH+ W+ I A+ILEDD+ ++ H INN+ Sbjct: 61 DKKLNLGEIGCYLSHVQCWQMIIEQQLDYALILEDDSILDPALMTVIQH-----INNLSA 115 Query: 119 KFDALR----KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 +D ++ +KPK L F + L T G + A LL I R Sbjct: 116 DWDYIKLCHGRKPKGIVKSIVLDDRFSLSTCLKLPASTRGQCVSLAGAQKLLATAYPIAR 175 Query: 175 PIDMDMKHWWEHNIPSLVTEP 195 P+D+D++ W+E + V P Sbjct: 176 PVDIDIQFWFEKQLRCFVVRP 196 >gi|319775306|ref|YP_004137794.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3047] gi|317449897|emb|CBY86109.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3047] Length = 247 Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 66/231 (28%), Positives = 108/231 (46%), Gaps = 31/231 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 K Y +D+ M ++E+ +P P + + + +STI E + R+ Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEI-ESTIYEKQKKRR 221 >gi|148827944|ref|YP_001292697.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittGG] gi|148719186|gb|ABR00314.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittGG] gi|226192488|gb|ACO37517.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 65/227 (28%), Positives = 106/227 (46%), Gaps = 31/227 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 K Y +D+ M ++E+ +P P + + + +STI E + Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEI-ESTINEKQ 217 >gi|145628361|ref|ZP_01784162.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 22.1-21] gi|144980136|gb|EDJ89795.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 22.1-21] Length = 232 Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|68249350|ref|YP_248462.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 86-028NP] gi|145632107|ref|ZP_01787842.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 3655] gi|145638450|ref|ZP_01794060.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittII] gi|329122700|ref|ZP_08251278.1| UDP-glucose-lipooligosaccharide glucosyltransferase [Haemophilus aegyptius ATCC 11116] gi|68057549|gb|AAX87802.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 86-028NP] gi|144987014|gb|EDJ93544.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 3655] gi|145272779|gb|EDK12686.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittII] gi|226192485|gb|ACO37515.1| Lex2B [Haemophilus influenzae] gi|309751585|gb|ADO81569.1| beta-(1-4)-glucosyltransferase Lex2B [Haemophilus influenzae R2866] gi|327472574|gb|EGF18004.1| UDP-glucose-lipooligosaccharide glucosyltransferase [Haemophilus aegyptius ATCC 11116] Length = 247 Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|145630497|ref|ZP_01786277.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae R3021] gi|144983887|gb|EDJ91329.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae R3021] Length = 247 Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNTKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|145634835|ref|ZP_01790543.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittAA] gi|260581568|ref|ZP_05849365.1| glycosyltransferase [Haemophilus influenzae NT127] gi|145268001|gb|EDK07997.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittAA] gi|226192494|gb|ACO37521.1| Lex2B [Haemophilus influenzae] gi|226192497|gb|ACO37523.1| Lex2B [Haemophilus influenzae] gi|226192509|gb|ACO37531.1| Lex2B [Haemophilus influenzae] gi|226192512|gb|ACO37533.1| Lex2B [Haemophilus influenzae] gi|226192524|gb|ACO37541.1| Lex2B [Haemophilus influenzae] gi|260095161|gb|EEW79052.1| glycosyltransferase [Haemophilus influenzae NT127] gi|309973750|gb|ADO96951.1| beta-(1-4)-glucosyltransferase Lex2B [Haemophilus influenzae R2846] Length = 247 Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192551|gb|ACO37559.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 65/233 (27%), Positives = 108/233 (46%), Gaps = 36/233 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNPICNRIFSH 51 P+++I+L EK R A + QF FFDAIYG++NP + +F Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNP-NHSLF-- 54 Query: 52 QKRQCQFKRL------LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 +R + KRL L+L ++GCY SH +W++ I+LEDDA F + F ++L Sbjct: 55 -QRYNENKRLNAKGYPLTLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVL 112 Query: 106 PHLS--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAI 163 ++ K ++ D L+ K K L + GN I+Q TGY++ +AA Sbjct: 113 DFINSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAAR 168 Query: 164 HLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 169 KFLTQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|229844622|ref|ZP_04464761.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 6P18H1] gi|229812336|gb|EEP48026.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 6P18H1] Length = 247 Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 66/231 (28%), Positives = 108/231 (46%), Gaps = 31/231 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P++VI+L EK R A + QF FF+AIYG++NP + R Sbjct: 5 PIFVINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLSQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 K Y +D+ M ++E+ +P P + + + +STI E + +K Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEI-ESTINEKQKKQK 221 >gi|226192518|gb|ACO37537.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 30/230 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FFDAIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLNLGQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192548|gb|ACO37557.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 67/231 (29%), Positives = 107/231 (46%), Gaps = 31/231 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P++VI+L EK R A + QF FF+AIYG++NP + R Sbjct: 5 PIFVINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLSQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 K Y +D+ M ++E+ +P P + E +STI E + +K Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCL-EDDGEIESTIYEKQKKQK 221 >gi|319897747|ref|YP_004135944.1| subname: full=udp-glucose--lipooligosaccharide glucosyltransferase; subname: full=udp-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3031] gi|317433253|emb|CBY81628.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3031] Length = 247 Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 66/231 (28%), Positives = 107/231 (46%), Gaps = 31/231 (13%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS------------FFDAIYGENNP---ICNRI 48 P+++I+L EK R A + QF FF+AIYG++NP + R Sbjct: 5 PIFIINL-------EKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQR- 56 Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 ++ KR L+L ++GCY SH +W++ I+LEDDA F + F ++L + Sbjct: 57 YNENKRLNAKGYPLTLSQLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFI 115 Query: 109 S--KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 116 NSDKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFL 171 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 K Y +D+ M ++E+ +P P + E +STI E + +K Sbjct: 172 TQSKEWYLTVDVTMDRFFENKVPPYAIVPFCL-EDDGEIESTIYEKQKKQK 221 >gi|157373252|ref|YP_001471852.1| glycosyl transferase family protein [Shewanella sediminis HAW-EB3] gi|157315626|gb|ABV34724.1| glycosyl transferase, family 25 [Shewanella sediminis HAW-EB3] Length = 252 Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 60/249 (24%), Positives = 109/249 (43%), Gaps = 14/249 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M VY+I+L S R E+ + ++ +++ A+ G R ++ ++ Sbjct: 1 MKFKVYIINLDSSVDRWEQLSSQCDQMGIEYERVSAVRGSELTEQERAKVYDLPTNLEKY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L+ EIGCY+SHI W++I A+ILEDD + + + D ++ Sbjct: 61 DKRLNDGEIGCYLSHIKCWEKIIEEELDFALILEDDVSLRENIKECIAQFFSMDSSD--- 117 Query: 119 KFDAL----RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 +D + RK+ K L +H+ L T F+ + A LL I R Sbjct: 118 -WDYIKLFHRKRLKMKVASLNLDEGVQLHKVAKLPASTCSQFVSQVGAKKLLKHSFPIAR 176 Query: 175 PIDMDMKHWWEHNIPSLVTE-PGAVYEAIDTNDSTIEESR-LVRKPTFSPLYFYRNTCYQ 232 P+D+D+++W+E + V P AV D+ I + R V++ F ++ Y+ Sbjct: 177 PVDVDIQYWFERELRCFVVSPPVAVPAGFDSEIDIIGQHRDYVQRRAFKRIWL--RVIYE 234 Query: 233 WNLHYNAWR 241 N+ N ++ Sbjct: 235 INVLLNTFK 243 >gi|309751471|gb|ADO81455.1| Lipooligosaccharide galactosyltransferase LosA2 [Haemophilus influenzae R2866] Length = 222 Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 62/221 (28%), Positives = 99/221 (44%), Gaps = 18/221 (8%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ +Q ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFYPQQFAVRK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++ EIGC +SH++++ + AIILEDDA S EF H+ +N + Sbjct: 65 PLTVGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFE----HIVLDSLNKVPNTM 120 Query: 121 DAL-RKKPKKDSYLCT---LPGNFDIH------QPRILSPRTTGYFIGKEAAIHLLNVRK 170 D L + K +Y C + G +H + + RTT Y I +E A LL + Sbjct: 121 DILFYEHGKAKTYFCKKNLIEGYRLVHYCTPSKRSKRTITRTTAYLITQEGANKLLKLAY 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I P D + + EP V+ ID+ +E+ Sbjct: 181 PILMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQ 221 >gi|329756963|gb|AEC04738.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 68/238 (28%), Positives = 99/238 (41%), Gaps = 54/238 (22%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC---QF-- 58 P++VISL S RR+ R + L F F D I G + S ++++C QF Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGING-------KTLSEEQKKCVDYQFYP 57 Query: 59 -----KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 R L++ EIGC +SH+ ++ + AIILEDDA S EF ++ Sbjct: 58 KTFAANRPLTIGEIGCAMSHLFIYNMMQDEDIAKAIILEDDAIVSQEFEHII-------- 109 Query: 114 NNILIKFDALRKKPKK-----------DSYLC--TLPGNFDI--------HQPRILSPRT 152 D+L+K PK SY C L + + H R ++ R Sbjct: 110 ------LDSLKKVPKSVDILFYEHGKAKSYFCKKKLIEGYRLVRYRSPSKHSKRTIT-RA 162 Query: 153 TGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 T YFI KEAA LL + I P D + + EP V+ +D+ +E Sbjct: 163 TAYFITKEAASKLLKLAYPIRMPADYLTGALQLTGLNAYGVEPPCVFRGVDSEIDAME 220 >gi|145638325|ref|ZP_01793935.1| LosA [Haemophilus influenzae PittII] gi|145272654|gb|EDK12561.1| LosA [Haemophilus influenzae PittII] Length = 222 Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 62/221 (28%), Positives = 99/221 (44%), Gaps = 18/221 (8%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ +Q ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFYPQQFAVRK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++ EIGC +SH++++ + AIILEDDA S EF H+ +N + Sbjct: 65 PLTVGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFE----HIVLDSLNKVPNTM 120 Query: 121 DAL-RKKPKKDSYLCT---LPGNFDIH------QPRILSPRTTGYFIGKEAAIHLLNVRK 170 D L + K +Y C + G +H + + RTT Y I +E A LL + Sbjct: 121 DILFYEHGKAKTYFCKKNLIEGYRLVHYCTPSKRSKRTITRTTAYLITQEGANKLLKLAY 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I P D + + EP V+ ID+ +E+ Sbjct: 181 PIRMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQ 221 >gi|119470917|ref|ZP_01613501.1| glycosyl transferase, family 25 [Alteromonadales bacterium TW-7] gi|119445939|gb|EAW27219.1| glycosyl transferase, family 25 [Alteromonadales bacterium TW-7] Length = 265 Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 57/206 (27%), Positives = 91/206 (44%), Gaps = 10/206 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFKRL 61 ++VI+L S R + AR++L F DA+ G + +S Q + Q+ Sbjct: 3 ALFVINLARSTQRLANTQTQLARVNLPFERIDAVDGAQLSTLQKYLHYSPQLNKQQYHYP 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS +IGCY+SH W++I AI+LEDD D + ++ + + LIK Sbjct: 63 LSNGQIGCYLSHRKAWQKIVDEKLKYAIVLEDDFYIDDSIHNAIKNIEQLNQPWQLIKLA 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTG---YFIGKEAAIHLLNVRKNIYRPID 177 A + + +Y N + HQ ++ + TG I E A LL RP+D Sbjct: 123 AYENRTRPIAY----QQNLNNHQRLVIHKKLMTGCCATAISYEGAKQLLKATAQFGRPVD 178 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAID 203 D++H WE ++ P V + D Sbjct: 179 CDLQHIWETHVNGFSLMPYPVSQNTD 204 >gi|329756950|gb|AEC04727.1| glucosyltransferase [Pasteurella multocida] gi|329756956|gb|AEC04732.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 63/228 (27%), Positives = 97/228 (42%), Gaps = 32/228 (14%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC------- 56 P++VISL S RR+ R + L F F D I G + S ++++C Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGING-------KTLSEEQKKCVDYHFYP 57 Query: 57 ---QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 R L++ EIGC +SH++++ + AIILEDDA S EF H+ + Sbjct: 58 ETFAANRPLTIGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFE----HIVLDSL 113 Query: 114 NNILIKFDAL-RKKPKKDSYLCT---LPGNFDIH------QPRILSPRTTGYFIGKEAAI 163 N + D L + K +Y C + G +H + + RTT Y I +E A Sbjct: 114 NKVPNTMDILFYEHGKAKTYFCKKNLIEGYRLVHYRTPSKRSKRTITRTTAYLITQEGAN 173 Query: 164 HLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 LL + I P D + + EP V+ ID+ +E+ Sbjct: 174 KLLKLAYPIRMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQ 221 >gi|224999306|gb|ACN78416.1| galactosyltransferase A [Pasteurella multocida] gi|325559791|gb|ADZ31082.1| putative bi-functional 1,4 and 1,6 beta-D-galactosyltransferase [Pasteurella multocida] gi|325559798|gb|ADZ31088.1| putative bi-functional 1,4 and 1,6 beta-D-galactosyltransferase [Pasteurella multocida] gi|329756856|gb|AEC04647.1| galactosyltransferase [Pasteurella multocida] gi|329756863|gb|AEC04653.1| galactosyltransferase [Pasteurella multocida] gi|329756870|gb|AEC04659.1| galactosyltransferase [Pasteurella multocida] gi|329756877|gb|AEC04665.1| galactosyltransferase [Pasteurella multocida] Length = 225 Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 67/223 (30%), Positives = 101/223 (45%), Gaps = 17/223 (7%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 M +P + VISL S RR+ HR + + L F FFDA+YG++ + + + C Sbjct: 1 MKLPKIIVISLKNS-PRRQIISHRLSGLGLDFEFFDAVYGKDLTKEELEKIDYEFFPKYC 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINN 115 K L+L EIGC +SHI +++ I + IILEDDA S F ++ L L K Sbjct: 60 GSKGALTLGEIGCAMSHIKIYEHIVANNLEQVIILEDDAIVSLYFEEIVLAALQKLPNRR 119 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDI--------HQPRILSPRTTGYFIGKEAAIHLLN 167 ++ D K K ++ LP + + H R + RTT Y I E A LL Sbjct: 120 EILFLD--HGKAKVYPFMRNLPERYRLARYRKPSKHSKRFIV-RTTAYLITLEGAKKLLK 176 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 I P D +I + EP V+ +++ + +E Sbjct: 177 HAYPIRMPSDFLTGLLQLTHINAYGIEPSCVFGGVESEINEME 219 >gi|52424501|ref|YP_087638.1| hypothetical protein MS0446 [Mannheimia succiniciproducens MBEL55E] gi|52306553|gb|AAU37053.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 240 Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 67/222 (30%), Positives = 99/222 (44%), Gaps = 22/222 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL--- 61 +++ISL S RR+ R + ++F FFDAIYG++ + S R+ KR Sbjct: 23 IFIISLKNS-PRRDVIAQRFNALGIKFEFFDAIYGKD--LSQEELSKIDREFAVKRFSTK 79 Query: 62 --LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILI 118 L+L EIGC +SHI +++ I + AII EDDA EF +++ LSK +I Sbjct: 80 KPLTLGEIGCALSHIAVYEHILKNNIEQAIIFEDDAIIHHEFKKIVEETLSKVPSRREII 139 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSP---------RTTGYFIGKEAAIHLLNVR 169 F+ + K S+ C + R SP RTT Y I A LLN Sbjct: 140 FFEHGKAK----SWFCKRSIHEGYKLVRYRSPSKNSKRCIFRTTSYLITLSGAKKLLNHA 195 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + P D I + EP V+ +D+ + IE+ Sbjct: 196 YPVRMPSDYLTGGLQITQINAYGIEPPCVFCGVDSEINAIED 237 >gi|329756911|gb|AEC04694.1| glucosyltransferase [Pasteurella multocida] gi|329756919|gb|AEC04701.1| glucosyltransferase [Pasteurella multocida] gi|329756926|gb|AEC04707.1| glucosyltransferase [Pasteurella multocida] gi|329756944|gb|AEC04722.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 65/228 (28%), Positives = 97/228 (42%), Gaps = 32/228 (14%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC------- 56 P++VISL S RR+ R + L F F D I G + S ++++C Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGING-------KTLSEEQKKCVDYHFYP 57 Query: 57 ---QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 R L++ EIGC +SH++++ + AIILEDDA S EF H+ + Sbjct: 58 ETFAANRPLTIGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFE----HIVLDSL 113 Query: 114 NNILIKFDAL-RKKPKKDSYLCT---LPGNFDIH--QPRILSPRT----TGYFIGKEAAI 163 N + D L + K +Y C + G +H P S RT T Y I +E A Sbjct: 114 NKVPNTMDILFYEHGKAKTYFCKKNLIEGYRLVHYRTPSKRSKRTITRATAYLITQEGAN 173 Query: 164 HLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 LL + I P D + + EP V+ ID+ +E+ Sbjct: 174 KLLKLAYPIRMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQ 221 >gi|110633728|ref|YP_673936.1| glycosyl transferase family protein [Mesorhizobium sp. BNC1] gi|110284712|gb|ABG62771.1| glycosyl transferase, family 25 [Chelativorans sp. BNC1] Length = 241 Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 59/197 (29%), Positives = 85/197 (43%), Gaps = 8/197 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFK 59 M IPV VI+L S R E RAA I + A+ G P R +K Q Sbjct: 1 MDIPVLVINLDRSKERWEGLAARAAAIDIFPERVQAVDGNAIEPSERRELQLRKFQLWHG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R E GCY+SH+ R+ + A+ILEDDA+F +F+ L L++ D+ L+K Sbjct: 61 RRPMGGEYGCYMSHMRALDRVIAAGWPYAVILEDDAEFLPDFAPRLRALTRLDLKPDLVK 120 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPR-ILSP--RTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K +L H R I P + GY + E A L ++ P Sbjct: 121 LYNHRTK----GFLTKAKTEAGDHVGRCIHGPLGSSMGYLVSGEGAAKLRAGALPMFLPY 176 Query: 177 DMDMKHWWEHNIPSLVT 193 D+ ++ W H + +T Sbjct: 177 DIALERGWAHGLRVYIT 193 >gi|153007760|ref|YP_001368975.1| glycosyl transferase family protein [Ochrobactrum anthropi ATCC 49188] gi|151559648|gb|ABS13146.1| glycosyl transferase family 25 [Ochrobactrum anthropi ATCC 49188] Length = 255 Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 56/239 (23%), Positives = 108/239 (45%), Gaps = 24/239 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I L + R+ + R+ ++ +A+ G + +R++ + ++ Sbjct: 18 VKAFIIHLERATDRQPQVEELVRRLPVETDVVNAVDGRTLDAQTISRVYRRSAHKPRYPF 77 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS EI C++SH W+ I ++LEDD + + EF+ + + C I +K Sbjct: 78 QLSTNEIACFLSHRKAWQAIVDQELDAGLVLEDDVELTPEFAA--AYSAACQI----LKA 131 Query: 121 DALRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGY-----FIGKEAAIHLLNVRKNIY 173 D+ + P +++S L + RI+ P G +G+EAA LL+ + Sbjct: 132 DSFIRFPFRERESGRVVL----NTETLRIIEPVPVGLGMVAQLVGREAAQRLLSATEVFD 187 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYR 227 RP+D + W + L PG V E ++ STI++SR +++ PLY ++ Sbjct: 188 RPVDTTAQMNWVTGLKPLSVLPGGVKEISVQLGGSTIQKSRSLPDKLKREILRPLYRWK 246 >gi|47500150|gb|AAT28923.1| unknown [Escherichia coli] Length = 260 Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 74/237 (31%), Positives = 113/237 (47%), Gaps = 38/237 (16%) Query: 3 IPVYVISLPFSHARREKFCHRAARI-HLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFK 59 + + V+SLP ++ARR KF R + + L+F FFD +YG+N P I I+ +K + + Sbjct: 4 LEIRVLSLPDAYARRTKFQERFSLVSKLKFQFFDGVYGKNIPDEILKSIYDDKKAKLKIN 63 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIG-AIILEDDA----DFSDEFSQLLPHLSKCDIN 114 R +++ EIG SH ++K AY + I+LEDD+ +F D ++LL ++ D Sbjct: 64 RSMTVGEIGATYSHYLIYKD-AYEKKLDYLIVLEDDSFVDENFDDVINRLLVKITPDDDA 122 Query: 115 NILIKFDALRKKP----KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 I I+ L K KKD L F++ ++L ++ YF+G I +K Sbjct: 123 IIFIQKHTLDSKVIFSRKKD----ILKNGFEL--VKMLG--SSQYFVGSYGYIL---TKK 171 Query: 171 NI------YRPIDMDMKHWW----EHNIPSL-VTEPGAVY---EAIDTNDSTIEESR 213 +I Y PI HW+ + I S P VY E I DS I E R Sbjct: 172 SINKIIQNYLPIYCVCDHWFFIKKDSKIESFYCVSPSLVYTNDEDIRLVDSFINEER 228 >gi|293390962|ref|ZP_06635296.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951496|gb|EFE01615.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D7S-1] Length = 230 Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 65/220 (29%), Positives = 102/220 (46%), Gaps = 15/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRL 61 P++VISL S RRE R + LQF FFDA+YG+ + ++ +Q Q + Sbjct: 9 PIFVISLKNS-PRREFIAKRLNGLGLQFEFFDAVYGKALSEEELAKV-DYQYYQDFDNKR 66 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKF 120 L+L EIGC +SHI +++ I + AIILEDDA S F +L + K ++ F Sbjct: 67 LTLGEIGCALSHIQVYEHIKKNNIAEAIILEDDAIVSTHFKAILQAAIEKLPSRYEILFF 126 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNVRKNIY 173 D + K C LP + + + R S + T Y + + LL + Sbjct: 127 DHGKAKSYPLIKKC-LPEGYKLVRYRYPSKNSRRSIMKATAYMVNQAGVEKLLKYAYPLR 185 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYE--AIDTNDSTIEE 211 P D + I + EP V+E A+++ ++IE+ Sbjct: 186 MPADFVTGFIQKTRIHAYGVEPSCVFEGLAVESEINSIED 225 >gi|329756884|gb|AEC04671.1| glucosyltransferase [Pasteurella multocida] Length = 227 Score = 68.6 bits (166), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 59/207 (28%), Positives = 93/207 (44%), Gaps = 13/207 (6%) Query: 15 ARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKRLLSLPEIGCYI 71 RR R + + L++ FFDA+YG++ + F + ++ L+L EIGC + Sbjct: 19 TRRPVISQRLSGLGLEYQFFDAVYGKDLTEEQLKEXDFEFYPKTYNARKALTLGEIGCAM 78 Query: 72 SHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDALRKKPKKD 130 SHI++++ I + AIILEDDA S F ++L L K +I FD K K Sbjct: 79 SHINVYEYIVKNNIDQAIILEDDAVVSLYFEEILASVLKKVPQRREIIFFD--HGKAKVY 136 Query: 131 SYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++ +LP + + + R S RTT Y I ++ A LL + P D Sbjct: 137 PFMRSLPERYRLARYRSPSKDSKRTIIRTTAYLITQQGAKKLLKHAYPVRMPSDFLTGSL 196 Query: 184 WEHNIPSLVTEPGAVYEAIDTNDSTIE 210 I + EP V+ + + IE Sbjct: 197 QLTGIHAYGVEPACVFGDVISEIDQIE 223 >gi|307245888|ref|ZP_07527973.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|307254861|ref|ZP_07536687.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307259303|ref|ZP_07541032.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|4809236|gb|AAD30156.1|AF143905_1 putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853248|gb|EFM85468.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306862232|gb|EFM94200.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306866670|gb|EFM98529.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 224 Score = 68.6 bits (166), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 65/207 (31%), Positives = 94/207 (45%), Gaps = 14/207 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRI-FSHQKRQCQFKR 60 P+ +ISL S +RR R + L F FFDA YG++ + +I F ++ ++ Sbjct: 6 PILIISLKNS-SRRAIISQRLTSLGLSFEFFDAFYGKDLTDEELQKIDFEFYPKKFDARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIK 119 L+L EIGC +SHI +++ + + AIILEDDA S F ++ L K +I Sbjct: 65 PLTLGEIGCALSHIKVYEYMVENNIEQAIILEDDAIVSLYFESIINEALKKVPSRKEIIF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNVRKNI 172 D K K ++ LP + + + RI S RTTGYFI A LL I Sbjct: 125 LD--HGKAKVWPFMRNLPERYRLARYRIPSKNSKRSIIRTTGYFITLTGAKKLLKYAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVY 199 P D I + EP V+ Sbjct: 183 RMPSDFLTGLLQLTGINAYGIEPPCVF 209 >gi|219871212|ref|YP_002475587.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis SH0165] gi|219691416|gb|ACL32639.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis SH0165] Length = 224 Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 65/227 (28%), Positives = 101/227 (44%), Gaps = 32/227 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 ++VISL + RRE R + + L F FFDA+YG++ + Q ++ FK Sbjct: 7 IFVISL-INSPRREFISKRLSDLKLNFEFFDAVYGKD------LSEEQLKEIDFKFYPEK 59 Query: 60 ----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + ++L E+GC +SHI L++ I + AIILEDDA S F +++ +N Sbjct: 60 YAARKAMTLGEVGCAMSHIKLYEHIVENNIDNAIILEDDAIVSLYFKEIILDA----LNK 115 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNF--DIHQPRILSP---------RTTGYFIGKEAAIH 164 I K++ L K + + P N R +SP RTTGY I + A Sbjct: 116 ISPKYEILFLDHGK-AKIYPFPKNLVERYRLARYISPSKNSKRTIIRTTGYIITQAGAKR 174 Query: 165 LLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 LL I P D +I + EP V+ + + +E+ Sbjct: 175 LLEYAYPIRMPSDFLTGSLQMTHIRAYGIEPSCVFGGSHSEINEMED 221 >gi|261867139|ref|YP_003255061.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412471|gb|ACX81842.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D11S-1] Length = 230 Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 65/220 (29%), Positives = 101/220 (45%), Gaps = 15/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRL 61 P++VISL SH RRE R + LQF FF A+YG+ + ++ +Q Q + Sbjct: 9 PIFVISLKNSH-RREFIAKRLNGLGLQFEFFYAVYGKALSEEELTKV-DYQYYQDFDNKR 66 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKF 120 L+L EIGC +SHI +++ I + AIILEDDA S F +L + K ++ F Sbjct: 67 LTLGEIGCALSHIQVYEHIKKNNIAEAIILEDDAIVSTHFKAILQAAIEKLPSRYEILFF 126 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNVRKNIY 173 D + K C LP + + + R S + T Y + LL + Sbjct: 127 DHGKAKSYPLIKKC-LPEGYKLVRYRYPSKNSRRSIMKATAYMVNLAGVEKLLKYAYPLR 185 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYE--AIDTNDSTIEE 211 P D + I + EP V+E A+++ ++IE+ Sbjct: 186 MPADFVTGFIQKTRIHAYGVEPSCVFEGLAVESEINSIED 225 >gi|148976943|ref|ZP_01813598.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrionales bacterium SWAT-3] gi|145963817|gb|EDK29077.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrionales bacterium SWAT-3] Length = 250 Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 61/207 (29%), Positives = 93/207 (44%), Gaps = 24/207 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN---PICNRIFSHQKRQCQFKRL 61 ++VI+L S RRE H+ + L F F AI G ++ P+ R ++ Q + Sbjct: 4 IFVINLESSTERRENISHQLDELALPFELFSAIDGRSSPHHPLIGR-YNDGLSQTYRAKT 62 Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDA-DFSDEFSQLLPHLSKCDINNILIK 119 LS ++GCY SH LW++ + + I I++EDDA F EF L DI I Sbjct: 63 LSAGQLGCYASHYLLWQKCVELNQPI--IVIEDDALIFKKEFLNFLQ-----DIPEIPES 115 Query: 120 FDALR----KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + +R K+ K DSY + IH+ T YF+ A LL K Y Sbjct: 116 IECVRLFSNKRRKYDSYEVFEATSTSIHKFTKGHMSATAYFLRPSGAKKLLEHSKEWYMA 175 Query: 176 IDMDMKHWWEHNI-------PSLVTEP 195 +D+ M +W++ + P L +P Sbjct: 176 VDIYMDRFWQNEVECYGTAAPCLTNDP 202 >gi|254476209|ref|ZP_05089595.1| glycosyl transferase, family 25 [Ruegeria sp. R11] gi|214030452|gb|EEB71287.1| glycosyl transferase, family 25 [Ruegeria sp. R11] Length = 238 Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 62/219 (28%), Positives = 97/219 (44%), Gaps = 22/219 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS-- 63 YVI L S AR + + DA+ G ++ + ++ Q LL Sbjct: 5 YVIHLAASTARTD-LVSALVQTLPGGQVLDAVDG------RKMSAQERAQVSVPHLLDPR 57 Query: 64 -----LP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNI 116 +P EIGC++SH WK IA A + EDD +D F+ L L D + Sbjct: 58 YPFPLMPSEIGCFLSHRKAWKAIAEGEADFGFVAEDDVVTTDGFADALQIALDHADRQS- 116 Query: 117 LIKFD-ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 LI+F A R+KPK + G+ + +P+ + Y +GK+AA L + + RP Sbjct: 117 LIRFPMAPREKPKS---VVATKGDVTLFRPQEIGLTAALYLLGKDAAHSLWSQSERFDRP 173 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDT-NDSTIEESR 213 +D ++ WE + SL P + A + STI++ R Sbjct: 174 VDTWLQMRWETGVDSLTLWPSCIKSAAASHGGSTIQKKR 212 >gi|170724445|ref|YP_001758471.1| glycosyl transferase family protein [Shewanella woodyi ATCC 51908] gi|169809792|gb|ACA84376.1| glycosyl transferase family 25 [Shewanella woodyi ATCC 51908] Length = 229 Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 44/140 (31%), Positives = 68/140 (48%), Gaps = 3/140 (2%) Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ ++L+ EIGCY+SHI W++I A++LEDDA +D + + L+ + Sbjct: 40 KYDKILNDGEIGCYMSHIRCWEQIVEQELDFALVLEDDAILTDNIIKYVEKLAPSSADWD 99 Query: 117 LIKFD-ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 IK + K DS L + + L TTG I A LL I RP Sbjct: 100 YIKLSHGSKVKSAVDSM--DLGDGLTLRKELKLPSTTTGQLISLSGAKKLLAHAYPITRP 157 Query: 176 IDMDMKHWWEHNIPSLVTEP 195 IDMD+++W+E ++ V P Sbjct: 158 IDMDIQYWFEKSLRCFVVAP 177 >gi|258625904|ref|ZP_05720779.1| Putative glycosyltransferase [Vibrio mimicus VM603] gi|258581868|gb|EEW06742.1| Putative glycosyltransferase [Vibrio mimicus VM603] Length = 277 Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 70/249 (28%), Positives = 108/249 (43%), Gaps = 16/249 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF---- 58 + +YVISL S RR + A L F FFDAI G NP + IFS+ + Sbjct: 36 MKIYVISLKNSLDRRASIEEQMANYGLDFEFFDAIDGRINPP-HPIFSNYDYAKRLWFTS 94 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 ++ S E+GCY SH +W++ A IILEDDA FS L + + Sbjct: 95 GKMPSKGELGCYASHYLMWQKCA-ELGESIIILEDDAKVLPCFSDYLNAIRVKTQEYGFL 153 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + + ++ + +L NF+I+ Y + E+A L+ + P+D Sbjct: 154 RLEQAYERSR--LFLKEKTDNFEIYFLTNNFGGARAYALSPESAKKLIENSQCWSMPVDN 211 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRNTCYQW 233 + + HN+PS + P V E D ++T + + L RKPT YR + Sbjct: 212 YIGSLYLHNMPSFIFHPSIV-ENPDYFETTFQNEKQPSAPLYRKPTRELYSLYRKV--RM 268 Query: 234 NLHYNAWRK 242 L N ++K Sbjct: 269 ALANNVYKK 277 >gi|206563391|ref|YP_002234154.1| putative glycosyltransferase [Burkholderia cenocepacia J2315] gi|198039431|emb|CAR55397.1| putative glycosyltransferase [Burkholderia cenocepacia J2315] Length = 255 Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 59/219 (26%), Positives = 95/219 (43%), Gaps = 15/219 (6%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDA--IYGENNPICNRIFSHQKRQCQFKRLLSLP 65 ISL +H RR A+ ++ FFDA + G + I + R ++ LLS Sbjct: 10 ISLARAHDRRAGMVEEFAKQNINARFFDAFELKGCGHGIPG--YDTGARLRRYGWLLSPG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E+GCY+SH +W+++ S + I+EDD D F L + +++ L + Sbjct: 68 ELGCYLSHRAVWRQLVESGSNACCIMEDDVALLDGFKSATEELYATREHWDMVRLMWLNR 127 Query: 126 KPKKD-SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 + + + ++L + + P L Y I +EAA +L+ I ID+ W Sbjct: 128 REQTEYAHLSSGTKLMWMESPVGLQC----YMITREAAEAMLDYTSRILHAIDIAFDRNW 183 Query: 185 EHNIPSLVTEP------GAVYEAIDTNDSTIEESRLVRK 217 EH VTEP GA +D +D+ RL K Sbjct: 184 EHEQRMFVTEPQFVQDTGAPTTILDRSDARTLVQRLKAK 222 >gi|167855172|ref|ZP_02477943.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis 29755] gi|167853717|gb|EDS24960.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis 29755] Length = 224 Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 63/221 (28%), Positives = 99/221 (44%), Gaps = 20/221 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKRL 61 +++ISL S RRE R + + L F FFDA+YG++ + F + ++ Sbjct: 7 IFIISLKNS-PRREFISKRLSDLKLNFEFFDAVYGKDLSEEKLKEIDFEFYPKHFSARKQ 65 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 ++L E+GC +SHI L++ I + AIILEDDA S F +++ +N I K++ Sbjct: 66 MTLGEVGCAMSHIKLYEYIVENNIDNAIILEDDAIVSLYFKEIISDA----LNKISPKYE 121 Query: 122 ALRKKPKKDSYLCTLPGNF--DIHQPRILSP---------RTTGYFIGKEAAIHLLNVRK 170 L K + + P N R +SP RTTGY I + A LL Sbjct: 122 ILFLDHGK-AKIYPFPKNLVERYRLARYISPSKNSKRTIIRTTGYIITQAGAKRLLEYAY 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I P D +I + EP V+ + + +E+ Sbjct: 181 PIRMPSDFLTGSLQMTHIRAYGIEPSCVFGGSHSEINEMED 221 >gi|145588437|ref|YP_001155034.1| glycosyl transferase family protein [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145046843|gb|ABP33470.1| glycosyl transferase, family 25 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 239 Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 58/222 (26%), Positives = 95/222 (42%), Gaps = 13/222 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I VISL S +R+EK + HL++ F DA+ G + +K + L Sbjct: 4 IQALVISLAGSDSRQEKVRAELGKTHLEWRFLDAVRGSALVEVPAEYKPEKVKALLGFEL 63 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+GC++SH W+ I II EDD F F + + LS + I+ Sbjct: 64 TPNELGCFLSHKKAWQECV-DKNIPTIIFEDDFYFLPHFEEAIHFLSTQCNDWDAIRLQG 122 Query: 123 LRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 L P++ C L G + + + T Y + AA L+ +IY P+D ++ Sbjct: 123 LSDVPQE----CILENGQMSLVRNVGDAVGATAYLLKPTAAKKLIAASNDIYEPLDHFLE 178 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 H+ +H++ L P V D T +S + +P P+ Sbjct: 179 HYQKHHVEFLAISPYPV-------DITRVQSTIADRPERLPI 213 >gi|54307438|ref|YP_128458.1| putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium profundum SS9] gi|46911858|emb|CAG18656.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium profundum SS9] Length = 253 Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 63/226 (27%), Positives = 105/226 (46%), Gaps = 12/226 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFKRLL 62 +Y+I+L S R+E ++ I++ F+FF AI G E +P+ + + Q Q R L Sbjct: 6 IYIINLLSSPERKENIANQLNAINVPFTFFPAIDGRKEAHPLFKK-YDDQCSQLYRGRSL 64 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILIKFD 121 + ++GCY SH LW++ A I+LEDDA S EF + N KF Sbjct: 65 NKGQLGCYASHYLLWEKCA-EKGEPIIVLEDDAIISPIEFKAFCTSATSL---NSQFKFI 120 Query: 122 ALRKKPKK--DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 L K +K S T +IH+ T GYF+ A LL ++ + P+D+ Sbjct: 121 RLHKHQRKRFSSKKATEIDKLEIHRFSKGHMGTIGYFLTPSGAKKLLEHSQHWFLPVDIF 180 Query: 180 MKHWWEHNIPSLVTEPGAVY--EAIDTNDSTIEESRLVRKPTFSPL 223 M +W I S T P + ++ ++N +E + ++ ++ + Sbjct: 181 MDRFWITKIDSYGTIPPCLSHDDSYESNIGYVEYEKRAKRHLYTKI 226 >gi|303251083|ref|ZP_07337269.1| hypothetical protein APP6_0382 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252616|ref|ZP_07534510.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302650093|gb|EFL80263.1| hypothetical protein APP6_0382 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306859938|gb|EFM91957.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 224 Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 65/207 (31%), Positives = 93/207 (44%), Gaps = 14/207 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRI-FSHQKRQCQFKR 60 P+ +ISL S +RR R + L F FFDA YG++ N +I F ++ ++ Sbjct: 6 PILIISLKNS-SRRAIISQRLTSLGLSFEFFDAFYGKDLTNEELQKIDFEFYPKKFDARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIK 119 L+L EIGC +SHI +++ + + AIILEDDA S F ++ L K +I Sbjct: 65 PLTLGEIGCALSHIKVYEYMVENNIEQAIILEDDAIVSLYFESIINEALKKVPSRKEIIF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNVRKNI 172 D K K ++ LP + + + I S RTTGYFI A LL I Sbjct: 125 LD--HGKAKVWPFMRNLPERYRLARYCIPSKNSKRSIIRTTGYFITLTGAKKLLKYAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVY 199 P D I + EP V+ Sbjct: 183 RMPSDFLTGLLQLTGINAYGIEPPCVF 209 >gi|251793913|ref|YP_003008645.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter aphrophilus NJ8700] gi|247535312|gb|ACS98558.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter aphrophilus NJ8700] Length = 228 Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 67/221 (30%), Positives = 99/221 (44%), Gaps = 21/221 (9%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL-- 61 P+++ISL S RRE R + LQF FFDA+YG+ + S K F + Sbjct: 9 PIFIISLKNS-PRREFMAKRLNGLGLQFEFFDAVYGKE--LSEEALS--KVDYSFYKTYN 63 Query: 62 ---LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNIL 117 L+L EIGC SHI +++ +A + AIILEDDA S F +++ +SK + L Sbjct: 64 PHPLTLGEIGCATSHIKIYEHMAKNNISSAIILEDDAIVSQFFKEIIKDIISKIHHSYEL 123 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT-------TGYFIGKEAAIHLLNVRK 170 + D K K + L + + R S + T Y I A LLN Sbjct: 124 VFLD--HGKVKSYPFKKKLIEGYRLAHYRSPSKNSKRCIIYATAYLITLSGAQKLLNYAY 181 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I P D + + + EP ++ +DT DS I++ Sbjct: 182 PIRMPADYLTGLIQKTKVNAYGVEPPCIFRGLDT-DSEIDQ 221 >gi|325579204|ref|ZP_08149160.1| glycosyltransferase involved in LPS biosynthesis [Haemophilus parainfluenzae ATCC 33392] gi|325159439|gb|EGC71573.1| glycosyltransferase involved in LPS biosynthesis [Haemophilus parainfluenzae ATCC 33392] Length = 225 Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 69/235 (29%), Positives = 102/235 (43%), Gaps = 40/235 (17%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQC 56 M IP + VISL S RRE R + + L FSFFDA G+ P + + + + Sbjct: 1 MAIPKIIVISLKHS-TRRENIAKRLSGLGLDFSFFDATDGKKLPASVLESVDYDFYPKHY 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 + L+L EIGC ISHI +++ + + AIILEDDA S F +++ L+K + N+ Sbjct: 60 LSPKPLTLGEIGCAISHIKVYEYMVENNIKSAIILEDDAIVSQHFKEIVEDTLNKINKNH 119 Query: 116 ILIKFD-------------------ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 LI FD A K P K+S C + T Y Sbjct: 120 ELIFFDHGKVKSHFFKKRIVEGYRLAHYKAPSKNSRRCII--------------YATAYL 165 Query: 157 IGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I A LLN I P D + + + EP ++ +++ DS I++ Sbjct: 166 ITLSGAKKLLNYAYPIRLPADYLTGLIQKTRVDTYGIEPPCIFRGLNS-DSEIDK 219 >gi|329756897|gb|AEC04682.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 14/219 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKR 60 P++VISL S ARRE R + L F F D + G+ N + ++ ++ ++ Sbjct: 6 PIFVISLKNS-ARREIIAQRLNGLKLNFQFIDGVNGKTLSQNELNKVDYTFYPQRFAARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIK 119 L++ EIGC +SH+ +++ + AIILEDDA S EF ++ L K N ++ Sbjct: 65 PLTVGEIGCAMSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVSKNVEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDI---HQPRILSPR----TTGYFIGKEAAIHLLNVRKNI 172 +D K K + TL N+ + +P S R T Y I A LL + I Sbjct: 125 YD--HGKAKSYCWKKTLVENYRLVHYRKPSKTSKRAIMCATAYLITLSGAQKLLQIAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 P D + + EP V++ + +E+ Sbjct: 183 RMPADYLTGALQLTGLKAYGVEPPCVFKGAISEIDAMEQ 221 >gi|269959343|ref|ZP_06173726.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269835780|gb|EEZ89856.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 240 Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 55/205 (26%), Positives = 88/205 (42%), Gaps = 26/205 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC----QF 58 + V+V+SL S RRE+ + + ++F FFDA+ G + F H ++ C Q Sbjct: 1 MKVFVVSLARSLGRRERIEEQLKQQGIEFEFFDAVDGSIDG-----FLHSEKSCPAITQR 55 Query: 59 KR--LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ L E+ C+ SH LW R IILED+ D + Q+L H Sbjct: 56 RKGYQLKTSEVACFSSHYELWVRCTELDE-PIIILEDNVDPVEGLKQVLAHTLPLTSTYD 114 Query: 117 LIKFDALRKK------PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 IK A +K+ P DS++ G + + TT Y + +AA + + Sbjct: 115 YIKLSATQKRAFTPIIPIDDSHML---GGYSAG-----TCGTTAYIVTPKAASKFVKHAQ 166 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEP 195 P+D M+ W HN+ + P Sbjct: 167 EFIEPVDDYMEKPWRHNVQTYSVAP 191 >gi|86133854|ref|ZP_01052436.1| glycosyltransferase family 25 (LPS biosynthesis protein) [Polaribacter sp. MED152] gi|85820717|gb|EAQ41864.1| glycosyltransferase family 25 (LPS biosynthesis protein) [Polaribacter sp. MED152] Length = 253 Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 62/221 (28%), Positives = 104/221 (47%), Gaps = 27/221 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 Y I+L S RR + ++++ + A YG++ + +I +KR+ LP Sbjct: 11 YYINLDKSTERRAFMEEQFNKLNIPITRISATYGKD--LDPKILKEEKRKHNILAHYPLP 68 Query: 66 ---EIGCYISHIHLWKRIAYSPAIGAIILEDDA----DFSDEFSQLLPHLSKCDINNILI 118 EIG ++H LW+ ++ P +I+LEDDA +F+++ QLL ++ D +I Sbjct: 69 NDGEIGICLTHFKLWEFLSKQPEDFSIVLEDDALIQNNFTEDLEQLLSQITINDFIDISG 128 Query: 119 K--FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 K F AL K +L P +L G IGK AA L N + Y PI Sbjct: 129 KKGFYALEKNTLLTKFLM----------PPVL---MIGQVIGKNAAQKLTNNLSDYYAPI 175 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAID--TNDSTIEESRLV 215 D+ + ++H +P T V ++ID STI+++ ++ Sbjct: 176 DVMKQDVYKHKVPLFSTNKQYV-KSIDKQMGGSTIQQNNMI 215 >gi|114768964|ref|ZP_01446590.1| glycosyl transferase, family 25 [alpha proteobacterium HTCC2255] gi|114549881|gb|EAU52762.1| glycosyl transferase, family 25 [alpha proteobacterium HTCC2255] Length = 249 Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 57/217 (26%), Positives = 97/217 (44%), Gaps = 17/217 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQC--QF 58 I ++I L ++ RR+ + + DAI G P N I S+ K+ C ++ Sbjct: 8 IKCFIIHLKRANKRRKFVDEIVNNMPIASEIIDAIDGSLLPEKEINSILSN-KKICNPKY 66 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD-----FSDEFSQLLPHLSKCDI 113 ++L EIGC++SH W+RI +I+EDD F+ + L ++ K Sbjct: 67 PFNINLGEIGCFLSHREAWQRIVDQKLEAGLIIEDDVRVNTPIFNKSLNFTLRYIRKYK- 125 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 I+F ++P + + + I P + RT+ + EAA+ LLN K I Sbjct: 126 ---YIQFQV--REPNRRADIIETENKLKILNPLPVHLRTSAQLVSFEAAVELLNKTKKID 180 Query: 174 RPIDMDMKHWWEHNIPS-LVTEPGAVYEAIDTNDSTI 209 RP+D ++ +WE I V + G ++ ST+ Sbjct: 181 RPVDTTLQMFWETKIKCYCVNQSGISDHTLEAGGSTL 217 >gi|221069650|ref|ZP_03545755.1| glycosyl transferase family 25 [Comamonas testosteroni KF-1] gi|220714673|gb|EED70041.1| glycosyl transferase family 25 [Comamonas testosteroni KF-1] Length = 254 Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 60/254 (23%), Positives = 109/254 (42%), Gaps = 12/254 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY-GENNPICNR-IFSHQKRQCQFKR 60 +P+ I+L RRE+ + A++ L S A++ + +P R F + ++ + Sbjct: 4 LPIVFINLSKDAERRERMTAQFAQMGLAASRLPAVWWADLSPAEQRHYFCAPQSHGRYFK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E GCY SH+ W+++ A ++ EDD + Q L + + Sbjct: 64 PLSNGEKGCYASHLRSWQQLLDGDAPAMVVFEDDVRLLPDLPQALAAIEALPADGSWDMI 123 Query: 121 DALRKKPKKDSYLCTL-PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 ++P+K + L G+ + + + GY I + A +L R RP+D+D Sbjct: 124 KLYGREPEKIANQGPLVEGSLQLISYQRVPSFAAGYVISRSGARKMLEARVPFDRPVDVD 183 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK---PTFSPLYFYRNTCYQWNLH 236 ++ W+E+++ VY ++ D T E S + + PT S L R + L+ Sbjct: 184 IRFWFENDLRVY-----GVYPSVIALDDTSEVSSIWAQQAAPT-SRLQKMRKFKMKLALN 237 Query: 237 YNAWRKDLPPVSTT 250 + R P VS Sbjct: 238 WGNLRAAKPQVSAV 251 >gi|109639335|gb|ABG36532.1| lipooligosaccharide galactosyltransferase I [Haemophilus ducreyi] Length = 249 Score = 65.1 bits (157), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 59/209 (28%), Positives = 98/209 (46%), Gaps = 20/209 (9%) Query: 3 IPVYVISLPFSHARR----EKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQC 56 +P+++I+L S R+ ++F H A I F FF+AIYG+ NP + ++ +R Sbjct: 2 LPIFIINLEKSVERKKIIQQQFSHLATEI--PFEFFNAIYGKENPDFYLFKKYNESERIK 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + ++L ++GC+ SH LW++ + + I IILEDDA D F + LS D N Sbjct: 60 RKGNQMNLAQLGCFASHYLLWEKCLELNQPI--IILEDDAILHDNFIEAYDFLS--DKEN 115 Query: 116 ILIKF------DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 I A R + K Y + + TTGY+I AA LLN Sbjct: 116 IFEFLWLSPPAPARRGQQGKKLYQIN-AKELSVFKFYKCWGNTTGYYITPAAAKKLLNYC 174 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 + +D+ ++ +W +++ L P + Sbjct: 175 QEWIYDLDITIERYWANDLACLALIPACL 203 >gi|159184922|ref|NP_354784.2| hypothetical protein Atu1799 [Agrobacterium tumefaciens str. C58] gi|159140201|gb|AAK87569.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 247 Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 6/194 (3%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLL 62 P+Y++S+ + AR EK A+ + L + + G+ +P R F + + + R Sbjct: 8 PIYIVSIARARARLEKMLEGASGLGLDLRPVEGVDGKTISPDDWRDFDRRGFELRNGRHA 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 E GCY SHI + + A A+I+EDD F+ +FS+ + + ++ ++K Sbjct: 68 LPGEYGCYASHIKALETFLATDAPVAVIVEDDVAFTPDFSERVKAMVAAMPDDAIVKLTN 127 Query: 123 LRKKPKKDSYLCTLPGNFD--IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 R+K K L F I+ P+ S Y I + AA L + + P D + Sbjct: 128 HRRKGFKARKTSALGDTFGRCIYGPQGSS---ACYIISRRAAERFLKAARRMTLPFDRAL 184 Query: 181 KHWWEHNIPSLVTE 194 + W + +T+ Sbjct: 185 ECGWGYGTEVYITD 198 >gi|299533415|ref|ZP_07046797.1| glycosyl transferase, family 25 [Comamonas testosteroni S44] gi|298718621|gb|EFI59596.1| glycosyl transferase, family 25 [Comamonas testosteroni S44] Length = 254 Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 50/215 (23%), Positives = 97/215 (45%), Gaps = 8/215 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY-GENNPICNR-IFSHQKRQCQFKR 60 +P+ I+L RRE+ + A++ L S A++ G+ + + F + ++ + Sbjct: 4 LPIVFINLSKDAERRERMTAQFAQMGLTASRLPAVWWGDLSEAEQKTFFCAPQSHGRYFK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E GCY SH+ W+++ S A ++ EDD + Q L + + Sbjct: 64 PLSNGEKGCYASHLRSWQQLMDSDAPAMVVFEDDVRLLPDLPQALAAIEALPTDGSWDMI 123 Query: 121 DALRKKPKKDSYLCTL-PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 ++P+K + L G+ + + + GY I + A +L R RP+D+D Sbjct: 124 KLYGREPEKIADQGPLVQGSLQLISYQRVPSFAAGYVISRSGARKMLEARVPFDRPVDVD 183 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 ++ W+E+++ VY ++ + D T E S + Sbjct: 184 IRFWFENDLRVY-----GVYPSVISLDDTSEISSI 213 >gi|239831175|ref|ZP_04679504.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] gi|239823442|gb|EEQ95010.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] Length = 255 Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 51/232 (21%), Positives = 101/232 (43%), Gaps = 10/232 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I L + R+ + ++ ++ +A+ G + +R++ + ++ Sbjct: 18 VKAFIIHLERATDRQPQVEALIRKLPIETDVVNAVDGRTLDAQTISRVYRRSAHKPRYPF 77 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS EI C++SH W+ I ++LEDD + + EF+ + + I+F Sbjct: 78 QLSTNEIACFLSHRKAWQAIVDQGLDAGLVLEDDVELTPEFAAAYSASCQLLKAHSFIRF 137 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 R++ L T I +P + +G+EAA LL+ + RP+D Sbjct: 138 -PFRERESGRVVLTT--EGVRIIEPVPVGLGMVAQLVGREAAQRLLSATEVFDRPVDTTA 194 Query: 181 KHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYR 227 + W + L PG V E ++ STI++SR +++ PLY ++ Sbjct: 195 QMNWVTGLKPLSVLPGGVKEISVQLGGSTIQKSRSLPDKLKREILRPLYRWK 246 >gi|260877887|ref|ZP_05890242.1| glycosyltransferase, family 25 [Vibrio parahaemolyticus AN-5034] gi|193787963|dbj|BAG50472.1| putative Lex2B [Vibrio parahaemolyticus] gi|308089865|gb|EFO39560.1| glycosyltransferase, family 25 [Vibrio parahaemolyticus AN-5034] Length = 240 Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 22/203 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR-- 60 + V+V+SL S RRE+ + + ++F FFDA+ G I + S + R +R Sbjct: 1 MKVFVVSLARSLDRRERIEEKLKQQGIEFDFFDAVDGS---IDGFLHSEKSRPAITQRRK 57 Query: 61 --LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 L E+ C+ SH LW R IILED+ D + Q+L H I Sbjct: 58 GYQLKTSEVACFSSHYELWVRCTELDE-PIIILEDNVDPVEGLKQVLAHTLPLTSTYDYI 116 Query: 119 KFDALRKK------PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 K A +K+ P DS++ G + + TT Y + +AA + + Sbjct: 117 KLSATQKRVFTPIIPIDDSHML---GGYSAG-----TCGTTAYIVTPKAASKFVKHAQEF 168 Query: 173 YRPIDMDMKHWWEHNIPSLVTEP 195 P+D M+ W HN+ + P Sbjct: 169 IEPVDDYMEKPWRHNVQTYSVAP 191 >gi|229527261|ref|ZP_04416654.1| beta-1,4-galactosyltransferase [Vibrio cholerae 12129(1)] gi|229335269|gb|EEO00753.1| beta-1,4-galactosyltransferase [Vibrio cholerae 12129(1)] Length = 242 Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 69/241 (28%), Positives = 103/241 (42%), Gaps = 24/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + A L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMASYGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ E+GCY SH +W++ + + I IILEDD S + L DI + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCVEINEPI--IILEDDIKISPMLVEYL-----ADIKHE 111 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS---PRTTGYFIGKEAAIHLLNVRKNIY 173 + F LR + + L + + LS GY I ++A LLN + Sbjct: 112 VSSFGFLRLQAAYERSDVFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWC 171 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRN 228 P+D + + H +PS + P V E + ++T + R L RKPT YR Sbjct: 172 MPVDNFIGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSAPLYRKPTREIYSLYRK 230 Query: 229 T 229 Sbjct: 231 V 231 >gi|171318340|ref|ZP_02907499.1| glycosyl transferase family 25 [Burkholderia ambifaria MEX-5] gi|171096482|gb|EDT41380.1| glycosyl transferase family 25 [Burkholderia ambifaria MEX-5] Length = 272 Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 68/258 (26%), Positives = 104/258 (40%), Gaps = 31/258 (12%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M ++VISL S RR R + F F DA + C+ + + ++ Sbjct: 1 MTFAIHVISLADS-PRRATIRRAVGRHGVDFDFEDAFDARALDAHACSAMTDTARVIARY 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD----EFSQLLPHLSKCDIN 114 R LS E+GC++SH+ +W++I S AI+LEDDA D F L + Sbjct: 60 GRPLSRGEVGCFMSHVRVWEKIVRS-GRAAIVLEDDAMLDDALFERFRSTPNELLSDHAD 118 Query: 115 NILIKFDALRKKPKKDSYL------CTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLN 167 +L+ L ++ +YL G I P + + GY+I + A L Sbjct: 119 LVLLGRSKLSRERAAQAYLYEPLKRARRIGGLQIGVPFKQWTSGAVGYWISADGARKALE 178 Query: 168 VRKNIYRPIDMDMKHW-WEHNIPSL-VTE--PGAVYEAIDTNDSTIEESRLVRKPT---- 219 + P+ + W W + L + E P AV+EA +T S IE R P Sbjct: 179 HARG---PVGALLDDWPWHRDHGGLRIAELRPYAVWEAFETMPSAIEAGRAALSPRRHGA 235 Query: 220 ----FSPLYFYRNTCYQW 233 F PL R ++W Sbjct: 236 LELLFKPLRAVR-AVFRW 252 >gi|264676250|ref|YP_003276156.1| glycosyl transferase, family 25 [Comamonas testosteroni CNB-2] gi|262206762|gb|ACY30860.1| glycosyl transferase, family 25 [Comamonas testosteroni CNB-2] Length = 254 Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 48/215 (22%), Positives = 92/215 (42%), Gaps = 8/215 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQFKR 60 +P+ I+L RRE+ + A++ L S A++ + + F + ++ + Sbjct: 4 LPIVFINLSKDAERRERMTAQFAQMGLTASRLPAVWWADLSEAEQKAFFCAPQSHGRYFK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E GCY SH+ W+++ A ++ EDD + Q L + + Sbjct: 64 PLSNGEKGCYASHLRSWQQLLGGDAPAMVVFEDDVRLLPQLPQALAAIEALPADGSWDMI 123 Query: 121 DALRKKPKKDSYLCTL-PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 ++P+K + L G + + + GY I + A +L R RP+D+D Sbjct: 124 KLYGREPEKIANQGPLVAGALQLISYQRVPSFAAGYVISRSGARKMLETRMPFDRPVDVD 183 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 ++ W+E+++ VY ++ D T E S + Sbjct: 184 IRFWFENDLRVY-----GVYPSVIALDDTSEVSSI 213 >gi|149909363|ref|ZP_01898019.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Moritella sp. PE36] gi|149807680|gb|EDM67628.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Moritella sp. PE36] Length = 246 Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 53/204 (25%), Positives = 90/204 (44%), Gaps = 12/204 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLS 63 ++V++L S R+E + + +++ F A+ G E+ ++ Q + L+ Sbjct: 4 IFVVNLEHSVERKEYISAQLDSMPVKYEMFPAVNGHESRLSILDMYDDGLSQAYRSKSLT 63 Query: 64 LPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSD----EFSQLLPHLSKCDINNILI 118 +IGCY SH LW++ + + I I++EDD EF L+P L D + Sbjct: 64 KGQIGCYASHYLLWQKCVELNKPI--IVIEDDIAIDKKRFIEFYDLVPEL---DSKYEFV 118 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + A R++ + + + T+ GN IH+ T GYF+ A L Y +D+ Sbjct: 119 RLSAHRRRKARLTTIETI-GNLSIHRASKGHMSTMGYFLTPNGAKKFLKYSTAWYMAVDI 177 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAI 202 M +W H EP V E I Sbjct: 178 YMDRFWIHGAEPFGLEPPCVSEDI 201 >gi|85713329|ref|ZP_01044347.1| putative glycosyltransferase [Idiomarina baltica OS145] gi|85692849|gb|EAQ30829.1| putative glycosyltransferase [Idiomarina baltica OS145] Length = 240 Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 51/193 (26%), Positives = 88/193 (45%), Gaps = 8/193 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + +++I+L S R EK + R+++ F A+ G + F ++ + + Sbjct: 1 MNVKIFLINLDRSTERLEKAAQQLNRLNVPFERVSAVDGSKLTGAEIDAAFDTEQAERRT 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--I 116 L++ EIGCY+SH+ W+RI AIILEDD D+F + + + +N Sbjct: 61 AYNLTIGEIGCYLSHVECWRRIITDKLDFAIILEDDLVLDDQFPHAISAIEQFTHSNEWD 120 Query: 117 LIKF-DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 IK + +K+P + T F + T + + A LL+ R +R Sbjct: 121 YIKLANPFKKRPYQTR---TALDRFTLVDYPKAPTGTVAQAVSFQGAETLLSHRPPFFRA 177 Query: 176 IDMDMKHWWEHNI 188 ID+D++ WE I Sbjct: 178 IDVDLQWEWELGI 190 >gi|262273482|ref|ZP_06051296.1| beta-1,4-galactosyltransferase [Grimontia hollisae CIP 101886] gi|262222460|gb|EEY73771.1| beta-1,4-galactosyltransferase [Grimontia hollisae CIP 101886] Length = 246 Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 63/221 (28%), Positives = 94/221 (42%), Gaps = 26/221 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSH---QKRQCQFKRL 61 + ++SL S RR+K + + L F FFDA+ G+N ++IF H +KR Sbjct: 3 IIIVSLKHSLERRKKIEKTLSALKLPFCFFDAVDGKNEH--HKIFDHYNNKKRVFIKGYP 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP--HLSKCDINNILIK 119 L E+GC+ SH LW+R ++LEDD + D+F + HL I I Sbjct: 61 LKPGELGCFASHYSLWERCV-ELNEPLLVLEDDIELEDDFLNVYKKMHLLAESIPYFRIG 119 Query: 120 --FDALRKKPKK-----DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 DA K K+ D L T P T GY I EAA ++ N Sbjct: 120 RVLDAKYKIFKQIDDCHDLVLYTKPVR-----------STQGYMITPEAAKKFIDAAYNW 168 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 Y P+D M+ W H + + EP + + + E ++ Sbjct: 169 YEPVDDFMEKEWLHGVLNFGIEPPLIRHDFNIHSEIGERAK 209 >gi|187920688|ref|YP_001889720.1| glycosyl transferase family 25 [Burkholderia phytofirmans PsJN] gi|187719126|gb|ACD20349.1| glycosyl transferase family 25 [Burkholderia phytofirmans PsJN] Length = 275 Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 66/258 (25%), Positives = 109/258 (42%), Gaps = 31/258 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 +PV+VISLP S RR + + F DA+ G + N ++ + ++ R Sbjct: 6 VPVHVISLPRS-GRRGAIAALLSGRGVAFRIEDAVDGRLLTHSELNAVYDDTAARRRYGR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN----I 116 ++ E+ C++SH +W++I A+ILEDDA F + + H + + + Sbjct: 65 SMTSAEVACFLSHRSVWRKI-VDNGCAAVILEDDAILEPAFFERVLHADESQLAAVAGIV 123 Query: 117 LIKFDALRKKPKKDSYLC------TLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLNVR 169 L+ LR+ +YL G + P + + + GY+I +AA L Sbjct: 124 LLGRSKLRRAASFWTYLNEPLRWRASVGGLRVGVPFKQWTSGSVGYWISAQAARQALAYS 183 Query: 170 KNIYRPIDMDMKHW-W---EHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR-KPTFS--- 221 + PI + W W E P AV+E + S+IE+ R R KP S Sbjct: 184 EG---PIGALLDDWPWHRDEGGARVAELRPYAVWEDFERLPSSIEQERKARIKPRASLHV 240 Query: 222 ----PLYFYRNTCYQWNL 235 PL R T +W++ Sbjct: 241 VALWPLRLAR-TAVRWSV 257 >gi|118581167|ref|YP_902417.1| glycosyl transferase family protein [Pelobacter propionicus DSM 2379] gi|118503877|gb|ABL00360.1| glycosyl transferase, family 25 [Pelobacter propionicus DSM 2379] Length = 251 Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 54/214 (25%), Positives = 87/214 (40%), Gaps = 22/214 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +++++L + RR C R + L+ F A+ G + + + R LS Sbjct: 9 LFILNLERAAQRRHIMCQRLCALGLEAEFLTAVDGASLDMDALPRGTEPR-------LSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 E GCY+SH+ W + A+ILEDD FS ++ +I + + FDA+R Sbjct: 62 GEKGCYLSHVAAWNTVVERGLSHAVILEDDVILGSNFSAVV-----AEIIALGMPFDAVR 116 Query: 125 KKPKKDSYL---------CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 K L TL N + P+ + T GY + A L + P Sbjct: 117 LSALKPDLLMPEHGAVPVATLASNARLVLPKKMPSGTQGYLVSYSGAKRLRSKISVPREP 176 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 +D +W+H + V P V E + +STI Sbjct: 177 VDDVFDTYWKHGLCIPVLFP-TVVEEDPSQESTI 209 >gi|269103933|ref|ZP_06156630.1| putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium damselae subsp. damselae CIP 102761] gi|268163831|gb|EEZ42327.1| putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium damselae subsp. damselae CIP 102761] Length = 249 Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 53/200 (26%), Positives = 99/200 (49%), Gaps = 8/200 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFKR 60 I ++VI++P S R+E + +++L F F+A+ G +++P+ N ++ Q + Sbjct: 2 IDIFVINIPSSVERKENITQQLVKLNLDFKLFEAVNGHKDSSPLFN-LYDEALSQQYRGK 60 Query: 61 LLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDA-DFSDEFSQLLPHLSKCDINNILI 118 LS ++GCY SH LW++ + + I IILEDDA +S+ F ++ + SK + I Sbjct: 61 SLSKGQLGCYASHYLLWQKCVEINKPI--IILEDDALIYSEPFLDIVENASKLAEHFECI 118 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + ++K + + LP +H+ TGYF+ A L + Y +D+ Sbjct: 119 RLFDNKRKTFRHAKEYPLPHT-AVHKFSKGHMSATGYFLTPTGAKKFLAHSQAWYLAVDI 177 Query: 179 DMKHWWEHNIPSLVTEPGAV 198 M +W +++ T P + Sbjct: 178 YMDRFWINDVNVHGTVPACL 197 >gi|15603008|ref|NP_246080.1| LosA [Pasteurella multocida subsp. multocida str. Pm70] gi|12721490|gb|AAK03227.1| LosA [Pasteurella multocida subsp. multocida str. Pm70] gi|329756891|gb|AEC04677.1| glucosyltransferase [Pasteurella multocida] gi|329756904|gb|AEC04688.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 59/219 (26%), Positives = 97/219 (44%), Gaps = 14/219 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ ++ ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFYPQRFAARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIK 119 L++ EIGC +SH+ +++ + AIILEDDA S EF ++ L K N ++ Sbjct: 65 PLTVGEIGCAMSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVSKNVEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDI---HQPRILSPR----TTGYFIGKEAAIHLLNVRKNI 172 +D K K + TL N+ + +P S R T Y I A LL + I Sbjct: 125 YD--HGKAKSYCWKKTLVENYRLVHYRKPSKTSKRAIMCATAYLITLSGAQKLLQIAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 P D + + EP V++ + +E+ Sbjct: 183 RMPADYLTGALQLTGLKAYGVEPPCVFKGAISEIDAMEQ 221 >gi|222109426|ref|YP_002551690.1| glycosyl transferase family 25 [Acidovorax ebreus TPSY] gi|221728870|gb|ACM31690.1| glycosyl transferase family 25 [Acidovorax ebreus TPSY] Length = 252 Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 48/200 (24%), Positives = 86/200 (43%), Gaps = 7/200 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFKR 60 IP+ I+L R+++ + +R+ L+ A+ + FS + Q+ + Sbjct: 4 IPIVYINLAKDTERQQRMQAQFSRLGLETFRLPAVRWSELSAEAQAHHFSAELNSHQYFK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILI 118 + E GCY SHI W+++ S A + EDD + L +S D +++ Sbjct: 64 PMGSGEKGCYCSHITAWRQLLDSAAAAMAVFEDDVRLLPAMPEALRAISTLPADCWDMVK 123 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 F R+K L G+ ++ R + GY I + A +L R+ RP+D+ Sbjct: 124 LFGREREKIASRRPLV---GSLELITYRRVPSFAAGYVISRSGARKMLESRQPFGRPVDV 180 Query: 179 DMKHWWEHNIPSLVTEPGAV 198 DM+ W+E+ + P V Sbjct: 181 DMRFWFENGMRVFGVHPSVV 200 >gi|262163773|ref|ZP_06031513.1| putative glycosyltransferase WavM [Vibrio mimicus VM223] gi|262027753|gb|EEY46418.1| putative glycosyltransferase WavM [Vibrio mimicus VM223] Length = 242 Score = 62.4 bits (150), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 30/242 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + A L F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMADYGLGFEFFDAIDGRVDPPHPLFAN--YDYAKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIA-YSPAIGAIILEDDADFSDEFS---QLLPHLSKC-- 111 ++ E+GCY SH +W++ A S I +ILEDDA FS +++ + ++C Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCAELSEPI--VILEDDAKILPSFSTYLEVIKYKTQCYG 116 Query: 112 --DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + + K+ D + L NF + LSP ++A LLN Sbjct: 117 FLRLEEADKRSQIFLKEKASDYEISFLSHNFGGARAYALSP---------QSAQKLLNGS 167 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY--EAIDTNDSTIEESR--LVRKPTFSPLYF 225 P+D M + H +PS + P + E +T +S+ L RKPT Sbjct: 168 LRWSMPVDNYMGSLYLHGVPSFLFSPSVIENPEFFETTFQNNWQSKTPLYRKPTREIYSL 227 Query: 226 YR 227 YR Sbjct: 228 YR 229 >gi|260779644|ref|ZP_05888534.1| putative glycosyltransferase WavM [Vibrio coralliilyticus ATCC BAA-450] gi|260604453|gb|EEX30757.1| putative glycosyltransferase WavM [Vibrio coralliilyticus ATCC BAA-450] Length = 241 Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 70/241 (29%), Positives = 104/241 (43%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSH----QKRQCQF 58 + V+VISL S RRE + ++++ F FFDA+ G P +++F + ++ Sbjct: 1 MKVFVISLQRSPERREYIKKQLDQLNIDFEFFDAVDGRAEP-PHKLFENYNYAKRLWLTS 59 Query: 59 KRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEF-SQLLPHLSKCDINNI 116 R+ S E+G Y SH LWK+ I I I+ EDDA F S L K D Sbjct: 60 GRMPSKGELGVYGSHYLLWKKSIELDEPI--IVFEDDASILPSFPSSLKVVQEKID---- 113 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG---YFIGKEAAIHLLNVRKNIY 173 ++ LR + K D + L D ++ G Y I AA L++ + Sbjct: 114 --RYGFLRLESKTDKGVLCLKEEDDNFSISFMTNNYGGLRCYAISPYAAKKLVSNSERWC 171 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVY--EAIDTNDSTIEES--RLVRKPTFSPLYFYRNT 229 P+D + + H++PS + P V + T EES RL RKPT YR Sbjct: 172 MPVDNYVGSLYLHDMPSYILSPAVVENPQEFATTIQLGEESKARLYRKPTRELYSLYRKI 231 Query: 230 C 230 Sbjct: 232 A 232 >gi|39545765|gb|AAR27965.1| putative LPS biosynthesis protein [Aeromonas hydrophila] Length = 221 Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 61/224 (27%), Positives = 93/224 (41%), Gaps = 27/224 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 IPV+VISL S RR + A + + F F DA G+ + + F K C Sbjct: 2 IPVFVISLARSVERRAMVVQQMAHLGIDFEFVDATDGKALSSAELAKVDFELAKEVCGHA 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF---SDEFSQLLPHLSKCDINNI 116 LS EIGC +SHI L++RI ++LEDD S + + H K +I + Sbjct: 62 --LSFGEIGCAMSHIRLYERIVAEGIERCVVLEDDIYLHMHSKAIIEAIVHSCKSEIVFL 119 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP---------RTTGYFIGKEAAIHLLN 167 K K+ +L +LPG + + R L+P T GY + A LL Sbjct: 120 ------HHGKAKRWPFLSSLPGGYRL--ARYLTPSRTSKRGVLSTAGYVLSLTGAKKLLQ 171 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + P D + + + EP + +D +TI++ Sbjct: 172 CAYPLRMPSDYLTGRLQLNGLTASGVEPSCL--DVDLFTTTIDD 213 >gi|83593566|ref|YP_427318.1| glycosyl transferase family protein [Rhodospirillum rubrum ATCC 11170] gi|83576480|gb|ABC23031.1| Glycosyl transferase, family 25 [Rhodospirillum rubrum ATCC 11170] Length = 255 Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 54/225 (24%), Positives = 96/225 (42%), Gaps = 20/225 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR-QCQ-- 57 M +PV+V++LP ARRE+ AR+ L F D + + + + R +C+ Sbjct: 1 MSLPVFVLNLPRDGARRERMTAELARVGLDGHFIDGV--DAGTLSQADWDRYDRSRCRAI 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 + + E+ CY++H ++++I A+ILEDD +++ +L L L Sbjct: 59 YGVDMLATELACYLAHERIFRKIVDENIDAALILEDDCQIANDLPSVLDALMATPPAARL 118 Query: 118 ---IKFDALRKKPKKDSYLCTLP--------GNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 ++ +R+ P G + +H +L GY + A +L Sbjct: 119 WQVVRLSTMREGKISAQARRLRPLRRLGDDRGLYRVHT-HVLG--LQGYVMTAAGARRML 175 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + + I+ PID + +WE+ I + P V D STI E Sbjct: 176 DYGRKIFMPIDHTLDRYWENRITPFIVHPCPVVHH-DEGHSTIGE 219 >gi|33152738|ref|NP_874091.1| lipooligosaccharide galactosyltransferase I [Haemophilus ducreyi 35000HP] gi|1388151|gb|AAB49623.1| LOS biosynthesis enzyme LBGA [Haemophilus ducreyi] gi|2198846|gb|AAC45592.1| LosA [Haemophilus ducreyi] gi|33148962|gb|AAP96480.1| lipooligosaccharide galactosyltransferase I [Haemophilus ducreyi 35000HP] Length = 224 Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 66/224 (29%), Positives = 98/224 (43%), Gaps = 24/224 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKR 60 P+++ISL S R+ R + L F+FFDAIYG+ + + ++ ++ Sbjct: 6 PIFIISLKDS-PRKNIIAERLNALGLSFNFFDAIYGKELSKKQLSEINYDFYIQKFNARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD-INNILIK 119 L+L EIGC ISHI L++ I + AIILEDDA S F ++ CD + I K Sbjct: 65 RLTLGEIGCAISHIKLYEYIVKNNINEAIILEDDAIVSIYFKDII-----CDALKKISCK 119 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSP---------RTTGYFIGKEAAIHLLN 167 ++ L K K + +L + + + +SP RTT Y I A LLN Sbjct: 120 YEILFLDHGKAKIFPFTRSLVERYRL--AKYISPSKKSKRIIIRTTAYIITLNGARKLLN 177 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I P D I + EP V+ + IE+ Sbjct: 178 YAYPIRMPSDYLTGTLQLTGINAYGIEPPCVFGGPISEIDQIED 221 >gi|153007953|ref|YP_001369168.1| glycosyl transferase family protein [Ochrobactrum anthropi ATCC 49188] gi|151559841|gb|ABS13339.1| glycosyl transferase family 25 [Ochrobactrum anthropi ATCC 49188] Length = 255 Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 63/256 (24%), Positives = 103/256 (40%), Gaps = 24/256 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRL 61 +PVYVI+L S R ++ A + + +A+ G+ +P F + + Sbjct: 5 VPVYVINLARSRDRWDRLKSNADALSIDLRRVEAVEGKLLSPEELTDFDDAGFRRWHGKT 64 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD-------IN 114 + EIGCY SHI + + IA +P A+I+EDD F+ F L +L+K +N Sbjct: 65 VLPAEIGCYFSHIQVLEIIAAAPESFAVIVEDDVIFTPAFQPFLTYLTKVTGWDAVKLVN 124 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 + F +K + S L G + Y + +E A +L + + Sbjct: 125 HRTAAFRPFQKAGDQFSIGRCLHGPLG---------SSAAYVVTREGASKMLKALRPMRL 175 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY------FYRN 228 P D+ ++ W T+ V E D STI + R P+Y +R Sbjct: 176 PYDVALERGWAGAYQVFTTDK-PVVEFSDIAISTIAQGRSAYAKKRLPVYKRITTLLFRA 234 Query: 229 TCYQWNLHYNAWRKDL 244 T Y + Y +K L Sbjct: 235 TDYVRRIAYALDKKGL 250 >gi|110634465|ref|YP_674673.1| glycosyl transferase family protein [Mesorhizobium sp. BNC1] gi|110285449|gb|ABG63508.1| glycosyl transferase, family 25 [Chelativorans sp. BNC1] Length = 250 Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 45/213 (21%), Positives = 87/213 (40%), Gaps = 6/213 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + +VI L + R + + + + DA+ E + R++ + ++ Sbjct: 1 MKVEAFVIHLARATGRAPQVERLRQNLTIPVTVIDAVDAEQLSEEEIVRVYRPGLHRPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 L E+ C++SH W+ I +I+EDD + L C ++ Sbjct: 61 PFPLRRTEVACFLSHRKAWQTIMERGLDAGLIIEDDVELLPGVHHLFDSGLACATTKDVL 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F KK ++ G I +PR++ +G++AA LL + RP+D Sbjct: 121 RFP---KKARERGAALNENGANRIFEPRLVGLGMQAQLVGRDAAGELLAFTREFDRPVDT 177 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 ++ W H + L + P A+ E A +T++ Sbjct: 178 TIQMRWLHGVRVLSSSPVAIREVAAALGGTTVQ 210 >gi|18653277|gb|AAL77351.1|AF444792_3 putative glycosyltransferase WavM [Vibrio cholerae] Length = 230 Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 68/239 (28%), Positives = 102/239 (42%), Gaps = 24/239 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ E+GCY SH +W++ + + I IILEDD S + L DI + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCVELNEPI--IILEDDIKISPMLVEYL-----ADIKHE 111 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS---PRTTGYFIGKEAAIHLLNVRKNIY 173 + F LR + + L + + LS GY I ++A LLN + Sbjct: 112 VSSFGFLRLQAAYERSDVFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWC 171 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYR 227 P+D + + H +PS + P V E + ++T + R L RKPT YR Sbjct: 172 MPVDNFIGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSTPLYRKPTREIYSLYR 229 >gi|145628223|ref|ZP_01784024.1| LosA [Haemophilus influenzae 22.1-21] gi|144979998|gb|EDJ89657.1| LosA [Haemophilus influenzae 22.1-21] Length = 221 Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 63/226 (27%), Positives = 97/226 (42%), Gaps = 29/226 (12%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NN---PICNRIFSHQK 53 P++VISL S RR+ R + L F F D I G+ NN C + F+ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFCPQRFA--- 61 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCD 112 ++ L++ EIGC +SH+ +++ + AIILEDDA S EF ++ L K Sbjct: 62 -----RKPLTVGEIGCAMSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVS 116 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDI---HQPRILSPR----TTGYFIGKEAAIHL 165 N ++ +D K K + TL N+ + +P S R TT Y I A L Sbjct: 117 KNVEILFYD--HGKAKSYCWKKTLVKNYRLVHYRKPSKTSKRAIMCTTAYLITLSGAQKL 174 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 L + I P D + + EP V++ + +E+ Sbjct: 175 LQIAYPIRMPADYLTGALQLTGLKAYGVEPPCVFKGAISEIDAMEQ 220 >gi|254448265|ref|ZP_05061727.1| putative Lex2B [gamma proteobacterium HTCC5015] gi|198262132|gb|EDY86415.1| putative Lex2B [gamma proteobacterium HTCC5015] Length = 241 Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 57/218 (26%), Positives = 92/218 (42%), Gaps = 7/218 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKR 60 + + VISL S RR+ + + + F FFDAI ++P C R +K R Sbjct: 1 MKICVISLKNSQERRQAVKQQMDILGIDFEFFDAI-TPDDPSCRFYRRLDRRKYLINTGR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 S EI CY SH LW S + +I+EDD + +F L C + Sbjct: 60 HPSPTEIACYASHYALWLECIRSKS-PMLIMEDDFTLTPDFKNAL---VACQSLADQYGY 115 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 L+++ + S G F + + + Y I A + + + + P+D+ + Sbjct: 116 IRLQEEYRARSKAILKHGEFTLSRYTRVPHCMMAYSINPSTAKAFVRLSETLSAPVDVML 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 K WEH P +V + ++DSTI+ + RKP Sbjct: 176 KKHWEHKQAIYGLTPYSVIASPHSSDSTIKGRQKHRKP 213 >gi|325473997|gb|EGC77185.1| hypothetical protein HMPREF9353_01535 [Treponema denticola F0402] Length = 210 Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 80/157 (50%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +V++L + R++ I + + FF A+YG++ ++ + + + + KR L++ Sbjct: 4 FVLNLEHNTERKKYMQDILKGIPIDYEFFPAVYGKSITDIDQFYDSKLAEKRAKRQLNVG 63 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SH ++K++ A+ILEDD F F ++ +S+ ++ N ++ K Sbjct: 64 EIGCALSHKAIYKKMIDENISQALILEDDISFLPNFFEVYTAMSQFNVGNKVVLLGTTAK 123 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 KP K + L N+ ++ T GY IG +AA Sbjct: 124 KPMKKVWKKNLFNNYSMYLVLNSYGGTYGYIIGLDAA 160 >gi|229521046|ref|ZP_04410467.1| beta-1,4-galactosyltransferase [Vibrio cholerae TM 11079-80] gi|229341931|gb|EEO06932.1| beta-1,4-galactosyltransferase [Vibrio cholerae TM 11079-80] Length = 242 Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 68/241 (28%), Positives = 102/241 (42%), Gaps = 24/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ E+GCY SH +W++ + + I IILEDD S + L DI + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCVELNEPI--IILEDDIKISPMLVEYL-----ADIKHE 111 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR---TTGYFIGKEAAIHLLNVRKNIY 173 + F LR + + L + + LS GY I ++A LLN + Sbjct: 112 VSSFGFLRLQAAYERSDVFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWC 171 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRN 228 P+D + + H +PS + P V E + ++T + R L RKPT YR Sbjct: 172 MPVDNFIGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSAPLYRKPTREIYSLYRK 230 Query: 229 T 229 Sbjct: 231 V 231 >gi|254225693|ref|ZP_04919300.1| putative glycosyltransferase WavM [Vibrio cholerae V51] gi|125621813|gb|EAZ50140.1| putative glycosyltransferase WavM [Vibrio cholerae V51] Length = 242 Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 68/241 (28%), Positives = 102/241 (42%), Gaps = 24/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ E+GCY SH +W++ + + I IILEDD S + L DI + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCVELNEPI--IILEDDIKISPMLVEYL-----ADIKHE 111 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR---TTGYFIGKEAAIHLLNVRKNIY 173 + F LR + + L + + LS GY I ++A LLN + Sbjct: 112 VSSFGFLRLQAAYERSDIFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWC 171 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRN 228 P+D + + H +PS + P V E + ++T + R L RKPT YR Sbjct: 172 MPVDNFIGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSTPLYRKPTREIYSLYRK 230 Query: 229 T 229 Sbjct: 231 V 231 >gi|83951263|ref|ZP_00959996.1| glycosyl transferase, family 25 [Roseovarius nubinhibens ISM] gi|83839162|gb|EAP78458.1| glycosyl transferase, family 25 [Roseovarius nubinhibens ISM] Length = 250 Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 46/165 (27%), Positives = 74/165 (44%), Gaps = 9/165 (5%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINNI-LIK 119 L + EIGC++SH +W I A+ILEDDA E F+Q L L + ++ I+ Sbjct: 64 LRMGEIGCFLSHRQIWAEILRRDLDHALILEDDAGLDPEHFAQALT-LGRDHLDRFGYIQ 122 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 F + + + L G + P++ RTT + + +A LL RP+D Sbjct: 123 FQT--RPARGPARLIATQGACQLLLPQMAGLRTTAQLVTRASAEQLLAASDRFDRPVDTF 180 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDT-NDSTIEESRLVRKPTFSPL 223 ++ W+ + + P + E D STI+ + RKP L Sbjct: 181 VQSHWQTGLRPAMILPSGISEIADQLEGSTIQSA---RKPVLEKL 222 >gi|172062730|ref|YP_001810381.1| glycosyl transferase family protein [Burkholderia ambifaria MC40-6] gi|171995247|gb|ACB66165.1| glycosyl transferase family 25 [Burkholderia ambifaria MC40-6] Length = 272 Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 70/258 (27%), Positives = 105/258 (40%), Gaps = 31/258 (12%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M ++VISL S RR R + F F DA + C+ + + ++ Sbjct: 1 MTFAIHVISLADS-PRRATIRRAVGRHGVDFDFEDAFDARALDADACSAMTDTVRVIARY 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLP---HLSKCDIN 114 R LS E+GC++SH+ +W++I S AI+LEDDA D F + L L + Sbjct: 60 GRPLSRGEVGCFMSHVRVWEKIVRS-GRAAIVLEDDAMLDDALFERFLSTPGELLSDHAD 118 Query: 115 NILIKFDALRKKPKKDSYL------CTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLN 167 +L+ L + +YL G I P + + GY+I A L Sbjct: 119 FVLLGRSKLSRDRAAQAYLYEPLKRARRIGGLQIGVPFKQWTSGAVGYWISVHGARKALE 178 Query: 168 VRKNIYRPIDMDMKHW-WEHNIPSL-VTE--PGAVYEAIDTNDSTIEESRLV----RKPT 219 + P+ + W W + L + E P AV+EA +T S IE R R T Sbjct: 179 HARG---PVGAVLDDWPWHRDHGGLRIAELRPYAVWEAFETMPSAIEAGRAALSSRRHGT 235 Query: 220 ----FSPLYFYRNTCYQW 233 F PL R ++W Sbjct: 236 LELLFKPLRVVR-AVFRW 252 >gi|332290185|ref|YP_004421037.1| Glycosyltransferase family 25 [Gallibacterium anatis UMN179] gi|330433081|gb|AEC18140.1| Glycosyltransferase family 25 [Gallibacterium anatis UMN179] Length = 246 Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 57/222 (25%), Positives = 99/222 (44%), Gaps = 16/222 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL- 61 +P+YVI + + R + + + ++F FF AI + P + +FSH + F+R Sbjct: 2 LPIYVIHIDSATERADSIQQQFDNLKIKFEFFPAINAKKTP-NHPLFSHYNAKKHFQRKG 60 Query: 62 --LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS E+GCY SH WK+ + + I I+LEDD + F + + + I+ Sbjct: 61 RNLSSGELGCYASHYSTWKKCLELNQPI--IVLEDDVTILENFKDIYTNAER-----IIQ 113 Query: 119 KFDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 K+D + + D + F I + GY I +AA LL + P Sbjct: 114 KYDFVWLHKNHRSDDKVIVESIDTFSITKFYRDYFCAQGYLITPKAAKQLLAYCEEWIYP 173 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 +D M ++E+ I + P + + I + +S I + R +K Sbjct: 174 VDDQMGRFYENKIENYAVYPACI-DHIASMESLIGDDRRGKK 214 >gi|258620626|ref|ZP_05715663.1| Putative glycosyltransferase [Vibrio mimicus VM573] gi|258587141|gb|EEW11853.1| Putative glycosyltransferase [Vibrio mimicus VM573] Length = 252 Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 34/107 (31%), Positives = 61/107 (57%), Gaps = 4/107 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + +YV+SL +RR ++ ++++ F F DA+ G P I + + +++ Q +++R Sbjct: 1 MGIYVVSL-LDSSRRINIINQFEKLNINFEFVDAVNGRLLPQAIVDSV-NNKDVQSRYRR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L EIGC +SH +++ IA S AII EDD SD S ++ + Sbjct: 59 KIGLGEIGCALSHQKIYQAIAESSVDYAIIFEDDIQLSDNISAVIEY 105 >gi|148832478|gb|ABR14155.1| LosA [Haemophilus influenzae] Length = 225 Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 69/219 (31%), Positives = 97/219 (44%), Gaps = 16/219 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRI-FSHQKRQCQFKRL 61 + VISL S +RR R + L+F FFDA YG+ P I + + + + + Sbjct: 6 IIVISLKNS-SRRAIISKRLLGLGLEFEFFDATYGKTLPQEILDSVDYEFYPKNYLSPKP 64 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKF 120 L+L EIGC ISHI ++ I + AIILEDDA S +++ L+K N LI Sbjct: 65 LTLGEIGCAISHIRAYEHIVENKIESAIILEDDAIISHFLKEIVQDALNKVKKNYDLIFL 124 Query: 121 DALRKKP---KKDSYLCTLPGNFDIHQPRILSPR----TTGYFIGKEAAIHLLNVRKNIY 173 D + K KK Y ++ P S R TT Y I + A LLN + Sbjct: 125 DHGKAKSYPFKKKLYEGYRLAHY--RAPSKNSKRCIIYTTAYLITQLGASKLLNYAYPVR 182 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEA--IDTNDSTIE 210 P D + I + EP V+ ID+ + IE Sbjct: 183 MPSDYLTGLIQKTKINAYGIEPPCVFRGLSIDSEINQIE 221 >gi|222084495|ref|YP_002543024.1| glycosyltransferase protein [Agrobacterium radiobacter K84] gi|221721943|gb|ACM25099.1| glycosyltransferase protein [Agrobacterium radiobacter K84] Length = 279 Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 56/200 (28%), Positives = 86/200 (43%), Gaps = 18/200 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRL 61 + +YVI+L S R E+ C +AAR L AI G P +RI F Q+ R Sbjct: 25 VCIYVINLDRSRERWERLCGQAARYGLNVIRVAAIDGAKIPEGDRIDFQQQQFVYHNGRK 84 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF--SQLLPHLSKCD------I 113 L E GCY SH+ ++ S AII+EDD + ++ L S C + Sbjct: 85 LLAGEYGCYRSHLLALQQFIDSGDKMAIIMEDDVELNERLIPRALSAMNSVCGARLVKLV 144 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 N+ L+ F L + + D + G +H P+ Y + + AA LL K + Sbjct: 145 NHRLVGFKPLSETTEND-----VVGRC-MHGPQ---GSAACYIVNRSAAKKLLITLKPML 195 Query: 174 RPIDMDMKHWWEHNIPSLVT 193 P D+ ++ W + + T Sbjct: 196 LPYDIALERGWSTGVETFST 215 >gi|66796392|gb|AAY56464.1| probable transglycosylase LosA [Haemophilus influenzae] gi|309972300|gb|ADO95501.1| beta-1,4-galactosyltransferase [Haemophilus influenzae R2846] Length = 225 Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 67/218 (30%), Positives = 98/218 (44%), Gaps = 15/218 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRI-FSHQKRQCQFKRL 61 + VISL S +RR R + L+F FFDA YG+ P I + + + + + Sbjct: 6 IIVISLKNS-SRRAIISKRLLGLGLEFEFFDATYGKTLPQEILDSVDYEFYPKHYLSPKP 64 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKF 120 L+L EIGC ISHI ++ I + AIILEDDA S +++ + L+K N LI Sbjct: 65 LTLGEIGCAISHIRAYEHIVENKIESAIILEDDAIISHFLKEIVQYALNKVKKNYDLIFL 124 Query: 121 DALRKKP---KKDSYLCTLPGNFDIHQPRILSPR----TTGYFIGKEAAIHLLNVRKNIY 173 D + K KK Y ++ P S R T Y I A LLN I Sbjct: 125 DHGKAKSYPFKKKLYEGYRLAHY--RAPSKNSRRCIIYATAYLITLSGAKKLLNYAYPIR 182 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 P D + + + EP V+ +++ DS I++ Sbjct: 183 LPADYLTGLIQKTRVNTYGIEPPCVFRGLNS-DSEIDK 219 >gi|323497081|ref|ZP_08102104.1| glycosyl transferase family protein [Vibrio sinaloensis DSM 21326] gi|323317925|gb|EGA70913.1| glycosyl transferase family protein [Vibrio sinaloensis DSM 21326] Length = 235 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 54/206 (26%), Positives = 89/206 (43%), Gaps = 20/206 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC------QF 58 ++VISL S RR + +QF F +A+ + + +R+C +F Sbjct: 3 IFVISLERSTDRRARVIEALGSKDIQFEFLNAVDASRPE-----YQYSERRCDALTRKRF 57 Query: 59 KRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 L+ EI C+ SH W++ + + I ++LED+ DFS+ F LL +L + Sbjct: 58 GYTLTTGEIACFASHHLAWEKCLELNEPI--MVLEDNCDFSERFFTLLSNLDTPLKSYSH 115 Query: 118 IKFDALRKKPK--KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 IK A R +P K+ L I + + T GY I AA ++ P Sbjct: 116 IKLAATRSQPPIIKEQ----LSQELAIVNYKRRTCGTMGYLITPNAAARFISGASLFIEP 171 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEA 201 +D M+ ++H + + P VY A Sbjct: 172 VDNYMEKPYKHGVKTFCFYPDLVYRA 197 >gi|40063395|gb|AAR38206.1| glycosyl transferase, family 25 [uncultured marine bacterium 580] Length = 257 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 51/201 (25%), Positives = 85/201 (42%), Gaps = 11/201 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFKRL 61 +P +VI++P R+ K + + + FF A N P + KR+ F R Sbjct: 8 LPTFVITMPDEINRQNKSKKQLKGVCDNYEFFFANSKPNKFPTT---YVKWKRRFYFGRD 64 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS E GC+ SH + ++I A++LEDD F F + + L KC + L++ Sbjct: 65 LSYGEFGCFNSHKLVLEKIVSKEIPKALVLEDDFIFLPNFEKSISDLLKCSYDWELVR-- 122 Query: 122 ALRKKPKKDSYL----CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L KPK ++ + L N+ + + Y + A LL+ I+ P+D Sbjct: 123 -LLSKPKLNNRMKKNVANLGNNYKLVRIATAPGGAYAYIVTLRGAKKLLSAMNKIWCPVD 181 Query: 178 MDMKHWWEHNIPSLVTEPGAV 198 + M W ++ L P Sbjct: 182 LVMGQPWRTDLEILTVIPSVA 202 >gi|48243736|gb|AAT40841.1| putative Lex2B [Haemophilus influenzae] Length = 175 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 45/154 (29%), Positives = 74/154 (48%), Gaps = 8/154 (5%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS--KCDINNILIKFDAL 123 ++GCY SH +W++ I+LEDDA F + F ++L ++ K ++ D L Sbjct: 2 QLGCYASHYSMWEKCV-ELDYPIIVLEDDAKFKNNFLEVLDFINSDKNTFEFFWLQPDRL 60 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 + K K L + GN I+Q TTGY++ +AA L K Y +D+ M + Sbjct: 61 KNKRK----LISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFLTQSKEWYLTVDVTMDRF 116 Query: 184 WEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 +E+ +P P + E +STI E + R+ Sbjct: 117 FENKVPPYAIVPFCL-EDDGEIESTIYEKQKKRR 149 >gi|165976409|ref|YP_001652002.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|190150305|ref|YP_001968830.1| LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|303253576|ref|ZP_07339714.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307250230|ref|ZP_07532186.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|165876510|gb|ABY69558.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|189915436|gb|ACE61688.1| putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|302647496|gb|EFL77714.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|306857759|gb|EFM89859.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 227 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 64/222 (28%), Positives = 96/222 (43%), Gaps = 13/222 (5%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 MPIP ++VISL S RR+ R + L F F DA+ G + + + ++ Sbjct: 6 MPIPPIFVISLANS-VRRKIISARLNSLGLAFEFIDAVNGRDLTEEQLSQVDYDFYPQRF 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 K L L EIGC +SHI ++ + + AIILEDDA S F +++ L K Sbjct: 65 GGKHRLKLGEIGCAMSHIRAYQHLVDNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRK 124 Query: 116 ILIKFDALRKK------PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 +I FD + K P K+ Y + RI+ TGY++ E A LLN Sbjct: 125 QIIFFDHGKAKYWPITRPLKEGYKLVRYRYPSKNSKRIIMI-ATGYYLTLEGAKLLLNHA 183 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + P D I + EP ++ + + IE+ Sbjct: 184 YPLRMPADFLTGAIQMTGIKAYGVEPPCIFISPGSEIDEIEK 225 >gi|307247978|ref|ZP_07530009.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|307261465|ref|ZP_07543136.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|307263648|ref|ZP_07545259.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|306855532|gb|EFM87704.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306868860|gb|EFN00666.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306871025|gb|EFN02758.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 222 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 64/222 (28%), Positives = 96/222 (43%), Gaps = 13/222 (5%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 MPIP ++VISL S RR+ R + L F F DA+ G + + + ++ Sbjct: 1 MPIPPIFVISLANS-VRRKIISARLNSLGLAFEFIDAVNGRDLTEEQLSQVDYDFYPQRF 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 K L L EIGC +SHI ++ + + AIILEDDA S F +++ L K Sbjct: 60 GGKHRLKLGEIGCAMSHIRAYQHLVDNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRK 119 Query: 116 ILIKFDALRKK------PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 +I FD + K P K+ Y + RI+ TGY++ E A LLN Sbjct: 120 QIIFFDHGKAKYWPITRPLKEGYKLVRYRYPSKNSKRIIMI-ATGYYLTLEGAKLLLNHA 178 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + P D I + EP ++ + + IE+ Sbjct: 179 YPLRMPADFLTGAIQMTGIKAYGVEPPCIFISPGSEIDEIEK 220 >gi|255264180|ref|ZP_05343522.1| lipooligosaccharide galactosyltransferase I [Thalassiobium sp. R2A62] gi|255106515|gb|EET49189.1| lipooligosaccharide galactosyltransferase I [Thalassiobium sp. R2A62] Length = 255 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 63/218 (28%), Positives = 91/218 (41%), Gaps = 16/218 (7%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRL 61 P++VISLP RR + L F DAI G + H Q R Sbjct: 9 PIFVISLPDRLDRRAAITAQLTAHDLPFEIIDAIDGRSGLTEAQKSAVDHAAVQRHLGRP 68 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS PEI C SH +++R+ GA+ILEDDA + F+ L + ++ +++F Sbjct: 69 LSDPEIACAQSHQSVYQRVITLDLGGAVILEDDATLTTAFATL--YRARAYQRFDILQFS 126 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRIL--SPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 R + +L DI R+L S + Y + + AA +L + K I P D Sbjct: 127 YRRARVFPSRWLWPHLTE-DIRAARVLLNSEVASTYSVNQRAARYLSHGPKPIGGPAD-- 183 Query: 180 MKHWWEHNIPSL---VTEPGAVYEAIDTNDSTIEESRL 214 W + L VT P + DT S IE R+ Sbjct: 184 ----WPRDTFGLRHAVTVPQTADQNCDTLSSDIEFGRV 217 >gi|124265921|ref|YP_001019925.1| LPS biosynthesis glycosyltransferase [Methylibium petroleiphilum PM1] gi|124258696|gb|ABM93690.1| glycosyltransferase involved in LPS biosynthesis-like protein [Methylibium petroleiphilum PM1] Length = 253 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 64/242 (26%), Positives = 97/242 (40%), Gaps = 34/242 (14%) Query: 13 SHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYIS 72 +H RRE+F AA ++ FFDA+ P + +F R L E+GCY S Sbjct: 4 AHQRREQFAAGAAHGTARWEFFDAL---RLPAQGLHYDEPLTVRRFGRALKPGEVGCYAS 60 Query: 73 HIHLWKRIAYSPAIGAIILEDDA-------------DFSDEFSQLLPHLSKCDINNILIK 119 H +W++ S A ++ EDD +D +L + IN + K Sbjct: 61 HYEVWRQFLMSAADQLLVFEDDVMVDWPLIEQLCAHRLADHGIDVLRLYTSHPINVRIAK 120 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + L S+L + G Q +LS R G EA LL+ + + P+D Sbjct: 121 YKLL----SDHSHLMQVRGYLYGTQAYVLSRR------GAEA---LLSACRVMTMPVDWA 167 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 M +W +P+ P V E STIE ++ + S L +R + W L A Sbjct: 168 MSRYWVCGMPAFTLFPFPVLE--RHGPSTIEHAQQIGA---SSLASHRVDRFLWRLRDRA 222 Query: 240 WR 241 R Sbjct: 223 AR 224 >gi|262276308|ref|ZP_06054117.1| putative glycosyltransferase WavM [Grimontia hollisae CIP 101886] gi|262220116|gb|EEY71432.1| putative glycosyltransferase WavM [Grimontia hollisae CIP 101886] Length = 247 Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 63/222 (28%), Positives = 95/222 (42%), Gaps = 10/222 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-NNPICNRIFSHQKRQ-CQFKRLL 62 VYVISL S RREK A ++ F+FFDA+ G P + ++KR + Sbjct: 6 VYVISLKRSVERREKIAKEMAEKNIAFTFFDAVDGHLGEPELAADYDYRKRLWLTSGKTP 65 Query: 63 SLPEIGCYISHIHLW-KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 S E+GCY SH LW K I I II EDD + + ++ + I+ Sbjct: 66 SKGELGCYASHYALWLKCIELGEPI--IICEDDIELHENAKSIVEYALMAANEYGFIRMQ 123 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 ++ K + G F +H R Y I +AA L+ R P+D + Sbjct: 124 SVCKGSATTEVI--KDGEFSLHLMRDNYGGLCAYAISPKAASRLIKHRWCF--PVDCFVG 179 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 + H S EP + + I N+STI+ ++ + P + L Sbjct: 180 ANYIHGQYSYQLEPCFMADHIG-NESTIQFEKMEKPPFYRKL 220 >gi|262404971|ref|ZP_06081523.1| putative glycosyltransferase WavM [Vibrio sp. RC586] gi|262348810|gb|EEY97951.1| putative glycosyltransferase WavM [Vibrio sp. RC586] Length = 242 Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 67/244 (27%), Positives = 99/244 (40%), Gaps = 34/244 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRVDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGA--IILEDDADFSDEFSQLLPHLSKCDINN 115 ++ E+GCY SH +W++ +G +ILEDDA F L + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCV---ELGEPIVILEDDAKILPSFINYLNIIRVKTQEY 115 Query: 116 ILIKFDA-------LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + K+ D + L NF + LSP E+A L+ Sbjct: 116 GFLRLEEAYDRSRLFLKEKTNDFKISFLTNNFGGARAYSLSP---------ESAKKLIKG 166 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPL 223 + P+D M + H++PS V P V E D ++T + R L RKPT Sbjct: 167 SQRWSMPVDNYMGSLYLHDMPSFVFYPSVV-ENPDHFETTFQNDRQPSAPLYRKPTRELY 225 Query: 224 YFYR 227 YR Sbjct: 226 SLYR 229 >gi|328471783|gb|EGF42660.1| glycosyl transferase family protein [Vibrio parahaemolyticus 10329] Length = 240 Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 50/204 (24%), Positives = 86/204 (42%), Gaps = 24/204 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR-----IFSHQKRQCQ 57 + V+V+SL S RRE+ + + ++F FFDA+ G + + + + +++ Q Sbjct: 1 MKVFVVSLARSLDRRERIEEKLKQQGIEFEFFDAVDGSIDGFLHSEKSRPVITQRRKGYQ 60 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 K E+ C+ SH LW R IILED+ D + Q+L + Sbjct: 61 LK----TSEVACFSSHYELWVRCTELDE-PIIILEDNVDPVEGLKQVLAYTLPLTSTYDY 115 Query: 118 IKFDALRKKP------KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 IK A +K+ DS++ G + + TT Y + +AA + + Sbjct: 116 IKLSATQKRAFTPIISIDDSHML---GGYSAG-----TCGTTAYIVTPKAASKFVKHAQE 167 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEP 195 P+D M+ W HN+ + P Sbjct: 168 FIEPVDDYMEKPWRHNVQTYSVAP 191 >gi|91226583|ref|ZP_01261307.1| putative Lex2B [Vibrio alginolyticus 12G01] gi|91189057|gb|EAS75339.1| putative Lex2B [Vibrio alginolyticus 12G01] Length = 240 Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 53/205 (25%), Positives = 86/205 (41%), Gaps = 26/205 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQ--CQFKR 60 + V+V+SL S RRE+ + + + F FFDA+ G +R +K + +R Sbjct: 1 MKVFVVSLARSLDRRERIEEKLKQQDITFEFFDAVDGS----VDRFLHSEKSRPTITLRR 56 Query: 61 ---LLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L E+ C+ SH LW R + I +ILED+ D + Q+L H N Sbjct: 57 KGYQLKTSEVACFASHYELWVRCVELDEPI--VILEDNVDPVEGLKQVLEHTLPLTSNYD 114 Query: 117 LIKFDALRKK------PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 IK A +K+ P D+++ G + + TT Y + +AA + + Sbjct: 115 YIKLSATQKRVFTPIVPIDDNHML---GGYSAG-----TCGTTAYIVTPKAASKFVMHAQ 166 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEP 195 P+D M+ W H + P Sbjct: 167 EFIEPVDDYMEKPWRHGVQVYSVSP 191 >gi|118594621|ref|ZP_01551968.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Methylophilales bacterium HTCC2181] gi|118440399|gb|EAV47026.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Methylophilales bacterium HTCC2181] Length = 260 Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 54/185 (29%), Positives = 80/185 (43%), Gaps = 8/185 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRL 61 I V+VISL RRE R I F FFDAI G+ + + KR+ + + Sbjct: 8 IKVFVISLAREKQRREAIQKRLNGICNTFEFFDAIDGKKEQLSQHNDYIGLKRRLFYGKD 67 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS E+GC+ SH L +I A+ILEDDA ++F + L + ++F Sbjct: 68 LSDGELGCFFSHRALINKIVQEKIPFAVILEDDAILLNDFKPTVESLLNTSYSWDFVRF- 126 Query: 122 ALRKKPKKDSYLCTLPGN-FDIHQPRILSPRTTG---YFIGKEAAIHLLNVRKNIYRPID 177 KPK T+ N FD +Q ++ G Y + + A L + ++ P D Sbjct: 127 --LTKPKIQRKTQTIVANLFDNYQLLRIATAPGGAYAYIVSMKGAKKLQRSMEKVWCPND 184 Query: 178 MDMKH 182 M Sbjct: 185 TLMGQ 189 >gi|117620908|ref|YP_858651.1| putative lipooligosaccharide biosynthesis protein LpsA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562315|gb|ABK39263.1| putative lipooligosaccharide biosynthesis protein LpsA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 221 Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 58/224 (25%), Positives = 92/224 (41%), Gaps = 27/224 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 IPV+VISL S RR + A + + F F DA G+ N + + K C Sbjct: 2 IPVFVISLARSVERRAMVVQQMAHLGIDFEFVDATDGKTLSNAELAKVDLALAKEVCGHA 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF---SDEFSQLLPHLSKCDINNI 116 LS EIGC +SHI +++R+ ++LEDD S + + H C + Sbjct: 62 --LSFGEIGCAMSHIRVYERVVAEGIDRCVVLEDDIYLHMHSKAIIETIVH--SCQSEIV 117 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP---------RTTGYFIGKEAAIHLLN 167 + K K+ +L +LPG + + R L+P T GY + A LL Sbjct: 118 FLH----HGKAKRWPFLSSLPGGYRL--ARYLTPSRSSKRGVLSTAGYVLTLAGAKKLLQ 171 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + P D + + + EP + +D +TI++ Sbjct: 172 CAYPLRMPSDYLTGRLQLNGLSASGVEPCCL--DVDLFTTTIDD 213 >gi|149189214|ref|ZP_01867501.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrio shilonii AK1] gi|148836968|gb|EDL53918.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrio shilonii AK1] Length = 258 Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 69/231 (29%), Positives = 104/231 (45%), Gaps = 28/231 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLL 62 ++VISLP + RRE+ + + QF+FFDA+ G R + R+ R L Sbjct: 14 IFVISLPDATERRERVKRIFSETNYQFTFFDAVNGRKGLPERLERRVNDTYRKVFRSRPL 73 Query: 63 SLPEIGCYISHIHLWKR-IAYSPAIGAIILEDD----ADFSDEFSQLLPHLSKCDINNIL 117 S E GC+ SH+ LW++ + I ++LEDD F + FSQ LP L K Sbjct: 74 SPGEKGCFASHLLLWEKCLELDEPI--VLLEDDFLPTESFENAFSQ-LPLLIKD------ 124 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRIL---SPRTTGYFIGKEAAIHLLNVRKNIYR 174 I++ L K+ K + ++ G + R L S TTGY + + A LL + Sbjct: 125 IEYLRLEKRTSKWFPINSVGG----FERRFLFDNSCGTTGYVLTPQGAQKLLQHSEEWLC 180 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 +D + + H + S P A+ EA S I+ + T PLYF Sbjct: 181 AVDNYISESFRHGMYSFNISPPAI-EAPHDMGSLIQ----CEEKTRVPLYF 226 >gi|46143791|ref|ZP_00134259.2| COG3306: Glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126208454|ref|YP_001053679.1| putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae L20] gi|126097246|gb|ABN74074.1| putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 213 Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 62/206 (30%), Positives = 89/206 (43%), Gaps = 13/206 (6%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 MPIP ++VISL S RR+ R + L F F DA+ G + + + ++ Sbjct: 6 MPIPPIFVISLANS-VRRKIISARLNSLGLAFEFIDAVNGRDLTEEQLSQVDYDFYPQRF 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 K L L EIGC +SHI ++ + + AIILEDDA S F +++ L K Sbjct: 65 GGKHRLKLGEIGCAMSHIRAYQHLVDNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRK 124 Query: 116 ILIKFDALRKK------PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 +I FD + K P K+ Y + RI+ TGY++ E A LLN Sbjct: 125 QIIFFDHGKAKYWPITRPLKEGYKLVRYRYPSKNSKRIIMI-ATGYYLTLEGAKLLLNHA 183 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEP 195 + P D I + EP Sbjct: 184 YPLRMPADFLTGAIQMTGIKAYGVEP 209 >gi|126724376|ref|ZP_01740219.1| Glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] gi|126705540|gb|EBA04630.1| Glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] Length = 247 Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 50/185 (27%), Positives = 80/185 (43%), Gaps = 6/185 (3%) Query: 16 RREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISH 73 R E ++ A ++ F +A+ G + ++++ + K L PEIGCY+SH Sbjct: 7 RMENVSNQMAEQNIAFVRVNAVDGRKFDEAETSKVYDAAANKKYGKYPLVGPEIGCYLSH 66 Query: 74 IHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYL 133 I+ W+ IA A G I EDD +L LS + + ++K A + Sbjct: 67 INAWRTIAEGDAEGGFIFEDDFQSDQYLKTVLTALSNDERDWDMVKLFAFNPAVPLANGR 126 Query: 134 CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRPIDMDMKHWWEHNIP-S 190 P + I P + GY + K AA L+ + +RP+D D K WE + + Sbjct: 127 ELTPDHI-IGIPYRVPTCLIGYGVTKLAAQSLVKRADQARFFRPVDEDQKFVWETGLRVA 185 Query: 191 LVTEP 195 LV P Sbjct: 186 LVARP 190 >gi|167412429|gb|ABZ79878.1| unknown [Campylobacter jejuni] Length = 259 Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 49/183 (26%), Positives = 79/183 (43%), Gaps = 13/183 (7%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKRLLS 63 +I+L + R+E + L + +A+ G+ N I + K KR L Sbjct: 5 IINLKKAKQRKEYISKLCQKYQLDYEIIEAVEGKAISKQEYLN-IVDYDKMLNFHKRELG 63 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 L E+GC +SH +++I A+ILEDDA F + + L + ++ + L+ Sbjct: 64 LGELGCSLSHKKCYEKILQEKLKYAVILEDDAYFDENLLEFLQYFNEFPKDLELLLLGHQ 123 Query: 124 RKKPKKDSYLCTLP---------GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 R+ D + P N+ I + T GYFI K A+ LL+ + IY Sbjct: 124 RQVYSDDGFRIESPYSRRFAKKISNYKIRRLIARGNGTYGYFITKNGALKLLSHLEKIYL 183 Query: 175 PID 177 PID Sbjct: 184 PID 186 >gi|301598260|ref|ZP_07243268.1| LPS glycosyltransferase subfamily protein [Acinetobacter baumannii AB059] Length = 238 Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 58/203 (28%), Positives = 92/203 (45%), Gaps = 18/203 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAAR-------IHLQFSFFDAIYGENNPICNRIFSHQKRQ 55 I + +SL S+ R+EK + I + F FFDAIYG+ P IF + R Sbjct: 4 ILILCVSLQGSNERQEKIKQQHQNLQSTIHDIQIDFQFFDAIYGKKLPNEYLIFLNLSR- 62 Query: 56 CQFKRL----LSLPEIGCYISHIHLWKRIA---YSPAIGAIILEDDADFSDE-FSQLLPH 107 QF L L E+GC++SH+ +W+R+A Y+ II+EDD F E Q L Sbjct: 63 -QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQQKLHS 121 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 L + + + + + + Y+ F++ P+ L T Y + +E A +N Sbjct: 122 LLETNPAFAFLGGHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDFVN 181 Query: 168 VRKNIYRPIDMDMKHWWEHNIPS 190 + ID D K+ E +I + Sbjct: 182 KQIKKLTYID-DWKYLLEGDIST 203 >gi|301344884|ref|ZP_07225625.1| LPS glycosyltransferase subfamily protein [Acinetobacter baumannii AB056] Length = 271 Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 60/205 (29%), Positives = 90/205 (43%), Gaps = 22/205 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAAR-------IHLQFSFFDAIYGENNPICNRIFSHQKRQ 55 I + +SL S+ R+EK + I + F FFDAIYG+ P IF + R Sbjct: 4 ILILCVSLQGSNERQEKIKQQHQNLQSTIHDIQIDFQFFDAIYGKKLPNEYLIFLNLSR- 62 Query: 56 CQFKRL----LSLPEIGCYISHIHLWKRIA---YSPAIGAIILEDDADFSDEFSQLLPHL 108 QF L L E+GC++SH+ +W+R+A Y+ II+EDD F E Q L Sbjct: 63 -QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQ--QKL 119 Query: 109 SKCDINNILIKFDALRKKPKK---DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 N F +P + Y+ F++ P+ L T Y + +E A Sbjct: 120 HSLLETNPAFAFLGGHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDF 179 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPS 190 +N + ID D K+ E +I + Sbjct: 180 VNKQIKKLTYID-DWKYLLEGDIST 203 >gi|325293185|ref|YP_004279049.1| glycosyl transferase, family 25 [Agrobacterium sp. H13-3] gi|325061038|gb|ADY64729.1| Glycosyl transferase, family 25 [Agrobacterium sp. H13-3] Length = 247 Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 50/195 (25%), Positives = 84/195 (43%), Gaps = 8/195 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRLL 62 PVY+IS+ + AR EK + A+ + L + + G+ P F+ + + + R Sbjct: 8 PVYIISIARARARLEKMLNGASGLRLDLRPVEGVDGKTIPAAEWTDFNRRGFELRNGRHA 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 E GCY SHI + + A A+I+EDD F+ +FS + + N ++K Sbjct: 68 LPGEYGCYASHIKALEIFLATDAPVAVIVEDDVTFTPDFSDRVKAMVAIMPENSIVKLTN 127 Query: 123 LRK---KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 R + +K S L G I+ P+ + Y I + A L + + P D Sbjct: 128 HRHRGFRARKTSALGDKLGRC-IYGPQ---GSSACYIISRGGAEEFLKAARVMTLPFDRA 183 Query: 180 MKHWWEHNIPSLVTE 194 ++ W + VTE Sbjct: 184 LECGWGYGTHVYVTE 198 >gi|260912783|ref|ZP_05919269.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260633161|gb|EEX51326.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 222 Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 60/221 (27%), Positives = 98/221 (44%), Gaps = 15/221 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRI-FSHQKRQCQ 57 MP P++VISL S RR+ R ++L F F D I G+ N++ + ++ Sbjct: 4 MP-PIFVISLKGS-PRRDVISQRLNGLNLNFRFIDGINGKELTQDELNKVDYEFYLKRFN 61 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNI 116 ++ L++ E+GC +SH+ +++ I + AIILEDDA S F L+ + K N Sbjct: 62 SRKPLTIGEVGCALSHLSIYEMIVQNKIEKAIILEDDAIVSQVFESLVKDSIRKSPDNTE 121 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDI---HQPRILSPRT----TGYFIGKEAAIHLLNVR 169 +I +D K K + T+ N+ + +P S R T Y I A LL V Sbjct: 122 IIFYD--HGKAKSYCWRKTISENYRLVHYRRPSKNSKRVIICATAYLITLSGAQKLLKVA 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 + P D + + EP V++ + +E Sbjct: 180 YPLRMPADYLTGALQLTKLKAYGVEPPCVFKGTISEIDAME 220 >gi|294678480|ref|YP_003579095.1| family 25 glycosyl transferase [Rhodobacter capsulatus SB 1003] gi|294477300|gb|ADE86688.1| glycosyl transferase, family 25 [Rhodobacter capsulatus SB 1003] Length = 258 Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 60/209 (28%), Positives = 94/209 (44%), Gaps = 34/209 (16%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL- 61 +PV VISL + AR+ + ARI + ++ F+ I G + + + + Sbjct: 8 LPVQVISLARATARQAQVAQEFARIGMGYTIFEGIDGAAHQ--AELLGRTDLAAWHRNMG 65 Query: 62 --LSLPEIGCYISHIHLWKRI-AYSPAIGAIILEDDADFSDEFSQ-LLPHLSKCDINNIL 117 +S +GCY SH+ LW++I A P I +I EDD F+ +F Q L L+ + +IL Sbjct: 66 APISAGHMGCYASHVELWRQIGAEGPEI-VLICEDDVTFTADFPQALAAGLAMAERWDIL 124 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT--------TG---YFIGKEAAIHLL 166 +F +R K + LP Q R L P T TG Y I + A L Sbjct: 125 -RFSCIRAKGR-------LP------QAR-LGPFTLMAYWGPFTGNGCYLIKRAVAARLT 169 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 I R D ++ +++H+I + EP Sbjct: 170 ERFYPIRRAHDHELNRFFDHDIRLMGLEP 198 >gi|332992059|gb|AEF02114.1| glycosyl transferase family protein [Alteromonas sp. SN2] Length = 227 Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 41/176 (23%), Positives = 77/176 (43%), Gaps = 2/176 (1%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 F A +FS AI G ++ + ++ +S + + + + L+L EIGC++SH W Sbjct: 2 FTQSAKHYSAEFSVITAIKGSSDNVMHQGYSKRANKLNYYKKLTLGEIGCFLSHRKAWSF 61 Query: 80 IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGN 139 I A++LEDD F F QL + K + I + + + K+ + + Sbjct: 62 IVEHKLAFAVVLEDDVMFEHSFQQLDKVIDKINEPWDYINLNEVHE--KRSATTVFMSEG 119 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + + L + I A LL ++ RP+D++++ I +L +P Sbjct: 120 VSVVKYNKLPIGASAQVISYSGAEKLLKYTESFSRPLDVELQWLAVDKIKALGIKP 175 >gi|262380196|ref|ZP_06073351.1| LPS glycosyltransferase subfamily [Acinetobacter radioresistens SH164] gi|262298390|gb|EEY86304.1| LPS glycosyltransferase subfamily [Acinetobacter radioresistens SH164] Length = 271 Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 57/203 (28%), Positives = 93/203 (45%), Gaps = 18/203 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAAR-------IHLQFSFFDAIYGENNPICNRIFSHQKRQ 55 I + +SL S+ R+EK + I + F FFDAIYG+ P IF + R Sbjct: 4 ILILCVSLQGSNERQEKIKQQHQNLQSTIHDIQIDFEFFDAIYGKKLPNEYLIFLNLSR- 62 Query: 56 CQFKRL----LSLPEIGCYISHIHLWKRIA---YSPAIGAIILEDDADF-SDEFSQLLPH 107 QF L L E+GC++SH+ +W+R+A Y+ II+EDD F + Q L Sbjct: 63 -QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYTAYDRIIIIEDDVIFQTKHIQQRLHS 121 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 L + D + + + + + Y+ F++ P+ L T Y + +E A ++ Sbjct: 122 LLQTDPSFAFLGGHSEPSRRRIRGYISNDELYFNMTGPKDLYTATYAYSLTRETAQDFIH 181 Query: 168 VRKNIYRPIDMDMKHWWEHNIPS 190 + ID D K+ E +I + Sbjct: 182 KQIKKLTYID-DWKYLLEGDIST 203 >gi|167626112|ref|YP_001676406.1| glycosyl transferase family protein [Shewanella halifaxensis HAW-EB4] gi|167356134|gb|ABZ78747.1| glycosyl transferase family 25 [Shewanella halifaxensis HAW-EB4] Length = 238 Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 62/212 (29%), Positives = 96/212 (45%), Gaps = 29/212 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFS--HQKRQCQFKR 60 + V+VISL S R +K A + FSFFDA ++NP + IFS ++ ++K Sbjct: 1 MKVFVISLKSSIERHKKVAE--ALKEVPFSFFDAENIKDNP-QHSIFSLYDSRKTKRYKG 57 Query: 61 L-LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-----LPH-LSKCDI 113 L++PE+GC+ SHI LWK+ + I+ ++ A F D SQL L H + Sbjct: 58 YELTIPELGCFASHISLWKQCLADNEVFLILEDNLALFGDLESQLDNIESLTHKYGVVKL 117 Query: 114 NNILI-KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 N+ KF + ++ + L G + T+ Y I +AA + LN Sbjct: 118 GNVFERKFVDIEPIDEQYKLISNLKG----------ACGTSAYAITPQAAQNYLNQVDGF 167 Query: 173 YRPIDMDMKHWWEHNI------PSLVTEPGAV 198 + P+D M + W N P LV+ A Sbjct: 168 FEPVDDFMDNEWRTNQTLYSYHPKLVSRSAAA 199 >gi|90414920|ref|ZP_01222884.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] gi|90323976|gb|EAS40572.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] Length = 249 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 63/243 (25%), Positives = 97/243 (39%), Gaps = 27/243 (11%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-----ICNRIFSHQKRQ 55 M +Y+I+L S R+ + + L ++ F AI G +P + + S Q R Sbjct: 1 MTCNIYIINLASSLDRKVNITKQLDNMSLPYTIFSAIDGRKDPHPLFDMYDDKLSQQYRG 60 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDIN 114 + LS ++GCY SH LW A II+EDDA F + H +K Sbjct: 61 ----KSLSKGQLGCYASHFLLWGECVRINA-PIIIIEDDALIDPVPFLDFIKHNTK---- 111 Query: 115 NILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 I K+ +R K C P + +I + TTGY + EAA L+ + Sbjct: 112 -IAAKYKCVRLFDNKRKNFCFSPKEQFDSLNICKFNKGHMSTTGYLLTPEAARKLIMHSQ 170 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTC 230 Y +D+ M +W + + T P + TND + K SP F+ Sbjct: 171 RWYMAVDIYMDRFWVNEVECFGTSPACL-----TNDPIFDSEIGYGKK--SPRSFFSRVK 223 Query: 231 YQW 233 +W Sbjct: 224 REW 226 >gi|146294977|ref|YP_001185401.1| glycosyl transferase family protein [Shewanella putrefaciens CN-32] gi|145566667|gb|ABP77602.1| glycosyl transferase, family 25 [Shewanella putrefaciens CN-32] Length = 236 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 60/220 (27%), Positives = 103/220 (46%), Gaps = 15/220 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFKR 60 + ++VISL S RR + + A+ ++F FF+A+ E + +R+ + ++ + Sbjct: 1 MKIFVISLERSIERRTQMIAKFAKAGIEFEFFNAVDASEEGFTLSDRVAPNVTKKRKGYE 60 Query: 61 LLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-LI 118 LL+ E+ CY SH LW++ + + I +I+ED A+ +D+F L ++ IN I Sbjct: 61 LLN-SEVACYASHYLLWEKCVEMNQPI--VIVEDHAELTDDFKATLA-IAFQHINEFGYI 116 Query: 119 KFDALRKKPK--KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 K A K K +D + T I + TTGY + +AA + I P+ Sbjct: 117 KLSAPLKDRKFIEDKKIDTFHS---IGHYTKNTCFTTGYIVSPDAAKCFIKASDRIVEPV 173 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D M+ W H + + P Y A STI +R ++ Sbjct: 174 DDFMEKPWLHRVRAYSLMPFICYRA--NLPSTIGSARKIK 211 >gi|194288948|ref|YP_002004855.1| glycosyl transferase [Cupriavidus taiwanensis LMG 19424] gi|193222783|emb|CAQ68786.1| glycosyl transferase [Cupriavidus taiwanensis LMG 19424] Length = 256 Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 36/114 (31%), Positives = 60/114 (52%), Gaps = 6/114 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 I YVI+L + ARR++ + R+ + F F A+ G + + R + Q + Sbjct: 2 IGAYVINLEAAEARRQRIAGQLTRLGVPFQVFPAVNGRALAEDEVARR-YDAQAASASY- 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-SKCD 112 R +S EIGC +SH+ +++++ A A++LEDDA D+ +L L SK D Sbjct: 60 RPMSRGEIGCALSHLGVYRKMLEDGASLALVLEDDALLGDDVPAVLEALASKMD 113 >gi|222087061|ref|YP_002545596.1| glycosyl transferase protein [Agrobacterium radiobacter K84] gi|221724509|gb|ACM27665.1| glycosyl transferase protein [Agrobacterium radiobacter K84] Length = 278 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 42/169 (24%), Positives = 66/169 (39%), Gaps = 9/169 (5%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 +S+ EI C++SH W I ++ EDD + F C I+F Sbjct: 101 MSVGEIACFLSHRKAWAAIVEQGVDAGLVFEDDVEIDASFHAAFAAAQACLTPGAFIRFP 160 Query: 122 ALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + K+ C L G + QP + + ++AAI LL RP+D + Sbjct: 161 F---RMGKEHGECVLTHGQASVIQPGRVGLGMVAQLVSRDAAIRLLEATALFDRPVDTTV 217 Query: 181 KHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESRLV----RKPTFSPLY 224 + W + L PG V+E + STI+ + V + PLY Sbjct: 218 QMRWITGLSPLAVLPGGVHEISSQLGGSTIKSRKTVFEKLSREILRPLY 266 >gi|145297203|ref|YP_001140044.1| LPS biosynthesis glycosyltransferase [Aeromonas salmonicida subsp. salmonicida A449] gi|142849975|gb|ABO88296.1| LPS biosynthesis glycosyltransferase [Aeromonas salmonicida subsp. salmonicida A449] gi|224995183|gb|ACN76671.1| WasB [Aeromonas salmonicida] Length = 227 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 33/106 (31%), Positives = 56/106 (52%), Gaps = 5/106 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 IP++VISL S ARR + + L+F+FFDA+ G+ + + + F + C Sbjct: 2 IPIFVISLSRSQARRAMLIKQMKHLGLEFTFFDAVDGKALSDAELQHVDFPLARETCGHD 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 LS+ E+GC +SHI L++ + ++LEDD F+ ++ Sbjct: 62 --LSMGEVGCAMSHIRLYEMMVEKNIERCVVLEDDIYLHMHFNAIM 105 >gi|90414922|ref|ZP_01222886.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] gi|90323978|gb|EAS40574.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] Length = 247 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 80/201 (39%), Gaps = 7/201 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLS 63 ++V+SL S RR+ L F F D I G + +P + Q + R + Sbjct: 3 IFVLSLVDSTERRKNVSSLLNEYQLDFEFIDGIDGRKKDPTLLAQYDEQAFIYNYGRKAA 62 Query: 64 LPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+GCY SH+ WK+ + I II EDD Q + + + I+ + Sbjct: 63 MGELGCYASHMLAWKKCVELDEPI--IIFEDDFRLKQGIHQAIDECKEVISDFHFIRLED 120 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 KP+ Y + + + T Y I AA LL NI P+D+ +++ Sbjct: 121 GNIKPQ---YKVKAQNDMTVFNYLKVPQCATCYAISPVAARKLLAKSNNIAFPVDVFIRN 177 Query: 183 WWEHNIPSLVTEPGAVYEAID 203 W H P EP V D Sbjct: 178 VWVHRQPIFSLEPFFVTTGTD 198 >gi|262371064|ref|ZP_06064386.1| glycosyl transferase, family 25 [Acinetobacter johnsonii SH046] gi|262313950|gb|EEY94995.1| glycosyl transferase, family 25 [Acinetobacter johnsonii SH046] Length = 254 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 51/226 (22%), Positives = 92/226 (40%), Gaps = 11/226 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFK 59 M + Y+I+L S R E+ + +++ F F A+ G + + Q Sbjct: 1 MKVVTYLINLDGSQERLERATQQLNQVNWPFERFSAVDGRGKDLTEFANYDDQGANDVLG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD----ADFSDEFSQLLPHLSK-CDIN 114 R L E+GCY+SH ++ + A ++LEDD DF D+ ++ +L + +++ Sbjct: 61 RRLMNSELGCYLSHYGCAEQFLSTDADYLVVLEDDMKINQDFKDKLDGVIEYLDQHKELD 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI +KK KD + + G + R G ++ A + + + Sbjct: 121 WYLINIATKKKKLAKD--ITNIDG-MSVWHAFYFPIRGLGLVWSRQGAEAFMQAGRKMTM 177 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND--STIEESRLVRKP 218 P+D+ + W N L P V A +D T+ + RK Sbjct: 178 PVDIFFQTWLSKNGKGLGVWPALVKPAGLDSDILGTVAAQGISRKE 223 >gi|283954707|ref|ZP_06372225.1| LPS glycosyltransferase subfamily [Campylobacter jejuni subsp. jejuni 414] gi|283793899|gb|EFC32650.1| LPS glycosyltransferase subfamily [Campylobacter jejuni subsp. jejuni 414] Length = 253 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 57/219 (26%), Positives = 95/219 (43%), Gaps = 48/219 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + V++I+L S R+E + ++ L+F FF AI +N K Sbjct: 1 MKVFIINLERSLDRKEHMQKQIQKLFLKNPNLKNKLEFIFFKAIDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ + + I +ILEDD +FSDE Sbjct: 50 EHLEFKEHFPWWDSWVLGRELSDGEKACFASHYRLWQECVKLNEPI--MILEDDVEFSDE 107 Query: 101 F----SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 F + + LSK + I I + D L L G + + ++ T GY Sbjct: 108 FLNHGEEYIDELSKSEYEYIRILYIF-------DKRLYLLRGGYYLSFEKLAG--TQGYI 158 Query: 157 IGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + A +++ K IY P+D M +++H + +++ +P Sbjct: 159 LKPSGAKKIISKAKFIYMPVDDYMDKFYKHKVLNIIKKP 197 >gi|239831361|ref|ZP_04679690.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] gi|239823628|gb|EEQ95196.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] Length = 194 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 50/191 (26%), Positives = 86/191 (45%), Gaps = 31/191 (16%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK----RQCQF 58 +PVYVI+L S R ++ A + ++ +A+ G ++ S ++ + F Sbjct: 5 VPVYVINLARSRDRWDRLKSNADALSIELRRVEAVEG-------KLLSSEELTDFDEAGF 57 Query: 59 KRL---LSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD-- 112 +R +++P EIGCY SHI + IA +P A+I+EDD F+ EF L HL+K Sbjct: 58 RRWHGKIAMPAEIGCYFSHIRALEIIADAPEPFAVIVEDDIVFTPEFKPFLTHLTKAAGW 117 Query: 113 -----INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 +N+ F + ++ + S L G + Y + +E A LL Sbjct: 118 DAVKLVNHRTAAFQSFQRIDSQFSIGRCLHGPLG---------SSAAYVVTREGASKLLK 168 Query: 168 VRKNIYRPIDM 178 + + P D+ Sbjct: 169 ALRPMRLPYDV 179 >gi|229524815|ref|ZP_04414220.1| beta-1,4-galactosyltransferase [Vibrio cholerae bv. albensis VL426] gi|229338396|gb|EEO03413.1| beta-1,4-galactosyltransferase [Vibrio cholerae bv. albensis VL426] Length = 242 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 65/231 (28%), Positives = 99/231 (42%), Gaps = 24/231 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + A L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMASHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 ++ E+GCY SH LW++ A ++LEDD ++ F + L + D N Sbjct: 59 SGKMPMRGELGCYASHYLLWQKCVELNA-PIVVLEDDVIINENFPKNLSIIK--DKTN-- 113 Query: 118 IKFDALRKKPKKDSYLCTL-----PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 ++ LR +P+ C L GN+ I T Y I E+A L+ + Sbjct: 114 -EYGFLRLEPEVGK--CRLFSKESKGNYSIAFMDNNWGGTRAYSISPESARKLILGSQKW 170 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPT 219 +D + + H +PS + P V ++ + E R L RKPT Sbjct: 171 SMAVDNYIGCTYIHKMPSYIFSPSMVEHGVEFETTFQNEKRIRVPLYRKPT 221 >gi|296161257|ref|ZP_06844065.1| glycosyl transferase family 25 [Burkholderia sp. Ch1-1] gi|295888415|gb|EFG68225.1| glycosyl transferase family 25 [Burkholderia sp. Ch1-1] Length = 275 Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 59/230 (25%), Positives = 98/230 (42%), Gaps = 26/230 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 +PV+VISL S RR+ F DA+ G + + ++ + ++ R Sbjct: 6 VPVHVISLSRS-GRRDAIARFLTDHGAAFRIEDAVDGRSLAQSELDAVYDDAAARRRYGR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-----PHLSK-CDIN 114 ++ E+ C++SH +W++I + A++LEDDA F + + P LS DI Sbjct: 65 SMTRAEVACFMSHRSVWRKIVET-GRAAVVLEDDAMLEPAFFEHVLRANEPELSAVADI- 122 Query: 115 NILIKFDALRKKPKKDSYL------CTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLN 167 +L+ LR+ +Y T G + P + + GY+I AA L Sbjct: 123 -VLLGRSKLRRTASSWTYFNEPLRRVTGVGGLRVGVPFKQWTSGAVGYWISAHAARRALA 181 Query: 168 VRKNIYRPIDMDMKHW-W---EHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + RP+ + W W E + P AV+E D S+IE+ R Sbjct: 182 YSE---RPLGALLDDWPWHRDEGGARVVELRPYAVWEDFDRLPSSIEDER 228 >gi|56698209|ref|YP_168581.1| glycosyl transferase family protein [Ruegeria pomeroyi DSS-3] gi|56679946|gb|AAV96612.1| glycosyl transferase, family 25 [Ruegeria pomeroyi DSS-3] Length = 273 Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 62/146 (42%), Gaps = 7/146 (4%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL-R 124 EIGC++SH W R+ S A A+I+EDD F+ L + I+F Sbjct: 104 EIGCFLSHRAAWARLVESEAEAALIIEDDMALEAGFADALALAQRNVAQMGYIQFQTRPV 163 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 P D G + QP + RT+G + + AA LL + + RP+D ++ W Sbjct: 164 TAPAIDQQ-----GQSVLMQPGLTPLRTSGQLVARWAAERLLALTEPFDRPVDTFLQMHW 218 Query: 185 EHNIPSLVTEPGAVYEAIDT-NDSTI 209 + V P + + + STI Sbjct: 219 HTGLRLGVISPSGLSDRTEVIGGSTI 244 >gi|126740757|ref|ZP_01756442.1| glycosyl transferase, family 25 [Roseobacter sp. SK209-2-6] gi|126718053|gb|EBA14770.1| glycosyl transferase, family 25 [Roseobacter sp. SK209-2-6] Length = 237 Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 38/154 (24%), Positives = 72/154 (46%), Gaps = 4/154 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNILIKF 120 LS E+GC++SH W+RI + +I+EDD +++ L +S I+ Sbjct: 61 LSPGEVGCFLSHRRCWQRIVDAGWDYGMIVEDDLSLEPEQWRDTLALISAHADAESFIRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 A R++ ++ + G + P+++ +T +G++AA LL + + RP+D + Sbjct: 121 PAKRREAPRE--VIAEEGQSKLFLPKVIGLQTVAQVVGRKAAKRLLTATEVLDRPVDTFL 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAI-DTNDSTIEESR 213 + W H P V E + STI++ + Sbjct: 179 QMHWIHGQRIQTILPNGVSELTQELGGSTIQKRK 212 >gi|110634463|ref|YP_674671.1| glycosyl transferase family protein [Mesorhizobium sp. BNC1] gi|110285447|gb|ABG63506.1| glycosyl transferase, family 25 [Chelativorans sp. BNC1] Length = 258 Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 41/161 (25%), Positives = 72/161 (44%), Gaps = 4/161 (2%) Query: 57 QFKRLLSL--PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 + +RL SL C++SH WKR+ A++LEDD + + + LL H Sbjct: 44 EVQRLGSLLPGATACFLSHRECWKRVVDEALPHAVVLEDDLHLAPDAASLLSHGEWVPTE 103 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 ++K + + + D + G+ +H+ R T GY + ++ A LL + + + Sbjct: 104 ADVVKLETRLCRTRVDKGVAAAIGSRSLHRLRSSHMGTGGYIVTRKGAERLLALSERLEA 163 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 P+D M + H+ SL T + A+ DS I +V Sbjct: 164 PVDHFMFNAGLHSAASLTTF--QMVPAVCVQDSYIGRQSMV 202 >gi|322380695|ref|ZP_08054834.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter suis HS5] gi|321146863|gb|EFX41624.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter suis HS5] Length = 173 Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 48/156 (30%), Positives = 74/156 (47%), Gaps = 17/156 (10%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-- 116 + +SL E+GCY SH LW R + Y+ I +ILEDD + F Q L L + I+ + Sbjct: 6 KYMSLGELGCYASHFCLWYRCLEYNEPI--VILEDDIELEPCFWQSLDFLEE-HIHTLGY 62 Query: 117 --LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 L+ L KKP + + + + G ++ T GY + + A+ + Sbjct: 63 VRLMHLFELVKKPTRFTGVLQIVG-------AVVGNGTQGYCLTPQVAMAFIKASAKWVI 115 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 P+D M + H I +LV EP A+ E N+S IE Sbjct: 116 PVDNLMDCTYLHGISNLVLEPFAIAE--KPNNSNIE 149 >gi|167837452|ref|ZP_02464335.1| glycosyl transferase, family 25 [Burkholderia thailandensis MSMB43] Length = 261 Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 56/224 (25%), Positives = 83/224 (37%), Gaps = 15/224 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I ISL + RR K + A + FFDA + + RQ ++ L Sbjct: 5 ISYVCISLTRAQDRRTKMVEQFANHGINARFFDAFDLKGTVEAIPGYDAAGRQRRYGWQL 64 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E+GCY+SH W ++ S ++EDD D F L + +++ Sbjct: 65 SRGEVGCYLSHRAAWLQLVQSGKEAMCVMEDDITLLDGFKAATLELYAARQHWDMVRLMW 124 Query: 123 LRKKPKKDSYLCTLPGNFDI---HQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + + ++ S LP + P L Y I + AA +L+ I ID+ Sbjct: 125 INE--RQQSEYARLPSGTRLMWMENPVGLQC----YMITRTAAQRMLDYTAKITHAIDIA 178 Query: 180 MKHWWEHN------IPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 WEH P V + GA D DS RL K Sbjct: 179 FDRNWEHGQRMYVTSPQFVADTGAPTTITDRPDSRTLVQRLKAK 222 >gi|153952416|ref|YP_001397749.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. doylei 269.97] gi|152939862|gb|ABS44603.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. doylei 269.97] Length = 254 Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 64/249 (25%), Positives = 97/249 (38%), Gaps = 57/249 (22%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + V++I+L S R+E + ++ L+F FF AI +N K Sbjct: 1 MKVFIINLERSLDRKEHMQKQIQKLFEKNPSLKNKLEFVFFKAIDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ + I IILEDD +FSDE Sbjct: 50 EHLEFKDHFPWWGSWVLGRELSDGEKACFASHYKLWQECVKLDEPI--IILEDDVEFSDE 107 Query: 101 FSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTL---------PGNFDIHQPRILSPR 151 F +NN D L K + LC L G + + ++ + Sbjct: 108 F-----------LNNGAEYIDELLKSEYEYIRLCYLFDKRLYFLSEGGYYLSFEKLAGAQ 156 Query: 152 TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 GY + AA+ L KN +D M ++ HN+ ++V +P + S + Sbjct: 157 --GYVLQVSAAMKFLKCAKNWIYAVDDYMDMFYRHNVLNIVKKPLLLKHDCRIESSISQA 214 Query: 212 SRLVRKPTF 220 RL K F Sbjct: 215 GRLFLKAKF 223 >gi|153834850|ref|ZP_01987517.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Vibrio harveyi HY01] gi|148868721|gb|EDL67798.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Vibrio harveyi HY01] Length = 241 Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 52/209 (24%), Positives = 92/209 (44%), Gaps = 13/209 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLL 62 +VISL + RR + A + F FFDAI G P+ ++ ++ +KR + Sbjct: 3 TFVISLESAVERRAHIEAQFAGVEQDFEFFDAIDGRKGEHPLFSK-YNVEKRLRVKGYGM 61 Query: 63 SLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 + E+GC+ SH LW++ + + I +++EDDA + F + ++++ + F Sbjct: 62 TPGELGCFASHYLLWEKCLELNEPI--VVIEDDAQLEECFDDSMKNINELQPYGYVRLFV 119 Query: 122 ALRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 RK+P K SY + D+ + T YF+ AA + + P+D Sbjct: 120 NGRKRPFVKIGSY-----KSHDVVEYLRGPGATRSYFVTPMAAKRFIESAQEWLLPVDDY 174 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 M +W + + PG V D + S Sbjct: 175 MDQFWSNEVACRGIMPGIVKNETDFDSSV 203 >gi|33151689|ref|NP_873042.1| lipooligosaccharide galactosyltransferase II [Haemophilus ducreyi 35000HP] gi|6942295|gb|AAF32397.1|AF224466_4 galactosyltransferase II [Haemophilus ducreyi] gi|33147910|gb|AAP95431.1| lipooligosaccharide galactosyltransferase II [Haemophilus ducreyi 35000HP] Length = 280 Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 59/173 (34%), Positives = 77/173 (44%), Gaps = 22/173 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + ARR+ A+ ++ F FFDA ENN + I H K LS Sbjct: 9 YVISLKTADARRQHIIQEFAKHNIPFQFFDACSIENNLYMD-IEKHLPMLLNSK--LSNS 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN-------ILI 118 E GC +SH LWK+ I EDD SDE ++ + S +NN ILI Sbjct: 66 EKGCLMSHFLLWKKCVLDDIPYMTIFEDDIILSDESNEFISDYSW--VNNRFYEQKEILI 123 Query: 119 KFD-----ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 KF+ +RK K + Y F+ R T Y I KEAA +L+ Sbjct: 124 KFETFLMPVIRKFTKINEY---KERAFNQLISRHFG--TASYLISKEAAKYLI 171 >gi|262192416|ref|ZP_06050568.1| putative glycosyltransferase WavM [Vibrio cholerae CT 5369-93] gi|262031680|gb|EEY50266.1| putative glycosyltransferase WavM [Vibrio cholerae CT 5369-93] Length = 242 Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 64/243 (26%), Positives = 96/243 (39%), Gaps = 32/243 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGA--IILEDDADFSDEFSQLLPHLSKCDINN 115 ++ E+GCY SH +W++ +G +ILEDDA F L + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCV---ELGEPIVILEDDAKILPSFINYLNIIRVKTQEY 115 Query: 116 ILIKFDA-------LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + K+ D + L NF + LSP E+A L+ Sbjct: 116 GFLRLEEAYDRSRLFLKEKTNDFEISFLTNNFGGARAYSLSP---------ESAKKLIKG 166 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPTFSPLY 224 + P+D M + H +PS V P V + + + E + L RKPT Sbjct: 167 SQRWSMPVDNYMGSLYLHGMPSFVFHPSVVDNPANFDTTFQHEVQRPTPLYRKPTRELYS 226 Query: 225 FYR 227 YR Sbjct: 227 LYR 229 >gi|156972987|ref|YP_001443894.1| glycosyl transferase [Vibrio harveyi ATCC BAA-1116] gi|156524581|gb|ABU69667.1| hypothetical protein VIBHAR_00665 [Vibrio harveyi ATCC BAA-1116] Length = 241 Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 52/209 (24%), Positives = 92/209 (44%), Gaps = 13/209 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLL 62 +VISL + RR + A + F FFDAI G P+ ++ ++ +KR + Sbjct: 3 TFVISLESAVERRAHIEAQFAGVEQDFEFFDAIDGRKGEHPLFSK-YNVEKRLRVKGYGM 61 Query: 63 SLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 + E+GC+ SH LW++ + + I +++EDDA + F + ++++ + F Sbjct: 62 TPGELGCFASHYLLWEKCLELNEPI--VVIEDDAQLEECFDDSMKNINELQPYGYVRLFV 119 Query: 122 ALRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 RK+P K SY + D+ + T YF+ AA + + P+D Sbjct: 120 NGRKRPFVKIGSY-----KSHDVVEYLRGPGATRSYFVTPMAAKKFIESAQEWLLPVDDY 174 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 M +W + + PG V D + S Sbjct: 175 MDQFWSNEVACRGIMPGIVKNETDFDSSV 203 >gi|304312996|ref|YP_003812594.1| Putative galactosyl transferase [gamma proteobacterium HdN1] gi|301798729|emb|CBL46962.1| Putative galactosyl transferase [gamma proteobacterium HdN1] Length = 283 Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 40/153 (26%), Positives = 73/153 (47%), Gaps = 6/153 (3%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIIL-EDDADFSDEFSQLLPHLSKCDINNILIKF 120 L+ E+GCY+SH L + AY + I L EDD LL ++ + L + Sbjct: 84 LTATEVGCYLSHYRLIQE-AYDTGLSHICLFEDDVVAEHGLGDLLREIAALEDTFHLTRL 142 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 +L+ + +K + L + + +P + T GY + + +L+ NIY PID Sbjct: 143 MSLKIRKRKLAR--PLSHGYSVVRPLRGALGTQGYVVNRTGMKEILDFGANIYMPIDKLY 200 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 ++ +N+ EP A+YE + +++E++R Sbjct: 201 DSFFLYNLNCYSVEPHAIYET--SRKTSVEKTR 231 >gi|193078329|gb|ABS90266.2| hypothetical protein A1S_3841 [Acinetobacter baumannii ATCC 17978] Length = 271 Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 58/205 (28%), Positives = 91/205 (44%), Gaps = 22/205 (10%) Query: 3 IPVYVISLPFSHARREKF--CHRAAR-----IHLQFSFFDAIYGENNPICNRIFSHQKRQ 55 I + +SL S+ R+EK H++ + I + F FF+AIYG+ P N + Sbjct: 4 ILILCVSLKGSNERQEKIKQQHQSLQSVIHDIQIDFEFFEAIYGKKLP--NEYLTFLNLS 61 Query: 56 CQFKRL----LSLPEIGCYISHIHLWKRIA---YSPAIGAIILEDDADFSDEFSQLLPHL 108 QF L L E+GC++SH+ +W+R+A Y+ II+EDD F E Q L Sbjct: 62 RQFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQ--QKL 119 Query: 109 SKCDINNILIKFDALRKKPKK---DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 N F +P + Y+ F++ P+ L T Y + +E A Sbjct: 120 HSLLETNPAFAFLGGHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDF 179 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPS 190 +N + ID D K+ E +I + Sbjct: 180 VNKQIKKLTYID-DWKYLLEGDIST 203 >gi|187929477|ref|YP_001899964.1| glycosyl transferase family 25 [Ralstonia pickettii 12J] gi|309781748|ref|ZP_07676481.1| LPS glycosyltransferase subfamily [Ralstonia sp. 5_7_47FAA] gi|187726367|gb|ACD27532.1| glycosyl transferase family 25 [Ralstonia pickettii 12J] gi|308919389|gb|EFP65053.1| LPS glycosyltransferase subfamily [Ralstonia sp. 5_7_47FAA] Length = 260 Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 54/235 (22%), Positives = 94/235 (40%), Gaps = 41/235 (17%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 +P + I+L RRE + + L F +YG+ P R + H + Q R Sbjct: 5 VPTFFINLDHDAGRREALERQLDALGLPHQRFPGVYGKTLPADELARHYDHARATSQ-SR 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++ E+GC +SH+ +++ + A+ILEDDA + +L Sbjct: 64 ELTVGEVGCALSHLGVYRAMIEQNLPYALILEDDAKLGPDVPAVL--------------- 108 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT---------------GYFIGKEAAIHL 165 DAL + D + TL + D + R P T GYF+ + AA + Sbjct: 109 DALAQSVSPDEPVVTLLTHIDRYYKRSARPLTADHRTVKLANYQWLAHGYFVTRAAAKRM 168 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND----STIEESRLVR 216 + + +Y P+ + +W++ +V V I D S +E R ++ Sbjct: 169 V---EQLY-PVWLAADYWYKFEREGIVQMRAVVPYVIGVQDFDSGSNLEADRAIK 219 >gi|332852212|ref|ZP_08434017.1| LPS glycosyltransferase [Acinetobacter baumannii 6013150] gi|332868180|ref|ZP_08438059.1| LPS glycosyltransferase [Acinetobacter baumannii 6013113] gi|332729342|gb|EGJ60682.1| LPS glycosyltransferase [Acinetobacter baumannii 6013150] gi|332733524|gb|EGJ64692.1| LPS glycosyltransferase [Acinetobacter baumannii 6013113] Length = 271 Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 58/205 (28%), Positives = 91/205 (44%), Gaps = 22/205 (10%) Query: 3 IPVYVISLPFSHARREKF--CHRAAR-----IHLQFSFFDAIYGENNPICNRIFSHQKRQ 55 I + +SL S+ R+EK H++ + I + F FF+AIYG+ P N + Sbjct: 4 ILILCVSLKGSNERQEKIKQQHQSLQSVIHDIQIDFEFFEAIYGKKLP--NEYLTFLNLS 61 Query: 56 CQFKRL----LSLPEIGCYISHIHLWKRIA---YSPAIGAIILEDDADFSDEFSQLLPHL 108 QF L L E+GC++SH+ +W+R+A Y+ II+EDD F E Q L Sbjct: 62 RQFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQ--QKL 119 Query: 109 SKCDINNILIKFDALRKKPKK---DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 N F +P + Y+ F++ P+ L T Y + +E A Sbjct: 120 HSLLETNPAFAFLGGHSEPSRRRIRGYVSKDELYFNMTGPKDLYTATYAYSLTRETAQDF 179 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPS 190 +N + ID D K+ E +I + Sbjct: 180 VNKQIKKLTYID-DWKYLLEGDIST 203 >gi|301018954|ref|ZP_07183177.1| LPS glycosyltransferase [Escherichia coli MS 69-1] gi|3821849|gb|AAC69683.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli] gi|300399448|gb|EFJ82986.1| LPS glycosyltransferase [Escherichia coli MS 69-1] Length = 257 Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 64/244 (26%), Positives = 104/244 (42%), Gaps = 33/244 (13%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRIFSHQKRQCQ 57 M +P+Y++SL RR K R+++ F FFDAI +N I +++ R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFHRLNINFDFFDAIDAKDPQNKEIIDKM-----RLSG 55 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-----LPHLSKCD 112 ++ EI C +SH ++K + A+ILEDD +++F + LP K Sbjct: 56 VGAEMTDGEIACTLSHQLIYKDMIDKNIEWAVILEDDVIVNEKFKKFLQYFNLPEKDKLK 115 Query: 113 INN--ILIKFDALRKKP--------KKDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEA 161 NN +L L P K CT NF+ ++ R RT Y + K+ Sbjct: 116 HNNLYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIR----RTCSYLMNKDM 171 Query: 162 AIHLLNVRKN--IYRPIDMDMKHWWEHN-IPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 A LL + K+ YR + H +HN I + ++ ++ +S +E RL+ Sbjct: 172 AQKLLKLTKDYGTYRADSWKLMH--QHNIIKEFYLDEIILHPILNEFNSHLESERLLTSE 229 Query: 219 TFSP 222 P Sbjct: 230 KKQP 233 >gi|153829559|ref|ZP_01982226.1| putative glycosyltransferase WavM [Vibrio cholerae 623-39] gi|148874955|gb|EDL73090.1| putative glycosyltransferase WavM [Vibrio cholerae 623-39] Length = 242 Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 46/242 (19%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + A L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMASYGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL------------ 105 ++ E+GCY SH LW++ A ++LEDD ++ F Q L Sbjct: 59 SGKMPMRGELGCYASHYLLWQKCVELNA-PIVVLEDDVIINENFPQYLSIIKDKTNEYGF 117 Query: 106 ----PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEA 161 P + KC + K+ K++ + + N+ T Y I + Sbjct: 118 LRLEPEVGKCRL---------FSKESKENYSIAFIDNNWG---------GTRAYSISPVS 159 Query: 162 AIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRK 217 A L+ + +D + + H +PS + P V ++ + E R L RK Sbjct: 160 ARKLILGSQKWSMAVDNYIGCTYIHKMPSYIFSPSMVEHGVEFETTFQNEKRIRVPLYRK 219 Query: 218 PT 219 PT Sbjct: 220 PT 221 >gi|213157910|ref|YP_002320708.1| LPS glycosyltransferase subfamily [Acinetobacter baumannii AB0057] gi|213057070|gb|ACJ41972.1| LPS glycosyltransferase subfamily [Acinetobacter baumannii AB0057] Length = 261 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 15/174 (8%) Query: 27 IHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL----LSLPEIGCYISHIHLWKRIA- 81 I + F FFDAIYG+ P IF + R QF L L E+GC++SH+ +W+R+A Sbjct: 25 IQIDFQFFDAIYGKKLPNEYLIFLNLSR--QFAGLCDHELGPSELGCWLSHMIIWQRLAQ 82 Query: 82 --YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKK---DSYLCTL 136 Y+ II+EDD F E Q L N F +P + Y+ Sbjct: 83 GDYAAYDRIIIIEDDVIFQTEHIQ--QKLHSLLETNPAFAFLGGHSEPSRRRIRGYVSND 140 Query: 137 PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPS 190 F++ P+ L T Y + +E A +N + ID D K+ E +I + Sbjct: 141 ELYFNMTGPKDLYTATYAYSLTRETAQDFVNKQIKKLTYID-DWKYLLEGDIST 193 >gi|18653286|gb|AAL77358.1|AF444794_3 putative glycosyltransferase WavM [Vibrio cholerae] Length = 242 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 63/231 (27%), Positives = 98/231 (42%), Gaps = 24/231 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 ++ E+GCY SH LW++ A ++LEDD ++ F Q L + D N Sbjct: 59 SGKMPMRGELGCYASHYLLWQKCVELNA-PIVVLEDDVIINENFPQYLSIMK--DKTN-- 113 Query: 118 IKFDALRKKPKKDSYLCTL-----PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 ++ LR +P+ C+L N+ I T Y I ++A L+ + Sbjct: 114 -EYGFLRLEPEVGK--CSLFSKESKENYSIAFMDNNWGGTRAYSISPDSARKLILGSQKW 170 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPT 219 +D + + H +PS + P V ++ + E R L RKPT Sbjct: 171 SMAVDNYIGCTYIHKMPSYIFSPSMVEHGVEFETTFQNEKRIRVPLYRKPT 221 >gi|315452598|ref|YP_004072868.1| Beta-1,4-galactosyltransferase [Helicobacter felis ATCC 49179] gi|315131650|emb|CBY82278.1| Beta-1,4-galactosyltransferase [Helicobacter felis ATCC 49179] Length = 263 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 15/170 (8%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 +SL E+GC+ SH LW+R +A I ILEDD F + L + I + + + Sbjct: 94 MSLGELGCFASHYALWQRCVALQEPI--CILEDDIALEPHFVENLDY-----IESYIPRL 146 Query: 121 DALRKKPKKDSYLCTLP--GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +R D L T P G F+I +P T GY I +AA L + P+D Sbjct: 147 HWVRLMHLFDYPLETTPILGVFEI-KPFTWGSGTQGYIITPKAASKFLKASQKWVMPVDC 205 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRN 228 M++ + H + + V +P + E ++ I R V P +PL F++ Sbjct: 206 VMENTYLHGVKNYVIKPFVIRE--NSTTGNITRDRNVTCP--APLRFFKK 251 >gi|191167804|ref|ZP_03029610.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli B7A] gi|309797494|ref|ZP_07691885.1| LPS glycosyltransferase [Escherichia coli MS 145-7] gi|190902147|gb|EDV61890.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli B7A] gi|308118930|gb|EFO56192.1| LPS glycosyltransferase [Escherichia coli MS 145-7] Length = 257 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 59/252 (23%), Positives = 100/252 (39%), Gaps = 49/252 (19%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRIFSHQKRQCQ 57 M +P+Y++SL RR K R+++ F FFDAI +N I +++ R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFHRLNINFDFFDAIDAKDPQNKEIIDKM-----RLSG 55 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 ++ EI C +SH ++K + A+ILEDD +++F + L + Sbjct: 56 VGAEMTDGEIACTLSHQLIYKDMIDKNIEWAVILEDDVIVNEKFKKFLQY---------- 105 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------------------------RTT 153 F+ K K + L L G +H +L RT Sbjct: 106 --FNLSEKDKLKHNNLYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTC 163 Query: 154 GYFIGKEAAIHLLNVRKN--IYRPIDMDMKHWWEHN-IPSLVTEPGAVYEAIDTNDSTIE 210 Y + K+ A LL + K+ YR + H +HN I + ++ ++ +S +E Sbjct: 164 SYLMNKDMAQKLLKLTKDYGTYRADSWKLMH--QHNIIKEFYLDEIILHPILNEFNSHLE 221 Query: 211 ESRLVRKPTFSP 222 RL+ P Sbjct: 222 SERLLTSEKKQP 233 >gi|225025752|ref|ZP_03714944.1| hypothetical protein EIKCOROL_02656 [Eikenella corrodens ATCC 23834] gi|224941533|gb|EEG22742.1| hypothetical protein EIKCOROL_02656 [Eikenella corrodens ATCC 23834] Length = 264 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 61/242 (25%), Positives = 98/242 (40%), Gaps = 42/242 (17%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI------YGENNPICNRIFSHQKRQC 56 IP Y+ISL RRE AR+ ++ +F DA+ + + +R+ H+K + Sbjct: 2 IPAYIISLANQQDRREHMRQECARVGIEATFIDAVDMRQASQSDIECLSSRLL-HKKTKK 60 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 Q R L+ E+GC +SH ++ I A++LEDDA+F P L++ + + Sbjct: 61 Q--RWLTRGELGCALSHHQVYAHIVRQQHPYALVLEDDAEF---IRNPQPLLNEGYLKAL 115 Query: 117 LIK--FDALRKKPKKDSYLCTLPGNFDIHQPRI-----------------------LSPR 151 + FD L Y+ TLP + RI Sbjct: 116 SAQYPFDILIL-----GYVKTLPHQLPYYYRRIPIKHRAKMNTDGNTIYFGTPWEQYGCG 170 Query: 152 TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 T Y I +E A L ++ D + H + L + P V EA++ DSTI + Sbjct: 171 TVAYIISREGAEKLCRATQSPCATADDWLYFEQHHGLRILHSRPAFVLEALEKFDSTIRQ 230 Query: 212 SR 213 + Sbjct: 231 EK 232 >gi|168188181|gb|ACA14476.1| WahB [Aeromonas hydrophila] Length = 221 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 53/205 (25%), Positives = 82/205 (40%), Gaps = 19/205 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 IPV+VISL S RR + A + + F F DA G+ + + K C + Sbjct: 2 IPVFVISLARSVERRAMVVQQMAHLGINFEFVDATDGKALSSAELAKVDLELAKEVCGHE 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 LS EIGC +SHI +++RI + LEDD ++ + + + I+ Sbjct: 62 --LSFGEIGCAMSHIRVYERIVAEGIDRCVALEDDIYLHMHSKAIIETIVRSCHSEIVFL 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSP---------RTTGYFIGKEAAIHLLNVRK 170 K K+ L +LPG + + R L+P T GY + A LL Sbjct: 120 HHG---KAKRWPILSSLPGGYRL--ARYLAPSRTSRRGILSTAGYVLTLAGAKKLLQCAY 174 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEP 195 + P D + + + EP Sbjct: 175 PLRMPSDYLTGRLQLNGLAASGVEP 199 >gi|163739820|ref|ZP_02147227.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis BS107] gi|161386854|gb|EDQ11216.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis BS107] Length = 237 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 69/155 (44%), Gaps = 3/155 (1%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD-ADFSDEFSQLLPHLSKCDINNILIKF 120 LS E+GC++SH W+ IA +I+EDD A + +L + + +I+ Sbjct: 61 LSGGEVGCFLSHRKCWQLIANGIDDYGLIVEDDMATDPGIWRDVLALIDSHAGPDSMIRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 A K+ + S + G + PR + +T +GK AA+ LL K + RP+D + Sbjct: 121 PA--KQRETASTVIAAHGAAQLFLPRRIGLQTVAQVVGKTAAVRLLTATKVLDRPVDTFL 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 + W H+ P V E + +SR V Sbjct: 179 QMHWVHDQTIHTVLPNGVSEETAALGGSTIQSRPV 213 >gi|262374910|ref|ZP_06068144.1| glycosyl transferase, family 25 [Acinetobacter lwoffii SH145] gi|262309923|gb|EEY91052.1| glycosyl transferase, family 25 [Acinetobacter lwoffii SH145] Length = 253 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 61/249 (24%), Positives = 100/249 (40%), Gaps = 13/249 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFK 59 M + Y+I+L S R + + +F F A G + I ++ + Sbjct: 1 MKVVTYLINLDGSDERLAAATQQLNSVSWRFERFAAYDGRGKALSEFINYNDDQTNKVLG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD----ADFSDEFSQLLPHLSK-CDIN 114 R L E+GCY+SH ++ + A ++LEDD +DF + ++ +L + D+N Sbjct: 61 RSLMNSELGCYLSHYGCAEKFLATDADYLVVLEDDLKINSDFKIKIDSIIEYLHQHQDLN 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI A +KK KD T G+ + R G ++ A + V K I Sbjct: 121 WYLINIAAKKKKLAKD---ITQIGDISLWHAFYFPIRGLGLIWSRQGAQSFVEVGKTINM 177 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND--STIEESRLVRKP--TFSPLYFYRNTC 230 P+D+ + W N L P V A +D T+ + RK S Y ++ Sbjct: 178 PVDIFFQSWLSQNGKGLGIWPPLVKPAGVDSDILGTVATQGISRKALENRSSSYGFKKQK 237 Query: 231 YQWNLHYNA 239 W Y+A Sbjct: 238 RMWRDRYHA 246 >gi|32265822|ref|NP_859854.1| hypothetical protein HH0323 [Helicobacter hepaticus ATCC 51449] gi|32261871|gb|AAP76920.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449] Length = 239 Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 50/185 (27%), Positives = 82/185 (44%), Gaps = 18/185 (9%) Query: 17 REKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQFK---RLLSLPEIGCYI 71 +++F + + QF FFDAI G+ + FS + + +S E C+ Sbjct: 2 QQQFAQMSEQQRAQFEIIFFDAIDGQKGE--HLAFSQYSKGASIAFRGKEMSDGERACFA 59 Query: 72 SHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKD 130 SH LW++ + + I I+LEDD + F L +SK + + ++ L K K Sbjct: 60 SHYSLWQKCVDLNEPI--IVLEDDVMILEHFWTELTRISKSEY--VYVRLTFLEDKVKA- 114 Query: 131 SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPS 190 LP F I R+ T GY++ A + K YRP+D M ++ H+IP Sbjct: 115 ---FVLPNEFYITFSRVTG--TQGYYLTPIGAQGFIASAKTWYRPVDDYMDMFYIHHIPI 169 Query: 191 LVTEP 195 + +P Sbjct: 170 ICIKP 174 >gi|237654429|ref|YP_002890743.1| glycosyl transferase family 25 [Thauera sp. MZ1T] gi|237625676|gb|ACR02366.1| glycosyl transferase family 25 [Thauera sp. MZ1T] Length = 265 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/113 (25%), Positives = 57/113 (50%), Gaps = 2/113 (1%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M I +++I+LP + RR++ A + L + +A+ G + R + + ++ Sbjct: 1 MSIGIFIINLPEAVERRQRVSGHLAALGLDATVIEAVRGSTLSVAERASVADDPRSVGRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC 111 R+L+ E+GC +SH+ ++++ S +ILEDDA + + LL C Sbjct: 61 GRVLTPGELGCAMSHVRAYEQLLCSGHQFGLILEDDAVLLPDVANLLVSAENC 113 >gi|113460783|ref|YP_718850.1| Lob1 protein [Haemophilus somnus 129PT] gi|112822826|gb|ABI24915.1| Lob1 protein [Haemophilus somnus 129PT] Length = 297 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 47/205 (22%), Positives = 88/205 (42%), Gaps = 14/205 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-------FSFFDAIYGE---NNPICNRIFSHQKR 54 ++VI+L + R+ H+ + Q + FF + G N+P+ + ++ +KR Sbjct: 52 IFVINLEKATERKHFISHQFTALQEQHPDIVINYQFFTGVNGNTQPNHPLFAK-YNQKKR 110 Query: 55 QCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 + ++L ++GCY SH LW++ + I I+LEDDA F + + Sbjct: 111 YYRKGNEITLGQLGCYASHYLLWEKCVQLQQPI--IVLEDDAILQPNFLAVYQFCFSAEN 168 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 K + TLP + + Q TTGY++ +AA L+ + Sbjct: 169 QFEFFWLTHSNSSKIKTKLIHTLPDSTKLEQHYFGYSNTTGYYLTPQAAQKFLDFSQEWI 228 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAV 198 +D+ M ++E+++ L P V Sbjct: 229 YNVDIFMDRFYENHVALLGVNPPCV 253 >gi|317050484|ref|YP_004111600.1| glycosyl transferase family 25 [Desulfurispirillum indicum S5] gi|316945568|gb|ADU65044.1| glycosyl transferase family 25 [Desulfurispirillum indicum S5] Length = 248 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 56/205 (27%), Positives = 83/205 (40%), Gaps = 15/205 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFKR 60 +PV+VISL S RR+ + F F DAI G + +P+ +R F K + R Sbjct: 5 LPVFVISLVSSEKRRKSSTELLLSQGISFEFIDAIDGRKDRHPLLDR-FRPDKFLVRHGR 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + E GCY SH W++ I+ EDD D + L I+ Sbjct: 64 PSAPGEAGCYASHFLAWQKCV-ELNCPIIVFEDDFAVRDHIYDIFSFLPDLMSYYPFIRL 122 Query: 121 D----ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + L KK K+ G++ + + + R T Y I AA + K P+ Sbjct: 123 EDNDPVLHKKIKQF-------GDYTLVRFLRIPQRATCYAISPFAAAAFIKASKEFVYPV 175 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEA 201 D+ ++H H IP P VY A Sbjct: 176 DVFVRHQNIHKIPIYGLLPYPVYPA 200 >gi|28896984|ref|NP_796589.1| putative Lex2B [Vibrio parahaemolyticus RIMD 2210633] gi|153839479|ref|ZP_01992146.1| putative Lex2B [Vibrio parahaemolyticus AQ3810] gi|260362359|ref|ZP_05775317.1| glycosyltransferase family 25 [Vibrio parahaemolyticus K5030] gi|260897637|ref|ZP_05906133.1| glycosyltransferase family 25 [Vibrio parahaemolyticus Peru-466] gi|260899568|ref|ZP_05907963.1| glycosyltransferase family 25 [Vibrio parahaemolyticus AQ4037] gi|28805192|dbj|BAC58473.1| putative Lex2B [Vibrio parahaemolyticus RIMD 2210633] gi|149747008|gb|EDM57996.1| putative Lex2B [Vibrio parahaemolyticus AQ3810] gi|308087546|gb|EFO37241.1| glycosyltransferase family 25 [Vibrio parahaemolyticus Peru-466] gi|308108818|gb|EFO46358.1| glycosyltransferase family 25 [Vibrio parahaemolyticus AQ4037] gi|308115137|gb|EFO52677.1| glycosyltransferase family 25 [Vibrio parahaemolyticus K5030] Length = 239 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 83/184 (45%), Gaps = 7/184 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + +YVI+L S RRE+ R + F FFDA + +P ++ +K + Sbjct: 1 MKIYVINLKESVERREQV--RKVLSSVDFEFFDAENIKKDPEHFIYTLYDEKKTRKYKGY 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++PE+GC+ SHI LWK+ S +ILED+ + E + L ++ L+K Sbjct: 59 TLTIPELGCWASHISLWKKCV-SDHTPFLILEDNIELFGELVEQLKNIENLTKKYGLVKL 117 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + +K + + + + + T+ Y I + A L+ + P+D M Sbjct: 118 GNIFE--RKHIEIVKIDEKYRLVSNLKGACGTSAYAITPKVAAAYLSQINGFFEPVDDFM 175 Query: 181 KHWW 184 + W Sbjct: 176 DNEW 179 >gi|332345593|gb|AEE58927.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 257 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 58/252 (23%), Positives = 101/252 (40%), Gaps = 49/252 (19%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRIFSHQKRQCQ 57 M +P+Y++SL RR K R+++ F FFDAI +N I +++ R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFHRLNINFDFFDAIDAKDPQNKEIIDKM-----RLSG 55 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 ++ EI C +SH +++ + A+ILEDD +++F + L + Sbjct: 56 VGAEMTDGEIACTLSHQLIYQDMIDKNIEWAVILEDDVIVNEKFKKFLQY---------- 105 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------------------------RTT 153 F+ K K + L L G +H +L RT Sbjct: 106 --FNLSEKDKLKHNNLYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTC 163 Query: 154 GYFIGKEAAIHLLNVRKN--IYRPIDMDMKHWWEHN-IPSLVTEPGAVYEAIDTNDSTIE 210 Y + K+ A +LL + K+ YR + H +HN I + ++ ++ +S +E Sbjct: 164 SYLMNKDMAQNLLKLTKDYGTYRADSWKLMH--QHNIIKEFYLDEIILHPILNEFNSHLE 221 Query: 211 ESRLVRKPTFSP 222 RL+ P Sbjct: 222 SERLLTSEKKQP 233 >gi|331685288|ref|ZP_08385874.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli H299] gi|331077659|gb|EGI48871.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli H299] Length = 257 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 58/252 (23%), Positives = 101/252 (40%), Gaps = 49/252 (19%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRIFSHQKRQCQ 57 M +P+Y++SL RR K R+++ F FFDAI +N I +++ R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFYRLNINFDFFDAIDAKDPQNKEIIDKM-----RLSG 55 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 ++ EI C +SH +++ + A+ILEDD +++F + L + Sbjct: 56 VGAEMTDGEIACTLSHQLIYQDMIDKNIEWAVILEDDVIVNEKFKKFLQY---------- 105 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------------------------RTT 153 F+ K K + L L G +H +L RT Sbjct: 106 --FNLSEKDKLKHNNLYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTC 163 Query: 154 GYFIGKEAAIHLLNVRKN--IYRPIDMDMKHWWEHN-IPSLVTEPGAVYEAIDTNDSTIE 210 Y + K+ A +LL + K+ YR + H +HN I + ++ ++ +S +E Sbjct: 164 SYLMNKDMAQNLLKLTKDYGTYRADSWKLMH--QHNIIKEFYLDEIILHPILNEFNSHLE 221 Query: 211 ESRLVRKPTFSP 222 RL+ P Sbjct: 222 SERLLTSGKKQP 233 >gi|152978207|ref|YP_001343836.1| glycosyl transferase family protein [Actinobacillus succinogenes 130Z] gi|150839930|gb|ABR73901.1| glycosyl transferase family 25 [Actinobacillus succinogenes 130Z] Length = 265 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 55/213 (25%), Positives = 90/213 (42%), Gaps = 24/213 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI-------HLQFSFFDAIYGE---NNPICNRIFSHQ 52 I + +I+L S RR+ + + HL + FF + G N+P+ + ++ + Sbjct: 17 INILIINLKKSTQRRQFMQKQFDELQKCFPDFHLNYQFFTGVNGNEEANHPLFKK-YNSK 75 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD 112 KR + + +SL ++GC+ SH L ++ IILEDDA D F +L + Sbjct: 76 KRLARKGQDMSLSQLGCFASHYLLLEKCVQLNE-PVIILEDDAILLDGFYDVLKYAP--- 131 Query: 113 INNILIKFDALRKKPK------KDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEAAIHL 165 + F+ R + K + L +P N I Q TTGYF+ AA Sbjct: 132 --TVANYFEFFRLSNRSGGNNVKSAPLFQIPNTNLIISQVYKGWANTTGYFVTPRAAKKF 189 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 L K +D+ M +WE+ + P V Sbjct: 190 LTQMKEWIYNVDITMDRYWENKVHFCALLPNVV 222 >gi|241663595|ref|YP_002981955.1| glycosyl transferase family 25 [Ralstonia pickettii 12D] gi|240865622|gb|ACS63283.1| glycosyl transferase family 25 [Ralstonia pickettii 12D] Length = 260 Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 53/235 (22%), Positives = 94/235 (40%), Gaps = 41/235 (17%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 +P + I+L RRE + + L F +YG+ P R + H + Q R Sbjct: 5 VPTFFINLDHDAGRREALERQLDALGLPHRRFPGVYGKTLPADELARHYDHARATGQ-SR 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++ E+GC +SH+ +++ + A+ILEDDA + +L Sbjct: 64 ELTVGEVGCALSHLGVYRAMIEQDLPYALILEDDAKLGPDVPAVL--------------- 108 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT---------------GYFIGKEAAIHL 165 DAL + D + TL + D + R P T GYF+ + AA + Sbjct: 109 DALARSVSPDEPVVTLLTHIDRYYKRSARPLTANHRTVKLANYQWLAHGYFVTRAAAKRM 168 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND----STIEESRLVR 216 + + +Y P+ + +W++ +V V I + S +E R ++ Sbjct: 169 V---EQLY-PVWLAADYWYKFEREGIVQMRAVVPYVIGVQNFDSGSNLEADRAIK 219 >gi|57238023|ref|YP_179272.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni RM1221] gi|12802847|gb|AAK08089.1|AF334378_2 unknown [Campylobacter jejuni] gi|57166827|gb|AAW35606.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni RM1221] gi|315058580|gb|ADT72909.1| Beta-1,4-galactosyltransferase [Campylobacter jejuni subsp. jejuni S3] Length = 254 Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 64/242 (26%), Positives = 102/242 (42%), Gaps = 43/242 (17%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + V++I+L S R+E + ++ L+F FF AI +N K Sbjct: 1 MKVFIINLERSLDRKEHMKKQIQKLFEKNPSLKNKLEFIFFKAIDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ + I IILEDD +FSDE Sbjct: 50 EHLEFKDHFPWWGSWVLGRELSDGEKACFASHYKLWQECVKLDEPI--IILEDDVEFSDE 107 Query: 101 FSQLLPHLSKCDINNILI-KFDALRKKPKKDSYLCTL-PGNFDIHQPRILSPRTTGYFIG 158 F + + I+ +L K++ +R D L L G + + ++ T GY + Sbjct: 108 F----LIMGQEYIDELLKSKYEYIRLCYLFDKRLYFLSEGGYYLSFEKLAG--TQGYVLQ 161 Query: 159 KEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 AA+ L KN +D M +++HN+ ++V +P + S + RL K Sbjct: 162 VSAAMKFLKYAKNWIYAVDDYMDMFYKHNVLNIVKKPLLLKHDCRIESSISQARRLFLKA 221 Query: 219 TF 220 F Sbjct: 222 KF 223 >gi|18653266|gb|AAL77342.1|AF443846_3 putative glycosyltransferase WavM [Vibrio cholerae] Length = 207 Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 56/210 (26%), Positives = 85/210 (40%), Gaps = 28/210 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQ-CQ 57 + +YVISL S RR + L+F FFDAI G +P N + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFAN--YDYTKRLWLT 58 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGA--IILEDDADFSDEFSQLLPHLSKCDINN 115 ++ E+GCY SH +W++ +G +ILEDDA F L + Sbjct: 59 SGKMPMRGELGCYASHYLMWQKCV---ELGEPIVILEDDAKILPSFINYLNIIRVKTQEY 115 Query: 116 ILIKFDA-------LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + K+ D + L NF + LSP E+A L+ Sbjct: 116 GFLRLEEAYDRSRLFLKEKTNDFEISFLTNNFGGARAYSLSP---------ESAKKLIKG 166 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 + P+D M + H++PS V P V Sbjct: 167 SQRWSMPVDNYMGSLYLHDMPSFVFYPSVV 196 >gi|82779115|ref|YP_405464.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae Sd197] gi|293417086|ref|ZP_06659713.1| beta1,4-galactosyltransferase WaaX [Escherichia coli B185] gi|309784398|ref|ZP_07679037.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae 1617] gi|81243263|gb|ABB63973.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae Sd197] gi|291431117|gb|EFF04110.1| beta1,4-galactosyltransferase WaaX [Escherichia coli B185] gi|308927905|gb|EFP73373.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae 1617] Length = 257 Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 57/251 (22%), Positives = 98/251 (39%), Gaps = 47/251 (18%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRIFSHQKRQCQ 57 M +P+Y++SL RR K R+++ F FFDAI +N I +++ R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFQRLNINFDFFDAIDAKDPQNKEIIDKM-----RLSG 55 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 ++ EI C +SH +++ + A+ILEDD +++F + L + Sbjct: 56 VGAEMTDGEIACTLSHQLIYQDMIDKNIEWAVILEDDVIVNEKFKKFLQY---------- 105 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------------------------RTT 153 F+ K K + L L G +H +L RT Sbjct: 106 --FNLSEKDKLKHNNLYLLGGQKGLHDYPVLGQSLFSKVKISTCTFRRVNFNKNKIRRTC 163 Query: 154 GYFIGKEAAIHLLNVRKN--IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 Y + K+ A LL + K+ YR + H H I + ++ ++ +S +E Sbjct: 164 SYLMNKDMAQKLLKLTKDYGTYRADSWKLMH-QHHIIKEFYLDEIILHPILNEFNSHLES 222 Query: 212 SRLVRKPTFSP 222 RL+ P Sbjct: 223 ERLLTSEKKQP 233 >gi|121613507|ref|YP_001000825.1| LPS glycosyltransferase subfamily protein [Campylobacter jejuni subsp. jejuni 81-176] gi|167005737|ref|ZP_02271495.1| LPS glycosyltransferase subfamily protein [Campylobacter jejuni subsp. jejuni 81-176] gi|12837556|gb|AAK08966.1|AF334961_2 unknown [Campylobacter jejuni] gi|40218030|gb|AAR82939.1| unknown [Campylobacter jejuni subsp. jejuni 81-176] gi|87249290|gb|EAQ72251.1| LPS glycosyltransferase subfamily [Campylobacter jejuni subsp. jejuni 81-176] Length = 251 Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 65/248 (26%), Positives = 104/248 (41%), Gaps = 52/248 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + V++I+L S R+E + ++ L+F FF AI +N K Sbjct: 1 MKVFIINLERSLDRKEHIQKQIQKLFEKNLSLKNKLEFIFFKAIDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ + I IILEDD +FSDE Sbjct: 50 EHLEFKDHFPWWGSWVLGRELSDGEKACFASHYKLWQECVKLDEPI--IILEDDVEFSDE 107 Query: 101 F----SQLLPHLSKCDINNILIKFDALRKKPK-KDSYLCTLPGNFDIHQPRILSPRTTGY 155 F ++ + L K I + + K K D++L TL T GY Sbjct: 108 FLNNGAEYIDELLKSKYEYIRLCYLTQGKMLKLNDNFLFTLDS----------IGGTQGY 157 Query: 156 FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 I +A+ ++ K +P+D M ++ + ++ P + + D N STIEE + Sbjct: 158 LIKPISALKFIDNLKFWIKPVDNVMDMYYYSKVFTITYTP-LLLKTTDIN-STIEERKKD 215 Query: 216 RKPTFSPL 223 ++P F + Sbjct: 216 KRPIFKRI 223 >gi|62290485|ref|YP_222278.1| glycosyl transferase family protein [Brucella abortus bv. 1 str. 9-941] gi|82700408|ref|YP_414982.1| glycosyl transferase family protein [Brucella melitensis biovar Abortus 2308] gi|189024710|ref|YP_001935478.1| glycosyl transferase, family 25 [Brucella abortus S19] gi|225853077|ref|YP_002733310.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis ATCC 23457] gi|254689784|ref|ZP_05153038.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 6 str. 870] gi|254694273|ref|ZP_05156101.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 3 str. Tulya] gi|254697928|ref|ZP_05159756.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 2 str. 86/8/59] gi|254730818|ref|ZP_05189396.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 4 str. 292] gi|256045220|ref|ZP_05448118.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis bv. 1 str. Rev.1] gi|256114172|ref|ZP_05454926.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis bv. 3 str. Ether] gi|256258037|ref|ZP_05463573.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 9 str. C68] gi|256263440|ref|ZP_05465972.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|260547014|ref|ZP_05822753.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260565183|ref|ZP_05835667.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|261214582|ref|ZP_05928863.1| glycosyl transferase [Brucella abortus bv. 3 str. Tulya] gi|265995491|ref|ZP_06108048.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|297248867|ref|ZP_06932585.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|62196617|gb|AAX74917.1| glycosyl transferase, family 25 [Brucella abortus bv. 1 str. 9-941] gi|82616509|emb|CAJ11576.1| Glycosyl transferase, family 25 [Brucella melitensis biovar Abortus 2308] gi|189020282|gb|ACD73004.1| Glycosyl transferase, family 25 [Brucella abortus S19] gi|225641442|gb|ACO01356.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis ATCC 23457] gi|260096064|gb|EEW79941.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260151251|gb|EEW86345.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|260916189|gb|EEX83050.1| glycosyl transferase [Brucella abortus bv. 3 str. Tulya] gi|262766604|gb|EEZ12393.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|263093448|gb|EEZ17498.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|297176036|gb|EFH35383.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|326409620|gb|ADZ66685.1| Glycosyl transferase, family 25 [Brucella melitensis M28] gi|326539323|gb|ADZ87538.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis M5-90] Length = 260 Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + + P+D Sbjct: 122 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKVSAPVD 173 >gi|17986701|ref|NP_539335.1| lipooligosaccharide biosynthesis protein lic2B [Brucella melitensis bv. 1 str. 16M] gi|237815992|ref|ZP_04594989.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus str. 2308 A] gi|260755315|ref|ZP_05867663.1| glycosyl transferase [Brucella abortus bv. 6 str. 870] gi|260758538|ref|ZP_05870886.1| glycosyl transferase [Brucella abortus bv. 4 str. 292] gi|260762361|ref|ZP_05874704.1| glycosyl transferase [Brucella abortus bv. 2 str. 86/8/59] gi|260884334|ref|ZP_05895948.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|265991653|ref|ZP_06104210.1| glycosyl transferase [Brucella melitensis bv. 1 str. Rev.1] gi|17982323|gb|AAL51599.1| lipooligosaccharide biosynthesis protein lic2b [Brucella melitensis bv. 1 str. 16M] gi|237789290|gb|EEP63501.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus str. 2308 A] gi|260668856|gb|EEX55796.1| glycosyl transferase [Brucella abortus bv. 4 str. 292] gi|260672793|gb|EEX59614.1| glycosyl transferase [Brucella abortus bv. 2 str. 86/8/59] gi|260675423|gb|EEX62244.1| glycosyl transferase [Brucella abortus bv. 6 str. 870] gi|260873862|gb|EEX80931.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|263002437|gb|EEZ15012.1| glycosyl transferase [Brucella melitensis bv. 1 str. Rev.1] Length = 268 Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 12 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 69 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 70 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 129 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + + P+D Sbjct: 130 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKVSAPVD 181 >gi|226328064|ref|ZP_03803582.1| hypothetical protein PROPEN_01955 [Proteus penneri ATCC 35198] gi|225203768|gb|EEG86122.1| hypothetical protein PROPEN_01955 [Proteus penneri ATCC 35198] Length = 245 Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 63/227 (27%), Positives = 102/227 (44%), Gaps = 31/227 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-RQCQFKR- 60 + +VI+L +R+ ++ I + + F +AI G RI S +Q + Sbjct: 1 MNTFVINLKDDLLKRDFMRNQLDNIGISYQFIEAIDG-------RIMSDNTIKQLAYDYP 53 Query: 61 --LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQL--LPHLSKCDINN 115 L+ EIGC +SH+ ++K++ A+ILEDDA S+ +S+L + ++ K N Sbjct: 54 DCYLTKGEIGCTLSHMAIYKKMIEDNIEIALILEDDALLPSNIYSKLIDIKNIDKSSKPN 113 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + I K +SY+ N +I+ +S T GY I K AA LL+ P Sbjct: 114 VYI-------LTKTESYIKNKQLNNNIYYAYSVSG-TYGYVINKAAAKSLLSK----LTP 161 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTND-----STIEESRLVRK 217 I + W L+ + ID ND S+I + RL+RK Sbjct: 162 IKYEADMWSTFKRQGLINLYCHIPHLIDNNDKDSSQSSIHQERLIRK 208 >gi|147899177|ref|NP_001088623.1| procollagen galactosyltransferase 1-B precursor [Xenopus laevis] gi|82179978|sp|Q5U483|G251B_XENLA RecName: Full=Procollagen galactosyltransferase 1-B; AltName: Full=Glycosyltransferase 25 family member 1-B; AltName: Full=Hydroxylysine galactosyltransferase 1-B; Flags: Precursor gi|55153756|gb|AAH85226.1| Glt25d1a protein [Xenopus laevis] Length = 611 Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 49/174 (28%), Positives = 76/174 (43%), Gaps = 14/174 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 V++I+L RRE+ + + F DA+YG+ N K +K Sbjct: 332 VFLINLKHRQDRRERMKRTLYELQIDFKLVDAVYGKMLNQSNVTEMGIKMLPGYKDPYHG 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK I+ + +LEDD F F + L L D+ + Sbjct: 392 RPLTRGEMGCFLSHYNIWKEISERNLEVSAVLEDDLRFEIFFKRRLQTLLH-DLEIAKLD 450 Query: 120 FDAL---RKKPKKDSYLCTLPG--NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +D + RK+ + D +PG N + S T GY I A LLN Sbjct: 451 WDLIYLGRKRMQVDEPEEPVPGVRNLVVSD---YSYWTLGYLISLRGARKLLNA 501 >gi|306844604|ref|ZP_07477191.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. BO1] gi|306275048|gb|EFM56811.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. BO1] Length = 261 Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 51/219 (23%), Positives = 95/219 (43%), Gaps = 6/219 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + A++ F A+ G E P + + KR + L+ Sbjct: 4 YLINLDRSSDRLEHMTSQFAKLGADFVRIPAVDGREMPPEALKAVTAAKRP--WAAPLTP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 EIGC++SH + IA + A+++EDD F+D+ +LL H + ++K + Sbjct: 62 TEIGCFLSHRRCLEAIARNDDPYAVVVEDDVIFADDAGKLLLHSEWVPHDADIVKIETQG 121 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID---MDMK 181 KK + + + + + GY + ++AA ++ + + PID + Sbjct: 122 KKVLIGKPIACASTRYSVARLYSVHILAAGYIVSRDAARRIVAEMEKVSAPIDHFLFNAP 181 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 + + + P +A T+ E S+L +KP F Sbjct: 182 YGVFNQLSVYQCTPALCKQAGLTSTLQSERSQLYQKPPF 220 >gi|322379338|ref|ZP_08053714.1| Beta-1,4-galactosyltransferase [Helicobacter suis HS1] gi|321148251|gb|EFX42775.1| Beta-1,4-galactosyltransferase [Helicobacter suis HS1] Length = 269 Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 48/157 (30%), Positives = 70/157 (44%), Gaps = 17/157 (10%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW R + Y+ I +ILEDD + F Q L L + + Sbjct: 100 KYMSLGELGCYASHFCLWYRCLEYNEPI--VILEDDIELEPCFWQSLDFLEEHIYTLGYV 157 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + L KKP + F I P + T GY + + A+ + Sbjct: 158 RLMHLDLFELVKKPTR------FTAVFQIVGP-VAGNGTQGYCLTPQVAMAFIKASAKWV 210 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 P+D M + H I +LV EP A+ E N+S IE Sbjct: 211 IPVDNLMDCTYLHGISNLVLEPFAIAE--KPNNSNIE 245 >gi|163744085|ref|ZP_02151451.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis 2.10] gi|161382632|gb|EDQ07035.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis 2.10] Length = 237 Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 68/155 (43%), Gaps = 3/155 (1%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD-ADFSDEFSQLLPHLSKCDINNILIKF 120 LS E+GC++SH W+ IA +I+EDD A + +L + + +I+ Sbjct: 61 LSGGEVGCFLSHRKCWQLIANGIDDYGLIVEDDMATDPGIWRDVLALIDSHAGPDSMIRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 A K+ + S + G + PR + +T +GK AA LL K + RP+D + Sbjct: 121 PA--KQRETASTVIAAHGAAQLFLPRRIGLQTVAQVVGKTAAARLLTATKVLDRPVDTFL 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 + W H+ P V E + +SR V Sbjct: 179 QMHWVHDQTIHTVLPNGVSEETAALGGSTIQSRPV 213 >gi|2078280|gb|AAC62824.1| Lob1 [Histophilus somni] Length = 287 Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 46/205 (22%), Positives = 88/205 (42%), Gaps = 14/205 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-------FSFFDAIYGE---NNPICNRIFSHQKR 54 ++VI+L + R+ H+ + Q + FF + G N+P+ + ++ +KR Sbjct: 42 IFVINLEKATERKHFISHQFTALQEQHPDIVINYQFFTGVNGNTQPNHPLFAK-YNQKKR 100 Query: 55 QCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 + ++L ++GCY SH LW++ + I I+LEDDA F + + Sbjct: 101 YQRKGNEITLGQLGCYASHYLLWEKCVQLQQPI--IVLEDDAILQPNFLAVYQFCFSAEN 158 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + TLP + + Q TTGY++ +AA L+ + Sbjct: 159 QFQFFWLTHSNSSKIRTKLIHTLPDSTKLEQHYFGYSNTTGYYLTPQAAQKFLDSSQEWI 218 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAV 198 +D+ M ++E+++ L P V Sbjct: 219 YNVDIFMDRFYENHVALLGVNPPCV 243 >gi|57168030|ref|ZP_00367169.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter coli RM2228] gi|57020404|gb|EAL57073.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter coli RM2228] Length = 253 Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 58/219 (26%), Positives = 93/219 (42%), Gaps = 48/219 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + ++I+L S R+E + ++ L+F FF A+ +N K Sbjct: 1 MKAFIINLERSLDRKEYMQRQNQKLFEKNPSLKNKLEFIFFKAVDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ I I IILEDD +FSDE Sbjct: 50 EYLEFKQHFPWWASWVLGRELSDGEKACFASHYKLWQECIKLDKPI--IILEDDVEFSDE 107 Query: 101 F----SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 F + + LSK + ++F L D L N+ + ++ T GY Sbjct: 108 FLNNGEEYIEELSKSEYE--YVRFCYLF-----DKKFYPLNENYILSFEKLAG--TQGYV 158 Query: 157 IGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + AA ++ K IY P+D M +++H + ++V +P Sbjct: 159 LKPSAANKFISKAKFIYTPVDDYMDMFYKHKVLNIVKKP 197 >gi|170719164|ref|YP_001784309.1| glycosyl transferase [Haemophilus somnus 2336] gi|168827293|gb|ACA32664.1| glycosyl transferase family 25 [Haemophilus somnus 2336] Length = 293 Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 46/205 (22%), Positives = 88/205 (42%), Gaps = 14/205 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-------FSFFDAIYGE---NNPICNRIFSHQKR 54 ++VI+L + R+ H+ + Q + FF + G N+P+ + ++ +KR Sbjct: 48 IFVINLEKATERKHFISHQFTALQEQHPDIVINYQFFTGVNGNTQPNHPLFAK-YNQKKR 106 Query: 55 QCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 + ++L ++GCY SH LW++ + I I+LEDDA F + + Sbjct: 107 YQRKGNEITLGQLGCYASHYLLWEKCVQLQQPI--IVLEDDAILQPNFLAVYQFCFSAEN 164 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + TLP + + Q TTGY++ +AA L+ + Sbjct: 165 QFEFFWLTHSNSSKIRTKLIHTLPDSTKLEQHYFGYSNTTGYYLTPQAAQKFLDSSQEWI 224 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAV 198 +D+ M ++E+++ L P V Sbjct: 225 YNVDIFMDRFYENHVALLGVNPPCV 249 >gi|148559939|ref|YP_001259470.1| glycosyl transferase family protein [Brucella ovis ATCC 25840] gi|254702321|ref|ZP_05164149.1| glycosyl transferase family protein [Brucella suis bv. 5 str. 513] gi|254710635|ref|ZP_05172446.1| glycosyl transferase family protein [Brucella pinnipedialis B2/94] gi|256032128|ref|ZP_05445742.1| glycosyl transferase family protein [Brucella pinnipedialis M292/94/1] gi|256061650|ref|ZP_05451789.1| glycosyl transferase family protein [Brucella neotomae 5K33] gi|256160324|ref|ZP_05458018.1| glycosyl transferase family protein [Brucella ceti M490/95/1] gi|256255529|ref|ZP_05461065.1| glycosyl transferase family protein [Brucella ceti B1/94] gi|256370018|ref|YP_003107529.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260169263|ref|ZP_05756074.1| glycosyl transferase, family 25 [Brucella sp. F5/99] gi|261222739|ref|ZP_05937020.1| glycosyl transferase [Brucella ceti B1/94] gi|261318207|ref|ZP_05957404.1| glycosyl transferase [Brucella pinnipedialis B2/94] gi|261325660|ref|ZP_05964857.1| glycosyl transferase [Brucella neotomae 5K33] gi|261752886|ref|ZP_05996595.1| glycosyl transferase [Brucella suis bv. 5 str. 513] gi|261758776|ref|ZP_06002485.1| glycosyl transferase [Brucella sp. F5/99] gi|265989240|ref|ZP_06101797.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] gi|265998701|ref|ZP_06111258.1| glycosyl transferase [Brucella ceti M490/95/1] gi|148371196|gb|ABQ61175.1| glycosyl transferase, family 25 [Brucella ovis ATCC 25840] gi|256000181|gb|ACU48580.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260921323|gb|EEX87976.1| glycosyl transferase [Brucella ceti B1/94] gi|261297430|gb|EEY00927.1| glycosyl transferase [Brucella pinnipedialis B2/94] gi|261301640|gb|EEY05137.1| glycosyl transferase [Brucella neotomae 5K33] gi|261738760|gb|EEY26756.1| glycosyl transferase [Brucella sp. F5/99] gi|261742639|gb|EEY30565.1| glycosyl transferase [Brucella suis bv. 5 str. 513] gi|262553325|gb|EEZ09159.1| glycosyl transferase [Brucella ceti M490/95/1] gi|264661437|gb|EEZ31698.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] Length = 260 Score = 52.0 bits (123), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 122 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 173 >gi|254719630|ref|ZP_05181441.1| glycosyl transferase family protein [Brucella sp. 83/13] gi|265984640|ref|ZP_06097375.1| glycosyl transferase [Brucella sp. 83/13] gi|306837735|ref|ZP_07470603.1| glycosyl transferase family protein [Brucella sp. NF 2653] gi|264663232|gb|EEZ33493.1| glycosyl transferase [Brucella sp. 83/13] gi|306407191|gb|EFM63402.1| glycosyl transferase family protein [Brucella sp. NF 2653] Length = 260 Score = 52.0 bits (123), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 122 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 173 >gi|16273584|ref|NP_439839.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae Rd KW20] gi|260581038|ref|ZP_05848860.1| conserved hypothetical protein [Haemophilus influenzae RdAW] gi|7387855|sp|P71398|LSG4_HAEIN RecName: Full=Lsg locus putative protein 4 gi|1574550|gb|AAC23343.1| lipopolysaccharide biosynthesis protein, putative [Haemophilus influenzae Rd KW20] gi|260092278|gb|EEW76219.1| conserved hypothetical protein [Haemophilus influenzae RdAW] Length = 257 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 50/174 (28%), Positives = 79/174 (45%), Gaps = 18/174 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + Q FS + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRELFFSQKNTEDFQVFSAINTMQKDWDELA-AIFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCDINNILI 118 EIGC +SH+ ++++I I A++ EDDA F +F Q L L K + IL+ Sbjct: 64 GEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAEIILL 123 Query: 119 ---------KFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 FD P S+LC G+ + P + T GY I K AA Sbjct: 124 GQSKINDFNDFDLEINYPTTFSFLCKKTGDVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|254714819|ref|ZP_05176630.1| glycosyl transferase family protein [Brucella ceti M644/93/1] gi|254717877|ref|ZP_05179688.1| glycosyl transferase family protein [Brucella ceti M13/05/1] Length = 260 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 122 HEVLISTPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 173 >gi|23502468|ref|NP_698595.1| glycosyl transferase family protein [Brucella suis 1330] gi|254704849|ref|ZP_05166677.1| glycosyl transferase family protein [Brucella suis bv. 3 str. 686] gi|260565910|ref|ZP_05836380.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] gi|23348460|gb|AAN30510.1| glycosyl transferase, family 25 [Brucella suis 1330] gi|260155428|gb|EEW90508.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] Length = 260 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLELASFTQISKE--WPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 122 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 173 >gi|254452105|ref|ZP_05065542.1| LPS glycosyltransferase subfamily [Octadecabacter antarcticus 238] gi|198266511|gb|EDY90781.1| LPS glycosyltransferase subfamily [Octadecabacter antarcticus 238] Length = 169 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 35/108 (32%), Positives = 48/108 (44%), Gaps = 2/108 (1%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 K L EIGCY+SHI W RIA A G + EDD D L L+ + ++ Sbjct: 60 KAPLVASEIGCYLSHIAAWTRIANGDAAGGFVFEDDFLADDTSGATLADLNVAQSDWDMV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 K + + PK + T G + P + GY + K+AA LL Sbjct: 120 KLFSFDQSPK--AVFETTHGTKRLVVPYRVPTCLIGYGMTKQAAQKLL 165 >gi|261219724|ref|ZP_05934005.1| glycosyl transferase [Brucella ceti M13/05/1] gi|261322614|ref|ZP_05961811.1| glycosyl transferase [Brucella ceti M644/93/1] gi|260924813|gb|EEX91381.1| glycosyl transferase [Brucella ceti M13/05/1] gi|261295304|gb|EEX98800.1| glycosyl transferase [Brucella ceti M644/93/1] Length = 269 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 13 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 70 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 71 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 130 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 131 HEVLISTPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 182 >gi|261755545|ref|ZP_05999254.1| glycosyl transferase [Brucella suis bv. 3 str. 686] gi|261745298|gb|EEY33224.1| glycosyl transferase [Brucella suis bv. 3 str. 686] Length = 268 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 12 YLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLELASFTQISKE--WPAPLSP 69 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 70 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 129 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 130 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 181 >gi|306844605|ref|ZP_07477192.1| glycosyl transferase family protein [Brucella sp. BO1] gi|306275049|gb|EFM56812.1| glycosyl transferase family protein [Brucella sp. BO1] Length = 269 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 13 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 70 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 71 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 130 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 131 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 182 >gi|225628038|ref|ZP_03786074.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella ceti str. Cudo] gi|294852916|ref|ZP_06793589.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] gi|225617201|gb|EEH14247.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella ceti str. Cudo] gi|294821505|gb|EFG38504.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] Length = 269 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 13 YLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSP 70 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 71 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 130 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 131 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 182 >gi|86356685|ref|YP_468577.1| putative glycosyltransferase protein [Rhizobium etli CFN 42] gi|86280787|gb|ABC89850.1| putative glycosyltransferase protein [Rhizobium etli CFN 42] Length = 298 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 58/257 (22%), Positives = 103/257 (40%), Gaps = 11/257 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L + RR + A L + A+ G + + F + + R Sbjct: 21 VNTYLINLDRAPLRRFRMERLLAGFGLAYERVAAVDGAAISLPHPGFDEKSYLSRHGRRP 80 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGCY+SH+ +R S A A+ILEDD DF +F++LL + + +++ Sbjct: 81 NPFEIGCYLSHVECARRFLASNAEFALILEDDLDFDGDFAELLEAALRHQAHWDILRLST 140 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + K T + I R + Y I ++AA + + P D+ Sbjct: 141 VNSGRKHRVEPLTASRSLAIALTR--EKGSGAYLINRQAAGWIAGALLPMRLPYDLAFDL 198 Query: 183 WWEHNIPSLVTEPGAVYEAID---TNDSTIEESRLVRKPTFSPLYF-----YRNTCYQWN 234 ++ + + +P V + D + + RL R+ +S L F R ++ Sbjct: 199 EFDDGLRACFVDPLPVGQRADPCSQIQAGLSAYRLGRRRPWSVLPFRAAAEIRRFVARFG 258 Query: 235 LHYNAWRKDLPPVSTTK 251 AWR L P S + Sbjct: 259 -RLAAWRLTLSPASMGR 274 >gi|323144164|ref|ZP_08078800.1| LPS glycosyltransferase [Succinatimonas hippei YIT 12066] gi|322416072|gb|EFY06770.1| LPS glycosyltransferase [Succinatimonas hippei YIT 12066] Length = 265 Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 48/204 (23%), Positives = 81/204 (39%), Gaps = 33/204 (16%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFS-FFDAIYGENNPICNRIFSHQKRQ------ 55 +P++++SL RR+K + + + +F A+ G + K Q Sbjct: 2 LPIFLVSLEHHQNRRDKVFAGLQKYGINDAIWFPAVNGNKEDLTKFNIDRSKHQRFWHNI 61 Query: 56 ---CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD-----ADFSDEFSQLLPH 107 C + R+ + E GC +SH+HL+++I A+I+EDD AD + Q+ Sbjct: 62 ISCCSYSRVYTNGEYGCALSHLHLYQKIVDENISAALIIEDDVVPRNADVTAFLEQIKDV 121 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------------RTTG 154 K + + +D + +S+ I RI P RT+G Sbjct: 122 QEKTGFEILFLDYD-----DRLNSFSKATISFGSIQVKRIGIPNWDWLFNRRKNVYRTSG 176 Query: 155 YFIGKEAAIHLLNVRKNIYRPIDM 178 Y I KE A LL + P D+ Sbjct: 177 YIITKEGAKKLLAKGYPVRMPADI 200 >gi|325268042|ref|ZP_08134688.1| lipooligosaccharide biosynthesis protein [Kingella denitrificans ATCC 33394] gi|324980427|gb|EGC16093.1| lipooligosaccharide biosynthesis protein [Kingella denitrificans ATCC 33394] Length = 263 Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 57/230 (24%), Positives = 97/230 (42%), Gaps = 26/230 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI------YGENNPICNRIFSHQKRQC 56 IP Y+ISLP RR+ A+ + DA+ + +C+ + H+K + Sbjct: 2 IPAYIISLPSETRRRQHVQAECAKYGIPAEMVDAVDMRHIEEAKLRELCS-LPLHKKPKK 60 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL------SK 110 Q R L+ E+GC +SH +++ + A+ILEDDA F + + ++ + Sbjct: 61 Q--RYLTKGELGCALSHHQIYQTMLARQQEFALILEDDATFIQDPTPVVQAAFLQRIKQQ 118 Query: 111 CDINNILIKFDAL---------RKKP-KKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGK 159 CD + +L+ + R+ P KK + T +F P Y I + Sbjct: 119 CDFDVLLLGYVKTVARYLPYYYRRIPIKKRVTMSTPQQDFVFGTPWEQFGCGAVAYVITQ 178 Query: 160 EAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 + A LL++ K P D + + + L P V E ++ DSTI Sbjct: 179 KGAEKLLDITKKPCVPADDWLYFEQQCGLKVLHCRPTLVLEDLENLDSTI 228 >gi|225025759|ref|ZP_03714951.1| hypothetical protein EIKCOROL_02663 [Eikenella corrodens ATCC 23834] gi|224941540|gb|EEG22749.1| hypothetical protein EIKCOROL_02663 [Eikenella corrodens ATCC 23834] Length = 270 Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 64/203 (31%), Positives = 94/203 (46%), Gaps = 34/203 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKRL-- 61 Y+ISL SH RR+ + + FSFFDAI N R+ F + Sbjct: 5 YIISLATSHTRRQHIFSQFNSKSIPFSFFDAISSSEQLNSATTRLLP------LFSEVDY 58 Query: 62 LSLPEIGCYISHIHLWKRIAYS--PAIGAIILEDDADFSDEFSQLLPHLSKCD----INN 115 LSL E GC++SHI LW+ P IG I EDD ++ ++ L + S IN Sbjct: 59 LSLGEKGCFMSHISLWQHCLDQNLPYIG--IFEDDIWLGEQANKFLTNDSWLQEFFPINE 116 Query: 116 -ILIKFDALRKKPKKDSYLCTLPGN-FDIHQP-RILSPR--TTGYFIGKEAAIHLLNVRK 170 +I+ + + + + S +P N + HQ ++ SP T GY I + A LL+ + Sbjct: 117 PFIIRLETVYEPCQIKS----MPRNSYHNHQIYKLCSPHHGTGGYIISQAAIKWLLSYLR 172 Query: 171 NI----YRPIDMDMKHWWEHNIP 189 N + PID+ M +EH IP Sbjct: 173 NTHLDDFTPIDVLM---FEHLIP 192 >gi|265984639|ref|ZP_06097374.1| glycosyl transferase [Brucella sp. 83/13] gi|306837736|ref|ZP_07470604.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. NF 2653] gi|264663231|gb|EEZ33492.1| glycosyl transferase [Brucella sp. 83/13] gi|306407192|gb|EFM63403.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. NF 2653] Length = 261 Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 50/219 (22%), Positives = 95/219 (43%), Gaps = 6/219 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + A++ F A+ G E P + + +R + L+ Sbjct: 4 YLINLDRSSDRLEHMTSQFAKLGADFVKIPAVDGREMPPEALKAVTATERP--WAAPLTP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 EIGC++SH + IA + A+++EDD F+D+ +LL H + ++K + Sbjct: 62 TEIGCFLSHRRCLEAIARNDDPYAVVVEDDVIFADDAGKLLLHSEWVPHDADIVKIETQG 121 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID---MDMK 181 KK + + + + + GY + ++AA ++ + + PID + Sbjct: 122 KKVLIGKPIACASTRYSVARLYSVHILAAGYIVSRDAARRIVAEMEKVSAPIDHFLFNAP 181 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 + + + P +A T+ E S+L +KP F Sbjct: 182 YGVFNQLSVYQCTPALCKQAGLTSTLQSERSQLYQKPPF 220 >gi|113866928|ref|YP_725417.1| glycosyltransferase involved in LPS biosynthesis [Ralstonia eutropha H16] gi|113525704|emb|CAJ92049.1| glycosyltransferase involved in LPS biosynthesis [Ralstonia eutropha H16] Length = 257 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 33/119 (27%), Positives = 60/119 (50%), Gaps = 16/119 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK--- 59 I YVI+L + RRE+ + A + + F F A+ G R+ + Q+ ++ Sbjct: 2 IGAYVINLDSARERRERISGQLAGLGIPFQVFRAVDG-------RVLAEQEVASKYDAAA 54 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-SKCD 112 R +S E+GC +SH+ +++++ A A++LEDDA + + +L L S+ D Sbjct: 55 AAARYRPMSRGEVGCGLSHLGVYRKMLDDGAAFALVLEDDASPGPDLATVLADLESRLD 113 >gi|85704750|ref|ZP_01035851.1| glycosyl transferase, family 25 [Roseovarius sp. 217] gi|85670568|gb|EAQ25428.1| glycosyl transferase, family 25 [Roseovarius sp. 217] Length = 241 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 40/169 (23%), Positives = 71/169 (42%), Gaps = 22/169 (13%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD-----FSDEFSQLLPHLSKCDINNI 116 L ++GC++SH +W I +I EDDA F+ + + D+ I Sbjct: 64 LKTSQLGCFLSHRQIWAEIVRRDLDYGLIFEDDATIDPVVFARALALARAQID--DLGYI 121 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + ALR + L G + + R+T +G++AA HLL++ + RP+ Sbjct: 122 QFQTQALRGPAR----LIDTNGGCALSLSLVSGLRSTAQMVGRDAAAHLLSLTETFDRPV 177 Query: 177 DMDMKHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPLY 224 D ++ W + P +Y + N T+E RKP + ++ Sbjct: 178 DCLIQSHWHTRL-----RPAGIYPSGVSRNAETVE-----RKPPLTQIW 216 >gi|157073889|ref|NP_001096660.1| procollagen galactosyltransferase 1-A precursor [Xenopus laevis] gi|160385807|sp|A0JPH3|G251A_XENLA RecName: Full=Procollagen galactosyltransferase 1-A; AltName: Full=Glycosyltransferase 25 family member 1-A; AltName: Full=Hydroxylysine galactosyltransferase 1-A; Flags: Precursor gi|117558235|gb|AAI27423.1| Glt25d1b protein [Xenopus laevis] Length = 611 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 74/174 (42%), Gaps = 14/174 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 V++I+L RRE+ + + + DA+YG+ K +K Sbjct: 332 VFLINLKHRQDRRERMKRTLYELQIDYKLVDAVYGKTLNQTQVSELGIKMLPDYKDPYHG 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK I+ + + EDD F F + L L D+ + Sbjct: 392 RPLTRGEMGCFLSHYNIWKEISERNLAVSAVFEDDLRFEIYFKRRLQTLLH-DLETAKLD 450 Query: 120 FDAL---RKKPKKDSYLCTLPG--NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +D + RK+ + D +PG N + S T GY I A LLN Sbjct: 451 WDLIYLGRKRMQVDEPEEPVPGVRNLVVSD---YSYWTLGYLISLRGAKKLLNA 501 >gi|317011021|gb|ADU84768.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori SouthAfrica7] Length = 283 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 58/204 (28%), Positives = 90/204 (44%), Gaps = 24/204 (11%) Query: 27 IHLQFSFFDAI-YGENN---PICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IA 81 +HL F D + + EN P + F H + C + + E+GCY SH LW++ I Sbjct: 60 LHLSFMIEDLLAFCENKKHPPCAFKNFFHAIKHC--GKRMGFGELGCYASHYSLWQKCIE 117 Query: 82 YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLP 137 + AI ILEDD + F + L + IN + L+ + K K + Sbjct: 118 LNEAI--CILEDDITLKEHFKESLE-FCRQHINELGYIRLMHLEENVAKQKTPVKGVSQI 174 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEP 195 NF T GY + +AA LL K PID M +H+W H + + V E Sbjct: 175 LNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEE 227 Query: 196 GAVY-EAIDTNDSTIEESRLVRKP 218 A++ + ++T +S E+ R + P Sbjct: 228 FAIFCDGMNTQNSNTEKQRPKKLP 251 >gi|260582795|ref|ZP_05850581.1| glycosyltransferase [Haemophilus influenzae NT127] gi|260094121|gb|EEW78023.1| glycosyltransferase [Haemophilus influenzae NT127] Length = 257 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 51/175 (29%), Positives = 76/175 (43%), Gaps = 20/175 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + Q FS + + + N + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRELFFSQKNIEDFQIFSAINTMQKDCNELAT-IFNIEQFKAHYGRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------L 108 EIGC +SH+ ++++I I A++ EDDA F F Q L L Sbjct: 64 GEIGCTLSHLSVYQKIIEDNEISEDSYALVCEDDALFHPNFQQNLTALLSEKLEAEIILL 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + INN D P S+LC GN + P + T GY I K AA Sbjct: 124 GQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|329756961|gb|AEC04736.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 51/185 (27%), Positives = 79/185 (42%), Gaps = 23/185 (12%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 MP+ YVISL + RR+ + ++ H+ F FFDA+ +P F Q+ Sbjct: 1 MPMTNYVISLSSARERRQHVMNEFSKHHVPFQFFDAV----SPSLQLDFLIQRLVPNLNG 56 Query: 61 L-LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH------LSKCDI 113 L+ E GC ISH+ LW + I EDD E + L L CD Sbjct: 57 TSLTGGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGREAKKFLAEDDWLFSLFNCD- 115 Query: 114 NNILIKFDALRKKPKKDSYLC-TLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLN 167 + +I+ + ++ +C LP H L+ + T GY I AA +LL+ Sbjct: 116 DVFIIRLETFLQET-----MCRALPHPISYHGRDFLALKEEHLGTAGYIISLGAAKYLLS 170 Query: 168 VRKNI 172 + +N+ Sbjct: 171 ILRNM 175 >gi|255323434|ref|ZP_05364565.1| LPS glycosyltransferase [Campylobacter showae RM3277] gi|255299471|gb|EET78757.1| LPS glycosyltransferase [Campylobacter showae RM3277] Length = 251 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 62/238 (26%), Positives = 93/238 (39%), Gaps = 29/238 (12%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M V+VISL ARREK R + + +F +A G + R Sbjct: 1 MKKLVFVISLKSDEARREKLKERF-KNYGEFKLVEATDGRTMSAKEYYGYALPSLEAYDR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA-DFSDEFSQLLPHLSKCDINNILI- 118 LLS EIGC +SH+ ++ S A A+ILEDD D + +K D + LI Sbjct: 60 LLSPSEIGCSLSHVCAYEEFLKSDAKFALILEDDVIGDEDGVKKAFETAAKMDEGSALIC 119 Query: 119 --------KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 +F A KK + D +L + I+ R Y + + AA +L K Sbjct: 120 GAQDGLEGRFSAFGKKLEDDFWLVSKRSYGTIY-------RAAAYVLDRRAAEKILQTHK 172 Query: 171 NIYRPID----MDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR-KPTFSPL 223 D + +++ + + P D DS I+ R+ R + SPL Sbjct: 173 RALCVADFWRILLLQNGLKMYFSDIFAHP------TDLTDSNIQAERVQRTQVKVSPL 224 >gi|308800464|ref|XP_003075013.1| Lysyl hydrolase/glycosyltransferase family 25 (ISS) [Ostreococcus tauri] gi|116061566|emb|CAL52284.1| Lysyl hydrolase/glycosyltransferase family 25 (ISS) [Ostreococcus tauri] Length = 464 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 5/66 (7%) Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ-LLPHL 108 SH RQ +RL GCY+SH+ LW+R+ + A AI+LEDDA S +F++ L+ L Sbjct: 314 SHAMRQSLHERL----RFGCYLSHVSLWRRVIRAEAPLAIVLEDDAVVSSKFARALVSRL 369 Query: 109 SKCDIN 114 K I+ Sbjct: 370 DKLPID 375 >gi|89076364|ref|ZP_01162697.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium sp. SKA34] gi|89047935|gb|EAR53526.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium sp. SKA34] Length = 242 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 71/250 (28%), Positives = 109/250 (43%), Gaps = 19/250 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFK-R 60 + ++VI+LP S RR + I + F FF AI G P K + F+ R Sbjct: 1 MKIFVINLPSSSDRRPFISKKMEDIGIDFEFFSAIDGRKGLPQDLANCPDDKHRIYFRSR 60 Query: 61 LLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 LS E GCY SH LW++ I I +ILEDD + F Q+ H +++N + Sbjct: 61 PLSSGEKGCYASHYRLWQKCIELQQPI--VILEDDCIPTSFFKQI--HTKLPELHNEGYE 116 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + L+ + + + L + I + T GY I A L+ K P+D Sbjct: 117 YLRLQGQQGQFKVIEQL-SDLQIVLWKENLIGTVGYSISPAGAKKLIAKSKKWRCPVDNF 175 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE------ESRLVRKPTFSPLY-FYRNTCYQ 232 + + N+ +P A+ E D STI+ E L+ K +F +Y YRN + Sbjct: 176 IGESYRTNLLCTGIQPYAI-EHDDNQTSTIQPTSAKSEVSLIYK-SFREIYRSYRNI--R 231 Query: 233 WNLHYNAWRK 242 LHY+ +K Sbjct: 232 MTLHYHLTKK 241 >gi|91205319|ref|YP_537674.1| glycosyltransferase [Rickettsia bellii RML369-C] gi|157826924|ref|YP_001495988.1| glycosyltransferase [Rickettsia bellii OSU 85-389] gi|91068863|gb|ABE04585.1| Glycosyltransferase [Rickettsia bellii RML369-C] gi|157802228|gb|ABV78951.1| Glycosyltransferase [Rickettsia bellii OSU 85-389] Length = 290 Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 64/242 (26%), Positives = 98/242 (40%), Gaps = 41/242 (16%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--------------- 47 I ++VI+L S R + I+L + F A+ G I N+ Sbjct: 52 IKIFVINLDHSTNRYNNINKQFIDINLSYERFSAVDGYELSIANQKGEKFTGLDLKNNSS 111 Query: 48 ---------IF--SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA- 95 IF S R L+ E GCY SH +W ++ A+ILEDDA Sbjct: 112 LLSLDNNYTIFCPSENINYYALNRTLTAGEFGCYCSHREIWLKMVEENIPYALILEDDAI 171 Query: 96 ---DFSDEFSQLLPHL-SKCDINNILIKFDALRKKPKKDSY-LCTLPGNFDIHQPRILSP 150 +F +FS ++ L S D+ + + + P++ Y + P I+ R Sbjct: 172 LKENFYKKFSTIIKSLPSDWDLVYLFLMY-----LPRQKFYKIYNNPYLKKINTQRYFYT 226 Query: 151 RTTGYFIGKEAAIHLLNVRKNIYRPIDMDM-KHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 +TGY I +AA LL K+ PID + + + I + +T P + D DSTI Sbjct: 227 -STGYLINLKAATKLLKYSKSFSEPIDDSIAQEVADSRIQAYITTPFLI--NTDFKDSTI 283 Query: 210 EE 211 E Sbjct: 284 IE 285 >gi|226954444|ref|ZP_03824908.1| glycosyltransferase LpsA [Acinetobacter sp. ATCC 27244] gi|226834793|gb|EEH67176.1| glycosyltransferase LpsA [Acinetobacter sp. ATCC 27244] Length = 270 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 71/174 (40%), Gaps = 4/174 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL ++ RR+ + + F FFDA+ P ++ K L+ Sbjct: 4 YVISLTYADQRRDHILSIFQKQAIDFCFFDAV----TPATMEQIANVLGLDITKTDLAKS 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E+ C +SH LW++ I EDD +E + L +IK +A Sbjct: 60 EVACLLSHAALWQKAIDENLSHIAIFEDDIHLGEEATIFLTKSDWIPEGCSIIKLEAFYP 119 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K DS +LP + + + GY + ++AA LLN+ + I +D Sbjct: 120 KVGVDSSFRSLPSKRKLSLLKTVHMGCGGYILSQQAARDLLNLLATYEQLIPVD 173 >gi|307340800|gb|ADN43861.1| WcnL [Escherichia coli] Length = 248 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 34/95 (35%), Positives = 46/95 (48%), Gaps = 4/95 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC--QFKR 60 I +Y I+L +ARREK A ++ L +AI G N + + + + R Sbjct: 2 IKIYTINLFTENARREKILSLAKKLELPIKIENAIIGSN--LSKEYLDNLTNESIDKIGR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA 95 LS E+GCY+SHI K S AIILEDD Sbjct: 60 NLSPGEVGCYLSHIKCLKNFLSSGDEFAIILEDDV 94 >gi|163757900|ref|ZP_02164989.1| glycosyl transferase, family 25 [Hoeflea phototrophica DFL-43] gi|162285402|gb|EDQ35684.1| glycosyl transferase, family 25 [Hoeflea phototrophica DFL-43] Length = 252 Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 45/213 (21%), Positives = 89/213 (41%), Gaps = 7/213 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQC-QFK 59 I ++I L + ARR + H + DA+ G + + ++S + + Sbjct: 7 IEAFIIHLARATARRPQVDHLLKACPVPARVLDAVDGRAMSEAEIDAVYSRKSMHAPHYP 66 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQLLPHLSKCDINNILI 118 ++ E+GC++SH W+ I +++EDD + +D F+ L + + ++ Sbjct: 67 FEMTAGEVGCFLSHRKAWQAIVACGRDAGLVIEDDVEIDADVFATALEFAREHVSAHGIV 126 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F ++ + L + +P I+ R + + AA LL RPID Sbjct: 127 QFQV--RRIAEPGPLIASTDAVVLARPMIIPLRASCTLYSRSAAERLLAQTGRFDRPIDG 184 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 ++ W + L+ P V + A D +TI+ Sbjct: 185 HVQLHWVTGLRPLIAVPSGVRDKAADIGGTTIQ 217 >gi|126735046|ref|ZP_01750792.1| hypothetical protein RCCS2_14254 [Roseobacter sp. CCS2] gi|126715601|gb|EBA12466.1| hypothetical protein RCCS2_14254 [Roseobacter sp. CCS2] Length = 261 Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 44/157 (28%), Positives = 77/157 (49%), Gaps = 12/157 (7%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 ++GCY SHI +W+ S + A+ILEDD F D+F Q L + + ++F+ +R Sbjct: 74 KMGCYASHIAVWEAFLASDSDVALILEDDVVFHDDFLQSLDLALEASDHWDTVRFNCIRA 133 Query: 126 K-PKKDSYLCTLPGNFDIHQPRILSPRT--TGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 K P + Y+ G++ ++ + P T Y I ++ A +L R D ++ Sbjct: 134 KLPVRQGYV----GDYSLNA--YVGPFTGNAAYLIKRDVAQRVLPNLWPQTRAFDHELNR 187 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--LVRK 217 ++ H+ EP + + D N+S+I S LV+K Sbjct: 188 FFLHDFRQCGLEPFSTH-VDDGNESSITGSNFALVKK 223 >gi|307637495|gb|ADN79945.1| Beta-1,4-galactosyl transferase [Helicobacter pylori 908] gi|325996084|gb|ADZ51489.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 2018] gi|325997680|gb|ADZ49888.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 2017] Length = 281 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 79/183 (43%), Gaps = 18/183 (9%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L K IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIVLKERFKESLEFCDK-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 207 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQ 232 ID M ++ H + + V E A+ + ++ +S E+ R R P+ + + ++ Q Sbjct: 208 IDCVMDRYYWHGVKNYVLEEFAIACDGMNAQNSNTEKQRPKRLPSSIRIGRFLHKGAIKQ 267 Query: 233 WNL 235 WN+ Sbjct: 268 WNI 270 >gi|315638778|ref|ZP_07893951.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis JV21] gi|315481187|gb|EFU71818.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis JV21] Length = 215 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 48/196 (24%), Positives = 87/196 (44%), Gaps = 21/196 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + +++I+L S R+E + +++ L+F FF+A+ G + F Sbjct: 1 MKIFIINLLRSTHRKESMKEQISKLFSENPSLKDRLEFIFFEAVDGGGGEEFAK-FHPAW 59 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 + F R LSL E GC+ SH LW++ G ++EDD +F+ F+ ++ Sbjct: 60 AKYLFGRELSLGEKGCFASHYKLWQK-CLELNEGIFVIEDDVEFTQAFNA-------NNL 111 Query: 114 NNILIK-FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 IL ++ +R K++ L NF I + T GY++ AA + + + Sbjct: 112 EKILKSPYEYVRIYHIKEAKFYDLELNFKITFSNVAG--TQGYYLKPSAARKFIKKCQFL 169 Query: 173 YRPIDMDMKHWWEHNI 188 +P+D M + NI Sbjct: 170 IKPVDDSMDYSEVLNI 185 >gi|300021717|ref|YP_003754328.1| glycosyl transferase family 25 [Hyphomicrobium denitrificans ATCC 51888] gi|299523538|gb|ADJ22007.1| glycosyl transferase family 25 [Hyphomicrobium denitrificans ATCC 51888] Length = 244 Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 45/191 (23%), Positives = 86/191 (45%), Gaps = 25/191 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V+VI+L + ++ + + +I F +A+ GE P ++ + F +SL Sbjct: 3 VFVINLASAT---DRLAYISGQIGGPFERIEAVRGEAVP--------ERLKANFSGTVSL 51 Query: 65 --PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 EIGCY SH+ + I A++LEDDA + +F ++ + C + ++ +D Sbjct: 52 LPGEIGCYASHLIAAENIVARGLPYAVVLEDDAILASDFHEV---VETC-VKHLPAGWDI 107 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILS-----PRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + K C L + R L P+TT ++ ++ L + RP+D Sbjct: 108 VALSDVKSLPHCRLS---QLSGERWLVRYAHFPKTTTAYVLSQSGCRKLLASRPRTRPVD 164 Query: 178 MDMKHWWEHNI 188 +D+++ WE + Sbjct: 165 VDIRYGWEMGL 175 >gi|161619545|ref|YP_001593432.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtE [Brucella canis ATCC 23365] gi|161336356|gb|ABX62661.1| Lacto-N-neotetraose biosynthesis glycosyl transferase lgtE [Brucella canis ATCC 23365] Length = 260 Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 48/178 (26%), Positives = 77/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ F+ ++ + LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLELASFTQISKE--WPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R + GY + +EAA LL + + P+D Sbjct: 122 HEVLISNPVKNE------GPYSISRLRSRHLQMGGYVVSREAARKLLPLMEKASAPVD 173 >gi|194272156|ref|NP_001123548.1| procollagen galactosyltransferase 2 [Danio rerio] gi|159570814|emb|CAP19485.1| novel protein similar to vertebrate glycosyltransferase 25 domain containing 1 (GLT25D1) [Danio rerio] Length = 613 Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 41/173 (23%), Positives = 79/173 (45%), Gaps = 10/173 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RRE+ + A + L+ + DA+ G+ + +K Sbjct: 332 IFLINLKRRFDRRERMLNTMAVLGLEATLVDAVDGKTLNTSQLQALGIEMMPGYKDPYSG 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R+L+ EIGC++SH WK++ ++LEDD F F + L + K D+ + Sbjct: 392 RVLTRGEIGCFLSHHFTWKQVLERGLRHVLVLEDDVRFEPRFKRRLQTIMK-DVEKTQLN 450 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 +D + RK+ + ++ G ++ + S T GY + ++ A LL + Sbjct: 451 WDLIYVGRKRMQVAQPEVSVEGVNNLVEAD-YSYWTLGYALSQQGAKKLLAAQ 502 >gi|294651461|ref|ZP_06728774.1| glycosyltransferase LpsA [Acinetobacter haemolyticus ATCC 19194] gi|292822611|gb|EFF81501.1| glycosyltransferase LpsA [Acinetobacter haemolyticus ATCC 19194] Length = 270 Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 43/174 (24%), Positives = 71/174 (40%), Gaps = 4/174 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL ++ RR+ + + F FFDA+ P ++ K L+ Sbjct: 4 YVISLTYADQRRDHILSIFQKQAIDFCFFDAV----TPATMEQIANALGLDITKTDLAKS 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E+ C +SH LW++ I EDD +E + L +IK +A Sbjct: 60 EVACLLSHAALWQKAIDENLSHIAIFEDDIHLGEEATIFLTKSDWIPEGCSIIKLEAFYP 119 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K +S +LP + + + GY + ++AA LLN+ + I +D Sbjct: 120 KVGVESSFRSLPSKRKLSLLKTVHMGCGGYILSQQAARDLLNLLATYEQLIPVD 173 >gi|148935|gb|AAA24981.1| ORF 4 [Haemophilus influenzae] Length = 257 Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 20/175 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RR+ F + Q FS + + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQKNTEDFQIFSAINTMQKDWDELAS-IFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDA----DFSDEFSQLLPH--------L 108 EIGC +SH+ ++++I I A++ EDDA DF + LL L Sbjct: 64 GEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHLDFQQNLTALLSEKLEAEIILL 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + +INN D P S+LC GN + P + T GY I K AA Sbjct: 124 GQSNINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|15611808|ref|NP_223459.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155297|gb|AAD06313.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 292 Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 52/184 (28%), Positives = 81/184 (44%), Gaps = 20/184 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 108 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDITLKEHFKESL-EFCRQHINELGYIR 164 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 165 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 218 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ID M +H+W H + + V E A+ + I+ +S E+ R R P + + +++ Sbjct: 219 IDCVMDRHYW-HGVKNYVLEEFAIACDGINAQNSNTEKQRPKRLPLSIRIGRFLHKSAIQ 277 Query: 232 QWNL 235 QWN+ Sbjct: 278 QWNI 281 >gi|300717598|ref|YP_003742401.1| LPS biosynthesis glycosyltransferase [Erwinia billingiae Eb661] gi|299063434|emb|CAX60554.1| Putative LPS biosynthesis glycosyltransferase [Erwinia billingiae Eb661] Length = 256 Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 4/107 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 PI ++VI+L S RRE R ++L F +A+ G N + + + + Sbjct: 4 PIRIFVINLKSSVTRRESLQRRLDELNLTAEFIEAVNGHN--LSESELAEHTLPVNYAYM 61 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 E+GC +SH ++K++ A+ILEDD D +L + Sbjct: 62 KG--EVGCALSHQLIYKKMVAGNIATALILEDDVYLPDNLPVILSQI 106 >gi|110006041|gb|ABG48523.1| LpsB [Haemophilus influenzae] gi|110006043|gb|ABG48524.1| LpsB [Haemophilus influenzae] Length = 256 Score = 48.9 bits (115), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + T GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|57241911|ref|ZP_00369851.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis RM3195] gi|57017103|gb|EAL53884.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis RM3195] Length = 243 Score = 48.9 bits (115), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 51/196 (26%), Positives = 88/196 (44%), Gaps = 21/196 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + +++I+L S R+E + +++ HL+F FF+A+ G+ + Sbjct: 1 MKIFIINLLRSTHRKESIRKQISKLFSENPSLKDHLEFIFFEAVDGDGGEEFAKFHPAWA 60 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHLSKCD 112 + R LS E GC+ SH LW++ G ++EDD +F+ F ++ L + K Sbjct: 61 KYLN-GRKLSPGEKGCFASHYKLWQK-CLELNEGIFVIEDDVEFTQAFNAKNLEKILKSP 118 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 + I + +R+K SY L +F+I + R GYFI AA + + + Sbjct: 119 YEYVRIHY--IREKS---SY--ALNNHFNITFSNVAGTR--GYFIKPSAAKKFIKHTRIL 169 Query: 173 YRPIDMDMKHWWEHNI 188 PID M + NI Sbjct: 170 INPIDDSMDYSTVSNI 185 >gi|301169482|emb|CBW29083.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 10810] Length = 278 Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 90 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEVDYISDDIHVLKLEANGK 148 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + T GY + + A +LL + KN +P+ Sbjct: 149 M-----FFKQPK--SVKC----DRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPL 195 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 196 DVAVDSLVFEDFLHFKDYKIVQL--SPG 221 >gi|145640853|ref|ZP_01796435.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae R3021] gi|145274367|gb|EDK14231.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 22.4-21] Length = 286 Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 39 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 97 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 98 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEVDYISDDIHVLKLEANGK 156 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + T GY + + A +LL + KN +P+ Sbjct: 157 M-----FFKQPK------SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPL 203 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 204 DVAVDSLVFEDFLHFKDYKIVQL--SPG 229 >gi|110006045|gb|ABG48525.1| LpsB [Haemophilus influenzae] gi|110006047|gb|ABG48526.1| LpsB [Haemophilus influenzae] Length = 256 Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEVDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + T GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|145633631|ref|ZP_01789358.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 3655] gi|145635446|ref|ZP_01791147.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittAA] gi|229845232|ref|ZP_04465365.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 6P18H1] gi|144985508|gb|EDJ92324.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 3655] gi|145267320|gb|EDK07323.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittAA] gi|229811827|gb|EEP47523.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 6P18H1] Length = 257 Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 49/174 (28%), Positives = 81/174 (46%), Gaps = 18/174 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RR+ F + Q FS + + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQKNTEDFQIFSAINTMQKDWDELAS-IFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCDINNILI 118 EIGC +SH+ ++++I I A++ EDDA F +F Q L L K + IL+ Sbjct: 64 GEIGCTLSHLSVYQKIIEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAEIILL 123 Query: 119 ---KFDALRKK------PKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 K ++ P S+LC GN + P + T GY I K AA Sbjct: 124 GQSKINSFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|164604787|gb|ABY61954.1| lipopolysaccharide glycosyltransferase [Haemophilus influenzae] gi|309973739|gb|ADO96940.1| Lipooligosaccharide glucosyltransferase LpsA [Haemophilus influenzae R2846] Length = 278 Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 90 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 148 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + T GY + + A +LL + KN +P+ Sbjct: 149 M-----FFKQPK--SVKC----DRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPL 195 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 196 DVAVDSLVFEDFLHFKDYKIVQL--SPG 221 >gi|210135004|ref|YP_002301443.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori P12] gi|210132972|gb|ACJ07963.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori P12] Length = 278 Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 58/207 (28%), Positives = 90/207 (43%), Gaps = 23/207 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPTCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIII 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 D F + L K IN + L+ + K K + NF T Sbjct: 132 KDRFKESLEFCDK-HINELGYIRLMHLEENVAKQKTPVKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + ++T +S E Sbjct: 185 GYVLTPKAAQKLLKYSTKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTE 243 Query: 211 ESRLVRKPTFSPL--YFYRNTCYQWNL 235 + R + P + ++ QWN+ Sbjct: 244 KQRPKKLPLSIRIGRSLHKGAIKQWNI 270 >gi|217032900|ref|ZP_03438377.1| hypothetical protein HPB128_152g17 [Helicobacter pylori B128] gi|298736508|ref|YP_003729034.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|216945394|gb|EEC24062.1| hypothetical protein HPB128_152g17 [Helicobacter pylori B128] gi|298355698|emb|CBI66570.1| glycosyl transferase, family 25 [Helicobacter pylori B8] Length = 281 Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 56/207 (27%), Positives = 91/207 (43%), Gaps = 23/207 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F H + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFHAIKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIVL 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 + F + L + IN + L+ + K K + NF T Sbjct: 132 KERFKESL-EFCRQHINELGYIRLMHLEENVAKQKTPVKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + ++ +S E Sbjct: 185 GYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNAQNSNTE 243 Query: 211 ESRLVRKPTFSPL--YFYRNTCYQWNL 235 + R + P + + +++ QWN+ Sbjct: 244 KQRPKKLPLSIRIGRFLHKSAIQQWNI 270 >gi|110006039|gb|ABG48522.1| LpsB [Haemophilus influenzae] Length = 256 Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + T GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|317178835|dbj|BAJ56623.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F30] Length = 278 Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 57/204 (27%), Positives = 92/204 (45%), Gaps = 17/204 (8%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIV 131 Query: 98 SDEFSQLLPHLSKCDINNI-LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 + F + L + IN + I+ L + K L + I T GY Sbjct: 132 KERFKESLEFCYQ-HINELGYIRLMHLEENVAKQKTLIKGVSQILNFKDGI---GTQGYV 187 Query: 157 IGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESR 213 + +AA LL K PID M +H+W H + + V E A+ + ++T +S E+ R Sbjct: 188 LAPKAAQKLLKYSTKKWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEKQR 246 Query: 214 LVRKPTFSPL--YFYRNTCYQWNL 235 + P + +++T QWN+ Sbjct: 247 PKKLPLSIRIGRSLHKSTVKQWNV 270 >gi|68250301|ref|YP_249413.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 86-028NP] gi|229847318|ref|ZP_04467420.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 7P49H1] gi|68058500|gb|AAX88753.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 86-028NP] gi|229809743|gb|EEP45467.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 7P49H1] Length = 257 Score = 48.5 bits (114), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 49/175 (28%), Positives = 78/175 (44%), Gaps = 20/175 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RR+ F + Q FS + + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQKNTEDFQIFSAINTMQKDWDELAS-IFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------L 108 EIGC +SH+ ++++I I A++ EDDA F +F Q L L Sbjct: 64 GEIGCTLSHLSVYQKIIEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAEIILL 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + INN D P S+LC GN + P + T GY I K AA Sbjct: 124 GQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|327277409|ref|XP_003223457.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 2-like [Anolis carolinensis] Length = 631 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 54/214 (25%), Positives = 92/214 (42%), Gaps = 31/214 (14%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R+L+ EIGC++SH ++WK + +++EDD F +F + L L DI + Sbjct: 409 RVLTRGEIGCFLSHYYIWKEVVDRELEKTLVIEDDVRFEHQFKKKLTKLMD-DIERAQLD 467 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR------- 169 ++ + RK+ + + +P ++ +P S T GY I + A L+ + Sbjct: 468 WELIYIGRKRMQVERPEKAVPNVVNLVEPD-YSYWTLGYAISLQGAQKLIEAQPFSKMLP 526 Query: 170 -----KNIYR--PIDMDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEESR 213 +Y P+ M+++ ++ + EP VY DT STI ++ Sbjct: 527 VDEFLPVMYNKHPVAKYMEYYEPRDLKAFSAEPLLVYPTHYTGQPGYLSDTETSTIWDNE 586 Query: 214 LVRKPTFSPLYFYRNTCYQWNLHYNAWRKD-LPP 246 V T R T Q +H +A KD LPP Sbjct: 587 TVA--TDWDRTLSRKTRQQGQIHTDAQNKDALPP 618 >gi|163843853|ref|YP_001628257.1| lipooligosaccharide biosynthesis protein lic2B [Brucella suis ATCC 23445] gi|163674576|gb|ABY38687.1| Lipooligosaccharide biosynthesis protein lic2B [Brucella suis ATCC 23445] Length = 260 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 48/178 (26%), Positives = 77/178 (43%), Gaps = 14/178 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L S R E + R+ QF +A+ G +P+ + S + ++ LS Sbjct: 4 YLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPL--ELASFIQISKEWPAPLSP 61 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-- 122 EIGC++SH ++IA A + EDD S S+ ++K DA Sbjct: 62 AEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFWASDHWIPKQADIVKIDAYG 121 Query: 123 ---LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L P K+ G + I + R +T GY + +EAA LL + + P+D Sbjct: 122 HEVLISNPVKNE------GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 173 >gi|170038076|ref|XP_001846879.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167881499|gb|EDS44882.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 496 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 29/104 (27%), Positives = 52/104 (50%), Gaps = 5/104 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRI---FSHQKRQCQFK 59 +Y+I+L RR K + + L+ +F A+ G+ N+ + I F K Sbjct: 211 IYMINLERRPERRLKMVNNFDALGLEVEYFPAVDGKKLNDDVLREIGVEFLPGYTDPYHK 270 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 R +++ EIGC++SH ++W+R+ ++LEDD F F + Sbjct: 271 RPMTMGEIGCFLSHYYIWERMVELGQEEVLVLEDDIRFEPYFKR 314 >gi|309972907|gb|ADO96108.1| Lipopolysaccharide biosynthesis protein LsgD [Haemophilus influenzae R2846] Length = 257 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 49/175 (28%), Positives = 76/175 (43%), Gaps = 20/175 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + Q FS + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRELFFSQKNTEDFQIFSAINTMQKDWDELA-AIFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------L 108 EIGC +SH+ ++++I I ++ EDDA F +F Q L L Sbjct: 64 GEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHSDFQQNLTALLSEKLEAEIILL 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + INN D P S+LC GN + P + T GY I K AA Sbjct: 124 GQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|226225975|ref|YP_002760081.1| putative glycosyltransferase [Gemmatimonas aurantiaca T-27] gi|226089166|dbj|BAH37611.1| putative glycosyltransferase [Gemmatimonas aurantiaca T-27] Length = 251 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 5/100 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGEN----NPICNRIFSHQKRQCQ 57 IPVY +SL + RR + +I +Q F F DA+ + +++ + Q Sbjct: 27 IPVYCVSLRGATERRSLIQSQVEKIGIQHFQFVDAVQASDLTHEKAASENLYAPEICQHF 86 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 R L L EI C +SH +++ IA S A ++I+EDDA F Sbjct: 87 HGRDLRLTEIACSLSHARIYRMIANSTAPWSLIIEDDALF 126 >gi|301170452|emb|CBW30059.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 10810] Length = 257 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 49/175 (28%), Positives = 78/175 (44%), Gaps = 20/175 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RR+ F + Q FS + + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQKNTEDFQIFSAINTMQKDWDELAS-IFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDA----DFSDEFSQLLPH--------L 108 EIGC +SH+ ++++I I A++ EDDA DF + LL L Sbjct: 64 GEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHLDFQQNLTALLSEKLEAEIILL 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + INN D P S+LC GN + P + T GY I K AA Sbjct: 124 GQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|148652793|ref|YP_001279886.1| glycosyl transferase family protein [Psychrobacter sp. PRwf-1] gi|148571877|gb|ABQ93936.1| glycosyl transferase, family 25 [Psychrobacter sp. PRwf-1] Length = 255 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 47/180 (26%), Positives = 79/180 (43%), Gaps = 14/180 (7%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VISL + RR+ A+ + + FFDA+ P ++ + L+ E Sbjct: 5 VISLSSATDRRQHIESEFAKHQVDYEFFDAL----TPDVATPYAKKLGLDVSSANLTPGE 60 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 + C +SH+ LW++I A I EDD ++ S LL + +IK +A KK Sbjct: 61 VACMMSHVALWEKIINEDLEYATIFEDDIYLGEDSSALLNRTDWIKPDWHIIKIEAFAKK 120 Query: 127 PKKDSYLC-----TLPGNFDIHQPRILSPRTTGYFIGKEAA-IHLLNVRKNIYRPIDMDM 180 +YL L + Q + + T GY + K+ A I+L ++++ P+D M Sbjct: 121 ----TYLANKTHEVLANKRHVTQLKGKNLGTAGYILSKQGAKIYLDYLKRHALLPLDRVM 176 >gi|145637636|ref|ZP_01793291.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittHH] gi|145269159|gb|EDK09107.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittHH] Length = 257 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 51/176 (28%), Positives = 76/176 (43%), Gaps = 22/176 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 Y+ISL RR+ F + + F F AI +N IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQ--KNTEDFQIFSAINTMQKNWDELAAIFNIEQFKAHYFRNVT 62 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------ 107 EIGC +SH+ ++++I I A++ EDDA F +F Q L Sbjct: 63 KGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAEIIL 122 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 L + INN D P S+LC GN + P + T GY I K AA Sbjct: 123 LGQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|308063408|gb|ADO05295.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 278 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 53/185 (28%), Positives = 83/185 (44%), Gaps = 22/185 (11%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-LIK 119 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + I+ Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCYQ-HINELGYIR 153 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDM 178 L + K L + I T GY + +AA LL K PID Sbjct: 154 LMHLEENVAKQKTLIKGVSQILNFKDGI---GTQGYVLAPKAAQKLLKYSAKEWVMPIDC 210 Query: 179 DM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL------YFYRNTC 230 M +H+W H + + V E A+ + ++T +S E+ +KP PL + +++ Sbjct: 211 VMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEK----QKPKKLPLSIRIGRFLHKSAL 265 Query: 231 YQWNL 235 QWN+ Sbjct: 266 KQWNV 270 >gi|110006051|gb|ABG48528.1| LpsA [Haemophilus influenzae] gi|110006053|gb|ABG48529.1| LpsA [Haemophilus influenzae] gi|110006063|gb|ABG48534.1| LpsA [Haemophilus influenzae] gi|110006065|gb|ABG48535.1| LpsA [Haemophilus influenzae] gi|110006067|gb|ABG48536.1| LpsA [Haemophilus influenzae] Length = 256 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI E + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPEIIEETAKKFNITLDRSP-KAKLSDE 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-VALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+P+ S C + +I+ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPQ--SVKC----DRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|84515249|ref|ZP_01002611.1| hypothetical protein SKA53_01286 [Loktanella vestfoldensis SKA53] gi|84510532|gb|EAQ06987.1| hypothetical protein SKA53_01286 [Loktanella vestfoldensis SKA53] Length = 256 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 46/198 (23%), Positives = 86/198 (43%), Gaps = 12/198 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + +++I+L +R + + + L ++ F AI G + + R+ + + + Sbjct: 9 LGIWLINLDRDTSRLARMQQQLDAMGLDYTRFGAIDGRARADELAPRVNAAAYAR-NMGQ 67 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 +S IGCY SH+ +W ++A + A+ILEDD F D+F L ++F Sbjct: 68 AVSPGHIGCYASHVAVWDQVAQATQPVALILEDDVVFHDDFLDSLDAALAGAAAWDTLRF 127 Query: 121 DALRKK-PKKDSYLCTLPGNFDIHQPRILSPRTTG--YFIGKEAAIHLLNVRKNIYRPID 177 + +R K P YL G + ++ + P T Y + ++ A L R D Sbjct: 128 NCIRAKWPVCQGYL----GRYRLNA--YIGPFTGNATYLVKRDVAGRLAPAFWPQTRAHD 181 Query: 178 MDMKHWWEHNIPSLVTEP 195 +M ++ H+ EP Sbjct: 182 HEMNRFFHHDFRQFGLEP 199 >gi|110006061|gb|ABG48533.1| LpsA [Haemophilus influenzae] Length = 256 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI E + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPEIIEETAKKFNITLDRSP-KAKLSDE 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-VALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+P+ S C + +I+ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPQ--SVKC----DRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKILQL--SPG 199 >gi|296190930|ref|XP_002743398.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Callithrix jacchus] Length = 595 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 11/144 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + Q Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +W+ + ++ EDD F F + L L + D+ ++ Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRRRLERLME-DVEAEKLR 440 Query: 120 FDAL---RKK--PKKDSYLCTLPG 138 +D + RK+ P+K++ + LPG Sbjct: 441 WDLIYLGRKQVNPEKEAAVEGLPG 464 >gi|332184819|gb|AEE27073.1| putative beta1,4-galactosyltransferase [Francisella cf. novicida 3523] Length = 249 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 28/97 (28%), Positives = 51/97 (52%), Gaps = 3/97 (3%) Query: 3 IPVYVISLPFSHARREKFC-HRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFK 59 + V+VI+ R+E H + ++SFF+A+ G + + R + + Q Sbjct: 2 MNVFVINRKKDIGRKEYIIQHFKEKGIERYSFFEAVEGRDLDSNFIKRCNGYLSKTLQRA 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD 96 + L++ EIGC +SH+ ++K+I GA+I EDD + Sbjct: 62 KGLNVGEIGCALSHLEIYKKIVNEGLAGALIFEDDVE 98 >gi|251792806|ref|YP_003007532.1| hypothetical protein NT05HA_1079 [Aggregatibacter aphrophilus NJ8700] gi|247534199|gb|ACS97445.1| Lsg locus putative protein 4 [Aggregatibacter aphrophilus NJ8700] Length = 256 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 82/183 (44%), Gaps = 18/183 (9%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + FS + + E + + +F+ K + + R ++ Sbjct: 4 YLISLDKDVRRRELFFAQPDTADFTVFSAINTMQKEWDELA-EVFNSTKFEQHYGRNVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCDINNILI 118 EIGC +SH+ ++++I + A++ EDDA F+ S L KCD + +LI Sbjct: 63 GEIGCTLSHLGVYRQIVEDRNVTENEYALVCEDDALFNANLSPKTTALLTEKCDADIVLI 122 Query: 119 ------KFDALRKK---PKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLNV 168 +FD + + P S+L G+ + P + T Y I K AA L V Sbjct: 123 GQSKIAEFDDVELEINYPTTFSFLRKTFGDVTVAYPYKSYFAGTVAYLIKKSAARVFLTV 182 Query: 169 RKN 171 +N Sbjct: 183 LEN 185 >gi|224437264|ref|ZP_03658236.1| hypothetical protein HcinC1_04787 [Helicobacter cinaedi CCUG 18818] Length = 267 Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust. Identities = 51/211 (24%), Positives = 83/211 (39%), Gaps = 38/211 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC--NRIFSHQKRQCQFKR 60 + +++I+LP + R A + QF D+ E I N I + K FK+ Sbjct: 1 MKLFIINLP-------QATERKALMSKQFENLDSKLKEKYEIIFFNAINAQAKEHLNFKQ 53 Query: 61 L------------LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 LS E C+ SH LW++ S I+LEDD + F Q L + Sbjct: 54 YSKIKSILFRGKELSDGERACFASHYTLWQKCVESNE-PIIVLEDDVEILPHFWQELERI 112 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS----PRTTGYFIGKEAAIH 164 + + + YL T +D+ + + T GY++ AA Sbjct: 113 EQSAFAYVRL------------MYLITKAKFYDLQKGFHFTFNNVAGTQGYYLTPTAAKA 160 Query: 165 LLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + K+ YRP+D M ++ H IP + +P Sbjct: 161 FIESAKSWYRPVDDYMDMFYIHKIPIVCVKP 191 >gi|148827955|ref|YP_001292708.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae PittGG] gi|148719197|gb|ABR00325.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae PittGG] Length = 237 Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust. Identities = 54/203 (26%), Positives = 87/203 (42%), Gaps = 32/203 (15%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI E + F+ + K LS Sbjct: 30 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPEIIEETAKKFNITLDRSP-KAKLSDE 88 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 89 EIGCALSHIVLWD-VALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANG- 146 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 K+P+ ++ + +I+ + GY + + A +LL + KN +P+ Sbjct: 147 ----KMFFKQPQ------SVKCDRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 194 Query: 177 DM--------DMKHWWEHNIPSL 191 D+ D H+ ++ I L Sbjct: 195 DVAVDSLVFEDFLHFKDYKILQL 217 >gi|149039147|gb|EDL93367.1| rCG45647, isoform CRA_a [Rattus norvegicus] Length = 596 Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust. Identities = 36/146 (24%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-------RQCQ 57 V+VISL RR + + + DA+ G + + I H + Sbjct: 323 VFVISLARRPQRRARMLSSLWEMEISARVVDAVDGRT--LNSSILKHLGVDLLPGYQDPY 380 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F D F + L L + D+ Sbjct: 381 SGRTLTKGEVGCFLSHYSIWEEVVAKGLARVVVFEDDVRFEDNFRKRLERLME-DVLTQK 439 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+++ + LPG Sbjct: 440 LSWDLIYLGRKQVNPEEEVAVEGLPG 465 >gi|296190932|ref|XP_002743399.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2 [Callithrix jacchus] Length = 548 Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust. Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 11/144 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + Q Sbjct: 275 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 334 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +W+ + ++ EDD F F + L L + D+ ++ Sbjct: 335 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRRRLERLME-DVEAEKLR 393 Query: 120 FDAL---RKK--PKKDSYLCTLPG 138 +D + RK+ P+K++ + LPG Sbjct: 394 WDLIYLGRKQVNPEKEAAVEGLPG 417 >gi|325268033|ref|ZP_08134679.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Kingella denitrificans ATCC 33394] gi|324980418|gb|EGC16084.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Kingella denitrificans ATCC 33394] Length = 277 Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust. Identities = 60/231 (25%), Positives = 94/231 (40%), Gaps = 26/231 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YV+S+ + RR+ + + F FFDA+ E + I H L+ Sbjct: 4 YVVSIRTAAQRRQHVAAEFNKHQIAFHFFDAVTPET--LAESIAEHCPNLAD--AFLTGG 59 Query: 66 EIGCYISHIHLWKRIAYS--PAIGAIILEDDADFSDEFSQLLPHLSKCDI----NNILIK 119 E GC++SH+ LW + P IG I EDD F S+ L D + +I+ Sbjct: 60 EKGCFMSHVCLWAKCVADDLPYIG--IFEDDVIFGQNSSRFLNDTKWLDERFQNQSFIIR 117 Query: 120 FDA-LRKKPKKDSYLCTLPGNF-DIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 + L+ P S P N I + + T Y I ++ AI LLN + + Sbjct: 118 METFLKANPVALSKSGVRPFNGRKILRLQSFGFGTAAYLISQQTAITLLNWIREVAPEKL 177 Query: 174 RPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVR 216 PID ++ + I P+L + + A + ST+E+ RL R Sbjct: 178 EPIDNMLFNAASEIPEIQMYQISPALCIQELQLNRADSSLSSTLEDGRLAR 228 >gi|58865502|ref|NP_001011962.1| glycosyltransferase 25 family member 3 [Rattus norvegicus] gi|55249709|gb|AAH85782.1| Cerebral endothelial cell adhesion molecule [Rattus norvegicus] Length = 517 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 36/146 (24%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-------RQCQ 57 V+VISL RR + + + DA+ G + + I H + Sbjct: 244 VFVISLARRPQRRARMLSSLWEMEISARVVDAVDGRT--LNSSILKHLGVDLLPGYQDPY 301 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F D F + L L + D+ Sbjct: 302 SGRTLTKGEVGCFLSHYSIWEEVVAKGLARVVVFEDDVRFEDNFRKRLERLME-DVLTQK 360 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+++ + LPG Sbjct: 361 LSWDLIYLGRKQVNPEEEVAVEGLPG 386 >gi|126729290|ref|ZP_01745104.1| Glycosyl transferase, family 25 [Sagittula stellata E-37] gi|126710280|gb|EBA09332.1| Glycosyl transferase, family 25 [Sagittula stellata E-37] Length = 265 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 42/184 (22%), Positives = 84/184 (45%), Gaps = 9/184 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR-IFSHQKRQCQFKRLLS 63 YVI+L S RR+ + AR +F +A+ G +R ++S +F R L Sbjct: 3 TYVINLERSVDRRQVLTEKLARDSWPHTFVNAVDGRALSAQDRALYSPLLAFLRFGRGLK 62 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI----LIK 119 E+ CY+SH+ ++ + ++LEDD D++ ++ L++ + ++ Sbjct: 63 AGELACYLSHLKCLRQFLATDEPVCLVLEDDVDWTPGTGAVVQELAQVLGQHGEPWEVVN 122 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-PRTTGYFI-GKEAAIHLLNVRKNIYRPID 177 + K+P + + + + + RI P TT + +E A H L+ + I+ P+D Sbjct: 123 LGHVSKRPALAAPIARIAEDNAV--SRIFDFPLTTHALLWNREGARHFLHRSRRIFDPVD 180 Query: 178 MDMK 181 + + Sbjct: 181 VAAR 184 >gi|154174436|ref|YP_001407702.1| putative glycosyltransferase involved in LPS biosynthesis [Campylobacter curvus 525.92] gi|112803649|gb|EAU00993.1| putative glycosyltransferase involved in LPS biosynthesis [Campylobacter curvus 525.92] Length = 260 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 62/239 (25%), Positives = 100/239 (41%), Gaps = 35/239 (14%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 PV++ISL RRE R + + +F+ DA+ G + ++LS Sbjct: 3 PVFLISLAKDAGRREALKQRF-KSYDKFTLIDAVDGRQMNAREYFGYVLPSFRAYGKILS 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-------EFSQLLPH--LSKCDIN 114 E+GC +SH+ ++ S A A++LEDD D E ++ +P L C Sbjct: 62 PSEVGCSLSHVKAYEAFLASEAKFALVLEDDVIGDDVGVAAAFELAEQMPQNSLLICGCQ 121 Query: 115 NILI-KFDALRKKPKKDSYLCTLPGNFDIHQPRILSP----RTTGYFIGKEAAIHLLNVR 169 + L +F A K+ D T P N ++ S R Y + +++A +LL + Sbjct: 122 DGLAGRFSAFGKRVNLD----TNPENRELWLVSRYSHSSIYRAGAYVLSRQSAQNLLRIH 177 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEA------IDTNDSTIEESRLVR--KPTF 220 K D+ WE+ L + +Y + ID + S IE R+ R KP F Sbjct: 178 KRALCTTDV-----WEY---LLRQDDMTMYFSDIFAHPIDLSGSNIEAERIDRGYKPNF 228 >gi|160395571|sp|Q5U309|GT253_RAT RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule Length = 572 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 36/146 (24%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-------RQCQ 57 V+VISL RR + + + DA+ G + + I H + Sbjct: 299 VFVISLARRPQRRARMLSSLWEMEISARVVDAVDGRT--LNSSILKHLGVDLLPGYQDPY 356 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F D F + L L + D+ Sbjct: 357 SGRTLTKGEVGCFLSHYSIWEEVVAKGLARVVVFEDDVRFEDNFRKRLERLME-DVLTQK 415 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+++ + LPG Sbjct: 416 LSWDLIYLGRKQVNPEEEVAVEGLPG 441 >gi|291326788|ref|ZP_06125861.2| putative beta1,4-galactosyltransferase [Providencia rettgeri DSM 1131] gi|291312941|gb|EFE53394.1| putative beta1,4-galactosyltransferase [Providencia rettgeri DSM 1131] Length = 232 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 60/232 (25%), Positives = 104/232 (44%), Gaps = 42/232 (18%) Query: 26 RIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSP 84 + L++ F +A+ G+N + +H C +L+ EIGC +SH+ ++K++A Sbjct: 9 KTKLKYEFINAVNGKNLSDTELKKATHDYPNC----MLTKGEIGCALSHLSIYKKMANEN 64 Query: 85 AIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK-----KPKKDSYLCTLPGN 139 A++LEDDA +LPH + I+ I I FD +RK K DSY+ N Sbjct: 65 IEQALVLEDDA--------ILPHNIEDIISQIKI-FDKIRKPNIFLLSKIDSYIRNQNLN 115 Query: 140 ---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM-------DMKHWWEHNIP 189 F ++Q + + Y I +AA +++ ++ I DM + + + H IP Sbjct: 116 DNIFKVYQ----AIGSHAYVINLKAAKNIIKIQSPIKYESDMWRYFRYFNCANIYGH-IP 170 Query: 190 SLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 +LV +S++EE R P YR+ + +Y +R Sbjct: 171 TLVISDDE-----SKLNSSLEEG---RAPLLKARERYRSNLKKMFKNYQYYR 214 >gi|301608466|ref|XP_002933810.1| PREDICTED: procollagen galactosyltransferase 2-like [Xenopus (Silurana) tropicalis] Length = 616 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 23/188 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR + +Q FDA+ G+ + + Q + + + Sbjct: 334 IFMINLKRRQDRRVRMLRSLYEQEIQVKIFDAVDGKA------LNASQLKAMKIEVIPGY 387 Query: 60 ------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 R L+ EIGC++SH ++WK + ++++EDD F F L L DI Sbjct: 388 HDPYSGRTLTSGEIGCFLSHYYIWKEVVDRGLEKSLVIEDDVRFEPLFKHKLMKLMN-DI 446 Query: 114 NNILIKFDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 +++D + RK+ + + +P ++ + S T GY I ++ A L+ Sbjct: 447 EEAEVEWDLIYIGRKRMQVERPEKAVPSVMNLVEAD-YSYWTLGYAISRQGAEKLIAAEP 505 Query: 171 -NIYRPID 177 N P+D Sbjct: 506 FNKMLPVD 513 >gi|325293427|ref|YP_004279291.1| glycosyltransferase 25 family member 1 [Agrobacterium sp. H13-3] gi|325061280|gb|ADY64971.1| Glycosyltransferase 25 family member 1 [Agrobacterium sp. H13-3] Length = 270 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 34/101 (33%), Positives = 46/101 (45%), Gaps = 4/101 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + V++I++ + R E R I L+ + G N PI FS R Sbjct: 11 LAVFLINMDSATKRLEDMNARLDAIGLKAERVPGVNGRELNYPIPE--FSELSYMLMHGR 68 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 S PEIGCY+SH+ R S A A+ILEDD F +F Sbjct: 69 RTSPPEIGCYLSHVACANRFMGSDADIALILEDDVVFEKDF 109 >gi|157885917|emb|CAP09301.1| novel protein similar to vertebrate glycosyltransferase 25 domain containing 1 (GLT25D1) [Danio rerio] Length = 598 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 5/106 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 VY+I+L RR++ + + + DA+ G+ N R + + Sbjct: 332 VYLINLKRREDRRDRMLRSLEVLGIDVTLTDAVDGKAMNSTQLRTLGIEMLPGFKDPYSD 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 R+L+ EIGC++SH ++WK++ ++LEDD F F L Sbjct: 392 RVLTKGEIGCFLSHYNIWKKVVELQQQQVLVLEDDVRFETNFKSRL 437 >gi|326664713|ref|XP_686329.4| PREDICTED: procollagen galactosyltransferase 2 [Danio rerio] Length = 584 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 5/106 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 VY+I+L RR++ + + + DA+ G+ N R + + Sbjct: 318 VYLINLKRREDRRDRMLRSLEVLGIDVTLTDAVDGKAMNSTQLRTLGIEMLPGFKDPYSD 377 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 R+L+ EIGC++SH ++WK++ ++LEDD F F L Sbjct: 378 RVLTKGEIGCFLSHYNIWKKVVELQQQQVLVLEDDVRFETNFKSRL 423 >gi|261839380|gb|ACX99145.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 52] Length = 278 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 51/184 (27%), Positives = 82/184 (44%), Gaps = 20/184 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCCQ-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ID M +H+W H + + V E A+ + ++T +S E+ R + P + + +++ Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEKQRPKKLPLSIRIGRFLHKSAVK 266 Query: 232 QWNL 235 QWN+ Sbjct: 267 QWNV 270 >gi|90418037|ref|ZP_01225949.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337709|gb|EAS51360.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 286 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/131 (23%), Positives = 58/131 (44%), Gaps = 3/131 (2%) Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 S+ + ++R ++ PE+ ++SH LW+R A S G I+LEDD F F + + + + Sbjct: 48 SYARLSTSWERPMTRPELAAFLSHRRLWERAAASED-GLIVLEDDTVFGTHFPRAVANAA 106 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + + L+ F+ + ++ T G + YF+ + A LL Sbjct: 107 ASNFD--LVNFETVGRRKFFRKRGGTTVGGVRFTELVREKSGAGAYFVSRNGARALLAKA 164 Query: 170 KNIYRPIDMDM 180 + P+D M Sbjct: 165 ETQAAPVDAFM 175 >gi|148826592|ref|YP_001291345.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittEE] gi|148716752|gb|ABQ98962.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittEE] Length = 278 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSS-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 90 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEVDYISDDIHVLKLEANGK 148 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 149 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 195 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 196 DVAVDSLVFEDFLHFKDYKIVQL--SPG 221 >gi|145629208|ref|ZP_01785007.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 22.1-21] gi|145639146|ref|ZP_01794753.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittII] gi|144978711|gb|EDJ88434.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 22.1-21] gi|145271708|gb|EDK11618.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittII] gi|309750727|gb|ADO80711.1| Lipopolysaccharide biosynthesis protein LsgD [Haemophilus influenzae R2866] Length = 257 Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust. Identities = 48/175 (27%), Positives = 76/175 (43%), Gaps = 20/175 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RR+ F + Q FS + + + N + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQKNTEDFQIFSAINTMQKDWNELA-AIFNIEQFKAHYGRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDA----DFSDEFSQLLPH--------L 108 EIGC +SH+ ++++I I ++ EDDA DF + LL + Sbjct: 64 GEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHPDFQKNLTALLAEKLESEIILV 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + INN D P S+LC GN + P + T GY I K AA Sbjct: 124 GQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|319776043|ref|YP_004138531.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3047] gi|319896851|ref|YP_004135046.1| udp-glcnac--lipooligosaccharide n-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3031] gi|317432355|emb|CBY80710.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3031] gi|317450634|emb|CBY86854.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3047] Length = 257 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 51/176 (28%), Positives = 75/176 (42%), Gaps = 22/176 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 Y+ISL RR+ F + + F F AI +N IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQ--KNTEDFQIFSAINTMQKNWDELAAIFNIEQFKAYYFRNVT 62 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------ 107 EIGC +SH+ ++++I I A++ EDDA F F Q L Sbjct: 63 KGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHQNFQQNLTALLSEKLEAEIIL 122 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 L + INN D P S+LC GN + P + T GY I K AA Sbjct: 123 LGQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|281349465|gb|EFB25049.1| hypothetical protein PANDA_003209 [Ailuropoda melanoleuca] Length = 569 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 35/142 (24%), Positives = 61/142 (42%), Gaps = 9/142 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ I R+ Q Sbjct: 296 VFVISLARRPDRRERMLSSLWEMEISGRVVDAVDGRTLNSSIMRRLGVDLLPGYQDPYSG 355 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++LEDD F F +L+ + + Sbjct: 356 RTLTKGEVGCFLSHYSIWEEVVARGLAQVLVLEDDVRFESNFRGRLERLMEEVEAEKLPW 415 Query: 116 ILIKFDALRKKPKKDSYLCTLP 137 LI + P++++ + LP Sbjct: 416 DLIYLGRKQVNPEEEAAVEGLP 437 >gi|40225472|gb|AAH11811.2| CERCAM protein [Homo sapiens] Length = 275 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 2 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 60 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 61 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 118 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 119 LSWDLIYLGRKQVNPEKETAVEGLPG 144 >gi|301758784|ref|XP_002915272.1| PREDICTED: glycosyltransferase 25 family member 3-like [Ailuropoda melanoleuca] Length = 590 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 35/142 (24%), Positives = 61/142 (42%), Gaps = 9/142 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ I R+ Q Sbjct: 317 VFVISLARRPDRRERMLSSLWEMEISGRVVDAVDGRTLNSSIMRRLGVDLLPGYQDPYSG 376 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++LEDD F F +L+ + + Sbjct: 377 RTLTKGEVGCFLSHYSIWEEVVARGLAQVLVLEDDVRFESNFRGRLERLMEEVEAEKLPW 436 Query: 116 ILIKFDALRKKPKKDSYLCTLP 137 LI + P++++ + LP Sbjct: 437 DLIYLGRKQVNPEEEAAVEGLP 458 >gi|145631596|ref|ZP_01787362.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae R3021] gi|144982796|gb|EDJ90322.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae R3021] Length = 270 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 51/176 (28%), Positives = 75/176 (42%), Gaps = 22/176 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 Y+ISL RRE F + + F F AI +N IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRELFF--SQKNTEDFQIFSAINTMQKNWDELAAIFNIEQFKAYYFRNVT 62 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDA----DFSDEFSQLLPH-------- 107 EIGC +SH+ ++++I I ++ EDDA DF + LL Sbjct: 63 KGEIGCTLSHLSVYQKIVEDNDIAENSYTLVCEDDALFHLDFQQNLTALLSEKLEAEIIL 122 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 L + INN D P S+LC GN + P + T GY I K AA Sbjct: 123 LGQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|329123202|ref|ZP_08251771.1| glycosyltransferase [Haemophilus aegyptius ATCC 11116] gi|327471547|gb|EGF16991.1| glycosyltransferase [Haemophilus aegyptius ATCC 11116] Length = 267 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 51/176 (28%), Positives = 75/176 (42%), Gaps = 22/176 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 Y+ISL RR+ F + + F F AI +N IF+ ++ + + R ++ Sbjct: 15 YLISLDKDIQRRKLFFSQ--KNTEDFQIFSAINTMQKNWDELAAIFNIEQFKAYYFRNVT 72 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------ 107 EIGC +SH+ ++++I I A++ EDDA F F Q L Sbjct: 73 KGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHQNFQQNLTALLSEKLEAEIIL 132 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 L + INN D P S+LC GN + P + T GY I K AA Sbjct: 133 LGQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAA 187 >gi|110006057|gb|ABG48531.1| LpsA [Haemophilus influenzae] Length = 256 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 58/208 (27%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEVDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +I+ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|261823574|ref|YP_003261680.1| glycosyl transferase family 25 [Pectobacterium wasabiae WPP163] gi|261607587|gb|ACX90073.1| glycosyl transferase family 25 [Pectobacterium wasabiae WPP163] Length = 249 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 58/232 (25%), Positives = 91/232 (39%), Gaps = 45/232 (19%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 + +V++L ++ K + L + A+ G NN + +++ Sbjct: 1 MKTFVVNLEKDKEKKVKIERECRKAELNYEIISAVDGRKINNNELKSKVHPVSLN----- 55 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 LS EIGC +SH ++K+I A+ILEDD D S D+NN L Sbjct: 56 -YLSKGEIGCALSHQKVYKKIIDDDIEYALILEDDVDISQ------------DVNNFLKD 102 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRIL------------SPRTTGYFIGKEAAIHLLN 167 F +++ K K D +L P +I S GY I +AA ++ Sbjct: 103 FLSVKNKTKGDVFLL-YPSGLRFFNRKIEISNNYFFYEVYNSSCAHGYIISNKAAKKIMR 161 Query: 168 VRKNIYRPIDMDMKHW-WEHNIP-----SLVTEPGAVYEAIDTNDSTIEESR 213 + PI + W W + I +L E Y+ ID + STIE R Sbjct: 162 IN----TPIILVADAWLWFYQISLLKVYALNKELVRAYD-IDKSLSTIETER 208 >gi|145632118|ref|ZP_01787853.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 3655] gi|110006049|gb|ABG48527.1| LpsA [Haemophilus influenzae] gi|144987025|gb|EDJ93555.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 3655] Length = 256 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 51/182 (28%), Positives = 80/182 (43%), Gaps = 24/182 (13%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNITFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEVDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM 178 D+ Sbjct: 174 DV 175 >gi|261837965|gb|ACX97731.1| LPS biosynthesis protein [Helicobacter pylori 51] Length = 289 Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust. Identities = 56/207 (27%), Positives = 94/207 (45%), Gaps = 24/207 (11%) Query: 40 ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFS 98 +N P + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 88 KNPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVK 143 Query: 99 DEFSQLLPHLSKCDINNI-LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR----TT 153 + F + L + IN + I+ L +K K F +IL+ + T Sbjct: 144 ERFKESLEFCYQ-HINELGYIRLMHLEEKVAKQK-------TFIKGVSQILNFKDGIGTQ 195 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + ++ +S E Sbjct: 196 GYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNAQNSNTE 254 Query: 211 ESRLVRKPTFSPL--YFYRNTCYQWNL 235 + R + P + +++T QW++ Sbjct: 255 KQRPKKLPLSIRIGRSLHKSTVKQWSV 281 >gi|118602876|ref|YP_904091.1| glycosyl transferase family protein [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] gi|118567815|gb|ABL02620.1| glycosyl transferase, family 25 [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] Length = 262 Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust. Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 7/108 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ-FKRL 61 + ++VI+L S RR + + L F F DAI + + K+ CQ ++R Sbjct: 1 MNIFVINLENSVERRNFQQEQLFALKLDFEFLDAISTTDVST-----TIHKQYCQDWQRP 55 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 L E+ CY SH W ++ S A+ILEDDA S +LL +LS Sbjct: 56 LKATEVACYFSHRCAWNKVIQSNQ-PALILEDDALLSKCVPELLANLS 102 >gi|254719629|ref|ZP_05181440.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. 83/13] Length = 244 Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust. Identities = 37/162 (22%), Positives = 72/162 (44%), Gaps = 3/162 (1%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L+ EIGC++SH + IA + A+++EDD F+D+ +LL H + ++K + Sbjct: 42 LTPTEIGCFLSHRRCLEAIARNDDPYAVVVEDDVIFADDAGKLLLHSEWVPHDADIVKIE 101 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID---M 178 KK + + + + + GY + ++AA ++ + + PID Sbjct: 102 TQGKKVLIGKPIACASTRYSVARLYSVHILAAGYIVSRDAARRIVAEMEKVSAPIDHFLF 161 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 + + + + P +A T+ E S+L +KP F Sbjct: 162 NAPYGVFNQLSVYQCTPALCKQAGLTSTLQSERSQLYQKPPF 203 >gi|15645445|ref|NP_207619.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] gi|2313960|gb|AAD07876.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] Length = 273 Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust. Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K I Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYI 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +DA K + I + T GY I + A L + Sbjct: 152 RLMHLLYDASVKSEPLSHKNHEIQERVGIIKAYSEGVGTQGYVITPKIAKVFLKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|37526907|ref|NP_930251.1| hypothetical protein plu3019 [Photorhabdus luminescens subsp. laumondii TTO1] gi|36786340|emb|CAE15393.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii TTO1] Length = 251 Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust. Identities = 43/192 (22%), Positives = 82/192 (42%), Gaps = 28/192 (14%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +++I+L R+ +A+ + L A+ G+ ++ + + + + Sbjct: 1 MKIFIINLESDVERKYSMLQQASSLRLDVEIIKAVNGKQ---LSKDEVMKLSRDFYNNGM 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN------I 116 +L E+GC +SH+ +++RI A+I+EDDA+ + S +L L K + N I Sbjct: 58 TLGELGCSLSHLLVYQRIVDENIPLALIMEDDAEINKNISDVLSALDKFNTKNPNKPNII 117 Query: 117 LIK-----FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 L+ D +K YL + I + T GY I AA LL + Sbjct: 118 LLNKTNEYIDTFKKNITGQYYLVNV----------IEAACTYGYVINNYAAQCLL----D 163 Query: 172 IYRPIDMDMKHW 183 +P+ ++ W Sbjct: 164 FLQPVWLEADKW 175 >gi|262371063|ref|ZP_06064385.1| glycosyl transferase [Acinetobacter johnsonii SH046] gi|262313949|gb|EEY94994.1| glycosyl transferase [Acinetobacter johnsonii SH046] Length = 256 Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust. Identities = 44/177 (24%), Positives = 77/177 (43%), Gaps = 14/177 (7%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VISL + ARRE + ++ F FFDA+ P R + + + L+ E Sbjct: 5 VISLKTATARREHIVQEFGKQNIGFEFFDAL----TPDLARPLAEKMQLNVEDEYLTGGE 60 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 + C++SH+ +W+++ I EDD + + S + +IK +A +K Sbjct: 61 LACFMSHVSIWQKMVDEQLPYVAIFEDDVFLGERAFDIFNSSSWIQEDWHIIKTEAFSEK 120 Query: 127 P--KKDSYLCTLPGNFDIHQPRILSPR---TTGYFIGKEAAIHLLN-VRKNIYRPID 177 K+ Y N Q L+ + T GY + + A+ LN ++K+ P+D Sbjct: 121 VLLAKEVYRI----NDTKRQITRLTGKNLGTAGYILSLKGAMAYLNYIKKSSLIPLD 173 >gi|312066988|ref|XP_003136531.1| hypothetical protein LOAG_00943 [Loa loa] gi|307768311|gb|EFO27545.1| hypothetical protein LOAG_00943 [Loa loa] Length = 544 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 43/169 (25%), Positives = 77/169 (45%), Gaps = 8/169 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRI-FSHQKRQCQFKR 60 +YVI+L R+ K + ++++++A G ++ P+ I F +KR Sbjct: 295 IYVINLERRKTRKVKMMELLKLMGFEYTWWEATDGHHLDSEPLYREIKFLPGYEDPFYKR 354 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + E+GC++SH +W+ + II EDD F + LL L + DI++ I++ Sbjct: 355 PMKAGEVGCFLSHYRIWQEVDKKKLDRVIIFEDDLRFVVNSTDLLKELIE-DIDSSRIEW 413 Query: 121 DA--LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 D L +K + + +PG+ + S T GY + A LL Sbjct: 414 DLVYLGRKRLEGANENWVPGHRHLSTVD-YSYWTLGYMLSLNGARKLLG 461 >gi|68249147|ref|YP_248259.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae 86-028NP] gi|68057346|gb|AAX87599.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae 86-028NP] Length = 312 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 51/187 (27%), Positives = 73/187 (39%), Gaps = 18/187 (9%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + L FSFF+A Y N N+ + Q Sbjct: 25 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSISIL 84 Query: 57 ---QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LS 109 + R+L+ E GC ISH LW + I EDD + L + Sbjct: 85 HNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKT 144 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + D N+I I +P K +P NFDI + T GY I + AA ++ Sbjct: 145 RFDFNDIFIIRLETFLQPVKLEKQTKIPPFYSRNFDILKSTHWG--TAGYIISQGAAKYV 202 Query: 166 LNVRKNI 172 + KNI Sbjct: 203 IEYLKNI 209 >gi|2497668|sp|Q57125|Y765_HAEIN RecName: Full=Putative glycosyltransferase HI_0765 gi|1573773|gb|AAC22423.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] Length = 282 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 94 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 152 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 153 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 199 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 200 DVAVDSLVFEDFLHFKDYKIVQL--SPG 225 >gi|254429889|ref|ZP_05043596.1| LPS glycosyltransferase subfamily, putative [Alcanivorax sp. DG881] gi|196196058|gb|EDX91017.1| LPS glycosyltransferase subfamily, putative [Alcanivorax sp. DG881] Length = 271 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 3/106 (2%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC--QFKRLL 62 V VI++ RR + + I L+F F D + + C ++K +R L Sbjct: 22 VTVIAMKHEIERRNTIENNLSAIGLRFRFIDGLNYDETTTCKIKLGYRKEMSVKTIQREL 81 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + PEIGC++SH W + A +ILE DA E +++ L Sbjct: 82 TAPEIGCFVSHREAWIQ-ASRKNRPTLILESDALLDSESVEIIERL 126 >gi|145638540|ref|ZP_01794149.1| diadenosine tetraphosphatase [Haemophilus influenzae PittII] gi|145272135|gb|EDK12043.1| diadenosine tetraphosphatase [Haemophilus influenzae PittII] Length = 308 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 72/183 (39%), Gaps = 14/183 (7%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKR---QCQ 57 I VIS+ + RR+ + L FSFF+A Y N N+ + + Sbjct: 25 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSNSILHNIE 84 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDI 113 R+L+ E GC ISH LW + I EDD + L ++ D Sbjct: 85 ESRILTKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDF 144 Query: 114 NNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 N+I I +P K +P NFDI + T GY I + AA +++ Sbjct: 145 NDIFIIRLETFLQPVKLEKQTKIPPFYSRNFDILKSTHWG--TAGYIISQGAAKYVIEYL 202 Query: 170 KNI 172 KNI Sbjct: 203 KNI 205 >gi|148652794|ref|YP_001279887.1| glycosyl transferase family protein [Psychrobacter sp. PRwf-1] gi|148571878|gb|ABQ93937.1| glycosyl transferase, family 25 [Psychrobacter sp. PRwf-1] Length = 254 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 44/212 (20%), Positives = 75/212 (35%), Gaps = 25/212 (11%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFK 59 M + Y+I+L S R E+ + + + F A G + + + ++ + Sbjct: 1 MKVLTYLINLDGSDKRLERSTEQLQQQNWDFERISAYDGRGKALSEFVDYDDEQTRSNLG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L EIGCY+SH ++ + A I+LEDD S F+ ++ Sbjct: 61 RSLLNSEIGCYLSHYQCAQKFLETDADMLIVLEDDMKMSSAFAPVVKE-----------A 109 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------------PRTTGYFIGKEAAIHLL 166 L P+ D YL + I + R G ++ A + Sbjct: 110 VSYLYTHPELDWYLVNIAAKKKKLAKDITTINEHTLWHAYYFPIRGLGLIWSRKGAEEFV 169 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 K I P+D+ + W N L P + Sbjct: 170 KHGKPITMPVDIFFQRWLSQNGKGLGVWPAVI 201 >gi|33152027|ref|NP_873380.1| lipooligosaccharide N-acetylglucosamine glycosyltransferase [Haemophilus ducreyi 35000HP] gi|33148249|gb|AAP95769.1| possible lipooligosaccharide N-acetylglucosamine glycosyltransferase [Haemophilus ducreyi 35000HP] Length = 264 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 10/117 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE+F +A Q F + + E + + + Q Q +++R ++ Sbjct: 5 YLISLAKDQQRREQFFTQADTADFQVFDAVNTMQQEWDSLAVKYDITQFAQ-RYQRSVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDA----DFSDEFSQLLPHLSKCDI 113 EIGC +SH+ +++ I P IG ++ EDDA D +QLL DI Sbjct: 64 GEIGCTLSHLGVYQWIVADPQIGDNDYCLVCEDDALFNQDVQSSLTQLLSQNVHADI 120 >gi|30995400|ref|NP_438924.2| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] Length = 278 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 90 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 148 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + GY + + A +LL + KN +P+ Sbjct: 149 M-----FFKQPK--SVKC----DRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 195 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 196 DVAVDSLVFEDFLHFKDYKIVQL--SPG 221 >gi|119608194|gb|EAW87788.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_b [Homo sapiens] Length = 539 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 266 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 324 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 325 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 382 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 383 LSWDLIYLGRKQVNPEKETAVEGLPG 408 >gi|193788560|ref|NP_057258.3| glycosyltransferase 25 family member 3 precursor [Homo sapiens] gi|74744901|sp|Q5T4B2|GT253_HUMAN RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule; Flags: Precursor gi|55958827|emb|CAI13494.1| cerebral endothelial cell adhesion molecule [Homo sapiens] gi|119608193|gb|EAW87787.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_a [Homo sapiens] Length = 595 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 322 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 380 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 381 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 438 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 439 LSWDLIYLGRKQVNPEKETAVEGLPG 464 >gi|22760015|dbj|BAC11036.1| unnamed protein product [Homo sapiens] gi|22760023|dbj|BAC11040.1| unnamed protein product [Homo sapiens] gi|55958829|emb|CAI13496.1| cerebral endothelial cell adhesion molecule [Homo sapiens] gi|111185706|gb|AAI19699.1| Cerebral endothelial cell adhesion molecule [Homo sapiens] gi|119608195|gb|EAW87789.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_c [Homo sapiens] gi|127802779|gb|AAH98432.2| Cerebral endothelial cell adhesion molecule [Homo sapiens] Length = 517 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 244 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 302 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 303 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 360 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 361 LSWDLIYLGRKQVNPEKETAVEGLPG 386 >gi|301611908|ref|XP_002935453.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 1-B-like [Xenopus (Silurana) tropicalis] Length = 610 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 42/174 (24%), Positives = 76/174 (43%), Gaps = 14/174 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQ---KRQCQFK 59 +++I+L RRE+ + + + DA+YG+ N +++ + Sbjct: 331 IFLINLKHRQDRRERMKRTLYELQIDYKLVDAVYGKTLNQTQVDKMGIKMLPGYKDPYHG 390 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK I+ + + EDD F F + L L D+ + Sbjct: 391 RPLTRGEMGCFLSHYNIWKEISERSLEASAVFEDDLRFEIFFKRRLQTLLH-DLEVAKLD 449 Query: 120 FDAL---RKKPKKDSYLCTLPG--NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +D + RK+ + + +PG N + S T GY I A LL+ Sbjct: 450 WDLIYLGRKRMQVEEPEEPVPGVRNLVVSD---YSYWTLGYLISLRGARKLLDA 500 >gi|114626942|ref|XP_001157210.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Pan troglodytes] Length = 595 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 11/144 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + Q Sbjct: 322 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEKLS 440 Query: 120 FDAL---RKK--PKKDSYLCTLPG 138 +D + RK+ P+K++ + LPG Sbjct: 441 WDLIYLGRKQVNPEKETAVEGLPG 464 >gi|119608196|gb|EAW87790.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_d [Homo sapiens] Length = 534 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 261 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 319 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 320 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 377 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 378 LSWDLIYLGRKQVNPEKETAVEGLPG 403 >gi|151337713|gb|ABS01288.1| glycosyltransferase [Haemophilus influenzae] Length = 278 Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 90 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 148 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 149 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 195 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 196 DVAVDSLVFEDFLHFKDYKIVQL--SPG 221 >gi|319775527|ref|YP_004138015.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae F3047] gi|329122430|ref|ZP_08251017.1| lipooligosaccharide biosynthesis protein lex-1 [Haemophilus aegyptius ATCC 11116] gi|317450118|emb|CBY86332.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae F3047] gi|327473712|gb|EGF19131.1| lipooligosaccharide biosynthesis protein lex-1 [Haemophilus aegyptius ATCC 11116] Length = 290 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 52/187 (27%), Positives = 73/187 (39%), Gaps = 18/187 (9%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSNSIL 62 Query: 57 ---QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LS 109 + R+L+ E GC ISH LW + I EDD + L + Sbjct: 63 HNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKT 122 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + D N+I I +P K +P NFDI + T GY I + AA +L Sbjct: 123 RFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWG--TAGYIISQGAAKYL 180 Query: 166 LNVRKNI 172 + KNI Sbjct: 181 IEYLKNI 187 >gi|109947473|ref|YP_664701.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] gi|109714694|emb|CAJ99702.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] Length = 281 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 82/184 (44%), Gaps = 20/184 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLP----HLSKCDINNI 116 + E+GCY SH LW++ + + A+ ILEDD D F + L H+++ + Sbjct: 97 MGFGELGCYASHYSLWQKCVELNEAV--CILEDDITLKDNFKESLKFCYQHINELGYIRL 154 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 + + + KK + + D T GY + +AA LL K P Sbjct: 155 MHLEENVAKKKTSIKGVSQILNFKD-------GIGTQGYVLAPKAAQKLLKYSAKKWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ID M +H+W H + + V E A++ + ++ +S E+ R + P + +++ Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIFCDEMNAQNSNTEKQRPKKLPLSIRIGRSLHKSAIK 266 Query: 232 QWNL 235 QWN+ Sbjct: 267 QWNI 270 >gi|269965925|ref|ZP_06180018.1| conserved hypothetical protein [Vibrio alginolyticus 40B] gi|269829478|gb|EEZ83719.1| conserved hypothetical protein [Vibrio alginolyticus 40B] Length = 241 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 46/204 (22%), Positives = 87/204 (42%), Gaps = 12/204 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR----QCQF 58 + V+VI++ S RR + + + ++ F F +AI ++ N ++S + + +F Sbjct: 1 MKVFVINMKRSVDRRVRMENALNKANIPFEFIEAI---DSSAPNFLYSERSNDDLTRKRF 57 Query: 59 KRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 L EI C+ SH W++ + + I ++LED+ D EF + H Sbjct: 58 GYRLVENEIACFSSHHLAWEKCLELNEPI--LVLEDNCDLLPEFFDVFSHFDTLAEQYDF 115 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 +K A KPK + ++ +I + + GY + AA + P+D Sbjct: 116 LKLAA--TKPKSFKVISSVNNKLNIVRFNKRTCGIMGYILTPHAASKFIKNATQFIEPVD 173 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEA 201 M+ ++H + + V P V A Sbjct: 174 DYMEKPYKHGVSTYVLRPDIVKRA 197 >gi|111185604|gb|AAI19700.1| Cerebral endothelial cell adhesion molecule [Homo sapiens] Length = 517 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 244 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 302 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 303 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 360 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 361 LSWDLIYLGRKQVNPEKETAVEGLPG 386 >gi|260579856|ref|ZP_05847686.1| conserved hypothetical protein [Haemophilus influenzae RdAW] gi|260093140|gb|EEW77073.1| conserved hypothetical protein [Haemophilus influenzae RdAW] Length = 282 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 94 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 152 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 153 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 199 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 200 DVAVDSLVFEDFLHFKDYKIVQL--SPG 225 >gi|83318248|gb|AAI08699.1| CERCAM protein [Homo sapiens] Length = 558 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 322 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 380 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 381 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 438 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 439 LSWDLIYLGRKQVNPEKETAVEGLPG 464 >gi|254474380|ref|ZP_05087766.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] gi|214028623|gb|EEB69458.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] Length = 283 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 32/116 (27%), Positives = 58/116 (50%), Gaps = 4/116 (3%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L +I + SH+ WKRIA S G ++LEDD FSD+F ++ L + +++ D Sbjct: 62 LPASDIEIFRSHMDCWKRIAASGQPG-LVLEDDLLFSDDFGRIASEL-MANAPKGIVRLD 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 A+ + + T G + ++Q + L+P Y + + A ++ + I R +D Sbjct: 120 AVSTPMLRRAGTATDCG-YQLNQIKTLAPSAAAYVVDPDTAANMAATAR-IERTVD 173 >gi|170581569|ref|XP_001895737.1| LPS glycosyltransferase family protein [Brugia malayi] gi|158597202|gb|EDP35413.1| LPS glycosyltransferase family protein [Brugia malayi] Length = 429 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 5/123 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRI-FSHQKRQCQFKR 60 +YVI+L + R+ K + +++++ A G + P+ + F +KR Sbjct: 292 IYVINLERRNERKAKMMELLKLMGFEYTWWKATDGHHLDLEPLYKEVKFLPGYEDPYYKR 351 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + E+GC++SH +W+ + II EDD F + LL L + DI++I I++ Sbjct: 352 PMKAGEVGCFLSHYRIWQEVDEKKLDRVIIFEDDLRFVVNSTILLKELIE-DIDSIEIEW 410 Query: 121 DAL 123 D + Sbjct: 411 DLI 413 >gi|317014215|gb|ADU81651.1| family 25 glycosyl transferase [Helicobacter pylori Gambia94/24] Length = 281 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 56/207 (27%), Positives = 89/207 (42%), Gaps = 23/207 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F H + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFHAIKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDITL 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 + F + L + IN + L+ + K K + NF T Sbjct: 132 KECFKESL-EFCRQHINELGYIRLMHLEENVAKQKTPVKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + ++ +S E Sbjct: 185 GYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNAQNSNTE 243 Query: 211 ESRLVRKPTFSPL--YFYRNTCYQWNL 235 + R + P + ++ QWN+ Sbjct: 244 KQRPKKLPLSIRIGRSLHKGAIKQWNI 270 >gi|261315764|ref|ZP_05954961.1| glycosyl transferase [Brucella pinnipedialis M163/99/10] gi|261304790|gb|EEY08287.1| glycosyl transferase [Brucella pinnipedialis M163/99/10] Length = 243 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 43/158 (27%), Positives = 69/158 (43%), Gaps = 14/158 (8%) Query: 26 RIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSP 84 R+ QF +A+ G +P+ F+ ++ + LS EIGC++SH ++IA Sbjct: 7 RLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSPAEIGCFLSHRKCLEKIAAGE 64 Query: 85 AIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-----LRKKPKKDSYLCTLPGN 139 A + EDD S S+ L ++K DA L P K+ G Sbjct: 65 DAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYGHEVLISNPVKNE------GP 118 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + I + R +T GY + +EAA LL + + P+D Sbjct: 119 YSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 156 >gi|52424960|ref|YP_088097.1| hypothetical protein MS0905 [Mannheimia succiniciproducens MBEL55E] gi|52307012|gb|AAU37512.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 261 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 32/111 (28%), Positives = 56/111 (50%), Gaps = 8/111 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKRLLS 63 +++SL RRE F + R F F+AI + + +F K + R ++ Sbjct: 8 FLVSLEKDIQRRELFFSQ--RNTQDFEVFNAINTMTQDLTSLGNLFDIIKFAQYYGRNVT 65 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSK 110 EIGC +SH+ ++++IA I A++ EDDA F++ F Q++ + K Sbjct: 66 KGEIGCTLSHLAIYQKIADDETINERDYALVCEDDALFAENFQQVIQEIVK 116 >gi|110006081|gb|ABG48543.1| LpsA [Haemophilus influenzae] Length = 256 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|254708263|ref|ZP_05170091.1| glycosyl transferase family protein [Brucella pinnipedialis M163/99/10] Length = 245 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 43/158 (27%), Positives = 69/158 (43%), Gaps = 14/158 (8%) Query: 26 RIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSP 84 R+ QF +A+ G +P+ F+ ++ + LS EIGC++SH ++IA Sbjct: 9 RLGAQFERVEAVNGRAMSPLELASFTQISKE--WPAPLSPAEIGCFLSHRKCLEKIAAGE 66 Query: 85 AIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA-----LRKKPKKDSYLCTLPGN 139 A + EDD S S+ L ++K DA L P K+ G Sbjct: 67 DAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYGHEVLISNPVKNE------GP 120 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + I + R +T GY + +EAA LL + + P+D Sbjct: 121 YSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVD 158 >gi|209696428|ref|YP_002264359.1| putative glycosyltransferase [Aliivibrio salmonicida LFI1238] gi|208010382|emb|CAQ80725.1| putative glycosyltransferase [Aliivibrio salmonicida LFI1238] Length = 231 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 4/96 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICN--RIFSHQKRQCQFK 59 I V V+SL S RR R + + F F+DA I + ++F ++ + + Sbjct: 2 IKVIVLSLKDS-PRRAIITERLKKRGITDFEFYDAFDARAMEISDLEKLFDVKRFRDTYG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA 95 R + EIGC +SH+ +WKRI+ S ++LEDDA Sbjct: 61 REPARGEIGCTLSHLGIWKRISESDCENWMVLEDDA 96 >gi|332970427|gb|EGK09419.1| glycosyltransferase [Kingella kingae ATCC 23330] Length = 263 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 59/228 (25%), Positives = 93/228 (40%), Gaps = 22/228 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI---YGENNPICNRIFSHQKRQCQFK 59 I Y+ISL RR +A R L +FFDA+ I + + ++ + + Sbjct: 2 ISTYIISLASETQRRAHMKAQAERYQLNAAFFDAVDMRQATQTDIEHLSVLPKHKKPKKQ 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQLLPH-----LSKCDI 113 R LS E+GC +SH +++ + A ILEDDA F + LLP ++ D Sbjct: 62 RWLSKGELGCALSHHQIYQEMINKQLDYAFILEDDARFLQSPKALLLPENLRKIAAQYDF 121 Query: 114 NNILIKFDAL---------RKKPKKDSYLCTLP-GNFDIHQP-RILSPRTTGYFIGKEAA 162 + +++ + R+ P K LP P Y I K+ A Sbjct: 122 DILILGYVKTLEHQLPYYHRRIPIKKRATLQLPEQTIQFGTPWEQYGCGAVAYVITKKGA 181 Query: 163 IHLLNVRKNIYRPIDMDMKHWWEH-NIPSLVTEPGAVYEAIDTNDSTI 209 LLN+ + P D D ++ +H + L P V E ++ STI Sbjct: 182 EKLLNITQKPCVPAD-DWLYFEQHCGVKVLHARPTFVLEDLEQLISTI 228 >gi|194388556|dbj|BAG60246.1| unnamed protein product [Homo sapiens] Length = 548 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 275 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 333 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 334 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 391 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 392 LSWDLIYLGRKQVNPEKETAVEGLPG 417 >gi|315454038|ref|YP_004074308.1| glycosyltransferase [Helicobacter felis ATCC 49179] gi|315133090|emb|CBY83718.1| Jhp0562-like glycosyltransferase [Helicobacter felis ATCC 49179] Length = 290 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 66/254 (25%), Positives = 99/254 (38%), Gaps = 52/254 (20%) Query: 1 MPIPVYVISLPFSHARREKFCH--------RAARIHLQFSFFDAI----------YGENN 42 M +PVY ISL H RR K C + + L+F FDAI G ++ Sbjct: 1 MVLPVYFISLK-DHPRR-KHCQSIIDNPPAQEGDLKLEFHLFDAIDKHSCYFKKDLGYSS 58 Query: 43 PICNRIFSHQKRQCQF-----KRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDAD 96 I N S ++ ++ R L L E+GCY SH LW R I + I +ILEDD Sbjct: 59 LIENSYSSEWLKESEWFRGAHGRELLLEELGCYASHYMLWLRCIELNCPI--VILEDDVV 116 Query: 97 FSDEFSQLLPHL--SKCDINNILIKFDALRKK-------------PKKDSYLCTLPGNFD 141 F + L H S D + F ++ K P L +F Sbjct: 117 LKPHFYESLVHCMQSPFDFVRLFATFWEIKTKKSVIAHTGGTQVLPTDSRIEVILKEHFY 176 Query: 142 IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV--- 198 + + + Y++ +AA ++ + P+D + H IP+L P +V Sbjct: 177 LSALEVFA--AAAYYLTPKAAKAFVSASLHFTEPVDNLLTLVHTHRIPNLTYIPLSVGFS 234 Query: 199 ----YEAIDTNDST 208 AI ++D T Sbjct: 235 KHHSISAISSDDPT 248 >gi|332832911|ref|XP_003312339.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2 [Pan troglodytes] Length = 548 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 11/144 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + Q Sbjct: 275 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 334 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ + Sbjct: 335 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEKLS 393 Query: 120 FDAL---RKK--PKKDSYLCTLPG 138 +D + RK+ P+K++ + LPG Sbjct: 394 WDLIYLGRKQVNPEKETAVEGLPG 417 >gi|290989828|ref|XP_002677539.1| predicted protein [Naegleria gruberi] gi|284091147|gb|EFC44795.1| predicted protein [Naegleria gruberi] Length = 665 Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust. Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 17/119 (14%) Query: 5 VYVISLPFSHARREKFCHRA---ARIHLQFSFFDAIYGENNPICNR-IFSH--------- 51 +YVI+L RR+K R I ++FF+ ++GE + ++ IF+H Sbjct: 387 IYVINLKIREDRRQKSIRRLNTLGGIFSNYTFFEPVFGE--ALTDQDIFNHVSITTYKAI 444 Query: 52 -QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 + R ++ L + +GCY++H ++WK + +I EDD + + + +++ + S Sbjct: 445 LEGRSLDYQ-LSTKGGVGCYLTHTNIWKDMIEKGYERILIFEDDFEITSPYDEIMTYFS 502 >gi|309751574|gb|ADO81558.1| Lipooligosaccharide glucosyltransferase LpsA [Haemophilus influenzae R2866] Length = 278 Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 90 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 148 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 149 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 195 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 196 DVAVDSLVFEDFLHFKDYKIVQL--SPG 221 >gi|145636679|ref|ZP_01792346.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittHH] gi|145638436|ref|ZP_01794046.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittII] gi|145270205|gb|EDK10141.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittHH] gi|145272765|gb|EDK12672.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittII] Length = 282 Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 94 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 152 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 153 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 199 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 200 DVAVDSLVFEDFLHFKDYKIVQL--SPG 225 >gi|309751470|gb|ADO81454.1| Probable lipooligosaccharide biosynthesis protein [Haemophilus influenzae R2866] Length = 280 Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 19/183 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 MP+ YVISL + RR+ + ++ H+ F FFDA+ +P F Q+ Sbjct: 1 MPMTNYVISLSSARERRQHVMNEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNG 56 Query: 61 L-LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINN 115 L+ E GC ISH+ LW + I EDD + L S+ + ++ Sbjct: 57 TSLTDGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGRDARTFLAEDEWLFSRFNCDD 116 Query: 116 ILIKFDALRKKPKKDSYLC-TLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLNVR 169 I I +R + +C LP L+ + T GY I AA +LL + Sbjct: 117 IFI----IRLETFLQETICEELPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIF 172 Query: 170 KNI 172 KN+ Sbjct: 173 KNM 175 >gi|317131936|ref|YP_004091250.1| glycosyl transferase family 25 [Ethanoligenens harbinense YUAN-3] gi|315469915|gb|ADU26519.1| glycosyl transferase family 25 [Ethanoligenens harbinense YUAN-3] Length = 294 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 44/157 (28%), Positives = 67/157 (42%), Gaps = 19/157 (12%) Query: 31 FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII 90 F+ D + E NP+ S + +K ++ EI +SHI LWK+IA +I Sbjct: 102 FTLADQLQVEPNPLIQIDESSKA----YKIKMTPQEIAIALSHIELWKKIASDNIPYTLI 157 Query: 91 LEDDADFSDEFSQLLPHLSKCDINN---------ILIKFDALRKKPKKDSYLCTLPGNFD 141 LEDD F F+ L IN + + F + KPK + L Sbjct: 158 LEDDVYFERGFANNLDAAWTDIINKSSQPFTFDILFLSFQEVGIKPKTRIHEDGL----- 212 Query: 142 IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +H+P + +GY + K A LLN+ Y P+D+ Sbjct: 213 VHKPNCGIWQASGYVLSKVGAQKLLNMLP-AYGPVDL 248 >gi|68249361|ref|YP_248473.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 86-028NP] gi|68057560|gb|AAX87813.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 86-028NP] Length = 282 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNIALDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 94 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 152 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK ++ + +++ + GY + + A +LL + KN +P+ Sbjct: 153 M-----FFKQPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 199 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 200 DVAVDSLVFEDFLHFKDYKIVQL--SPG 225 >gi|309750044|gb|ADO80028.1| Lipooligosaccharide biosynthesis protein Lic2A [Haemophilus influenzae R2866] Length = 278 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 49/179 (27%), Positives = 72/179 (40%), Gaps = 14/179 (7%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 I VIS+ + RR+ + L FSFF+A ++ N I + + R+ Sbjct: 3 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSNSILHN----IEESRI 58 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINNIL 117 L+ E GC ISH LW + I EDD + L ++ D N+I Sbjct: 59 LTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIF 118 Query: 118 IKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 I +P K +P NFDI + T GY I + AA +++ KNI Sbjct: 119 IIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWG--TAGYIISQGAAKYVIEYLKNI 175 >gi|85059128|ref|YP_454830.1| hypothetical protein SG1150 [Sodalis glossinidius str. 'morsitans'] gi|84779648|dbj|BAE74425.1| conserved hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 248 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 80/182 (43%), Gaps = 34/182 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +++I+L S RR+ ++ R +L + F +A+ G + I H + R Sbjct: 1 MKIFIINLKESFRRRQCIEYQCRRYNLDYEFTEAVNGHSLS-SEEIVQHTRTINYATRP- 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 EIGC +SHI++++ I A+ILEDDA + E +P LI+ A Sbjct: 59 --GEIGCALSHIYIYRLICERGLEQALILEDDAKITPE---AIP----------LIQQIA 103 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 LR P + P + T ++ K IH+L+ +++R ID M H Sbjct: 104 LRNDPNR---------------PLLTLLTTCNQYLKK--PIHVLDKTHSLHRVIDAAMTH 146 Query: 183 WW 184 + Sbjct: 147 GY 148 >gi|317009185|gb|ADU79765.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori India7] Length = 281 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 51/184 (27%), Positives = 82/184 (44%), Gaps = 20/184 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCYQ-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPIKGVSQILNFKD------GIGTQGYVLAPKAAQKLLEYSAKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ID M +H+W H + + V E A+ + ++ +S E+ R + P + + +++T Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAISCDGMNAQNSNTEKQRPKKLPLSIRIGRFLHKSTIK 266 Query: 232 QWNL 235 QWN+ Sbjct: 267 QWNI 270 >gi|217034149|ref|ZP_03439569.1| hypothetical protein HP9810_868g42 [Helicobacter pylori 98-10] gi|216943433|gb|EEC22889.1| hypothetical protein HP9810_868g42 [Helicobacter pylori 98-10] Length = 300 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 51/184 (27%), Positives = 82/184 (44%), Gaps = 20/184 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCYQ-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPIKGVSQILNFKD------GIGTQGYVLAPKAAQKLLEYSAKKWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ID M +H+W H + + V E A+ + ++T +S E+ R + P + + +++ Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEKQRPKKLPLSIRIGRFLHKSAVK 266 Query: 232 QWNL 235 QWN+ Sbjct: 267 QWNV 270 >gi|110006069|gb|ABG48537.1| LpsA [Haemophilus influenzae] gi|110006071|gb|ABG48538.1| LpsA [Haemophilus influenzae] gi|110006073|gb|ABG48539.1| LpsA [Haemophilus influenzae] gi|110006075|gb|ABG48540.1| LpsA [Haemophilus influenzae] gi|110006079|gb|ABG48542.1| LpsA [Haemophilus influenzae] Length = 256 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|315634104|ref|ZP_07889393.1| glycosyltransferase [Aggregatibacter segnis ATCC 33393] gi|315477354|gb|EFU68097.1| glycosyltransferase [Aggregatibacter segnis ATCC 33393] Length = 256 Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust. Identities = 49/180 (27%), Positives = 80/180 (44%), Gaps = 18/180 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + + FS + + E + + +F+ K + + R ++ Sbjct: 4 YLISLDKDVQRRELFFSQPNTADFEVFSAINTMQKEWDALA-EVFNPTKFEQHYGRNVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCDINNILI 118 EIGC +SH+ +++RI I A+I EDDA F+ + S+ L C + ILI Sbjct: 63 GEIGCTLSHLEVYRRIVADERIAETDYALICEDDALFNADLSEKTTALLTQHCSADIILI 122 Query: 119 ------KFDALRKK---PKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLNV 168 F+ + + P S+L + + P + T Y I K AA L+V Sbjct: 123 GQSKIATFNDVELEINYPTTFSFLRQKVADVTVAYPYKSYFAGTVAYLIKKSAARRFLDV 182 >gi|7959265|dbj|BAA96026.1| KIAA1502 protein [Homo sapiens] Length = 560 Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 64/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 321 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYS- 379 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 380 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 437 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 438 LSWDLIYLGRKQVNPEKETAVEGLPG 463 >gi|119385481|ref|YP_916537.1| glycosyl transferase family protein [Paracoccus denitrificans PD1222] gi|119375248|gb|ABL70841.1| glycosyl transferase, family 25 [Paracoccus denitrificans PD1222] Length = 262 Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust. Identities = 59/246 (23%), Positives = 91/246 (36%), Gaps = 9/246 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRL 61 +PVY+I+L S R + + F A G I + R Sbjct: 11 VPVYLINLDGSDERLRSATRQLDEAGIPFERVPAFDGRALRIEEFPDYDPAGAMAYMGRP 70 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS----KCDINNIL 117 L EIGCY+SH+ +R S A ++ EDD F++ L LS + D + L Sbjct: 71 LRGGEIGCYLSHLDCARRFLDSGAEYGVVFEDDMQLKPGFAKGLRILSDWLDRHDRDWDL 130 Query: 118 IKFDALRKKPKKDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I A + K + G + D+ + TTG ++ A ++ + I + Sbjct: 131 INIGAGQHKIFTPVMGFEVAGRHHDLTRAHYFPMTTTGLIWSRQGAETFVSTHRRITAGV 190 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS-PLYFYRNTCYQWNL 235 D +HW + L P V +S IE+ + R + PLY Q Sbjct: 191 DNHFRHWLTRSDRGLAVWPPLV--TTTGVESQIEDGKGKRSASGRHPLYGLIKQRRQLIN 248 Query: 236 HYNAWR 241 AWR Sbjct: 249 KLIAWR 254 >gi|110006077|gb|ABG48541.1| LpsA [Haemophilus influenzae] Length = 256 Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 89/208 (42%), Gaps = 34/208 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNIALDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLP--------HLSKCDINNI 116 EIGC +SHI LW +A + I I EDD + +LL H+ K + N Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDIHLGENAKELLEIDYISDDIHVLKLEANGK 126 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + K+PK S C + +++ + GY + + A +LL + KN +P+ Sbjct: 127 M-----FFKQPK--SVKC----DRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPL 173 Query: 177 DM--------DMKHWWEHNIPSLVTEPG 196 D+ D H+ ++ I L PG Sbjct: 174 DVAVDSLVFEDFLHFKDYKIVQL--SPG 199 >gi|329756902|gb|AEC04686.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 75/183 (40%), Gaps = 19/183 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 MP+ YVISL + RR + ++ H+ F FFDA+ +P F Q+ Sbjct: 1 MPMTNYVISLSSARERRRHVMNEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNG 56 Query: 61 L-LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINN 115 L+ E GC ISH+ LW + I EDD + L S+ + ++ Sbjct: 57 TSLTGGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGRDARAFLAEDEWLFSRFNCDD 116 Query: 116 ILIKFDALRKKPKKDSYLC-TLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLNVR 169 I I +R + +C LP L+ + T GY I AA +LL + Sbjct: 117 IFI----IRLETFLQETICEALPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIF 172 Query: 170 KNI 172 KN+ Sbjct: 173 KNM 175 >gi|260797405|ref|XP_002593693.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae] gi|229278921|gb|EEN49704.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae] Length = 384 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 29/124 (23%), Positives = 57/124 (45%), Gaps = 6/124 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSH---QKRQCQFK 59 ++VI+L RR++ H I L F +A+ G N + ++ + + Sbjct: 262 IFVINLKRRPERRKRMVHTLKEIGLDFKLMEAVDGLTLNASVLKKMGVTVLPEYKDPWAD 321 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R +++ EIGC++SH +W+ I ++ EDD F F + + + +I I + Sbjct: 322 RSMTMGEIGCFLSHYKIWEEIVEKNLDWVLVFEDDIRFEPFFKRRMYKMLN-EIEEIRLD 380 Query: 120 FDAL 123 +D + Sbjct: 381 WDLM 384 >gi|317014039|gb|ADU81475.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Gambia94/24] Length = 332 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 77/330 (23%), Positives = 117/330 (35%), Gaps = 82/330 (24%) Query: 3 IPVYVISLPFSHAR--REKFC---HRAARIHLQFSFFDAI---YGENNPICNRIFSHQK- 53 I VY+ISL S R EKF + + F FDAI + + + +++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKFVLESNEKFKGRCVFQIFDAISPKHQDFEKLLQELYNAQSL 61 Query: 54 -----RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-- 106 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 62 LQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLDQPVVILEDDVTLESHFMQALEDC 120 Query: 107 -----------------HLSKCDI----------------NNILIKFDA----------- 122 H +K + N IL KF Sbjct: 121 LKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDHSFKNNPILEKFKKFFDVSRFLNLS 180 Query: 123 -------LRKKPKKDSYLCTLPGNFDIHQPRILS---PRTTGYFIGKEAAIHLLNV--RK 170 + KK +K Y F + + S T GY++ + A + R Sbjct: 181 THKVIHYILKKIQKSYYATHEKETFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATERF 240 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTFS 221 I P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 241 KIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPQKSTFK 300 Query: 222 PLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 301 NLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|188527351|ref|YP_001910038.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] gi|188143591|gb|ACD48008.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] Length = 278 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 51/181 (28%), Positives = 81/181 (44%), Gaps = 14/181 (7%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-LIK 119 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + I+ Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKEYFKESLEFCYQ-HINELGYIR 153 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDM 178 L + K L + I T GY + +AA LL K PID Sbjct: 154 LMHLEENVAKQKTLIKGVSQILNFKDGI---GTQGYVLAPKAAQKLLKYSTKEWVMPIDC 210 Query: 179 DM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWN 234 M +H+W H + + V E A+ + ++T +S E+ R + P + +++ QWN Sbjct: 211 VMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEKQRPKKLPLSIRIGRSLHKSALKQWN 269 Query: 235 L 235 + Sbjct: 270 V 270 >gi|313143720|ref|ZP_07805913.1| lipooligosaccharide 5G8 epitope biosynthesis-protein [Helicobacter cinaedi CCUG 18818] gi|313128751|gb|EFR46368.1| lipooligosaccharide 5G8 epitope biosynthesis-protein [Helicobacter cinaedi CCUG 18818] Length = 250 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 43/168 (25%), Positives = 70/168 (41%), Gaps = 18/168 (10%) Query: 33 FFDAIYGENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIIL 91 FF+AI + N + +S K + LS E C+ SH LW++ S I+L Sbjct: 20 FFNAINAQAKEHLNFKQYSKIKSILFRGKELSDGERACFASHYTLWQKCVESNE-PIIVL 78 Query: 92 EDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-- 149 EDD + F Q L + + + + YL T +D+ + + Sbjct: 79 EDDVEILPHFWQELERIEQSAFAYVRL------------MYLITKAKFYDLQKGFHFTFN 126 Query: 150 --PRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 T GY++ AA + K+ YRP+D M ++ H IP + +P Sbjct: 127 NVAGTQGYYLTPTAAKAFIESAKSWYRPVDDYMDMFYIHKIPIVCVKP 174 >gi|297270127|ref|XP_001111820.2| PREDICTED: glycosyltransferase 25 family member 3-like [Macaca mulatta] Length = 714 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 11/144 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + Q Sbjct: 441 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 500 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ + Sbjct: 501 RTLTKGEVGCFLSHYSIWEEVVARGLSQVLVFEDDVRFESNFRGRLERLME-DVEAEKLP 559 Query: 120 FDAL---RKK--PKKDSYLCTLPG 138 +D + RK+ P+K++ + LPG Sbjct: 560 WDLIYLGRKQVNPEKEAAVEGLPG 583 >gi|15611832|ref|NP_223483.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155330|gb|AAD06344.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 273 Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust. Identities = 43/156 (27%), Positives = 69/156 (44%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L H+ + Sbjct: 94 KFMSLGELGCYASHYSLWQKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + + I + T GY I + A L + Sbjct: 152 RLMHLLYDASVKSEPLNHKNQEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|145634824|ref|ZP_01790532.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittAA] gi|110006031|gb|ABG48518.1| LpsD [Haemophilus influenzae] gi|145267990|gb|EDK07986.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittAA] Length = 256 Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust. Identities = 52/180 (28%), Positives = 77/180 (42%), Gaps = 18/180 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI----YGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F+FFDAI E N F K Sbjct: 9 YVISLTTEQKRRKNITEEFGKQNIPFAFFDAITPDLIEETAKKFNITFDRSS-----KAT 63 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C +SHI LW + + I EDD + +LL ++ + ++K + Sbjct: 64 LCDGEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLE 122 Query: 122 ALRK--KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVR-KNIYRPID 177 A K K++ C N +I + + T GY I + A +LL N+R K +Y ID Sbjct: 123 AHGKIIYGKREQIKC----NRNISRLKFKHTGTAGYSITAKGARYLLNNIRNKQLYLAID 178 >gi|160395573|sp|Q29NU5|GLT25_DROPS RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor Length = 626 Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust. Identities = 45/177 (25%), Positives = 80/177 (45%), Gaps = 20/177 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF------ 58 +++I+L RR+K + I LQ F A+ G+ +R+ Q+ +F Sbjct: 344 IFMINLERRPERRQKMENLFEEIGLQVEHFPAVDGKELN-ADRV---QEMGIRFLPGYED 399 Query: 59 ---KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF---SQLLPHLSKCD 112 R +++ EIGC++SH +W R+ +ILEDD F F + + + ++ Sbjct: 400 PYHHRAMTMGEIGCFLSHYRIWVRMVQLELKEVLILEDDIRFDPYFRANAVRVLNQARSV 459 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + LI F RK+ K++S + +H S T GY + + A+ LL + Sbjct: 460 VEYDLIYFG--RKRLKEESEPWVADADSLVHAG--YSYWTLGYVLSLQGALKLLAAK 512 >gi|110006033|gb|ABG48519.1| LpsD [Haemophilus influenzae] gi|110006035|gb|ABG48520.1| LpsD [Haemophilus influenzae] gi|110006037|gb|ABG48521.1| LpsD [Haemophilus influenzae] Length = 256 Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust. Identities = 52/180 (28%), Positives = 77/180 (42%), Gaps = 18/180 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI----YGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F+FFDAI E N F K Sbjct: 9 YVISLTTEQKRRKNITEEFGKQNIPFAFFDAITPDLIEETAKKFNITFDRSS-----KAT 63 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C +SHI LW + + I EDD + +LL ++ + ++K + Sbjct: 64 LCDGEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLE 122 Query: 122 ALRK--KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVR-KNIYRPID 177 A K K++ C N +I + + T GY I + A +LL N+R K +Y ID Sbjct: 123 AHGKIIYGKREQIKC----NRNISRLKFKHTGTAGYSITAKGARYLLNNIRNKQLYLAID 178 >gi|308187262|ref|YP_003931393.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Pantoea vagans C9-1] gi|308057772|gb|ADO09944.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Pantoea vagans C9-1] Length = 258 Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust. Identities = 40/168 (23%), Positives = 68/168 (40%), Gaps = 9/168 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++I+L S RR + L + F A+ G + + + R + Sbjct: 1 MKTFIINLASSTGRRATISAQCDAAGLDYEFISAVNG--YALTDAEIAQHTRTVNYA--F 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL--PHLSKCDINNILIKF 120 EIGC +SHI +++++ A+ILEDDA F+ + +L P + N L+ Sbjct: 57 KPGEIGCALSHIAIYRKMKDEKIPQALILEDDALFTGQLGAVLSSPAMQLSPDNPTLVLL 116 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + K T + P + Y I EAAI LL + Sbjct: 117 SRVNRYVDKAIAAVTETSHL---YPVYSATTAHAYVINLEAAIRLLKL 161 >gi|148827119|ref|YP_001291872.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittGG] gi|148718361|gb|ABQ99488.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittGG] Length = 191 Score = 45.4 bits (106), Expect = 0.009, Method: Compositional matrix adjust. Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 20/170 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + Q FS + + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRELFFSQKNTEDFQIFSAINTMQKDWDELAS-IFNIEQFKAHYGRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPH------------L 108 EIGC +SH+ ++++I I ++ EDDA F +F Q L L Sbjct: 64 GEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHSDFQQNLTALLSEKLEAEIILL 123 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFI 157 + INN D P S+LC GN + P + T GY I Sbjct: 124 GQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLI 172 >gi|254466303|ref|ZP_05079714.1| glycosyl transferase, family 25 [Rhodobacterales bacterium Y4I] gi|206687211|gb|EDZ47693.1| glycosyl transferase, family 25 [Rhodobacterales bacterium Y4I] Length = 273 Score = 45.1 bits (105), Expect = 0.009, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 53/123 (43%), Gaps = 17/123 (13%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS----KCDINNI-LIKFD 121 + C +SH +W+R S A A++ EDD S E L LS C + NI + D Sbjct: 59 MACTLSHARVWERFLDSGADAALVFEDDVFISAELRLWLEDLSWWPEGCGLVNIEFWRSD 118 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILS--PRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 +LR D + D +LS P + GY + ++AA LL RP DM Sbjct: 119 SLRVLLGTDGT-----QHLDRELAPMLSRNPGSAGYLVTRDAAEQLLAA-----RPFDMS 168 Query: 180 MKH 182 + Sbjct: 169 IDQ 171 >gi|145628932|ref|ZP_01784732.1| dimethyladenosine transferase [Haemophilus influenzae 22.1-21] gi|144979402|gb|EDJ89088.1| dimethyladenosine transferase [Haemophilus influenzae 22.1-21] Length = 223 Score = 45.1 bits (105), Expect = 0.009, Method: Compositional matrix adjust. Identities = 51/187 (27%), Positives = 73/187 (39%), Gaps = 18/187 (9%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + L FSFF+A Y N N+ + Q Sbjct: 24 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSNSIL 83 Query: 57 ---QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LS 109 + R+L+ E GC ISH LW + I EDD + L + Sbjct: 84 HNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKT 143 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + D N+I I +P K +P NFDI + T GY I + AA ++ Sbjct: 144 RFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKST--HWGTAGYIISQGAAKYV 201 Query: 166 LNVRKNI 172 + KNI Sbjct: 202 IEYLKNI 208 >gi|156120717|ref|NP_001095505.1| glycosyltransferase 25 family member 3 precursor [Bos taurus] gi|160395522|sp|A7MB73|GT253_BOVIN RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule; Flags: Precursor gi|154425666|gb|AAI51374.1| CERCAM protein [Bos taurus] Length = 595 Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust. Identities = 33/142 (23%), Positives = 59/142 (41%), Gaps = 9/142 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + + Q Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSVMRTLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F QL+ + + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVTRGLAQVVVFEDDVRFESNFKGRLEQLMEEVEAEKLPW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLP 137 LI + P++++ + LP Sbjct: 442 DLIYLGRKQVNPEEEAVVEGLP 463 >gi|145637408|ref|ZP_01793067.1| diadenosine tetraphosphatase [Haemophilus influenzae PittHH] gi|145269354|gb|EDK09298.1| diadenosine tetraphosphatase [Haemophilus influenzae PittHH] Length = 307 Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 72/183 (39%), Gaps = 14/183 (7%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKR---QCQ 57 I VIS+ + RR+ + L FSFF+A Y N N+ + + Sbjct: 24 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSNSILHNIE 83 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDI 113 R+L+ E GC ISH LW + I EDD + L ++ D Sbjct: 84 KSRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDF 143 Query: 114 NNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 N+I I +P K +P NFDI + T GY I + AA +++ Sbjct: 144 NDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWG--TAGYIISQGAAKYVIEYL 201 Query: 170 KNI 172 KNI Sbjct: 202 KNI 204 >gi|194225890|ref|XP_001499943.2| PREDICTED: similar to cerebral endothelial cell adhesion molecule 1 [Equus caballus] Length = 584 Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust. Identities = 34/142 (23%), Positives = 59/142 (41%), Gaps = 9/142 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ I + Q Sbjct: 311 VFVISLARRPDRRERMLSSLWEMEISGRVVDAVDGRTLNSSILRSLGVDLLPGYQDPYSG 370 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F QL+ + + Sbjct: 371 RTLTKGEVGCFLSHYSIWEEVVARGLAQVLVFEDDVRFESNFRGRLEQLMEEVEAEKLPW 430 Query: 116 ILIKFDALRKKPKKDSYLCTLP 137 LI + P++++ + LP Sbjct: 431 DLIYLGRKQVNPEEEAVVEGLP 452 >gi|308062125|gb|ADO04013.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Cuz20] Length = 278 Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust. Identities = 53/188 (28%), Positives = 82/188 (43%), Gaps = 28/188 (14%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCYQ-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTSVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLEYSAKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKPTFSPL------YFYR 227 ID M +H+W H + + V E A+ + ++T +S E+ +KP PL ++ Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEK----QKPKKLPLSIRIGRSLHK 262 Query: 228 NTCYQWNL 235 + QWN+ Sbjct: 263 SALKQWNV 270 >gi|269139614|ref|YP_003296315.1| putative beta 1,4-galactosyltransferase [Edwardsiella tarda EIB202] gi|267985275|gb|ACY85104.1| putative beta 1,4-galactosyltransferase [Edwardsiella tarda EIB202] gi|304559490|gb|ADM42154.1| Beta-1,4-galactosyltransferase [Edwardsiella tarda FL6-60] Length = 251 Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust. Identities = 55/235 (23%), Positives = 97/235 (41%), Gaps = 41/235 (17%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + V++I+L ++ + +R+++ +A+ G + +R+ + Q Sbjct: 1 MKVFIINLKRDLQKKHEIISECSRLNINHEIVEAVSGIDLSAAEVDRLID---KDAQI-- 55 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L+ EIGC +SH+ ++++I A+ILEDDA D D+N ++ Sbjct: 56 YLTKGEIGCSLSHLRIYQKIISEALPYALILEDDAILHD------------DLNEVIQAI 103 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPR-----------TTGYFIGKEAAIHLLNVR 169 + + K +YL G +Q LS + T GY + + A L + Sbjct: 104 ENVIDKESSHAYLLYKTGCVYGNQRIKLSEKYSLYESNVPTCTHGYVVTNKTARLLTAIN 163 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAV-YEAIDTND-----STIEESRLVRKP 218 + D ++E NI P ++ ID+ D STIEE RL R P Sbjct: 164 TPVRFEADAWRNFYFEKNI-----RPYSLNINLIDSRDQSKENSTIEEERLSRTP 213 >gi|296482048|gb|DAA24163.1| glycosyltransferase 25 family member 3 precursor [Bos taurus] Length = 531 Score = 45.1 bits (105), Expect = 0.011, Method: Compositional matrix adjust. Identities = 33/142 (23%), Positives = 59/142 (41%), Gaps = 9/142 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + + Q Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSVMRTLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F QL+ + + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVTRGLAQVVVFEDDVRFESNFKGRLEQLMEEVEAEKLPW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLP 137 LI + P++++ + LP Sbjct: 442 DLIYLGRKQVNPEEEAVVEGLP 463 >gi|329756895|gb|AEC04680.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust. Identities = 51/188 (27%), Positives = 77/188 (40%), Gaps = 29/188 (15%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQF 58 MP+ YVISL + RR+ + ++ + F FFDAI + + + + + Sbjct: 1 MPMTNYVISLLSAKERRQHVINEFSKHQVPFQFFDAISPSSQLDSLIQELIPNLNGAS-- 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH------LSKCD 112 L+ E GC ISH+ LW + + I EDD + + L L CD Sbjct: 59 ---LTGGEKGCLISHLALWHKCIQDNSPYVTIFEDDILLGRDARKFLGEDEWLFSLFNCD 115 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGN--------FDIHQPRILSPRTTGYFIGKEAAIH 164 +I I +R + LC P N F + + L T GY I AA + Sbjct: 116 --DIFI----IRLETFLQPTLCQTPPNPISYCGRDFLVLKDEHLG--TAGYIISLGAAKY 167 Query: 165 LLNVRKNI 172 LL + KN+ Sbjct: 168 LLEIFKNM 175 >gi|157326139|gb|ABV44302.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust. Identities = 41/122 (33%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R A F FDAI ++ + Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCA-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK +ILEDDA F Q Sbjct: 57 DAQSMLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDAMLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|255090032|ref|XP_002506937.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] gi|226522211|gb|ACO68195.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] Length = 290 Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust. Identities = 40/130 (30%), Positives = 56/130 (43%), Gaps = 26/130 (20%) Query: 3 IPVYVISLPFSHARREKFCH---RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + V+VI+L + ARR + H +A H F D + N + +KR Sbjct: 106 LNVFVITLQRTPARRAQLIHNLTQAGVAHKIFFAVDGLLPLNGDDILKYAGLRKRS---- 161 Query: 60 RLLSL---------PEI----------GCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 RL SL PE+ GCYISH+ LW+ + S A+ILEDD + Sbjct: 162 RLTSLDVLRAHSPSPELDLMIHERLRFGCYISHVRLWEHLVNSELPFAVILEDDVIILER 221 Query: 101 FSQLLPHLSK 110 F L HL + Sbjct: 222 FDYSLRHLIR 231 >gi|25989453|gb|AAL82722.1| putative beta1,4-galactosyltransferase [Edwardsiella tarda] Length = 251 Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust. Identities = 55/235 (23%), Positives = 97/235 (41%), Gaps = 41/235 (17%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + V++I+L ++ + +R+++ +A+ G + +R+ + Q Sbjct: 1 MKVFIINLKRDLQKKHEIISECSRLNINHEIVEAVSGIDLSAAEVDRLID---KDAQI-- 55 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L+ EIGC +SH+ ++++I A+ILEDDA D D+N ++ Sbjct: 56 YLTKGEIGCSLSHLRIYQKIISEALPYALILEDDAILHD------------DLNEVIQAI 103 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPR-----------TTGYFIGKEAAIHLLNVR 169 + + K +YL G +Q LS + T GY + + A L + Sbjct: 104 ENVIDKESGHAYLLYKTGCVYGNQRIKLSEKYSLYESNVPTCTHGYVVTNKTARLLTAIN 163 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAV-YEAIDTND-----STIEESRLVRKP 218 + D ++E NI P ++ ID+ D STIEE RL R P Sbjct: 164 TPVRFEADAWRNFYFEKNI-----RPYSLNINLIDSRDQSKENSTIEEERLSRTP 213 >gi|86139388|ref|ZP_01057957.1| glycosyl transferase, family 25 [Roseobacter sp. MED193] gi|85823891|gb|EAQ44097.1| glycosyl transferase, family 25 [Roseobacter sp. MED193] Length = 241 Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust. Identities = 49/217 (22%), Positives = 86/217 (39%), Gaps = 13/217 (5%) Query: 2 PIPVYVISLPFSHARR---EKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK--RQC 56 P+ ++I +P S AR+ E+ C + FDA+ G + + H Sbjct: 3 PMRSFIIHMPDSTARQANAERLCDDLP----SGTLFDAVNGRDPAQIGAVKYHGGDLHSP 58 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINN 115 + L EIG + SH +W+++ A+I EDD F + L L + + Sbjct: 59 NYPFALRPAEIGVFQSHRRIWQKMVDEGIDLALITEDDLQIDPVLFPKALELLQRHATPD 118 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 I+ K+ + + + + PR++ + +G+ AA LL K I RP Sbjct: 119 HYIRLPV--KQRETPAKVLEDKDGLQLILPRVIGLQCICQCVGRHAAARLLQATKEIDRP 176 Query: 176 IDMDMK-HWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 +D ++ HW + G A STI++ Sbjct: 177 VDTFLQMHWITQQPVHALLGTGNQEVAAQIGGSTIQQ 213 >gi|157326123|gb|ABV44294.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust. Identities = 40/122 (32%), Positives = 50/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK S +ILEDDA F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKSNQ-PVVILEDDAMLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|47213906|emb|CAF95848.1| unnamed protein product [Tetraodon nigroviridis] Length = 601 Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust. Identities = 31/124 (25%), Positives = 57/124 (45%), Gaps = 6/124 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR K A + L F+ DA+ G+ + ++K Sbjct: 320 IFLINLKRRLDRRTKMLKTFAALGLHFTLTDAVDGKALNTSQLQALGIEMLPRYKDPYSG 379 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R+L+ EIGC++SH +W ++ ++LEDD F F + L + DI+ + Sbjct: 380 RVLTRGEIGCFLSHHSIWTQVLERGLEKVLVLEDDVRFEPRFKRRLQAIMD-DIDRAQLD 438 Query: 120 FDAL 123 +D + Sbjct: 439 WDLI 442 >gi|5764665|gb|AAD51367.1|AF177203_1 cerebral cell adhesion molecule [Homo sapiens] Length = 517 Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust. Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 15/146 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ N + + +Q Sbjct: 244 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDAGWLNSSAIRNLGVDLLPGYQDPYS- 302 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ Sbjct: 303 -GRTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEK 360 Query: 118 IKFDAL---RKK--PKKDSYLCTLPG 138 + +D + RK+ P+K++ + LPG Sbjct: 361 LSWDLIYLGRKQVNPEKETAVEGLPG 386 >gi|308182961|ref|YP_003927088.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] gi|308065146|gb|ADO07038.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] Length = 278 Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 25/208 (12%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIII 131 Query: 98 SDEFSQLLPHLSKCDINNI-LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 + F + L + IN + I+ L + K L + I T GY Sbjct: 132 KERFKESLEFCYQ-HINELGYIRLMHLEENVAKQKTLIKGVSQILNFKDGI---GTQGYV 187 Query: 157 IGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESR 213 + +AA LL K PID M +H+W H + + V E A+ + ++ +S E+ Sbjct: 188 LAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNAQNSNTEK-- 244 Query: 214 LVRKPTFSPL------YFYRNTCYQWNL 235 +KP PL ++ QWN+ Sbjct: 245 --QKPKKLPLSIRIGRSLHKGALKQWNV 270 >gi|229844633|ref|ZP_04464772.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae 6P18H1] gi|229812347|gb|EEP48037.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae 6P18H1] Length = 257 Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust. Identities = 48/178 (26%), Positives = 71/178 (39%), Gaps = 11/178 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDA----IYGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F FFDA I E N IF K Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNIIFDRSS-----KAT 63 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C +SHI LW + + I EDD + +LL ++ D + ++K + Sbjct: 64 LCDAEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NVDYIDSDTDILKLE 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 L K + + +I + GY I + A +LLN KN + +D Sbjct: 123 TLYGKIMLGK-MYQIKCERNIFHLKFKHTGMGGYSITAKGAKYLLNKIKNTRLDLAID 179 >gi|305380|gb|AAA65534.1| lipopolysaccharide core [Haemophilus influenzae] Length = 294 Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust. Identities = 51/191 (26%), Positives = 74/191 (38%), Gaps = 22/191 (11%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + +L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSINQS 62 Query: 57 -------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-- 107 + R+L+ E GC ISH LW + I EDD + L Sbjct: 63 NSILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNE 122 Query: 108 --LSKCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEA 161 ++ D N+I I +P K +P NFDI + T GY I + A Sbjct: 123 WLKTRFDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWG--TAGYIISQGA 180 Query: 162 AIHLLNVRKNI 172 A +++ KNI Sbjct: 181 AKYVIEYLKNI 191 >gi|157136453|ref|XP_001656834.1| hypothetical protein AaeL_AAEL003481 [Aedes aegypti] gi|122095142|sp|Q17FB8|GLT25_AEDAE RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor gi|108881003|gb|EAT45228.1| conserved hypothetical protein [Aedes aegypti] Length = 607 Score = 44.7 bits (104), Expect = 0.015, Method: Compositional matrix adjust. Identities = 30/116 (25%), Positives = 53/116 (45%), Gaps = 19/116 (16%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQ---CQF--- 58 +Y+I+L RR K + + L FF A+ G R S K + +F Sbjct: 325 IYMINLERRPERRNKMFNNFDELGLDVEFFPAVDG-------RQLSDDKLRDIGVKFLPG 377 Query: 59 ------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 KR +++ EIGC++SH ++W+++ ++LEDD F F + + + Sbjct: 378 YADPYHKRPMTMGEIGCFLSHYYIWEKMVAMNQEEVLVLEDDIRFEPYFKRRVAQV 433 >gi|321463619|gb|EFX74634.1| hypothetical protein DAPPUDRAFT_199801 [Daphnia pulex] Length = 623 Score = 44.7 bits (104), Expect = 0.015, Method: Compositional matrix adjust. Identities = 30/110 (27%), Positives = 52/110 (47%), Gaps = 6/110 (5%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R +++ EIGC++SH +W+ I +II EDD F F + L L +++ + + Sbjct: 384 RAMTMGEIGCFLSHYAIWQEIVDRQLSSSIIFEDDIRFESNFVKKLADLVN-EVDRLQVD 442 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 +D + RK+ K ++ G ++ S T Y + K A LL Sbjct: 443 WDLIYLGRKRLKHENETWVEGGQLLVNVE--YSYWTLSYILSKRGAEKLL 490 >gi|308184403|ref|YP_003928536.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] gi|308060323|gb|ADO02219.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] Length = 332 Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust. Identities = 76/310 (24%), Positives = 105/310 (33%), Gaps = 92/310 (29%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 DAQSMLKSDWYHSYVGAGLTLPELGCYLSHYFLWKECV-KLNQPVVILEDDVALESHFMQ 115 Query: 104 LLP-------------------HLSKCDI----------------NNILIKFDA------ 122 L H +K + N IL KF Sbjct: 116 ALEDCLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFSPFDYPFKNNPILEKFKKFFDVSR 175 Query: 123 ------------LRKKPKKDSYLCTLPGNFDIHQPRILS---PRTTGYFIGKEAAIHLLN 167 + KK +K Y F + + S T GY++ + A + Sbjct: 176 FLNLSTHKVIHYILKKIQKSYYAAHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIE 235 Query: 168 V--RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------R 216 R I P+DM M + H+I + P V + + DSTI++ + + Sbjct: 236 ATERFKIIEPVDMFMDNSAYHDIANFTCVPCPVSLSEHSLDSTIQKPQKKSLKSYPSPPK 295 Query: 217 KPTFSPLYFY 226 K TF L+FY Sbjct: 296 KSTFRELFFY 305 >gi|254779201|ref|YP_003057306.1| putative glycosyltransferase, family 25 [Helicobacter pylori B38] gi|254001112|emb|CAX29067.1| Putative glycosyltransferase, family 25 [Helicobacter pylori B38] Length = 273 Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust. Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K I Sbjct: 94 KFMSLGELGCYASHYSLWQKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYI 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +D+ K + I + T GY I + A + Sbjct: 152 RLMHLLYDSSVKSEPLSHKNHEIQERVGIIKAYSKGVGTQGYVITPKIAKVFKKHSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+DM M + H + +LV +P + A D STI Sbjct: 212 VPVDMIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|156537912|ref|XP_001608141.1| PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] Length = 617 Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust. Identities = 31/115 (26%), Positives = 55/115 (47%), Gaps = 8/115 (6%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 KR +++ E+GC++SH +W RI ++ILEDD F F Q +L L + + Sbjct: 382 KRPMTMGEVGCFLSHYIVWNRIVEDGDKISLILEDDVKFEPYFRQKIKLILNELERFKKD 441 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 L+ RK+ ++D+ + +H S T GY + + A L++ + Sbjct: 442 WDLVYLG--RKQMQRDTESWVEGSRYLVHAG--YSYWTVGYMLSAKGAKKLIDAK 492 >gi|317181932|dbj|BAJ59716.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F57] Length = 332 Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPELGCYLSHYFLWKECVKTDQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|238919223|ref|YP_002932738.1| glycosyltransferase [Edwardsiella ictaluri 93-146] gi|19113667|gb|AAL25629.2| putative glycosyltransferase [Edwardsiella ictaluri 93-146] gi|238868792|gb|ACR68503.1| putative glycosyltransferase [Edwardsiella ictaluri 93-146] Length = 261 Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust. Identities = 53/234 (22%), Positives = 94/234 (40%), Gaps = 29/234 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK------RQC 56 + ++ISL ARR R H F FFDA+ + + + S K + Sbjct: 1 MKTFIISLKDEVARRNSISDRLH--HHDFEFFDAV--DLRVASDDVLSAVKCKNINYKHP 56 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP------HLSK 110 + ++ EIGC +SH+ L+K+I A ++EDDA + ++L + Sbjct: 57 AIREHMTKGEIGCALSHMQLYKKIVDDNLAFARVIEDDAVILNHDEKILNDFILALEMKN 116 Query: 111 CDINNILIKFDALRKKPKKDSYL------CTLPGNFDIHQP-RILSPRTTGYFIGKEAAI 163 D + +L+ + L+ YL G + + P R + T GY + + A Sbjct: 117 IDWDIMLLGYSKLKACDSFGFYLKEPIKNIVKSGAYSLGIPFRNWTCGTVGYLVSQSGAK 176 Query: 164 HLLNVRKNIYRPIDMDMKHWW----EHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 ++N + + W +N+ L P V E ++ +S+IE R Sbjct: 177 KMIN--NAAFGKVCTVADDWLFFEKNYNLKILHIRPLIVLEDFESYESSIETDR 228 >gi|227498916|ref|ZP_03929055.1| conserved hypothetical protein [Acidaminococcus sp. D21] gi|226904367|gb|EEH90285.1| conserved hypothetical protein [Acidaminococcus sp. D21] Length = 202 Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust. Identities = 33/124 (26%), Positives = 56/124 (45%), Gaps = 2/124 (1%) Query: 44 ICNRIFSHQKRQCQFKR-LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 + N + Q ++C+ LL++ E+GC +SH+ + K + A I EDD S+EF Sbjct: 1 MGNELNETQLKECRGDNGLLTIGEVGCVLSHLKVCKDFLSTNAPYWTIFEDDIHLSNEFL 60 Query: 103 QLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 P + + I LR+ K + L GN + + GY I ++AA Sbjct: 61 DAYPRIESFMNSQIEPSVLLLRRNNGKGKVVYPLGGNHHVLH-MLAGTMACGYIINRKAA 119 Query: 163 IHLL 166 +L+ Sbjct: 120 ENLV 123 >gi|125381152|gb|ABN41495.1| putative glycosyltransferase [Campylobacter jejuni] Length = 266 Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 80/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 10 MKFFIINLKIAEDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 70 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 129 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 R+ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 130 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 189 Query: 171 NIYRPID 177 I P D Sbjct: 190 KIIFPYD 196 >gi|145637013|ref|ZP_01792677.1| LosA [Haemophilus influenzae PittHH] gi|145269871|gb|EDK09810.1| LosA [Haemophilus influenzae PittHH] Length = 148 Score = 44.3 bits (103), Expect = 0.019, Method: Compositional matrix adjust. Identities = 41/151 (27%), Positives = 63/151 (41%), Gaps = 14/151 (9%) Query: 71 ISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL-RKKPKK 129 +SH++++ + AIILEDDA S EF H+ +N + D L + K Sbjct: 1 MSHLYIYNMMQEQNIDKAIILEDDAIVSQEFE----HIVLDSLNKVPNTMDILFYEHGKA 56 Query: 130 DSYLCT---LPGNFDIH------QPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 +Y C + G +H + + RTT Y I +E A LL + I P D Sbjct: 57 KTYFCKKNLIEGYRLVHYCTPSKRSKRTITRTTAYLITQEGANKLLKLAYPIRMPADYLT 116 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + + EP V+ ID+ IE+ Sbjct: 117 GALQLTGLNAYGVEPPCVFRGIDSEIDAIEQ 147 >gi|116201385|ref|XP_001226504.1| hypothetical protein CHGG_08577 [Chaetomium globosum CBS 148.51] gi|88177095|gb|EAQ84563.1| hypothetical protein CHGG_08577 [Chaetomium globosum CBS 148.51] Length = 493 Score = 43.9 bits (102), Expect = 0.021, Method: Compositional matrix adjust. Identities = 22/93 (23%), Positives = 43/93 (46%), Gaps = 2/93 (2%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ I+LP RR+ A L ++ D + G++ + ++ Q + Sbjct: 47 IFAINLPSRTDRRDALALAGALSELDITWVDGVLGKD--VLDKTLPGDPNSRQGNGGFTT 104 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 G + +H+ + +RI + A+ILEDDAD+ Sbjct: 105 GNKGSWRAHMDVLQRIVHENVTSALILEDDADW 137 >gi|317180390|dbj|BAJ58176.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 184 Score = 43.9 bits (102), Expect = 0.021, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|149201031|ref|ZP_01878006.1| glycosyl transferase, family 25 [Roseovarius sp. TM1035] gi|149145364|gb|EDM33390.1| glycosyl transferase, family 25 [Roseovarius sp. TM1035] Length = 240 Score = 43.9 bits (102), Expect = 0.022, Method: Compositional matrix adjust. Identities = 31/148 (20%), Positives = 67/148 (45%), Gaps = 3/148 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-LIKF 120 L+ PEI ++SH +W I +I ++DA + L++ I+++ I F Sbjct: 64 LTTPEIARFLSHRQIWAEIVRQGLDYGLIFDEDAALDPQIFARARDLARDHIDDLGYIAF 123 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + + + G + P + +PR+ +G++AA HLL++ + RP++ + Sbjct: 124 QP--QAVRGPARVIDTNGGCVLCLPVVNAPRSPVQMVGQDAAAHLLHLTEIFDRPVEFLI 181 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDST 208 + W ++ + P V + + T Sbjct: 182 QSHWHTHLRTGAVYPSGVSRIMQGSGHT 209 >gi|325133389|gb|EGC56053.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis M13399] Length = 279 Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust. Identities = 29/100 (29%), Positives = 42/100 (42%), Gaps = 2/100 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRVMAELVPGLAKQHLLSGV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 E C++SH+ LWK+ + EDD + Q L Sbjct: 62 EKACFMSHVVLWKQALDEGVPYIAVFEDDVLLGEGVEQFL 101 >gi|253989141|ref|YP_003040497.1| hypothetical protein PAU_01661 [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253780591|emb|CAQ83753.1| conserved hypothetical protein [Photorhabdus asymbiotica] Length = 261 Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust. Identities = 56/222 (25%), Positives = 90/222 (40%), Gaps = 19/222 (8%) Query: 6 YVISLPFSHA-RREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR-LLS 63 +VISL ++ RR + ++ + F FFDAI I N + F LS Sbjct: 4 FVISLSQNNEKRRNHIVEQFSKKSIPFEFFDAIDKTKIHIANDL------GVSFDNPNLS 57 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 L E GC +SH+ LWK++ A I EDD S E L + + + +IK + Sbjct: 58 LGEKGCILSHVMLWKKVIDENLPMATIFEDDIYLSKEAENYLKNYDWINPDWHVIKIERA 117 Query: 124 RKKPKKD-SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----------VRKNI 172 +K K S + T + I + R GY I + A +L + I Sbjct: 118 DEKVKTAISPVKTFNKHEGIFKLRGEHLGAGGYIITNKGAKYLFEKITSSPFKDPIDYEI 177 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 + D ++ IP+L + + + D S++E R+ Sbjct: 178 FNNFIYDKNYFICQFIPALCMQDYTINKCHDKFPSSLENERI 219 >gi|71149096|gb|AAZ29047.1| Lgt2 [Moraxella catarrhalis] gi|326570929|gb|EGE20953.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis BC7] gi|326575875|gb|EGE25798.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis CO72] Length = 254 Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust. Identities = 50/197 (25%), Positives = 82/197 (41%), Gaps = 39/197 (19%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGENNPICNRIFSHQKRQ 55 I +VIS+ + RRE + + F FFDA+ Y + I I ++Q+ Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAVTPTDISKYAQKLSI--PIINNQR-- 57 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 L+ E C++SH+ LW+++ I EDD ++ + C I N Sbjct: 58 ------LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGND--------AACFIKN 103 Query: 116 --ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------RTTGYFIGKEAAIHLLN 167 + +FD ++ + + + L + + H R L+P T GY I + A LL Sbjct: 104 DWLYFEFDIIKLETQHE--LVHIGKSIHHHGNRTLNPLKSTHVGTAGYIISQSGAKRLLE 161 Query: 168 VRKNI----YRPIDMDM 180 K+I Y ID M Sbjct: 162 FIKSIDEYEYYAIDHVM 178 >gi|157326159|gb|ABV44312.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK S +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGASLTLPELGCYLSHYLLWKECVKSNQ-PVVILEDDVTLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|74318870|gb|ABA02572.1| Lgt2A [Moraxella catarrhalis] gi|326563616|gb|EGE13868.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 103P14B1] gi|326566382|gb|EGE16532.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis BC1] gi|326573223|gb|EGE23191.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 101P30B1] Length = 254 Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust. Identities = 50/197 (25%), Positives = 82/197 (41%), Gaps = 39/197 (19%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGENNPICNRIFSHQKRQ 55 I +VIS+ + RRE + + F FFDA+ Y + I I ++Q+ Sbjct: 2 IQNFVISIKTATKRREHIMCEFGKQGIAFEFFDAVTPTDISKYAQKLSI--PIINNQR-- 57 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 L+ E C++SH+ LW+++ I EDD ++ + C I N Sbjct: 58 ------LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGND--------AACFIKN 103 Query: 116 --ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------RTTGYFIGKEAAIHLLN 167 + +FD ++ + + + L + + H R L+P T GY I + A LL Sbjct: 104 DWLYFEFDIIKLETQHE--LVHIGKSIHHHGNRTLNPLKSTHVGTAGYIISQSGAKRLLE 161 Query: 168 VRKNI----YRPIDMDM 180 K+I Y ID M Sbjct: 162 FIKSIDEYEYYAIDHVM 178 >gi|145641046|ref|ZP_01796627.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae R3021] gi|145274207|gb|EDK14072.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 22.4-21] Length = 221 Score = 43.9 bits (102), Expect = 0.025, Method: Compositional matrix adjust. Identities = 37/132 (28%), Positives = 58/132 (43%), Gaps = 18/132 (13%) Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIG----AIILEDDA----DFSD 99 IF+ ++ + + R ++ EIGC +SH+ ++++I I ++ EDDA DF Sbjct: 11 IFNIEQFKAHYGRNVTKGEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHPDFQK 70 Query: 100 EFSQLLPH--------LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSP 150 + LL + + INN D P S+LC GN + P + Sbjct: 71 NLTALLAEKLESEIILVGQSKINN-FNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFA 129 Query: 151 RTTGYFIGKEAA 162 T GY I K AA Sbjct: 130 GTVGYLIKKSAA 141 >gi|307245938|ref|ZP_07528021.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853157|gb|EFM85379.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 256 Score = 43.9 bits (102), Expect = 0.026, Method: Compositional matrix adjust. Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ISL RRE F +A FS + + E + + R F K + ++ R ++ Sbjct: 4 FLISLDKDAQRRELFFSQADTADFTVFSAINTMNVEQSELEQR-FDFAKFEQRYGRKVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDIN 114 EIGC +SH+ +++ I +I A++ EDD F+ F Q L L +N Sbjct: 63 GEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLN 116 >gi|205355890|ref|ZP_03222659.1| hypothetical protein Cj8421_1178 [Campylobacter jejuni subsp. jejuni CG8421] gi|205346324|gb|EDZ32958.1| hypothetical protein Cj8421_1178 [Campylobacter jejuni subsp. jejuni CG8421] Length = 257 Score = 43.9 bits (102), Expect = 0.026, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 80/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 R+ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPID 177 I P D Sbjct: 181 KIIFPYD 187 >gi|108563215|ref|YP_627531.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori HPAG1] gi|107836988|gb|ABF84857.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori HPAG1] Length = 294 Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust. Identities = 47/160 (29%), Positives = 71/160 (44%), Gaps = 18/160 (11%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LWK+ I + AI ILEDD + F + L + IN + Sbjct: 108 MGFGELGCYASHYSLWKKCIELNEAI--CILEDDIIVKERFKESL-EFCRQHINELGYIR 164 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 165 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 218 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESR 213 ID M +H+W H + + V E A+ + +++ +S E+ R Sbjct: 219 IDCVMDRHYW-HGVKNYVLEEFAIACDGMNSQNSNTEKQR 257 >gi|32033957|ref|ZP_00134213.1| COG3306: Glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126208502|ref|YP_001053727.1| putative glycosyltransferase [Actinobacillus pleuropneumoniae L20] gi|307257074|ref|ZP_07538849.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|126097294|gb|ABN74122.1| putative glycosyltransferase [Actinobacillus pleuropneumoniae serovar 5b str. L20] gi|306864445|gb|EFM96353.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 256 Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust. Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ISL RRE F +A FS + + E + + R F K + ++ R ++ Sbjct: 4 FLISLDKDAQRRELFFSQADTADFTVFSAINTMNVEQSELEQR-FDFAKFEQRYGRKVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDIN 114 EIGC +SH+ +++ I +I A++ EDD F+ F Q L L +N Sbjct: 63 GEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLN 116 >gi|303251039|ref|ZP_07337225.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252666|ref|ZP_07534558.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307261511|ref|ZP_07543180.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|302650049|gb|EFL80219.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306859842|gb|EFM91863.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306868794|gb|EFN00602.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 256 Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust. Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ISL RRE F +A FS + + E + + R F K + ++ R ++ Sbjct: 4 FLISLDKDAQRRELFFSQADTADFTVFSAINTMNVEQSELEQR-FDFAKFEQRYGRKVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDIN 114 EIGC +SH+ +++ I +I A++ EDD F+ F Q L L +N Sbjct: 63 GEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLN 116 >gi|108563025|ref|YP_627341.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] gi|107836798|gb|ABF84667.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] Length = 332 Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 50/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK + +ILEDDA F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDAMLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|297380003|gb|ADI34890.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori v225d] Length = 285 Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust. Identities = 54/206 (26%), Positives = 91/206 (44%), Gaps = 22/206 (10%) Query: 40 ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFS 98 +++P + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 77 KHSPCALKNFFYALKHC--GKRMGFGELGCYASHYLLWQKCIELNEAI--CILEDDIIVK 132 Query: 99 DEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG 154 + F + L + IN + L+ + K K + NF T G Sbjct: 133 ERFKESLEFCYQ-HINELGYIRLMHLEENVAKQKTPIKGVSQILNFKD------GIGTQG 185 Query: 155 YFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEE 211 Y + +AA LL K PID M +H+W H + + V E A+ + ++T +S E+ Sbjct: 186 YVLAPKAAQKLLEYSTKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEK 244 Query: 212 SRLVRKPTFSPL--YFYRNTCYQWNL 235 R + P + +++ QWN+ Sbjct: 245 QRPKKLPLSIRIGRSLHKSALKQWNV 270 >gi|254361948|ref|ZP_04978079.1| glycosyltransferase LpsA [Mannheimia haemolytica PHL213] gi|1346467|sp|Q05770|LPSA_PASHA RecName: Full=Lipooligosaccharide biosynthesis protein lpsA gi|561691|gb|AAA80283.1| LpsA [Mannheimia haemolytica] gi|153093495|gb|EDN74475.1| glycosyltransferase LpsA [Mannheimia haemolytica PHL213] Length = 263 Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust. Identities = 48/181 (26%), Positives = 76/181 (41%), Gaps = 22/181 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ + ++ F FFDAI + + F+ L+ Sbjct: 4 YVISLTSAQERRKHIEAEFGKQNIPFQFFDAITPDLIKEKAKAFNIDISNTN----LTKG 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLL--------PHLSKCDINNI 116 EI C +SHI LW +A + I I EDD + +LL H+ K + Sbjct: 60 EIACALSHIALW-HLAKQQNLDYICIFEDDIYLGNNAFELLKTNYIPENTHIVKLET--- 115 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + FD + + K + Y+ N + + T GY + + A L+N+ K + PI Sbjct: 116 -LPFDRINRFNKTEKYIL----NRRLFKLNSRHVGTAGYILTNKGAEFLINILKTLNIPI 170 Query: 177 D 177 D Sbjct: 171 D 171 >gi|201067868|ref|ZP_03217759.1| hypothetical protein CJBH_L15 [Campylobacter jejuni subsp. jejuni BH-01-0142] gi|200004562|gb|EDZ05035.1| hypothetical protein CJBH_L15 [Campylobacter jejuni subsp. jejuni BH-01-0142] Length = 252 Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 80/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 R+ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPID 177 I P D Sbjct: 181 KIIFPYD 187 >gi|157415408|ref|YP_001482664.1| hypothetical protein C8J_1088 [Campylobacter jejuni subsp. jejuni 81116] gi|13123730|gb|AAK12951.1|AF343914_4 unknown [Campylobacter jejuni] gi|108514948|gb|ABF93270.1| hypothetical protein [Campylobacter jejuni] gi|157386372|gb|ABV52687.1| hypothetical protein C8J_1088 [Campylobacter jejuni subsp. jejuni 81116] gi|167412365|gb|ABZ79825.1| unknown [Campylobacter jejuni] gi|167412381|gb|ABZ79839.1| unknown [Campylobacter jejuni] gi|307748049|gb|ADN91319.1| Putative uncharacterized protein [Campylobacter jejuni subsp. jejuni M1] gi|315932286|gb|EFV11229.1| glycosyltransferase [Campylobacter jejuni subsp. jejuni 327] Length = 257 Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 80/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 R+ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPID 177 I P D Sbjct: 181 KIIFPYD 187 >gi|165976453|ref|YP_001652046.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876554|gb|ABY69602.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 256 Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust. Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ISL RRE F +A FS + + E + + R F K + ++ R ++ Sbjct: 4 FLISLDKDAQRRELFFSQADTADFTVFSAINTMNVEQSELEQR-FDFAKFEQRYGRKVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDIN 114 EIGC +SH+ +++ I +I A++ EDD F+ F Q L L +N Sbjct: 63 GEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLN 116 >gi|62868792|gb|AAY17580.1| putative glycosyltransferase [Campylobacter jejuni] gi|108514881|gb|ABF93232.1| hypothetical protein [Campylobacter jejuni] Length = 257 Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 80/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 R+ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPID 177 I P D Sbjct: 181 KIIFPYD 187 >gi|307166662|gb|EFN60659.1| Glycosyltransferase 25 family member [Camponotus floridanus] Length = 198 Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust. Identities = 52/187 (27%), Positives = 82/187 (43%), Gaps = 28/187 (14%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 KR +++ EIGC++SH +W+++ ++LEDD F F Q +L LS I Sbjct: 10 KRPMTMGEIGCFLSHYLIWQKVLEHGYKNVMVLEDDVRFEPFFRQKVNYVLEELSALGIE 69 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN------- 167 LI + +K K +S + F +H S T GY + + A L+ Sbjct: 70 WDLI-YVGRKKLVKSESPVEG--SKFLLHAA--YSYWTLGYILSENGARKLIGAMPLGKL 124 Query: 168 VRKNIYRPI--DMDMKHWWEHNIP-----SLVTEPGAVYEAIDTND----STIEESRLVR 216 V + Y PI D K W P +L T P +Y T + S E+S+L++ Sbjct: 125 VPVDEYLPILSDTHPKEQWAAQFPIRDLITLSTNPLLIYPTHYTGEDGYISDTEDSKLMQ 184 Query: 217 KPTFSPL 223 T +P+ Sbjct: 185 N-TVTPI 190 >gi|46411176|ref|NP_997181.1| glycosyltransferase 25 family member 3 precursor [Mus musculus] gi|160395523|sp|A3KGW5|GT253_MOUSE RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule; Flags: Precursor gi|126540790|emb|CAM46219.1| cerebral endothelial cell adhesion molecule 1 [Mus musculus] gi|148676479|gb|EDL08426.1| cerebral endothelial cell adhesion molecule 1 [Mus musculus] gi|187953029|gb|AAI38848.1| Cerebral endothelial cell adhesion molecule [Mus musculus] Length = 592 Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust. Identities = 33/145 (22%), Positives = 59/145 (40%), Gaps = 13/145 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-------RQCQ 57 V+VISL RR + + + DA+ G + + I H + Sbjct: 319 VFVISLARRPQRRARMLSSLWEMEISAQVVDAVDGRT--LNSSILKHLGVDLLPGYQDPY 376 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK----CDI 113 L+ E+GC++SH +W+ + ++ EDD F D F + L L + + Sbjct: 377 SGHTLTKGEVGCFLSHYSIWEEVVARGLARVVVFEDDVRFKDNFRRRLERLMEDVLIQKL 436 Query: 114 NNILIKFDALRKKPKKDSYLCTLPG 138 + LI + P+++ + LPG Sbjct: 437 SWDLIYLGRKQVNPEEEVAVEGLPG 461 >gi|229845811|ref|ZP_04465923.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae 7P49H1] gi|229810815|gb|EEP46532.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae 7P49H1] Length = 277 Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust. Identities = 51/180 (28%), Positives = 77/180 (42%), Gaps = 18/180 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI----YGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F+FFDAI E + N F K Sbjct: 30 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPDLIEETSKKFNITFDRSS-----KAT 84 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C +SHI LW + + I EDD + +LL ++ + ++K + Sbjct: 85 LCDAEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLE 143 Query: 122 ALRK--KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVR-KNIYRPID 177 A K K++ C N +I + + GY I + A +LL N+R K +Y ID Sbjct: 144 AHGKIIYGKREQIKC----NRNISRLKFKHTGMAGYSITAKGARYLLNNIRNKQLYLAID 199 >gi|261838139|gb|ACX97905.1| LPS biosynthesis protein [Helicobacter pylori 51] Length = 436 Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 47/122 (38%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|219871215|ref|YP_002475590.1| lipooligosaccharide biosynthesis protein lpsA [Haemophilus parasuis SH0165] gi|219691419|gb|ACL32642.1| lipooligosaccharide biosynthesis protein lpsA [Haemophilus parasuis SH0165] Length = 263 Score = 43.5 bits (101), Expect = 0.030, Method: Compositional matrix adjust. Identities = 44/177 (24%), Positives = 71/177 (40%), Gaps = 8/177 (4%) Query: 6 YVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +VISL +S+ +R H + F FFDAI N +F + L+ Sbjct: 4 FVISLKYSNDKRRTHMKNEFSSHNIPFEFFDAITPNENAQYANLFKINLDNTK----LTQ 59 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-LIKFDAL 123 EI C SH+ LWK++ + I EDD S +L + + +IK + Sbjct: 60 GEISCLFSHLTLWKQMIDNNLDRIAIFEDDIYLSSSAKDMLDSIESISTTDFDVIKLEKS 119 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRPIDM 178 ++ + N +H + + GY I AA ++ +NI PID+ Sbjct: 120 LERVLASKFYKIRMRNSSLHILKDSHLGSAGYVITNSAARKIITYIQNENIIAPIDI 176 >gi|195998972|ref|XP_002109354.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens] gi|190587478|gb|EDV27520.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens] Length = 546 Score = 43.5 bits (101), Expect = 0.030, Method: Compositional matrix adjust. Identities = 45/172 (26%), Positives = 72/172 (41%), Gaps = 15/172 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ------- 57 VYVI+L RR+ I L FF+AI G + + Q Sbjct: 305 VYVINLLRRKERRQYMKALLDEICLDAEFFEAIDGRQ--LNEEKIAKLGIQILPGYLDPF 362 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 +R + EIGC++SH LWK++ +ILEDD F + L + + ++ ++ Sbjct: 363 HQRPMKYGEIGCFLSHYQLWKKVIQFNYSSILILEDDVKVKMGFHRQLSQVMQ-EVQSLS 421 Query: 118 IKFDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + +D + RKK ++ + N+ I S T GY + A LL Sbjct: 422 LPWDIIYIGRKKLLSEAEIPVKGANYLIWPD--YSYWTVGYILSASGAKKLL 471 >gi|157326145|gb|ABV44305.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 43.5 bits (101), Expect = 0.030, Method: Compositional matrix adjust. Identities = 78/337 (23%), Positives = 117/337 (34%), Gaps = 96/337 (28%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLDQPVVILEDDVTLESHFMQ 115 Query: 104 LLPHLSKCDINNILI----------KFDALRKK--------PKKDS-YLCTLPGNFDIHQ 144 L K + + + KF L K+ P K++ L FD+ Sbjct: 116 ALEDCLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNTPILEKFKKFFDV-- 173 Query: 145 PRILSPRT---------------------------------------TGYFIGKEAAIHL 165 PR L+ T GY++ + A Sbjct: 174 PRFLNLSTHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTF 233 Query: 166 LNV--RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV-------- 215 + R I P+DM M + H++ +L P V + + DSTI++ + Sbjct: 234 IEATERFKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLP 293 Query: 216 -RKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 +K TF L++Y + + + K P+ T K Sbjct: 294 PQKSTFKNLFYYSLNAKKRLKAFQQYSKQYAPLKTPK 330 >gi|15645424|ref|NP_207598.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] gi|2313935|gb|AAD07853.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] Length = 284 Score = 43.5 bits (101), Expect = 0.030, Method: Compositional matrix adjust. Identities = 48/165 (29%), Positives = 72/165 (43%), Gaps = 18/165 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD D F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIIKDRFKESL-EFCRHHINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKP 218 ID M +H+W H + + V E A+ + ++ +S E+ R + P Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIACDGMNAQNSNTEKQRPKKLP 251 >gi|62289315|ref|YP_221108.1| glycosyl transferase family protein [Brucella abortus bv. 1 str. 9-941] gi|62195447|gb|AAX73747.1| glycosyl transferase, family 25 [Brucella abortus bv. 1 str. 9-941] Length = 239 Score = 43.5 bits (101), Expect = 0.031, Method: Compositional matrix adjust. Identities = 48/205 (23%), Positives = 78/205 (38%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD----AD 96 N RI+ + ++ LS EI C++SH W+ I I+EDD + Sbjct: 43 NKDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEN 102 Query: 97 FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + + HL I F R+ ++ F RI+ P G Sbjct: 103 FMGAYRAAVDHLEPGGF--IRFTFRDDREHGRE---------VFRDEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 F+ +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQFVSYDAAQKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRVRAY 236 >gi|17987885|ref|NP_540519.1| glycosyltransferase [Brucella melitensis bv. 1 str. 16M] gi|82699245|ref|YP_413819.1| glycosyl transferase family protein [Brucella melitensis biovar Abortus 2308] gi|189023568|ref|YP_001934336.1| glycosyl transferase, family 25 [Brucella abortus S19] gi|225851867|ref|YP_002732100.1| glycosyltransferase 25 family protein [Brucella melitensis ATCC 23457] gi|237814806|ref|ZP_04593804.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus str. 2308 A] gi|254688634|ref|ZP_05151888.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 6 str. 870] gi|254693115|ref|ZP_05154943.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 3 str. Tulya] gi|254696757|ref|ZP_05158585.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 2 str. 86/8/59] gi|254729666|ref|ZP_05188244.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 4 str. 292] gi|256044048|ref|ZP_05446959.1| Glycosyltransferase 25 family member 2 precursor [Brucella melitensis bv. 1 str. Rev.1] gi|256112844|ref|ZP_05453765.1| Glycosyltransferase 25 family member 2 precursor [Brucella melitensis bv. 3 str. Ether] gi|256256881|ref|ZP_05462417.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 9 str. C68] gi|256264619|ref|ZP_05467151.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|260545931|ref|ZP_05821672.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260563407|ref|ZP_05833893.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|260754106|ref|ZP_05866454.1| glycosyl transferase family 25 [Brucella abortus bv. 6 str. 870] gi|260757326|ref|ZP_05869674.1| glycosyl transferase family 25 [Brucella abortus bv. 4 str. 292] gi|260761150|ref|ZP_05873493.1| glycosyl transferase family 25 [Brucella abortus bv. 2 str. 86/8/59] gi|260883131|ref|ZP_05894745.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|261213352|ref|ZP_05927633.1| glycosyl transferase family 25 [Brucella abortus bv. 3 str. Tulya] gi|265990459|ref|ZP_06103016.1| glycosyl transferase family 25 [Brucella melitensis bv. 1 str. Rev.1] gi|265994287|ref|ZP_06106844.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|297247731|ref|ZP_06931449.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|17983618|gb|AAL52783.1| glycosyltransferase [Brucella melitensis bv. 1 str. 16M] gi|82615346|emb|CAJ10307.1| Glycosyl transferase, family 25 [Brucella melitensis biovar Abortus 2308] gi|189019140|gb|ACD71862.1| Glycosyl transferase, family 25 [Brucella abortus S19] gi|225640232|gb|ACO00146.1| Glycosyltransferase 25 family member 2 precursor [Brucella melitensis ATCC 23457] gi|237789643|gb|EEP63853.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus str. 2308 A] gi|260097338|gb|EEW81213.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260153423|gb|EEW88515.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|260667644|gb|EEX54584.1| glycosyl transferase family 25 [Brucella abortus bv. 4 str. 292] gi|260671582|gb|EEX58403.1| glycosyl transferase family 25 [Brucella abortus bv. 2 str. 86/8/59] gi|260674214|gb|EEX61035.1| glycosyl transferase family 25 [Brucella abortus bv. 6 str. 870] gi|260872659|gb|EEX79728.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|260914959|gb|EEX81820.1| glycosyl transferase family 25 [Brucella abortus bv. 3 str. Tulya] gi|262765400|gb|EEZ11189.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|263001243|gb|EEZ13818.1| glycosyl transferase family 25 [Brucella melitensis bv. 1 str. Rev.1] gi|263094986|gb|EEZ18694.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|297174900|gb|EFH34247.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|326408363|gb|ADZ65428.1| Glycosyl transferase, family 25 [Brucella melitensis M28] gi|326538078|gb|ADZ86293.1| glycosyltransferase 25 family member 2 precursor [Brucella melitensis M5-90] Length = 239 Score = 43.5 bits (101), Expect = 0.031, Method: Compositional matrix adjust. Identities = 48/205 (23%), Positives = 78/205 (38%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD----AD 96 N RI+ + ++ LS EI C++SH W+ I I+EDD + Sbjct: 43 NKDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEN 102 Query: 97 FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + + HL I F R+ ++ F RI+ P G Sbjct: 103 FMGAYRAAVDHLEPGGF--IRFTFRDDREHGRE---------VFRDEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 F+ +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQFVSYDAAQKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRVRAY 236 >gi|260182073|gb|ACX35567.1| beta-1,4 galactosyltransferase [Pasteurella multocida] Length = 283 Score = 43.5 bits (101), Expect = 0.032, Method: Compositional matrix adjust. Identities = 49/190 (25%), Positives = 81/190 (42%), Gaps = 30/190 (15%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----KR 60 YVISL + RR+ + ++ + F FFDAI +P+ N++ S QF Sbjct: 7 YVISLASAVERRQHISEQFSQYDIPFQFFDAI--SPSPLLNQLVS------QFFPSLADS 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINNI 116 L+ E GC+ISH+ LW + ++ EDD + L S+ + N++ Sbjct: 59 SLTDGEKGCFISHLSLWHKCVEKNLPYIVVFEDDILLGKNADKFLIEDEWFFSRFNTNDV 118 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT-----GYFIGKEAAIHLLNVRKN 171 I +R + C + +L+ ++T GY I AA LL++ N Sbjct: 119 FI----VRLETFLQKVYCQPSHIKSYYNRELLTLKSTHFGTAGYIISLGAAKFLLSLFNN 174 Query: 172 IY----RPID 177 ++ PID Sbjct: 175 MHIEEVAPID 184 >gi|328789321|ref|XP_397154.3| PREDICTED: glycosyltransferase 25 family member-like [Apis mellifera] Length = 567 Score = 43.5 bits (101), Expect = 0.032, Method: Compositional matrix adjust. Identities = 39/170 (22%), Positives = 77/170 (45%), Gaps = 11/170 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRI---FSHQKRQCQFK 59 +Y+I+L RR + + ++ DA+ G N I ++ + Sbjct: 322 IYMINLLRRPERRNRMHKLFKELGIRVETHDAVDGRALNQSILEKMGIKIMPEYTDPYHD 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R +++ EIGC++SH ++W ++ + IILEDD F F Q L ++ ++ ++ ++ Sbjct: 382 RPMTMGEIGCFLSHYNIWNKVIENDFKSVIILEDDVRFEPFFCQKLNYIL-TELKDLHLE 440 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 +D + RKK +++ + +H S T GY + A L+ Sbjct: 441 WDLVYLGRKKLAENAEPWIDESKYLVHAA--YSYWTLGYILSASGARKLI 488 >gi|317180389|dbj|BAJ58175.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 332 Score = 43.5 bits (101), Expect = 0.033, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|254779221|ref|YP_003057326.1| putative glycosyl transferase, family 25; putative LPS biosynthesis protein [Helicobacter pylori B38] gi|254001132|emb|CAX29090.1| Putative glycosyl transferase, family 25; putative LPS biosynthesis protein [Helicobacter pylori B38] Length = 284 Score = 43.5 bits (101), Expect = 0.033, Method: Compositional matrix adjust. Identities = 49/165 (29%), Positives = 73/165 (44%), Gaps = 18/165 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L K IN + Sbjct: 97 MGFGELGCYASHYLLWQKCIELNEAI--CILEDDIIVKERFKESLEFCDK-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKP 218 ID M +H+W H + + V E A+ + ++T +S E+ R + P Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIACDEMNTQNSNTEKQRPKKLP 251 >gi|217032517|ref|ZP_03438008.1| hypothetical protein HPB128_180g16 [Helicobacter pylori B128] gi|298736530|ref|YP_003729056.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|216945795|gb|EEC24418.1| hypothetical protein HPB128_180g16 [Helicobacter pylori B128] gi|298355720|emb|CBI66592.1| glycosyl transferase, family 25 [Helicobacter pylori B8] Length = 273 Score = 43.5 bits (101), Expect = 0.033, Method: Compositional matrix adjust. Identities = 43/156 (27%), Positives = 66/156 (42%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWQKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYV 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +D K + I + T GY I + A L + Sbjct: 152 RLMHLLYDTSVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|315586530|gb|ADU40911.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 35A] Length = 280 Score = 43.5 bits (101), Expect = 0.033, Method: Compositional matrix adjust. Identities = 53/188 (28%), Positives = 83/188 (44%), Gaps = 21/188 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCTLKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIV 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 + F + L + IN + L+ + K K + NF T Sbjct: 132 KEHFKESLEFCYQ-HINELGYIRLMHLEENVAKQKTPIKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + ++T +S E Sbjct: 185 GYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTE 243 Query: 211 ESRLVRKP 218 + R + P Sbjct: 244 KQRPKKLP 251 >gi|315586698|gb|ADU41079.1| glycosyltransferase [Helicobacter pylori 35A] Length = 332 Score = 43.1 bits (100), Expect = 0.035, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DVQSMLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECVKTDQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|71065765|ref|YP_264492.1| glycosyl transferase family protein [Psychrobacter arcticus 273-4] gi|71038750|gb|AAZ19058.1| probable glycosyl transferase family 25, LPS biosynthesis [Psychrobacter arcticus 273-4] Length = 255 Score = 43.1 bits (100), Expect = 0.035, Method: Compositional matrix adjust. Identities = 44/181 (24%), Positives = 76/181 (41%), Gaps = 20/181 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGENNPICNRIFSHQKRQCQF 58 +VISL + RR+ + ++ F FFDA+ Y + PI + Sbjct: 4 HVISLTTATDRRQHITQEFTKQNISFEFFDALIPDLAHTYAQTLPIDLENIA-------- 55 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 L+ E+ C++SH +W+++ I EDD ++ LL S +I Sbjct: 56 ---LTGGELACFMSHASVWQQMIDQNIPYLAIFEDDVYLGEDAEALLTTTSWIKPEWHII 112 Query: 119 KFDALRKKPKKDSYLCTLPGNFD-IHQPRILSPRTTGYFIG-KEAAIHLLNVRKNIYRPI 176 K +A +K S + + I Q + + T GY + + A ++L + KN RP+ Sbjct: 113 KIEAFSEKVFLSSNSSKIISDRRCIAQLKGRNLGTAGYILSLRGAQVYLDYISKNKLRPL 172 Query: 177 D 177 D Sbjct: 173 D 173 >gi|317180568|dbj|BAJ58354.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 284 Score = 43.1 bits (100), Expect = 0.036, Method: Compositional matrix adjust. Identities = 52/183 (28%), Positives = 81/183 (44%), Gaps = 21/183 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIV 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 + F + L + IN + L+ + K K + NF T Sbjct: 132 KERFKESLEFCCQ-HINELGYIRLMHLEENVAKQKTPIKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + ++T +S E Sbjct: 185 GYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTE 243 Query: 211 ESR 213 + R Sbjct: 244 KQR 246 >gi|210135026|ref|YP_002301465.1| beta-1,4-galactosyltransferase [Helicobacter pylori P12] gi|210132994|gb|ACJ07985.1| beta-1,4-galactosyltransferase [Helicobacter pylori P12] Length = 273 Score = 43.1 bits (100), Expect = 0.036, Method: Compositional matrix adjust. Identities = 43/156 (27%), Positives = 66/156 (42%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYV 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +DA K + I + T GY I + A + Sbjct: 152 RLMHLLYDASVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|332673390|gb|AEE70207.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 83] Length = 280 Score = 43.1 bits (100), Expect = 0.036, Method: Compositional matrix adjust. Identities = 48/165 (29%), Positives = 73/165 (44%), Gaps = 18/165 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCCQ-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPIKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSAKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKP 218 ID M +H+W H + + V E A+ + ++T +S E+ R + P Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAIACDGMNTQNSNTEKQRPKKLP 251 >gi|317177365|dbj|BAJ55154.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F16] Length = 278 Score = 43.1 bits (100), Expect = 0.037, Method: Compositional matrix adjust. Identities = 52/203 (25%), Positives = 87/203 (42%), Gaps = 15/203 (7%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 E +P C + F + + C + + E+GCY SH LW++ + ILEDD Sbjct: 76 EKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCLELNEV-ICILEDDIIVK 132 Query: 99 DEFSQLLPHLSKCDINNI-LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFI 157 + F + L + IN + I+ L + K L + I T GY + Sbjct: 133 ERFKESLEFCYQ-HINELGYIRLMHLEENVAKQKTLIKGVSQILNFKDGI---GTQGYVL 188 Query: 158 GKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRL 214 +AA LL K PID M +H+W H + + V E A+ + ++ +S E+ R Sbjct: 189 APKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAIACDGMNAQNSNTEKQRP 247 Query: 215 VRKPTFSPL--YFYRNTCYQWNL 235 + P + +++ QWN+ Sbjct: 248 KKLPLSIRIGRSLHKSAVKQWNV 270 >gi|328721259|ref|XP_001944685.2| PREDICTED: glycosyltransferase 25 family member-like [Acyrthosiphon pisum] Length = 223 Score = 43.1 bits (100), Expect = 0.037, Method: Compositional matrix adjust. Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 8/115 (6%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDI 113 +KR ++ EIGC++SH +W + Y ++LEDDA F F +L L + + Sbjct: 30 YKRPITKGEIGCFMSHYRIWAKTTYEGLDEVLVLEDDARFEPYFRFKLQMVLDELRRLKV 89 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + L+ + +K D+ L + + +P S T GY + A LLN Sbjct: 90 SWDLV---YIGRKSLNDNNESRLENSKLLVRPGY-SYWTLGYLLSGRGAKKLLNA 140 >gi|307250276|ref|ZP_07532230.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306857717|gb|EFM89819.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 256 Score = 43.1 bits (100), Expect = 0.037, Method: Compositional matrix adjust. Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ISL RRE F +A + FS + + E + + R F K + ++ R ++ Sbjct: 4 FLISLDKDAQRRELFFSQADTVDFTVFSAINTMNVEQSELEQR-FDFAKFEQRYGRKVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDIN 114 EIGC +SH+ +++ I +I A++ EDD + F Q L L +N Sbjct: 63 GEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLLAANFQQNLTALLNEKLN 116 >gi|157326149|gb|ABV44307.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 43.1 bits (100), Expect = 0.037, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYFLWKECVKTNQ-PVVILEDDVTLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|329756889|gb|AEC04675.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 43.1 bits (100), Expect = 0.038, Method: Compositional matrix adjust. Identities = 49/183 (26%), Positives = 75/183 (40%), Gaps = 19/183 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 MP+ YVISL + RR + ++ H+ F FDA+ +P + F Q+ Sbjct: 1 MPMTNYVISLSSARERRRHVMNEFSKHHVPFQIFDAV----SPSSHLDFLIQRLVPNLNG 56 Query: 61 L-LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINN 115 L+ E GC ISH+ LW + I EDD + L S+ + ++ Sbjct: 57 TSLTGGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGRDARAFLAEDEWLFSRFNCDD 116 Query: 116 ILIKFDALRKKPKKDSYLC-TLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLNVR 169 I I +R + +C LP L+ + T GY I AA +LL + Sbjct: 117 IFI----IRLETFLQETICEALPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIF 172 Query: 170 KNI 172 KN+ Sbjct: 173 KNM 175 >gi|148244950|ref|YP_001219644.1| glycosyl transferase family protein [Candidatus Vesicomyosocius okutanii HA] gi|146326777|dbj|BAF61920.1| glycosyltransferase, family 25 [Candidatus Vesicomyosocius okutanii HA] Length = 258 Score = 43.1 bits (100), Expect = 0.040, Method: Compositional matrix adjust. Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 9/110 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDA--IYGENNPICNRIFSHQKRQCQFKR 60 + ++VI+L + RR+ + +R+ L + +A I NN ++QK ++R Sbjct: 1 MDIFVINLSSAIKRRQFQEQQLSRLKLDYQITNATTISDINNK------TYQKHHYDWQR 54 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 + E+ CY SH LW +I + A+ILEDD S +L L K Sbjct: 55 PMQKTEVACYFSHQTLWSKIVKKNRM-ALILEDDVLLSKHTPSILTKLEK 103 >gi|317177550|dbj|BAJ55339.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F16] Length = 332 Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECVKTDQ-PIVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|308184590|ref|YP_003928723.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori SJM180] gi|308060510|gb|ADO02406.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori SJM180] Length = 284 Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust. Identities = 53/188 (28%), Positives = 83/188 (44%), Gaps = 21/188 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFYALKHCG--KRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIVL 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 + F + L K IN + L+ + K K + NF T Sbjct: 132 KECFKESLEFCDK-HINELGYIRLMHLEENVAKQKTPVKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIE 210 GY + +AA LL K PID M +H+W H + + V E A+ + +++ +S E Sbjct: 185 GYVLAPKAAQKLLKYSTKEWVMPIDCVMDRHYW-HGVKNYVLEEFAITCDGMNSQNSNTE 243 Query: 211 ESRLVRKP 218 + R + P Sbjct: 244 KQRPKKLP 251 >gi|238920443|ref|YP_002933958.1| beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] gi|238870013|gb|ACR69724.1| putative beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] Length = 246 Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust. Identities = 55/227 (24%), Positives = 97/227 (42%), Gaps = 29/227 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + V++I+L ++ + R+ ++F DA+ G + + + R Sbjct: 1 MQVFIINLKHDIQKKNAIQRQCKRLRIKFEIIDAVCGIDLTDSDLDSFIDADARS----- 55 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP----HLSKCD-INN 115 LS E+GC +SH L++RI A+ILEDDA ++ + ++ +SK D I Sbjct: 56 YLSRGEMGCALSHNSLYQRIIAEQLPFALILEDDAILHNDTAHVIALIEKRISKEDNIAL 115 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT-GYFIGKEAAIHLLNVRKNIYR 174 +L K + + K L TL ++ ++ +P T GY I +AA LL + Sbjct: 116 LLYKTEYIYKNK-----LITLSDDYIFYESN--TPTLTHGYVITNKAARTLLTLNT---- 164 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAI-----DTNDSTIEESRLVR 216 P+ ++ W V + I D STIE+ R ++ Sbjct: 165 PVRVEADAWRYFYFTRFVRAYSLNTDLILSHDVDKQTSTIEKERQLK 211 >gi|57790492|gb|AAW56186.1| Cj81-127 [Campylobacter jejuni subsp. jejuni 81-176] Length = 180 Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust. Identities = 54/202 (26%), Positives = 82/202 (40%), Gaps = 50/202 (24%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + V++I+L S R+E + ++ L+F FF AI +N K Sbjct: 1 MKVFIINLERSLDRKEHIQKQIQKLFEKNLSLKNKLEFIFFKAIDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ + I IILEDD +FSDE Sbjct: 50 EHLEFKDHFPWWGSWVLGRELSDGEKACFASHYKLWQECVKLDEPI--IILEDDVEFSDE 107 Query: 101 F----SQLLPHLSKCDINNILIKFDALRKKPK-KDSYLCTLPGNFDIHQPRILSPRTTGY 155 F ++ + L K I + + K K D++L TL T GY Sbjct: 108 FLNNGAEYIDELLKSKYEYIRLCYLTQGKMLKLNDNFLFTLDS----------IGGTQGY 157 Query: 156 FIGKEAAIHLLNVRKNIYRPID 177 I +A+ ++ K +P+D Sbjct: 158 LIKPISALKFIDNLKFWIKPVD 179 >gi|317012636|gb|ADU83244.1| beta-1,4-galactosyltransferase [Helicobacter pylori Lithuania75] Length = 273 Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust. Identities = 43/159 (27%), Positives = 67/159 (42%), Gaps = 16/159 (10%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYV 151 Query: 119 KFDALRKKPK--------KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 + L P K+ + G ++ + T GY I + A + Sbjct: 152 RLMHLLYDPNVKSEPLSHKNHEIQERVGIIKVYSEGV---GTQGYVITPKIAKVFKKCSR 208 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 209 KWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|108563237|ref|YP_627553.1| beta-1,4-galactosyltransferase [Helicobacter pylori HPAG1] gi|107837010|gb|ABF84879.1| beta-1,4-galactosyltransferase [Helicobacter pylori HPAG1] Length = 273 Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust. Identities = 44/156 (28%), Positives = 65/156 (41%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + +SL E+GCY SH LW++ I + AI ILEDD + F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWQKCIELNEAI--CILEDDITLKEGFKEGLDFLEKHIQELGYV 151 Query: 119 KFDALRKKPKKDSYLCT-----LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + L P S + + I + T GY I + A L + Sbjct: 152 RLMHLLYDPNVKSEPLSHKNQEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|238920436|ref|YP_002933951.1| beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] gi|238870005|gb|ACR69716.1| putative beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] Length = 247 Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust. Identities = 55/237 (23%), Positives = 98/237 (41%), Gaps = 45/237 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V++I+L ++ + + ++++ +A+ G + I + Q L Sbjct: 1 MKVFIINLKRDLQKKHEIINECDQLNISHEIIEAVSGIDLS-ATEIDHLIDKDAQI--YL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGC +SH+ ++++I A+ILEDDA D+ S+++ I N++ Sbjct: 58 TKGEIGCSLSHLRIYQKIISEDLPYALILEDDAILHDDLSEVIHA-----IENVI----- 107 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPR-----------TTGYFIGKEAAIHLLNVRKN 171 K K +YL ++ LS + T GY + + A + Sbjct: 108 --DKEKSHAYLLYKTNCVYSNKEIKLSEKYSLYESNAPTYTHGYVVTNKTA----RLITE 161 Query: 172 IYRPIDMDMKHW----WEHNIPS------LVTEPGAVYEAIDTNDSTIEESRLVRKP 218 I P+ ++ W +E NI S L+T E +STIEE RL R P Sbjct: 162 INTPVRLEADAWRNFYFEKNIRSYSLNIDLITSRDQSKE-----NSTIEEERLSRSP 213 >gi|6630626|dbj|BAA88524.1| beta-1,4-galactosyltransferase [Helicobacter pylori] Length = 273 Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust. Identities = 44/160 (27%), Positives = 69/160 (43%), Gaps = 18/160 (11%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYV 151 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIH--QPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-- 174 + L P S P N H Q R+ + + +G + + + K + Sbjct: 152 RLMHLLYDPNVKSE----PLNHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKHS 207 Query: 175 -----PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 208 RKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|317009163|gb|ADU79743.1| beta-1,4-galactosyltransferase [Helicobacter pylori India7] Length = 273 Score = 43.1 bits (100), Expect = 0.043, Method: Compositional matrix adjust. Identities = 44/159 (27%), Positives = 67/159 (42%), Gaps = 12/159 (7%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 + + +SL E+GCY SH LW++ I + I ILEDD + F + L L K +I + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKENFKEGLDFLEK-NIQEL 148 Query: 117 L------IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 + +DA K + I + T GY I + A L + Sbjct: 149 GYARLMHLLYDANVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKCSR 208 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 209 KWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|217032011|ref|ZP_03437512.1| hypothetical protein HPB128_187g38 [Helicobacter pylori B128] gi|216946321|gb|EEC24927.1| hypothetical protein HPB128_187g38 [Helicobacter pylori B128] Length = 332 Score = 43.1 bits (100), Expect = 0.044, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK S +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGTGLTLPELGCYLSHYLLWKECVKSNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|91078804|ref|XP_970300.1| PREDICTED: similar to Glycosyltransferase 25 family member [Tribolium castaneum] gi|270003725|gb|EFA00173.1| hypothetical protein TcasGA2_TC002995 [Tribolium castaneum] Length = 559 Score = 43.1 bits (100), Expect = 0.044, Method: Compositional matrix adjust. Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 5/85 (5%) Query: 20 FCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF-KRLLSLPEIGCYISHIHL 76 FC + LQ + DA+ G N I +I + + KR ++L EIGC++SH ++ Sbjct: 335 FCFD--ELGLQVTIVDAVDGRALNASILQQISPLPEYADPYHKRPMTLGEIGCFLSHYNI 392 Query: 77 WKRIAYSPAIGAIILEDDADFSDEF 101 WK I + ++LEDD F F Sbjct: 393 WKDIVRNGYETTLVLEDDIRFESFF 417 >gi|301169267|emb|CBW28864.1| lipooligosaccharide biosynthesis protein lex-1 (ec 2.-.-.-) [Haemophilus influenzae 10810] Length = 298 Score = 42.7 bits (99), Expect = 0.044, Method: Compositional matrix adjust. Identities = 51/195 (26%), Positives = 74/195 (37%), Gaps = 26/195 (13%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + +L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSINQS 62 Query: 57 -----------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 + R+L+ E GC ISH LW + I EDD + L Sbjct: 63 INQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFL 122 Query: 106 PH----LSKCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFI 157 ++ D N+I I +P K +P NFDI + T GY I Sbjct: 123 NQNEWLKTRFDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWG--TAGYII 180 Query: 158 GKEAAIHLLNVRKNI 172 + AA +++ KNI Sbjct: 181 SQGAAKYVIEYLKNI 195 >gi|13474441|ref|NP_106009.1| glycosyl transferase [Mesorhizobium loti MAFF303099] gi|14025194|dbj|BAB51795.1| glycosyl transferase [Mesorhizobium loti MAFF303099] Length = 265 Score = 42.7 bits (99), Expect = 0.044, Method: Compositional matrix adjust. Identities = 40/171 (23%), Positives = 61/171 (35%), Gaps = 3/171 (1%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VI+L S R ARI + F + G+ +P + + R LS E Sbjct: 5 VINLDRSPERLAHMSKEFARIGIGFERVAGVDGKEHP---DLMLQPQHAIHATRPLSGSE 61 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 I C SH W IA A + EDD FS + LL S + ++K + + Sbjct: 62 IACLHSHRACWTIIAQDEAPYGAVFEDDMVFSSKAGALLGDTSWVPADADVVKLETFFSR 121 Query: 127 PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + F + + R P GY + + A L + +D Sbjct: 122 TVIQRRRISAGHGFSMFRLRKGHPGAGGYLLSRRTARDFLEATAQVNIAVD 172 >gi|253989060|ref|YP_003040416.1| beta1,4-galactosyltransferase waax [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253780510|emb|CAQ83672.1| putative beta1,4-galactosyltransferase waax [Photorhabdus asymbiotica] Length = 251 Score = 42.7 bits (99), Expect = 0.045, Method: Compositional matrix adjust. Identities = 58/273 (21%), Positives = 111/273 (40%), Gaps = 59/273 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN------NPICNRIFSHQKRQC 56 + ++VI+LP R+ ++A R+ L F +A+ G++ N + H Sbjct: 1 MKIFVINLPKDKERKASIQYQADRLGLNVEFIEAVNGKDLSEDEINILSKDFHQHG---- 56 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ +GC +SHI +++++ A+ILEDDA ++ + Sbjct: 57 -----MTHGVLGCSLSHIKIYEKMIKDNIDIALILEDDALLNENIIESYN---------- 101 Query: 117 LIKFDALRKKPKKDSYLCTLPGNF-DIHQPR----------ILSPRTTGYFIGKEAAIHL 165 LI+ + K K + YL ++ + D + + I + T GY + +AA +L Sbjct: 102 LIESYNYKNKDKPNVYLLSVVNEYIDTFKTKLSTKYNLVNVIDADYTYGYMLNIKAANNL 161 Query: 166 LNVRKNIYRPIDMDMKHW---WEHN-------IPSLV-TEPGAVYEAIDTNDSTIEESRL 214 L N P+ ++ W EH IP ++ P + ++++ S E R+ Sbjct: 162 L----NFLTPVWIEADKWRFMREHGAIKLKAIIPHVIDVTPLSAVSTLESDRSITLEKRI 217 Query: 215 VRKPTFSPLYFYRNT-----CYQWNLHYNAWRK 242 F+ Y RN + W + +W K Sbjct: 218 ---AFFNEQYQNRNLYVKLRAFLWRIFVRSWVK 247 >gi|208434740|ref|YP_002266406.1| beta-1,4-galactosyl transferase [Helicobacter pylori G27] gi|208432669|gb|ACI27540.1| beta-1,4-galactosyl transferase [Helicobacter pylori G27] Length = 273 Score = 42.7 bits (99), Expect = 0.045, Method: Compositional matrix adjust. Identities = 44/160 (27%), Positives = 69/160 (43%), Gaps = 18/160 (11%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEAI--CILEDDITLKEDFKEGLDFLEKHIQELGYV 151 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIH--QPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-- 174 + L P S P N H Q R+ + + +G + + + K + Sbjct: 152 RLMHLLYDPNVKSE----PLNHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKHS 207 Query: 175 -----PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 208 RKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|167855175|ref|ZP_02477946.1| probable glycosyl transferase family 25, LPS biosynthesis [Haemophilus parasuis 29755] gi|167853720|gb|EDS24963.1| probable glycosyl transferase family 25, LPS biosynthesis [Haemophilus parasuis 29755] Length = 263 Score = 42.7 bits (99), Expect = 0.045, Method: Compositional matrix adjust. Identities = 44/177 (24%), Positives = 70/177 (39%), Gaps = 8/177 (4%) Query: 6 YVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +VISL +S+ +R H + F FFDAI N +F + L+ Sbjct: 4 FVISLKYSNDKRRTHMKNEFSSHNIPFEFFDAITPNENAQYANLFKINLDNTE----LTQ 59 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-LIKFDAL 123 EI C SH+ LWK++ + I EDD S +L + + +IK + Sbjct: 60 GEISCLFSHLTLWKQMIDNNLDRIAIFEDDIYLSSSAKDMLDSIESISTTDFDVIKLEKS 119 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRPIDM 178 ++ + N +H + + GY I AA + +NI PID+ Sbjct: 120 LERVLASKFYKIRMRNSSLHILKDSHLGSAGYVITNSAARKTITYIQNENIIAPIDI 176 >gi|148560689|ref|YP_001258608.1| hypothetical protein BOV_0614 [Brucella ovis ATCC 25840] gi|148371946|gb|ABQ61925.1| putative membrane protein [Brucella ovis ATCC 25840] Length = 759 Score = 42.7 bits (99), Expect = 0.046, Method: Compositional matrix adjust. Identities = 53/196 (27%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 57 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 114 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + A +ILEDD F++ S + DI L Sbjct: 115 NGREMLPGEYGCYRSHLKALESFLSDGAPYGLILEDDVVFTETTSARIH-----DIIKSL 169 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 170 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRVIHGPQ---GSAAAYLVSREGARKLLSALS 226 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 227 TMELPWDVAMERFWYH 242 >gi|157326155|gb|ABV44310.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.7 bits (99), Expect = 0.048, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDVTLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|298736312|ref|YP_003728838.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|298355502|emb|CBI66374.1| jhp0562-like glycosyltransferase, family 25 [Helicobacter pylori B8] Length = 332 Score = 42.7 bits (99), Expect = 0.050, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK S +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKSNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|23501502|ref|NP_697629.1| hypothetical protein BR0615 [Brucella suis 1330] gi|161618590|ref|YP_001592477.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella canis ATCC 23365] gi|254703954|ref|ZP_05165782.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella suis bv. 3 str. 686] gi|260566797|ref|ZP_05837267.1| LPSA protein [Brucella suis bv. 4 str. 40] gi|261754606|ref|ZP_05998315.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] gi|23347409|gb|AAN29544.1| membrane protein, putative [Brucella suis 1330] gi|161335401|gb|ABX61706.1| Lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella canis ATCC 23365] gi|260156315|gb|EEW91395.1| LPSA protein [Brucella suis bv. 4 str. 40] gi|261744359|gb|EEY32285.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] Length = 703 Score = 42.7 bits (99), Expect = 0.051, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGNF---DIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDTIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|254688898|ref|ZP_05152152.1| LPSA protein [Brucella abortus bv. 6 str. 870] gi|260754383|ref|ZP_05866731.1| predicted protein [Brucella abortus bv. 6 str. 870] gi|260674491|gb|EEX61312.1| predicted protein [Brucella abortus bv. 6 str. 870] Length = 703 Score = 42.7 bits (99), Expect = 0.052, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 78/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ + Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSSAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|48812|emb|CAA40221.1| unnamed protein product [Haemophilus influenzae] Length = 298 Score = 42.7 bits (99), Expect = 0.052, Method: Compositional matrix adjust. Identities = 51/195 (26%), Positives = 74/195 (37%), Gaps = 26/195 (13%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + +L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSINQS 62 Query: 57 -----------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 + R+L+ E GC ISH LW + I EDD + L Sbjct: 63 INQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFL 122 Query: 106 PH----LSKCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFI 157 ++ D N+I I +P K +P NFDI + T GY I Sbjct: 123 NQNEWLKTRFDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWG--TAGYII 180 Query: 158 GKEAAIHLLNVRKNI 172 + AA +++ KNI Sbjct: 181 SQGAAKYVIEYLKNI 195 >gi|317011045|gb|ADU84792.1| pbeta-1,4-galactosyltransferase [Helicobacter pylori SouthAfrica7] Length = 273 Score = 42.7 bits (99), Expect = 0.053, Method: Compositional matrix adjust. Identities = 46/180 (25%), Positives = 71/180 (39%), Gaps = 10/180 (5%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW++ I + I ILEDD + F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIDLNEPI--CILEDDITLKENFKEGLDFLEKHIQELGYV 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +D K D + I + T GY I + A L + Sbjct: 152 RLMHLLYDPNVKSEPLDHQNQEIQEYIGIIKAYSEGVGTQGYVITPKIAKVFLKYSQRWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQW 233 P+D M + H + +LV + A+ A D STI P + + C +W Sbjct: 212 VPVDTIMDATFIHGVKNLVLQSFAI--ADDEQISTIARKEEPYSPKIALMRKLHFKCLKW 269 >gi|189023836|ref|YP_001934604.1| LPSA protein [Brucella abortus S19] gi|225627122|ref|ZP_03785160.1| Glycosyltransferase 25 family member 3 [Brucella ceti str. Cudo] gi|237815074|ref|ZP_04594072.1| Glycosyltransferase 25 family member 3 [Brucella abortus str. 2308 A] gi|265999554|ref|ZP_05466880.2| LPSA protein [Brucella melitensis bv. 2 str. 63/9] gi|297247992|ref|ZP_06931710.1| glycosyl transferase, family 25 [Brucella abortus bv. 5 str. B3196] gi|189019408|gb|ACD72130.1| LPSA protein [Brucella abortus S19] gi|225617957|gb|EEH15001.1| Glycosyltransferase 25 family member 3 [Brucella ceti str. Cudo] gi|237789911|gb|EEP64121.1| Glycosyltransferase 25 family member 3 [Brucella abortus str. 2308 A] gi|263094622|gb|EEZ18420.1| LPSA protein [Brucella melitensis bv. 2 str. 63/9] gi|297175161|gb|EFH34508.1| glycosyl transferase, family 25 [Brucella abortus bv. 5 str. B3196] Length = 759 Score = 42.7 bits (99), Expect = 0.053, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 57 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 114 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 115 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 169 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 170 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 226 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 227 TMELPWDVAMERFWHH 242 >gi|261838140|gb|ACX97906.1| LPS biosynthesis protein [Helicobacter pylori 51] Length = 332 Score = 42.7 bits (99), Expect = 0.054, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 47/122 (38%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|308061950|gb|ADO03838.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Cuz20] Length = 332 Score = 42.7 bits (99), Expect = 0.054, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ICVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTDQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|294851984|ref|ZP_06792657.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] gi|294820573|gb|EFG37572.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] Length = 759 Score = 42.7 bits (99), Expect = 0.055, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 57 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 114 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 115 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 169 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 170 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 226 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 227 TMELPWDVAMERFWHH 242 >gi|317179019|dbj|BAJ56807.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F30] Length = 332 Score = 42.7 bits (99), Expect = 0.056, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQNMLKSDWFHSYVGTGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|317012433|gb|ADU83041.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Lithuania75] Length = 183 Score = 42.7 bits (99), Expect = 0.056, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKLVQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|306842115|ref|ZP_07474785.1| LPSA protein [Brucella sp. BO2] gi|306287788|gb|EFM59216.1| LPSA protein [Brucella sp. BO2] Length = 726 Score = 42.7 bits (99), Expect = 0.057, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 24 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 81 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 82 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 136 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 137 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 193 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 194 TMELPWDVAMERFWHH 209 >gi|261839554|gb|ACX99319.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 52] Length = 332 Score = 42.4 bits (98), Expect = 0.059, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 47/122 (38%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE GCY+SH LWK + +ILEDD F Q Sbjct: 57 AAQSMLKSDWFHSYVGDGLTLPEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|260182079|gb|ACX35572.1| beta-1,4 galactosyltransferase [Pasteurella multocida] Length = 280 Score = 42.4 bits (98), Expect = 0.059, Method: Compositional matrix adjust. Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 13/94 (13%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----KR 60 YVISL + RR+ + ++ + F FFDAI +P+ N++ S QF Sbjct: 7 YVISLASAVERRQHISEQFSQYDIPFQFFDAI--SPSPLLNQLVS------QFFPSLADS 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD 94 L+ E GC+ISH+ LW + ++ EDD Sbjct: 59 SLTDGEKGCFISHLSLWHKCVEKNLPYIVVFEDD 92 >gi|126306516|ref|XP_001375578.1| PREDICTED: similar to C1orf17 [Monodelphis domestica] Length = 680 Score = 42.4 bits (98), Expect = 0.060, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 54/112 (48%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+++ + Sbjct: 458 RPLTRGEIGCFLSHYSIWKEVIDRELEKTLVIEDDVRFEHQFKKKLVKLMD-DIDHVQLD 516 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 517 WELIYIGRKRMQVQEPEKAVPNVMNLVEAD-YSYWTLGYVISLEGAQKLVGA 567 >gi|325578527|ref|ZP_08148627.1| glycosyltransferase [Haemophilus parainfluenzae ATCC 33392] gi|325159763|gb|EGC71893.1| glycosyltransferase [Haemophilus parainfluenzae ATCC 33392] Length = 256 Score = 42.4 bits (98), Expect = 0.061, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 82/174 (47%), Gaps = 18/174 (10%) Query: 6 YVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + + FS + + E + + +F+ K + + R ++ Sbjct: 4 YLISLDKDVQRRELFFAQPDTADFVVFSAINTMQKEWDELA-EVFNPTKFEQHYGRNVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFS-QLLPHLS-KCDINNILI 118 EIGC +SH+ ++++I + A++ EDDA F+ S + + L+ KCD + +LI Sbjct: 63 GEIGCTLSHLAVYRQIVEDQNVTENDYALVCEDDALFNANLSPKTIALLTEKCDADIVLI 122 Query: 119 ------KFDALRKK---PKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 +F+ + + P S+L G+ + P + T Y I K AA Sbjct: 123 GQSKIAEFNNVELEINYPTTFSFLRQTIGDVTVAYPYKSYFAGTVAYLIKKSAA 176 >gi|162951747|gb|ABY21735.1| LD07116p [Drosophila melanogaster] Length = 639 Score = 42.4 bits (98), Expect = 0.062, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 14/174 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR------QCQF 58 +++I+L RREK I ++ F A+ G+ R+ R Sbjct: 362 IFMINLKRRPERREKMERLFDEIGIEAEHFPAVDGKELS-TERLLEMGVRFLPGYEDPYH 420 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ---LLPHLSKCDINN 115 R +++ EIGC++SH ++W + +ILEDD F F Q + + ++ Sbjct: 421 HRAMTMGEIGCFLSHYNIWVMMVRKQLKEVLILEDDIRFEPYFRQNAVRILNQARNAAQY 480 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 LI F RK+ K++S + +H S T GY I + A+ LL + Sbjct: 481 DLIYFG--RKRLKEESEPAVENADNLVHAG--YSYWTLGYVISLQGALKLLAAK 530 >gi|304398828|ref|ZP_07380698.1| glycosyl transferase family 25 [Pantoea sp. aB] gi|304353532|gb|EFM17909.1| glycosyl transferase family 25 [Pantoea sp. aB] Length = 257 Score = 42.4 bits (98), Expect = 0.062, Method: Compositional matrix adjust. Identities = 40/168 (23%), Positives = 71/168 (42%), Gaps = 9/168 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++I+L S RR + + + F A+ G+ Q FK Sbjct: 1 MKAFIINLDSSTKRRATIAAQCQNAGISYEFIQAVNGKQLSNEEIALHTQPVNYAFKA-- 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL--PHLSKCDINNILIKF 120 EIGC +SHI++++++ A+ILEDDA ++ +L P L L+ Sbjct: 59 --GEIGCALSHINIYRKMVDEKIPQALILEDDALLTEALPPVLASPALQLSAAKPTLVLL 116 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ K K + T+ + I+ P + + Y + EAA LL + Sbjct: 117 SSVNKYVNKP--IATVTESARIY-PVYSATTSHAYVVNFEAAKRLLTL 161 >gi|209884650|ref|YP_002288507.1| glycosyl transferase [Oligotropha carboxidovorans OM5] gi|209872846|gb|ACI92642.1| glycosyl transferase [Oligotropha carboxidovorans OM5] Length = 256 Score = 42.4 bits (98), Expect = 0.063, Method: Compositional matrix adjust. Identities = 39/178 (21%), Positives = 67/178 (37%), Gaps = 2/178 (1%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M IP Y+I+L R + R + + F+ DA+ G+ Sbjct: 1 MTIPPYLINLARDPERLAEMQARFRALGIAFTRVDAVDGKAMTAAELDTVRAPHPAWLP- 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L E+ C +SH W+ IA A + EDD FS L + ++ ++K Sbjct: 60 -LKPNEVACALSHRKCWQMIADGAASHGCVFEDDMLFSPRLPAFLSDTAWIPVDADIVKI 118 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + + D + F + + R R Y + + A LL + + I P+D+ Sbjct: 119 EESFNRVWIDVPPREVGQGFRLGRVRSTHYRAGAYIVSRTGAKRLLAMTERISLPLDL 176 >gi|157326151|gb|ABV44308.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.4 bits (98), Expect = 0.063, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK +ILEDDA F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDAMLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|308063592|gb|ADO05479.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 332 Score = 42.4 bits (98), Expect = 0.063, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IYVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|188527533|ref|YP_001910220.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] gi|188143773|gb|ACD48190.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] Length = 332 Score = 42.4 bits (98), Expect = 0.064, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ICVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR---------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q K L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSMLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|110006055|gb|ABG48530.1| nonfunctional LpsA [Haemophilus influenzae] gi|110006059|gb|ABG48532.1| nonfunctional LpsA [Haemophilus influenzae] Length = 237 Score = 42.4 bits (98), Expect = 0.064, Method: Compositional matrix adjust. Identities = 55/200 (27%), Positives = 85/200 (42%), Gaps = 37/200 (18%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 EIGC +SHI LW +A + I I EDD H+ K + N + Sbjct: 68 EIGCALSHIVLWD-LALENNLNYINIFEDDI-----------HVLKLEANGKM-----FF 110 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM------ 178 K+PK S C + +++ + GY + + A +LL + KN +P+D+ Sbjct: 111 KQPK--SVKC----DRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKN--KPLDVAVDSLV 162 Query: 179 --DMKHWWEHNIPSLVTEPG 196 D H+ ++ I L PG Sbjct: 163 FEDFLHFKDYKIVQL--SPG 180 >gi|332673368|gb|AEE70185.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 83] Length = 273 Score = 42.4 bits (98), Expect = 0.067, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 75/182 (41%), Gaps = 18/182 (9%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I + I ILEDD + F + L H+ + D Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKENFKEGLDFLEKHIQELDYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + I + T GY I + A + Sbjct: 152 RLMYLLYDANVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPLYF 225 P+D M + H + +LV +P + A D STI L+RK F L + Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKYLKY 269 Query: 226 YR 227 ++ Sbjct: 270 WQ 271 >gi|254718764|ref|ZP_05180575.1| LPSA protein [Brucella sp. 83/13] gi|265983742|ref|ZP_06096477.1| conserved hypothetical protein [Brucella sp. 83/13] gi|306840196|ref|ZP_07472976.1| LPSA protein [Brucella sp. NF 2653] gi|264662334|gb|EEZ32595.1| conserved hypothetical protein [Brucella sp. 83/13] gi|306404727|gb|EFM61026.1| LPSA protein [Brucella sp. NF 2653] Length = 703 Score = 42.4 bits (98), Expect = 0.067, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|260912785|ref|ZP_05919271.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Pasteurella dagmatis ATCC 43325] gi|260633163|gb|EEX51328.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Pasteurella dagmatis ATCC 43325] Length = 284 Score = 42.4 bits (98), Expect = 0.069, Method: Compositional matrix adjust. Identities = 50/185 (27%), Positives = 79/185 (42%), Gaps = 20/185 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 YVISL + RR+ + ++ F FFDA+ E N + ++ + R C L+ Sbjct: 10 YVISLSSATDRRKHILEEFYQHNIPFLFFDALSPSLEFNQLVEQLIPNLSR-CN----LT 64 Query: 64 LPEIGCYISHIHLWKRIAYS--PAIGAIILEDDADFSDEFSQLLP-----HLSKCDINNI 116 E GC +SH+ LW + P IG + EDD + + L H +N Sbjct: 65 NGEKGCLMSHLSLWHKCVEENLPYIG--VFEDDILLGKDAEEFLASDEWIHSRFQHGDNF 122 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI---- 172 +I+ + +K D+ + + D + R T GY I AA + LN+ N+ Sbjct: 123 IIRLETFLQKVLCDATNISPYLDRDFLKLRSTHFGTAGYIISFSAAKYFLNIFANMKEDE 182 Query: 173 YRPID 177 PID Sbjct: 183 IEPID 187 >gi|256113154|ref|ZP_05454031.1| LPSA protein [Brucella melitensis bv. 3 str. Ether] gi|265994567|ref|ZP_06107124.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] gi|262765680|gb|EEZ11469.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] Length = 703 Score = 42.4 bits (98), Expect = 0.071, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|46487347|gb|AAS99068.1| Tgh001 [Campylobacter jejuni] gi|58585452|gb|AAW79073.1| unknown [Campylobacter jejuni] Length = 257 Score = 42.4 bits (98), Expect = 0.071, Method: Compositional matrix adjust. Identities = 43/187 (22%), Positives = 80/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 ++ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 121 GHYKQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPID 177 I P D Sbjct: 181 KIIFPYD 187 >gi|124008087|ref|ZP_01692786.1| LPS glycosyltransferase subfamily, putative [Microscilla marina ATCC 23134] gi|123986501|gb|EAY26307.1| LPS glycosyltransferase subfamily, putative [Microscilla marina ATCC 23134] Length = 269 Score = 42.4 bits (98), Expect = 0.072, Method: Compositional matrix adjust. Identities = 48/195 (24%), Positives = 86/195 (44%), Gaps = 23/195 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG------ENNPICNRIFSHQKRQC 56 + +V+SL + R A L + DA+ G E CN ++ Sbjct: 1 MKTFVVSLKRAKNRYAYIRQHVAGRKLDYKIIDAVDGSLLTEKEIEENCN-----MEQVD 55 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-SKCDINN 115 +++ LS IGC +SH++ ++ + A I+EDD + + LL + ++ + Sbjct: 56 KYRWWLSNGAIGCALSHLNAYEEFLATGDKAAFIIEDDVLLPETITDLLLEIENEIKPSE 115 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT----GYFIGKEAAIHLLNVRKN 171 I++ + A K + N ++ P + P+ T Y IG+ AA LN+++N Sbjct: 116 IILLYYASFAPAKFSTIGERKLSNSGLYYP--IDPKQTITAAAYVIGRTAA---LNLKEN 170 Query: 172 IYRPIDMDMKHWWEH 186 I RPI++ WEH Sbjct: 171 I-RPIEVTADS-WEH 183 >gi|306845221|ref|ZP_07477797.1| LPSA protein [Brucella sp. BO1] gi|306274380|gb|EFM56187.1| LPSA protein [Brucella sp. BO1] Length = 703 Score = 42.4 bits (98), Expect = 0.072, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|300796728|ref|NP_001178231.1| procollagen galactosyltransferase 2 [Bos taurus] gi|297484385|ref|XP_002694258.1| PREDICTED: glycosyltransferase 25 domain containing 2 [Bos taurus] gi|296478943|gb|DAA21058.1| glycosyltransferase 25 domain containing 2 [Bos taurus] Length = 626 Score = 42.4 bits (98), Expect = 0.072, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 54/112 (48%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + + Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMD-DIDRVQLD 462 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + ++P ++ + S T GY I E A L+ Sbjct: 463 WELIYIGRKRMQVKEPEKSVPNVVNLVEAD-YSYWTLGYVISLEGAQKLVGA 513 >gi|62289577|ref|YP_221370.1| hypothetical protein BruAb1_0634 [Brucella abortus bv. 1 str. 9-941] gi|82699507|ref|YP_414081.1| glycosyl transferase family O-antigen polymerase [Brucella melitensis biovar Abortus 2308] gi|225852140|ref|YP_002732373.1| glycosyltransferase 25 family protein [Brucella melitensis ATCC 23457] gi|254693379|ref|ZP_05155207.1| LPSA protein [Brucella abortus bv. 3 str. Tulya] gi|254697030|ref|ZP_05158858.1| LPSA protein [Brucella abortus bv. 2 str. 86/8/59] gi|254707667|ref|ZP_05169495.1| LPSA protein [Brucella pinnipedialis M163/99/10] gi|254709749|ref|ZP_05171560.1| LPSA protein [Brucella pinnipedialis B2/94] gi|254729929|ref|ZP_05188507.1| LPSA protein [Brucella abortus bv. 4 str. 292] gi|256031239|ref|ZP_05444853.1| LPSA protein [Brucella pinnipedialis M292/94/1] gi|256060748|ref|ZP_05450911.1| LPSA protein [Brucella neotomae 5K33] gi|256159346|ref|ZP_05457132.1| LPSA protein [Brucella ceti M490/95/1] gi|256254647|ref|ZP_05460183.1| LPSA protein [Brucella ceti B1/94] gi|256257146|ref|ZP_05462682.1| LPSA protein [Brucella abortus bv. 9 str. C68] gi|256369054|ref|YP_003106562.1| hypothetical protein BMI_I614 [Brucella microti CCM 4915] gi|260168375|ref|ZP_05755186.1| LPSA protein [Brucella sp. F5/99] gi|260545662|ref|ZP_05821403.1| LpsA protein [Brucella abortus NCTC 8038] gi|260757602|ref|ZP_05869950.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260761428|ref|ZP_05873771.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260883408|ref|ZP_05895022.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|261213629|ref|ZP_05927910.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|261221822|ref|ZP_05936103.1| conserved hypothetical protein [Brucella ceti B1/94] gi|261315164|ref|ZP_05954361.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261317280|ref|ZP_05956477.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261324737|ref|ZP_05963934.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261757835|ref|ZP_06001544.1| LPSA protein [Brucella sp. F5/99] gi|265988318|ref|ZP_06100875.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|265997786|ref|ZP_06110343.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|62195709|gb|AAX74009.1| hypothetical membrane protein [Brucella abortus bv. 1 str. 9-941] gi|82615608|emb|CAJ10595.1| Glycosyl transferase, family 25:O-antigen polymerase [Brucella melitensis biovar Abortus 2308] gi|225640505|gb|ACO00419.1| Glycosyltransferase 25 family member 3 precursor [Brucella melitensis ATCC 23457] gi|255999214|gb|ACU47613.1| hypothetical protein BMI_I614 [Brucella microti CCM 4915] gi|260097069|gb|EEW80944.1| LpsA protein [Brucella abortus NCTC 8038] gi|260667920|gb|EEX54860.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260671860|gb|EEX58681.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260872936|gb|EEX80005.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|260915236|gb|EEX82097.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|260920406|gb|EEX87059.1| conserved hypothetical protein [Brucella ceti B1/94] gi|261296503|gb|EEX99999.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261300717|gb|EEY04214.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261304190|gb|EEY07687.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261737819|gb|EEY25815.1| LPSA protein [Brucella sp. F5/99] gi|262552254|gb|EEZ08244.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|264660515|gb|EEZ30776.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|326408640|gb|ADZ65705.1| LPSA protein [Brucella melitensis M28] gi|326538362|gb|ADZ86577.1| glycosyltransferase 25 family member 3 precursor [Brucella melitensis M5-90] Length = 703 Score = 42.4 bits (98), Expect = 0.073, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|254713752|ref|ZP_05175563.1| LPSA protein [Brucella ceti M644/93/1] gi|254717190|ref|ZP_05179001.1| LPSA protein [Brucella ceti M13/05/1] gi|261219006|ref|ZP_05933287.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261321494|ref|ZP_05960691.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|260924095|gb|EEX90663.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261294184|gb|EEX97680.1| conserved hypothetical protein [Brucella ceti M644/93/1] Length = 703 Score = 42.4 bits (98), Expect = 0.073, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|254701409|ref|ZP_05163237.1| LPSA protein [Brucella suis bv. 5 str. 513] gi|261751948|ref|ZP_05995657.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] gi|261741701|gb|EEY29627.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] Length = 703 Score = 42.4 bits (98), Expect = 0.074, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|24581946|ref|NP_723087.1| CG31915 [Drosophila melanogaster] gi|74864910|sp|Q8IPK4|GLT25_DROME RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor gi|22945672|gb|AAN10543.1| CG31915 [Drosophila melanogaster] Length = 612 Score = 42.4 bits (98), Expect = 0.074, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 14/174 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR------QCQF 58 +++I+L RREK I ++ F A+ G+ R+ R Sbjct: 335 IFMINLKRRPERREKMERLFDEIGIEAEHFPAVDGKELS-TERLLEMGVRFLPGYEDPYH 393 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ---LLPHLSKCDINN 115 R +++ EIGC++SH ++W + +ILEDD F F Q + + ++ Sbjct: 394 HRAMTMGEIGCFLSHYNIWVMMVRKQLKEVLILEDDIRFEPYFRQNAVRILNQARNAAQY 453 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 LI F RK+ K++S + +H S T GY I + A+ LL + Sbjct: 454 DLIYFG--RKRLKEESEPAVENADNLVHAG--YSYWTLGYVISLQGALKLLAAK 503 >gi|157326129|gb|ABV44297.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.4 bits (98), Expect = 0.075, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 50/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 + Q L+LPE+GCY+SH LWK S +ILEDD F Q Sbjct: 57 NAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECIKSNQ-PVVILEDDIVLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|163842890|ref|YP_001627294.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella suis ATCC 23445] gi|163673613|gb|ABY37724.1| Lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella suis ATCC 23445] Length = 703 Score = 42.4 bits (98), Expect = 0.075, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY SH+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIKSL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|317014235|gb|ADU81671.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Gambia94/24] Length = 273 Score = 42.0 bits (97), Expect = 0.079, Method: Compositional matrix adjust. Identities = 44/158 (27%), Positives = 67/158 (42%), Gaps = 14/158 (8%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-------C 111 + +SL E+GCY SH LW++ I I ILEDD ++F + L L K Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELDEPI--CILEDDITLKEDFKEGLDFLEKHIHELGYA 151 Query: 112 DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 + ++L +DA K + + I + T GY I + A L + Sbjct: 152 RLMHLL--YDASVKSEPLNHKNQEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRK 209 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 210 WVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|157326147|gb|ABV44306.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.0 bits (97), Expect = 0.079, Method: Compositional matrix adjust. Identities = 40/133 (30%), Positives = 51/133 (38%), Gaps = 25/133 (18%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLDQPVVILEDDVALESHFMQ 115 Query: 104 LLPHLSKCDINNI 116 L K N + Sbjct: 116 ALEDCLKSPFNFV 128 >gi|148825759|ref|YP_001290512.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittEE] gi|148715919|gb|ABQ98129.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittEE] Length = 257 Score = 42.0 bits (97), Expect = 0.080, Method: Compositional matrix adjust. Identities = 47/174 (27%), Positives = 80/174 (45%), Gaps = 18/174 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RR+ F + Q FS + + + + + + IF+ ++ + + R ++ Sbjct: 5 YLISLDKDIQRRKLFFSQKNTEDFQIFSAINTMQKDWDELAS-IFNIEQFKAHYFRNVTK 63 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCDINNILI 118 EIGC +SH+ ++++I I A++ EDDA F +F Q L L K + IL+ Sbjct: 64 GEIGCTLSHLSVYQKIIEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAEIILL 123 Query: 119 ---KFDALRKKPKKDSYLCTL------PGNFDIHQP-RILSPRTTGYFIGKEAA 162 K ++ + +Y T GN + P + T GY I K AA Sbjct: 124 GQSKINSFNDTDLEINYPTTFLFYAKKQGNVNYAFPYKSYFAGTVGYLIKKSAA 177 >gi|317180590|dbj|BAJ58376.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F32] Length = 273 Score = 42.0 bits (97), Expect = 0.081, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 75/182 (41%), Gaps = 18/182 (9%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + +SL E+GCY SH LW++ I + I ILEDD ++F + L L K + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELGYV 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +DA K + + I + T GY I + A + Sbjct: 152 RLMHLLYDANVKSEPLNHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKAFKKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPLYF 225 P+D M + H + +LV +P + A D STI L+RK F L + Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKYLKY 269 Query: 226 YR 227 ++ Sbjct: 270 WQ 271 >gi|188527328|ref|YP_001910015.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Shi470] gi|188143568|gb|ACD47985.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Shi470] Length = 273 Score = 42.0 bits (97), Expect = 0.082, Method: Compositional matrix adjust. Identities = 49/187 (26%), Positives = 81/187 (43%), Gaps = 24/187 (12%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----C 111 + + +SL E+GCY SH LW++ I + I ILEDD ++F + L L K Sbjct: 92 YAKFMSLGELGCYASHYSLWQKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 112 DINNILIKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YVRLMHLLYDASVRSEPLSHKNHEIQERVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTF 220 + P+D M + H + +LV +P + A D STI L+RK F Sbjct: 207 SQKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHF 264 Query: 221 SPLYFYR 227 L +++ Sbjct: 265 KYLKYWQ 271 >gi|156390789|ref|XP_001635452.1| predicted protein [Nematostella vectensis] gi|156222546|gb|EDO43389.1| predicted protein [Nematostella vectensis] Length = 589 Score = 42.0 bits (97), Expect = 0.082, Method: Compositional matrix adjust. Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 1/63 (1%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 KR L++ EIGC++SH +WK + +ILEDD F +F + L + D N + Sbjct: 391 KRPLTMGEIGCFLSHYKIWKEMIEKGLERVLILEDDVRFEPDFRRKLLAMI-ADANQLES 449 Query: 119 KFD 121 K++ Sbjct: 450 KYN 452 >gi|329756909|gb|AEC04692.1| hypothetical protein [Pasteurella multocida] gi|329756917|gb|AEC04699.1| hypothetical protein [Pasteurella multocida] gi|329756924|gb|AEC04705.1| hypothetical protein [Pasteurella multocida] gi|329756931|gb|AEC04711.1| hypothetical protein [Pasteurella multocida] gi|329756937|gb|AEC04716.1| hypothetical protein [Pasteurella multocida] gi|329756942|gb|AEC04720.1| hypothetical protein [Pasteurella multocida] gi|329756948|gb|AEC04725.1| hypothetical protein [Pasteurella multocida] gi|329756954|gb|AEC04730.1| hypothetical protein [Pasteurella multocida] Length = 282 Score = 42.0 bits (97), Expect = 0.085, Method: Compositional matrix adjust. Identities = 49/185 (26%), Positives = 82/185 (44%), Gaps = 20/185 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL---- 61 YVISL + RR+ ++ ++ ++ F FFDAI +P+ +++ F RL Sbjct: 6 YVISLSSAVERRQHIRNQFSQKNIPFQFFDAISP--SPLLDQLVLQ-----FFPRLADSS 58 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINNI- 116 L+ E C++SH+ LW + ++ EDD + + L S+ D I Sbjct: 59 LTGGEKACFMSHLSLWHKCVEENLPYIVVFEDDIVLGKDADKFLIGDEWLFSRFDPEEIF 118 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-- 174 +I+ + +K +S + D + T GY I + AA LL++ KNI Sbjct: 119 IIRLETFLQKVVCESTHIAPYTHRDFLSLKSAHFGTAGYVISQGAAKFLLDIFKNISNEH 178 Query: 175 --PID 177 PID Sbjct: 179 IAPID 183 >gi|307637306|gb|ADN79756.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 908] gi|325995897|gb|ADZ51302.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 2018] gi|325997492|gb|ADZ49700.1| putative glycosyltransferase [Helicobacter pylori 2017] Length = 332 Score = 42.0 bits (97), Expect = 0.085, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 + Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 NAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDVTLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|15611629|ref|NP_223280.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155103|gb|AAD06135.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 332 Score = 42.0 bits (97), Expect = 0.085, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 + Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 NAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLDQPVVILEDDVTLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|157326135|gb|ABV44300.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.0 bits (97), Expect = 0.087, Method: Compositional matrix adjust. Identities = 40/126 (31%), Positives = 50/126 (39%), Gaps = 33/126 (26%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKHEDFEKLV---- 52 Query: 53 KRQCQFKRLL-------------SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD 99 + C + LL +LPE+GCY+SH LWK +ILEDDA Sbjct: 53 QELCDAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDAMLES 111 Query: 100 EFSQLL 105 F Q L Sbjct: 112 HFMQAL 117 >gi|157326131|gb|ABV44298.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.0 bits (97), Expect = 0.088, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 + Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 NAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLDQPVVILEDDVTLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|251793915|ref|YP_003008647.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter aphrophilus NJ8700] gi|247535314|gb|ACS98560.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter aphrophilus NJ8700] Length = 282 Score = 42.0 bits (97), Expect = 0.090, Method: Compositional matrix adjust. Identities = 48/190 (25%), Positives = 74/190 (38%), Gaps = 22/190 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR---QCQFKRLL 62 +VIS+P + RR + + + F FFDA P R+ H +R L Sbjct: 9 FVISIPTADKRRNHIIQQFGQKKIPFEFFDAF----TP-SERLNDHLQRYLPNVAATPRL 63 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS----KCDINNILI 118 ++ E GC +SH LWK+ + EDD + Q L + + I + Sbjct: 64 TMGEKGCLMSHFMLWKKCVDDGLDYITLFEDDILLGENAEQFLAEDEWLKVRFNFQEIFV 123 Query: 119 -KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR---TTGYFIGKEAAIHLLNVRKNI-- 172 + + K + LP F + IL + T GY I AA +L+ V + Sbjct: 124 LRLETFLMPVKIEKQQGILP--FQQREIDILKSKHFGTAGYVISHGAAKYLIEVFEKFSS 181 Query: 173 --YRPIDMDM 180 +PID M Sbjct: 182 EEVKPIDEIM 191 >gi|110006029|gb|ABG48517.1| LpsC [Haemophilus influenzae] Length = 256 Score = 42.0 bits (97), Expect = 0.093, Method: Compositional matrix adjust. Identities = 52/180 (28%), Positives = 75/180 (41%), Gaps = 18/180 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDA----IYGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F FFDA I E N IF K Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNIIFDRSS-----KAT 63 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C +SHI LW + + I EDD + +LL ++ + ++K + Sbjct: 64 LCDAEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLE 122 Query: 122 ALRK--KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVR-KNIYRPID 177 A K K + C N +I + + GY I + A +LL N+R K +Y ID Sbjct: 123 AHGKIIYGKCEQIKC----NRNISRLKFKHTGMAGYSITAKGARYLLNNIRNKQLYLAID 178 >gi|221115475|ref|XP_002156385.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 589 Score = 42.0 bits (97), Expect = 0.093, Method: Compositional matrix adjust. Identities = 28/109 (25%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN----RIFSHQK-RQCQFK 59 ++VI+L R +K + +F F+A+ G IF + + + Sbjct: 325 IFVINLIRRQDRFKKMSFLLKELGFKFRHFEAVDGRELSAKKVKEMGIFPLEGFKDPYLE 384 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 R ++L EIGC++SH +W + ++LEDD F F++ L L Sbjct: 385 RPMTLGEIGCFLSHWRIWNEVIDRSLDMVLVLEDDVRFESGFNKKLHQL 433 >gi|157326125|gb|ABV44295.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 42.0 bits (97), Expect = 0.095, Method: Compositional matrix adjust. Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 3/45 (6%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLL 105 L+LPE+GCY+SH LWK + I +ILEDDA F Q L Sbjct: 75 LTLPELGCYLSHYLLWKECVKLDQPI--VILEDDATLESNFMQAL 117 >gi|322515189|ref|ZP_08068188.1| lipooligosaccharide N-acetylglucosamine glycosyltransferase [Actinobacillus ureae ATCC 25976] gi|322118799|gb|EFX91000.1| lipooligosaccharide N-acetylglucosamine glycosyltransferase [Actinobacillus ureae ATCC 25976] Length = 256 Score = 42.0 bits (97), Expect = 0.097, Method: Compositional matrix adjust. Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 10/120 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRIFSHQKRQCQFKRLL 62 Y+ISL ARRE F + F FDAI E + + + K + +++R Sbjct: 4 YLISLAKDSARRELFFSQPDTAD--FQVFDAINTMSLEWQELAQK-YDLTKFEQRYQRKA 60 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGA----IILEDDADFSDEFSQLLPHLSKCDINNILI 118 + E+GC SH+ ++++I +I ++ EDDA F+ F L L + D+ +I Sbjct: 61 TKGEVGCTFSHLGVYQQIINDESITEHEYCLVCEDDALFNQGFQYTLDKLVQQDLRADII 120 >gi|225626847|ref|ZP_03784886.1| Glycosyltransferase 25 family member 2 precursor [Brucella ceti str. Cudo] gi|256030977|ref|ZP_05444591.1| glycosyl transferase family protein [Brucella pinnipedialis M292/94/1] gi|260168111|ref|ZP_05754922.1| glycosyl transferase, family 25 [Brucella sp. F5/99] gi|261757563|ref|ZP_06001272.1| glycosyl transferase [Brucella sp. F5/99] gi|265988042|ref|ZP_06100599.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] gi|225618504|gb|EEH15547.1| Glycosyltransferase 25 family member 2 precursor [Brucella ceti str. Cudo] gi|261737547|gb|EEY25543.1| glycosyl transferase [Brucella sp. F5/99] gi|264660239|gb|EEZ30500.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] Length = 239 Score = 42.0 bits (97), Expect = 0.097, Method: Compositional matrix adjust. Identities = 47/205 (22%), Positives = 77/205 (37%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD----AD 96 N RI+ + ++ LS EI C++SH W+ I I+EDD + Sbjct: 43 NKDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEN 102 Query: 97 FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + + HL I F R+ ++ F RI+ P G Sbjct: 103 FMGAYRAAVDHLEPGGF--IRFTFRDDREHGRE---------VFRDEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 + +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQLVSYDAAQKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRARAY 236 >gi|301156387|emb|CBW15858.1| unnamed protein product [Haemophilus parainfluenzae T3T1] Length = 256 Score = 41.6 bits (96), Expect = 0.100, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 81/174 (46%), Gaps = 18/174 (10%) Query: 6 YVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + + FS + + E + +F+ K + + R ++ Sbjct: 4 YLISLDKDVQRRELFFAQPDTADFVVFSAINTMQKEWEELA-EVFNPTKFEQHYGRNVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFS-QLLPHLS-KCDINNILI 118 EIGC +SH+ ++++I + A++ EDDA F+ S + + L+ KCD + +LI Sbjct: 63 GEIGCTLSHLAVYRQIVEDQNVTENDYALVCEDDALFNANLSPKTIALLTEKCDADIVLI 122 Query: 119 ------KFDALRKK---PKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 +F+ + + P S+L G+ + P + T Y I K AA Sbjct: 123 GQSKIAEFNDVELEINYPTTFSFLRQTIGDVTVAYPYKSYFAGTVAYLIKKSAA 176 >gi|114704987|ref|ZP_01437895.1| glycosyl transferase, family 25 [Fulvimarina pelagi HTCC2506] gi|114539772|gb|EAU42892.1| glycosyl transferase, family 25 [Fulvimarina pelagi HTCC2506] Length = 261 Score = 41.6 bits (96), Expect = 0.100, Method: Compositional matrix adjust. Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 5/103 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +P Y I+L + RR+ +A R+ L+ S F A+ E+ + F+ ++R + Sbjct: 2 VPAYYINLDRARERRDFMERQAERLGLRLSRFSALSAED--VDETRFTALA--GLWERPM 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 + E+ +SH LW+++A A A I EDDA S + L Sbjct: 58 TRVEVAVLLSHASLWQKVADDDAPLA-IFEDDAVLSPRLPEFL 99 >gi|297380029|gb|ADI34916.1| Hypothetical protein HPV225_0845 [Helicobacter pylori v225d] Length = 273 Score = 41.6 bits (96), Expect = 0.10, Method: Compositional matrix adjust. Identities = 48/187 (25%), Positives = 82/187 (43%), Gaps = 24/187 (12%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCD 112 + + +SL E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 113 INNIL-IKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YARLMHLLYDASVRSEPLSHKNHEIQERVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTF 220 + P+D M + H + +LV +P + A D STI L+RK F Sbjct: 207 SRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHF 264 Query: 221 SPLYFYR 227 L +++ Sbjct: 265 KYLKYWQ 271 >gi|109947390|ref|YP_664618.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] gi|109714611|emb|CAJ99619.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] Length = 349 Score = 41.6 bits (96), Expect = 0.10, Method: Compositional matrix adjust. Identities = 22/44 (50%), Positives = 25/44 (56%), Gaps = 1/44 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L+LPE+GCY+SH LWK A IILEDD F Q L Sbjct: 82 LTLPELGCYLSHYLLWKECARLNQ-PVIILEDDVALESNFIQAL 124 >gi|50119115|ref|YP_048282.1| putative beta1,4-galactosyltransferase [Pectobacterium atrosepticum SCRI1043] gi|49609641|emb|CAG73074.1| putative beta1,4-galactosyltransferase [Pectobacterium atrosepticum SCRI1043] Length = 248 Score = 41.6 bits (96), Expect = 0.10, Method: Compositional matrix adjust. Identities = 50/170 (29%), Positives = 68/170 (40%), Gaps = 36/170 (21%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS EIGC +SH ++KRI A+ILEDD + S DI L +F Sbjct: 57 LSKGEIGCVLSHQRIYKRILDDDIDYALILEDDVELSQ------------DIKVFLKEFL 104 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRI------------LSPRTTGYFIGKEAAIHLLNVR 169 +++ K K D +L P RI S GY I +AA L+ + Sbjct: 105 SVKDKNKGDVFLLY-PSGLRFLNRRINVSHDYFFYEAYNSSCAHGYIISNKAAKKLIRIN 163 Query: 170 KNIYRPIDMDMKHW-WEHNIPSLVT-----EPGAVYEAIDTNDSTIEESR 213 PI + W W + I L E Y+ +D + STIE R Sbjct: 164 T----PIILVADAWLWFYQISLLKVYVLNKELVRAYD-VDKSLSTIETER 208 >gi|157326121|gb|ABV44293.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLVQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 + Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 NAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLDQPVVILEDDITLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|308182985|ref|YP_003927112.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori PeCan4] gi|308065170|gb|ADO07062.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori PeCan4] Length = 273 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 44/175 (25%), Positives = 77/175 (44%), Gaps = 16/175 (9%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCD 112 + + +SL E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 113 INNIL-IKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YARLMHLLYDASVRSEPLSHKNHEIQEHVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 + P+D M + H + +LV +P + A D STI + P + + Sbjct: 207 SRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEELYSPKIALM 259 >gi|157326153|gb|ABV44309.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 1/44 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L+LPE+GCY+SH LWK +ILEDDA F Q L Sbjct: 75 LTLPELGCYLSHYLLWKECV-KLNQPVVILEDDAMLESNFMQAL 117 >gi|332025628|gb|EGI65790.1| Glycosyltransferase 25 family member [Acromyrmex echinatior] Length = 197 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 20/71 (28%), Positives = 42/71 (59%), Gaps = 1/71 (1%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 KR +++ EIGC++SH +W+++ ++LEDD F F Q + ++ +++N+ + Sbjct: 10 KRPMTMGEIGCFLSHYIVWQKVLEHGYKSVMVLEDDVRFEPFFRQKVDYVL-AELSNLKL 68 Query: 119 KFDALRKKPKK 129 ++D + KK Sbjct: 69 EWDLIYMGRKK 79 >gi|207092597|ref|ZP_03240384.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori HPKX_438_AG0C1] Length = 232 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 50/167 (29%), Positives = 72/167 (43%), Gaps = 20/167 (11%) Query: 40 ENNPICN-RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADF 97 E +P C + F + + C + + E+GCY SH LW++ I + AI ILEDD Sbjct: 76 EKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIELNEAI--CILEDDIII 131 Query: 98 SDEFSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 D F + L K IN + L+ + K K + NF T Sbjct: 132 EDRFKESLEFCDK-HINELGYIRLMHLEENVAKQKTPVKGVSQILNFKD------GIGTQ 184 Query: 154 GYFIGKEAAIHLLNVR-KNIYRPIDMDM-KHWWEHNIPSLVTEPGAV 198 GY + +AA LL K PID M +H+W H + + V E A+ Sbjct: 185 GYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYW-HGVKNYVLEEFAI 230 >gi|308063385|gb|ADO05272.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Sat464] Length = 273 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 48/187 (25%), Positives = 82/187 (43%), Gaps = 24/187 (12%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCD 112 + + +SL E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 113 INNIL-IKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YARLMHLLYDASVRSEPLSHKNHEIQERVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTF 220 + P+D M + H + +LV +P + A D STI L+RK F Sbjct: 207 SRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHF 264 Query: 221 SPLYFYR 227 L +++ Sbjct: 265 KYLKYWQ 271 >gi|190150353|ref|YP_001968878.1| glycosyltransferase [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|303253661|ref|ZP_07339799.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307248072|ref|ZP_07530101.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|307254912|ref|ZP_07536734.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307259353|ref|ZP_07541080.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|307263696|ref|ZP_07545304.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189915484|gb|ACE61736.1| putative glycosyltransferase [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|302647581|gb|EFL77799.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|306855470|gb|EFM87644.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306862153|gb|EFM94125.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306866589|gb|EFM98450.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306870950|gb|EFN02686.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 256 Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust. Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 6/114 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ISL RRE F +A FS + + E + + R F K + ++ R ++ Sbjct: 4 FLISLDKDAQRRELFFSQADTADFTVFSAINTMNVEQSELEQR-FDFAKFEQRYGRKVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDIN 114 EIGC +SH+ +++ I +I A++ EDD + F Q L L +N Sbjct: 63 GEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLLAANFQQNLTALLNEKLN 116 >gi|255090100|ref|XP_002506971.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] gi|226522245|gb|ACO68229.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] Length = 272 Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust. Identities = 54/205 (26%), Positives = 88/205 (42%), Gaps = 57/205 (27%) Query: 3 IPVYVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRI----FSHQKRQCQ 57 + V+VISL S RR + C A +I + + F A+ G ++ + ++ +K++ Sbjct: 35 VHVHVISLARSFERR-RACVEALQIQNISYEIFSAVDGLDDGAFHEREIARYAGKKKRAA 93 Query: 58 FKRL-----------------LSLPE-----------IGCYISHIHLWKRIAYSPAIGAI 89 R SLPE GC +SH+ LW R+ + + Sbjct: 94 LHRTSMMRRDELESARSHYDTASLPEEVRNDLHERLRFGCSMSHVRLWMRLLSTNLPFFV 153 Query: 90 ILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKK----DSYLCTL-PGNF---D 141 ILEDDA +++P+ + + DALR+ PK ++ C + PG + + Sbjct: 154 ILEDDA-------RVVPNFQRRIV-------DALRQLPKDWDLFYAFACDIRPGEYLRGN 199 Query: 142 IHQPRILSPRTTGYFIGKEAAIHLL 166 I Q R T GY I ++ A HLL Sbjct: 200 IRQFRG-GACTLGYAISRKGAEHLL 223 >gi|157326137|gb|ABV44301.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDVTLESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|159185030|ref|NP_355025.2| hypothetical protein Atu2058 [Agrobacterium tumefaciens str. C58] gi|159140304|gb|AAK87810.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 206 Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust. Identities = 20/42 (47%), Positives = 25/42 (59%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 R S PEIGCY+SH+ + A A+ILEDD F D+F Sbjct: 4 RRTSPPEIGCYLSHVACANKFMTGDADIALILEDDVVFEDDF 45 >gi|39936414|ref|NP_948690.1| glycosyl transferase [Rhodopseudomonas palustris CGA009] gi|39650269|emb|CAE28792.1| possible glycosyl transferase [Rhodopseudomonas palustris CGA009] Length = 268 Score = 41.2 bits (95), Expect = 0.13, Method: Compositional matrix adjust. Identities = 46/190 (24%), Positives = 79/190 (41%), Gaps = 10/190 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + + V+SLP + ARR + L + +FDA P N + + + F R L Sbjct: 5 VKIVVLSLPTATARRSQMSAMLGNTSLDWQYFDAHASLMCP--NLHYDGAELRRHFGRNL 62 Query: 63 SLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 S PEI SH + + + + A ++LEDD F +F L + C N + + Sbjct: 63 SPPEIAVCSSHAAILQEFVDRNQADYILVLEDDVIFDTDFP-LDQFCAFCASNGL----E 117 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILS--PRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 +R K + L +D R + Y + + AA L+ ++I +D+ Sbjct: 118 YVRLFGKHYAKAVHLGFFYDRSIVRFTTSPAGAQAYLMSRMAARRFLDSYQSIRATVDLA 177 Query: 180 MKHWWEHNIP 189 M +W +P Sbjct: 178 MDSFWITQLP 187 >gi|254476656|ref|ZP_05090042.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] gi|214030899|gb|EEB71734.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] Length = 256 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 32/119 (26%), Positives = 55/119 (46%), Gaps = 8/119 (6%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L+ E+GC++SH WK++ S + +ILEDD +F + + ++ L+K D Sbjct: 41 LTPGELGCFLSHRTAWKQVVASQEV-TVILEDDILLCSQFGETIDAVAGRLGGKDLVKLD 99 Query: 122 ALRKKPKKDSYLCTLPGN-FDIHQPRILS--PRTTGYFIGKEAAIHLLNVRKNIYRPID 177 L ++ + LP N R+L P + Y + EAA LL+ + +D Sbjct: 100 WLDMPIRRGA----LPANDSGAELSRLLQPVPSSAAYVLTPEAAEVLLHRSQAFCDTVD 154 >gi|306842412|ref|ZP_07475065.1| glycosyl transferase family protein [Brucella sp. BO2] gi|306287461|gb|EFM58936.1| glycosyl transferase family protein [Brucella sp. BO2] Length = 239 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 46/205 (22%), Positives = 79/205 (38%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 N RI+ + ++ LS EI C++SH W+ I I+EDD ++ Sbjct: 43 GNDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEN 102 Query: 101 FSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + +N++ I+F R + + F RI+ P G Sbjct: 103 FMGAY----RAAVNHLEPGGFIRF-TFRDDREHGREV------FRDEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 + +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQLVSYDAARKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRVRAY 236 >gi|261493664|ref|ZP_05990183.1| Lipooligosaccharide biosynthesis protein lpsA [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261310664|gb|EEY11848.1| Lipooligosaccharide biosynthesis protein lpsA [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 263 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 47/181 (25%), Positives = 75/181 (41%), Gaps = 22/181 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ + ++ F FFDAI + + F+ L+ Sbjct: 4 YVISLTSAQERRKHIEAEFGKQNIPFQFFDAITPDLIKEKAKAFNIDISNTN----LTKG 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAI-ILEDDADFSDEFSQLL--------PHLSKCDINNI 116 EI C +SHI LW +A + I I EDD + +LL H+ K + Sbjct: 60 EIACALSHIALW-HLAKQQNLDYICIFEDDIYLDNNAFELLKTNYIPENTHIVKLET--- 115 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + FD + + K + Y+ N + + GY + + A L+N+ K + PI Sbjct: 116 -LPFDRINRFNKTEKYIL----NRRLFKLNSRHVGMAGYILTNKGAEFLINILKTLNIPI 170 Query: 177 D 177 D Sbjct: 171 D 171 >gi|254718514|ref|ZP_05180325.1| glycosyl transferase family protein [Brucella sp. 83/13] gi|265983483|ref|ZP_06096218.1| glycosyl transferase family 25 [Brucella sp. 83/13] gi|306838322|ref|ZP_07471168.1| glycosyl transferase family protein [Brucella sp. NF 2653] gi|264662075|gb|EEZ32336.1| glycosyl transferase family 25 [Brucella sp. 83/13] gi|306406613|gb|EFM62846.1| glycosyl transferase family protein [Brucella sp. NF 2653] Length = 239 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 46/205 (22%), Positives = 79/205 (38%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 N RI+ + ++ LS EI C++SH W+ I I+EDD ++ Sbjct: 43 GNDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEN 102 Query: 101 FSQLLPHLSKCDINNI----LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + +N++ I+F R + + F RI+ P G Sbjct: 103 FMGAY----RAAVNHLEPGGFIRF-TFRDDREHGREV------FRNEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 + +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQLVSYDAARKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRVRAY 236 >gi|157326141|gb|ABV44303.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 49/122 (40%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK + +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKTNQ-PVVILEDDIVLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|99078489|ref|YP_611747.1| glycosyl transferase family protein [Ruegeria sp. TM1040] gi|99035627|gb|ABF62485.1| glycosyl transferase family 25 [Ruegeria sp. TM1040] Length = 279 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 43/174 (24%), Positives = 70/174 (40%), Gaps = 10/174 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ-FK 59 M + Y+ISL R K H A + L+ + DA + + +QC+ F Sbjct: 1 MKLATYMISLD-----RAKGRHAAMQATLRDAGLDAEFVSAVDLAETSEEELLQQCKSFG 55 Query: 60 --RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 + + + C +SH +W++ S A A+I EDD S E Q LS L Sbjct: 56 PWGVFAHGNMACTLSHAKVWEQFLASDADVALIFEDDVFISAELRQWTEDLSWWPQGCEL 115 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILS--PRTTGYFIGKEAAIHLLNVR 169 + + R + K + + +LS P + GY + ++ A LL R Sbjct: 116 VNLEFWRSETLKVMVGTSASQHLGRQVAPMLSRNPGSAGYLVSRKGAEALLASR 169 >gi|23501228|ref|NP_697355.1| glycosyl transferase family protein [Brucella suis 1330] gi|148558876|ref|YP_001258359.1| glycosyl transferase family protein [Brucella ovis ATCC 25840] gi|161618304|ref|YP_001592191.1| glycosyl transferase family protein [Brucella canis ATCC 23365] gi|163842608|ref|YP_001627012.1| glycosyl transferase family protein [Brucella suis ATCC 23445] gi|254701140|ref|ZP_05162968.1| glycosyl transferase family protein [Brucella suis bv. 5 str. 513] gi|254703688|ref|ZP_05165516.1| glycosyl transferase family protein [Brucella suis bv. 3 str. 686] gi|254707939|ref|ZP_05169767.1| glycosyl transferase family protein [Brucella pinnipedialis M163/99/10] gi|254709483|ref|ZP_05171294.1| glycosyl transferase family protein [Brucella pinnipedialis B2/94] gi|254713099|ref|ZP_05174910.1| glycosyl transferase family protein [Brucella ceti M644/93/1] gi|254716548|ref|ZP_05178359.1| glycosyl transferase family protein [Brucella ceti M13/05/1] gi|256060468|ref|ZP_05450639.1| glycosyl transferase family protein [Brucella neotomae 5K33] gi|256159022|ref|ZP_05456859.1| glycosyl transferase family protein [Brucella ceti M490/95/1] gi|256254380|ref|ZP_05459916.1| glycosyl transferase family protein [Brucella ceti B1/94] gi|256368780|ref|YP_003106286.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260567060|ref|ZP_05837530.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] gi|261218343|ref|ZP_05932624.1| glycosyl transferase family 25 [Brucella ceti M13/05/1] gi|261221542|ref|ZP_05935823.1| glycosyl transferase [Brucella ceti B1/94] gi|261315430|ref|ZP_05954627.1| glycosyl transferase family 25 [Brucella pinnipedialis M163/99/10] gi|261317005|ref|ZP_05956202.1| glycosyl transferase family 25 [Brucella pinnipedialis B2/94] gi|261320808|ref|ZP_05960005.1| glycosyl transferase family 25 [Brucella ceti M644/93/1] gi|261324459|ref|ZP_05963656.1| glycosyl transferase [Brucella neotomae 5K33] gi|261751674|ref|ZP_05995383.1| glycosyl transferase family 25 protein [Brucella suis bv. 5 str. 513] gi|261754328|ref|ZP_05998037.1| glycosyl transferase family 25 protein [Brucella suis bv. 3 str. 686] gi|265997505|ref|ZP_06110062.1| glycosyl transferase [Brucella ceti M490/95/1] gi|294851709|ref|ZP_06792382.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] gi|23347110|gb|AAN29270.1| glycosyl transferase, family 25 [Brucella suis 1330] gi|148370133|gb|ABQ60112.1| glycosyl transferase, family 25 [Brucella ovis ATCC 25840] gi|161335115|gb|ABX61420.1| glycosyl transferase family 25 [Brucella canis ATCC 23365] gi|163673331|gb|ABY37442.1| glycosyl transferase family 25 [Brucella suis ATCC 23445] gi|255998938|gb|ACU47337.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260156578|gb|EEW91658.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] gi|260920126|gb|EEX86779.1| glycosyl transferase [Brucella ceti B1/94] gi|260923432|gb|EEX90000.1| glycosyl transferase family 25 [Brucella ceti M13/05/1] gi|261293498|gb|EEX96994.1| glycosyl transferase family 25 [Brucella ceti M644/93/1] gi|261296228|gb|EEX99724.1| glycosyl transferase family 25 [Brucella pinnipedialis B2/94] gi|261300439|gb|EEY03936.1| glycosyl transferase [Brucella neotomae 5K33] gi|261304456|gb|EEY07953.1| glycosyl transferase family 25 [Brucella pinnipedialis M163/99/10] gi|261741427|gb|EEY29353.1| glycosyl transferase family 25 protein [Brucella suis bv. 5 str. 513] gi|261744081|gb|EEY32007.1| glycosyl transferase family 25 protein [Brucella suis bv. 3 str. 686] gi|262551973|gb|EEZ07963.1| glycosyl transferase [Brucella ceti M490/95/1] gi|294820298|gb|EFG37297.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] Length = 239 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 47/205 (22%), Positives = 77/205 (37%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD----AD 96 N RI+ + ++ LS EI C++SH W+ I I+EDD + Sbjct: 43 NKDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEN 102 Query: 97 FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + + HL I F R+ ++ F RI+ P G Sbjct: 103 FMGAYRAAVDHLEPGGF--IRFTFRDDREHGRE---------VFRDEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 + +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQLVSYDAAQKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRVRAY 236 >gi|293392083|ref|ZP_06636417.1| Lsg locus putative protein 4 [Aggregatibacter actinomycetemcomitans D7S-1] gi|290952617|gb|EFE02736.1| Lsg locus putative protein 4 [Aggregatibacter actinomycetemcomitans D7S-1] Length = 258 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 10/126 (7%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQF 58 M + Y++SL RRE F + FS F AI ++ + +F+ K + + Sbjct: 1 MIMNKYLVSLDKDVQRRELFFAQPGTA--DFSVFPAINTMQKDWDALSAVFNPVKFEQHY 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCD 112 R ++ EIGC +SH+ ++++I + A++ EDDA F+ + L L C Sbjct: 59 GRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNKNLPENLTALLAQHCT 118 Query: 113 INNILI 118 + ILI Sbjct: 119 ADIILI 124 >gi|116208356|ref|XP_001229987.1| hypothetical protein CHGG_03471 [Chaetomium globosum CBS 148.51] gi|88184068|gb|EAQ91536.1| hypothetical protein CHGG_03471 [Chaetomium globosum CBS 148.51] Length = 380 Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust. Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 6/93 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++V+ LP RR+ +AA ++ F D + G + P ++ K Q KRL Sbjct: 39 IFVVGLPSRTDRRDGMVLQAALSNMDIEFIDGVAGADVP--DKAIPMNKGQ---KRLRDA 93 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 IG + +H++ + + + A+ILEDD D+ Sbjct: 94 -SIGSWRAHMNAIREVVHRNLSSALILEDDVDW 125 >gi|118094236|ref|XP_422290.2| PREDICTED: similar to C1orf17 [Gallus gallus] Length = 627 Score = 41.2 bits (95), Expect = 0.15, Method: Compositional matrix adjust. Identities = 53/227 (23%), Positives = 94/227 (41%), Gaps = 30/227 (13%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH ++WK + +++EDD F +F + L L DI + Sbjct: 405 RPLTRGEIGCFLSHYYIWKEVVNRGLEKTLVIEDDVRFEHQFKRKLMKLMD-DIEQAQLD 463 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY---------FIGKEAAIHLLN 167 ++ + RK+ + +P ++ + S T GY IG E +L Sbjct: 464 WELIYIGRKRMQVQQPEKAVPNVMNLVEAD-YSYWTLGYAISFQGAQKLIGAEPFSKMLP 522 Query: 168 VRKNI-----YRPIDMDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEESR 213 V + + P+ M+++ ++ + EP VY DT STI ++ Sbjct: 523 VDEFLPVMYNKHPVAKYMEYYESRDLKAFSAEPLLVYPTHYTGQPGYLSDTETSTIWDNE 582 Query: 214 LVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 V + + +++ Q +H A KD P ++ PSS L Sbjct: 583 TVST-DWDRTHSWKSR-QQGQIHSEAQNKDALPPQSSLNAPSSRDEL 627 >gi|261400608|ref|ZP_05986733.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria lactamica ATCC 23970] gi|33413657|gb|AAN08511.1| LgtB [Neisseria lactamica] gi|269209691|gb|EEZ76146.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria lactamica ATCC 23970] Length = 279 Score = 41.2 bits (95), Expect = 0.15, Method: Compositional matrix adjust. Identities = 28/100 (28%), Positives = 42/100 (42%), Gaps = 2/100 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRVMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 E C++SH+ LWK+ + EDD + + L Sbjct: 62 EKACFMSHVVLWKQALDEGLPYIAVFEDDVLLGKDAEKFL 101 >gi|157326127|gb|ABV44296.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 330 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESKRRLNTEKLILESNEKFKGRCV-----FQIFDAISPKHEDFEKLVQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDVTLESHFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|217034127|ref|ZP_03439547.1| hypothetical protein HP9810_868g20 [Helicobacter pylori 98-10] gi|216943411|gb|EEC22867.1| hypothetical protein HP9810_868g20 [Helicobacter pylori 98-10] Length = 273 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 43/161 (26%), Positives = 73/161 (45%), Gaps = 16/161 (9%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCD 112 + + +SL E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 113 INNIL-IKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YARLMHLLYDANVRSEPLSHKNHEIQERVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 + P+D M + H + +LV +P + A D STI Sbjct: 207 SRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|308062146|gb|ADO04034.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Cuz20] Length = 273 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 43/161 (26%), Positives = 72/161 (44%), Gaps = 16/161 (9%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----C 111 + + +SL E+GCY SH LW++ I + + ILEDD ++F + L L K Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPV--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 112 DINNILIKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YVRLMHLLYDASVRSEPLSHKNHEIQERVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 + P+D M + H + +LV +P + A D STI Sbjct: 207 SRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|292618105|ref|XP_684212.3| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Danio rerio] Length = 591 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 20/70 (28%), Positives = 37/70 (52%), Gaps = 1/70 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + A+I EDD F F + + L + ++ + + Sbjct: 384 RTLTKGEVGCFLSHYYIWKEMVDMQLDKALIFEDDVRFQANFKRRMMRLME-EVEQVELD 442 Query: 120 FDALRKKPKK 129 +D + KK Sbjct: 443 WDIIYLGRKK 452 >gi|315586508|gb|ADU40889.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 35A] Length = 273 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 48/185 (25%), Positives = 81/185 (43%), Gaps = 24/185 (12%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 ++ + +DA +R +P K+ + G + + T GY I + A + Sbjct: 152 RLMYLLYDANVRSEPLSHKNHEIQERVGIIKAYSHGV---GTQGYVITPKIAKVFKKCSR 208 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSP 222 P+D M + H + +LV +P + A D STI L+RK F Sbjct: 209 KWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKY 266 Query: 223 LYFYR 227 L +++ Sbjct: 267 LKYWQ 271 >gi|261868375|ref|YP_003256297.1| hypothetical protein D11S_1715 [Aggregatibacter actinomycetemcomitans D11S-1] gi|261413707|gb|ACX83078.1| Lsg locus putative protein 4 [Aggregatibacter actinomycetemcomitans D11S-1] Length = 258 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 10/126 (7%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQF 58 M + Y++SL RRE F + FS F AI ++ + +F+ K + + Sbjct: 1 MIMNKYLVSLDKDVQRRELFFAQPGTAD--FSVFPAINTMQKDWDALSAVFNPVKFEQHY 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCD 112 R ++ EIGC +SH+ ++++I + A++ EDDA F+ + L L C Sbjct: 59 GRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNKNLPENLTALLAQHCT 118 Query: 113 INNILI 118 + ILI Sbjct: 119 ADIILI 124 >gi|261393257|emb|CAX50883.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis 8013] Length = 280 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 28/102 (27%), Positives = 42/102 (41%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRVMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH+ LWK+ + EDD + + L Sbjct: 62 EKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAE 103 >gi|18129284|emb|CAD20562.1| putative glycosyltransferase [Brucella abortus] Length = 703 Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust. Identities = 51/196 (26%), Positives = 77/196 (39%), Gaps = 20/196 (10%) Query: 1 MPIPVYVISLPFSHARREKFCH--RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + A Q DA+ G RI R Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTA--TQRIGIDDARFDAI 58 Query: 59 KRLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 LP E GCY +H+ + + +ILEDD F++ S + DI L Sbjct: 59 NGREMLPGEYGCYRNHLKALESFLSDGSPYGLILEDDVVFTETTSARIH-----DIIESL 113 Query: 118 IKFDALR----KKPKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 FD ++ + P S L T G+ IH P+ Y + +E A LL+ Sbjct: 114 PDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQ---GSAAAYLVSREGARKLLSALS 170 Query: 171 NIYRPIDMDMKHWWEH 186 + P D+ M+ +W H Sbjct: 171 TMELPWDVAMERFWHH 186 >gi|317182139|dbj|BAJ59923.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F57] Length = 275 Score = 41.2 bits (95), Expect = 0.17, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 75/184 (40%), Gaps = 18/184 (9%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCD 112 + + +SL E+GCY SH LW++ I + I ILEDD + F + L H+ + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKENFKEGLDFLEKHIQELG 149 Query: 113 INNIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 ++ + +DA K + I + T GY I + A + Sbjct: 150 YARLMHLLYDASVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRK 209 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPL 223 P+D M + H + +LV +P + A D STI L+RK F L Sbjct: 210 WVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKYL 267 Query: 224 YFYR 227 +++ Sbjct: 268 KYWQ 271 >gi|207092487|ref|ZP_03240274.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori HPKX_438_AG0C1] Length = 194 Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust. Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 3/52 (5%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 + +SL E+GCY SH LW++ I + AI ILEDD ++F + L L K Sbjct: 94 KFMSLGELGCYASHYSLWQKCIELNEAI--CILEDDITLKEDFKEGLDFLEK 143 >gi|45768794|gb|AAH68118.1| Glycosyltransferase 25 domain containing 2 [Mus musculus] Length = 625 Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 53/112 (47%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L + DI+ + Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLME-DIDKAQLD 461 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 462 WELIYIGRKRMQVKEPEKAVPNVVNLVEAD-YSYWTLGYAISLEGAQKLVGA 512 >gi|21069194|gb|AAM33868.1|AF470661_2 LgtE [Neisseria meningitidis] Length = 280 Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust. Identities = 28/102 (27%), Positives = 42/102 (41%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRVMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH+ LWK+ + EDD + + L Sbjct: 62 EKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAE 103 >gi|108514913|gb|ABF93251.1| hypothetical protein [Campylobacter jejuni] Length = 257 Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust. Identities = 43/187 (22%), Positives = 79/187 (42%), Gaps = 12/187 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL IGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGGIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALRKKPKKDSYLC----TLPGNFDI----HQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 R+ D + +L ++ I H R++ T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPID 177 I P D Sbjct: 181 KIIFPYD 187 >gi|307637518|gb|ADN79968.1| lipo oligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 908] Length = 273 Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust. Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 14/169 (8%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I I ILEDD ++F + L H+ + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELDEPI--CILEDDITLKEDFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + I + T GY I + A L + Sbjct: 152 RLMHLLYDASVKSEPLIHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSQKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP 222 P+D M + H + +LV +P + A D STI ++ ++SP Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIAR----KEESYSP 254 >gi|148707510|gb|EDL39457.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Mus musculus] Length = 625 Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 53/112 (47%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L + DI+ + Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLME-DIDKAQLD 461 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 462 WELIYIGRKRMQVKEPEKAVPNVVNLVEAD-YSYWTLGYAISLEGAQKLVGA 512 >gi|228008340|ref|NP_808424.3| procollagen galactosyltransferase 2 precursor [Mus musculus] gi|160395572|sp|Q6NVG7|GT252_MOUSE RecName: Full=Procollagen galactosyltransferase 2; AltName: Full=Glycosyltransferase 25 family member 2; AltName: Full=Hydroxylysine galactosyltransferase 2; Flags: Precursor Length = 625 Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 53/112 (47%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L + DI+ + Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLME-DIDKAQLD 461 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 462 WELIYIGRKRMQVKEPEKAVPNVVNLVEAD-YSYWTLGYAISLEGAQKLVGA 512 >gi|254360553|ref|ZP_04976702.1| possible glycosyltransferase [Mannheimia haemolytica PHL213] gi|153091093|gb|EDN73098.1| possible glycosyltransferase [Mannheimia haemolytica PHL213] Length = 261 Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 12/118 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLLS 63 Y+ISL RRE F + F F AI ++ N F + ++ R + Sbjct: 4 YLISLEKDVKRRELFFSQPNTE--DFEVFSAINTMSSDWEALNNHFDLALFEQRYGRKTT 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQ----LLPHLSKCDI 113 E+GC +SH+ ++++IA I ++ EDDA F+D+F + LL K DI Sbjct: 62 KGEVGCTLSHLAVYQKIAEDSTIAEDDYCLVCEDDALFADDFQKHLDALLAQNLKVDI 119 >gi|26343025|dbj|BAC35169.1| unnamed protein product [Mus musculus] Length = 625 Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 53/112 (47%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L + DI+ + Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLME-DIDKAQLD 461 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 462 WELIYIGRKRMQVKEPEKAVPNVVNLVEAD-YSYWTLGYAISLEGAQKLVGA 512 >gi|157326143|gb|ABV44304.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust. Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 25/122 (20%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENN---PICNRIF 49 I VY+ISL S R EKF R F FDAI ++ + ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEYFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 Q L+LPE+GCY+SH LWK +ILEDD F Q Sbjct: 57 DAQSLLQSDWYHSYVGTGLTLPELGCYLSHYLLWKECV-KLNQPVVILEDDVALESNFMQ 115 Query: 104 LL 105 L Sbjct: 116 AL 117 >gi|73968112|ref|XP_851283.1| PREDICTED: similar to cerebral endothelial cell adhesion molecule 1 [Canis familiaris] Length = 595 Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust. Identities = 32/142 (22%), Positives = 59/142 (41%), Gaps = 9/142 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V+VISL RRE+ + + DA+ G N+ + + Q Sbjct: 322 VFVISLARRPDRRERMLSSLWEMEVSGRVVDAVDGRTLNSSLMRSLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F +L+ + + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARRLARILVFEDDVRFESNFRGRLERLMEEVEAEKLPW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLP 137 LI + P++++ + LP Sbjct: 442 DLIYLGRKQVNPEEEAAVEGLP 463 >gi|260914134|ref|ZP_05920607.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260631767|gb|EEX49945.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 259 Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust. Identities = 48/187 (25%), Positives = 83/187 (44%), Gaps = 23/187 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLLS 63 ++ISL RRE F + F + AI N +F+ ++ Q ++ R ++ Sbjct: 4 FLISLEKDEYRRELFFQQPDT--QDFEVYSAINTMNETWEKLTALFNVEQFQQRYGRAVT 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEF----SQLLPHLSKCDI-- 113 EIGC +SH+ ++++I AI ++ EDDA F+ F + LL + D+ Sbjct: 62 KGEIGCTLSHLGVYQKIVDDQAISEDEYCLVCEDDALFNQHFLNNVTALLQQERQVDLIL 121 Query: 114 --NNILIKFDALRKK---PKKDSYLCTLPGNFDI---HQPRILSPRTTGYFIGKEAA-IH 164 + ++ FD + P S+LC N + + + T Y I K AA Sbjct: 122 VGQSKILHFDDNELEINYPVTFSWLCKKIANTEFCYAYPYKDYFAGTVAYLIKKSAARSF 181 Query: 165 LLNVRKN 171 L+ V++N Sbjct: 182 LIQVQQN 188 >gi|257465064|ref|ZP_05629435.1| putative glycosyltransferase [Actinobacillus minor 202] gi|257450724|gb|EEV24767.1| putative glycosyltransferase [Actinobacillus minor 202] Length = 256 Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust. Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 6/108 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + Q FS + + + + N +F + + + R ++ Sbjct: 4 YLISLEKDVQRRELFFSQPDTQDFQIFSAINTMSLSSEELEN-MFDVTQFEQHYHRAVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL 108 EIGC +SH+++++ I +I A++ EDD FS F++ L L Sbjct: 63 GEIGCTMSHLNVYQWIVDDESIAEEDYALVCEDDVLFSANFNENLTAL 110 >gi|76156078|gb|AAX27314.2| SJCHGC08517 protein [Schistosoma japonicum] Length = 170 Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust. Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 7/105 (6%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL-- 123 EIGC++SH ++W + + +ILEDD F+ F + L + K N+ I +D L Sbjct: 1 EIGCFLSHYNIWNEMITNGYNRILILEDDLRFAPAFIRNLNKIIKEADENV-INWDLLYI 59 Query: 124 -RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 RK+ K +P + P + T GY + + AI L+N Sbjct: 60 GRKRMSKSE--KRVPNTTSLAYPD-YTYWTLGYVLNRSGAIKLIN 101 >gi|149636351|ref|XP_001516115.1| PREDICTED: similar to C1orf17 [Ornithorhynchus anatinus] Length = 845 Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 53/112 (47%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + + Sbjct: 623 RPLTRGEIGCFLSHYSIWKEVIDRDLEKTLVIEDDVRFEHQFKKKLIKLMD-DIDRVQLD 681 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 682 WELIYIGRKRMQVQEPEKAVPNVVNLVEAD-YSYWTLGYVISLEGAQKLVGA 732 >gi|86150773|ref|ZP_01068989.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni subsp. jejuni 260.94] gi|85841943|gb|EAQ59189.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni subsp. jejuni 260.94] Length = 178 Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust. Identities = 53/199 (26%), Positives = 76/199 (38%), Gaps = 57/199 (28%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + V++I+L S R+E + ++ L+F FF AI +N K Sbjct: 1 MKVFIINLERSLDRKEHIQKQIQKLFEKNLSLKNKLEFIFFKAIDAKN-----------K 49 Query: 54 RQCQFK------------RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDE 100 +FK R LS E C+ SH LW+ + I IILEDD +FSDE Sbjct: 50 EHLEFKDHFPWWGSWVLGRELSDGEKACFASHYKLWQECVKLDEPI--IILEDDVEFSDE 107 Query: 101 FSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTL---------PGNFDIHQPRILSPR 151 F +NN D L K + LC L G + + ++ Sbjct: 108 F-----------LNNGAEYIDELLKSKYEYIRLCYLFDKRLYFLSEGGYYLSFEKL--AG 154 Query: 152 TTGYFIGKEAAIHLLNVRK 170 T GY + AA+ L K Sbjct: 155 TQGYVLQVSAAMKFLKYAK 173 >gi|313229149|emb|CBY23734.1| unnamed protein product [Oikopleura dioica] Length = 576 Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust. Identities = 29/105 (27%), Positives = 52/105 (49%), Gaps = 13/105 (12%) Query: 5 VYVISLPFSHARREK--FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC------ 56 ++VI+L R+E+ FC R I +++ A+ G+ + I RQ Sbjct: 333 IFVINLERRQDRKERMDFCMRLQGI--KYTLVPAVDGKA-LTQDDIDGLGIRQLPGFSDP 389 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 R ++ E+GC++SH +LW+++ P ++LEDD F +F Sbjct: 390 HKPRTITRGEVGCFLSHYYLWQKMV--PGETYLVLEDDVRFGPDF 432 >gi|315044417|ref|XP_003171584.1| hypothetical protein MGYG_06124 [Arthroderma gypseum CBS 118893] gi|311343927|gb|EFR03130.1| hypothetical protein MGYG_06124 [Arthroderma gypseum CBS 118893] Length = 402 Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust. Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 6/93 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV++LP RR+ AA ++ + D I GE + +R K Sbjct: 69 IYVVNLPSRTDRRDALVLMAALSGIKLHWIDGIMGET--VLDRALPPPASHKTMKN---- 122 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 IG + +HI+ + + + A+ILEDDAD+ Sbjct: 123 ANIGSWRAHINALQDMVENNISSALILEDDADW 155 >gi|297685475|ref|XP_002820323.1| PREDICTED: glycosyltransferase 25 family member 3-like [Pongo abelii] Length = 528 Score = 40.8 bits (94), Expect = 0.20, Method: Compositional matrix adjust. Identities = 28/111 (25%), Positives = 45/111 (40%), Gaps = 9/111 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQ 57 V+VISL RRE+ + + DA+ G N + + +Q Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGQVVDAVDGRMLNSSAIRNLGVDLLPGYQDPYS- 380 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 R L+ E+GC++SH +W+ + ++ EDD F F L L Sbjct: 381 -GRTLTKGEVGCFLSHHSIWEEVVARGLARVLVFEDDVRFESNFRGRLERL 430 >gi|3132249|dbj|BAA28126.1| unnamed protein product [Actinobacillus actinomycetemcomitans] Length = 256 Score = 40.8 bits (94), Expect = 0.20, Method: Compositional matrix adjust. Identities = 34/121 (28%), Positives = 58/121 (47%), Gaps = 10/121 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 Y++SL RRE F + FS F AI ++ + +F+ K + + R ++ Sbjct: 4 YLVSLDKDVQRRELFFAQPGTAD--FSVFPAINTMQKDWDALSAVFNPVKFEQHYGRKVT 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL--SKCDINNIL 117 EIGC +SH+ ++++I + A++ EDDA F+ + L L C + IL Sbjct: 62 KGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNKNLPENLTALLAQHCTADIIL 121 Query: 118 I 118 I Sbjct: 122 I 122 >gi|83944103|ref|ZP_00956559.1| putative glycosyltransferase [Sulfitobacter sp. EE-36] gi|83844970|gb|EAP82851.1| putative glycosyltransferase [Sulfitobacter sp. EE-36] Length = 281 Score = 40.8 bits (94), Expect = 0.20, Method: Compositional matrix adjust. Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 12/106 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN----PICNR--IFSHQKRQCQF 58 Y+I+L R + + A + L F F A+ G ++ P NR +QK+Q Sbjct: 3 TYLINLERRPDRLSEVSRKMAELGLPFKLFRAVDGHDDDAKLPDLNRHLFVINQKKQP-- 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL 104 EI C SHI +W++ + A+ILEDD D + + + Sbjct: 61 ----VRGEIACAASHIAVWRQFLETDEEYALILEDDIDIAPDLEMV 102 >gi|192292142|ref|YP_001992747.1| glycosyl transferase family 25 [Rhodopseudomonas palustris TIE-1] gi|192285891|gb|ACF02272.1| glycosyl transferase family 25 [Rhodopseudomonas palustris TIE-1] Length = 268 Score = 40.8 bits (94), Expect = 0.20, Method: Compositional matrix adjust. Identities = 47/190 (24%), Positives = 79/190 (41%), Gaps = 10/190 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + + V+SLP + ARR + L + +FDA P N + + + F R L Sbjct: 5 VKIVVLSLPTATARRSQMSAMLGNTSLDWQYFDAHASLMCP--NLHYDGAELRRHFGRNL 62 Query: 63 SLPEIGCYISHIH-LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 S PEI SH L + + + A ++LEDD F +F L + C N + + Sbjct: 63 SPPEIAVCSSHAAILHEFVDRNQADYILVLEDDVIFDTDFP-LDQFCAFCASNGL----E 117 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILS--PRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 +R K + L +D R + Y + + AA L+ ++I +D+ Sbjct: 118 YVRLFGKHYAKAVHLGFFYDRSIVRFTTSPAGAQAYLMSRMAARRFLDSYRSIRATVDLA 177 Query: 180 MKHWWEHNIP 189 M +W +P Sbjct: 178 MDSFWITQLP 187 >gi|313215923|emb|CBY37331.1| unnamed protein product [Oikopleura dioica] Length = 579 Score = 40.8 bits (94), Expect = 0.21, Method: Compositional matrix adjust. Identities = 29/105 (27%), Positives = 52/105 (49%), Gaps = 13/105 (12%) Query: 5 VYVISLPFSHARREK--FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC------ 56 ++VI+L R+E+ FC R I +++ A+ G+ + I RQ Sbjct: 333 IFVINLERRQDRKERMDFCMRLQGI--KYTLVPAVDGKA-LTQDDIDGLGIRQLPGFSDP 389 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 R ++ E+GC++SH +LW+++ P ++LEDD F +F Sbjct: 390 HKPRTITRGEVGCFLSHYYLWQKMV--PGETYLVLEDDVRFGPDF 432 >gi|326924714|ref|XP_003208570.1| PREDICTED: procollagen galactosyltransferase 2-like [Meleagris gallopavo] Length = 552 Score = 40.8 bits (94), Expect = 0.22, Method: Compositional matrix adjust. Identities = 53/227 (23%), Positives = 94/227 (41%), Gaps = 30/227 (13%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH ++WK + +++EDD F +F + L L DI + Sbjct: 330 RPLTRGEIGCFLSHYYIWKEVLNRGLEKTLVIEDDVRFEHQFKRKLMKLMD-DIEQAQLD 388 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY---------FIGKEAAIHLLN 167 ++ + RK+ + +P ++ + S T GY IG E +L Sbjct: 389 WELIYIGRKRMQVQQPEKAVPNVMNLVEAD-YSYWTLGYAISFQGAQKLIGAEPFSKMLP 447 Query: 168 VRKNI-----YRPIDMDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEESR 213 V + + P+ M+++ ++ + EP VY DT STI ++ Sbjct: 448 VDEFLPVMYNKHPVAKYMEYYESRDLKAFSAEPLLVYPTHYTGQPGYLSDTETSTIWDNE 507 Query: 214 LVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 V + + +++ Q +H A KD P ++ PSS L Sbjct: 508 TVST-DWDRTHSWKSR-QQGQIHSEAQNKDALPPQSSLNAPSSRDEL 552 >gi|317178813|dbj|BAJ56601.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F30] Length = 273 Score = 40.8 bits (94), Expect = 0.22, Method: Compositional matrix adjust. Identities = 46/182 (25%), Positives = 75/182 (41%), Gaps = 18/182 (9%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + I + T GY I + A + Sbjct: 152 RLMYLLYDANVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPLYF 225 P+D M + H + +LV +P + A D STI L+RK F L + Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKYLKY 269 Query: 226 YR 227 ++ Sbjct: 270 WQ 271 >gi|301787505|ref|XP_002929168.1| PREDICTED: procollagen galactosyltransferase 2-like, partial [Ailuropoda melanoleuca] Length = 630 Score = 40.8 bits (94), Expect = 0.22, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + Sbjct: 408 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLTKLMD-DIDQAQLD 466 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 467 WELIYIGRKRMQVQEPEKAVPNVGNLVEAD-YSYWTLGYVISLEGAQKLVGA 517 >gi|160395584|sp|Q7Q021|GLT25_ANOGA RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor Length = 592 Score = 40.4 bits (93), Expect = 0.22, Method: Compositional matrix adjust. Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 7/100 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ------F 58 +Y+I+L RR K + L F A+ G+ +++ R Sbjct: 321 IYMINLERRTERRTKMLKHFDLLGLDVEHFPAVDGKQLS-DKKVYDMGIRFLPGYADPFH 379 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 KR +++ EIGC++SH ++W+R+ ++LEDD F Sbjct: 380 KRPMTMGEIGCFLSHYNIWERMVRLNQQEVLVLEDDIRFE 419 >gi|261492320|ref|ZP_05988882.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261312003|gb|EEY13144.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 268 Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 12/118 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLLS 63 Y+ISL RRE F + F F AI ++ N F + ++ R + Sbjct: 11 YLISLEKDVKRRELFFSQPNTE--DFEVFSAINTMSSDWEALNNHFDLALFEQRYGRKTT 68 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQ----LLPHLSKCDI 113 E+GC +SH+ ++++IA I ++ EDDA F+D+F + LL K DI Sbjct: 69 KGEVGCTLSHLAVYQKIAEDRTIAEDDYCLVCEDDALFADDFQKHLDALLAQNLKVDI 126 >gi|293341373|ref|XP_001070927.2| PREDICTED: glycosyltransferase 25 domain containing 2 [Rattus norvegicus] gi|293352770|ref|XP_222718.5| PREDICTED: glycosyltransferase 25 domain containing 2 [Rattus norvegicus] gi|149058405|gb|EDM09562.1| glycosyltransferase 25 domain containing 2 (predicted) [Rattus norvegicus] Length = 625 Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMD-DIDKARLD 461 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 462 WELIYIGRKRMQVKEPEKAVPNVVNLVEAD-YSYWTLGYVISLEGAQKLVGA 512 >gi|281346524|gb|EFB22108.1| hypothetical protein PANDA_019266 [Ailuropoda melanoleuca] Length = 635 Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + Sbjct: 413 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLTKLMD-DIDQAQLD 471 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 472 WELIYIGRKRMQVQEPEKAVPNVGNLVEAD-YSYWTLGYVISLEGAQKLVGA 522 >gi|158300399|ref|XP_320324.3| AGAP012208-PA [Anopheles gambiae str. PEST] gi|157013141|gb|EAA00118.3| AGAP012208-PA [Anopheles gambiae str. PEST] Length = 554 Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust. Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 7/100 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ------F 58 +Y+I+L RR K + L F A+ G+ +++ R Sbjct: 321 IYMINLERRTERRTKMLKHFDLLGLDVEHFPAVDGKQLS-DKKVYDMGIRFLPGYADPFH 379 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 KR +++ EIGC++SH ++W+R+ ++LEDD F Sbjct: 380 KRPMTMGEIGCFLSHYNIWERMVRLNQQEVLVLEDDIRFE 419 >gi|226954450|ref|ZP_03824914.1| glycosyl transferase [Acinetobacter sp. ATCC 27244] gi|226834799|gb|EEH67182.1| glycosyl transferase [Acinetobacter sp. ATCC 27244] Length = 252 Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust. Identities = 52/226 (23%), Positives = 91/226 (40%), Gaps = 11/226 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R + + FS F A G + + Q Sbjct: 1 MKVVTYLINLEGSDQRLANATAQLQQAGWAFSRFPAYDGRGKALSEFENYDDAAAQKILG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA----DFSDEFSQLLPHLSK-CDIN 114 R L E+GCY+SH K+ + A ++LEDD +F + + L+ +L + ++ Sbjct: 61 RSLISSELGCYLSHYGCAKKFLETDADYLVVLEDDIQVLPNFKQKLNSLISYLDQHKELE 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 ++ A +KK KD + + G + I R G + A + + K + Sbjct: 121 WYVVNLAAKKKKLAKD--IVQIDG-YTIWHAYYFPIRGVGLVWSRAGAEAFVELGKTMQV 177 Query: 175 PIDMDMKHWWEHNIPSL-VTEPGAVYEAIDTND-STIEESRLVRKP 218 P+D+ + W N L V +P ID++ T+ + RK Sbjct: 178 PVDIFFQSWLSKNGKGLGVWQPFVQPAGIDSDILGTVATQGIQRKA 223 >gi|114769884|ref|ZP_01447494.1| putative beta1,4-galactosyltransferase [alpha proteobacterium HTCC2255] gi|114549589|gb|EAU52471.1| putative beta1,4-galactosyltransferase [alpha proteobacterium HTCC2255] Length = 263 Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust. Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 7/108 (6%) Query: 3 IPVYVISLPFSHARREKFCH-RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + +YVISL ++ R+KF +A L AI + N I + Q + R Sbjct: 1 MKIYVISLE-NYRDRQKFQQEQAMAFGLDIEIVSAI--DANSISEKKL--QDAANNWSRP 55 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 ++ ++ C++SH W+ I + A+I+EDD F+ + ++L H+S Sbjct: 56 INAKDVACFLSHKKTWE-IIHDKGERALIIEDDIVFNQQIVEVLNHIS 102 >gi|307213488|gb|EFN88897.1| Glycosyltransferase 25 family member [Harpegnathos saltator] Length = 195 Score = 40.4 bits (93), Expect = 0.24, Method: Compositional matrix adjust. Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 4/64 (6%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 KR ++ EIGC++SH +W+++ ++LEDD F F Q +L LS DI Sbjct: 10 KRPMTTGEIGCFLSHYVIWQKVLEHGYKDVMVLEDDVRFEPFFRQKVRYVLTELSDLDIK 69 Query: 115 NILI 118 L+ Sbjct: 70 WDLV 73 >gi|208434716|ref|YP_002266382.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori G27] gi|208432645|gb|ACI27516.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori G27] Length = 284 Score = 40.4 bits (93), Expect = 0.24, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 72/165 (43%), Gaps = 18/165 (10%) Query: 62 LSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 + E+GCY SH LW++ I + AI ILEDD + F + L + IN + Sbjct: 97 MGFGELGCYASHYSLWQKCIELNEAI--CILEDDIIVKERFKESLEFCYQ-HINELGYIR 153 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRP 175 L+ + K K + NF T GY + +AA LL K P Sbjct: 154 LMHLEENVAKQKTPVKGVSQILNFKD------GIGTQGYVLAPKAAQKLLKYSTKEWVMP 207 Query: 176 IDMDM-KHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRKP 218 ID M +H+W H + + V E A+ + ++ +S E+ R + P Sbjct: 208 IDCVMDRHYW-HGVKNYVLEEFAITCDEMNAQNSNTEKQRPKKLP 251 >gi|73960467|ref|XP_859640.1| PREDICTED: similar to glycosyltransferase 25 domain containing 2 isoform 2 [Canis familiaris] Length = 651 Score = 40.4 bits (93), Expect = 0.24, Method: Compositional matrix adjust. Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 7/113 (6%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ + + ++ Sbjct: 429 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 488 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R + K+ GN S T GY I E A L+ Sbjct: 489 ELIYIGRKRMQVKEPEKAVPNVGNL---VEADYSYWTLGYVISLEGAQKLVGA 538 >gi|21069183|gb|AAM33860.1|AF470658_3 LgtE [Neisseria meningitidis] gi|21069191|gb|AAM33866.1|AF470660_3 LgtE [Neisseria meningitidis] Length = 280 Score = 40.4 bits (93), Expect = 0.25, Method: Compositional matrix adjust. Identities = 28/102 (27%), Positives = 42/102 (41%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRMMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH+ LWK+ + EDD + + L Sbjct: 62 EKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAE 103 >gi|229845713|ref|ZP_04465835.1| diadenosine tetraphosphatase [Haemophilus influenzae 6P18H1] gi|229811355|gb|EEP47062.1| diadenosine tetraphosphatase [Haemophilus influenzae 6P18H1] Length = 302 Score = 40.4 bits (93), Expect = 0.25, Method: Compositional matrix adjust. Identities = 51/199 (25%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQS 62 Query: 57 ---------------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 + R+L+ E GC ISH LW + I EDD + Sbjct: 63 INQSINQSNSILHNIEKSRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENA 122 Query: 102 SQLLPH----LSKCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTT 153 L ++ D N+I I +P K +P NFDI + T Sbjct: 123 EVFLNQNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWG--TA 180 Query: 154 GYFIGKEAAIHLLNVRKNI 172 GY I + AA +++ KNI Sbjct: 181 GYIISQGAAKYVIEYLKNI 199 >gi|291229542|ref|XP_002734736.1| PREDICTED: glycosyltransferase 25 domain containing 2-like, partial [Saccoglossus kowalevskii] Length = 576 Score = 40.4 bits (93), Expect = 0.26, Method: Compositional matrix adjust. Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 9/119 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +Y+I+L RRE+ + + + DA+ G+ + K F Sbjct: 339 IYMINLLRRPDRRERMVAAFDLLGIDATLVDAVDGQELTDEKVLELGIKMLPGFADPYHG 398 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDIN 114 R L+ EIGC++SH ++W+ + ++LEDD F +F + ++ L + IN Sbjct: 399 RELTKGEIGCFLSHYNIWQEVVSKNYSKVMVLEDDVRFKFKFLERLNAMMAELKELHIN 457 >gi|148825173|ref|YP_001289926.1| diadenosine tetraphosphatase [Haemophilus influenzae PittEE] gi|148715333|gb|ABQ97543.1| diadenosine tetraphosphatase [Haemophilus influenzae PittEE] Length = 302 Score = 40.4 bits (93), Expect = 0.26, Method: Compositional matrix adjust. Identities = 51/199 (25%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQS 62 Query: 57 ---------------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 + R+L+ E GC ISH LW + I EDD + Sbjct: 63 INQSINQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENA 122 Query: 102 SQLLPH----LSKCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTT 153 L ++ D N+I I +P K +P NFDI + T Sbjct: 123 EVFLNQNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWG--TA 180 Query: 154 GYFIGKEAAIHLLNVRKNI 172 GY I + AA +++ KNI Sbjct: 181 GYIISQGAAKYVIEYLKNI 199 >gi|15602374|ref|NP_245446.1| hypothetical protein PM0509 [Pasteurella multocida subsp. multocida str. Pm70] gi|12720769|gb|AAK02593.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 261 Score = 40.4 bits (93), Expect = 0.26, Method: Compositional matrix adjust. Identities = 31/109 (28%), Positives = 52/109 (47%), Gaps = 8/109 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI--YGENNPICNRIFSHQKRQCQFKRLLS 63 ++ISL HARRE F A F+ F A+ E+ +F + Q + R ++ Sbjct: 4 WLISLEKDHARREHFF--AQPDTADFTVFSAMNTMQESWENLTALFDLNRFQQYYGRAVT 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL 108 E+GC +SH+ ++++I I ++ EDD F+ F + L L Sbjct: 62 KGEVGCTLSHLAVYQQIVGDATISENDYCLVCEDDVLFNQGFQRHLETL 110 >gi|261496106|ref|ZP_05992514.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. OVINE] gi|261308208|gb|EEY09503.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. OVINE] Length = 261 Score = 40.4 bits (93), Expect = 0.27, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 12/118 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRLLS 63 Y+ISL RRE F + F F AI ++ N F + ++ R + Sbjct: 4 YLISLEKDVKRRELFFSQPNTE--DFEVFSAINTMSSDWEALNNHFDLALFEQRYGRKTT 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQ----LLPHLSKCDI 113 E+GC +SH+ ++++IA I ++ EDDA F+D+F + LL K DI Sbjct: 62 KGEVGCTLSHLAVYQKIAEDRTIAEDDYCLVCEDDALFADDFQKHLDALLAQNLKVDI 119 >gi|73960471|ref|XP_849763.1| PREDICTED: similar to glycosyltransferase 25 domain containing 2 isoform 1 [Canis familiaris] Length = 626 Score = 40.4 bits (93), Expect = 0.27, Method: Compositional matrix adjust. Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 7/113 (6%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ + + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R + K+ GN S T GY I E A L+ Sbjct: 464 ELIYIGRKRMQVKEPEKAVPNVGNLVEAD---YSYWTLGYVISLEGAQKLVGA 513 >gi|325996107|gb|ADZ51512.1| putative lipopolysaccharide biosynthesis protein/Beta-1,4-galactosyltransferase [Helicobacter pylori 2018] gi|325997703|gb|ADZ49911.1| putative lipo oligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 2017] Length = 273 Score = 40.4 bits (93), Expect = 0.27, Method: Compositional matrix adjust. Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 10/156 (6%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I I ILEDD ++F + L H+ + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELDEPI--CILEDDITLKEDFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + I + T GY I + A L + Sbjct: 152 RLMHLLYDASVKSEPLIHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSQKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 P+D M + H + +LV +P + A D STI Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTI 245 >gi|306844961|ref|ZP_07477542.1| glycosyl transferase family protein [Brucella sp. BO1] gi|306274593|gb|EFM56382.1| glycosyl transferase family protein [Brucella sp. BO1] Length = 239 Score = 40.4 bits (93), Expect = 0.29, Method: Compositional matrix adjust. Identities = 46/205 (22%), Positives = 77/205 (37%), Gaps = 25/205 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 N RI+ + ++ LS EI C++SH W+ I I+EDD ++ Sbjct: 43 GNDEIARIYKRRLHTPRYPFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEI 102 Query: 101 F----SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY- 155 F + HL + D R+ ++ F RI+ P G Sbjct: 103 FMGAYRAAVDHLEPGGFIRFTFRDD--REHGRE---------VFRDEAVRIIIPNPIGLG 151 Query: 156 ----FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIE 210 + +AA LL++ + RP+D ++ W + L PG V E + +TI+ Sbjct: 152 MVAQLVSYDAARKLLDITQTFDRPVDTTVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQ 211 Query: 211 E----SRLVRKPTFSPLYFYRNTCY 231 S + + P+Y R Y Sbjct: 212 HKKNFSDKLAREILRPIYRMRVRAY 236 >gi|73960469|ref|XP_859668.1| PREDICTED: similar to glycosyltransferase 25 domain containing 2 isoform 3 [Canis familiaris] Length = 643 Score = 40.0 bits (92), Expect = 0.29, Method: Compositional matrix adjust. Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 7/113 (6%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ + + ++ Sbjct: 421 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 480 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R + K+ GN S T GY I E A L+ Sbjct: 481 ELIYIGRKRMQVKEPEKAVPNVGNL---VEADYSYWTLGYVISLEGAQKLVGA 530 >gi|301630121|ref|XP_002944176.1| PREDICTED: glycosyltransferase 25 family member 3-like [Xenopus (Silurana) tropicalis] Length = 590 Score = 40.0 bits (92), Expect = 0.29, Method: Compositional matrix adjust. Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 1/64 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +WK I A++ EDD F F + + L DI + + Sbjct: 382 RTLTKGEVGCFLSHFQIWKEITERQLDTAVVFEDDVRFQPFFKRKMIRL-MGDIRSAELD 440 Query: 120 FDAL 123 +D + Sbjct: 441 WDLI 444 >gi|157164803|ref|YP_001467299.1| ExsB [Campylobacter concisus 13826] gi|112801958|gb|EAT99302.1| putative glycosyl transferase, family 25 [Campylobacter concisus 13826] Length = 253 Score = 40.0 bits (92), Expect = 0.31, Method: Compositional matrix adjust. Identities = 62/233 (26%), Positives = 92/233 (39%), Gaps = 36/233 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRL 61 +Y+ISL RRE + + F DAI G N +I S + + ++ Sbjct: 3 EIYLISLEKDTKRRELLQQKFGS-YDSFKLIDAIDGRELNAREYYKIISPSFKA--YGKV 59 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILI-- 118 LS E+GC +SH+ ++ S A A+I EDD D + SK N++LI Sbjct: 60 LSPAEVGCSLSHVKAYEAFLASDAKFALIFEDDVIGGDHAIKEAFLAASKMPENSVLICG 119 Query: 119 -------KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 +F A KK DS L H + R Y + K++A +LL + K Sbjct: 120 MQDGLEGRFSAFGKKV--DSSLSKPLWQVSKHSFSSI-YRAGAYVLTKKSAKNLLEIHKR 176 Query: 172 IYRPIDM--------DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ DM+ ++ L P D + S IE RL R Sbjct: 177 ALCTTDVWDYLLGVNDMQMYF----CDLFAHP------TDLSGSNIEGERLER 219 >gi|294651467|ref|ZP_06728780.1| glycosyl transferase [Acinetobacter haemolyticus ATCC 19194] gi|292822617|gb|EFF81507.1| glycosyl transferase [Acinetobacter haemolyticus ATCC 19194] Length = 252 Score = 40.0 bits (92), Expect = 0.31, Method: Compositional matrix adjust. Identities = 52/226 (23%), Positives = 90/226 (39%), Gaps = 11/226 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R + + FS F A G + + Q Sbjct: 1 MKVVTYLINLEGSDQRLANATAQLQQAGWAFSRFPAYDGRGKALSEFENYDDAAAQKILG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA----DFSDEFSQLLPHLSK-CDIN 114 R L E+GCY+SH K+ + A ++LEDD +F + L+ +L + ++ Sbjct: 61 RSLISSELGCYLSHYGCAKKFLETDADYLVVLEDDIQVLPNFKQNLNSLISYLDQHKELE 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 ++ A +KK KD + + G + I R G + A + + K + Sbjct: 121 WYVVNLAAKKKKLAKD--IVQIDG-YTIWHAYYFPIRGVGLVWSRAGAEAFVELGKTMQV 177 Query: 175 PIDMDMKHWWEHNIPSL-VTEPGAVYEAIDTND-STIEESRLVRKP 218 P+D+ + W N L V +P ID++ T+ + RK Sbjct: 178 PVDIFFQSWLSKNGKGLGVWQPFVQPAGIDSDILGTVATQGIKRKA 223 >gi|319897736|ref|YP_004135933.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae F3031] gi|317433242|emb|CBY81617.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae F3031] Length = 282 Score = 40.0 bits (92), Expect = 0.32, Method: Compositional matrix adjust. Identities = 51/180 (28%), Positives = 74/180 (41%), Gaps = 18/180 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDA----IYGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F FFDA I E N F K Sbjct: 35 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITFDRSS-----KAT 89 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C +SHI LW + + I EDD + +LL ++ + ++K + Sbjct: 90 LCDAEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLE 148 Query: 122 ALRK--KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVR-KNIYRPID 177 A K K + C N +I + + GY I + A +LL N+R K +Y ID Sbjct: 149 AHGKIIYGKCEQIKC----NRNISRLKFKHTGMAGYSITAKGARYLLNNIRNKQLYLAID 204 >gi|149629292|ref|XP_001516534.1| PREDICTED: similar to Cerebral endothelial cell adhesion molecule 1 [Ornithorhynchus anatinus] Length = 431 Score = 40.0 bits (92), Expect = 0.33, Method: Compositional matrix adjust. Identities = 17/49 (34%), Positives = 27/49 (55%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 R L+ E+GC++SH +W+ +A ++LEDD F F + L L Sbjct: 219 RTLTKGEVGCFLSHHAVWEEVAARGLGQVVVLEDDVRFEANFKRRLERL 267 >gi|158302599|ref|XP_561137.5| Anopheles gambiae str. PEST AGAP012933-PA [Anopheles gambiae str. PEST] gi|157021089|gb|EAL42272.3| AGAP012933-PA [Anopheles gambiae str. PEST] Length = 330 Score = 40.0 bits (92), Expect = 0.34, Method: Compositional matrix adjust. Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 7/100 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ------F 58 +Y+I+L RR K + L F A+ G+ +++ R Sbjct: 175 IYMINLERRTERRTKMLKHFDLLGLDVEHFPAVDGKQLS-DKKVYDMGIRFLPGYADPFH 233 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 KR +++ EIGC++SH ++W+R+ ++LEDD F Sbjct: 234 KRPMTMGEIGCFLSHYNIWERMVRLNQQEVLVLEDDIRFE 273 >gi|324508587|gb|ADY43624.1| Glycosyltransferase 25 family member [Ascaris suum] Length = 539 Score = 40.0 bits (92), Expect = 0.34, Method: Compositional matrix adjust. Identities = 39/168 (23%), Positives = 71/168 (42%), Gaps = 8/168 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPIC-NRIFSHQKRQCQFKR 60 +Y+I+L RR+K + + + A G EN + + +KR Sbjct: 294 IYLINLERRQERRQKMSEILKLMGIDHKLWRATDGKLLENEEFAADVVLLPGYEDPYYKR 353 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + EIGC++SH +W+ + I+ EDD F + +L L + D+++ + + Sbjct: 354 PMKTGEIGCFLSHYRIWRDVIDKSFERVIVFEDDLRFILNATNMLTELIE-DLDHTALPW 412 Query: 121 DA--LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 D L +K + + +PG + S T GY + + A LL Sbjct: 413 DLVYLGRKRLESARENWVPGQRHLSTVG-YSYWTLGYMLSQSGAKKLL 459 >gi|240948365|ref|ZP_04752751.1| putative glycosyltransferase [Actinobacillus minor NM305] gi|240297404|gb|EER47945.1| putative glycosyltransferase [Actinobacillus minor NM305] Length = 256 Score = 40.0 bits (92), Expect = 0.34, Method: Compositional matrix adjust. Identities = 30/108 (27%), Positives = 53/108 (49%), Gaps = 6/108 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+ISL RRE F + Q FS + + + + ++F + + + R ++ Sbjct: 4 YLISLEKDVQRRELFFSQPDTQDFQIFSAINTMSLSSEELA-KMFDVTQFEQHYHRAVTK 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHL 108 EIGC +SH+ +++ I +I A++ EDD FS F++ L L Sbjct: 63 GEIGCTMSHLKVYQWIVDDESIAEEDYALVCEDDVLFSANFNENLTAL 110 >gi|194210402|ref|XP_001489806.2| PREDICTED: glycosyltransferase 25 domain containing 2 [Equus caballus] Length = 578 Score = 40.0 bits (92), Expect = 0.35, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + Sbjct: 356 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMD-DIDRAQLD 414 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 415 WELIYIGRKRMQVKEPEKAVPNVVNLVEAD-YSYWTLGYVISLEGAQKLVGA 465 >gi|297281262|ref|XP_002802062.1| PREDICTED: procollagen galactosyltransferase 2-like [Macaca mulatta] Length = 626 Score = 40.0 bits (92), Expect = 0.36, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMD-DIDQAQLD 462 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 WELIYIGRKRMQVKEPEKAVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 513 >gi|308184613|ref|YP_003928746.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori SJM180] gi|308060533|gb|ADO02429.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori SJM180] Length = 273 Score = 39.7 bits (91), Expect = 0.38, Method: Compositional matrix adjust. Identities = 47/187 (25%), Positives = 81/187 (43%), Gaps = 24/187 (12%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCD 112 + + +SL E+GCY SH LW++ I + + ILEDD + F + L H+ + Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPV--CILEDDITLKEGFKEGLDFLEKHIQELG 149 Query: 113 INNIL-IKFDA-LRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + +DA +R +P K+ + G + + T GY I + A Sbjct: 150 YARLMHLLYDASVRSEPLSHKNHEIQEHVGIIKAYSHGV---GTQGYVITPKIAKVFKKC 206 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTF 220 + P+D M + H + +LV +P + A D STI L+RK F Sbjct: 207 SRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHF 264 Query: 221 SPLYFYR 227 L +++ Sbjct: 265 KYLKYWQ 271 >gi|157326157|gb|ABV44311.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 39.7 bits (91), Expect = 0.39, Method: Compositional matrix adjust. Identities = 20/44 (45%), Positives = 24/44 (54%), Gaps = 1/44 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 75 LTLPELGCYLSHYLLWKECV-KLDQPVVILEDDVTLEPHFMQAL 117 >gi|332230643|ref|XP_003264502.1| PREDICTED: procollagen galactosyltransferase 2 [Nomascus leucogenys] Length = 626 Score = 39.7 bits (91), Expect = 0.39, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+ + Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRQLEKTLVIEDDVRFEHQFKKKLMKLMD-DIDQAQLD 462 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 WELIYIGRKRMQVKEPEKAVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 513 >gi|7465084|pir||C64597 lipopolysacharide biosynthesis glycosyl transferase - Helicobacter pylori (strain 26695) Length = 404 Score = 39.7 bits (91), Expect = 0.39, Method: Compositional matrix adjust. Identities = 40/123 (32%), Positives = 47/123 (38%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFK---------RLLSLP-EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K R LP E GCY+SH LWK + +ILEDDA F Sbjct: 57 DAQSMLKSDWFHSDWCRGELLPQEFGCYLSHYFLWKECVKTNQ-PVVILEDDAMLESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|38569122|gb|AAR24271.1| glycosyltransferase [Shigella boydii] Length = 261 Score = 39.7 bits (91), Expect = 0.40, Method: Compositional matrix adjust. Identities = 49/191 (25%), Positives = 82/191 (42%), Gaps = 13/191 (6%) Query: 4 PVYVISLPFSHARREKFCHR-AARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++SL + RR K + L FSFF+ IYG+ + + I+S + Q R Sbjct: 5 SIKILSLADAIERRRKVSREFELKSVLPFSFFNGIYGKTLSKEKLDTIYSSELAQKVLHR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIK 119 L+ EIG SH +++ ++LEDD F +L L K + + I Sbjct: 65 QLTAGEIGATYSHYLIFREAYERGDEFVVVLEDDCYIDRNFDTVLTSILDKKNPTDDEII 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG---YFIGKEAAIHLLNVRKNI---Y 173 F +++ ++S++ G+ I++ L R G YF+G I K + Y Sbjct: 125 F--IQRHTSENSHIIRSLGSEKINKNYTL-HRMLGSAQYFVGAYGYIVTRAAMKKMIDTY 181 Query: 174 RPIDMDMKHWW 184 PI HW+ Sbjct: 182 LPIYCVCDHWY 192 >gi|311978056|ref|YP_003987176.1| putative glycosyltransferase [Acanthamoeba polyphaga mimivirus] gi|82000081|sp|Q5UQ62|YR655_MIMIV RecName: Full=Putative glycosyltransferase R655 gi|55417266|gb|AAV50916.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308204975|gb|ADO18776.1| putative glycosyltransferase [Acanthamoeba polyphaga mimivirus] Length = 324 Score = 39.7 bits (91), Expect = 0.40, Method: Compositional matrix adjust. Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 12/115 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGEN----NPICNRIFSHQKRQCQ 57 + + VI+L RRE + ++ Q+ FF+A GE +PI +F H Sbjct: 4 LKIIVINLKRRTDRREIMEKKFQDENITQYEFFEAFDGETLRPEDPILG-VFKHGVHGLS 62 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAY-SPAIGAIILEDDADFSDEFSQLLPHLSKC 111 K + GC +SH +W++IA + ++LEDD +F F + L + K Sbjct: 63 RKGVA-----GCALSHYTVWQKIAADTSGTKYLVLEDDINFKPNFKENLSKVMKT 112 >gi|16272494|ref|NP_438708.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] gi|260580997|ref|ZP_05848820.1| diadenosine tetraphosphatase [Haemophilus influenzae RdAW] gi|1170778|sp|Q03974|LIC2A_HAEIN RecName: Full=Lipooligosaccharide biosynthesis protein lex-1 gi|1573535|gb|AAC22208.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] gi|260092356|gb|EEW76296.1| diadenosine tetraphosphatase [Haemophilus influenzae RdAW] Length = 302 Score = 39.7 bits (91), Expect = 0.41, Method: Compositional matrix adjust. Identities = 51/199 (25%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQC---- 56 I VIS+ + RR+ + L FSFF+A Y N N+ + Q Sbjct: 3 AIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQS 62 Query: 57 ---------------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 + R+L+ E GC ISH LW + I EDD + Sbjct: 63 INQSINQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENFEYLKIFEDDVILGENA 122 Query: 102 SQLLPH----LSKCDINNILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTT 153 L ++ D N+I I +P K +P NFDI + T Sbjct: 123 EVFLNQNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWG--TA 180 Query: 154 GYFIGKEAAIHLLNVRKNI 172 GY I + AA +++ KNI Sbjct: 181 GYIISQGAAKYVIEYLKNI 199 >gi|332673581|gb|AEE70398.1| probable lipopolysaccharide biosynthesis protein [Helicobacter pylori 83] Length = 188 Score = 39.7 bits (91), Expect = 0.44, Method: Compositional matrix adjust. Identities = 39/123 (31%), Positives = 47/123 (38%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R A F FDAI ++ + Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCA-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDYCYQELLLREFGCYLSHYLLWKECVKTDQ-PVVILEDDVVLESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|296113314|ref|YP_003627252.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis RH4] gi|295921008|gb|ADG61359.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis RH4] Length = 254 Score = 39.7 bits (91), Expect = 0.45, Method: Compositional matrix adjust. Identities = 30/116 (25%), Positives = 49/116 (42%), Gaps = 21/116 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGE--NNPICNRIFSHQK 53 I +VIS+ + RRE + + F FFDA+ Y + + PI N Sbjct: 2 IQNFVISIKTATKRREHIMCEFGKQGIAFEFFDAVTPTDISKYAQKLSIPIIN------- 54 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 + L+ E C++SH+ LW+++ I EDD ++ Q L L+ Sbjct: 55 -----NQRLTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELT 105 >gi|157959946|ref|YP_001499980.1| glycosyl transferase family protein [Shewanella pealeana ATCC 700345] gi|157844946|gb|ABV85445.1| glycosyl transferase family 25 [Shewanella pealeana ATCC 700345] Length = 247 Score = 39.7 bits (91), Expect = 0.46, Method: Compositional matrix adjust. Identities = 44/172 (25%), Positives = 68/172 (39%), Gaps = 39/172 (22%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL RR H + + F FFDAI P+ N +++ + LS Sbjct: 4 FVISLVTEDRRRSHISHEFSSQAIPFEFFDAI----TPVQNEAEANRLQILTQGTTLSKG 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E C +SH+ + ++I A I EDD HLSK DA Sbjct: 60 ETSCLLSHVAILQKIVDESIPYAAIFEDDI-----------HLSK----------DAYH- 97 Query: 126 KPKKDSYLCT---LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 YLC+ +P + ++ + + + G F+G H L K ++R Sbjct: 98 ------YLCSENWIPDSVELVKLEMFNHIAKGQFLG----THKLESGKELFR 139 >gi|198415096|ref|XP_002129882.1| PREDICTED: similar to GLT25D1 protein [Ciona intestinalis] Length = 594 Score = 39.7 bits (91), Expect = 0.46, Method: Compositional matrix adjust. Identities = 33/129 (25%), Positives = 60/129 (46%), Gaps = 16/129 (12%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 +R L+ EIGC++SH +W+ + + I+ EDD F F++ L ++ + +I + + Sbjct: 409 ERTLTRGEIGCFLSHYFIWQEVVKNKLDQVIVFEDDLRFEISFNRRLKNVMQ-EIEDAKL 467 Query: 119 KFDALR--------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 ++D + K+P+K C L D S T Y + A LL K Sbjct: 468 EWDLIYIGRKRMQIKEPEKSVPDCPLLVEAD------YSYWTLAYILSNSGAQKLL-AGK 520 Query: 171 NIYRPIDMD 179 ++R + +D Sbjct: 521 PLHRMVPVD 529 >gi|316973139|gb|EFV56766.1| glycosyltransferase 25 family member 1 [Trichinella spiralis] Length = 372 Score = 39.7 bits (91), Expect = 0.46, Method: Compositional matrix adjust. Identities = 19/57 (33%), Positives = 34/57 (59%), Gaps = 1/57 (1%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHLSKCDIN 114 KR ++L E+GC++SH ++W+ + A+I EDD F+ F Q+ +++ D N Sbjct: 45 KRPMTLGEVGCFLSHYNVWRDMLDRGYRRAVIFEDDLRFTRSFRRQVGVVMAELDAN 101 >gi|157326133|gb|ABV44299.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 39/123 (31%), Positives = 51/123 (41%), Gaps = 27/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIF 49 I VY+ISL S R EKF R F FDAI + + + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCI-----FQIFDAISPKHQDFEKLLQELY 56 Query: 50 SHQK------RQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFS 102 Q L+LPE+GCY+SH LWK I + + +ILEDD F Sbjct: 57 DAQSLLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECIKLNQPV--VILEDDITLEPHFM 114 Query: 103 QLL 105 Q L Sbjct: 115 QAL 117 >gi|55960746|emb|CAI14721.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] Length = 234 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 12 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 71 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 72 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 121 >gi|74318875|gb|ABA02576.1| Lgt2B/C [Moraxella catarrhalis] gi|124245099|gb|ABM92441.1| Lgt2B/C [Moraxella catarrhalis] gi|326562971|gb|EGE13250.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 46P47B1] gi|326565312|gb|EGE15492.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 12P80B1] gi|326570192|gb|EGE20237.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis BC8] Length = 254 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 30/116 (25%), Positives = 49/116 (42%), Gaps = 21/116 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGE--NNPICNRIFSHQK 53 I +VIS+ + RRE + + F FFDA+ Y + + PI N Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAVTPTDISKYAQKLSIPIIN------- 54 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 + L+ E C++SH+ LW+++ I EDD ++ Q L L+ Sbjct: 55 -----NQRLTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELT 105 >gi|332811368|ref|XP_524994.3| PREDICTED: procollagen galactosyltransferase 2 [Pan troglodytes] Length = 626 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 463 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 513 >gi|16506820|ref|NP_055916.1| procollagen galactosyltransferase 2 precursor [Homo sapiens] gi|74750765|sp|Q8IYK4|GT252_HUMAN RecName: Full=Procollagen galactosyltransferase 2; AltName: Full=Glycosyltransferase 25 family member 2; AltName: Full=Hydroxylysine galactosyltransferase 2; Flags: Precursor gi|12620188|gb|AAG60609.1|AF288389_1 C1orf17 [Homo sapiens] gi|23273043|gb|AAH35672.1| Glycosyltransferase 25 domain containing 2 [Homo sapiens] gi|55957835|emb|CAI17872.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] gi|55960747|emb|CAI14722.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] gi|119611578|gb|EAW91172.1| glycosyltransferase 25 domain containing 2, isoform CRA_c [Homo sapiens] gi|168278659|dbj|BAG11209.1| glycosyltransferase 25 domain-containing protein 2 [synthetic construct] gi|325463379|gb|ADZ15460.1| glycosyltransferase 25 domain containing 2 [synthetic construct] Length = 626 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 463 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 513 >gi|145228085|gb|ABP48752.1| beta(1-4) galactosyltransferase [Moraxella catarrhalis] Length = 254 Score = 39.3 bits (90), Expect = 0.51, Method: Compositional matrix adjust. Identities = 30/116 (25%), Positives = 49/116 (42%), Gaps = 21/116 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGE--NNPICNRIFSHQK 53 I +VIS+ + RRE + + F FFDA+ Y + + PI N Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAVTPTDISKYAQKLSIPIIN------- 54 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 + L+ E C++SH+ LW+++ I EDD ++ Q L L+ Sbjct: 55 -----NQRLTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELT 105 >gi|119611576|gb|EAW91170.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Homo sapiens] Length = 638 Score = 39.3 bits (90), Expect = 0.52, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 416 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 475 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 476 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 525 >gi|332673582|gb|AEE70399.1| jhp0562 family glycosyltransferase [Helicobacter pylori 83] Length = 332 Score = 39.3 bits (90), Expect = 0.53, Method: Compositional matrix adjust. Identities = 20/44 (45%), Positives = 24/44 (54%), Gaps = 1/44 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 75 LTLPELGCYLSHYLLWKECV-KINQPVVILEDDVALEFNFMQAL 117 >gi|332978467|gb|EGK15180.1| lipooligosaccharide biosynthesis protein LpsA [Psychrobacter sp. 1501(2011)] Length = 255 Score = 39.3 bits (90), Expect = 0.55, Method: Compositional matrix adjust. Identities = 47/215 (21%), Positives = 87/215 (40%), Gaps = 15/215 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I YVISL + RR+ + A+ +F F +A+ + I + F+ L Sbjct: 2 IKSYVISLEKEYERRDHIVKQFAQHDTEFQFLNAV--TPDTTFEEIQTLGLMDLNFES-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH---LSKCDINNILIK 119 + E+ C +SH+ LW I EDD +F++L+ L DI +K Sbjct: 59 TAGELACLLSHLKLWHLAVEENYDYIAIFEDDIHLGSDFNKLMNDSSWLKDLDI----VK 114 Query: 120 FDALRKKPKKDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPI 176 + R + + G N +H + + T GY + + A +L+ K + I Sbjct: 115 LEKFRPTVELSFNSIKIEGTNRKLHILKGKNLGTGGYILSLKGANYLIRYFKRLGYVEAI 174 Query: 177 DMDM--KHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 D + + + ++P +P V + N+ ++ Sbjct: 175 DAVLFNERKYPKDLPIYQLQPAVVIQDSKLNEQSV 209 >gi|152979041|ref|YP_001344670.1| glycosyl transferase family protein [Actinobacillus succinogenes 130Z] gi|150840764|gb|ABR74735.1| glycosyl transferase family 25 [Actinobacillus succinogenes 130Z] Length = 292 Score = 39.3 bits (90), Expect = 0.55, Method: Compositional matrix adjust. Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 8/92 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAI---YGENNPICNRIFSHQKRQCQFKRLL 62 YVISLP + +RR+ + + ++ F FFDAI G N I + + Q+ + L Sbjct: 26 YVISLPTAVSRRQHIRNVFQQQNVPFEFFDAISPDIGLNAAIDRFVPNLQQAEH-----L 80 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDD 94 + E C++SH+ LW++ I EDD Sbjct: 81 TAGEKSCFMSHVSLWQKCLDDNLPYIAIFEDD 112 >gi|148707509|gb|EDL39456.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Mus musculus] Length = 469 Score = 39.3 bits (90), Expect = 0.56, Method: Compositional matrix adjust. Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 1/64 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L + DI+ + Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLME-DIDKAQLD 461 Query: 120 FDAL 123 ++ + Sbjct: 462 WELI 465 >gi|119611577|gb|EAW91171.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Homo sapiens] gi|193787801|dbj|BAG53004.1| unnamed protein product [Homo sapiens] Length = 506 Score = 39.3 bits (90), Expect = 0.58, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 284 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 343 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 344 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 393 >gi|261837943|gb|ACX97709.1| beta-4-galactosyltransferase [Helicobacter pylori 51] Length = 273 Score = 39.3 bits (90), Expect = 0.59, Method: Compositional matrix adjust. Identities = 45/182 (24%), Positives = 74/182 (40%), Gaps = 18/182 (9%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +S E+GCY SH LW++ I + I ILEDD ++F + L H+ + Sbjct: 94 KFMSFGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + I + T GY I + A + Sbjct: 152 RLMYLLYDANVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPLYF 225 P+D M + H + +LV +P + A D STI L+RK F L + Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSSKIALMRKLHFKYLKY 269 Query: 226 YR 227 ++ Sbjct: 270 WQ 271 >gi|325919939|ref|ZP_08181924.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] gi|325920508|ref|ZP_08182433.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] gi|325548991|gb|EGD19920.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] gi|325549555|gb|EGD20424.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] Length = 269 Score = 39.3 bits (90), Expect = 0.60, Method: Compositional matrix adjust. Identities = 26/93 (27%), Positives = 39/93 (41%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL N +++ Sbjct: 67 RKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLRSTEWLPAGNSIVR 126 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K DS + + G Q R++ P T Sbjct: 127 LEPSYNRIKLDSKVADIGG----RQLRLVRPST 155 >gi|193785321|dbj|BAG54474.1| unnamed protein product [Homo sapiens] Length = 234 Score = 39.3 bits (90), Expect = 0.60, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 12 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMVNIDQAQLDW 71 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 72 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 121 >gi|260462736|ref|ZP_05810941.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] gi|259031380|gb|EEW32651.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] Length = 281 Score = 39.3 bits (90), Expect = 0.61, Method: Compositional matrix adjust. Identities = 42/165 (25%), Positives = 59/165 (35%), Gaps = 11/165 (6%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----RLL 62 VI+L S R ARI + F+ A ++P Q QF R L Sbjct: 5 VINLDRSPDRLAHITAEFARIGIAFARIVATDARDHP-------ELVLQPQFSIYAVRRL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E+ C SH W IA A + EDD FS + LL + +K + Sbjct: 58 SSSEVACMHSHRACWSIIAQDDAPYGAVFEDDVVFSAKAGALLADSGWISADADAVKLET 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 K + F + + R T GY + ++ A LL Sbjct: 118 FFSKTMIQRKRIAVGNGFSLFRLRRSHMGTGGYIVSRQMARDLLE 162 >gi|254229481|ref|ZP_04922896.1| Glycosyltransferase involved in LPS biosynthesis [Vibrio sp. Ex25] gi|151938052|gb|EDN56895.1| Glycosyltransferase involved in LPS biosynthesis [Vibrio sp. Ex25] Length = 249 Score = 39.3 bits (90), Expect = 0.61, Method: Compositional matrix adjust. Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 10/72 (13%) Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-- 106 +SH+ RQ + + EIGC+ +H W+ I S AI+LED+ +F E S+LL Sbjct: 72 YSHKFRQ----KAIMAGEIGCFKTHSQAWELIVAS-GTPAIVLEDNIEFIGEASRLLADD 126 Query: 107 ---HLSKCDINN 115 ++ C + N Sbjct: 127 VLSYIQSCGLVN 138 >gi|262392590|ref|YP_003284444.1| LPS biosynthesis glycosyltransferase [Vibrio sp. Ex25] gi|262336184|gb|ACY49979.1| glycosyltransferase involved in LPS biosynthesis [Vibrio sp. Ex25] Length = 253 Score = 39.3 bits (90), Expect = 0.62, Method: Compositional matrix adjust. Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 10/72 (13%) Query: 49 FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-- 106 +SH+ RQ + + EIGC+ +H W+ I S AI+LED+ +F E S+LL Sbjct: 76 YSHKFRQ----KAIMAGEIGCFKTHSQAWELIVAS-GTPAIVLEDNIEFIGEASRLLADD 130 Query: 107 ---HLSKCDINN 115 ++ C + N Sbjct: 131 VLSYIQSCGLVN 142 >gi|261839355|gb|ACX99120.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 52] Length = 273 Score = 39.3 bits (90), Expect = 0.63, Method: Compositional matrix adjust. Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 18/182 (9%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 + +SL E+GCY SH LW++ I + I ILEDD + F + L H+ + Sbjct: 94 KFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEGFKEGLDFLEKHIQELGYA 151 Query: 115 NIL-IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 ++ + +DA K + I + T GY I A + Sbjct: 152 RLMHLLYDANVKSEPLSHKNHEIQERVGIIKAYSEGMGTQGYVITPNIAKVFKKCSRKWV 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPLYF 225 P+D M + H + +LV +P + A D STI L+RK F L + Sbjct: 212 VPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKYLKY 269 Query: 226 YR 227 ++ Sbjct: 270 WQ 271 >gi|157823499|ref|NP_001099537.1| procollagen galactosyltransferase 1 [Rattus norvegicus] gi|149036101|gb|EDL90767.1| glycosyltransferase 25 domain containing 1 (predicted) [Rattus norvegicus] gi|169642770|gb|AAI60899.1| Glycosyltransferase 25 domain containing 1 [Rattus norvegicus] Length = 617 Score = 39.3 bits (90), Expect = 0.63, Method: Compositional matrix adjust. Identities = 30/122 (24%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RRE+ + F +A+ G+ N Q R Sbjct: 338 VFMINLKRRLDRRERMLRALHEQEIDFQLVEAVDGKAMNTSQVEAMGIQMLPGYRDPYHG 397 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMQ-DVEREGLD 456 Query: 120 FD 121 +D Sbjct: 457 WD 458 >gi|304388480|ref|ZP_07370583.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis ATCC 13091] gi|21069187|gb|AAM33863.1|AF470659_3 LgtE [Neisseria meningitidis] gi|304337477|gb|EFM03643.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis ATCC 13091] Length = 280 Score = 39.3 bits (90), Expect = 0.63, Method: Compositional matrix adjust. Identities = 29/102 (28%), Positives = 41/102 (40%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRMMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH LWK+ + EDD + Q L Sbjct: 62 EKACFMSHAVLWKQALDESLPYIAVFEDDVLLGEGAEQFLAE 103 >gi|194376002|dbj|BAG57345.1| unnamed protein product [Homo sapiens] Length = 554 Score = 38.9 bits (89), Expect = 0.69, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 463 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 513 >gi|193787780|dbj|BAG52983.1| unnamed protein product [Homo sapiens] Length = 363 Score = 38.9 bits (89), Expect = 0.69, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 141 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 200 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 201 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 250 >gi|55960748|emb|CAI14723.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] Length = 363 Score = 38.9 bits (89), Expect = 0.69, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 141 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 200 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 201 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 250 >gi|222086962|ref|YP_002545496.1| glycosyltransferase protein [Agrobacterium radiobacter K84] gi|221724410|gb|ACM27566.1| glycosyltransferase protein [Agrobacterium radiobacter K84] Length = 250 Score = 38.9 bits (89), Expect = 0.69, Method: Compositional matrix adjust. Identities = 41/201 (20%), Positives = 76/201 (37%), Gaps = 18/201 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKR 60 P+PVY+I++ + R + ++ ++ + + G P + H++ Q + R Sbjct: 5 PLPVYLINIDRATDRLAEVQRQSDEFGIRIERVNGVDGALLPQEQWVDVDHERFQRRHGR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK- 119 ++ E GCY SH+ ++ + AII+EDD F + N +IK Sbjct: 65 VILPGEYGCYRSHLLALRQFLAGDSQSAIIIEDDIALDGNFLARAVAAQEAVPNAEVIKL 124 Query: 120 -------FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 F A + PK D L G T Y + + A ++ + Sbjct: 125 VNHRWGGFYATARSPKGDVIGRCLFGP---------QGSTACYLVTRRGAEKIVKSLAVM 175 Query: 173 YRPIDMDMKHWWEHNIPSLVT 193 P D+ ++ W+ T Sbjct: 176 SLPWDVAIERGWDMRTSIFTT 196 >gi|325201435|gb|ADY96889.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis M01-240149] Length = 275 Score = 38.9 bits (89), Expect = 0.73, Method: Compositional matrix adjust. Identities = 27/89 (30%), Positives = 38/89 (42%), Gaps = 2/89 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRVMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDD 94 E C++SH LWK+ + EDD Sbjct: 62 EKACFMSHAVLWKQALDEGLPYIAVFEDD 90 >gi|322786020|gb|EFZ12636.1| hypothetical protein SINV_80063 [Solenopsis invicta] Length = 186 Score = 38.9 bits (89), Expect = 0.75, Method: Compositional matrix adjust. Identities = 18/68 (26%), Positives = 39/68 (57%), Gaps = 1/68 (1%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 +++ EIGC++SH +W+++ ++LEDD F F Q + ++ ++ ++ IK+D Sbjct: 1 MTMGEIGCFLSHYVVWQKVLKHGYKSVMVLEDDVRFEPFFRQKVNYV-LAELTDLGIKWD 59 Query: 122 ALRKKPKK 129 + K+ Sbjct: 60 LVYMGRKR 67 >gi|319779934|ref|YP_004139410.1| glycosyl transferase family 25 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317165822|gb|ADV09360.1| glycosyl transferase family 25 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 274 Score = 38.9 bits (89), Expect = 0.75, Method: Compositional matrix adjust. Identities = 41/171 (23%), Positives = 67/171 (39%), Gaps = 2/171 (1%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VI+L S R + ARI + F AI + P +++ Q + + L+ E Sbjct: 5 VINLDRSPDRLARVTAEFARIGVAFERVQAIDARDRPELDQL--PQNVGYRNRLPLTDGE 62 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 I C +SH W IA A I EDD F+ + LL + ++K ++ K Sbjct: 63 IACLLSHRACWAVIAQGDARYGAIFEDDIVFAAKAGALLRDSDWIPADAEVVKLESYFCK 122 Query: 127 PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + F + T GY I ++ A+ L+ + I +D Sbjct: 123 TVIRRKRIAIGHGFSTSKLDAEHIGTAGYIISRQTALDLIKATEEIGIAVD 173 >gi|91777876|ref|YP_553084.1| hypothetical protein Bxe_B2251a [Burkholderia xenovorans LB400] gi|91690536|gb|ABE33734.1| Hypothetical protein Bxe_B2251a [Burkholderia xenovorans LB400] Length = 241 Score = 38.9 bits (89), Expect = 0.75, Method: Compositional matrix adjust. Identities = 42/194 (21%), Positives = 83/194 (42%), Gaps = 21/194 (10%) Query: 35 DAIYGEN--NPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILE 92 DA+ G + + ++ + ++ R ++ E+ C++SH +W++I + A++LE Sbjct: 3 DAVDGRSLAQSELDAVYDDAAARRRYGRSMTRAEVACFMSHRSVWRKIVET-GRAAVVLE 61 Query: 93 DDADFSDEFSQLLPHLSKCDINN----ILIKFDALRKKPKKDSYL------CTLPGNFDI 142 DDA F + + ++ +++ +L+ LR+ +Y G I Sbjct: 62 DDAMLEPAFFERVLRANESELSAVADIVLLGRSKLRRTASAWTYFNEPLRHVIGVGGLRI 121 Query: 143 HQP-RILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW-W---EHNIPSLVTEPGA 197 P + + + GY+I AA L + P+ + W W E + P A Sbjct: 122 GVPFKQWTSGSVGYWISAHAACAALAYSEG---PLGALLDDWPWHRDEGGARVVELRPYA 178 Query: 198 VYEAIDTNDSTIEE 211 V+E D S+IE+ Sbjct: 179 VWEDFDRLPSSIEQ 192 >gi|83944105|ref|ZP_00956561.1| glycosyltransferase, family protein 25 [Sulfitobacter sp. EE-36] gi|83844972|gb|EAP82853.1| glycosyltransferase, family protein 25 [Sulfitobacter sp. EE-36] Length = 252 Score = 38.9 bits (89), Expect = 0.76, Method: Compositional matrix adjust. Identities = 25/77 (32%), Positives = 41/77 (53%), Gaps = 6/77 (7%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIG--AIILEDDADFSDEFSQLLPHLSK-CDIN 114 ++R L E C++SH+ W+R+A A+G A+ILEDD S LL D++ Sbjct: 38 WQRPLKETEKACFLSHVAAWERVA---ALGEPALILEDDVVLSRATQVLLKSCRGLVDVD 94 Query: 115 NILIKFDALRKKPKKDS 131 ++ ++ RK KD+ Sbjct: 95 HLTLEVRQRRKIVAKDA 111 >gi|317009368|gb|ADU79948.1| family 25 glycosyl transferase [Helicobacter pylori India7] Length = 337 Score = 38.9 bits (89), Expect = 0.79, Method: Compositional matrix adjust. Identities = 19/44 (43%), Positives = 24/44 (54%), Gaps = 1/44 (2%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L+LPE+GCY+SH LWK + +ILEDD Q L Sbjct: 75 LTLPELGCYLSHYFLWKECVKTNQ-PVVILEDDVALEFNLMQAL 117 >gi|15991376|gb|AAL12842.1|AF355193_6 LgtE [Neisseria meningitidis] gi|325127442|gb|EGC50372.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis N1568] gi|325129463|gb|EGC52293.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis OX99.30304] gi|325201436|gb|ADY96890.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M01-240149] gi|325207416|gb|ADZ02868.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis NZ-05/33] Length = 280 Score = 38.9 bits (89), Expect = 0.79, Method: Compositional matrix adjust. Identities = 28/102 (27%), Positives = 41/102 (40%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRVMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH LWK+ + EDD + + L Sbjct: 62 EKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAE 103 >gi|317177342|dbj|BAJ55131.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F16] Length = 273 Score = 38.5 bits (88), Expect = 0.85, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 74/184 (40%), Gaps = 18/184 (9%) Query: 58 FKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----C 111 + + +SL E+GCY SH LW++ I + I ILEDD ++F + L L K Sbjct: 92 YAKFMSLGELGCYASHYSLWEKCIELNEPI--CILEDDITLKEDFKEGLDFLEKHIQELG 149 Query: 112 DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 + + + +DA K + I + T GY I + A + Sbjct: 150 YVRLMYLLYDANVKSEPLSHKNHEIQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRK 209 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--------LVRKPTFSPL 223 P+D M + H + +LV + + A D STI L+RK F L Sbjct: 210 WVVPVDTIMDATFIHGVKNLVLQLFVI--ADDEQISTIARKEEPYSPKIALMRKLHFKYL 267 Query: 224 YFYR 227 +++ Sbjct: 268 KYWQ 271 >gi|3043692|dbj|BAA25510.1| KIAA0584 protein [Homo sapiens] Length = 738 Score = 38.5 bits (88), Expect = 0.85, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 13/116 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F +L+ ++ + ++ Sbjct: 516 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 575 Query: 116 ILIKFDALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 LI R K+P+K +P ++ + S T GY I E A L+ Sbjct: 576 ELIYIGRKRMQVKEPEK-----AVPNVANLVEAD-YSYWTLGYVISLEGAQKLVGA 625 >gi|322489523|emb|CBZ24781.1| glycosyltransferase family-like protein [Leishmania mexicana MHOM/GT/2001/U1103] Length = 275 Score = 38.5 bits (88), Expect = 0.90, Method: Compositional matrix adjust. Identities = 17/42 (40%), Positives = 26/42 (61%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 L+ IGC +SH +W+R+ A+ILEDD +F +FS+ Sbjct: 92 LTKGAIGCALSHRAVWQRVVAEHRACALILEDDLEFHHQFSR 133 >gi|289669872|ref|ZP_06490947.1| glycosyltransferase [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 269 Score = 38.5 bits (88), Expect = 0.91, Method: Compositional matrix adjust. Identities = 25/93 (26%), Positives = 39/93 (41%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL N +++ Sbjct: 67 RTWTASNIGCFLSHQAAWRIAADSADAYTAIFEDDMHLSDALPTLLRSTDWLPGGNSIVR 126 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K D+ + + G Q R++ P T Sbjct: 127 LEPSYNRIKLDAKVADVAGR----QLRLVRPST 155 >gi|109947310|ref|YP_664538.1| pbeta-1,4-galactosyltransferase [Helicobacter acinonychis str. Sheeba] gi|109714531|emb|CAJ99539.1| pbeta-1,4-galactosyltransferase [Helicobacter acinonychis str. Sheeba] Length = 275 Score = 38.5 bits (88), Expect = 0.92, Method: Compositional matrix adjust. Identities = 46/188 (24%), Positives = 76/188 (40%), Gaps = 14/188 (7%) Query: 60 RLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDI 113 + ++ E+GCY SH LW++ I + I ILEDD + F + L L K + Sbjct: 94 KFMNFGELGCYASHYSLWEKCIDLNEPI--CILEDDITLKENFKEGLDFLEKHIQELGYV 151 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + + +D K D + + I + T GY I + A L + Sbjct: 152 RLMHLLYDPNIKSEPLDHNNPEIQEHIGIIKAYSEGVGTQGYVITPKIAKVFLKYSRRWI 211 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQW 233 P+D M + H + +LV + + E + STIE R + +P + R ++ Sbjct: 212 VPVDTIMDATFIHGVKNLVLQSFVIAE--EEQISTIER-RGIEEPYSPKITLMRELHFK- 267 Query: 234 NLHYNAWR 241 H WR Sbjct: 268 --HLEWWR 273 >gi|303285286|ref|XP_003061933.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] gi|226456344|gb|EEH53645.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] Length = 354 Score = 38.5 bits (88), Expect = 0.94, Method: Compositional matrix adjust. Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 14/104 (13%) Query: 2 PIPVYVISL---PFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF 58 P+ ++V+S+ P AR + R++ + F A+ P R +R+ Sbjct: 36 PVKIFVVSVLDDPVGQARLARL--RSSLSSVAFEVVPAVDVRKTP--ERELRDIRRRVDG 91 Query: 59 KRL-------LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA 95 +R +SL E+GC +SH+ ++ + S AI LEDDA Sbjct: 92 RRFEEINRRRVSLAEVGCSLSHLRAYETMLESNHDAAIFLEDDA 135 >gi|329122713|ref|ZP_08251291.1| lipooligosaccharide biosynthesis protein [Haemophilus aegyptius ATCC 11116] gi|327472587|gb|EGF18017.1| lipooligosaccharide biosynthesis protein [Haemophilus aegyptius ATCC 11116] Length = 161 Score = 38.5 bits (88), Expect = 0.95, Method: Compositional matrix adjust. Identities = 32/104 (30%), Positives = 41/104 (39%), Gaps = 9/104 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDA----IYGENNPICNRIFSHQKRQCQFKRL 61 YVISL RR+ + ++ F FFDA I E N F K Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITFDRSS-----KAT 85 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L EI C +SHI LW + + I EDD + +LL Sbjct: 86 LCDAEIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL 129 >gi|170736608|ref|YP_001777868.1| glycosyl transferase family protein [Burkholderia cenocepacia MC0-3] gi|169818796|gb|ACA93378.1| glycosyl transferase family 25 [Burkholderia cenocepacia MC0-3] Length = 256 Score = 38.5 bits (88), Expect = 0.98, Method: Compositional matrix adjust. Identities = 37/101 (36%), Positives = 41/101 (40%), Gaps = 14/101 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V VISL SHARR F RA HL F F DAI G R S L Sbjct: 20 VQVISLRRSHARRHAF--RANNPHLDFEFVDAIDG-------RALSDADLAASGLFAPGL 70 Query: 65 P----EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 P G ++ LW IA + I EDDA F +F Sbjct: 71 PYTRGAFGIAMTTHRLWTDIASDSEL-VTIAEDDAIFRPDF 110 >gi|326930289|ref|XP_003211280.1| PREDICTED: glycosyltransferase 25 family member 3-like [Meleagris gallopavo] Length = 541 Score = 38.5 bits (88), Expect = 1.00, Method: Compositional matrix adjust. Identities = 17/49 (34%), Positives = 25/49 (51%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 R L+ E+GC++SH ++WK I ++ EDD F F L L Sbjct: 334 RTLTKGEVGCFLSHYNIWKEIVSRGLQRVLVFEDDVRFEVSFPARLQRL 382 >gi|308184404|ref|YP_003928537.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] gi|308060324|gb|ADO02220.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] Length = 201 Score = 38.5 bits (88), Expect = 1.0, Method: Compositional matrix adjust. Identities = 39/123 (31%), Positives = 46/123 (37%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 IPVY+ISL S R EKF R F FDAI ++ + Sbjct: 18 IPVYIISLKESQRRLDTEKLVLKSNEKFKGRYV-----FQIFDAISPKHQDFEKFVQELY 72 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L E GCY+SH LWK +ILEDDA F Sbjct: 73 DAQSMLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECV-KLNQPVVILEDDAMLESNFM 131 Query: 103 QLL 105 Q L Sbjct: 132 QAL 134 >gi|218461235|ref|ZP_03501326.1| putative glycosyltransferase protein [Rhizobium etli Kim 5] Length = 289 Score = 38.5 bits (88), Expect = 1.0, Method: Compositional matrix adjust. Identities = 47/203 (23%), Positives = 87/203 (42%), Gaps = 2/203 (0%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + I Y+I+L + RR + + I + F A+ G + R F Q + R Sbjct: 27 LRINTYLINLDRAELRRFRMERLLSEIGISFERVAAVDGVGISLPRRDFDGQSYLRRHGR 86 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + EIGCY+SH+ +R A+ILEDD DF D+F+++L + + +++ Sbjct: 87 TPNPFEIGCYLSHVECARRFLAGNGEFALILEDDLDFDDDFAEVLAAALEQHASWDILRL 146 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + K T + I R + Y I ++AA + +V + P D+ + Sbjct: 147 STVNSGRKHRVEPLTASRSLAIALTR--EKGSGAYLINRKAAGWIADVLVPMRLPYDLAL 204 Query: 181 KHWWEHNIPSLVTEPGAVYEAID 203 ++ + + +P V + D Sbjct: 205 DLEFDEGLRACFVDPVPVSQRAD 227 >gi|254249297|ref|ZP_04942617.1| Glycosyl transferase [Burkholderia cenocepacia PC184] gi|124875798|gb|EAY65788.1| Glycosyl transferase [Burkholderia cenocepacia PC184] Length = 327 Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust. Identities = 41/115 (35%), Positives = 44/115 (38%), Gaps = 18/115 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V VISL SHARR F RA HL F F DAI G R S L Sbjct: 91 VQVISLRRSHARRHAF--RANNPHLDFEFVDAIDG-------RALSDADLAASGLFAPGL 141 Query: 65 P----EIGCYISHIHLWKRIAYSPAIGAIILEDDA----DFSDEFSQLLPHLSKC 111 P G ++ LW IA + I EDDA DF D Q L C Sbjct: 142 PYTRGAFGIAMTTHRLWTDIASDSEL-VTIAEDDAIFRPDFHDATLQFLRDNLGC 195 >gi|262371669|ref|ZP_06064948.1| glycosyl transferase, family 25 [Acinetobacter junii SH205] gi|262311694|gb|EEY92779.1| glycosyl transferase, family 25 [Acinetobacter junii SH205] Length = 252 Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust. Identities = 29/109 (26%), Positives = 44/109 (40%), Gaps = 1/109 (0%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R + + FS F A G + + Q Sbjct: 1 MKVVTYLINLDGSDQRLASATAQLNQAGWDFSRFSAYDGRGKALSEFENYDDLAAQKILG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 R L E+GCY+SH ++ + A ++LEDD + F Q L L Sbjct: 61 RSLINSELGCYLSHYGCAEKFLETDADYLVVLEDDIEVLPNFKQKLESL 109 >gi|71650980|ref|XP_814177.1| glycosyl transferase-like protein [Trypanosoma cruzi strain CL Brener] gi|70879127|gb|EAN92326.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 275 Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust. Identities = 19/53 (35%), Positives = 28/53 (52%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 F L+ IGC + H +W+ I A++LEDD +F +F +LL L K Sbjct: 88 FGMDLTRGAIGCALGHRKIWEMIVAERRTRALVLEDDVEFHHKFGRLLGPLWK 140 >gi|71415375|ref|XP_809756.1| glycosyl transferase-like protein [Trypanosoma cruzi strain CL Brener] gi|70874187|gb|EAN87905.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] gi|322829519|gb|EFZ32876.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 275 Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust. Identities = 19/53 (35%), Positives = 28/53 (52%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 F L+ IGC + H +W+ I A++LEDD +F +F +LL L K Sbjct: 88 FGMDLTRGAIGCALGHRKIWEMIVAERRTRALVLEDDVEFHHKFGRLLGPLWK 140 >gi|242007885|ref|XP_002424748.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508251|gb|EEB12010.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 259 Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust. Identities = 18/65 (27%), Positives = 38/65 (58%), Gaps = 1/65 (1%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 KR ++ EIGC++SH +W+++ + ++LEDD F F + + L K ++ ++ + Sbjct: 75 KRAMTHGEIGCFLSHYFIWEKVLENKYDLIMVLEDDVQFEPYFREKITALLK-EVEDLKL 133 Query: 119 KFDAL 123 +D + Sbjct: 134 PWDLI 138 >gi|73986206|ref|XP_541950.2| PREDICTED: similar to glycosyltransferase 25 domain containing 1 [Canis familiaris] Length = 623 Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust. Identities = 29/122 (23%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RRE+ ++ +A+ G+ N Q R Sbjct: 344 VFMINLKRRQDRRERMLRALQEQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 404 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLLNLMR-DVEREGLD 462 Query: 120 FD 121 +D Sbjct: 463 WD 464 >gi|308390022|gb|ADO32342.1| lacto-N-neotetraose biosynthesis glycosyl transferase [Neisseria meningitidis alpha710] Length = 280 Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust. Identities = 28/102 (27%), Positives = 41/102 (40%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR R + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIADTFGRHDIPFQFFDALMPSEE--LNRMMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH LWK+ + EDD + + L Sbjct: 62 EKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAE 103 >gi|289664998|ref|ZP_06486579.1| glycosyltransferase [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 269 Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust. Identities = 25/93 (26%), Positives = 39/93 (41%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL N +++ Sbjct: 67 RTWTASNIGCFLSHQAAWRIAADSADAYTAIFEDDMHLSDALPTLLRSTHWLPGGNSIVR 126 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K D+ + + G Q R++ P T Sbjct: 127 LEPSYNRIKLDAKVADVAGR----QLRLVRPST 155 >gi|115920130|ref|XP_783019.2| PREDICTED: similar to Glycosyltransferase 25 domain containing 2 [Strongylocentrotus purpuratus] gi|115954234|ref|XP_001196894.1| PREDICTED: similar to Glycosyltransferase 25 domain containing 2 [Strongylocentrotus purpuratus] Length = 624 Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust. Identities = 16/44 (36%), Positives = 25/44 (56%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 + R+L+ EIGC++SH ++WK + +I EDD F F Sbjct: 374 WGRVLTKGEIGCFLSHYNIWKEVVERNLSRILIFEDDIRFGARF 417 >gi|163803283|ref|ZP_02197162.1| 3-deoxy-D-manno-octulosonic-acid kinase [Vibrio sp. AND4] gi|159172920|gb|EDP57758.1| 3-deoxy-D-manno-octulosonic-acid kinase [Vibrio sp. AND4] Length = 246 Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust. Identities = 42/188 (22%), Positives = 85/188 (45%), Gaps = 15/188 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFS--HQKRQCQFKR 60 + ++VI+L S RRE + I F FF+ +NP + I+S + ++K Sbjct: 1 MKIFVINLKRSVDRREHVSNLLRGI--DFEFFEGEDIADNPE-HEIYSLYDASKTLKYKG 57 Query: 61 -LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 L+ PE+GC+ S I+LW+ +I ED+ + L L+ +IN+++ + Sbjct: 58 YTLTTPELGCFASQINLWRHCVKINE-PVLIFEDNIKL---YGDLKAQLN--NINSLVSQ 111 Query: 120 FDALRKK---PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + ++ +K + + L + + + + Y I E A L++ + + P+ Sbjct: 112 YGIVKLGNYFERKYTKVAVLDDTYSLVSCAESACGNSAYAITPEVANKYLSILQGFFEPV 171 Query: 177 DMDMKHWW 184 D + + W Sbjct: 172 DDFIDNEW 179 >gi|240949472|ref|ZP_04753812.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor NM305] gi|240296045|gb|EER46706.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor NM305] Length = 210 Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust. Identities = 46/179 (25%), Positives = 72/179 (40%), Gaps = 13/179 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR + ++ F FFDAI + + FS L+ Sbjct: 10 YVISLLTAKERRNHIIQEFGKQNIPFEFFDAITPDLIEEKAKEFSIDISNSP----LTKG 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDALR 124 E+GC +SHI LW + I EDD + + L + DI+ I I+ Sbjct: 66 EMGCALSHIALWHFAKENKLDYICIFEDDIYLGENAKEFLTNSYVNHDIDVIKIE----- 120 Query: 125 KKPKKDSYLCTLPGNF---DIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 K K Y +F ++H+ + T GY + + +LL K + I +D+ Sbjct: 121 KHSSKIIYAANPEAHFCNRNLHKLKSKHTGTAGYILTAKGIKYLLEKTKVYHLSIPVDV 179 >gi|283458618|ref|YP_003363253.1| LPS biosynthesis glycosyltransferase [Rothia mucilaginosa DY-18] gi|283134668|dbj|BAI65433.1| glycosyltransferase involved in LPS biosynthesis [Rothia mucilaginosa DY-18] Length = 333 Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust. Identities = 38/176 (21%), Positives = 76/176 (43%), Gaps = 16/176 (9%) Query: 31 FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII 90 F + A+ ++ + + F ++ ++ + E GC +SH H+WK S A A+ Sbjct: 90 FQRYSAVDNRDDHVTDAQFDSKQFLARYGKEPRPAEKGCALSHYHMWKDFLASDADWALF 149 Query: 91 LEDDADFS-------DEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 EDD S D + PH+ ++ +I +A + P+ D +L F Sbjct: 150 AEDDVLISPDLQPVVDRIIEKYPHVQMVNLGDIYAS-EAGKLTPQVDYPRLSLLSPFVYG 208 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVY 199 + R+ + + G AA++LL+ R R ++ + +P +V + ++Y Sbjct: 209 KYRMGNAYGSKPLYG--AALYLLS-RSGAERLVEC-----YGETVPGVVADDYSLY 256 >gi|291567094|dbj|BAI89366.1| putative glycosyl transferase [Arthrospira platensis NIES-39] Length = 283 Score = 37.7 bits (86), Expect = 1.5, Method: Compositional matrix adjust. Identities = 35/133 (26%), Positives = 54/133 (40%), Gaps = 17/133 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQCQFKRLLS 63 +YVI+LP RR + I L F I+ P FS S Sbjct: 16 IYVINLPERVDRRREMEKEIKSIGLNFDSEKVNIFPAIRPTEKLAFS------------S 63 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINNILIK 119 + +GCY+SH+ + K +++EDD S F +QLL LS+ + + + + Sbjct: 64 IGMLGCYLSHLEIIKIAKRDQLSNILVMEDDLAISSRFCSVQTQLLDELSQVNWDLLFLG 123 Query: 120 FDALRKKPKKDSY 132 + A K D Y Sbjct: 124 YFAYHKLKLSDYY 136 >gi|157866262|ref|XP_001681837.1| glycosyl transferase [Leishmania major strain Friedlin] gi|68125136|emb|CAJ02854.1| glycosyltransferase family-like protein [Leishmania major strain Friedlin] Length = 275 Score = 37.7 bits (86), Expect = 1.5, Method: Compositional matrix adjust. Identities = 17/42 (40%), Positives = 26/42 (61%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 L+ IGC +SH +W+R+ A+ILEDD +F +FS+ Sbjct: 92 LTKGAIGCALSHRAVWQRVVAEHRECALILEDDLEFHHQFSR 133 >gi|78045758|ref|YP_361933.1| family 25 glycosyltransferase [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78034188|emb|CAJ21833.1| putative family 25 glycosyltransferase [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 318 Score = 37.7 bits (86), Expect = 1.5, Method: Compositional matrix adjust. Identities = 37/160 (23%), Positives = 61/160 (38%), Gaps = 16/160 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK--- 59 I Y+I++ S R + R + L F AI G + + +H R+ + Sbjct: 51 IKSYLINMQRSGDRLQAMSARFDALGLPFERIAAIDGAT--LTDEQIAHFVRERPLEGSG 108 Query: 60 -------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD 112 R + IGC++SH W+ A S I EDD SD LL Sbjct: 109 DAFSAGPRKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLGSTDWLP 168 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 +++ + + K D+ + + G Q R++ P T Sbjct: 169 GGRSIVRLEPSYNRIKLDAKVADVAGR----QLRLVRPST 204 >gi|294669020|ref|ZP_06734106.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291309012|gb|EFE50255.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 275 Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust. Identities = 28/101 (27%), Positives = 40/101 (39%), Gaps = 2/101 (1%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VISL + ARR+ + F FFDA+ NR + LLS E Sbjct: 5 VISLSSARARRKHIADTFGARGIPFRFFDALMPSEE--LNRAMEELVPGLAKQHLLSEVE 62 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 C++SH+ LWK+ + EDD + + L Sbjct: 63 KACFMSHVVLWKQALDEGLPYVAVFEDDVLLGENAEKFLAE 103 >gi|303291206|ref|XP_003064889.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] gi|226453560|gb|EEH50869.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] Length = 293 Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust. Identities = 36/130 (27%), Positives = 59/130 (45%), Gaps = 28/130 (21%) Query: 46 NRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 +R S+++++ +RL GC +SHI +W ++ S + I+LEDDA EF + Sbjct: 133 SRRLSNREQRVLHERL----RFGCSLSHIRIWMKLLSSTSDFFIVLEDDAIVVKEFER-- 186 Query: 106 PHLSKCDINNILIKFDALRKKPKK-----DSYLCT-LPGNF---DIHQPRILSPRTTGYF 156 +AL+ P Y+C+ LPG + +I Q R S T GY Sbjct: 187 ------------KTHEALKSLPDDWELFYIGYVCSALPGGYLSRNIRQLRGGSC-TKGYA 233 Query: 157 IGKEAAIHLL 166 + + A L+ Sbjct: 234 VSRRGAERLV 243 >gi|170719162|ref|YP_001784307.1| glycosyl transferase [Haemophilus somnus 2336] gi|168827291|gb|ACA32662.1| glycosyl transferase family 25 [Haemophilus somnus 2336] Length = 277 Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust. Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 19/185 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKRLLS 63 +VISL S RR+ ++ +SFF+A EN I K L+ Sbjct: 4 HVISLKSSVERRQHIEKYFGEQNIPYSFFNAFQPNEENENIIQYYVPSLN-----KTDLT 58 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINNILIK 119 + E GC++SH+ LW++ I EDD S Q L + D+N I Sbjct: 59 IGEKGCFMSHVLLWQKCVDENLPFIAIFEDDIILSPVAHQFLGQYDWLKERFDVNQYFIL 118 Query: 120 FDALRKKPKKDSYLCTLPGNFDIH-QPRILSPR--TTGYFIGKEAAIHLLNVRKN---IY 173 L +Y T F P+++S + T GY I + AA LL++ K+ Sbjct: 119 --RLETYLMNSAYRKTKIKFFQQRCFPQLMSQQWGTAGYIISQIAAKKLLDLFKSPDFSV 176 Query: 174 RPIDM 178 PID+ Sbjct: 177 EPIDV 181 >gi|283956549|ref|ZP_06374029.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. jejuni 1336] gi|283792269|gb|EFC31058.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. jejuni 1336] Length = 183 Score = 37.7 bits (86), Expect = 1.7, Method: Compositional matrix adjust. Identities = 36/140 (25%), Positives = 59/140 (42%), Gaps = 29/140 (20%) Query: 70 YISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPK 128 ++SH LW+ + I IILEDD +FSDEF +NN D L K Sbjct: 3 FVSHYKLWQECVKLDEPI--IILEDDVEFSDEF-----------LNNGEEYIDGLLKSEY 49 Query: 129 KDSYLCTL-------------PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + LC L G + + + T GY + AA +++ + IY Sbjct: 50 EYIRLCYLFDKRLYFLSESYGGGGYYLSFKNL--SGTQGYVLKPSAAKKIISKAQFIYMS 107 Query: 176 IDMDMKHWWEHNIPSLVTEP 195 +D M ++H + +++ +P Sbjct: 108 VDDYMDKIYKHKVLNIIKKP 127 >gi|269213729|ref|ZP_05982739.2| lacto-N-neotetraose biosynthesis glycosyl tranferase LgtE [Neisseria cinerea ATCC 14685] gi|269145630|gb|EEZ72048.1| lacto-N-neotetraose biosynthesis glycosyl tranferase LgtE [Neisseria cinerea ATCC 14685] Length = 295 Score = 37.7 bits (86), Expect = 1.8, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 65/182 (35%), Gaps = 37/182 (20%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIY---------GENNPICNRIFSHQKRQC 56 YVISL + RR R + F FFDA+ GE P R Sbjct: 19 YVISLASAKERRAHIADTFGRHDIPFQFFDALMPSEELEQAMGELVPGLVR--------- 69 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCD 112 LLS E C++SH+ LWK+ + EDD + + L + D Sbjct: 70 --HNLLSSVEKACFMSHVVLWKQALDEGVSYIAVFEDDVLLGEGAEKFLAKDGWLKERFD 127 Query: 113 INNILI-----KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR--TTGYFIGKEAAIHL 165 ++ I F + +P C +P H P + S T GY I +EA Sbjct: 128 PDSAFIVRLETMFIKVIARP------CGVPDYEGRHFPLLESEHWGTAGYIISREAMRFF 181 Query: 166 LN 167 L Sbjct: 182 LE 183 >gi|294627346|ref|ZP_06705931.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|294667590|ref|ZP_06732804.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292598301|gb|EFF42453.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292602585|gb|EFF46022.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 269 Score = 37.4 bits (85), Expect = 1.9, Method: Compositional matrix adjust. Identities = 25/93 (26%), Positives = 39/93 (41%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL N +++ Sbjct: 67 RKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLRSTDWLPGGNSIVR 126 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K D+ + + G Q R++ P T Sbjct: 127 LEPSYNRIKLDAKVADVAG----RQLRLVRPST 155 >gi|209525525|ref|ZP_03274064.1| glycosyl transferase family 25 [Arthrospira maxima CS-328] gi|209494024|gb|EDZ94340.1| glycosyl transferase family 25 [Arthrospira maxima CS-328] Length = 287 Score = 37.4 bits (85), Expect = 1.9, Method: Compositional matrix adjust. Identities = 37/135 (27%), Positives = 56/135 (41%), Gaps = 21/135 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YVI+LP RR + I L F+ E I I +K L+ Sbjct: 20 IYVINLPERVDRRREMEKEIKSIGLNFN------SEKVKIFPAIKPTEK--------LAF 65 Query: 65 PEIG---CYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINNIL 117 P IG CY+SH+ + K +++EDD S F +QLL LS+ + + + Sbjct: 66 PSIGVLGCYLSHLEIIKIAKTDKLSHILVMEDDLAISSRFCSVQTQLLDELSQVNWDLLF 125 Query: 118 IKFDALRKKPKKDSY 132 + + A K D Y Sbjct: 126 LGYLAYNKLKLSDYY 140 >gi|322514724|ref|ZP_08067750.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Actinobacillus ureae ATCC 25976] gi|322119319|gb|EFX91435.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Actinobacillus ureae ATCC 25976] Length = 276 Score = 37.4 bits (85), Expect = 1.9, Method: Compositional matrix adjust. Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 11/94 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL---- 61 +VISL ++ RR + ++ ++ F FFDA+ P + Q L Sbjct: 8 FVISLVNANVRRSHINNEFSKRNIPFQFFDAL----QP---SVLLEQYIATTLPGLKVAN 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA 95 LS E C++SH+ LW+++A S I EDD Sbjct: 61 LSEGEKACFMSHMALWEKLANSKNNYMAIFEDDV 94 >gi|284007091|emb|CBA72366.1| conserved hypothetical protein [Arsenophonus nasoniae] Length = 270 Score = 37.4 bits (85), Expect = 2.0, Method: Compositional matrix adjust. Identities = 29/110 (26%), Positives = 50/110 (45%), Gaps = 5/110 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKR 60 P+++ISL RR + + + L F+ DAI G N I + + + + + Sbjct: 10 PIFIISLKKDTDRRTQISYALSSQALPFTIVDAIEGSKLTQNDIAGLKYPYYDKA--YGK 67 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 EI C +SH +++ I + A+ILEDD + S L+ L + Sbjct: 68 PNGKNEIACSLSHQSVYQHIIKNDIEWALILEDDVIIDKKLSPLIKALEQ 117 >gi|157367406|gb|ABV45559.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 435 Score = 37.4 bits (85), Expect = 2.0, Method: Compositional matrix adjust. Identities = 38/123 (30%), Positives = 46/123 (37%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKLLQELY 56 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 R K + L E GCY+SH LWK + +ILEDDA F Sbjct: 57 DRSSLLKSDWFHSDYCYKELLPQEFGCYLSHYFLWKECVKTNQ-PIVILEDDAMLESHFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|323457338|gb|EGB13204.1| hypothetical protein AURANDRAFT_60431 [Aureococcus anophagefferens] Length = 2376 Score = 37.4 bits (85), Expect = 2.0, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 57/141 (40%), Gaps = 35/141 (24%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF---SDEFSQLLPHLSKCDINNILIK 119 ++ ++ C SH LW+R + P ++LEDDA F +D F + + Sbjct: 432 TVGQVACLCSHASLWRRFSACPKPHFVVLEDDARFPAGADAFRRAVD------------- 478 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRI----------LSP--RT---TGYFIGKEAAIH 164 D +R +P YL P ++ PR+ +P RT Y + K A Sbjct: 479 -DVVRGEPFDVCYLYVYPDHWP-SPPRVADDPNRFRVSTTPGFRTWCLLAYVVSKRGAEK 536 Query: 165 L--LNVRKNIYRPIDMDMKHW 183 L L + +Y PID + W Sbjct: 537 LRRLIETEEVYAPIDCMVADW 557 >gi|74024994|ref|XP_829063.1| glycosyl transferase [Trypanosoma brucei TREU927] gi|70834449|gb|EAN79951.1| glycosyl transferase-like protein [Trypanosoma brucei] Length = 307 Score = 37.4 bits (85), Expect = 2.0, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 54/112 (48%), Gaps = 15/112 (13%) Query: 5 VYVISLPFSHARRE---KFCHRAARIHLQFSFFDAIYGEN--------NPICNRIFSHQK 53 VYVI+L R E + RA +H ++ F A+ G N + +R+ + Sbjct: 47 VYVINLDRRPDRWEFATQQLSRAGFLHEEYVRFPAVDGRNVDLQKAHACGLISRLGLLRL 106 Query: 54 RQCQFKRLLSL----PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 ++ + +R+ + +GC +SH LW +IA S +++EDD+ F +F Sbjct: 107 QEPEHRRIWGMDLNPGAVGCALSHALLWAQIAASRHRSVLVVEDDSLFPQDF 158 >gi|68357136|ref|XP_694217.1| PREDICTED: procollagen galactosyltransferase 1 [Danio rerio] Length = 609 Score = 37.4 bits (85), Expect = 2.1, Method: Compositional matrix adjust. Identities = 18/46 (39%), Positives = 27/46 (58%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 R L+ E+GC++SH ++WK I ++ILEDD F F + L Sbjct: 390 RPLTKGELGCFLSHYNIWKEIVDRGLKTSLILEDDLRFEIFFKRRL 435 >gi|169234704|ref|NP_001108473.1| chromosome associated protein D3 [Bombyx mori] gi|18700451|dbj|BAB85193.1| hypothetical protein [Bombyx mori] gi|22474509|dbj|BAC10614.1| hypothetical protein [Bombyx mori] Length = 407 Score = 37.4 bits (85), Expect = 2.1, Method: Compositional matrix adjust. Identities = 43/175 (24%), Positives = 71/175 (40%), Gaps = 15/175 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFS----HQKRQCQFK 59 +Y+I+L RR + + F AI G N + + R +S K Sbjct: 211 IYMINLERRQERRFLMELSFKELGMSVQHFSAIDGRNLDMNDLREYSITLMPNYEDPYHK 270 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD----FSDEFSQLLPHLSKCDINN 115 R + E+GC++SH ++W+ I + A++LEDD F F QL+ +++ Sbjct: 271 RPMKAGEVGCFLSHYYIWQDIVKNHHRVALLLEDDIHFVPYFRHRFIQLMDEITQSLSAL 330 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR----TTGYFIGKEAAIHLL 166 + I ++ + L L G + P P T GY I + A LL Sbjct: 331 LYIIYNMFYFRYIGRKIL--LDGEEEYATPHTTRPLYSYWTLGYLISERGAKKLL 383 >gi|261335009|emb|CBH18003.1| glycosyl transferase-like protein, putative [Trypanosoma brucei gambiense DAL972] Length = 307 Score = 37.4 bits (85), Expect = 2.1, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 54/112 (48%), Gaps = 15/112 (13%) Query: 5 VYVISLPFSHARRE---KFCHRAARIHLQFSFFDAIYGEN--------NPICNRIFSHQK 53 VYVI+L R E + RA +H ++ F A+ G N + +R+ + Sbjct: 47 VYVINLDRRPDRWEFATQQLSRAGFLHEEYVRFPAVDGRNVDLQKAHACGLISRLGLLRL 106 Query: 54 RQCQFKRLLSL----PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 ++ + +R+ + +GC +SH LW +IA S +++EDD+ F +F Sbjct: 107 QEPEHRRIWGMDLNPGAVGCALSHALLWAQIAASRHRSVLVVEDDSLFPQDF 158 >gi|1944154|dbj|BAA19628.1| unnamed protein product [Aggregatibacter actinomycetemcomitans] Length = 221 Score = 37.4 bits (85), Expect = 2.4, Method: Compositional matrix adjust. Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 6/81 (7%) Query: 44 ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSD 99 + + +F+ K + + R ++ EIGC +SH+ ++++I + A++ EDDA F+ Sbjct: 7 VLSAVFNPVKFEQHYGRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNK 66 Query: 100 EFSQLLPHL--SKCDINNILI 118 + L L C + ILI Sbjct: 67 NLPENLTALLAQHCTADIILI 87 >gi|311249255|ref|XP_003123541.1| PREDICTED: procollagen galactosyltransferase 1-like [Sus scrofa] Length = 623 Score = 37.0 bits (84), Expect = 2.5, Method: Compositional matrix adjust. Identities = 39/182 (21%), Positives = 69/182 (37%), Gaps = 28/182 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RR++ ++ +A+ G+ N Q R Sbjct: 344 VFMINLKRRQDRRDRMLRALQEQEIESRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 404 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 462 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPR------------TTGYFIGKEAAIHLLN 167 +D + K+ + P PR T Y I + A LL Sbjct: 463 WDLIYVGRKR----------MQVEHPEKAVPRVRNLVEADYSYWTLAYVISLQGARKLLA 512 Query: 168 VR 169 R Sbjct: 513 AR 514 >gi|149944687|ref|NP_001092425.1| procollagen galactosyltransferase 1 precursor [Bos taurus] gi|160395520|sp|A5PK45|GT251_BOVIN RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|148744100|gb|AAI42351.1| GLT25D1 protein [Bos taurus] gi|296486064|gb|DAA28177.1| glycosyltransferase 25 domain containing 1 precursor [Bos taurus] Length = 623 Score = 37.0 bits (84), Expect = 2.6, Method: Compositional matrix adjust. Identities = 29/122 (23%), Positives = 53/122 (43%), Gaps = 6/122 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RRE+ + +A+ G+ N Q R Sbjct: 344 VFMINLKRRQDRRERMLRALEEQEIACRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 404 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMQ-DVEREGLD 462 Query: 120 FD 121 +D Sbjct: 463 WD 464 >gi|160333551|ref|NP_001103992.1| procollagen galactosyltransferase 1 precursor [Danio rerio] gi|160395521|sp|A5PMF6|GT251_DANRE RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|148725420|emb|CAN87888.1| novel protein similar to vertebrate glycosyltransferase 25 domain containing 1 (GLT25D1) [Danio rerio] Length = 604 Score = 37.0 bits (84), Expect = 2.6, Method: Compositional matrix adjust. Identities = 38/165 (23%), Positives = 74/165 (44%), Gaps = 19/165 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++W I ++++EDD F F + L +L + ++ + + Sbjct: 385 RPLTKGELGCFLSHYNIWNEIVDRGLQSSLVIEDDLRFEVFFKRRLQNLMQ-EVQSQQLD 443 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN---VRKNI- 172 +D + RK+ + + ++P + + S T GY I A LL ++K + Sbjct: 444 WDLIYIGRKRMQVERPEKSVPRIHSLVEADY-SYWTLGYVISLRGAQKLLRAEPLKKMLP 502 Query: 173 ----------YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 PI+ M H+ + ++ + EP +Y T D Sbjct: 503 VDEFLPVMYNKHPIEEYMSHFPQRDLRAFSAEPLLIYPTHYTGDQ 547 >gi|325579209|ref|ZP_08149165.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Haemophilus parainfluenzae ATCC 33392] gi|325159444|gb|EGC71578.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Haemophilus parainfluenzae ATCC 33392] Length = 282 Score = 37.0 bits (84), Expect = 2.7, Method: Compositional matrix adjust. Identities = 43/173 (24%), Positives = 67/173 (38%), Gaps = 18/173 (10%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR---QCQFKRLLS 63 VIS+ + RR + + + F FFDA P +R+ H +R + L+ Sbjct: 10 VISISTADKRRNHIIDQFGQKKIPFEFFDAF----TP-SDRLDVHLQRYLPNVEATSKLT 64 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 E GC +SH LWK+ I EDD + + L + + + L Sbjct: 65 AGEKGCLMSHFMLWKKCVDDNLAYISIFEDDILLGENAEKFLANDEWLKVRFNFQEIFVL 124 Query: 124 R--------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 R + K+ L N DI + + T GY I AA +L+N+ Sbjct: 125 RLETFLMPVQLEKQQQILPFQERNIDILKSKHFG--TAGYIISNGAAKYLINL 175 >gi|261378165|ref|ZP_05982738.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria cinerea ATCC 14685] gi|269145629|gb|EEZ72047.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria cinerea ATCC 14685] Length = 279 Score = 37.0 bits (84), Expect = 2.9, Method: Compositional matrix adjust. Identities = 49/198 (24%), Positives = 72/198 (36%), Gaps = 41/198 (20%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIY---------GENNPICNRIFSHQKRQCQ 57 +ISL + RR R + F FFDA+ GE P R Sbjct: 5 IISLASAKERRAHIADTFGRHDIPFQFFDALMPSEELEQAMGELVPGLVR---------- 54 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDI 113 LLS E C++SH+ LWK+ + EDD + + L + D Sbjct: 55 -HNLLSSVEKACFMSHVVLWKQALDEGVSYIAVFEDDVLLGEGAEKFLAKDGWLKERFDP 113 Query: 114 NNILI-----KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR--TTGYFIGKEAA---- 162 ++ I F + +P C +P H P + S T GY I +EA Sbjct: 114 DSAFIVRLETMFIKVIARP------CGVPDYEGRHFPLLESEHWGTAGYIISREAMRFFL 167 Query: 163 IHLLNVRKNIYRPIDMDM 180 H ++ +PID+ M Sbjct: 168 QHFAMLQPERIKPIDLMM 185 >gi|157367399|gb|ABV45556.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 442 Score = 37.0 bits (84), Expect = 2.9, Method: Compositional matrix adjust. Identities = 38/123 (30%), Positives = 44/123 (35%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR----------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K L E GCY+SH LWK S +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKSNQ-PVVILEDDVTLESHFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|260582744|ref|ZP_05850531.1| LOW QUALITY PROTEIN: diadenosine tetraphosphatase [Haemophilus influenzae NT127] gi|260094194|gb|EEW78095.1| LOW QUALITY PROTEIN: diadenosine tetraphosphatase [Haemophilus influenzae NT127] Length = 269 Score = 37.0 bits (84), Expect = 2.9, Method: Compositional matrix adjust. Identities = 35/121 (28%), Positives = 49/121 (40%), Gaps = 10/121 (8%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINN 115 R+L+ E GC ISH LW + I EDD + L ++ D N+ Sbjct: 48 RILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFND 107 Query: 116 ILIKFDALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 I I +P K +P NFDI + T GY I + AA +++ KN Sbjct: 108 IFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKS--THRGTAGYIISQGAAKYVIEYLKN 165 Query: 172 I 172 I Sbjct: 166 I 166 >gi|292621863|ref|XP_002664798.1| PREDICTED: procollagen galactosyltransferase 1-like, partial [Danio rerio] Length = 535 Score = 37.0 bits (84), Expect = 3.0, Method: Compositional matrix adjust. Identities = 38/165 (23%), Positives = 74/165 (44%), Gaps = 19/165 (11%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++W I ++++EDD F F + L +L + ++ + + Sbjct: 316 RPLTKGELGCFLSHYNIWNEIVDRGLQSSLVIEDDLRFEVFFKRRLQNLMQ-EVQSQQLD 374 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN---VRKNI- 172 +D + RK+ + + ++P + + S T GY I A LL ++K + Sbjct: 375 WDLIYIGRKRMQVERPEKSVPRIHSLVEAD-YSYWTLGYVISLRGAQKLLRAEPLKKMLP 433 Query: 173 ----------YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 PI+ M H+ + ++ + EP +Y T D Sbjct: 434 VDEFLPVMYNKHPIEEYMSHFPQRDLRAFSAEPLLIYPTHYTGDQ 478 >gi|283458617|ref|YP_003363252.1| LPS biosynthesis glycosyltransferase [Rothia mucilaginosa DY-18] gi|283134667|dbj|BAI65432.1| glycosyltransferase involved in LPS biosynthesis [Rothia mucilaginosa DY-18] Length = 273 Score = 37.0 bits (84), Expect = 3.0, Method: Compositional matrix adjust. Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 8/90 (8%) Query: 31 FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII 90 F + A+ +N + + F + + ++ R S E GC +SH+ +W+ S A A+I Sbjct: 30 FIRYSAVDNRDNHVADEEFDVEFFRSRWHRDPSPAEKGCMLSHVKMWRDFVASDADWALI 89 Query: 91 LEDDADFSDEFSQLLPHLSKCDINNILIKF 120 EDD LL ++ +N I+ K+ Sbjct: 90 AEDDI--------LLSPDAEAVVNAIITKY 111 >gi|315125453|ref|YP_004067456.1| glycosyl transferase family 25 [Pseudoalteromonas sp. SM9913] gi|315013966|gb|ADT67304.1| glycosyl transferase family 25 [Pseudoalteromonas sp. SM9913] Length = 172 Score = 37.0 bits (84), Expect = 3.1, Method: Compositional matrix adjust. Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 4/127 (3%) Query: 74 IHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYL 133 + LW+++ ++LEDD ++ K N LIK R P DS Sbjct: 1 MRLWQKMVNENMPFCVVLEDDLFIEASLKDVVDAAIKLK-NWDLIKLSDNRNFPFIDS-- 57 Query: 134 CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVT 193 L N + + T GY I A LL RK +RP+D+DM+ E + + Sbjct: 58 APLENNLTVGNYKKAPNGTQGYIISLSGAKKLLQ-RKPFFRPVDVDMQFHTEVGLSMIGI 116 Query: 194 EPGAVYE 200 +P + E Sbjct: 117 KPYPIAE 123 >gi|284053776|ref|ZP_06383986.1| glycosyl transferase family protein [Arthrospira platensis str. Paraca] Length = 284 Score = 37.0 bits (84), Expect = 3.1, Method: Compositional matrix adjust. Identities = 37/135 (27%), Positives = 56/135 (41%), Gaps = 21/135 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YVI+LP RR + I L F N +IF K + L+ Sbjct: 17 IYVINLPERVDRRREMEKEIKSIGLNF----------NSEKVKIFPALKPTEK----LAF 62 Query: 65 PEIG---CYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINNIL 117 P IG CY+SH+ + K +++EDD S F +QLL LS+ + + + Sbjct: 63 PSIGVLGCYLSHLEIIKIAKRDKLSHILVMEDDLAISSRFCSVQTQLLDELSQVNWDLLF 122 Query: 118 IKFDALRKKPKKDSY 132 + + A K D Y Sbjct: 123 LGYFAYHKLKLSDYY 137 >gi|296233252|ref|XP_002761953.1| PREDICTED: procollagen galactosyltransferase 1-like [Callithrix jacchus] Length = 738 Score = 37.0 bits (84), Expect = 3.1, Method: Compositional matrix adjust. Identities = 26/111 (23%), Positives = 49/111 (44%), Gaps = 5/111 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V++I+L RRE+ ++ +A+ G+ N + Q Sbjct: 459 VFMINLKRRQDRRERMLRALQEQGIECRLVEAVDGKAMNTSQVEALGIQMLPGYQDPYHG 518 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 519 RPLTKGELGCFLSHYNIWKEVVDRGLEKSLVFEDDLRFEIFFKRRLMNLMR 569 >gi|301753877|ref|XP_002912839.1| PREDICTED: procollagen galactosyltransferase 1-like [Ailuropoda melanoleuca] Length = 542 Score = 36.6 bits (83), Expect = 3.5, Method: Compositional matrix adjust. Identities = 29/122 (23%), Positives = 53/122 (43%), Gaps = 6/122 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RRE+ + +A+ G+ N Q R Sbjct: 263 VFMINLKRRQDRRERMLRALQAQEIACRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 322 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 323 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLLNLMR-DVEREGLD 381 Query: 120 FD 121 +D Sbjct: 382 WD 383 >gi|268574800|ref|XP_002642379.1| Hypothetical protein CBG18383 [Caenorhabditis briggsae] Length = 497 Score = 36.6 bits (83), Expect = 3.5, Method: Compositional matrix adjust. Identities = 23/99 (23%), Positives = 44/99 (44%), Gaps = 5/99 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-----NPICNRIFSHQKRQCQFK 59 +Y+I+L R ++ + +++S +A G+ + N + K Sbjct: 255 IYLINLKRRSERLDRMQKIFDLLGIEYSLLEATDGQKLDQLPEDLKNYHILDKYLDPITK 314 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 R + EIGC++SH +W+ + + I+ EDD FS Sbjct: 315 RPMKNGEIGCFLSHYRIWQDVVKNKYEKVIVFEDDLRFS 353 >gi|308497624|ref|XP_003110999.1| hypothetical protein CRE_04808 [Caenorhabditis remanei] gi|308242879|gb|EFO86831.1| hypothetical protein CRE_04808 [Caenorhabditis remanei] Length = 531 Score = 36.6 bits (83), Expect = 3.6, Method: Compositional matrix adjust. Identities = 15/40 (37%), Positives = 23/40 (57%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 KR + EIGC++SH +W+ + + I+ EDD FS Sbjct: 348 KRPMKKGEIGCFLSHYRIWQDVVKNKLKKVIVFEDDLRFS 387 >gi|149757348|ref|XP_001499949.1| PREDICTED: glycosyltransferase 25 domain containing 1 [Equus caballus] Length = 548 Score = 36.6 bits (83), Expect = 3.6, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 35/62 (56%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D++ + Sbjct: 329 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVDREGLD 387 Query: 120 FD 121 +D Sbjct: 388 WD 389 >gi|281343517|gb|EFB19101.1| hypothetical protein PANDA_000528 [Ailuropoda melanoleuca] Length = 535 Score = 36.6 bits (83), Expect = 3.7, Method: Compositional matrix adjust. Identities = 27/111 (24%), Positives = 48/111 (43%), Gaps = 5/111 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RRE+ + +A+ G+ N Q R Sbjct: 256 VFMINLKRRQDRRERMLRALQAQEIACRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 315 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 316 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLLNLMR 366 >gi|309357488|emb|CAP35849.2| hypothetical protein CBG_18383 [Caenorhabditis briggsae AF16] Length = 523 Score = 36.6 bits (83), Expect = 3.7, Method: Compositional matrix adjust. Identities = 23/99 (23%), Positives = 44/99 (44%), Gaps = 5/99 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-----NPICNRIFSHQKRQCQFK 59 +Y+I+L R ++ + +++S +A G+ + N + K Sbjct: 281 IYLINLKRRSERLDRMQKIFDLLGIEYSLLEATDGQKLDQLPEDLKNYHILDKYLDPITK 340 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 R + EIGC++SH +W+ + + I+ EDD FS Sbjct: 341 RPMKNGEIGCFLSHYRIWQDVVKNKYEKVIVFEDDLRFS 379 >gi|297276457|ref|XP_001114885.2| PREDICTED: procollagen galactosyltransferase 1-like [Macaca mulatta] Length = 474 Score = 36.6 bits (83), Expect = 3.8, Method: Compositional matrix adjust. Identities = 27/111 (24%), Positives = 49/111 (44%), Gaps = 5/111 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQK----RQCQFK 59 V++I+L RRE+ ++ +A+ G+ N Q R Sbjct: 195 VFMINLTRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 254 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 255 RPLTKGELGCFLSHYNIWKEVCPEXLQKSLVFEDDLLFEIFFKRRLMNLMR 305 >gi|30060992|gb|AAP19870.1| glycosyl transferase [Haemophilus influenzae] gi|30060994|gb|AAP19871.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 36.6 bits (83), Expect = 3.9, Method: Compositional matrix adjust. Identities = 34/115 (29%), Positives = 46/115 (40%), Gaps = 10/115 (8%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINNILIKFD 121 E GC ISH LW + I EDD + L ++ D N+I I Sbjct: 4 EKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFIIRL 63 Query: 122 ALRKKPKKDSYLCTLP----GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 +P K +P NFDI + L T GY I + AA +++ KNI Sbjct: 64 ETFLQPVKLEKQTKIPPFYSRNFDILKSTHLG--TAGYIISQGAAKYVIEYLKNI 116 >gi|157367391|gb|ABV45552.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 416 Score = 36.6 bits (83), Expect = 4.0, Method: Compositional matrix adjust. Identities = 38/123 (30%), Positives = 44/123 (35%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ I Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFIQELY 56 Query: 53 KRQCQFKR----------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALKSNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|126322946|ref|XP_001368839.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 623 Score = 36.6 bits (83), Expect = 4.0, Method: Compositional matrix adjust. Identities = 31/124 (25%), Positives = 53/124 (42%), Gaps = 6/124 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQ---FK 59 V++I+L RRE+ ++ DAI G N + Q Sbjct: 344 VFMINLKRRLDRRERMLRTLHEQEIECKIVDAIDGRAMNTSQVEALGIRMLPGYQDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +WK + +++ EDD F F + L +L DI ++ Sbjct: 404 RPLTKGELGCFLSHHQIWKEVVERGLEKSLVFEDDLRFEIFFKRRLMNL-MYDIEEEGLE 462 Query: 120 FDAL 123 +D + Sbjct: 463 WDLI 466 >gi|126297741|ref|XP_001367449.1| PREDICTED: similar to cerebral endothelial cell adhesion molecule 1 [Monodelphis domestica] Length = 592 Score = 36.6 bits (83), Expect = 4.1, Method: Compositional matrix adjust. Identities = 15/49 (30%), Positives = 25/49 (51%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 R L+ E+GC++SH +W+ + ++ EDD F F + L L Sbjct: 380 RTLTKGEVGCFLSHYSIWEEMVTRGLERVVVFEDDVRFEAGFRKRLERL 428 >gi|152978999|ref|YP_001344628.1| glycosyl transferase family protein [Actinobacillus succinogenes 130Z] gi|150840722|gb|ABR74693.1| glycosyl transferase family 25 [Actinobacillus succinogenes 130Z] Length = 257 Score = 36.6 bits (83), Expect = 4.1, Method: Compositional matrix adjust. Identities = 19/65 (29%), Positives = 40/65 (61%), Gaps = 4/65 (6%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIG----AIILEDDADFSDEFSQLLPHLSKCDI 113 ++R ++ EIGC +SH++++K++ I +I EDDA F+++F++ L L + ++ Sbjct: 56 YRRDVTKGEIGCTLSHLNVYKKVTEDETIEEEDYVLICEDDALFAEKFNENLTALLRQNL 115 Query: 114 NNILI 118 +I Sbjct: 116 TADII 120 >gi|312116343|ref|YP_004013939.1| glycosyl transferase family 25 [Rhodomicrobium vannielii ATCC 17100] gi|311221472|gb|ADP72840.1| glycosyl transferase family 25 [Rhodomicrobium vannielii ATCC 17100] Length = 322 Score = 36.2 bits (82), Expect = 4.3, Method: Compositional matrix adjust. Identities = 30/125 (24%), Positives = 54/125 (43%), Gaps = 1/125 (0%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRL 61 IPV++I+L + R + + + + + F AI G P F+ ++ + R Sbjct: 8 IPVFLINLDRAPHRLARMDEKLRSLGIAYERFAAIDGRALPPDSVETFAPRETWTKPTRR 67 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 E+ C+ SH+ + + I A +LEDD D + +F L K N +K + Sbjct: 68 PIANEVACFQSHLSVLQIIRDRSYERACVLEDDLDLAPDFGTFLDPTLKFPPNTDFLKLE 127 Query: 122 ALRKK 126 + K Sbjct: 128 VTQVK 132 >gi|332853929|ref|XP_512497.3| PREDICTED: procollagen galactosyltransferase 1, partial [Pan troglodytes] Length = 478 Score = 36.2 bits (82), Expect = 4.5, Method: Compositional matrix adjust. Identities = 18/66 (27%), Positives = 36/66 (54%), Gaps = 1/66 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 461 Query: 120 FDALRK 125 +D + + Sbjct: 462 WDLMGR 467 >gi|146080914|ref|XP_001464112.1| glycosyl transferase [Leishmania infantum JPCM5] gi|134068202|emb|CAM66488.1| glycosyltransferase family-like protein [Leishmania infantum JPCM5] gi|322497507|emb|CBZ32581.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 275 Score = 36.2 bits (82), Expect = 4.5, Method: Compositional matrix adjust. Identities = 16/42 (38%), Positives = 25/42 (59%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 L+ IGC +SH +W+R+ +ILEDD +F +FS+ Sbjct: 92 LTKGAIGCALSHRAVWQRVVAEHRECVLILEDDLEFHHQFSR 133 >gi|48734956|gb|AAH71684.1| GLT25D1 protein [Homo sapiens] Length = 222 Score = 36.2 bits (82), Expect = 4.7, Method: Compositional matrix adjust. Identities = 16/51 (31%), Positives = 29/51 (56%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 120 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR 170 >gi|15611630|ref|NP_223281.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155113|gb|AAD06145.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 444 Score = 36.2 bits (82), Expect = 5.0, Method: Compositional matrix adjust. Identities = 18/40 (45%), Positives = 20/40 (50%), Gaps = 1/40 (2%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 E GCY+SH LWK S +ILEDD F Q L Sbjct: 93 EFGCYLSHYFLWKECVKSNQ-PVVILEDDVTLESHFMQAL 131 >gi|291336902|gb|ADD96431.1| LosA [uncultured organism MedDCM-OCT-S09-C426] Length = 209 Score = 36.2 bits (82), Expect = 5.0, Method: Compositional matrix adjust. Identities = 48/159 (30%), Positives = 62/159 (38%), Gaps = 42/159 (26%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+L EI C I HI K IA +ILEDD F+++F K Sbjct: 57 RELNLAEISCTIKHILGIKSIAEKCNSYGLILEDDVVFAEDFKS---------------K 101 Query: 120 FDA-LRKKPKK-DSYLCTLPGNFDIHQPRILS---------PRTTG---YFIGKEAAIHL 165 F++ L K P+ D+ G I QP I P + G Y + KEAA + Sbjct: 102 FNSFLEKTPEDWDAIFMGCCGGLRIPQPYIKEDKVAYQVNHPASRGGDSYLLKKEAAQKI 161 Query: 166 LNVRKNIYRPIDM---------DMK-HWWEHNIPSLVTE 194 K D DMK +WWE P LV + Sbjct: 162 YETIKPFNTVSDWELSYQFYLHDMKVYWWE---PPLVIQ 197 >gi|325916198|ref|ZP_08178482.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas vesicatoria ATCC 35937] gi|325537618|gb|EGD09330.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas vesicatoria ATCC 35937] Length = 282 Score = 36.2 bits (82), Expect = 5.1, Method: Compositional matrix adjust. Identities = 24/93 (25%), Positives = 39/93 (41%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL + +++ Sbjct: 80 RKWTASNIGCFLSHQAAWRIAAASEDAYTAIFEDDMHLSDALPALLRSTDWLPPGSSVVR 139 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K D+ + + G Q R++ P T Sbjct: 140 LEPSYNRIKLDTKVADIAG----RQLRLVRPST 168 >gi|312376728|gb|EFR23731.1| hypothetical protein AND_12341 [Anopheles darlingi] Length = 323 Score = 36.2 bits (82), Expect = 5.2, Method: Compositional matrix adjust. Identities = 12/37 (32%), Positives = 24/37 (64%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 +++ EIGC++SH ++W+++ ++LEDD F Sbjct: 83 MTMGEIGCFLSHYYIWEKMVRLGLAEVLVLEDDIRFE 119 >gi|47220022|emb|CAG12170.1| unnamed protein product [Tetraodon nigroviridis] Length = 635 Score = 35.8 bits (81), Expect = 5.6, Method: Compositional matrix adjust. Identities = 29/112 (25%), Positives = 53/112 (47%), Gaps = 5/112 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK I ++++EDD F F + L L ++ + Sbjct: 416 RPLTKGELGCFLSHYNIWKEIVKRRLHTSLVIEDDLRFEVFFKRRLMDL-MTEVEEEGLD 474 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +D + RK+ + + +P ++ + S T GY I + A LL Sbjct: 475 WDLIYIGRKRMQVERPEKAVPNIHNLVEAD-YSYWTLGYMISLQGAEKLLKA 525 >gi|308182779|ref|YP_003926906.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] gi|308064964|gb|ADO06856.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] Length = 200 Score = 35.8 bits (81), Expect = 5.7, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 45/123 (36%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTDQ-PVVILEDDVALESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|237750809|ref|ZP_04581289.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter bilis ATCC 43879] gi|229373254|gb|EEO23645.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter bilis ATCC 43879] Length = 172 Score = 35.8 bits (81), Expect = 5.7, Method: Compositional matrix adjust. Identities = 42/172 (24%), Positives = 69/172 (40%), Gaps = 19/172 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAA---------RIHLQFSFFDAIYGENNPICNRIFSHQK 53 + ++VI+L S R+ + + + L +SFF+AI ++ Sbjct: 1 MKIFVINLERSKDRKAHMQEKLSLLEKDPLFKELDLSYSFFNAIDSKSESFKEYKAMFNP 60 Query: 54 RQCQF--KRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 C R+L EI CY SH LW+ + I I+LEDD F F L + + Sbjct: 61 MLCYLWHGRILIDNEIACYASHYSLWEECVRLDEPI--IVLEDDIFFEKHFLSALQDMKR 118 Query: 111 CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 + ++F +D Y+ + N + H + T GY+I AA Sbjct: 119 TTFS--FVRF--FTSARSRDKYIYKI-DNSNYHYSLKNTNGTLGYYITPSAA 165 >gi|157367403|gb|ABV45558.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 437 Score = 35.8 bits (81), Expect = 5.8, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 44/123 (35%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKDRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFKR----------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|255327326|ref|ZP_05368400.1| LPS glycosyltransferase [Rothia mucilaginosa ATCC 25296] gi|255295606|gb|EET74949.1| LPS glycosyltransferase [Rothia mucilaginosa ATCC 25296] Length = 272 Score = 35.8 bits (81), Expect = 6.0, Method: Compositional matrix adjust. Identities = 39/176 (22%), Positives = 79/176 (44%), Gaps = 16/176 (9%) Query: 31 FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII 90 F + A+ ++ + + F + + ++ R S E GC +SH ++WK S A A++ Sbjct: 30 FVPYSAVDNRDDHVTDEEFDVEVFRKRWHRDPSPAEKGCMLSHYNMWKDFLASDADWALL 89 Query: 91 LEDDADFSDEFSQLL-------PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 EDD S + ++ PH+ ++ +I +A + P+ D +L F Sbjct: 90 AEDDVLISPDLQPVVERIIEKYPHVQMVNLGDIYAS-EAGKLNPQVDYPRLSLLSPFVYG 148 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVY 199 + R+ S + G AA++LL+ R R ++ + +P +V + ++Y Sbjct: 149 KYRMGSAYGSKPLYG--AALYLLS-RSGAERLVEC-----FGETVPGVVADDYSLY 196 >gi|18088114|gb|AAH20492.1| GLT25D1 protein [Homo sapiens] Length = 231 Score = 35.8 bits (81), Expect = 6.0, Method: Compositional matrix adjust. Identities = 16/51 (31%), Positives = 29/51 (56%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 12 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR 62 >gi|119605029|gb|EAW84623.1| glycosyltransferase 25 domain containing 1, isoform CRA_c [Homo sapiens] Length = 565 Score = 35.8 bits (81), Expect = 6.1, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 461 Query: 120 FD 121 +D Sbjct: 462 WD 463 >gi|22760716|dbj|BAC11307.1| unnamed protein product [Homo sapiens] Length = 622 Score = 35.8 bits (81), Expect = 6.2, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 461 Query: 120 FD 121 +D Sbjct: 462 WD 463 >gi|146421867|ref|XP_001486877.1| hypothetical protein PGUG_00254 [Meyerozyma guilliermondii ATCC 6260] gi|146387998|gb|EDK36156.1| hypothetical protein PGUG_00254 [Meyerozyma guilliermondii ATCC 6260] Length = 151 Score = 35.8 bits (81), Expect = 6.3, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 5/117 (4%) Query: 149 SPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 +P+T F K I LL ++KN++ P ++ +W N+P+L V + + T +T Sbjct: 31 TPQTAYKFDPKYTKIELLLIQKNVFAPA-FGLRRFWRQNLPTLKFHNDDV-DFVLTRIAT 88 Query: 209 IEESRLVRKPTFSPLYFYRNTCYQW---NLHYNAWRKDLPPVSTTKFLPSSSSSLIK 262 + L + PT L+ + + N H N K+L +S K +P ++K Sbjct: 89 KTKEDLAKCPTKIILHHADGSVQELDCSNKHSNKILKELVALSEAKAVPKEEIPVVK 145 >gi|297379818|gb|ADI34705.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori v225d] Length = 200 Score = 35.8 bits (81), Expect = 6.3, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 45/123 (36%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|148697003|gb|EDL28950.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Mus musculus] Length = 478 Score = 35.8 bits (81), Expect = 6.3, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 408 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 466 Query: 120 FD 121 +D Sbjct: 467 WD 468 >gi|325995898|gb|ADZ51303.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 2018] Length = 200 Score = 35.8 bits (81), Expect = 6.4, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 47/123 (38%), Gaps = 30/123 (24%) Query: 5 VYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIFSH 51 VY+ISL S R EKF R F FDAI Y + + ++ Sbjct: 4 VYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKYQDFEKLVQELYDS 58 Query: 52 ---------QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 C ++ LL E GCY+SH LWK + +ILEDD F Sbjct: 59 SSLLKSDWFHSDYC-YQELLP-QEFGCYLSHYFLWKECVKTNQ-PVVILEDDVALESHFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|170784829|ref|NP_666323.2| procollagen galactosyltransferase 1 precursor [Mus musculus] gi|160395574|sp|Q8K297|GT251_MOUSE RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|34785210|gb|AAH56951.1| Glycosyltransferase 25 domain containing 1 [Mus musculus] gi|148697002|gb|EDL28949.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Mus musculus] Length = 617 Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 456 Query: 120 FD 121 +D Sbjct: 457 WD 458 >gi|21595163|gb|AAH32165.1| Glycosyltransferase 25 domain containing 1 [Mus musculus] Length = 617 Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 456 Query: 120 FD 121 +D Sbjct: 457 WD 458 >gi|31377697|ref|NP_078932.2| procollagen galactosyltransferase 1 precursor [Homo sapiens] gi|74715064|sp|Q8NBJ5|GT251_HUMAN RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|22761754|dbj|BAC11684.1| unnamed protein product [Homo sapiens] gi|80478641|gb|AAI08309.1| Glycosyltransferase 25 domain containing 1 [Homo sapiens] gi|119605028|gb|EAW84622.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Homo sapiens] Length = 622 Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 461 Query: 120 FD 121 +D Sbjct: 462 WD 463 >gi|10438672|dbj|BAB15308.1| unnamed protein product [Homo sapiens] Length = 243 Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust. Identities = 16/51 (31%), Positives = 29/51 (56%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 24 RPLTKGELGCFLSHYNIWKEVVDRGPQKSLVFEDDLRFEIFFKRRLMNLMR 74 >gi|119605027|gb|EAW84621.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Homo sapiens] Length = 645 Score = 35.8 bits (81), Expect = 6.6, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 426 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 484 Query: 120 FD 121 +D Sbjct: 485 WD 486 >gi|9963922|gb|AAG09766.1|AF240672_3 galactosyl transferase [Neisseria subflava] gi|325204879|gb|ADZ00333.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M01-240355] Length = 280 Score = 35.8 bits (81), Expect = 6.8, Method: Compositional matrix adjust. Identities = 27/102 (26%), Positives = 40/102 (39%), Gaps = 2/102 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR + F FFDA+ NR+ + + LLS Sbjct: 4 HVISLASAAERRAHIAATFGARGIPFQFFDALMPSEE--LNRMMAELVPGLAKQHLLSEV 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 E C++SH LWK+ + EDD + + L Sbjct: 62 EKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAE 103 >gi|332253465|ref|XP_003275861.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 1-like [Nomascus leucogenys] Length = 626 Score = 35.8 bits (81), Expect = 6.9, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 407 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 465 Query: 120 FD 121 +D Sbjct: 466 WD 467 >gi|74217150|dbj|BAE43293.1| unnamed protein product [Mus musculus] Length = 617 Score = 35.8 bits (81), Expect = 6.9, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D+ + Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVEREGLD 456 Query: 120 FD 121 +D Sbjct: 457 WD 458 >gi|149912506|ref|ZP_01901040.1| glycosyl transferase, family 25 [Roseobacter sp. AzwK-3b] gi|149812912|gb|EDM72738.1| glycosyl transferase, family 25 [Roseobacter sp. AzwK-3b] Length = 250 Score = 35.8 bits (81), Expect = 7.0, Method: Compositional matrix adjust. Identities = 28/102 (27%), Positives = 43/102 (42%), Gaps = 21/102 (20%) Query: 5 VYVISLPFSHARREKFCHRAARI-----HLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 VY+I+LP RR + + AR+ H SFF A+ + +H Sbjct: 22 VYIINLPERADRRREIEGQLARLGLSSAHPSISFFKAVKPSEAGGWPSVGAH-------- 73 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 GC++SH+ + + S +ILEDD D+S F Sbjct: 74 --------GCFMSHLGVLEEALESRFDNVLILEDDMDWSPRF 107 >gi|229846632|ref|ZP_04466740.1| dimethyladenosine transferase [Haemophilus influenzae 7P49H1] gi|229810725|gb|EEP46443.1| dimethyladenosine transferase [Haemophilus influenzae 7P49H1] Length = 274 Score = 35.8 bits (81), Expect = 7.0, Method: Compositional matrix adjust. Identities = 48/215 (22%), Positives = 86/215 (40%), Gaps = 32/215 (14%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRI---FSHQKRQCQFKR 60 YVISL ++ RRE ++ ++ F FFDA+ E + + H K Sbjct: 17 YVISLTSAYQRREHIQKEFSQQNIPFEFFDALKPSKELTSLIEKFIPNLLHAK------- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD-----INN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPK-KDSYLCTLPGNFDIHQPRILSPR--TTGYFIGKEAA----IHLLNV 168 +++F+ K KD + G + +++S T GY I +EA H+ ++ Sbjct: 129 FILRFETFLDFSKCKDKKIKPYSGRKIL---KLVSENCGTAGYVISREAVKQLNAHICSL 185 Query: 169 RKNIYRPIDMDMKHWWEHNI----PSLVTEPGAVY 199 N ID+ M + + P + + G++Y Sbjct: 186 TSNHLLAIDLLMFNIFNQFTYQVSPGVCVQEGSLY 220 >gi|208434539|ref|YP_002266205.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori G27] gi|208432468|gb|ACI27339.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori G27] Length = 186 Score = 35.4 bits (80), Expect = 7.5, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 45/123 (36%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 56 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|166710181|ref|ZP_02241388.1| glycosyltransferase [Xanthomonas oryzae pv. oryzicola BLS256] Length = 269 Score = 35.4 bits (80), Expect = 7.7, Method: Compositional matrix adjust. Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL N +++ Sbjct: 67 RKWTASNIGCFLSHQAAWRIAADSADAYTAIFEDDMHLSDTLPALLASTDWLPGGNSIVR 126 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K + + + G Q R++ P T Sbjct: 127 LEPSYNRIKLHAKVADVAG----RQLRLVRPST 155 >gi|325997493|gb|ADZ49701.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 2017] Length = 404 Score = 35.4 bits (80), Expect = 7.7, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 47/123 (38%), Gaps = 30/123 (24%) Query: 5 VYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAI---YGENNPICNRIFSH 51 VY+ISL S R EKF R F FDAI Y + + ++ Sbjct: 4 VYIISLKESQRRLDTEKLILESNEKFKGRCV-----FQIFDAISPKYQDFEKLVQELYDS 58 Query: 52 ---------QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 C ++ LL E GCY+SH LWK + +ILEDD F Sbjct: 59 SSLLKSDWFHSDYC-YQELLP-QEFGCYLSHYFLWKECVKTNQ-PVVILEDDVALESHFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|308063591|gb|ADO05478.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 202 Score = 35.4 bits (80), Expect = 7.8, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 45/123 (36%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 18 IQVYIISLKESQRRLDTEKLVSESNEKFKGRCV-----FQIFDAISPKHEDFEKFVQELY 72 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L E GCY+SH LWK + +ILEDD F Sbjct: 73 DAQSMLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFM 131 Query: 103 QLL 105 Q L Sbjct: 132 QAL 134 >gi|310817025|ref|YP_003964989.1| glycosyl transferase, family 25 [Ketogulonicigenium vulgare Y25] gi|308755760|gb|ADO43689.1| glycosyl transferase, family 25 [Ketogulonicigenium vulgare Y25] Length = 261 Score = 35.4 bits (80), Expect = 8.0, Method: Compositional matrix adjust. Identities = 49/214 (22%), Positives = 84/214 (39%), Gaps = 10/214 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFK 59 + I YVI+L + R F ++ + A+ G + +P +S + F Sbjct: 12 LQISYYVINLDDNPERLRAFAAHIQPENIAYERVAAVDGRHVSPASFPSYSAKAAVAFFG 71 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L E+ C +SHI R A AII EDDA +++F+ + + + L Sbjct: 72 RGLLGGEVACCLSHIDALNRFMAGDAPYAIIFEDDAWVAEDFTAKVER-AIALFDEKLPG 130 Query: 120 FDALRKKPK--KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + L P+ + T G + + T + A +L IY P D Sbjct: 131 WQLLHLYPRTLRGHSPITAEGGHTLFAAHEFAVGTAAIVYSRAGAKAVLETAVPIYAPWD 190 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEA-IDTNDSTIE 210 + +K + + + GA+ A I T+D+ E Sbjct: 191 VHLK-----EVTARHGKSGAITPALIQTSDAPSE 219 >gi|257464901|ref|ZP_05629272.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor 202] gi|257450561|gb|EEV24604.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor 202] Length = 263 Score = 35.4 bits (80), Expect = 8.1, Method: Compositional matrix adjust. Identities = 38/127 (29%), Positives = 59/127 (46%), Gaps = 11/127 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR + ++ F FFDAI + + F L+ Sbjct: 10 YVISLLTAKERRNHIIQEFGKQNIPFEFFDAITPDLIEEKAKEFGIDISSSP----LTKG 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIKFD-- 121 EI C +SHI LW +A + I + +D + E ++ L++ INN +IKF+ Sbjct: 66 EIACALSHIALW-HLAKEKELDYICIFEDDIYLGENAKCF--LTEAYINNDVDIIKFETF 122 Query: 122 ALRKKPK 128 +L K+PK Sbjct: 123 SLDKQPK 129 >gi|192360086|ref|YP_001981136.1| glycosyl transferase [Cellvibrio japonicus Ueda107] gi|190686251|gb|ACE83929.1| glycosyl transferase, putative, gt25A [Cellvibrio japonicus Ueda107] Length = 250 Score = 35.4 bits (80), Expect = 8.4, Method: Compositional matrix adjust. Identities = 34/131 (25%), Positives = 57/131 (43%), Gaps = 10/131 (7%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD 112 +R Q K+ ++GC++SH +W+RI S A + EDD + +LL Sbjct: 52 QRPRQGKKSWLRGQMGCFLSHYGVWQRIVNSRVRFAAVFEDDIHIAPGLYELLEGDDWIA 111 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQP---RILSPR--TTGYFIGKEAAIHLLN 167 + +I+ D + + L P H R+LS T GY I + A LL+ Sbjct: 112 DSVDIIRLDTSTNRVR----LGPEPA-LQCHSHYFYRLLSTSWCTGGYIIHQRTARQLLD 166 Query: 168 VRKNIYRPIDM 178 + ++P D+ Sbjct: 167 LAPRYHQPSDV 177 >gi|261839553|gb|ACX99318.1| hypothetical protein HPKB_0724 [Helicobacter pylori 52] Length = 184 Score = 35.4 bits (80), Expect = 8.6, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 45/123 (36%), Gaps = 26/123 (21%) Query: 3 IPVYVISLPFSHAR----------REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ 52 I VY+ISL S R EKF R F FDAI ++ + Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-----FQIFDAISPKHQDFEKFVQELY 56 Query: 53 KRQCQFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q K + L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTNQ-PVVILEDDVALESNFM 115 Query: 103 QLL 105 Q L Sbjct: 116 QAL 118 >gi|325925713|ref|ZP_08187089.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] gi|325926867|ref|ZP_08188149.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] gi|325542768|gb|EGD14228.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] gi|325543882|gb|EGD15289.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] Length = 271 Score = 35.4 bits (80), Expect = 8.7, Method: Compositional matrix adjust. Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 4/93 (4%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + IGC++SH W+ A S I EDD SD LL +++ Sbjct: 67 RKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLGSTDWLPGGRSIVR 126 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + K D+ + + G Q R++ P T Sbjct: 127 LEPSYNRIKLDAKVADVAG----RQLRLVRPST 155 >gi|256081803|ref|XP_002577157.1| cerebral cell adhesion molecule related [Schistosoma mansoni] gi|238662453|emb|CAZ33394.1| cerebral cell adhesion molecule related [Schistosoma mansoni] Length = 680 Score = 35.0 bits (79), Expect = 9.8, Method: Compositional matrix adjust. Identities = 16/42 (38%), Positives = 24/42 (57%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 R L EIGC++SH ++W + + +ILEDD F+ F Sbjct: 505 RSLKFGEIGCFLSHYNIWIEMINNGYNRILILEDDLRFAPAF 546 Searching..................................................done Results from round 2 >gi|254781114|ref|YP_003065527.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] gi|254040791|gb|ACT57587.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] Length = 263 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 263/263 (100%), Positives = 263/263 (100%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR Sbjct: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF Sbjct: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM Sbjct: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW Sbjct: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 Query: 241 RKDLPPVSTTKFLPSSSSSLIKS 263 RKDLPPVSTTKFLPSSSSSLIKS Sbjct: 241 RKDLPPVSTTKFLPSSSSSLIKS 263 >gi|315122483|ref|YP_004062972.1| glycosyl transferase family protein [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495885|gb|ADR52484.1| glycosyl transferase family protein [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 253 Score = 304 bits (779), Expect = 8e-81, Method: Composition-based stats. Identities = 184/252 (73%), Positives = 212/252 (84%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + I YVISLPFSHARREKFCHRA+ I+LQFSF DAIYGEN+ IC +IF +Q RQ FKR Sbjct: 2 ISISNYVISLPFSHARREKFCHRASLINLQFSFVDAIYGENDSICKKIFYNQNRQHHFKR 61 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LLSLPEIGCY+SHI+LWK+IA S A+GAIILEDDADFS F QL+ HL+ CDI NILIKF Sbjct: 62 LLSLPEIGCYMSHINLWKKIASSDALGAIILEDDADFSVGFPQLISHLNHCDIGNILIKF 121 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 DALRK+PK+ +LC +PGNF I QPRILSPRTTGYFIGK+AA HLL+VRK+I+RP+DMDM Sbjct: 122 DALRKRPKETDFLCKVPGNFGILQPRILSPRTTGYFIGKQAAAHLLDVRKDIFRPVDMDM 181 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 KHWWEH +PSLVTEPGAVYEA +T+DS+IE SRL +K TFSPLYFY+N YQ NLHY A Sbjct: 182 KHWWEHTVPSLVTEPGAVYEAANTDDSSIEASRLQKKRTFSPLYFYQNMRYQLNLHYQAR 241 Query: 241 RKDLPPVSTTKF 252 +K LP VS F Sbjct: 242 KKTLPYVSPEIF 253 >gi|91791576|ref|YP_561227.1| glycosyl transferase family protein [Shewanella denitrificans OS217] gi|91713578|gb|ABE53504.1| glycosyl transferase, family 25 [Shewanella denitrificans OS217] Length = 256 Score = 242 bits (618), Expect = 4e-62, Method: Composition-based stats. Identities = 66/254 (25%), Positives = 115/254 (45%), Gaps = 6/254 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M ++VI+L S R E + +++ ++F A+ G+N + +++ ++ Sbjct: 1 MSFKIFVINLDSSVDRIENMQAQCSKLGIEFERVSAVRGKNLSSAEKAAVYNRDVNLAKY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L+ EIGCY+SH W++I ++ILEDDA + E + + L+ I Sbjct: 61 DKELNDGEIGCYMSHARCWEQILIQKLDYSLILEDDAILTPEITAYIAKLADSTSEWDYI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K RK PK L + Q L TTG F+ + A LL I RPID+ Sbjct: 121 KLSHGRK-PKNILNAIDLGDGLSLGQCIKLPSTTTGQFVSQTGAKKLLQHAYPIARPIDI 179 Query: 179 DMKHWWEHNIPSLVTEPGAVYEA-IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHY 237 D++ W+E ++ S V P + + + + + R V K F+ + ++ ++ NL Sbjct: 180 DIQFWYEKSLRSFVVRPFPILNGDFGSEINQVTDRRQVDKRQFARI--WQKAKFELNLLL 237 Query: 238 NAWRKDLPPVSTTK 251 + R P S K Sbjct: 238 HRGRLGALPQSLVK 251 >gi|258625904|ref|ZP_05720779.1| Putative glycosyltransferase [Vibrio mimicus VM603] gi|258581868|gb|EEW06742.1| Putative glycosyltransferase [Vibrio mimicus VM603] Length = 277 Score = 233 bits (595), Expect = 2e-59, Method: Composition-based stats. Identities = 69/248 (27%), Positives = 106/248 (42%), Gaps = 14/248 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + A L F FFDAI G NP + + KR Sbjct: 36 MKIYVISLKNSLDRRASIEEQMANYGLDFEFFDAIDGRINPPHPIFSNYDYAKRLWFTSG 95 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ S E+GCY SH +W++ A IILEDDA FS L + ++ Sbjct: 96 KMPSKGELGCYASHYLMWQKCAELGE-SIIILEDDAKVLPCFSDYLNAIRVKTQEYGFLR 154 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + ++ + +L NF+I+ Y + E+A L+ + P+D Sbjct: 155 LEQAYERSRL--FLKEKTDNFEIYFLTNNFGGARAYALSPESAKKLIENSQCWSMPVDNY 212 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRNTCYQWN 234 + + HN+PS + P V E D ++T + + L RKPT YR + Sbjct: 213 IGSLYLHNMPSFIFHPSIV-ENPDYFETTFQNEKQPSAPLYRKPTRELYSLYRKV--RMA 269 Query: 235 LHYNAWRK 242 L N ++K Sbjct: 270 LANNVYKK 277 >gi|262404971|ref|ZP_06081523.1| putative glycosyltransferase WavM [Vibrio sp. RC586] gi|262348810|gb|EEY97951.1| putative glycosyltransferase WavM [Vibrio sp. RC586] Length = 242 Score = 230 bits (587), Expect = 1e-58, Method: Composition-based stats. Identities = 64/248 (25%), Positives = 99/248 (39%), Gaps = 14/248 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRVDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ +ILEDDA F L + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVELGEP-IVILEDDAKILPSFINYLNIIRVKTQEYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + + +L +F I Y + E+A L+ + P+D Sbjct: 120 LE--EAYDRSRLFLKEKTNDFKISFLTNNFGGARAYSLSPESAKKLIKGSQRWSMPVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRNTCYQWN 234 M + H++PS V P V E D ++T + R L RKPT YR + Sbjct: 178 MGSLYLHDMPSFVFYPSVV-ENPDHFETTFQNDRQPSAPLYRKPTRELYSLYRKI--KMA 234 Query: 235 LHYNAWRK 242 L N ++K Sbjct: 235 LANNVYKK 242 >gi|229524815|ref|ZP_04414220.1| beta-1,4-galactosyltransferase [Vibrio cholerae bv. albensis VL426] gi|229338396|gb|EEO03413.1| beta-1,4-galactosyltransferase [Vibrio cholerae bv. albensis VL426] Length = 242 Score = 229 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 10/239 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + A L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMASHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH LW++ A ++LEDD ++ F + L + ++ Sbjct: 61 KMPMRGELGCYASHYLLWQKCVELNAP-IVVLEDDVIINENFPKNLSIIKDKTNEYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + K + S GN+ I T Y I E+A L+ + +D Sbjct: 120 LEPEVGKCRLFSKESK--GNYSIAFMDNNWGGTRAYSISPESARKLILGSQKWSMAVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPTFSPLYFYRNTCYQWN 234 + + H +PS + P V ++ + E R L RKPT Y+ + Sbjct: 178 IGCTYIHKMPSYIFSPSMVEHGVEFETTFQNEKRIRVPLYRKPTREIYSLYKKIRLKLF 236 >gi|262192416|ref|ZP_06050568.1| putative glycosyltransferase WavM [Vibrio cholerae CT 5369-93] gi|262031680|gb|EEY50266.1| putative glycosyltransferase WavM [Vibrio cholerae CT 5369-93] Length = 242 Score = 227 bits (579), Expect = 1e-57, Method: Composition-based stats. Identities = 61/247 (24%), Positives = 96/247 (38%), Gaps = 14/247 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ +ILEDDA F L + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVELGEP-IVILEDDAKILPSFINYLNIIRVKTQEYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + + +L +F+I Y + E+A L+ + P+D Sbjct: 120 LE--EAYDRSRLFLKEKTNDFEISFLTNNFGGARAYSLSPESAKKLIKGSQRWSMPVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES-----RLVRKPTFSPLYFYRNTCYQWN 234 M + H +PS V P V + D+T + L RKPT YR + Sbjct: 178 MGSLYLHGMPSFVFHPSVV-DNPANFDTTFQHEVQRPTPLYRKPTRELYSLYRKI--RMA 234 Query: 235 LHYNAWR 241 L N ++ Sbjct: 235 LANNVYK 241 >gi|18653286|gb|AAL77358.1|AF444794_3 putative glycosyltransferase WavM [Vibrio cholerae] Length = 242 Score = 225 bits (574), Expect = 4e-57, Method: Composition-based stats. Identities = 58/241 (24%), Positives = 93/241 (38%), Gaps = 10/241 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH LW++ A ++LEDD ++ F Q L + ++ Sbjct: 61 KMPMRGELGCYASHYLLWQKCVELNAP-IVVLEDDVIINENFPQYLSIMKDKTNEYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + K S N+ I T Y I ++A L+ + +D Sbjct: 120 LEPEVGKCSLFSKESK--ENYSIAFMDNNWGGTRAYSISPDSARKLILGSQKWSMAVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPTFSPLYFYRNTCYQWNL 235 + + H +PS + P V ++ + E R L RKPT Y+ Sbjct: 178 IGCTYIHKMPSYIFSPSMVEHGVEFETTFQNEKRIRVPLYRKPTREIYSVYKKIRIMMFA 237 Query: 236 H 236 + Sbjct: 238 N 238 >gi|229521046|ref|ZP_04410467.1| beta-1,4-galactosyltransferase [Vibrio cholerae TM 11079-80] gi|229341931|gb|EEO06932.1| beta-1,4-galactosyltransferase [Vibrio cholerae TM 11079-80] Length = 242 Score = 225 bits (573), Expect = 5e-57, Method: Composition-based stats. Identities = 64/248 (25%), Positives = 102/248 (41%), Gaps = 14/248 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ IILEDD S + L + + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVELNEP-IIILEDDIKISPMLVEYLADIKHEVSSFGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A ++ D +L + I GY I ++A LLN + P+D Sbjct: 120 LQAAYER--SDVFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWCMPVDNF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRNTCYQWN 234 + + H +PS + P V E + ++T + R L RKPT YR + Sbjct: 178 IGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSAPLYRKPTREIYSLYRKV--KMA 234 Query: 235 LHYNAWRK 242 + N ++K Sbjct: 235 IANNVYKK 242 >gi|153829559|ref|ZP_01982226.1| putative glycosyltransferase WavM [Vibrio cholerae 623-39] gi|148874955|gb|EDL73090.1| putative glycosyltransferase WavM [Vibrio cholerae 623-39] Length = 242 Score = 225 bits (573), Expect = 7e-57, Method: Composition-based stats. Identities = 59/241 (24%), Positives = 94/241 (39%), Gaps = 10/241 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + A L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMASYGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH LW++ A ++LEDD ++ F Q L + ++ Sbjct: 61 KMPMRGELGCYASHYLLWQKCVELNAP-IVVLEDDVIINENFPQYLSIIKDKTNEYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + K + S N+ I T Y I +A L+ + +D Sbjct: 120 LEPEVGKCRLFSKESK--ENYSIAFIDNNWGGTRAYSISPVSARKLILGSQKWSMAVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPTFSPLYFYRNTCYQWNL 235 + + H +PS + P V ++ + E R L RKPT Y+ Sbjct: 178 IGCTYIHKMPSYIFSPSMVEHGVEFETTFQNEKRIRVPLYRKPTREIYSVYKKIRIMMFA 237 Query: 236 H 236 + Sbjct: 238 N 238 >gi|254225693|ref|ZP_04919300.1| putative glycosyltransferase WavM [Vibrio cholerae V51] gi|125621813|gb|EAZ50140.1| putative glycosyltransferase WavM [Vibrio cholerae V51] Length = 242 Score = 224 bits (572), Expect = 9e-57, Method: Composition-based stats. Identities = 64/248 (25%), Positives = 102/248 (41%), Gaps = 14/248 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ IILEDD S + L + + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVELNEP-IIILEDDIKISPMLVEYLADIKHEVSSFGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A ++ D +L + I GY I ++A LLN + P+D Sbjct: 120 LQAAYER--SDIFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWCMPVDNF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRNTCYQWN 234 + + H +PS + P V E + ++T + R L RKPT YR + Sbjct: 178 IGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSTPLYRKPTREIYSLYRKV--KMA 234 Query: 235 LHYNAWRK 242 + N ++K Sbjct: 235 IANNVYKK 242 >gi|229527261|ref|ZP_04416654.1| beta-1,4-galactosyltransferase [Vibrio cholerae 12129(1)] gi|229335269|gb|EEO00753.1| beta-1,4-galactosyltransferase [Vibrio cholerae 12129(1)] Length = 242 Score = 224 bits (571), Expect = 9e-57, Method: Composition-based stats. Identities = 65/248 (26%), Positives = 103/248 (41%), Gaps = 14/248 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + A L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMASYGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ IILEDD S + L + + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVEINEP-IIILEDDIKISPMLVEYLADIKHEVSSFGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A ++ D +L + I GY I ++A LLN + P+D Sbjct: 120 LQAAYER--SDVFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWCMPVDNF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRNTCYQWN 234 + + H +PS + P V E + ++T + R L RKPT YR + Sbjct: 178 IGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSAPLYRKPTREIYSLYRKV--KMA 234 Query: 235 LHYNAWRK 242 + N ++K Sbjct: 235 IANNVYKK 242 >gi|241765554|ref|ZP_04763514.1| glycosyl transferase family 25 [Acidovorax delafieldii 2AN] gi|241364643|gb|EER59681.1| glycosyl transferase family 25 [Acidovorax delafieldii 2AN] Length = 244 Score = 220 bits (562), Expect = 1e-55, Method: Composition-based stats. Identities = 64/249 (25%), Positives = 113/249 (45%), Gaps = 9/249 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 MP+P+ I+L RR + R+ LQ DA++ P + ++S + Q+ Sbjct: 1 MPLPLCYINLARDEERRARIESELRRLCLQGDRIDAVWWAGVPTAQQSALYSEALNRRQY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L E GCY SH+ W+++ S A I+LEDD D F ++ +++ + ++ Sbjct: 61 YKPLVAGEKGCYASHLVAWQKLLDSTAPAMIVLEDDIRLDDRFLEVTDAIAQLEEPWDMV 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K ++ +K L ++ + R + T GY + + A LL RK RPID+ Sbjct: 121 KLMG--REHEKVRAHRPLTAGTELVEYRRIPSMTAGYVVSRSGAAKLLASRKPFGRPIDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL-HY 237 D++ WWE+++ L P + + S+I + + P +R + L Sbjct: 179 DLRFWWENDLRILGVVPPVLILDDTSLVSSIGQ----KPPRLGVGAKWRKFRMKLRLTAL 234 Query: 238 NAWRKDLPP 246 NAW + PP Sbjct: 235 NAWHQVRPP 243 >gi|114565077|ref|YP_752591.1| glycosyl transferase family protein [Shewanella frigidimarina NCIMB 400] gi|114336370|gb|ABI73752.1| glycosyl transferase, family 25 [Shewanella frigidimarina NCIMB 400] Length = 250 Score = 220 bits (562), Expect = 1e-55, Method: Composition-based stats. Identities = 59/247 (23%), Positives = 106/247 (42%), Gaps = 6/247 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQF 58 M V++I+L S R + ++ +++ A+YG++ + +++ Q ++ Sbjct: 1 MKFKVFLINLDNSTERFTFMDEQLKQLGVEYQRISAVYGKDLHDTEIAKVYDPQTNLQKY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L+L EIGCY+SH+ W+ I A+ILEDD+ ++ H++ + I Sbjct: 61 DKKLNLGEIGCYLSHVQCWQMIIEQQLDYALILEDDSILDPALMTVIQHINNLSADWDYI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K RK PK L F + L T G + A LL I RP+D+ Sbjct: 121 KLCHGRK-PKGIVKSIVLDDRFSLSTCLKLPASTRGQCVSLAGAQKLLATAYPIARPVDI 179 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 D++ W+E + V P V D + + R L ++ ++ NL Sbjct: 180 DIQFWFEKQLRCFVVRPFPVI-GTDLDSDINRQGRRTNVERHHLLRIWQKMKFELNLL-- 236 Query: 239 AWRKDLP 245 +++ LP Sbjct: 237 KYKQQLP 243 >gi|157373252|ref|YP_001471852.1| glycosyl transferase family protein [Shewanella sediminis HAW-EB3] gi|157315626|gb|ABV34724.1| glycosyl transferase, family 25 [Shewanella sediminis HAW-EB3] Length = 252 Score = 220 bits (562), Expect = 1e-55, Method: Composition-based stats. Identities = 62/251 (24%), Positives = 108/251 (43%), Gaps = 8/251 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M VY+I+L S R E+ + ++ +++ A+ G R ++ ++ Sbjct: 1 MKFKVYIINLDSSVDRWEQLSSQCDQMGIEYERVSAVRGSELTEQERAKVYDLPTNLEKY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD-INNIL 117 + L+ EIGCY+SHI W++I A+ILEDD + + + D + Sbjct: 61 DKRLNDGEIGCYLSHIKCWEKIIEEELDFALILEDDVSLRENIKECIAQFFSMDSSDWDY 120 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 IK RK+ K L +H+ L T F+ + A LL I RP+D Sbjct: 121 IKLFH-RKRLKMKVASLNLDEGVQLHKVAKLPASTCSQFVSQVGAKKLLKHSFPIARPVD 179 Query: 178 MDMKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESR-LVRKPTFSPLYFYRNTCYQWNL 235 +D+++W+E + V P AV D+ I + R V++ F ++ Y+ N+ Sbjct: 180 VDIQYWFERELRCFVVSPPVAVPAGFDSEIDIIGQHRDYVQRRAFKRIWL--RVIYEINV 237 Query: 236 HYNAWRKDLPP 246 N ++ P Sbjct: 238 LLNTFKLSPFP 248 >gi|221069650|ref|ZP_03545755.1| glycosyl transferase family 25 [Comamonas testosteroni KF-1] gi|220714673|gb|EED70041.1| glycosyl transferase family 25 [Comamonas testosteroni KF-1] Length = 254 Score = 220 bits (561), Expect = 1e-55, Method: Composition-based stats. Identities = 53/252 (21%), Positives = 102/252 (40%), Gaps = 6/252 (2%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQFK 59 +P+ I+L RRE+ + A++ L S A++ + + F + ++ Sbjct: 3 SLPIVFINLSKDAERRERMTAQFAQMGLAASRLPAVWWADLSPAEQRHYFCAPQSHGRYF 62 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 + LS E GCY SH+ W+++ A ++ EDD + Q L + + Sbjct: 63 KPLSNGEKGCYASHLRSWQQLLDGDAPAMVVFEDDVRLLPDLPQALAAIEALPADGSWDM 122 Query: 120 FDALRKKPKKDSYLCTL-PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 ++P+K + L G+ + + + GY I + A +L R RP+D+ Sbjct: 123 IKLYGREPEKIANQGPLVEGSLQLISYQRVPSFAAGYVISRSGARKMLEARVPFDRPVDV 182 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 D++ W+E+++ P + + S+I + S L R + L++ Sbjct: 183 DIRFWFENDLRVYGVYPSVIALDDTSEVSSIWAQQAA---PTSRLQKMRKFKMKLALNWG 239 Query: 239 AWRKDLPPVSTT 250 R P VS Sbjct: 240 NLRAAKPQVSAV 251 >gi|262163773|ref|ZP_06031513.1| putative glycosyltransferase WavM [Vibrio mimicus VM223] gi|262027753|gb|EEY46418.1| putative glycosyltransferase WavM [Vibrio mimicus VM223] Length = 242 Score = 219 bits (558), Expect = 3e-55, Method: Composition-based stats. Identities = 62/247 (25%), Positives = 103/247 (41%), Gaps = 14/247 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + A L F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMADYGLGFEFFDAIDGRVDPPHPLFANYDYAKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ A +ILEDDA FS L + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCAELSEP-IVILEDDAKILPSFSTYLEVIKYKTQCYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + K+ + +L +++I Y + ++A LLN P+D Sbjct: 120 LEEADKRSQI--FLKEKASDYEISFLSHNFGGARAYALSPQSAQKLLNGSLRWSMPVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE-----ESRLVRKPTFSPLYFYRNTCYQWN 234 M + H +PS + P V E + ++T + ++ L RKPT YR + Sbjct: 178 MGSLYLHGVPSFLFSPS-VIENPEFFETTFQNNWQSKTPLYRKPTREIYSLYRKI--KMA 234 Query: 235 LHYNAWR 241 + + ++ Sbjct: 235 MANHVYK 241 >gi|18653277|gb|AAL77351.1|AF444792_3 putative glycosyltransferase WavM [Vibrio cholerae] Length = 230 Score = 219 bits (558), Expect = 3e-55, Method: Composition-based stats. Identities = 62/234 (26%), Positives = 96/234 (41%), Gaps = 12/234 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ IILEDD S + L + + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVELNEP-IIILEDDIKISPMLVEYLADIKHEVSSFGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A ++ D +L + I GY I ++A LLN + P+D Sbjct: 120 LQAAYER--SDVFLKKETERYSILFLSNNFGGAIGYAIAPQSAAKLLNSSERWCMPVDNF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-----LVRKPTFSPLYFYRN 228 + + H +PS + P V E + ++T + R L RKPT YR Sbjct: 178 IGSLYLHEMPSFLFSPIMV-ENPENFETTFQNDRQPSTPLYRKPTREIYSLYRK 230 >gi|299533415|ref|ZP_07046797.1| glycosyl transferase, family 25 [Comamonas testosteroni S44] gi|298718621|gb|EFI59596.1| glycosyl transferase, family 25 [Comamonas testosteroni S44] Length = 254 Score = 217 bits (552), Expect = 2e-54, Method: Composition-based stats. Identities = 52/252 (20%), Positives = 104/252 (41%), Gaps = 6/252 (2%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQFK 59 +P+ I+L RRE+ + A++ L S A++ + + F + ++ Sbjct: 3 SLPIVFINLSKDAERRERMTAQFAQMGLTASRLPAVWWGDLSEAEQKTFFCAPQSHGRYF 62 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 + LS E GCY SH+ W+++ S A ++ EDD + Q L + + Sbjct: 63 KPLSNGEKGCYASHLRSWQQLMDSDAPAMVVFEDDVRLLPDLPQALAAIEALPTDGSWDM 122 Query: 120 FDALRKKPKKDSYLCTL-PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 ++P+K + L G+ + + + GY I + A +L R RP+D+ Sbjct: 123 IKLYGREPEKIADQGPLVQGSLQLISYQRVPSFAAGYVISRSGARKMLEARVPFDRPVDV 182 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 D++ W+E+++ P + + S+I ++ + L R + L + Sbjct: 183 DIRFWFENDLRVYGVYPSVISLDDTSEISSIWAQ---KEAPVNRLQKMRKFKMKLALSWG 239 Query: 239 AWRKDLPPVSTT 250 +R P V+ Sbjct: 240 NFRASKPLVTAV 251 >gi|145628361|ref|ZP_01784162.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 22.1-21] gi|144980136|gb|EDJ89795.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 22.1-21] Length = 232 Score = 214 bits (546), Expect = 9e-54, Method: Composition-based stats. Identities = 58/229 (25%), Positives = 103/229 (44%), Gaps = 10/229 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 D+ M ++E+ +P P + + + + E+ + R S Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQRSLKNSDYER 230 >gi|145636691|ref|ZP_01792358.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittHH] gi|229845801|ref|ZP_04465913.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 7P49H1] gi|145270217|gb|EDK10153.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittHH] gi|226192503|gb|ACO37527.1| Lex2B [Haemophilus influenzae] gi|226192506|gb|ACO37529.1| Lex2B [Haemophilus influenzae] gi|226192515|gb|ACO37535.1| Lex2B [Haemophilus influenzae] gi|226192536|gb|ACO37549.1| Lex2B [Haemophilus influenzae] gi|226192542|gb|ACO37553.1| Lex2B [Haemophilus influenzae] gi|229810805|gb|EEP46522.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 7P49H1] Length = 247 Score = 214 bits (544), Expect = 1e-53, Method: Composition-based stats. Identities = 58/220 (26%), Positives = 103/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TTGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTINEKQKKQR 221 >gi|148826601|ref|YP_001291354.1| 50S ribosomal protein L31 [Haemophilus influenzae PittEE] gi|148716761|gb|ABQ98971.1| 50S ribosomal protein L31 [Haemophilus influenzae PittEE] gi|226192500|gb|ACO37525.1| Lex2B [Haemophilus influenzae] gi|226192521|gb|ACO37539.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 214 bits (544), Expect = 1e-53, Method: Composition-based stats. Identities = 58/220 (26%), Positives = 103/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P V P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYVIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|224999306|gb|ACN78416.1| galactosyltransferase A [Pasteurella multocida] gi|325559791|gb|ADZ31082.1| putative bi-functional 1,4 and 1,6 beta-D-galactosyltransferase [Pasteurella multocida] gi|325559798|gb|ADZ31088.1| putative bi-functional 1,4 and 1,6 beta-D-galactosyltransferase [Pasteurella multocida] gi|329756856|gb|AEC04647.1| galactosyltransferase [Pasteurella multocida] gi|329756863|gb|AEC04653.1| galactosyltransferase [Pasteurella multocida] gi|329756870|gb|AEC04659.1| galactosyltransferase [Pasteurella multocida] gi|329756877|gb|AEC04665.1| galactosyltransferase [Pasteurella multocida] Length = 225 Score = 213 bits (543), Expect = 2e-53, Method: Composition-based stats. Identities = 67/224 (29%), Positives = 101/224 (45%), Gaps = 15/224 (6%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 M +P + VISL S RR+ HR + + L F FFDA+YG++ + + + C Sbjct: 1 MKLPKIIVISLKNS-PRRQIISHRLSGLGLDFEFFDAVYGKDLTKEELEKIDYEFFPKYC 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINN 115 K L+L EIGC +SHI +++ I + IILEDDA S F ++ L L K Sbjct: 60 GSKGALTLGEIGCAMSHIKIYEHIVANNLEQVIILEDDAIVSLYFEEIVLAALQKLPNRR 119 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNV 168 ++ D K K ++ LP + + + R S RTT Y I E A LL Sbjct: 120 EILFLD--HGKAKVYPFMRNLPERYRLARYRKPSKHSKRFIVRTTAYLITLEGAKKLLKH 177 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 I P D +I + EP V+ +++ + +E Sbjct: 178 AYPIRMPSDFLTGLLQLTHINAYGIEPSCVFGGVESEINEMERR 221 >gi|226192539|gb|ACO37551.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 58/220 (26%), Positives = 103/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TTGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKRR 221 >gi|145630497|ref|ZP_01786277.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae R3021] gi|144983887|gb|EDJ91329.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae R3021] Length = 247 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 TKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192551|gb|ACO37559.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNP--ICNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHSLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192491|gb|ACO37519.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 103/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L++ ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTMGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TTGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKRR 221 >gi|148827944|ref|YP_001292697.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittGG] gi|148719186|gb|ABR00314.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittGG] gi|226192488|gb|ACO37517.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTINEKQKKQR 221 >gi|145634835|ref|ZP_01790543.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittAA] gi|260581568|ref|ZP_05849365.1| glycosyltransferase [Haemophilus influenzae NT127] gi|145268001|gb|EDK07997.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittAA] gi|226192494|gb|ACO37521.1| Lex2B [Haemophilus influenzae] gi|226192497|gb|ACO37523.1| Lex2B [Haemophilus influenzae] gi|226192509|gb|ACO37531.1| Lex2B [Haemophilus influenzae] gi|226192512|gb|ACO37533.1| Lex2B [Haemophilus influenzae] gi|226192524|gb|ACO37541.1| Lex2B [Haemophilus influenzae] gi|260095161|gb|EEW79052.1| glycosyltransferase [Haemophilus influenzae NT127] gi|309973750|gb|ADO96951.1| beta-(1-4)-glucosyltransferase Lex2B [Haemophilus influenzae R2846] Length = 247 Score = 212 bits (541), Expect = 3e-53, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|68249350|ref|YP_248462.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 86-028NP] gi|145632107|ref|ZP_01787842.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 3655] gi|145638450|ref|ZP_01794060.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittII] gi|329122700|ref|ZP_08251278.1| UDP-glucose-lipooligosaccharide glucosyltransferase [Haemophilus aegyptius ATCC 11116] gi|68057549|gb|AAX87802.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 86-028NP] gi|144987014|gb|EDJ93544.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 3655] gi|145272779|gb|EDK12686.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae PittII] gi|226192485|gb|ACO37515.1| Lex2B [Haemophilus influenzae] gi|309751585|gb|ADO81569.1| beta-(1-4)-glucosyltransferase Lex2B [Haemophilus influenzae R2866] gi|327472574|gb|EGF18004.1| UDP-glucose-lipooligosaccharide glucosyltransferase [Haemophilus aegyptius ATCC 11116] Length = 247 Score = 212 bits (540), Expect = 4e-53, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192554|gb|ACO37561.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 212 bits (540), Expect = 4e-53, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 102/221 (46%), Gaps = 10/221 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPICNRI--FSHQKRQ 55 IP+Y+I+L S R+ + ++ +L + FF+AIYG++NP ++ KR Sbjct: 4 IPIYIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFHRYNENKRL 63 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 64 NAKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTF 122 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 R K K+ L + GN I+Q TGY++ +AA L K Y Sbjct: 123 EFFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLT 180 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 +D+ M ++E+ +P P + + + + E+ + R Sbjct: 181 VDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|311766115|emb|CBW29072.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 10810] Length = 247 Score = 212 bits (539), Expect = 5e-53, Method: Composition-based stats. Identities = 56/220 (25%), Positives = 99/220 (45%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRA-----ARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + + + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFELLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|20977179|gb|AAM33309.1|AF503507_2 Lex2B [Haemophilus influenzae] Length = 248 Score = 212 bits (539), Expect = 5e-53, Method: Composition-based stats. Identities = 56/220 (25%), Positives = 99/220 (45%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRA-----ARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + + + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFELLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|507354|gb|AAA60375.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 211 bits (538), Expect = 6e-53, Method: Composition-based stats. Identities = 56/220 (25%), Positives = 99/220 (45%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRA-----ARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + + + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFELLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 F L + K L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 F--FWLLPDRLKNKRKLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|226192518|gb|ACO37537.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 211 bits (537), Expect = 1e-52, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLNLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|319775306|ref|YP_004137794.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3047] gi|317449897|emb|CBY86109.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3047] Length = 247 Score = 211 bits (537), Expect = 1e-52, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKRR 221 >gi|260779644|ref|ZP_05888534.1| putative glycosyltransferase WavM [Vibrio coralliilyticus ATCC BAA-450] gi|260604453|gb|EEX30757.1| putative glycosyltransferase WavM [Vibrio coralliilyticus ATCC BAA-450] Length = 241 Score = 211 bits (537), Expect = 1e-52, Method: Composition-based stats. Identities = 64/238 (26%), Positives = 99/238 (41%), Gaps = 12/238 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQ-CQFK 59 + V+VISL S RRE + ++++ F FFDA+ G P +++ KR Sbjct: 1 MKVFVISLQRSPERREYIKKQLDQLNIDFEFFDAVDGRAEPPHKLFENYNYAKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R+ S E+G Y SH LWK+ I+ EDDA F L + + ++ Sbjct: 61 RMPSKGELGVYGSHYLLWKKSIELDEP-IIVFEDDASILPSFPSSLKVVQEKIDRYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 ++ K K L NF I Y I AA L++ + P+D Sbjct: 120 LES--KTDKGVLCLKEEDDNFSISFMTNNYGGLRCYAISPYAAKKLVSNSERWCMPVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE-----ESRLVRKPTFSPLYFYRNTCYQ 232 + + H++PS + P V E +TI+ ++RL RKPT YR + Sbjct: 178 VGSLYLHDMPSYILSPAVV-ENPQEFATTIQLGEESKARLYRKPTRELYSLYRKIALR 234 >gi|226192530|gb|ACO37545.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 210 bits (535), Expect = 1e-52, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 103/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 + L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKRYPLTLGQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + R Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|54307438|ref|YP_128458.1| putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium profundum SS9] gi|46911858|emb|CAG18656.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium profundum SS9] Length = 253 Score = 210 bits (535), Expect = 2e-52, Method: Composition-based stats. Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 7/227 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKRLL 62 +Y+I+L S R+E ++ I++ F+FF AI G +P+ + + Q Q R L Sbjct: 6 IYIINLLSSPERKENIANQLNAINVPFTFFPAIDGRKEAHPLFKK-YDDQCSQLYRGRSL 64 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILIKFD 121 + ++GCY SH LW++ A I+LEDDA S EF + + I+ Sbjct: 65 NKGQLGCYASHYLLWEKCAEKGEP-IIVLEDDAIISPIEFKAFCTSATSLNSQFKFIRLH 123 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 ++K + S T +IH+ T GYF+ A LL ++ + P+D+ M Sbjct: 124 KHQRK-RFSSKKATEIDKLEIHRFSKGHMGTIGYFLTPSGAKKLLEHSQHWFLPVDIFMD 182 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRN 228 +W I S T P + D+ +S I ++ R Sbjct: 183 RFWITKIDSYGTIPPCLSHD-DSYESNIGYVEYEKRAKRHLYTKIRR 228 >gi|226192533|gb|ACO37547.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 104/221 (47%), Gaps = 10/221 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPICN--RIFSHQKRQ 55 IP+Y+I+L S R+ + ++ +L + FF+AIYG++NP + ++ KR Sbjct: 4 IPIYIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQRYNENKRL 63 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 64 NAKGYPLTLSQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTF 122 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 R K K+ L + GN I+Q TTGY++ +AA L K Y Sbjct: 123 KFFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFLTQSKEWYLT 180 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 +D+ M ++E+ +P P + + + + E+ + + Sbjct: 181 VDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQK 221 >gi|226192545|gb|ACO37555.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 103/221 (46%), Gaps = 10/221 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPICN--RIFSHQKRQ 55 IP+Y+I+L S R+ + ++ +L + FF+AIYG++NP + ++ KR Sbjct: 4 IPIYIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQRYNENKRL 63 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 64 NAKGYPLTLSQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTF 122 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 R K K+ L + GN I+Q TGY++ +AA L K Y Sbjct: 123 EFFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLT 180 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 +D+ M ++E+ +P P + + + + E+ + R Sbjct: 181 VDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 221 >gi|329756897|gb|AEC04682.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 209 bits (533), Expect = 2e-52, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 95/220 (43%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKR 60 P++VISL S ARRE R + L F F D + G+ N + ++ ++ ++ Sbjct: 6 PIFVISLKNS-ARREIIAQRLNGLKLNFQFIDGVNGKTLSQNELNKVDYTFYPQRFAARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 L++ EIGC +SH+ +++ + AIILEDDA S EF ++ L K N ++ Sbjct: 65 PLTVGEIGCAMSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVSKNVEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNI 172 +D K K + TL N+ + R S T Y I A LL + I Sbjct: 125 YD--HGKAKSYCWKKTLVENYRLVHYRKPSKTSKRAIMCATAYLITLSGAQKLLQIAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V++ + +E+ Sbjct: 183 RMPADYLTGALQLTGLKAYGVEPPCVFKGAISEIDAMEQR 222 >gi|145638325|ref|ZP_01793935.1| LosA [Haemophilus influenzae PittII] gi|145272654|gb|EDK12561.1| LosA [Haemophilus influenzae PittII] Length = 222 Score = 209 bits (532), Expect = 3e-52, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 96/220 (43%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ +Q ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFYPQQFAVRK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIK 119 L++ EIGC +SH++++ + AIILEDDA S EF + L L+K ++ Sbjct: 65 PLTVGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFEHIVLDSLNKVPNTMDILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 ++ K K L + + + RTT Y I +E A LL + I Sbjct: 125 YE--HGKAKTYFCKKNLIEGYRLVHYCTPSKRSKRTITRTTAYLITQEGANKLLKLAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V+ ID+ +E+ Sbjct: 183 RMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQR 222 >gi|226192548|gb|ACO37557.1| Lex2B [Haemophilus influenzae] Length = 247 Score = 209 bits (532), Expect = 4e-52, Method: Composition-based stats. Identities = 56/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPICN--RIFSHQKRQC 56 P++VI+L S R+ + ++ +L + FF+AIYG++NP + ++ KR Sbjct: 5 PIFVINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLSQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + + Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQK 221 >gi|229844622|ref|ZP_04464761.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 6P18H1] gi|229812336|gb|EEP48026.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae 6P18H1] Length = 247 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 60/227 (26%), Positives = 105/227 (46%), Gaps = 11/227 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPICN--RIFSHQKRQC 56 P++VI+L S R+ + ++ +L + FF+AIYG++NP + ++ KR Sbjct: 5 PIFVINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLSQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 D+ M ++E+ +P P + + + STI E + +K + Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIE-STINEKQKKQKSLKIVI 227 >gi|329756950|gb|AEC04727.1| glucosyltransferase [Pasteurella multocida] gi|329756956|gb|AEC04732.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 60/220 (27%), Positives = 91/220 (41%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR---IFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + R Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSEEQKKCVDYHFYPETFAANR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIK 119 L++ EIGC +SH++++ + AIILEDDA S EF + L L+K ++ Sbjct: 65 PLTIGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFEHIVLDSLNKVPNTMDILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNI 172 ++ K K L + + R S RTT Y I +E A LL + I Sbjct: 125 YE--HGKAKTYFCKKNLIEGYRLVHYRTPSKRSKRTITRTTAYLITQEGANKLLKLAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V+ ID+ +E+ Sbjct: 183 RMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQR 222 >gi|309751471|gb|ADO81455.1| Lipooligosaccharide galactosyltransferase LosA2 [Haemophilus influenzae R2866] Length = 222 Score = 209 bits (531), Expect = 5e-52, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 96/220 (43%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ +Q ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFYPQQFAVRK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIK 119 L++ EIGC +SH++++ + AIILEDDA S EF + L L+K ++ Sbjct: 65 PLTVGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFEHIVLDSLNKVPNTMDILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 ++ K K L + + + RTT Y I +E A LL + I Sbjct: 125 YE--HGKAKTYFCKKNLIEGYRLVHYCTPSKRSKRTITRTTAYLITQEGANKLLKLAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V+ ID+ +E+ Sbjct: 183 LMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQR 222 >gi|18653266|gb|AAL77342.1|AF443846_3 putative glycosyltransferase WavM [Vibrio cholerae] Length = 207 Score = 208 bits (530), Expect = 7e-52, Method: Composition-based stats. Identities = 53/209 (25%), Positives = 85/209 (40%), Gaps = 7/209 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQ-CQFK 59 + +YVISL S RR + L+F FFDAI G +P + + KR Sbjct: 1 MKIYVISLKNSLDRRASIEQQMTSHGLKFEFFDAIDGRIDPPHPLFANYDYTKRLWLTSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 ++ E+GCY SH +W++ +ILEDDA F L + ++ Sbjct: 61 KMPMRGELGCYASHYLMWQKCVELGEP-IVILEDDAKILPSFINYLNIIRVKTQEYGFLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + + +L +F+I Y + E+A L+ + P+D Sbjct: 120 LE--EAYDRSRLFLKEKTNDFEISFLTNNFGGARAYSLSPESAKKLIKGSQRWSMPVDNY 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 M + H++PS V P V E D ++T Sbjct: 178 MGSLYLHDMPSFVFYPSVV-ENPDHFETT 205 >gi|319897747|ref|YP_004135944.1| subname: full=udp-glucose--lipooligosaccharide glucosyltransferase; subname: full=udp-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3031] gi|317433253|emb|CBY81628.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Haemophilus influenzae F3031] Length = 247 Score = 208 bits (529), Expect = 8e-52, Method: Composition-based stats. Identities = 55/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPICN--RIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FF+AIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFNAIYGKSNPNYPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L ++GCY SH +W++ I+LEDDA F + F ++L ++ Sbjct: 65 AKGYPLTLSQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFE 123 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 R K K+ L + GN I+Q TGY++ +AA L K Y + Sbjct: 124 FFWLQPDRLKNKR--KLISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQSKEWYLTV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ M ++E+ +P P + + + + E+ + + Sbjct: 182 DVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQK 221 >gi|307245888|ref|ZP_07527973.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|307254861|ref|ZP_07536687.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307259303|ref|ZP_07541032.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|4809236|gb|AAD30156.1|AF143905_1 putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853248|gb|EFM85468.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306862232|gb|EFM94200.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306866670|gb|EFM98529.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 224 Score = 207 bits (527), Expect = 1e-51, Method: Composition-based stats. Identities = 68/225 (30%), Positives = 96/225 (42%), Gaps = 16/225 (7%) Query: 1 MPI--PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQ 55 M I P+ +ISL S RR R + L F FFDA YG++ + F ++ Sbjct: 1 MKIFPPILIISLKNSS-RRAIISQRLTSLGLSFEFFDAFYGKDLTDEELQKIDFEFYPKK 59 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDIN 114 ++ L+L EIGC +SHI +++ + + AIILEDDA S F ++ L K Sbjct: 60 FDARKPLTLGEIGCALSHIKVYEYMVENNIEQAIILEDDAIVSLYFESIINEALKKVPSR 119 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLN 167 +I D K K ++ LP + + + RI S RTTGYFI A LL Sbjct: 120 KEIIFLD--HGKAKVWPFMRNLPERYRLARYRIPSKNSKRSIIRTTGYFITLTGAKKLLK 177 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 I P D I + EP V+ + IE Sbjct: 178 YAYPIRMPSDFLTGLLQLTGINAYGIEPPCVFGGELSEIDQIENR 222 >gi|52424501|ref|YP_087638.1| hypothetical protein MS0446 [Mannheimia succiniciproducens MBEL55E] gi|52306553|gb|AAU37053.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 240 Score = 207 bits (527), Expect = 1e-51, Method: Composition-based stats. Identities = 64/226 (28%), Positives = 100/226 (44%), Gaps = 17/226 (7%) Query: 1 MPI---PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKR 54 M I +++ISL S RR+ R + ++F FFDAIYG++ + + Sbjct: 16 MKIELPKIFIISLKNS-PRRDVIAQRFNALGIKFEFFDAIYGKDLSQEELSKIDREFAVK 74 Query: 55 QCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDI 113 + K+ L+L EIGC +SHI +++ I + AII EDDA EF +++ LSK Sbjct: 75 RFSTKKPLTLGEIGCALSHIAVYEHILKNNIEQAIIFEDDAIIHHEFKKIVEETLSKVPS 134 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLL 166 +I F+ K K ++ + + + R S RTT Y I A LL Sbjct: 135 RREIIFFE--HGKAKSWFCKRSIHEGYKLVRYRSPSKNSKRCIFRTTSYLITLSGAKKLL 192 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 N + P D I + EP V+ +D+ + IE+ Sbjct: 193 NHAYPVRMPSDYLTGGLQITQINAYGIEPPCVFCGVDSEINAIEDR 238 >gi|303251083|ref|ZP_07337269.1| hypothetical protein APP6_0382 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252616|ref|ZP_07534510.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302650093|gb|EFL80263.1| hypothetical protein APP6_0382 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306859938|gb|EFM91957.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 224 Score = 207 bits (527), Expect = 1e-51, Method: Composition-based stats. Identities = 65/225 (28%), Positives = 94/225 (41%), Gaps = 16/225 (7%) Query: 1 MPI--PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQ 55 M I P+ +ISL S RR R + L F FFDA YG++ + F ++ Sbjct: 1 MKIFPPILIISLKNSS-RRAIISQRLTSLGLSFEFFDAFYGKDLTNEELQKIDFEFYPKK 59 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDIN 114 ++ L+L EIGC +SHI +++ + + AIILEDDA S F ++ L K Sbjct: 60 FDARKPLTLGEIGCALSHIKVYEYMVENNIEQAIILEDDAIVSLYFESIINEALKKVPSR 119 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLN 167 +I D K K ++ LP + + + + RTTGYFI A LL Sbjct: 120 KEIIFLD--HGKAKVWPFMRNLPERYRLARYCIPSKNSKRSIIRTTGYFITLTGAKKLLK 177 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 I P D I + EP V+ + IE Sbjct: 178 YAYPIRMPSDFLTGLLQLTGINAYGIEPPCVFGGELSEIDQIENR 222 >gi|329756911|gb|AEC04694.1| glucosyltransferase [Pasteurella multocida] gi|329756919|gb|AEC04701.1| glucosyltransferase [Pasteurella multocida] gi|329756926|gb|AEC04707.1| glucosyltransferase [Pasteurella multocida] gi|329756944|gb|AEC04722.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 207 bits (526), Expect = 2e-51, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 90/220 (40%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR---IFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + R Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSEEQKKCVDYHFYPETFAANR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIK 119 L++ EIGC +SH++++ + AIILEDDA S EF + L L+K ++ Sbjct: 65 PLTIGEIGCAMSHLYIYNMMQEQNIDKAIILEDDAIVSQEFEHIVLDSLNKVPNTMDILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNI 172 ++ K K L + + R S R T Y I +E A LL + I Sbjct: 125 YE--HGKAKTYFCKKNLIEGYRLVHYRTPSKRSKRTITRATAYLITQEGANKLLKLAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V+ ID+ +E+ Sbjct: 183 RMPADYLTGALQLTGLNAYGVEPPCVFRGIDSEIDAMEQR 222 >gi|15603008|ref|NP_246080.1| LosA [Pasteurella multocida subsp. multocida str. Pm70] gi|12721490|gb|AAK03227.1| LosA [Pasteurella multocida subsp. multocida str. Pm70] gi|329756891|gb|AEC04677.1| glucosyltransferase [Pasteurella multocida] gi|329756904|gb|AEC04688.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 206 bits (525), Expect = 2e-51, Method: Composition-based stats. Identities = 58/220 (26%), Positives = 95/220 (43%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ ++ ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFYPQRFAARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 L++ EIGC +SH+ +++ + AIILEDDA S EF ++ L K N ++ Sbjct: 65 PLTVGEIGCAMSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVSKNVEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNI 172 +D K K + TL N+ + R S T Y I A LL + I Sbjct: 125 YD--HGKAKSYCWKKTLVENYRLVHYRKPSKTSKRAIMCATAYLITLSGAQKLLQIAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V++ + +E+ Sbjct: 183 RMPADYLTGALQLTGLKAYGVEPPCVFKGAISEIDAMEQR 222 >gi|315633563|ref|ZP_07888853.1| LPS biosynthesis protein [Aggregatibacter segnis ATCC 33393] gi|315477605|gb|EFU68347.1| LPS biosynthesis protein [Aggregatibacter segnis ATCC 33393] Length = 225 Score = 206 bits (524), Expect = 3e-51, Method: Composition-based stats. Identities = 70/219 (31%), Positives = 104/219 (47%), Gaps = 14/219 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKRL 61 +++ISL S RR +R +++ LQF FFDAIYG++ + F + K+ Sbjct: 8 IFIISLVNS-PRRSIIKNRLSKLGLQFEFFDAIYGKDLSKEQLSQIDFQFYPQNFSAKKP 66 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKF 120 L+L EIGC +SHI L++ I + AIILEDDA S F +++ L+K ++ Sbjct: 67 LTLGEIGCAMSHIMLYEHIVKNNIREAIILEDDAIVSLYFEEIVTAALNKVSSKKEILFL 126 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNIY 173 D K K ++ LP + + + R S RTTGY I E A LL+ I Sbjct: 127 D--HGKAKVYPFMKRLPERYRLAKYRSPSKKSKRSIIRTTGYLITYEGAKKLLSYAYPIR 184 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D NI + EP V+ +D+ +IE Sbjct: 185 MPSDFLTGLLQLTNINAYGIEPACVFGDVDSEIDSIENR 223 >gi|260912783|ref|ZP_05919269.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260633161|gb|EEX51326.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 222 Score = 205 bits (523), Expect = 4e-51, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 95/221 (42%), Gaps = 15/221 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQ 57 MP P++VISL S RR+ R ++L F F D I G+ + + + ++ Sbjct: 4 MP-PIFVISLKGS-PRRDVISQRLNGLNLNFRFIDGINGKELTQDELNKVDYEFYLKRFN 61 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNI 116 ++ L++ E+GC +SH+ +++ I + AIILEDDA S F L+ + K N Sbjct: 62 SRKPLTIGEVGCALSHLSIYEMIVQNKIEKAIILEDDAIVSQVFESLVKDSIRKSPDNTE 121 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 +I +D K K + T+ N+ + R S T Y I A LL V Sbjct: 122 IIFYD--HGKAKSYCWRKTISENYRLVHYRRPSKNSKRVIICATAYLITLSGAQKLLKVA 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 + P D + + EP V++ + +E Sbjct: 180 YPLRMPADYLTGALQLTKLKAYGVEPPCVFKGTISEIDAME 220 >gi|39545765|gb|AAR27965.1| putative LPS biosynthesis protein [Aeromonas hydrophila] Length = 221 Score = 205 bits (523), Expect = 4e-51, Method: Composition-based stats. Identities = 55/220 (25%), Positives = 88/220 (40%), Gaps = 17/220 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN---PICNRIFSHQKRQCQFK 59 IPV+VISL S RR + A + + F F DA G+ + F K C Sbjct: 2 IPVFVISLARSVERRAMVVQQMAHLGIDFEFVDATDGKALSSAELAKVDFELAKEVC--G 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 LS EIGC +SHI L++RI ++LEDD ++ + + I+ Sbjct: 60 HALSFGEIGCAMSHIRLYERIVAEGIERCVVLEDDIYLHMHSKAIIEAIVHSCKSEIVF- 118 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 K K+ +L +LPG + + + + T GY + A LL + Sbjct: 119 --LHHGKAKRWPFLSSLPGGYRLARYLTPSRTSKRGVLSTAGYVLSLTGAKKLLQCAYPL 176 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + + EP + +D +TI++ Sbjct: 177 RMPSDYLTGRLQLNGLTASGVEPSCL--DVDLFTTTIDDR 214 >gi|117620908|ref|YP_858651.1| putative lipooligosaccharide biosynthesis protein LpsA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562315|gb|ABK39263.1| putative lipooligosaccharide biosynthesis protein LpsA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 221 Score = 205 bits (523), Expect = 4e-51, Method: Composition-based stats. Identities = 53/220 (24%), Positives = 89/220 (40%), Gaps = 17/220 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 IPV+VISL S RR + A + + F F DA G+ N + + K C Sbjct: 2 IPVFVISLARSVERRAMVVQQMAHLGIDFEFVDATDGKTLSNAELAKVDLALAKEVC--G 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 LS EIGC +SHI +++R+ ++LEDD ++ + + I+ Sbjct: 60 HALSFGEIGCAMSHIRVYERVVAEGIDRCVVLEDDIYLHMHSKAIIETIVHSCQSEIVF- 118 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 K K+ +L +LPG + + + + T GY + A LL + Sbjct: 119 --LHHGKAKRWPFLSSLPGGYRLARYLTPSRSSKRGVLSTAGYVLTLAGAKKLLQCAYPL 176 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + + EP + +D +TI++ Sbjct: 177 RMPSDYLTGRLQLNGLSASGVEPCCL--DVDLFTTTIDDR 214 >gi|167855172|ref|ZP_02477943.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis 29755] gi|167853717|gb|EDS24960.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis 29755] Length = 224 Score = 205 bits (521), Expect = 6e-51, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 96/220 (43%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQKRQCQFKR 60 +++ISL S RRE R + + L F FFDA+YG++ F + ++ Sbjct: 6 KIFIISLKNS-PRREFISKRLSDLKLNFEFFDAVYGKDLSEEKLKEIDFEFYPKHFSARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 ++L E+GC +SHI L++ I + AIILEDDA S F +++ L+K ++ Sbjct: 65 QMTLGEVGCAMSHIKLYEYIVENNIDNAIILEDDAIVSLYFKEIISDALNKISPKYEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 D K K + L + + + + RTTGY I + A LL I Sbjct: 125 LD--HGKAKIYPFPKNLVERYRLARYISPSKNSKRTIIRTTGYIITQAGAKRLLEYAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D +I + EP V+ + + +E+ Sbjct: 183 RMPSDFLTGSLQMTHIRAYGIEPSCVFGGSHSEINEMEDR 222 >gi|219871212|ref|YP_002475587.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis SH0165] gi|219691416|gb|ACL32639.1| lipooligosaccharide galactosyltransferase I [Haemophilus parasuis SH0165] Length = 224 Score = 204 bits (520), Expect = 9e-51, Method: Composition-based stats. Identities = 61/220 (27%), Positives = 96/220 (43%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR---IFSHQKRQCQFKR 60 ++VISL S RRE R + + L F FFDA+YG++ F + ++ Sbjct: 6 KIFVISLINS-PRREFISKRLSDLKLNFEFFDAVYGKDLSEEQLKEIDFKFYPEKYAARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIK 119 ++L E+GC +SHI L++ I + AIILEDDA S F ++ L L+K ++ Sbjct: 65 AMTLGEVGCAMSHIKLYEHIVENNIDNAIILEDDAIVSLYFKEIILDALNKISPKYEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 D K K + L + + + + RTTGY I + A LL I Sbjct: 125 LD--HGKAKIYPFPKNLVERYRLARYISPSKNSKRTIIRTTGYIITQAGAKRLLEYAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D +I + EP V+ + + +E+ Sbjct: 183 RMPSDFLTGSLQMTHIRAYGIEPSCVFGGSHSEINEMEDR 222 >gi|332290185|ref|YP_004421037.1| Glycosyltransferase family 25 [Gallibacterium anatis UMN179] gi|330433081|gb|AEC18140.1| Glycosyltransferase family 25 [Gallibacterium anatis UMN179] Length = 246 Score = 204 bits (519), Expect = 1e-50, Method: Composition-based stats. Identities = 53/236 (22%), Positives = 92/236 (38%), Gaps = 7/236 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 +P+YVI + + R + + + ++F FF AI + P ++ +K + R Sbjct: 2 LPIYVIHIDSATERADSIQQQFDNLKIKFEFFPAINAKKTPNHPLFSHYNAKKHFQRKGR 61 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E+GCY SH WK+ I+LEDD + F + + + + Sbjct: 62 NLSSGELGCYASHYSTWKKCLELNQP-IIVLEDDVTILENFKDIYTNAERIIQKYDFVWL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + D + F I + GY I +AA LL + P+D M Sbjct: 121 HKNHR--SDDKVIVESIDTFSITKFYRDYFCAQGYLITPKAAKQLLAYCEEWIYPVDDQM 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLH 236 ++E+ I + P + + I + +S I + R +K S R + H Sbjct: 179 GRFYENKIENYAVYPACI-DHIASMESLIGDDRRGKK-KLSLTSKIRREYFNLKDH 232 >gi|264676250|ref|YP_003276156.1| glycosyl transferase, family 25 [Comamonas testosteroni CNB-2] gi|262206762|gb|ACY30860.1| glycosyl transferase, family 25 [Comamonas testosteroni CNB-2] Length = 254 Score = 203 bits (517), Expect = 2e-50, Method: Composition-based stats. Identities = 50/252 (19%), Positives = 98/252 (38%), Gaps = 6/252 (2%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQFK 59 +P+ I+L RRE+ + A++ L S A++ + + F + ++ Sbjct: 3 SLPIVFINLSKDAERRERMTAQFAQMGLTASRLPAVWWADLSEAEQKAFFCAPQSHGRYF 62 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 + LS E GCY SH+ W+++ A ++ EDD + Q L + + Sbjct: 63 KPLSNGEKGCYASHLRSWQQLLGGDAPAMVVFEDDVRLLPQLPQALAAIEALPADGSWDM 122 Query: 120 FDALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 ++P+K + L G + + + GY I + A +L R RP+D+ Sbjct: 123 IKLYGREPEKIANQGPLVAGALQLISYQRVPSFAAGYVISRSGARKMLETRMPFDRPVDV 182 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 D++ W+E+++ P + + S+I + L R + L + Sbjct: 183 DIRFWFENDLRVYGVYPSVIALDDTSEVSSIWAQKEAPADLAQRL---RKFKMKLELTWG 239 Query: 239 AWRKDLPPVSTT 250 P V+ Sbjct: 240 NSLARTPHVAEV 251 >gi|251793913|ref|YP_003008645.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter aphrophilus NJ8700] gi|247535312|gb|ACS98558.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter aphrophilus NJ8700] Length = 228 Score = 203 bits (516), Expect = 2e-50, Method: Composition-based stats. Identities = 64/217 (29%), Positives = 95/217 (43%), Gaps = 13/217 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFKRLL 62 P+++ISL S RRE R + LQF FFDA+YG+ + + L Sbjct: 9 PIFIISLKNS-PRREFMAKRLNGLGLQFEFFDAVYGKELSEEALSKVDYSFYKTYNPHPL 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-SKCDINNILIKFD 121 +L EIGC SHI +++ +A + AIILEDDA S F +++ + SK + L+ D Sbjct: 68 TLGEIGCATSHIKIYEHMAKNNISSAIILEDDAIVSQFFKEIIKDIISKIHHSYELVFLD 127 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNVRKNIYR 174 K K + L + + R S T Y I A LLN I Sbjct: 128 --HGKVKSYPFKKKLIEGYRLAHYRSPSKNSKRCIIYATAYLITLSGAQKLLNYAYPIRM 185 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 P D + + + EP ++ +DT DS I++ Sbjct: 186 PADYLTGLIQKTKVNAYGVEPPCIFRGLDT-DSEIDQ 221 >gi|145628223|ref|ZP_01784024.1| LosA [Haemophilus influenzae 22.1-21] gi|144979998|gb|EDJ89657.1| LosA [Haemophilus influenzae 22.1-21] Length = 221 Score = 203 bits (516), Expect = 2e-50, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 96/220 (43%), Gaps = 15/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + N ++ ++ ++ Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSQDELNNIDYTFCPQRFA-RK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 L++ EIGC +SH+ +++ + AIILEDDA S EF ++ L K N ++ Sbjct: 64 PLTVGEIGCAMSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVSKNVEILF 123 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNI 172 +D K K + TL N+ + R S TT Y I A LL + I Sbjct: 124 YD--HGKAKSYCWKKTLVKNYRLVHYRKPSKTSKRAIMCTTAYLITLSGAQKLLQIAYPI 181 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V++ + +E+ Sbjct: 182 RMPADYLTGALQLTGLKAYGVEPPCVFKGAISEIDAMEQR 221 >gi|269959343|ref|ZP_06173726.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269835780|gb|EEZ89856.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 240 Score = 203 bits (516), Expect = 3e-50, Method: Composition-based stats. Identities = 55/221 (24%), Positives = 89/221 (40%), Gaps = 6/221 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFS-HQKRQCQFKRL 61 + V+V+SL S RRE+ + + ++F FFDA+ G + + S Q + Sbjct: 1 MKVFVVSLARSLGRRERIEEQLKQQGIEFEFFDAVDGSIDGFLHSEKSCPAITQRRKGYQ 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ C+ SH LW R IILED+ D + Q+L H IK Sbjct: 61 LKTSEVACFSSHYELWVRCTELDEP-IIILEDNVDPVEGLKQVLAHTLPLTSTYDYIKLS 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A +K+ + + + + + + TT Y + +AA + + P+D M+ Sbjct: 120 ATQKRA--FTPIIPIDDSHMLGGYSAGTCGTTAYIVTPKAASKFVKHAQEFIEPVDDYME 177 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP 222 W HN+ + P A STI R + P Sbjct: 178 KPWRHNVQTYSVAPDLFTRAQIA--STIGAKRKDKSKMTVP 216 >gi|90414920|ref|ZP_01222884.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] gi|90323976|gb|EAS40572.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] Length = 249 Score = 202 bits (515), Expect = 3e-50, Method: Composition-based stats. Identities = 56/236 (23%), Positives = 93/236 (39%), Gaps = 11/236 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-ICNRIFSHQKRQCQFK 59 M +Y+I+L S R+ + + L ++ F AI G +P ++ + Q Sbjct: 1 MTCNIYIINLASSLDRKVNITKQLDNMSLPYTIFSAIDGRKDPHPLFDMYDDKLSQQYRG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILI 118 + LS ++GCY SH LW A II+EDDA F + H +K + Sbjct: 61 KSLSKGQLGCYASHFLLWGECVRINAP-IIIIEDDALIDPVPFLDFIKHNTKIAAKYKCV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + ++K S + +I + TTGY + EAA L+ + Y +D+ Sbjct: 120 RLFDNKRKNFCFSPKEQF-DSLNICKFNKGHMSTTGYLLTPEAARKLIMHSQRWYMAVDI 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWN 234 M +W + + T P + DS I + SP F+ +W Sbjct: 179 YMDRFWVNEVECFGTSPACLTNDP-IFDSEIGYGKK------SPRSFFSRVKREWF 227 >gi|307247978|ref|ZP_07530009.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|307261465|ref|ZP_07543136.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|307263648|ref|ZP_07545259.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|306855532|gb|EFM87704.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306868860|gb|EFN00666.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306871025|gb|EFN02758.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 222 Score = 202 bits (515), Expect = 3e-50, Method: Composition-based stats. Identities = 63/224 (28%), Positives = 94/224 (41%), Gaps = 15/224 (6%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 MPIP ++VISL S RR+ R + L F F DA+ G + + + ++ Sbjct: 1 MPIPPIFVISLANSV-RRKIISARLNSLGLAFEFIDAVNGRDLTEEQLSQVDYDFYPQRF 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 K L L EIGC +SHI ++ + + AIILEDDA S F +++ L K Sbjct: 60 GGKHRLKLGEIGCAMSHIRAYQHLVDNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRK 119 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNV 168 +I FD K K L + + + R S TGY++ E A LLN Sbjct: 120 QIIFFD--HGKAKYWPITRPLKEGYKLVRYRYPSKNSKRIIMIATGYYLTLEGAKLLLNH 177 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 + P D I + EP ++ + + IE+ Sbjct: 178 AYPLRMPADFLTGAIQMTGIKAYGVEPPCIFISPGSEIDEIEKR 221 >gi|89076364|ref|ZP_01162697.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium sp. SKA34] gi|89047935|gb|EAR53526.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Photobacterium sp. SKA34] Length = 242 Score = 202 bits (514), Expect = 4e-50, Method: Composition-based stats. Identities = 63/249 (25%), Positives = 96/249 (38%), Gaps = 17/249 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF--KR 60 + ++VI+LP S RR + I + F FF AI G + + + R Sbjct: 1 MKIFVINLPSSSDRRPFISKKMEDIGIDFEFFSAIDGRKGLPQDLANCPDDKHRIYFRSR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN-NILIK 119 LS E GCY SH LW++ +ILEDD + F Q+ L + ++ Sbjct: 61 PLSSGEKGCYASHYRLWQKCIELQQP-IVILEDDCIPTSFFKQIHTKLPELHNEGYEYLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + + K L + I + T GY I A L+ K P+D Sbjct: 120 LQGQQGQFKVIEQLS----DLQIVLWKENLIGTVGYSISPAGAKKLIAKSKKWRCPVDNF 175 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE------ESRLVRKPTFSPLYFYRNTCYQW 233 + + N+ +P A+ E D STI+ E L+ K YRN + Sbjct: 176 IGESYRTNLLCTGIQPYAI-EHDDNQTSTIQPTSAKSEVSLIYKSFREIYRSYRNI--RM 232 Query: 234 NLHYNAWRK 242 LHY+ +K Sbjct: 233 TLHYHLTKK 241 >gi|156972987|ref|YP_001443894.1| glycosyl transferase [Vibrio harveyi ATCC BAA-1116] gi|156524581|gb|ABU69667.1| hypothetical protein VIBHAR_00665 [Vibrio harveyi ATCC BAA-1116] Length = 241 Score = 202 bits (514), Expect = 4e-50, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 97/241 (40%), Gaps = 9/241 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + +VISL + RR + A + F FFDAI G +P+ ++ ++ +KR Sbjct: 1 MRTFVISLESAVERRAHIEAQFAGVEQDFEFFDAIDGRKGEHPLFSK-YNVEKRLRVKGY 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 ++ E+GC+ SH LW++ +++EDDA + F + ++++ + F Sbjct: 60 GMTPGELGCFASHYLLWEKCLELNEP-IVVIEDDAQLEECFDDSMKNINELQPYGYVRLF 118 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 RK+P + + T YF+ AA + + P+D M Sbjct: 119 VNGRKRPFVKIGSYKSHD---VVEYLRGPGATRSYFVTPMAAKKFIESAQEWLLPVDDYM 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 +W + + PG V D DS++ + K Y C W ++ + Sbjct: 176 DQFWSNEVACRGIMPGIVKNETD-FDSSVGKLEKSNKMNRLTRELYSIQCALWRK-WHLF 233 Query: 241 R 241 R Sbjct: 234 R 234 >gi|165976409|ref|YP_001652002.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|190150305|ref|YP_001968830.1| LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|303253576|ref|ZP_07339714.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307250230|ref|ZP_07532186.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|165876510|gb|ABY69558.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|189915436|gb|ACE61688.1| putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|302647496|gb|EFL77714.1| lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|306857759|gb|EFM89859.1| Lipooligosaccharide galactosyltransferase I [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 227 Score = 202 bits (514), Expect = 4e-50, Method: Composition-based stats. Identities = 63/224 (28%), Positives = 94/224 (41%), Gaps = 15/224 (6%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 MPIP ++VISL S RR+ R + L F F DA+ G + + + ++ Sbjct: 6 MPIPPIFVISLANSV-RRKIISARLNSLGLAFEFIDAVNGRDLTEEQLSQVDYDFYPQRF 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 K L L EIGC +SHI ++ + + AIILEDDA S F +++ L K Sbjct: 65 GGKHRLKLGEIGCAMSHIRAYQHLVDNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRK 124 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNV 168 +I FD K K L + + + R S TGY++ E A LLN Sbjct: 125 QIIFFD--HGKAKYWPITRPLKEGYKLVRYRYPSKNSKRIIMIATGYYLTLEGAKLLLNH 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 + P D I + EP ++ + + IE+ Sbjct: 183 AYPLRMPADFLTGAIQMTGIKAYGVEPPCIFISPGSEIDEIEKR 226 >gi|145297203|ref|YP_001140044.1| LPS biosynthesis glycosyltransferase [Aeromonas salmonicida subsp. salmonicida A449] gi|142849975|gb|ABO88296.1| LPS biosynthesis glycosyltransferase [Aeromonas salmonicida subsp. salmonicida A449] gi|224995183|gb|ACN76671.1| WasB [Aeromonas salmonicida] Length = 227 Score = 202 bits (513), Expect = 5e-50, Method: Composition-based stats. Identities = 53/221 (23%), Positives = 94/221 (42%), Gaps = 17/221 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFK 59 IP++VISL S ARR + + L+F+FFDA+ G+ + + + F + C Sbjct: 2 IPIFVISLSRSQARRAMLIKQMKHLGLEFTFFDAVDGKALSDAELQHVDFPLARETC--G 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 LS+ E+GC +SHI L++ + ++LEDD F+ ++ + I+ Sbjct: 60 HDLSMGEVGCAMSHIRLYEMMVEKNIERCVVLEDDIYLHMHFNAIMESAIAKSHSEIIFL 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLNVRKNI 172 + K K ++ LP + + + R S T GY + A LL I Sbjct: 120 Y---HGKAKCWPWMKPLPEGYRLARYRTPSCSSKRGIISTAGYILTLSGAKKLLRKAYPI 176 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 P D + + + EP + + ++TI++ Sbjct: 177 RMPSDYLTGRLQLNGLTASGVEPSCM--DVGLFETTIDDRN 215 >gi|262276308|ref|ZP_06054117.1| putative glycosyltransferase WavM [Grimontia hollisae CIP 101886] gi|262220116|gb|EEY71432.1| putative glycosyltransferase WavM [Grimontia hollisae CIP 101886] Length = 247 Score = 202 bits (513), Expect = 5e-50, Method: Composition-based stats. Identities = 64/246 (26%), Positives = 99/246 (40%), Gaps = 10/246 (4%) Query: 1 MPI-PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-NNPICNRIFSHQKRQ-CQ 57 M + VYVISL S RREK A ++ F+FFDA+ G P + ++KR Sbjct: 1 MSVSNVYVISLKRSVERREKIAKEMAEKNIAFTFFDAVDGHLGEPELAADYDYRKRLWLT 60 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 + S E+GCY SH LW + II EDD + + ++ + Sbjct: 61 SGKTPSKGELGCYASHYALWLKCIELGEP-IIICEDDIELHENAKSIVEYALMAANEYGF 119 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 I+ ++ K + G F +H R Y I +AA L+ P+D Sbjct: 120 IRMQSVCKGSATTEVI--KDGEFSLHLMRDNYGGLCAYAISPKAASRLIKH--RWCFPVD 175 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY-QWNLH 236 + + H S EP + + I STI+ ++ + P + L T Y + L Sbjct: 176 CFVGANYIHGQYSYQLEPCFMADHIGNE-STIQFEKMEKPPFYRKLSRELYTLYKKIMLS 234 Query: 237 YNAWRK 242 W+K Sbjct: 235 IMYWKK 240 >gi|148976943|ref|ZP_01813598.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrionales bacterium SWAT-3] gi|145963817|gb|EDK29077.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrionales bacterium SWAT-3] Length = 250 Score = 202 bits (513), Expect = 6e-50, Method: Composition-based stats. Identities = 56/234 (23%), Positives = 94/234 (40%), Gaps = 11/234 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFKRL 61 ++VI+L S RRE H+ + L F F AI G ++P I ++ Q + Sbjct: 3 KIFVINLESSTERRENISHQLDELALPFELFSAIDGRSSPHHPLIGRYNDGLSQTYRAKT 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA-DFSDEFSQLLPHLSKCDINNILIKF 120 LS ++GCY SH LW++ I++EDDA F EF L + + + ++ Sbjct: 63 LSAGQLGCYASHYLLWQKCVELNQP-IIVIEDDALIFKKEFLNFLQDIPEIPESIECVRL 121 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + K+ K DSY + IH+ T YF+ A LL K Y +D+ M Sbjct: 122 FS-NKRRKYDSYEVFEATSTSIHKFTKGHMSATAYFLRPSGAKKLLEHSKEWYMAVDIYM 180 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWN 234 +W++ + T + + + R + + +W Sbjct: 181 DRFWQNEVECYGTAAPCLTNDPKFDSDIGYDKRTNTR------SLLKKCKREWF 228 >gi|325579204|ref|ZP_08149160.1| glycosyltransferase involved in LPS biosynthesis [Haemophilus parainfluenzae ATCC 33392] gi|325159439|gb|EGC71573.1| glycosyltransferase involved in LPS biosynthesis [Haemophilus parainfluenzae ATCC 33392] Length = 225 Score = 201 bits (512), Expect = 7e-50, Method: Composition-based stats. Identities = 66/223 (29%), Positives = 102/223 (45%), Gaps = 16/223 (7%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQC 56 M IP + VISL S RRE R + + L FSFFDA G+ P + + + + Sbjct: 1 MAIPKIIVISLKHST-RRENIAKRLSGLGLDFSFFDATDGKKLPASVLESVDYDFYPKHY 59 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 + L+L EIGC ISHI +++ + + AIILEDDA S F +++ L+K + N+ Sbjct: 60 LSPKPLTLGEIGCAISHIKVYEYMVENNIKSAIILEDDAIVSQHFKEIVEDTLNKINKNH 119 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNV 168 LI FD K K + + + + + S T Y I A LLN Sbjct: 120 ELIFFD--HGKVKSHFFKKRIVEGYRLAHYKAPSKNSRRCIIYATAYLITLSGAKKLLNY 177 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 I P D + + + EP ++ +++ DS I++ Sbjct: 178 AYPIRLPADYLTGLIQKTRVDTYGIEPPCIFRGLNS-DSEIDK 219 >gi|91226583|ref|ZP_01261307.1| putative Lex2B [Vibrio alginolyticus 12G01] gi|91189057|gb|EAS75339.1| putative Lex2B [Vibrio alginolyticus 12G01] Length = 240 Score = 201 bits (512), Expect = 7e-50, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 85/217 (39%), Gaps = 6/217 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-RQCQFKRL 61 + V+V+SL S RRE+ + + + F FFDA+ G + + S + Sbjct: 1 MKVFVVSLARSLDRRERIEEKLKQQDITFEFFDAVDGSVDRFLHSEKSRPTITLRRKGYQ 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ C+ SH LW R +ILED+ D + Q+L H N IK Sbjct: 61 LKTSEVACFASHYELWVRCVELDEP-IVILEDNVDPVEGLKQVLEHTLPLTSNYDYIKLS 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A +K+ + + + N + + TT Y + +AA + + P+D M+ Sbjct: 120 ATQKR--VFTPIVPIDDNHMLGGYSAGTCGTTAYIVTPKAASKFVMHAQEFIEPVDDYME 177 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 W H + P A STI R + Sbjct: 178 KPWRHGVQVYSVSPDLFTRAEIA--STIGGKRKDKSK 212 >gi|168188181|gb|ACA14476.1| WahB [Aeromonas hydrophila] Length = 221 Score = 201 bits (512), Expect = 8e-50, Method: Composition-based stats. Identities = 52/218 (23%), Positives = 85/218 (38%), Gaps = 13/218 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRL 61 IPV+VISL S RR + A + + F F DA G+ + + Sbjct: 2 IPVFVISLARSVERRAMVVQQMAHLGINFEFVDATDGKALSSAELAKVDLELAKEVCGHE 61 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS EIGC +SHI +++RI + LEDD ++ + + + I+ Sbjct: 62 LSFGEIGCAMSHIRVYERIVAEGIDRCVALEDDIYLHMHSKAIIETIVRSCHSEIVF--- 118 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 K K+ L +LPG + + + R T GY + A LL + Sbjct: 119 LHHGKAKRWPILSSLPGGYRLARYLAPSRTSRRGILSTAGYVLTLAGAKKLLQCAYPLRM 178 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + + EP + +D +TI++ Sbjct: 179 PSDYLTGRLQLNGLAASGVEPCCL--DVDLFTTTIDDR 214 >gi|329756963|gb|AEC04738.1| glucosyltransferase [Pasteurella multocida] Length = 222 Score = 201 bits (511), Expect = 8e-50, Method: Composition-based stats. Identities = 61/220 (27%), Positives = 92/220 (41%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR---IFSHQKRQCQFKR 60 P++VISL S RR+ R + L F F D I G+ + + + R Sbjct: 6 PIFVISLKNS-PRRDIIAQRLNGLKLNFKFIDGINGKTLSEEQKKCVDYQFYPKTFAANR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIK 119 L++ EIGC +SH+ ++ + AIILEDDA S EF + L L K + ++ Sbjct: 65 PLTIGEIGCAMSHLFIYNMMQDEDIAKAIILEDDAIVSQEFEHIILDSLKKVPKSVDILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNI 172 ++ K K L + + + R S R T YFI KEAA LL + I Sbjct: 125 YE--HGKAKSYFCKKKLIEGYRLVRYRSPSKHSKRTITRATAYFITKEAASKLLKLAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D + + EP V+ +D+ +E Sbjct: 183 RMPADYLTGALQLTGLNAYGVEPPCVFRGVDSEIDAMEHR 222 >gi|153834850|ref|ZP_01987517.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Vibrio harveyi HY01] gi|148868721|gb|EDL67798.1| UDP-glucose--lipooligosaccharide glucosyltransferase [Vibrio harveyi HY01] Length = 241 Score = 201 bits (511), Expect = 9e-50, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 97/241 (40%), Gaps = 9/241 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + +VISL + RR + A + F FFDAI G +P+ ++ ++ +KR Sbjct: 1 MRTFVISLESAVERRAHIEAQFAGVEQDFEFFDAIDGRKGEHPLFSK-YNVEKRLRVKGY 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 ++ E+GC+ SH LW++ +++EDDA + F + ++++ + F Sbjct: 60 GMTPGELGCFASHYLLWEKCLELNEP-IVVIEDDAQLEECFDDSMKNINELQPYGYVRLF 118 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 RK+P + + T YF+ AA + + P+D M Sbjct: 119 VNGRKRPFVKIGSYKSHD---VVEYLRGPGATRSYFVTPMAAKRFIESAQEWLLPVDDYM 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 +W + + PG V D DS++ + K Y C W ++ + Sbjct: 176 DQFWSNEVACRGIMPGIVKNETD-FDSSVGKLEKSNKMNRLTRELYSIQCALWRK-WHLF 233 Query: 241 R 241 R Sbjct: 234 R 234 >gi|119470917|ref|ZP_01613501.1| glycosyl transferase, family 25 [Alteromonadales bacterium TW-7] gi|119445939|gb|EAW27219.1| glycosyl transferase, family 25 [Alteromonadales bacterium TW-7] Length = 265 Score = 201 bits (511), Expect = 1e-49, Method: Composition-based stats. Identities = 56/228 (24%), Positives = 90/228 (39%), Gaps = 5/228 (2%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFKRLLS 63 +VI+L S R + AR++L F DA+ G + +S Q + Q+ LS Sbjct: 5 FVINLARSTQRLANTQTQLARVNLPFERIDAVDGAQLSTLQKYLHYSPQLNKQQYHYPLS 64 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 +IGCY+SH W++I AI+LEDD D + ++ + + LIK A Sbjct: 65 NGQIGCYLSHRKAWQKIVDEKLKYAIVLEDDFYIDDSIHNAIKNIEQLNQPWQLIKLAAY 124 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 + + +Y L + + + L I E A LL RP+D D++H Sbjct: 125 ENRTRPIAYQQNLNNHQRLVIHKKLMTGCCATAISYEGAKQLLKATAQFGRPVDCDLQHI 184 Query: 184 WEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 WE ++ P V + D + K + R Sbjct: 185 WETHVNGFSLMPYPVSQNTDIQSDI---ASCATKTKVKKAFLRRKAQQ 229 >gi|149909363|ref|ZP_01898019.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Moritella sp. PE36] gi|149807680|gb|EDM67628.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Moritella sp. PE36] Length = 246 Score = 200 bits (510), Expect = 1e-49, Method: Composition-based stats. Identities = 50/223 (22%), Positives = 90/223 (40%), Gaps = 5/223 (2%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRLLS 63 ++V++L S R+E + + +++ F A+ G + + + Q + L+ Sbjct: 4 IFVVNLEHSVERKEYISAQLDSMPVKYEMFPAVNGHESRLSILDMYDDGLSQAYRSKSLT 63 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINNILIKFDA 122 +IGCY SH LW++ I++EDD + F + + + D ++ A Sbjct: 64 KGQIGCYASHYLLWQKCVELNKP-IIVIEDDIAIDKKRFIEFYDLVPELDSKYEFVRLSA 122 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 R++ + + + T+ GN IH+ T GYF+ A L Y +D+ M Sbjct: 123 HRRRKARLTTIETI-GNLSIHRASKGHMSTMGYFLTPNGAKKFLKYSTAWYMAVDIYMDR 181 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 +W H EP V E I S I + + L Sbjct: 182 FWIHGAEPFGLEPPCVSEDIHFE-SDIGYEKNASRSLSGRLRR 223 >gi|222109426|ref|YP_002551690.1| glycosyl transferase family 25 [Acidovorax ebreus TPSY] gi|221728870|gb|ACM31690.1| glycosyl transferase family 25 [Acidovorax ebreus TPSY] Length = 252 Score = 200 bits (508), Expect = 2e-49, Method: Composition-based stats. Identities = 58/252 (23%), Positives = 103/252 (40%), Gaps = 10/252 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFK 59 IP+ I+L R+++ + +R+ L+ A+ + FS + Q+ Sbjct: 3 NIPIVYINLAKDTERQQRMQAQFSRLGLETFRLPAVRWSELSAEAQAHHFSAELNSHQYF 62 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN-NILI 118 + + E GCY SHI W+++ S A + EDD + L +S + ++ Sbjct: 63 KPMGSGEKGCYCSHITAWRQLLDSAAAAMAVFEDDVRLLPAMPEALRAISTLPADCWDMV 122 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K ++ +K + L G+ ++ R + GY I + A +L R+ RP+D+ Sbjct: 123 KL--FGREREKIASRRPLVGSLELITYRRVPSFAAGYVISRSGARKMLESRQPFGRPVDV 180 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 DM+ W+E+ + P V + S+I R R L R + L + Sbjct: 181 DMRFWFENGMRVFGVHPSVVALDDTSKVSSIWPDREPRALIMQRL---RKLKMKMELIWG 237 Query: 239 AWR--KDLPPVS 248 R D P VS Sbjct: 238 NARHESDAPHVS 249 >gi|269103933|ref|ZP_06156630.1| putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium damselae subsp. damselae CIP 102761] gi|268163831|gb|EEZ42327.1| putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium damselae subsp. damselae CIP 102761] Length = 249 Score = 199 bits (507), Expect = 3e-49, Method: Composition-based stats. Identities = 55/234 (23%), Positives = 99/234 (42%), Gaps = 8/234 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRL 61 I ++VI++P S R+E + +++L F F+A+ G +++ ++ Q + Sbjct: 2 IDIFVINIPSSVERKENITQQLVKLNLDFKLFEAVNGHKDSSPLFNLYDEALSQQYRGKS 61 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQLLPHLSKCDINNILIKF 120 LS ++GCY SH LW++ IILEDDA S+ F ++ + SK + I+ Sbjct: 62 LSKGQLGCYASHYLLWQKCVEINKP-IIILEDDALIYSEPFLDIVENASKLAEHFECIRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 ++K + + LP +H+ TGYF+ A L + Y +D+ M Sbjct: 121 FDNKRKTFRHAKEYPLPHT-AVHKFSKGHMSATGYFLTPTGAKKFLAHSQAWYLAVDIYM 179 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWN 234 +W +++ T P + DS I KP + + Sbjct: 180 DRFWINDVNVHGTVPACLSNDP-KFDSDIGYG---AKPKRDLYCRLKREVFNLA 229 >gi|33152738|ref|NP_874091.1| lipooligosaccharide galactosyltransferase I [Haemophilus ducreyi 35000HP] gi|1388151|gb|AAB49623.1| LOS biosynthesis enzyme LBGA [Haemophilus ducreyi] gi|2198846|gb|AAC45592.1| LosA [Haemophilus ducreyi] gi|33148962|gb|AAP96480.1| lipooligosaccharide galactosyltransferase I [Haemophilus ducreyi 35000HP] Length = 224 Score = 199 bits (507), Expect = 3e-49, Method: Composition-based stats. Identities = 62/220 (28%), Positives = 94/220 (42%), Gaps = 14/220 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKR 60 P+++ISL S R+ R + L F+FFDAIYG+ + + ++ ++ Sbjct: 6 PIFIISLKDS-PRKNIIAERLNALGLSFNFFDAIYGKELSKKQLSEINYDFYIQKFNARK 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 L+L EIGC ISHI L++ I + AIILEDDA S F ++ L K ++ Sbjct: 65 RLTLGEIGCAISHIKLYEYIVKNNINEAIILEDDAIVSIYFKDIICDALKKISCKYEILF 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNI 172 D K K + +L + + + + + RTT Y I A LLN I Sbjct: 125 LD--HGKAKIFPFTRSLVERYRLAKYISPSKKSKRIIIRTTAYIITLNGARKLLNYAYPI 182 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 P D I + EP V+ + IE+ Sbjct: 183 RMPSDYLTGTLQLTGINAYGIEPPCVFGGPISEIDQIEDR 222 >gi|328471783|gb|EGF42660.1| glycosyl transferase family protein [Vibrio parahaemolyticus 10329] Length = 240 Score = 199 bits (505), Expect = 4e-49, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 89/217 (41%), Gaps = 6/217 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQ-KRQCQFKRL 61 + V+V+SL S RRE+ + + ++F FFDA+ G + + S Q + Sbjct: 1 MKVFVVSLARSLDRRERIEEKLKQQGIEFEFFDAVDGSIDGFLHSEKSRPVITQRRKGYQ 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ C+ SH LW R IILED+ D + Q+L + IK Sbjct: 61 LKTSEVACFSSHYELWVRCTELDEP-IIILEDNVDPVEGLKQVLAYTLPLTSTYDYIKLS 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A +K+ + + ++ + + + TT Y + +AA + + P+D M+ Sbjct: 120 ATQKRA--FTPIISIDDSHMLGGYSAGTCGTTAYIVTPKAASKFVKHAQEFIEPVDDYME 177 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 W HN+ + P A STI R + Sbjct: 178 KPWRHNVQTYSVAPDLFTRAQIA--STIGAKRKDKSK 212 >gi|239831175|ref|ZP_04679504.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] gi|239823442|gb|EEQ95010.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] Length = 255 Score = 199 bits (505), Expect = 5e-49, Method: Composition-based stats. Identities = 48/232 (20%), Positives = 100/232 (43%), Gaps = 10/232 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I L + R+ + ++ ++ +A+ G + +R++ + ++ Sbjct: 18 VKAFIIHLERATDRQPQVEALIRKLPIETDVVNAVDGRTLDAQTISRVYRRSAHKPRYPF 77 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS EI C++SH W+ I ++LEDD + + EF+ + + I+F Sbjct: 78 QLSTNEIACFLSHRKAWQAIVDQGLDAGLVLEDDVELTPEFAAAYSASCQLLKAHSFIRF 137 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + ++ + I +P + +G+EAA LL+ + RP+D Sbjct: 138 PF---RERESGRVVLTTEGVRIIEPVPVGLGMVAQLVGREAAQRLLSATEVFDRPVDTTA 194 Query: 181 KHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYR 227 + W + L PG V E ++ STI++SR +++ PLY ++ Sbjct: 195 QMNWVTGLKPLSVLPGGVKEISVQLGGSTIQKSRSLPDKLKREILRPLYRWK 246 >gi|260877887|ref|ZP_05890242.1| glycosyltransferase, family 25 [Vibrio parahaemolyticus AN-5034] gi|193787963|dbj|BAG50472.1| putative Lex2B [Vibrio parahaemolyticus] gi|308089865|gb|EFO39560.1| glycosyltransferase, family 25 [Vibrio parahaemolyticus AN-5034] Length = 240 Score = 198 bits (504), Expect = 5e-49, Method: Composition-based stats. Identities = 54/217 (24%), Positives = 88/217 (40%), Gaps = 6/217 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK-RQCQFKRL 61 + V+V+SL S RRE+ + + ++F FFDA+ G + + S Q + Sbjct: 1 MKVFVVSLARSLDRRERIEEKLKQQGIEFDFFDAVDGSIDGFLHSEKSRPAITQRRKGYQ 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ C+ SH LW R IILED+ D + Q+L H IK Sbjct: 61 LKTSEVACFSSHYELWVRCTELDEP-IIILEDNVDPVEGLKQVLAHTLPLTSTYDYIKLS 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A +K+ + + + + + + TT Y + +AA + + P+D M+ Sbjct: 120 ATQKR--VFTPIIPIDDSHMLGGYSAGTCGTTAYIVTPKAASKFVKHAQEFIEPVDDYME 177 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 W HN+ + P A STI R + Sbjct: 178 KPWRHNVQTYSVAPDLFTRAQIA--STIGAKRKDKSK 212 >gi|149189214|ref|ZP_01867501.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrio shilonii AK1] gi|148836968|gb|EDL53918.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrio shilonii AK1] Length = 258 Score = 197 bits (502), Expect = 1e-48, Method: Composition-based stats. Identities = 60/236 (25%), Positives = 95/236 (40%), Gaps = 13/236 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKR 60 + ++VISLP + RRE+ + + QF+FFDA+ G R + R+ R Sbjct: 12 MRIFVISLPDATERRERVKRIFSETNYQFTFFDAVNGRKGLPERLERRVNDTYRKVFRSR 71 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E GC+ SH+ LW++ ++LEDD ++ F L + ++ Sbjct: 72 PLSPGEKGCFASHLLLWEKCLELDEP-IVLLEDDFLPTESFENAFSQLPLLIKDIEYLRL 130 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + K+ K + G F+ S TTGY + + A LL + +D + Sbjct: 131 E---KRTSKWFPI-NSVGGFERRFLFDNSCGTTGYVLTPQGAQKLLQHSEEWLCAVDNYI 186 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP-----TFSPLYFYRNTCY 231 + H + S P A+ EA S I+ R P T FYR Sbjct: 187 SESFRHGMYSFNISPPAI-EAPHDMGSLIQCEEKTRVPLYFKLTRELNRFYRFVRM 241 >gi|329756884|gb|AEC04671.1| glucosyltransferase [Pasteurella multocida] Length = 227 Score = 197 bits (502), Expect = 1e-48, Method: Composition-based stats. Identities = 59/208 (28%), Positives = 93/208 (44%), Gaps = 13/208 (6%) Query: 16 RREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFKRLLSLPEIGCYIS 72 RR R + + L++ FFDA+YG++ + F + ++ L+L EIGC +S Sbjct: 20 RRPVISQRLSGLGLEYQFFDAVYGKDLTEEQLKEXDFEFYPKTYNARKALTLGEIGCAMS 79 Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDALRKKPKKDS 131 HI++++ I + AIILEDDA S F ++L L K +I FD K K Sbjct: 80 HINVYEYIVKNNIDQAIILEDDAVVSLYFEEILASVLKKVPQRREIIFFD--HGKAKVYP 137 Query: 132 YLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 ++ +LP + + + R S RTT Y I ++ A LL + P D Sbjct: 138 FMRSLPERYRLARYRSPSKDSKRTIIRTTAYLITQQGAKKLLKHAYPVRMPSDFLTGSLQ 197 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEES 212 I + EP V+ + + IE Sbjct: 198 LTGIHAYGVEPACVFGDVISEIDQIENR 225 >gi|293390962|ref|ZP_06635296.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951496|gb|EFE01615.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D7S-1] Length = 230 Score = 197 bits (501), Expect = 1e-48, Method: Composition-based stats. Identities = 64/220 (29%), Positives = 98/220 (44%), Gaps = 13/220 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRLL 62 P++VISL S RRE R + LQF FFDA+YG+ +Q Q + L Sbjct: 9 PIFVISLKNS-PRREFIAKRLNGLGLQFEFFDAVYGKALSEEELAKVDYQYYQDFDNKRL 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIKFD 121 +L EIGC +SHI +++ I + AIILEDDA S F +L + K ++ FD Sbjct: 68 TLGEIGCALSHIQVYEHIKKNNIAEAIILEDDAIVSTHFKAILQAAIEKLPSRYEILFFD 127 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNIYR 174 + K C LP + + + R S + T Y + + LL + Sbjct: 128 HGKAKSYPLIKKC-LPEGYKLVRYRYPSKNSRRSIMKATAYMVNQAGVEKLLKYAYPLRM 186 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEA--IDTNDSTIEES 212 P D + I + EP V+E +++ ++IE+ Sbjct: 187 PADFVTGFIQKTRIHAYGVEPSCVFEGLAVESEINSIEDR 226 >gi|262273482|ref|ZP_06051296.1| beta-1,4-galactosyltransferase [Grimontia hollisae CIP 101886] gi|262222460|gb|EEY73771.1| beta-1,4-galactosyltransferase [Grimontia hollisae CIP 101886] Length = 246 Score = 197 bits (500), Expect = 2e-48, Method: Composition-based stats. Identities = 55/243 (22%), Positives = 98/243 (40%), Gaps = 9/243 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF---K 59 + + ++SL S RR+K + + L F FFDA+ G+N ++IF H + + Sbjct: 1 MRIIIVSLKHSLERRKKIEKTLSALKLPFCFFDAVDGKNEH--HKIFDHYNNKKRVFIKG 58 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 L E+GC+ SH LW+R ++LEDD + D+F + + + + Sbjct: 59 YPLKPGELGCFASHYSLWERCVELNEP-LLVLEDDIELEDDFLNVYKKMHLLAESIPYFR 117 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + K + D+ T GY I EAA ++ N Y P+D Sbjct: 118 IGRVL--DAKYKIFKQIDDCHDLVLYTKPVRSTQGYMITPEAAKKFIDAAYNWYEPVDDF 175 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY-FYRNTCYQWNLHYN 238 M+ W H + + EP + + + E ++ + YR+ L Y+ Sbjct: 176 MEKEWLHGVLNFGIEPPLIRHDFNIHSEIGERAKPYIPLNVRVIRELYRSYESIRKLCYS 235 Query: 239 AWR 241 ++ Sbjct: 236 FYK 238 >gi|90414922|ref|ZP_01222886.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] gi|90323978|gb|EAS40574.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Photobacterium profundum 3TCK] Length = 247 Score = 197 bits (500), Expect = 2e-48, Method: Composition-based stats. Identities = 60/235 (25%), Positives = 92/235 (39%), Gaps = 12/235 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRL 61 + ++V+SL S RR+ L F F D I G + +P + Q + R Sbjct: 1 MRIFVLSLVDSTERRKNVSSLLNEYQLDFEFIDGIDGRKKDPTLLAQYDEQAFIYNYGRK 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 ++ E+GCY SH+ WK+ II EDD Q + + + I+ + Sbjct: 61 AAMGELGCYASHMLAWKKCVELDEP-IIIFEDDFRLKQGIHQAIDECKEVISDFHFIRLE 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 KP+ Y + + + T Y I AA LL NI P+D+ ++ Sbjct: 120 DGNIKPQ---YKVKAQNDMTVFNYLKVPQCATCYAISPVAARKLLAKSNNIAFPVDVFIR 176 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEES-RLVRKPTFSPL----YFYRNTCY 231 + W H P EP V D S I + R +K F+ L Y RN + Sbjct: 177 NVWVHRQPIFSLEPFFVTTGTD--ISIIGDRKRNQKKSLFTRLMCIGYKIRNVTF 229 >gi|153007760|ref|YP_001368975.1| glycosyl transferase family protein [Ochrobactrum anthropi ATCC 49188] gi|151559648|gb|ABS13146.1| glycosyl transferase family 25 [Ochrobactrum anthropi ATCC 49188] Length = 255 Score = 196 bits (499), Expect = 2e-48, Method: Composition-based stats. Identities = 49/232 (21%), Positives = 100/232 (43%), Gaps = 10/232 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I L + R+ + R+ ++ +A+ G + +R++ + ++ Sbjct: 18 VKAFIIHLERATDRQPQVEELVRRLPVETDVVNAVDGRTLDAQTISRVYRRSAHKPRYPF 77 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS EI C++SH W+ I ++LEDD + + EF+ + + I+F Sbjct: 78 QLSTNEIACFLSHRKAWQAIVDQELDAGLVLEDDVELTPEFAAAYSAACQILKADSFIRF 137 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + ++ + I +P + +G+EAA LL+ + RP+D Sbjct: 138 PF---RERESGRVVLNTETLRIIEPVPVGLGMVAQLVGREAAQRLLSATEVFDRPVDTTA 194 Query: 181 KHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYR 227 + W + L PG V E ++ STI++SR +++ PLY ++ Sbjct: 195 QMNWVTGLKPLSVLPGGVKEISVQLGGSTIQKSRSLPDKLKREILRPLYRWK 246 >gi|148832478|gb|ABR14155.1| LosA [Haemophilus influenzae] Length = 225 Score = 195 bits (496), Expect = 5e-48, Method: Composition-based stats. Identities = 66/222 (29%), Positives = 93/222 (41%), Gaps = 16/222 (7%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQCQFKR 60 + VISL S RR R + L+F FFDA YG+ P + + + + + Sbjct: 5 KIIVISLKNSS-RRAIISKRLLGLGLEFEFFDATYGKTLPQEILDSVDYEFYPKNYLSPK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIK 119 L+L EIGC ISHI ++ I + AIILEDDA S +++ L+K N LI Sbjct: 64 PLTLGEIGCAISHIRAYEHIVENKIESAIILEDDAIISHFLKEIVQDALNKVKKNYDLIF 123 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNVRKNI 172 D K K + L + + R S TT Y I + A LLN + Sbjct: 124 LD--HGKAKSYPFKKKLYEGYRLAHYRAPSKNSKRCIIYTTAYLITQLGASKLLNYAYPV 181 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEA--IDTNDSTIEES 212 P D + I + EP V+ ID+ + IE Sbjct: 182 RMPSDYLTGLIQKTKINAYGIEPPCVFRGLSIDSEINQIEHR 223 >gi|146294977|ref|YP_001185401.1| glycosyl transferase family protein [Shewanella putrefaciens CN-32] gi|145566667|gb|ABP77602.1| glycosyl transferase, family 25 [Shewanella putrefaciens CN-32] Length = 236 Score = 195 bits (496), Expect = 5e-48, Method: Composition-based stats. Identities = 55/240 (22%), Positives = 93/240 (38%), Gaps = 9/240 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRL 61 + ++VISL S RR + + A+ ++F FF+A+ + + + Sbjct: 1 MKIFVISLERSIERRTQMIAKFAKAGIEFEFFNAVDASEEGFTLSDRVAPNVTKKRKGYE 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ CY SH LW++ +I+ED A+ +D+F L + IK Sbjct: 61 LLNSEVACYASHYLLWEKCVEMNQP-IVIVEDHAELTDDFKATLAIAFQHINEFGYIKLS 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A K +K + I + TTGY + +AA + I P+D M+ Sbjct: 120 AP-LKDRKFIEDKKIDTFHSIGHYTKNTCFTTGYIVSPDAAKCFIKASDRIVEPVDDFME 178 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP----TFSPLYFYRNTCYQWNLHY 237 W H + + P Y A STI +R ++ + +R+ L Y Sbjct: 179 KPWLHRVRAYSLMPFICYRA--NLPSTIGSARKIKHKLGLVKKLYIECFRSYERILRLIY 236 >gi|254448265|ref|ZP_05061727.1| putative Lex2B [gamma proteobacterium HTCC5015] gi|198262132|gb|EDY86415.1| putative Lex2B [gamma proteobacterium HTCC5015] Length = 241 Score = 195 bits (495), Expect = 6e-48, Method: Composition-based stats. Identities = 57/238 (23%), Positives = 93/238 (39%), Gaps = 6/238 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI-CNRIFSHQKRQCQFKRL 61 + + VISL S RR+ + + + F FFDAI ++ R +K R Sbjct: 1 MKICVISLKNSQERRQAVKQQMDILGIDFEFFDAITPDDPSCRFYRRLDRRKYLINTGRH 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 S EI CY SH LW S + +I+EDD + +F L I+ Sbjct: 61 PSPTEIACYASHYALWLECIRSKSP-MLIMEDDFTLTPDFKNALVACQSLADQYGYIRLQ 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 ++ + S G F + + + Y I A + + + + P+D+ +K Sbjct: 120 ---EEYRARSKAILKHGEFTLSRYTRVPHCMMAYSINPSTAKAFVRLSETLSAPVDVMLK 176 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 WEH P +V + ++DSTI+ + RKP L + T W + Sbjct: 177 KHWEHKQAIYGLTPYSVIASPHSSDSTIKGRQKHRKPLHIQLQRF-LTKATWPVRNAL 233 >gi|66796392|gb|AAY56464.1| probable transglycosylase LosA [Haemophilus influenzae] gi|309972300|gb|ADO95501.1| beta-1,4-galactosyltransferase [Haemophilus influenzae R2846] Length = 225 Score = 195 bits (495), Expect = 7e-48, Method: Composition-based stats. Identities = 64/219 (29%), Positives = 94/219 (42%), Gaps = 15/219 (6%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQCQFKR 60 + VISL S RR R + L+F FFDA YG+ P + + + + + Sbjct: 5 KIIVISLKNSS-RRAIISKRLLGLGLEFEFFDATYGKTLPQEILDSVDYEFYPKHYLSPK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIK 119 L+L EIGC ISHI ++ I + AIILEDDA S +++ + L+K N LI Sbjct: 64 PLTLGEIGCAISHIRAYEHIVENKIESAIILEDDAIISHFLKEIVQYALNKVKKNYDLIF 123 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNVRKNI 172 D K K + L + + R S T Y I A LLN I Sbjct: 124 LD--HGKAKSYPFKKKLYEGYRLAHYRAPSKNSRRCIIYATAYLITLSGAKKLLNYAYPI 181 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 P D + + + EP V+ +++ DS I++ Sbjct: 182 RLPADYLTGLIQKTRVNTYGIEPPCVFRGLNS-DSEIDK 219 >gi|85713329|ref|ZP_01044347.1| putative glycosyltransferase [Idiomarina baltica OS145] gi|85692849|gb|EAQ30829.1| putative glycosyltransferase [Idiomarina baltica OS145] Length = 240 Score = 194 bits (494), Expect = 1e-47, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 97/241 (40%), Gaps = 7/241 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQF 58 M + +++I+L S R EK + R+++ F A+ G + F ++ + + Sbjct: 1 MNVKIFLINLDRSTERLEKAAQQLNRLNVPFERVSAVDGSKLTGAEIDAAFDTEQAERRT 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI--NNI 116 L++ EIGCY+SH+ W+RI AIILEDD D+F + + + Sbjct: 61 AYNLTIGEIGCYLSHVECWRRIITDKLDFAIILEDDLVLDDQFPHAISAIEQFTHSNEWD 120 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 IK KK + T F + T + + A LL+ R +R I Sbjct: 121 YIKLANPFKKRPYQT--RTALDRFTLVDYPKAPTGTVAQAVSFQGAETLLSHRPPFFRAI 178 Query: 177 DMDMKHWWEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL 235 D+D++ WE I P ++ + + + +++ ++ L + Sbjct: 179 DVDLQWEWELGIKVQGLVPYVANVSDAESEIQQMAQRKDLKQRRWTKLKESIKFKLKKAS 238 Query: 236 H 236 H Sbjct: 239 H 239 >gi|113460783|ref|YP_718850.1| Lob1 protein [Haemophilus somnus 129PT] gi|112822826|gb|ABI24915.1| Lob1 protein [Haemophilus somnus 129PT] Length = 297 Score = 194 bits (494), Expect = 1e-47, Method: Composition-based stats. Identities = 50/243 (20%), Positives = 94/243 (38%), Gaps = 18/243 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPI--CNRIFSHQKRQ 55 ++VI+L + R+ H+ + + + FF + G P ++ +KR Sbjct: 52 IFVINLEKATERKHFISHQFTALQEQHPDIVINYQFFTGVNGNTQPNHPLFAKYNQKKRY 111 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + ++L ++GCY SH LW++ I+LEDDA F + + Sbjct: 112 YRKGNEITLGQLGCYASHYLLWEKCVQLQQP-IIVLEDDAILQPNFLAVYQFCFSAENQF 170 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 K + TLP + + Q TTGY++ +AA L+ + Sbjct: 171 EFFWLTHSNSSKIKTKLIHTLPDSTKLEQHYFGYSNTTGYYLTPQAAQKFLDFSQEWIYN 230 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-------LVRKPTFSPLYFYRN 228 +D+ M ++E+++ L P V S I ++ +R+ F+ L + Sbjct: 231 VDIFMDRFYENHVALLGVNPPCVKPDFSKQ-SQITMNKNNRTFWVKLRREYFALLERIKR 289 Query: 229 TCY 231 Y Sbjct: 290 FVY 292 >gi|170719164|ref|YP_001784309.1| glycosyl transferase [Haemophilus somnus 2336] gi|168827293|gb|ACA32664.1| glycosyl transferase family 25 [Haemophilus somnus 2336] Length = 293 Score = 194 bits (493), Expect = 1e-47, Method: Composition-based stats. Identities = 49/243 (20%), Positives = 94/243 (38%), Gaps = 18/243 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPI--CNRIFSHQKRQ 55 ++VI+L + R+ H+ + + + FF + G P ++ +KR Sbjct: 48 IFVINLEKATERKHFISHQFTALQEQHPDIVINYQFFTGVNGNTQPNHPLFAKYNQKKRY 107 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + ++L ++GCY SH LW++ I+LEDDA F + + Sbjct: 108 QRKGNEITLGQLGCYASHYLLWEKCVQLQQP-IIVLEDDAILQPNFLAVYQFCFSAENQF 166 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + + TLP + + Q TTGY++ +AA L+ + Sbjct: 167 EFFWLTHSNSSKIRTKLIHTLPDSTKLEQHYFGYSNTTGYYLTPQAAQKFLDSSQEWIYN 226 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-------LVRKPTFSPLYFYRN 228 +D+ M ++E+++ L P V S I ++ +R+ F+ L + Sbjct: 227 VDIFMDRFYENHVALLGVNPPCVKPDFSKQ-SQITMNKNNRTFWVKLRREYFALLERIKR 285 Query: 229 TCY 231 Y Sbjct: 286 FVY 288 >gi|2078280|gb|AAC62824.1| Lob1 [Histophilus somni] Length = 287 Score = 194 bits (492), Expect = 2e-47, Method: Composition-based stats. Identities = 49/243 (20%), Positives = 94/243 (38%), Gaps = 18/243 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPI--CNRIFSHQKRQ 55 ++VI+L + R+ H+ + + + FF + G P ++ +KR Sbjct: 42 IFVINLEKATERKHFISHQFTALQEQHPDIVINYQFFTGVNGNTQPNHPLFAKYNQKKRY 101 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + ++L ++GCY SH LW++ I+LEDDA F + + Sbjct: 102 QRKGNEITLGQLGCYASHYLLWEKCVQLQQP-IIVLEDDAILQPNFLAVYQFCFSAENQF 160 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + + TLP + + Q TTGY++ +AA L+ + Sbjct: 161 QFFWLTHSNSSKIRTKLIHTLPDSTKLEQHYFGYSNTTGYYLTPQAAQKFLDSSQEWIYN 220 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR-------LVRKPTFSPLYFYRN 228 +D+ M ++E+++ L P V S I ++ +R+ F+ L + Sbjct: 221 VDIFMDRFYENHVALLGVNPPCVKPDFSKQ-SQITMNKNNRTFWVKLRREYFALLERIKR 279 Query: 229 TCY 231 Y Sbjct: 280 FVY 282 >gi|118594621|ref|ZP_01551968.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Methylophilales bacterium HTCC2181] gi|118440399|gb|EAV47026.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Methylophilales bacterium HTCC2181] Length = 260 Score = 192 bits (488), Expect = 4e-47, Method: Composition-based stats. Identities = 56/232 (24%), Positives = 95/232 (40%), Gaps = 7/232 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-NRIFSHQKRQCQFKR 60 I V+VISL RRE R I F FFDAI G+ + + + KR+ + + Sbjct: 7 DIKVFVISLAREKQRREAIQKRLNGICNTFEFFDAIDGKKEQLSQHNDYIGLKRRLFYGK 66 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E+GC+ SH L +I A+ILEDDA ++F + L + ++F Sbjct: 67 DLSDGELGCFFSHRALINKIVQEKIPFAVILEDDAILLNDFKPTVESLLNTSYSWDFVRF 126 Query: 121 DALRK-KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K + K + + L N+ + + Y + + A L + ++ P D Sbjct: 127 LTKPKIQRKTQTIVANLFDNYQLLRIATAPGGAYAYIVSMKGAKKLQRSMEKVWCPNDTL 186 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 M + L+ P + + + +S I + R + + L + Y Sbjct: 187 MGQPLRTHAEILIVYPP-IADWDKSFNSAIGDDRFKK----NRLSGWEKAVY 233 >gi|261867139|ref|YP_003255061.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412471|gb|ACX81842.1| lipooligosaccharide galactosyltransferase I [Aggregatibacter actinomycetemcomitans D11S-1] Length = 230 Score = 191 bits (486), Expect = 8e-47, Method: Composition-based stats. Identities = 64/220 (29%), Positives = 97/220 (44%), Gaps = 13/220 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRLL 62 P++VISL SH RRE R + LQF FF A+YG+ +Q Q + L Sbjct: 9 PIFVISLKNSH-RREFIAKRLNGLGLQFEFFYAVYGKALSEEELTKVDYQYYQDFDNKRL 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIKFD 121 +L EIGC +SHI +++ I + AIILEDDA S F +L + K ++ FD Sbjct: 68 TLGEIGCALSHIQVYEHIKKNNIAEAIILEDDAIVSTHFKAILQAAIEKLPSRYEILFFD 127 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNIYR 174 + K C LP + + + R S + T Y + LL + Sbjct: 128 HGKAKSYPLIKKC-LPEGYKLVRYRYPSKNSRRSIMKATAYMVNLAGVEKLLKYAYPLRM 186 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEA--IDTNDSTIEES 212 P D + I + EP V+E +++ ++IE+ Sbjct: 187 PADFVTGFIQKTRIHAYGVEPSCVFEGLAVESEINSIEDR 226 >gi|153007953|ref|YP_001369168.1| glycosyl transferase family protein [Ochrobactrum anthropi ATCC 49188] gi|151559841|gb|ABS13339.1| glycosyl transferase family 25 [Ochrobactrum anthropi ATCC 49188] Length = 255 Score = 191 bits (485), Expect = 1e-46, Method: Composition-based stats. Identities = 61/251 (24%), Positives = 98/251 (39%), Gaps = 12/251 (4%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFKR 60 +PVYVI+L S R ++ A + + +A+ G+ P F + + Sbjct: 4 SVPVYVINLARSRDRWDRLKSNADALSIDLRRVEAVEGKLLSPEELTDFDDAGFRRWHGK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + EIGCY SHI + + IA +P A+I+EDD F+ F L +L+K +K Sbjct: 64 TVLPAEIGCYFSHIQVLEIIAAAPESFAVIVEDDVIFTPAFQPFLTYLTKVT-GWDAVKL 122 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 R F I + + Y + +E A +L + + P D+ Sbjct: 123 VNHR--TAAFRPFQKAGDQFSIGRCLHGPLGSSAAYVVTREGASKMLKALRPMRLPYDVA 180 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY------FYRNTCYQW 233 ++ W T+ V E D STI + R P+Y +R T Y Sbjct: 181 LERGWAGAYQVFTTDKPVV-EFSDIAISTIAQGRSAYAKKRLPVYKRITTLLFRATDYVR 239 Query: 234 NLHYNAWRKDL 244 + Y +K L Sbjct: 240 RIAYALDKKGL 250 >gi|46143791|ref|ZP_00134259.2| COG3306: Glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126208454|ref|YP_001053679.1| putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae L20] gi|126097246|gb|ABN74074.1| putative LPS biosynthesis protein [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 213 Score = 189 bits (480), Expect = 4e-46, Method: Composition-based stats. Identities = 61/208 (29%), Positives = 87/208 (41%), Gaps = 15/208 (7%) Query: 1 MPIP-VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQC 56 MPIP ++VISL S RR+ R + L F F DA+ G + + + ++ Sbjct: 6 MPIPPIFVISLANSV-RRKIISARLNSLGLAFEFIDAVNGRDLTEEQLSQVDYDFYPQRF 64 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINN 115 K L L EIGC +SHI ++ + + AIILEDDA S F +++ L K Sbjct: 65 GGKHRLKLGEIGCAMSHIRAYQHLVDNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRK 124 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNV 168 +I FD K K L + + + R S TGY++ E A LLN Sbjct: 125 QIIFFD--HGKAKYWPITRPLKEGYKLVRYRYPSKNSKRIIMIATGYYLTLEGAKLLLNH 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPG 196 + P D I + EP Sbjct: 183 AYPLRMPADFLTGAIQMTGIKAYGVEPP 210 >gi|83593566|ref|YP_427318.1| glycosyl transferase family protein [Rhodospirillum rubrum ATCC 11170] gi|83576480|gb|ABC23031.1| Glycosyl transferase, family 25 [Rhodospirillum rubrum ATCC 11170] Length = 255 Score = 188 bits (477), Expect = 8e-46, Method: Composition-based stats. Identities = 50/226 (22%), Positives = 97/226 (42%), Gaps = 10/226 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFK 59 M +PV+V++LP ARRE+ AR+ L F D + + + + + + Sbjct: 1 MSLPVFVLNLPRDGARRERMTAELARVGLDGHFIDGVDAGTLSQADWDRYDRSRCRAIYG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN---NI 116 + E+ CY++H ++++I A+ILEDD +++ +L L Sbjct: 61 VDMLATELACYLAHERIFRKIVDENIDAALILEDDCQIANDLPSVLDALMATPPAARLWQ 120 Query: 117 LIKFDALRK-----KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 +++ +R+ + ++ L L + +++ GY + A +L+ + Sbjct: 121 VVRLSTMREGKISAQARRLRPLRRLGDDRGLYRVHTHVLGLQGYVMTAAGARRMLDYGRK 180 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 I+ PID + +WE+ I + P V D STI E R+ Sbjct: 181 IFMPIDHTLDRYWENRITPFIVHPCPVVHH-DEGHSTIGERDPRRR 225 >gi|167837452|ref|ZP_02464335.1| glycosyl transferase, family 25 [Burkholderia thailandensis MSMB43] Length = 261 Score = 187 bits (476), Expect = 1e-45, Method: Composition-based stats. Identities = 57/251 (22%), Positives = 90/251 (35%), Gaps = 16/251 (6%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 ISL + RR K + A + FFDA + + RQ ++ LS E Sbjct: 9 CISLTRAQDRRTKMVEQFANHGINARFFDAFDLKGTVEAIPGYDAAGRQRRYGWQLSRGE 68 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 +GCY+SH W ++ S ++EDD D F L + +++ + ++ Sbjct: 69 VGCYLSHRAAWLQLVQSGKEAMCVMEDDITLLDGFKAATLELYAARQHWDMVRLMWINER 128 Query: 127 PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEH 186 + S LP + Y I + AA +L+ I ID+ WEH Sbjct: 129 --QQSEYARLPSGTRLM-WMENPVGLQCYMITRTAAQRMLDYTAKITHAIDIAFDRNWEH 185 Query: 187 NIPSLVTEPGAVYEA------IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 VT P V + D DS RL K +YR + + +N Sbjct: 186 GQRMYVTSPQFVADTGAPTTITDRPDSRTLVQRLKAK-------YYRKVERRTSRRFNDE 238 Query: 241 RKDLPPVSTTK 251 + P+ Sbjct: 239 HRPKRPIQIEI 249 >gi|109639335|gb|ABG36532.1| lipooligosaccharide galactosyltransferase I [Haemophilus ducreyi] Length = 249 Score = 187 bits (474), Expect = 2e-45, Method: Composition-based stats. Identities = 54/225 (24%), Positives = 102/225 (45%), Gaps = 11/225 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARI--HLQFSFFDAIYGENNPICN--RIFSHQKRQCQF 58 +P+++I+L S R++ + + + + F FF+AIYG+ NP + ++ +R + Sbjct: 2 LPIFIINLEKSVERKKIIQQQFSHLATEIPFEFFNAIYGKENPDFYLFKKYNESERIKRK 61 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 ++L ++GC+ SH LW++ IILEDDA D F + LS + + Sbjct: 62 GNQMNLAQLGCFASHYLLWEKCLELNQP-IIILEDDAILHDNFIEAYDFLSDKENIFEFL 120 Query: 119 KFD----ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 A R + K Y + + TTGY+I AA LLN + Sbjct: 121 WLSPPAPARRGQQGKKLYQINAKE-LSVFKFYKCWGNTTGYYITPAAAKKLLNYCQEWIY 179 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT 219 +D+ ++ +W +++ L P + + + +S I + ++ T Sbjct: 180 DLDITIERYWANDLACLALIPACLEPDL-SLESNIPVDKGKQQRT 223 >gi|28896984|ref|NP_796589.1| putative Lex2B [Vibrio parahaemolyticus RIMD 2210633] gi|153839479|ref|ZP_01992146.1| putative Lex2B [Vibrio parahaemolyticus AQ3810] gi|260362359|ref|ZP_05775317.1| glycosyltransferase family 25 [Vibrio parahaemolyticus K5030] gi|260897637|ref|ZP_05906133.1| glycosyltransferase family 25 [Vibrio parahaemolyticus Peru-466] gi|260899568|ref|ZP_05907963.1| glycosyltransferase family 25 [Vibrio parahaemolyticus AQ4037] gi|28805192|dbj|BAC58473.1| putative Lex2B [Vibrio parahaemolyticus RIMD 2210633] gi|149747008|gb|EDM57996.1| putative Lex2B [Vibrio parahaemolyticus AQ3810] gi|308087546|gb|EFO37241.1| glycosyltransferase family 25 [Vibrio parahaemolyticus Peru-466] gi|308108818|gb|EFO46358.1| glycosyltransferase family 25 [Vibrio parahaemolyticus AQ4037] gi|308115137|gb|EFO52677.1| glycosyltransferase family 25 [Vibrio parahaemolyticus K5030] Length = 239 Score = 185 bits (471), Expect = 4e-45, Method: Composition-based stats. Identities = 55/246 (22%), Positives = 100/246 (40%), Gaps = 14/246 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 + +YVI+L S RRE+ + + F FFDA + +P ++ +K + Sbjct: 1 MKIYVINLKESVERREQVRKVLS--SVDFEFFDAENIKKDPEHFIYTLYDEKKTRKYKGY 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++PE+GC+ SHI LWK+ +ILED+ + E + L ++ L+K Sbjct: 59 TLTIPELGCWASHISLWKKCVSDHTPF-LILEDNIELFGELVEQLKNIENLTKKYGLVKL 117 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + ++ K + + + + + T+ Y I + A L+ + P+D M Sbjct: 118 GNIFER--KHIEIVKIDEKYRLVSNLKGACGTSAYAITPKVAAAYLSQINGFFEPVDDFM 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP----TFSPLYFYRNTCYQWNLH 236 + W +P V +N S+ R V+ + YR + Sbjct: 176 DNEWRTRQTVYSYKPQLVSR---SNTSSTIGQRKVKSNNGLLNKINIELYRLFKQLNQVI 232 Query: 237 YNAWRK 242 YN +K Sbjct: 233 YNKRKK 238 >gi|269965925|ref|ZP_06180018.1| conserved hypothetical protein [Vibrio alginolyticus 40B] gi|269829478|gb|EEZ83719.1| conserved hypothetical protein [Vibrio alginolyticus 40B] Length = 241 Score = 185 bits (471), Expect = 4e-45, Method: Composition-based stats. Identities = 48/215 (22%), Positives = 84/215 (39%), Gaps = 6/215 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRL 61 + V+VI++ S RR + + + ++ F F +AI + + +F Sbjct: 1 MKVFVINMKRSVDRRVRMENALNKANIPFEFIEAIDSSAPNFLYSERSNDDLTRKRFGYR 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EI C+ SH W++ ++LED+ D EF + H +K Sbjct: 61 LVENEIACFSSHHLAWEKCLELNEP-ILVLEDNCDLLPEFFDVFSHFDTLAEQYDFLKLA 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A KPK + ++ +I + + GY + AA + P+D M+ Sbjct: 120 AT--KPKSFKVISSVNNKLNIVRFNKRTCGIMGYILTPHAASKFIKNATQFIEPVDDYME 177 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 ++H + + V P V A STI SR + Sbjct: 178 KPYKHGVSTYVLRPDIVKRADI--PSTIGSSRKDK 210 >gi|323497081|ref|ZP_08102104.1| glycosyl transferase family protein [Vibrio sinaloensis DSM 21326] gi|323317925|gb|EGA70913.1| glycosyl transferase family protein [Vibrio sinaloensis DSM 21326] Length = 235 Score = 185 bits (470), Expect = 5e-45, Method: Composition-based stats. Identities = 54/215 (25%), Positives = 85/215 (39%), Gaps = 6/215 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIF-SHQKRQCQFKRL 61 + ++VISL S RR + +QF F +A+ + +F Sbjct: 1 MRIFVISLERSTDRRARVIEALGSKDIQFEFLNAVDASRPEYQYSERRCDALTRKRFGYT 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L+ EI C+ SH W++ ++LED+ DFS+ F LL +L + IK Sbjct: 61 LTTGEIACFASHHLAWEKCLELNEP-IMVLEDNCDFSERFFTLLSNLDTPLKSYSHIKLA 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 A R +P L I + + T GY I AA ++ P+D M+ Sbjct: 120 ATRSQPPIIK--EQLSQELAIVNYKRRTCGTMGYLITPNAAARFISGASLFIEPVDNYME 177 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 ++H + + P VY A STI R + Sbjct: 178 KPYKHGVKTFCFYPDLVYRAKIQ--STIGSQRKNK 210 >gi|284105044|ref|ZP_06386173.1| Glycosyl transferase, family 25 [Candidatus Poribacteria sp. WGA-A3] gi|283830167|gb|EFC34427.1| Glycosyl transferase, family 25 [Candidatus Poribacteria sp. WGA-A3] Length = 248 Score = 185 bits (469), Expect = 7e-45, Method: Composition-based stats. Identities = 65/248 (26%), Positives = 114/248 (45%), Gaps = 13/248 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKRL 61 P YVI+L + R + + + F DA+ G + + + + K Sbjct: 3 PSYVINLAENTVRMDNTARQLTAQRIPFERIDAVNGWALSESEISLAYDAEVNRRRGKHP 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L PEIGCY+SH+ W+R+A + G I EDD +++ +L LS+ + + ++K Sbjct: 63 LLRPEIGCYLSHVTAWRRVAEGESAGGFIFEDDFLATEDLPDVLSRLSEDERDWDMVKLF 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 + + P+ + P ++ P + T GY + +EA+ HL+ +RP+D D K Sbjct: 123 SFDQGPRTVTRRQLGP-RHEVVVPFRVPSCTVGYGLTREASRHLVRRATPFFRPVDEDQK 181 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDS-TIEESR--LVRKPTF----SPLYFYRNTCYQWN 234 +WE + + P V + TI E R LV++ + S +Y +R YQ Sbjct: 182 FFWETGLRVALVLPSPVLAGDQQAVTRTIREERRLLVKRGSMQAWHSLIYQFR---YQAL 238 Query: 235 LHYNAWRK 242 LH++ +K Sbjct: 239 LHWHWMQK 246 >gi|86356685|ref|YP_468577.1| putative glycosyltransferase protein [Rhizobium etli CFN 42] gi|86280787|gb|ABC89850.1| putative glycosyltransferase protein [Rhizobium etli CFN 42] Length = 298 Score = 183 bits (464), Expect = 2e-44, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 104/258 (40%), Gaps = 13/258 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L + RR + A L + A+ G + + F + + R Sbjct: 21 VNTYLINLDRAPLRRFRMERLLAGFGLAYERVAAVDGAAISLPHPGFDEKSYLSRHGRRP 80 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGCY+SH+ +R S A A+ILEDD DF +F++LL + + +++ Sbjct: 81 NPFEIGCYLSHVECARRFLASNAEFALILEDDLDFDGDFAELLEAALRHQAHWDILRLST 140 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + K + L + + + Y I ++AA + + P D+ Sbjct: 141 VNSGRKHR--VEPLTASRSLAIALTREKGSGAYLINRQAAGWIAGALLPMRLPYDLAFDL 198 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES----RLVRKPTFSPLYF-----YRNTCYQW 233 ++ + + +P V + D S I+ RL R+ +S L F R ++ Sbjct: 199 EFDDGLRACFVDPLPVGQRADPC-SQIQAGLSAYRLGRRRPWSVLPFRAAAEIRRFVARF 257 Query: 234 NLHYNAWRKDLPPVSTTK 251 AWR L P S + Sbjct: 258 G-RLAAWRLTLSPASMGR 274 >gi|262371064|ref|ZP_06064386.1| glycosyl transferase, family 25 [Acinetobacter johnsonii SH046] gi|262313950|gb|EEY94995.1| glycosyl transferase, family 25 [Acinetobacter johnsonii SH046] Length = 254 Score = 183 bits (464), Expect = 3e-44, Method: Composition-based stats. Identities = 51/251 (20%), Positives = 96/251 (38%), Gaps = 13/251 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R E+ + +++ F F A+ G + + Q Sbjct: 1 MKVVTYLINLDGSQERLERATQQLNQVNWPFERFSAVDGRGKDLTEFANYDDQGANDVLG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKC-DIN 114 R L E+GCY+SH ++ + A ++LEDD + +F ++ +L + +++ Sbjct: 61 RRLMNSELGCYLSHYGCAEQFLSTDADYLVVLEDDMKINQDFKDKLDGVIEYLDQHKELD 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI +KK KD T + R G ++ A + + + Sbjct: 121 WYLINIATKKKKLAKD---ITNIDGMSVWHAFYFPIRGLGLVWSRQGAEAFMQAGRKMTM 177 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND--STIEESRLVRKPTFSPLYFY--RNTC 230 P+D+ + W N L P V A +D T+ + RK + + + Sbjct: 178 PVDIFFQTWLSKNGKGLGVWPALVKPAGLDSDILGTVAAQGISRKEKEGRSFSHGLKKQK 237 Query: 231 YQWNLHYNAWR 241 W + A + Sbjct: 238 RIWRDKFYAMK 248 >gi|152978207|ref|YP_001343836.1| glycosyl transferase family protein [Actinobacillus succinogenes 130Z] gi|150839930|gb|ABR73901.1| glycosyl transferase family 25 [Actinobacillus succinogenes 130Z] Length = 265 Score = 182 bits (463), Expect = 4e-44, Method: Composition-based stats. Identities = 55/253 (21%), Positives = 92/253 (36%), Gaps = 19/253 (7%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGE---NNPICNRIFSH 51 I + +I+L S RR+ + + L + FF + G N+P+ + ++ Sbjct: 16 SINILIINLKKSTQRRQFMQKQFDELQKCFPDFHLNYQFFTGVNGNEEANHPLFKK-YNS 74 Query: 52 QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC 111 +KR + + +SL ++GC+ SH L ++ IILEDDA D F +L + Sbjct: 75 KKRLARKGQDMSLSQLGCFASHYLLLEKCVQLNEP-VIILEDDAILLDGFYDVLKYAPTV 133 Query: 112 DINNILIKFD--ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + + K N I Q TTGYF+ AA L Sbjct: 134 ANYFEFFRLSNRSGGNNVKSAPLFQIPNTNLIISQVYKGWANTTGYFVTPRAAKKFLTQM 193 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR----KPTFSPLYF 225 K +D+ M +WE+ + P V + S I+ ++ R K Sbjct: 194 KEWIYNVDITMDRYWENKVHFCALLPNVVRPQDEFL-SQIQMNKTKRSISVKLKREIFAA 252 Query: 226 YRNTCYQWNLHYN 238 Y + Sbjct: 253 YDRVNKCLFDLLH 265 >gi|317050484|ref|YP_004111600.1| glycosyl transferase family 25 [Desulfurispirillum indicum S5] gi|316945568|gb|ADU65044.1| glycosyl transferase family 25 [Desulfurispirillum indicum S5] Length = 248 Score = 182 bits (463), Expect = 4e-44, Method: Composition-based stats. Identities = 60/230 (26%), Positives = 90/230 (39%), Gaps = 16/230 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKR 60 +PV+VISL S RR+ + F F DAI G + P+ +R F K + R Sbjct: 5 LPVFVISLVSSEKRRKSSTELLLSQGISFEFIDAIDGRKDRHPLLDR-FRPDKFLVRHGR 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + E GCY SH W++ I+ EDD D + L I+ Sbjct: 64 PSAPGEAGCYASHFLAWQKCVELN-CPIIVFEDDFAVRDHIYDIFSFLPDLMSYYPFIRL 122 Query: 121 DA----LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + L KK K+ G++ + + + R T Y I AA + K P+ Sbjct: 123 EDNDPVLHKKIKQF-------GDYTLVRFLRIPQRATCYAISPFAAAAFIKASKEFVYPV 175 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTND-STIEESRLVRKPTFSPLYF 225 D+ ++H H IP P VY A + + S I + P + L Sbjct: 176 DVFVRHQNIHKIPIYGLLPYPVYPADPSGNYSEIGNRHKDKGPLWCKLTR 225 >gi|306844604|ref|ZP_07477191.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. BO1] gi|306275048|gb|EFM56811.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. BO1] Length = 261 Score = 182 bits (461), Expect = 6e-44, Method: Composition-based stats. Identities = 50/238 (21%), Positives = 98/238 (41%), Gaps = 6/238 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + A++ F A+ G P ++ L Sbjct: 1 MRFYLINLDRSSDRLEHMTSQFAKLGADFVRIPAVDGREMPPEALKAVTAAKRPW-AAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGC++SH + IA + A+++EDD F+D+ +LL H + ++K + Sbjct: 60 TPTEIGCFLSHRRCLEAIARNDDPYAVVVEDDVIFADDAGKLLLHSEWVPHDADIVKIET 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM-- 180 KK + + + + + GY + ++AA ++ + + PID + Sbjct: 120 QGKKVLIGKPIACASTRYSVARLYSVHILAAGYIVSRDAARRIVAEMEKVSAPIDHFLFN 179 Query: 181 -KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF--SPLYFYRNTCYQWNL 235 + + + P +A T+ E S+L +KP F L + T + ++ Sbjct: 180 APYGVFNQLSVYQCTPALCKQAGLTSTLQSERSQLYQKPPFVGRVLREIKRTVKRASI 237 >gi|167626112|ref|YP_001676406.1| glycosyl transferase family protein [Shewanella halifaxensis HAW-EB4] gi|167356134|gb|ABZ78747.1| glycosyl transferase family 25 [Shewanella halifaxensis HAW-EB4] Length = 238 Score = 181 bits (460), Expect = 8e-44, Method: Composition-based stats. Identities = 60/245 (24%), Positives = 100/245 (40%), Gaps = 12/245 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 + V+VISL S R +K + FSFFDA ++NP ++ +K + Sbjct: 1 MKVFVISLKSSIERHKKVAEALKE--VPFSFFDAENIKDNPQHSIFSLYDSRKTKRYKGY 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++PE+GC+ SHI LWK+ + +ILED+ + L ++ ++K Sbjct: 59 ELTIPELGCFASHISLWKQCLADNEVF-LILEDNLALFGDLESQLDNIESLTHKYGVVKL 117 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + ++ D + + + + + T+ Y I +AA + LN + P+D M Sbjct: 118 GNVFERKFVD--IEPIDEQYKLISNLKGACGTSAYAITPQAAQNYLNQVDGFFEPVDDFM 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK-PTFSPL--YFYRNTCYQWNLHY 237 + W N P V + S I + K F+ L FYR Y Sbjct: 176 DNEWRTNQTLYSYHPKLVSRSAAA--SIIGLRKNKTKLGIFNKLTVEFYRLLKQFKQSRY 233 Query: 238 NAWRK 242 N K Sbjct: 234 NKNHK 238 >gi|145588437|ref|YP_001155034.1| glycosyl transferase family protein [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145046843|gb|ABP33470.1| glycosyl transferase, family 25 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 239 Score = 179 bits (454), Expect = 4e-43, Method: Composition-based stats. Identities = 56/230 (24%), Positives = 93/230 (40%), Gaps = 11/230 (4%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 I VISL S +R+EK + HL++ F DA+ G + +K + Sbjct: 3 DIQALVISLAGSDSRQEKVRAELGKTHLEWRFLDAVRGSALVEVPAEYKPEKVKALLGFE 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L+ E+GC++SH W+ II EDD F F + + LS + I+ Sbjct: 63 LTPNELGCFLSHKKAWQECVDKNIP-TIIFEDDFYFLPHFEEAIHFLSTQCNDWDAIRLQ 121 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 L P++ G + + + T Y + AA L+ +IY P+D ++ Sbjct: 122 GLSDVPQE---CILENGQMSLVRNVGDAVGATAYLLKPTAAKKLIAASNDIYEPLDHFLE 178 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 H+ +H++ L P V I STI + +P P+ + Sbjct: 179 HYQKHHVEFLAISPYPV--DITRVQSTIAD-----RPERLPIKGWAKLKR 221 >gi|110634463|ref|YP_674671.1| glycosyl transferase family protein [Mesorhizobium sp. BNC1] gi|110285447|gb|ABG63506.1| glycosyl transferase, family 25 [Chelativorans sp. BNC1] Length = 258 Score = 179 bits (454), Expect = 4e-43, Method: Composition-based stats. Identities = 51/241 (21%), Positives = 90/241 (37%), Gaps = 24/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +++I+L R + L F+ A+ G P+ Q L Sbjct: 1 MKIFLINLDRRPDRLRHMTEILDGLGLPFTRLSAVDGMQLPMEEV---------QRLGSL 51 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 C++SH WKR+ A++LEDD + + + LL H ++K + Sbjct: 52 LPGATACFLSHRECWKRVVDEALPHAVVLEDDLHLAPDAASLLSHGEWVPTEADVVKLET 111 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + D + G+ +H+ R T GY + ++ A LL + + + P+D M + Sbjct: 112 RLCRTRVDKGVAAAIGSRSLHRLRSSHMGTGGYIVTRKGAERLLALSERLEAPVDHFMFN 171 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTFSPLYFYRNTCYQW 233 H+ SL T + A+ DS I +V +P P R W Sbjct: 172 AGLHSAASLTT--FQMVPAVCVQDSYIGRQSMVLGIESDLLDERPPTKP----RGIRKLW 225 Query: 234 N 234 Sbjct: 226 R 226 >gi|17986701|ref|NP_539335.1| lipooligosaccharide biosynthesis protein lic2B [Brucella melitensis bv. 1 str. 16M] gi|237815992|ref|ZP_04594989.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus str. 2308 A] gi|260755315|ref|ZP_05867663.1| glycosyl transferase [Brucella abortus bv. 6 str. 870] gi|260758538|ref|ZP_05870886.1| glycosyl transferase [Brucella abortus bv. 4 str. 292] gi|260762361|ref|ZP_05874704.1| glycosyl transferase [Brucella abortus bv. 2 str. 86/8/59] gi|260884334|ref|ZP_05895948.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|265991653|ref|ZP_06104210.1| glycosyl transferase [Brucella melitensis bv. 1 str. Rev.1] gi|17982323|gb|AAL51599.1| lipooligosaccharide biosynthesis protein lic2b [Brucella melitensis bv. 1 str. 16M] gi|237789290|gb|EEP63501.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus str. 2308 A] gi|260668856|gb|EEX55796.1| glycosyl transferase [Brucella abortus bv. 4 str. 292] gi|260672793|gb|EEX59614.1| glycosyl transferase [Brucella abortus bv. 2 str. 86/8/59] gi|260675423|gb|EEX62244.1| glycosyl transferase [Brucella abortus bv. 6 str. 870] gi|260873862|gb|EEX80931.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|263002437|gb|EEZ15012.1| glycosyl transferase [Brucella melitensis bv. 1 str. Rev.1] Length = 268 Score = 178 bits (451), Expect = 9e-43, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 94/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 9 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 68 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 127 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + + P+D + Sbjct: 128 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKVSAPVDHFLFD 186 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 187 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 236 >gi|170724445|ref|YP_001758471.1| glycosyl transferase family protein [Shewanella woodyi ATCC 51908] gi|169809792|gb|ACA84376.1| glycosyl transferase family 25 [Shewanella woodyi ATCC 51908] Length = 229 Score = 177 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 51/212 (24%), Positives = 87/212 (41%), Gaps = 5/212 (2%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQFKRLLSLPEIGCYISHIHLW 77 + + + A+ G + ++ ++ ++L+ EIGCY+SHI W Sbjct: 1 MTKQCEPLGITPERVSAVRGSLLSEKEKSDVYQLSANLSKYDKILNDGEIGCYMSHIRCW 60 Query: 78 KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLP 137 ++I A++LEDDA +D + + L+ + IK K K L Sbjct: 61 EQIVEQELDFALVLEDDAILTDNIIKYVEKLAPSSADWDYIKLSHG-SKVKSAVDSMDLG 119 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGA 197 + + L TTG I A LL I RPIDMD+++W+E ++ V P Sbjct: 120 DGLTLRKELKLPSTTTGQLISLSGAKKLLAHAYPITRPIDMDIQYWFEKSLRCFVVAPFP 179 Query: 198 VYEA-IDTNDSTIEESRLVRK-PTFSPLYFYR 227 + + + ++ R V+K P L R Sbjct: 180 ILNGDFGSEINKTDDRRFVKKRPIRRVLQKLR 211 >gi|210135004|ref|YP_002301443.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori P12] gi|210132972|gb|ACJ07963.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori P12] Length = 278 Score = 177 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 58/276 (21%), Positives = 98/276 (35%), Gaps = 49/276 (17%) Query: 3 IPVYVISLP----FSHARRE-KFCHRAARI---HLQFSFFDAIYGENNPI---------- 44 + V++I L + + +E + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCHNFSLKETHITPLLESLKFQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLLAFCKNEKHPTCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD D F + L K I+ L + K + G I Sbjct: 119 NE-AICILEDDIIIKDRFKESLEFCDKHINELGYIRLMHLEENVAKQKT--PVKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLTPKAAQKLLKYSTKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++T +S E+ R + P + ++ QWN+ Sbjct: 235 MNTQNSNTEKQRPKKLPLSIRIGRSLHKGAIKQWNI 270 >gi|306844605|ref|ZP_07477192.1| glycosyl transferase family protein [Brucella sp. BO1] gi|306275049|gb|EFM56812.1| glycosyl transferase family protein [Brucella sp. BO1] Length = 269 Score = 177 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 10 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 68 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 69 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 128 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 129 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 187 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 188 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 237 >gi|261219724|ref|ZP_05934005.1| glycosyl transferase [Brucella ceti M13/05/1] gi|261322614|ref|ZP_05961811.1| glycosyl transferase [Brucella ceti M644/93/1] gi|260924813|gb|EEX91381.1| glycosyl transferase [Brucella ceti M13/05/1] gi|261295304|gb|EEX98800.1| glycosyl transferase [Brucella ceti M644/93/1] Length = 269 Score = 177 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 10 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 68 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 69 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 128 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 129 YGHEVLISTPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 187 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 188 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 237 >gi|62290485|ref|YP_222278.1| glycosyl transferase family protein [Brucella abortus bv. 1 str. 9-941] gi|82700408|ref|YP_414982.1| glycosyl transferase family protein [Brucella melitensis biovar Abortus 2308] gi|189024710|ref|YP_001935478.1| glycosyl transferase, family 25 [Brucella abortus S19] gi|225853077|ref|YP_002733310.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis ATCC 23457] gi|254689784|ref|ZP_05153038.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 6 str. 870] gi|254694273|ref|ZP_05156101.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 3 str. Tulya] gi|254697928|ref|ZP_05159756.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 2 str. 86/8/59] gi|254730818|ref|ZP_05189396.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 4 str. 292] gi|256045220|ref|ZP_05448118.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis bv. 1 str. Rev.1] gi|256114172|ref|ZP_05454926.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis bv. 3 str. Ether] gi|256258037|ref|ZP_05463573.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella abortus bv. 9 str. C68] gi|256263440|ref|ZP_05465972.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|260547014|ref|ZP_05822753.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260565183|ref|ZP_05835667.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|261214582|ref|ZP_05928863.1| glycosyl transferase [Brucella abortus bv. 3 str. Tulya] gi|265995491|ref|ZP_06108048.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|297248867|ref|ZP_06932585.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|62196617|gb|AAX74917.1| glycosyl transferase, family 25 [Brucella abortus bv. 1 str. 9-941] gi|82616509|emb|CAJ11576.1| Glycosyl transferase, family 25 [Brucella melitensis biovar Abortus 2308] gi|189020282|gb|ACD73004.1| Glycosyl transferase, family 25 [Brucella abortus S19] gi|225641442|gb|ACO01356.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis ATCC 23457] gi|260096064|gb|EEW79941.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260151251|gb|EEW86345.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|260916189|gb|EEX83050.1| glycosyl transferase [Brucella abortus bv. 3 str. Tulya] gi|262766604|gb|EEZ12393.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|263093448|gb|EEZ17498.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|297176036|gb|EFH35383.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|326409620|gb|ADZ66685.1| Glycosyl transferase, family 25 [Brucella melitensis M28] gi|326539323|gb|ADZ87538.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella melitensis M5-90] Length = 260 Score = 177 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 94/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + + P+D + Sbjct: 120 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKVSAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|110634465|ref|YP_674673.1| glycosyl transferase family protein [Mesorhizobium sp. BNC1] gi|110285449|gb|ABG63508.1| glycosyl transferase, family 25 [Chelativorans sp. BNC1] Length = 250 Score = 177 bits (449), Expect = 1e-42, Method: Composition-based stats. Identities = 46/236 (19%), Positives = 92/236 (38%), Gaps = 6/236 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + +VI L + R + + + + DA+ E + R++ + ++ Sbjct: 1 MKVEAFVIHLARATGRAPQVERLRQNLTIPVTVIDAVDAEQLSEEEIVRVYRPGLHRPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 L E+ C++SH W+ I +I+EDD + L C ++ Sbjct: 61 PFPLRRTEVACFLSHRKAWQTIMERGLDAGLIIEDDVELLPGVHHLFDSGLACATTKDVL 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F KK ++ G I +PR++ +G++AA LL + RP+D Sbjct: 121 RFP---KKARERGAALNENGANRIFEPRLVGLGMQAQLVGRDAAGELLAFTREFDRPVDT 177 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWN 234 ++ W H + L + P A+ E T + + + + L Y+++ Sbjct: 178 TIQMRWLHGVRVLSSSPVAIREVAAALGGTTVQGK-GKSHAETLLREMLRARYRFS 232 >gi|225628038|ref|ZP_03786074.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella ceti str. Cudo] gi|294852916|ref|ZP_06793589.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] gi|225617201|gb|EEH14247.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Brucella ceti str. Cudo] gi|294821505|gb|EFG38504.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] Length = 269 Score = 177 bits (449), Expect = 1e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 10 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 68 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 69 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 128 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 129 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 187 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 188 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 237 >gi|118581167|ref|YP_902417.1| glycosyl transferase family protein [Pelobacter propionicus DSM 2379] gi|118503877|gb|ABL00360.1| glycosyl transferase, family 25 [Pelobacter propionicus DSM 2379] Length = 251 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 51/250 (20%), Positives = 93/250 (37%), Gaps = 15/250 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M +++++L + RR C R + L+ F A+ G + + + R Sbjct: 5 MIPKLFILNLERAAQRRHIMCQRLCALGLEAEFLTAVDGASLDMDALPRGTEPR------ 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E GCY+SH+ W + A+ILEDD FS ++ + + ++ Sbjct: 59 -LSPGEKGCYLSHVAAWNTVVERGLSHAVILEDDVILGSNFSAVVAEIIALGMPFDAVRL 117 Query: 121 DALRKKPKKD----SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 AL+ + TL N + P+ + T GY + A L + P+ Sbjct: 118 SALKPDLLMPEHGAVPVATLASNARLVLPKKMPSGTQGYLVSYSGAKRLRSKISVPREPV 177 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL----YFYRNTCYQ 232 D +W+H + V P V E + + R + + + +F + Sbjct: 178 DDVFDTYWKHGLCIPVLFPTVVEEDPSQESTIVGRIRWTYRTKKTLIQRFMWFVEKKRRK 237 Query: 233 WNLHYNAWRK 242 ++ A R Sbjct: 238 ISISLIARRL 247 >gi|124265921|ref|YP_001019925.1| LPS biosynthesis glycosyltransferase [Methylibium petroleiphilum PM1] gi|124258696|gb|ABM93690.1| glycosyltransferase involved in LPS biosynthesis-like protein [Methylibium petroleiphilum PM1] Length = 253 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 58/244 (23%), Positives = 94/244 (38%), Gaps = 12/244 (4%) Query: 10 LPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGC 69 + +H RRE+F AA ++ FFDA+ P + +F R L E+GC Sbjct: 1 MASAHQRREQFAAGAAHGTARWEFFDAL---RLPAQGLHYDEPLTVRRFGRALKPGEVGC 57 Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKK 129 Y SH +W++ S A ++ EDD + L D +++ + Sbjct: 58 YASHYEVWRQFLMSAADQLLVFEDDVMVDWPLIEQLCAHRLADHGIDVLRLYTSHPINVR 117 Query: 130 DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIP 189 + L + + Q R T Y + + A LL+ + + P+D M +W +P Sbjct: 118 IAKYKLLSDHSHLMQVRGYLYGTQAYVLSRRGAEALLSACRVMTMPVDWAMSRYWVCGMP 177 Query: 190 SLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR----KDLP 245 + P V E STIE ++ + S L +R + W L A R Sbjct: 178 AFTLFPFPVLE--RHGPSTIEHAQQIGA---SSLASHRVDRFLWRLRDRAARAWFDLTSS 232 Query: 246 PVST 249 P S Sbjct: 233 PASP 236 >gi|254714819|ref|ZP_05176630.1| glycosyl transferase family protein [Brucella ceti M644/93/1] gi|254717877|ref|ZP_05179688.1| glycosyl transferase family protein [Brucella ceti M13/05/1] Length = 260 Score = 177 bits (448), Expect = 2e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 120 YGHEVLISTPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|265984639|ref|ZP_06097374.1| glycosyl transferase [Brucella sp. 83/13] gi|306837736|ref|ZP_07470604.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. NF 2653] gi|264663231|gb|EEZ33492.1| glycosyl transferase [Brucella sp. 83/13] gi|306407192|gb|EFM63403.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. NF 2653] Length = 261 Score = 177 bits (448), Expect = 2e-42, Method: Composition-based stats. Identities = 50/238 (21%), Positives = 96/238 (40%), Gaps = 6/238 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + A++ F A+ G P + L Sbjct: 1 MRFYLINLDRSSDRLEHMTSQFAKLGADFVKIPAVDGREMPPEALKAVTATERPW-AAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGC++SH + IA + A+++EDD F+D+ +LL H + ++K + Sbjct: 60 TPTEIGCFLSHRRCLEAIARNDDPYAVVVEDDVIFADDAGKLLLHSEWVPHDADIVKIET 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM-- 180 KK + + + + + GY + ++AA ++ + + PID + Sbjct: 120 QGKKVLIGKPIACASTRYSVARLYSVHILAAGYIVSRDAARRIVAEMEKVSAPIDHFLFN 179 Query: 181 -KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF--SPLYFYRNTCYQWNL 235 + + + P +A T+ E S+L +KP F L + T + + Sbjct: 180 APYGVFNQLSVYQCTPALCKQAGLTSTLQSERSQLYQKPPFVGRVLREIKRTFKRAGI 237 >gi|262374910|ref|ZP_06068144.1| glycosyl transferase, family 25 [Acinetobacter lwoffii SH145] gi|262309923|gb|EEY91052.1| glycosyl transferase, family 25 [Acinetobacter lwoffii SH145] Length = 253 Score = 177 bits (448), Expect = 2e-42, Method: Composition-based stats. Identities = 60/251 (23%), Positives = 100/251 (39%), Gaps = 13/251 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFK 59 M + Y+I+L S R + + +F F A G + I ++ + Sbjct: 1 MKVVTYLINLDGSDERLAAATQQLNSVSWRFERFAAYDGRGKALSEFINYNDDQTNKVLG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKC-DIN 114 R L E+GCY+SH ++ + A ++LEDD + +F ++ +L + D+N Sbjct: 61 RSLMNSELGCYLSHYGCAEKFLATDADYLVVLEDDLKINSDFKIKIDSIIEYLHQHQDLN 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI A +KK KD T G+ + R G ++ A + V K I Sbjct: 121 WYLINIAAKKKKLAKD---ITQIGDISLWHAFYFPIRGLGLIWSRQGAQSFVEVGKTINM 177 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND--STIEESRLVRK--PTFSPLYFYRNTC 230 P+D+ + W N L P V A +D T+ + RK S Y ++ Sbjct: 178 PVDIFFQSWLSQNGKGLGIWPPLVKPAGVDSDILGTVATQGISRKALENRSSSYGFKKQK 237 Query: 231 YQWNLHYNAWR 241 W Y+A + Sbjct: 238 RMWRDRYHALK 248 >gi|254719630|ref|ZP_05181441.1| glycosyl transferase family protein [Brucella sp. 83/13] gi|265984640|ref|ZP_06097375.1| glycosyl transferase [Brucella sp. 83/13] gi|306837735|ref|ZP_07470603.1| glycosyl transferase family protein [Brucella sp. NF 2653] gi|264663232|gb|EEZ33493.1| glycosyl transferase [Brucella sp. 83/13] gi|306407191|gb|EFM63402.1| glycosyl transferase family protein [Brucella sp. NF 2653] Length = 260 Score = 176 bits (447), Expect = 2e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 120 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|148559939|ref|YP_001259470.1| glycosyl transferase family protein [Brucella ovis ATCC 25840] gi|254702321|ref|ZP_05164149.1| glycosyl transferase family protein [Brucella suis bv. 5 str. 513] gi|254710635|ref|ZP_05172446.1| glycosyl transferase family protein [Brucella pinnipedialis B2/94] gi|256032128|ref|ZP_05445742.1| glycosyl transferase family protein [Brucella pinnipedialis M292/94/1] gi|256061650|ref|ZP_05451789.1| glycosyl transferase family protein [Brucella neotomae 5K33] gi|256160324|ref|ZP_05458018.1| glycosyl transferase family protein [Brucella ceti M490/95/1] gi|256255529|ref|ZP_05461065.1| glycosyl transferase family protein [Brucella ceti B1/94] gi|256370018|ref|YP_003107529.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260169263|ref|ZP_05756074.1| glycosyl transferase, family 25 [Brucella sp. F5/99] gi|261222739|ref|ZP_05937020.1| glycosyl transferase [Brucella ceti B1/94] gi|261318207|ref|ZP_05957404.1| glycosyl transferase [Brucella pinnipedialis B2/94] gi|261325660|ref|ZP_05964857.1| glycosyl transferase [Brucella neotomae 5K33] gi|261752886|ref|ZP_05996595.1| glycosyl transferase [Brucella suis bv. 5 str. 513] gi|261758776|ref|ZP_06002485.1| glycosyl transferase [Brucella sp. F5/99] gi|265989240|ref|ZP_06101797.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] gi|265998701|ref|ZP_06111258.1| glycosyl transferase [Brucella ceti M490/95/1] gi|148371196|gb|ABQ61175.1| glycosyl transferase, family 25 [Brucella ovis ATCC 25840] gi|256000181|gb|ACU48580.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260921323|gb|EEX87976.1| glycosyl transferase [Brucella ceti B1/94] gi|261297430|gb|EEY00927.1| glycosyl transferase [Brucella pinnipedialis B2/94] gi|261301640|gb|EEY05137.1| glycosyl transferase [Brucella neotomae 5K33] gi|261738760|gb|EEY26756.1| glycosyl transferase [Brucella sp. F5/99] gi|261742639|gb|EEY30565.1| glycosyl transferase [Brucella suis bv. 5 str. 513] gi|262553325|gb|EEZ09159.1| glycosyl transferase [Brucella ceti M490/95/1] gi|264661437|gb|EEZ31698.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] Length = 260 Score = 176 bits (447), Expect = 2e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 120 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|206563391|ref|YP_002234154.1| putative glycosyltransferase [Burkholderia cenocepacia J2315] gi|198039431|emb|CAR55397.1| putative glycosyltransferase [Burkholderia cenocepacia J2315] Length = 255 Score = 176 bits (447), Expect = 2e-42, Method: Composition-based stats. Identities = 59/250 (23%), Positives = 99/250 (39%), Gaps = 18/250 (7%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 ISL +H RR A+ ++ FFDA + + R ++ LLS E Sbjct: 9 CISLARAHDRRAGMVEEFAKQNINARFFDAFELKGCGHGIPGYDTGARLRRYGWLLSPGE 68 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL-RK 125 +GCY+SH +W+++ S + I+EDD D F L + +++ L R+ Sbjct: 69 LGCYLSHRAVWRQLVESGSNACCIMEDDVALLDGFKSATEELYATREHWDMVRLMWLNRR 128 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE 185 + + ++L + + P Y I +EAA +L+ I ID+ WE Sbjct: 129 EQTEYAHLSSGTKLMWMESP----VGLQCYMITREAAEAMLDYTSRILHAIDIAFDRNWE 184 Query: 186 HNIPSLVTEPGAVYEA------IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 H VTEP V + +D +D+ RL K ++R Y Sbjct: 185 HEQRMFVTEPQFVQDTGAPTTILDRSDARTLVQRLKAK-------YHRKVDRIAARAYAK 237 Query: 240 WRKDLPPVST 249 + P++ Sbjct: 238 KHRPSSPITL 247 >gi|153952416|ref|YP_001397749.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. doylei 269.97] gi|152939862|gb|ABS44603.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. doylei 269.97] Length = 254 Score = 176 bits (446), Expect = 3e-42, Method: Composition-based stats. Identities = 60/251 (23%), Positives = 95/251 (37%), Gaps = 28/251 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNP-ICNRIFSHQ 52 + V++I+L S R+E + ++ L+F FF AI +N + + Sbjct: 1 MKVFIINLERSLDRKEHMQKQIQKLFEKNPSLKNKLEFVFFKAIDAKNKEHLEFKDHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ IILEDD +FSDEF ++ + L Sbjct: 61 WGSWVLGRELSDGEKACFASHYKLWQECVKLDEP-IIILEDDVEFSDEFLNNGAEYIDEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K I+ L K G + + ++ GY + AA+ L Sbjct: 120 LK--SEYEYIRLCYLFDKRLYF----LSEGGYYLSFEKL--AGAQGYVLQVSAAMKFLKC 171 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRN 228 KN +D M ++ HN+ ++V +P + S + RL K F YR Sbjct: 172 AKNWIYAVDDYMDMFYRHNVLNIVKKPLLLKHDCRIESSISQAGRLFLKAKF-----YRK 226 Query: 229 TCYQWNLHYNA 239 + Y Sbjct: 227 IIREIFRLYRN 237 >gi|109947473|ref|YP_664701.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] gi|109714694|emb|CAJ99702.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] Length = 281 Score = 176 bits (446), Expect = 3e-42, Method: Composition-based stats. Identities = 55/274 (20%), Positives = 90/274 (32%), Gaps = 45/274 (16%) Query: 3 IPVYVISLPFSHARREKFCHR--------AARIHLQFSFFDAIYGENNP----------- 43 + V++I L R + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCRNFSLKETHIAPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 44 ------ICNRIFSHQKRQ------------CQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 F K+ + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLLAFCKDKKHPPCMLKNFFYALKHCGKRMGFGELGCYASHYSLWQKCVELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD D F + L + I+ L + K T Sbjct: 121 -AVCILEDDITLKDNFKESLKFCYQHINELGYIRLMHLEENVAKKK---TSIKGVSQILN 176 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 T GY + +AA LL K PID M + H + + V E A++ + ++ Sbjct: 177 FKDGIGTQGYVLAPKAAQKLLKYSAKKWVMPIDCVMDRHYWHGVKNYVLEEFAIFCDEMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 +S E+ R + P + +++ QWN+ Sbjct: 237 AQNSNTEKQRPKKLPLSIRIGRSLHKSAIKQWNI 270 >gi|57238023|ref|YP_179272.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni RM1221] gi|12802847|gb|AAK08089.1|AF334378_2 unknown [Campylobacter jejuni] gi|57166827|gb|AAW35606.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni RM1221] gi|315058580|gb|ADT72909.1| Beta-1,4-galactosyltransferase [Campylobacter jejuni subsp. jejuni S3] Length = 254 Score = 176 bits (446), Expect = 3e-42, Method: Composition-based stats. Identities = 61/251 (24%), Positives = 96/251 (38%), Gaps = 28/251 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNP-ICNRIFSHQ 52 + V++I+L S R+E + ++ L+F FF AI +N + + Sbjct: 1 MKVFIINLERSLDRKEHMKKQIQKLFEKNPSLKNKLEFIFFKAIDAKNKEHLEFKDHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ IILEDD +FSDEF + + L Sbjct: 61 WGSWVLGRELSDGEKACFASHYKLWQECVKLDEP-IIILEDDVEFSDEFLIMGQEYIDEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K I+ L K G + + ++ T GY + AA+ L Sbjct: 120 LKSK--YEYIRLCYLFDKRLYF----LSEGGYYLSFEKL--AGTQGYVLQVSAAMKFLKY 171 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRN 228 KN +D M +++HN+ ++V +P + S + RL K F YR Sbjct: 172 AKNWIYAVDDYMDMFYKHNVLNIVKKPLLLKHDCRIESSISQARRLFLKAKF-----YRK 226 Query: 229 TCYQWNLHYNA 239 + Y Sbjct: 227 IIREIFRLYRN 237 >gi|317178835|dbj|BAJ56623.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F30] Length = 278 Score = 175 bits (445), Expect = 4e-42, Method: Composition-based stats. Identities = 56/276 (20%), Positives = 94/276 (34%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNEKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K TL Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQK---TLIKGVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSTKKWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++T +S E+ R + P + +++T QWN+ Sbjct: 235 MNTQNSNTEKQRPKKLPLSIRIGRSLHKSTVKQWNV 270 >gi|188527351|ref|YP_001910038.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] gi|188143591|gb|ACD48008.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] Length = 278 Score = 175 bits (444), Expect = 5e-42, Method: Composition-based stats. Identities = 55/276 (19%), Positives = 93/276 (33%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLWAFCKNKKHPPCTLKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K TL Sbjct: 119 NE-AICILEDDIIVKEYFKESLEFCYQHINELGYIRLMHLEENVAKQK---TLIKGVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSTKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++T +S E+ R + P + +++ QWN+ Sbjct: 235 MNTQNSNTEKQRPKKLPLSIRIGRSLHKSALKQWNV 270 >gi|15611808|ref|NP_223459.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155297|gb|AAD06313.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 292 Score = 175 bits (444), Expect = 5e-42, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 91/277 (32%), Gaps = 49/277 (17%) Query: 2 PIPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI--------- 44 + V++I L + + + FDAIY + +P Sbjct: 11 NMRVFIIHLSPKTCKNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEH 70 Query: 45 ----------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAY 82 + F + + C + + E+GCY SH LW++ Sbjct: 71 LHPSFMVEDLLAFCKNKKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIE 128 Query: 83 SPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI 142 ILEDD + F + L + I+ L + K Sbjct: 129 LNE-AICILEDDITLKEHFKESLEFCRQHINELGYIRLMHLEENVAKQKTPVK---GVSQ 184 Query: 143 HQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-E 200 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 185 ILNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACD 244 Query: 201 AIDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 I+ +S E+ R R P + + +++ QWN+ Sbjct: 245 GINAQNSNTEKQRPKRLPLSIRIGRFLHKSAIQQWNI 281 >gi|307637495|gb|ADN79945.1| Beta-1,4-galactosyl transferase [Helicobacter pylori 908] gi|325996084|gb|ADZ51489.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 2018] gi|325997680|gb|ADZ49888.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 2017] Length = 281 Score = 175 bits (444), Expect = 6e-42, Method: Composition-based stats. Identities = 55/274 (20%), Positives = 94/274 (34%), Gaps = 45/274 (16%) Query: 3 IPVYVISLP----FSHARRE-KFC---HRAARIHLQFSFFDAIYGENNP----------- 43 + V++I L + + +E + + FDAIY +++P Sbjct: 1 MRVFIIHLSPKTCHNFSLKETHITPLLESLKLQGISYEIFDAIYSKSSPTQLHPLILEHL 60 Query: 44 ------ICNRIFSHQKRQ------------CQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 F K+ + + E+GCY SH LW++ Sbjct: 61 HPSFVIEDLLAFCENKKHPPCALKNFFYAIKHCGKRMGFGELGCYASHYSLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L K I+ L + K Sbjct: 121 -AICILEDDIVLKERFKESLEFCDKHINELGYIRLMHLEENVAKQKTPVK---GVSQILN 176 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 T GY + +AA LL K PID M ++ H + + V E A+ + ++ Sbjct: 177 FKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRYYWHGVKNYVLEEFAIACDGMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 +S E+ R R P+ + + ++ QWN+ Sbjct: 237 AQNSNTEKQRPKRLPSSIRIGRFLHKGAIKQWNI 270 >gi|261755545|ref|ZP_05999254.1| glycosyl transferase [Brucella suis bv. 3 str. 686] gi|261745298|gb|EEY33224.1| glycosyl transferase [Brucella suis bv. 3 str. 686] Length = 268 Score = 175 bits (444), Expect = 6e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 9 MKCYLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLE-LASFTQISKEWPAPL 67 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 68 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 127 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 128 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 186 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 187 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 236 >gi|110633728|ref|YP_673936.1| glycosyl transferase family protein [Mesorhizobium sp. BNC1] gi|110284712|gb|ABG62771.1| glycosyl transferase, family 25 [Chelativorans sp. BNC1] Length = 241 Score = 175 bits (443), Expect = 7e-42, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 97/244 (39%), Gaps = 8/244 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFK 59 M IPV VI+L S R E RAA I + A+ G P R +K Q Sbjct: 1 MDIPVLVINLDRSKERWEGLAARAAAIDIFPERVQAVDGNAIEPSERRELQLRKFQLWHG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R E GCY+SH+ R+ + A+ILEDDA+F +F+ L L++ D+ L+K Sbjct: 61 RRPMGGEYGCYMSHMRALDRVIAAGWPYAVILEDDAEFLPDFAPRLRALTRLDLKPDLVK 120 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT-GYFIGKEAAIHLLNVRKNIYRPIDM 178 R K + + ++ GY + E A L ++ P D+ Sbjct: 121 LYNHR--TKGFLTKAKTEAGDHVGRCIHGPLGSSMGYLVSGEGAAKLRAGALPMFLPYDI 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT--FSPLYFYRNTCYQWNLH 236 ++ W H + +T + A + STI + + P P +R Y Sbjct: 179 ALERGWAHGLRVYITRKPLLKPA--RSGSTIGGYKKTKFPFYMRIPTALFRGQDYLRRTL 236 Query: 237 YNAW 240 Y + Sbjct: 237 YAVF 240 >gi|23502468|ref|NP_698595.1| glycosyl transferase family protein [Brucella suis 1330] gi|254704849|ref|ZP_05166677.1| glycosyl transferase family protein [Brucella suis bv. 3 str. 686] gi|260565910|ref|ZP_05836380.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] gi|23348460|gb|AAN30510.1| glycosyl transferase, family 25 [Brucella suis 1330] gi|260155428|gb|EEW90508.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] Length = 260 Score = 174 bits (442), Expect = 8e-42, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLE-LASFTQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 120 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|317177365|dbj|BAJ55154.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F16] Length = 278 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 54/276 (19%), Positives = 93/276 (33%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNEKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCLEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 + ILEDD + F + L + I+ L + K TL Sbjct: 119 NEV-ICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQK---TLIKGVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++ +S E+ R + P + +++ QWN+ Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRSLHKSAVKQWNV 270 >gi|40063395|gb|AAR38206.1| glycosyl transferase, family 25 [uncultured marine bacterium 580] Length = 257 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 49/220 (22%), Positives = 87/220 (39%), Gaps = 4/220 (1%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 +P +VI++P R+ K + + + FF A + KR+ F R Sbjct: 7 KLPTFVITMPDEINRQNKSKKQLKGVCDNYEFFFAN--SKPNKFPTTYVKWKRRFYFGRD 64 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK-F 120 LS E GC+ SH + ++I A++LEDD F F + + L KC + L++ Sbjct: 65 LSYGEFGCFNSHKLVLEKIVSKEIPKALVLEDDFIFLPNFEKSISDLLKCSYDWELVRLL 124 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + + L N+ + + Y + A LL+ I+ P+D+ M Sbjct: 125 SKPKLNNRMKKNVANLGNNYKLVRIATAPGGAYAYIVTLRGAKKLLSAMNKIWCPVDLVM 184 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 W ++ L P V + +S I R ++ Sbjct: 185 GQPWRTDLEILTVIPS-VATWDQSFNSAIGNERFKKRQLR 223 >gi|261839380|gb|ACX99145.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 52] Length = 278 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 53/274 (19%), Positives = 92/274 (33%), Gaps = 45/274 (16%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPICNRIF----- 49 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 50 ------------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 + + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNKKHPPCTLTNFFYALKHCGKRMGFGELGCYASHYSLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L + I+ L + K + G I Sbjct: 121 -AICILEDDIIVKERFKESLEFCCQHINELGYIRLMHLEENVAKQKT--PVKGVSQILNF 177 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 + T GY + +AA LL K PID M + H + + V E A+ + ++ Sbjct: 178 KD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDGMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 T +S E+ R + P + + +++ QWN+ Sbjct: 237 TQNSNTEKQRPKKLPLSIRIGRFLHKSAVKQWNV 270 >gi|317014215|gb|ADU81651.1| family 25 glycosyl transferase [Helicobacter pylori Gambia94/24] Length = 281 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 56/276 (20%), Positives = 97/276 (35%), Gaps = 49/276 (17%) Query: 3 IPVYVISLP----FSHARRE-KFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + +E + + FDAIY +++P Sbjct: 1 MRVFIIHLSPKTCHNFSLKETHITPLLESLKLQGISYEIFDAIYSKSSPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F H + C + + E+GCY SH LW++ Sbjct: 61 HPSFVIEDLLAFCENEKHPPCALKNFFHAIKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDITLKECFKESLEFCRQHINELGYIRLMHLEENVAKQKT--PVKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++ +S E+ R + P + ++ QWN+ Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRSLHKGAIKQWNI 270 >gi|225626847|ref|ZP_03784886.1| Glycosyltransferase 25 family member 2 precursor [Brucella ceti str. Cudo] gi|256030977|ref|ZP_05444591.1| glycosyl transferase family protein [Brucella pinnipedialis M292/94/1] gi|260168111|ref|ZP_05754922.1| glycosyl transferase, family 25 [Brucella sp. F5/99] gi|261757563|ref|ZP_06001272.1| glycosyl transferase [Brucella sp. F5/99] gi|265988042|ref|ZP_06100599.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] gi|225618504|gb|EEH15547.1| Glycosyltransferase 25 family member 2 precursor [Brucella ceti str. Cudo] gi|261737547|gb|EEY25543.1| glycosyl transferase [Brucella sp. F5/99] gi|264660239|gb|EEZ30500.1| glycosyl transferase [Brucella pinnipedialis M292/94/1] Length = 239 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 86/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLIKELPVKAEVIEAVDSRALNKDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTENFMGAYRAAVDHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + + +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRDEAVRIIIPNPIGLGMVAQLVSYDAAQKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRARAY 236 >gi|261837965|gb|ACX97731.1| LPS biosynthesis protein [Helicobacter pylori 51] Length = 289 Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 92/277 (33%), Gaps = 49/277 (17%) Query: 2 PIPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIY--------------- 38 + V++I L + + + FDAIY Sbjct: 11 NMRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEH 70 Query: 39 ----------------GENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAY 82 +N P + F + + C + + E+GCY SH LW++ Sbjct: 71 LHPSFMIEDLWAFCKNKKNPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIE 128 Query: 83 SPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI 142 ILEDD + F + L + I+ L +K K Sbjct: 129 LNE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEEKVAKQKTFIK---GVSQ 184 Query: 143 HQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-E 200 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 185 ILNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACD 244 Query: 201 AIDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++ +S E+ R + P + +++T QW++ Sbjct: 245 GMNAQNSNTEKQRPKKLPLSIRIGRSLHKSTVKQWSV 281 >gi|23501228|ref|NP_697355.1| glycosyl transferase family protein [Brucella suis 1330] gi|148558876|ref|YP_001258359.1| glycosyl transferase family protein [Brucella ovis ATCC 25840] gi|161618304|ref|YP_001592191.1| glycosyl transferase family protein [Brucella canis ATCC 23365] gi|163842608|ref|YP_001627012.1| glycosyl transferase family protein [Brucella suis ATCC 23445] gi|254701140|ref|ZP_05162968.1| glycosyl transferase family protein [Brucella suis bv. 5 str. 513] gi|254703688|ref|ZP_05165516.1| glycosyl transferase family protein [Brucella suis bv. 3 str. 686] gi|254707939|ref|ZP_05169767.1| glycosyl transferase family protein [Brucella pinnipedialis M163/99/10] gi|254709483|ref|ZP_05171294.1| glycosyl transferase family protein [Brucella pinnipedialis B2/94] gi|254713099|ref|ZP_05174910.1| glycosyl transferase family protein [Brucella ceti M644/93/1] gi|254716548|ref|ZP_05178359.1| glycosyl transferase family protein [Brucella ceti M13/05/1] gi|256060468|ref|ZP_05450639.1| glycosyl transferase family protein [Brucella neotomae 5K33] gi|256159022|ref|ZP_05456859.1| glycosyl transferase family protein [Brucella ceti M490/95/1] gi|256254380|ref|ZP_05459916.1| glycosyl transferase family protein [Brucella ceti B1/94] gi|256368780|ref|YP_003106286.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260567060|ref|ZP_05837530.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] gi|261218343|ref|ZP_05932624.1| glycosyl transferase family 25 [Brucella ceti M13/05/1] gi|261221542|ref|ZP_05935823.1| glycosyl transferase [Brucella ceti B1/94] gi|261315430|ref|ZP_05954627.1| glycosyl transferase family 25 [Brucella pinnipedialis M163/99/10] gi|261317005|ref|ZP_05956202.1| glycosyl transferase family 25 [Brucella pinnipedialis B2/94] gi|261320808|ref|ZP_05960005.1| glycosyl transferase family 25 [Brucella ceti M644/93/1] gi|261324459|ref|ZP_05963656.1| glycosyl transferase [Brucella neotomae 5K33] gi|261751674|ref|ZP_05995383.1| glycosyl transferase family 25 protein [Brucella suis bv. 5 str. 513] gi|261754328|ref|ZP_05998037.1| glycosyl transferase family 25 protein [Brucella suis bv. 3 str. 686] gi|265997505|ref|ZP_06110062.1| glycosyl transferase [Brucella ceti M490/95/1] gi|294851709|ref|ZP_06792382.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] gi|23347110|gb|AAN29270.1| glycosyl transferase, family 25 [Brucella suis 1330] gi|148370133|gb|ABQ60112.1| glycosyl transferase, family 25 [Brucella ovis ATCC 25840] gi|161335115|gb|ABX61420.1| glycosyl transferase family 25 [Brucella canis ATCC 23365] gi|163673331|gb|ABY37442.1| glycosyl transferase family 25 [Brucella suis ATCC 23445] gi|255998938|gb|ACU47337.1| glycosyl transferase, family 25 [Brucella microti CCM 4915] gi|260156578|gb|EEW91658.1| glycosyl transferase, family 25 [Brucella suis bv. 4 str. 40] gi|260920126|gb|EEX86779.1| glycosyl transferase [Brucella ceti B1/94] gi|260923432|gb|EEX90000.1| glycosyl transferase family 25 [Brucella ceti M13/05/1] gi|261293498|gb|EEX96994.1| glycosyl transferase family 25 [Brucella ceti M644/93/1] gi|261296228|gb|EEX99724.1| glycosyl transferase family 25 [Brucella pinnipedialis B2/94] gi|261300439|gb|EEY03936.1| glycosyl transferase [Brucella neotomae 5K33] gi|261304456|gb|EEY07953.1| glycosyl transferase family 25 [Brucella pinnipedialis M163/99/10] gi|261741427|gb|EEY29353.1| glycosyl transferase family 25 protein [Brucella suis bv. 5 str. 513] gi|261744081|gb|EEY32007.1| glycosyl transferase family 25 protein [Brucella suis bv. 3 str. 686] gi|262551973|gb|EEZ07963.1| glycosyl transferase [Brucella ceti M490/95/1] gi|294820298|gb|EFG37297.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] Length = 239 Score = 173 bits (440), Expect = 1e-41, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 86/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLIKELPVKAEVIEAVDSRALNKDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTENFMGAYRAAVDHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + + +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRDEAVRIIIPNPIGLGMVAQLVSYDAAQKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRVRAY 236 >gi|217032900|ref|ZP_03438377.1| hypothetical protein HPB128_152g17 [Helicobacter pylori B128] gi|298736508|ref|YP_003729034.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|216945394|gb|EEC24062.1| hypothetical protein HPB128_152g17 [Helicobacter pylori B128] gi|298355698|emb|CBI66570.1| glycosyl transferase, family 25 [Helicobacter pylori B8] Length = 281 Score = 173 bits (440), Expect = 2e-41, Method: Composition-based stats. Identities = 56/276 (20%), Positives = 99/276 (35%), Gaps = 49/276 (17%) Query: 3 IPVYVISLP----FSHARRE-KFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + +E + + FDAIY +++P Sbjct: 1 MRVFIIHLSPKTCHNFSLKETHITPLLESLKLQGISYEIFDAIYSKSSPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F H + C + + E+GCY SH LW++ Sbjct: 61 HPSFVIEDLLAFCENEKHPPCALKNFFHAIKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDIVLKERFKESLEFCRQHINELGYIRLMHLEENVAKQKT--PVKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++ +S E+ R + P + + +++ QWN+ Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRFLHKSAIQQWNI 270 >gi|62289315|ref|YP_221108.1| glycosyl transferase family protein [Brucella abortus bv. 1 str. 9-941] gi|62195447|gb|AAX73747.1| glycosyl transferase, family 25 [Brucella abortus bv. 1 str. 9-941] Length = 239 Score = 173 bits (440), Expect = 2e-41, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 87/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLVKELPVKAEVIEAVDSRALNKDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTENFMGAYRAAVDHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + F+ +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRDEAVRIIIPNPIGLGMVAQFVSYDAAQKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRVRAY 236 >gi|254718514|ref|ZP_05180325.1| glycosyl transferase family protein [Brucella sp. 83/13] gi|265983483|ref|ZP_06096218.1| glycosyl transferase family 25 [Brucella sp. 83/13] gi|306838322|ref|ZP_07471168.1| glycosyl transferase family protein [Brucella sp. NF 2653] gi|264662075|gb|EEZ32336.1| glycosyl transferase family 25 [Brucella sp. 83/13] gi|306406613|gb|EFM62846.1| glycosyl transferase family protein [Brucella sp. NF 2653] Length = 239 Score = 173 bits (440), Expect = 2e-41, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 86/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLIKELPVRAEVIEAVDSRALGNDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTENFMGAYRAAVNHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + + +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRNEAVRIIIPNPIGLGMVAQLVSYDAARKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRVRAY 236 >gi|17987885|ref|NP_540519.1| glycosyltransferase [Brucella melitensis bv. 1 str. 16M] gi|82699245|ref|YP_413819.1| glycosyl transferase family protein [Brucella melitensis biovar Abortus 2308] gi|189023568|ref|YP_001934336.1| glycosyl transferase, family 25 [Brucella abortus S19] gi|225851867|ref|YP_002732100.1| glycosyltransferase 25 family protein [Brucella melitensis ATCC 23457] gi|237814806|ref|ZP_04593804.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus str. 2308 A] gi|254688634|ref|ZP_05151888.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 6 str. 870] gi|254693115|ref|ZP_05154943.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 3 str. Tulya] gi|254696757|ref|ZP_05158585.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 2 str. 86/8/59] gi|254729666|ref|ZP_05188244.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 4 str. 292] gi|256044048|ref|ZP_05446959.1| Glycosyltransferase 25 family member 2 precursor [Brucella melitensis bv. 1 str. Rev.1] gi|256112844|ref|ZP_05453765.1| Glycosyltransferase 25 family member 2 precursor [Brucella melitensis bv. 3 str. Ether] gi|256256881|ref|ZP_05462417.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus bv. 9 str. C68] gi|256264619|ref|ZP_05467151.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|260545931|ref|ZP_05821672.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260563407|ref|ZP_05833893.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|260754106|ref|ZP_05866454.1| glycosyl transferase family 25 [Brucella abortus bv. 6 str. 870] gi|260757326|ref|ZP_05869674.1| glycosyl transferase family 25 [Brucella abortus bv. 4 str. 292] gi|260761150|ref|ZP_05873493.1| glycosyl transferase family 25 [Brucella abortus bv. 2 str. 86/8/59] gi|260883131|ref|ZP_05894745.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|261213352|ref|ZP_05927633.1| glycosyl transferase family 25 [Brucella abortus bv. 3 str. Tulya] gi|265990459|ref|ZP_06103016.1| glycosyl transferase family 25 [Brucella melitensis bv. 1 str. Rev.1] gi|265994287|ref|ZP_06106844.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|297247731|ref|ZP_06931449.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|17983618|gb|AAL52783.1| glycosyltransferase [Brucella melitensis bv. 1 str. 16M] gi|82615346|emb|CAJ10307.1| Glycosyl transferase, family 25 [Brucella melitensis biovar Abortus 2308] gi|189019140|gb|ACD71862.1| Glycosyl transferase, family 25 [Brucella abortus S19] gi|225640232|gb|ACO00146.1| Glycosyltransferase 25 family member 2 precursor [Brucella melitensis ATCC 23457] gi|237789643|gb|EEP63853.1| Glycosyltransferase 25 family member 2 precursor [Brucella abortus str. 2308 A] gi|260097338|gb|EEW81213.1| glycosyl transferase, family 25 [Brucella abortus NCTC 8038] gi|260153423|gb|EEW88515.1| glycosyl transferase, family 25 [Brucella melitensis bv. 1 str. 16M] gi|260667644|gb|EEX54584.1| glycosyl transferase family 25 [Brucella abortus bv. 4 str. 292] gi|260671582|gb|EEX58403.1| glycosyl transferase family 25 [Brucella abortus bv. 2 str. 86/8/59] gi|260674214|gb|EEX61035.1| glycosyl transferase family 25 [Brucella abortus bv. 6 str. 870] gi|260872659|gb|EEX79728.1| glycosyl transferase [Brucella abortus bv. 9 str. C68] gi|260914959|gb|EEX81820.1| glycosyl transferase family 25 [Brucella abortus bv. 3 str. Tulya] gi|262765400|gb|EEZ11189.1| glycosyl transferase [Brucella melitensis bv. 3 str. Ether] gi|263001243|gb|EEZ13818.1| glycosyl transferase family 25 [Brucella melitensis bv. 1 str. Rev.1] gi|263094986|gb|EEZ18694.1| glycosyl transferase [Brucella melitensis bv. 2 str. 63/9] gi|297174900|gb|EFH34247.1| glycosyl transferase [Brucella abortus bv. 5 str. B3196] gi|326408363|gb|ADZ65428.1| Glycosyl transferase, family 25 [Brucella melitensis M28] gi|326538078|gb|ADZ86293.1| glycosyltransferase 25 family member 2 precursor [Brucella melitensis M5-90] Length = 239 Score = 173 bits (439), Expect = 2e-41, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 87/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLIKELPVKAEVIEAVDSRALNKDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTENFMGAYRAAVDHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + F+ +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRDEAVRIIIPNPIGLGMVAQFVSYDAAQKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRVRAY 236 >gi|306842412|ref|ZP_07475065.1| glycosyl transferase family protein [Brucella sp. BO2] gi|306287461|gb|EFM58936.1| glycosyl transferase family protein [Brucella sp. BO2] Length = 239 Score = 173 bits (439), Expect = 2e-41, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 86/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLIKELPVRAEVIEAVDSRALGNDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTENFMGAYRAAVNHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + + +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRDEAVRIIIPNPIGLGMVAQLVSYDAARKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRVRAY 236 >gi|308063408|gb|ADO05295.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 278 Score = 173 bits (439), Expect = 2e-41, Method: Composition-based stats. Identities = 54/276 (19%), Positives = 94/276 (34%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESFKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLLAFCKNKKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K TL Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQK---TLIKGVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++T +S E+ + + P + + +++ QWN+ Sbjct: 235 MNTQNSNTEKQKPKKLPLSIRIGRFLHKSALKQWNV 270 >gi|297380003|gb|ADI34890.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori v225d] Length = 285 Score = 172 bits (437), Expect = 3e-41, Method: Composition-based stats. Identities = 60/300 (20%), Positives = 100/300 (33%), Gaps = 62/300 (20%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNEKHSPCALKNFFYALKHC--GKRMGFGELGCYASHYLLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQKT--PIKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLEYSTKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSS 259 ++T +S E+ R + P + +++ QWN+ F P SSS Sbjct: 235 MNTQNSNTEKQRPKKLPLSIRIGRSLHKSALKQWNV-------------LKSFFPHQSSS 281 >gi|161619545|ref|YP_001593432.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtE [Brucella canis ATCC 23365] gi|161336356|gb|ABX62661.1| Lacto-N-neotetraose biosynthesis glycosyl transferase lgtE [Brucella canis ATCC 23365] Length = 260 Score = 172 bits (437), Expect = 3e-41, Method: Composition-based stats. Identities = 50/232 (21%), Positives = 92/232 (39%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLE-LASFTQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ L ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R + GY + +EAA LL + + P+D + Sbjct: 120 YGHEVLISNPVKN-EGPYSISRLRSRHLQMGGYVVSREAARKLLPLMEKASAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|283954707|ref|ZP_06372225.1| LPS glycosyltransferase subfamily [Campylobacter jejuni subsp. jejuni 414] gi|283793899|gb|EFC32650.1| LPS glycosyltransferase subfamily [Campylobacter jejuni subsp. jejuni 414] Length = 253 Score = 172 bits (437), Expect = 4e-41, Method: Composition-based stats. Identities = 61/262 (23%), Positives = 105/262 (40%), Gaps = 30/262 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNP-ICNRIFSHQ 52 + V++I+L S R+E + ++ L+F FF AI +N + + Sbjct: 1 MKVFIINLERSLDRKEHMQKQIQKLFLKNPNLKNKLEFIFFKAIDAKNKEHLEFKEHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ +ILEDD +FSDEF + + L Sbjct: 61 WDSWVLGRELSDGEKACFASHYRLWQECVKLNEP-IMILEDDVEFSDEFLNHGEEYIDEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 SK + I I + D L L G + + ++ T GY + A +++ Sbjct: 120 SKSEYEYIRILYIF-------DKRLYLLRGGYYLSFEKL--AGTQGYILKPSGAKKIISK 170 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR------LVRKPTFSP 222 K IY P+D M +++H + +++ +P + ++ R + RK Sbjct: 171 AKFIYMPVDDYMDKFYKHKVLNIIKKPLLLKHNLEFQSDISRLGRHNRKIKIYRKIIREI 230 Query: 223 LYFYRNTCYQWNLHYNAWRKDL 244 FY + ++ Y R L Sbjct: 231 CRFYPSILSLLSILYIRNRLKL 252 >gi|308182961|ref|YP_003927088.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] gi|308065146|gb|ADO07038.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] Length = 278 Score = 172 bits (437), Expect = 4e-41, Method: Composition-based stats. Identities = 53/276 (19%), Positives = 91/276 (32%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNEKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K TL Sbjct: 119 NE-AICILEDDIIIKERFKESLEFCYQHINELGYIRLMHLEENVAKQK---TLIKGVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++ +S E+ + + P + ++ QWN+ Sbjct: 235 MNAQNSNTEKQKPKKLPLSIRIGRSLHKGALKQWNV 270 >gi|217034149|ref|ZP_03439569.1| hypothetical protein HP9810_868g42 [Helicobacter pylori 98-10] gi|216943433|gb|EEC22889.1| hypothetical protein HP9810_868g42 [Helicobacter pylori 98-10] Length = 300 Score = 172 bits (435), Expect = 6e-41, Method: Composition-based stats. Identities = 55/276 (19%), Positives = 96/276 (34%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNKKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQKT--PIKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLEYSAKKWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++T +S E+ R + P + + +++ QWN+ Sbjct: 235 MNTQNSNTEKQRPKKLPLSIRIGRFLHKSAVKQWNV 270 >gi|308062125|gb|ADO04013.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Cuz20] Length = 278 Score = 171 bits (434), Expect = 7e-41, Method: Composition-based stats. Identities = 51/274 (18%), Positives = 88/274 (32%), Gaps = 45/274 (16%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPICNRIF----- 49 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 50 ------------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 + + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLWAFCKNEKHPPCVLENFFYALKHCGKRMGFGELGCYASHYSLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L + I+ L + K T Sbjct: 121 -AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQK---TSVKGVSQILN 176 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 T GY + +AA LL K PID M + H + + V E A+ + ++ Sbjct: 177 FKDGIGTQGYVLAPKAAQKLLEYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDGMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 T +S E+ + + P + +++ QWN+ Sbjct: 237 TQNSNTEKQKPKKLPLSIRIGRSLHKSALKQWNV 270 >gi|163843853|ref|YP_001628257.1| lipooligosaccharide biosynthesis protein lic2B [Brucella suis ATCC 23445] gi|163674576|gb|ABY38687.1| Lipooligosaccharide biosynthesis protein lic2B [Brucella suis ATCC 23445] Length = 260 Score = 171 bits (434), Expect = 8e-41, Method: Composition-based stats. Identities = 50/232 (21%), Positives = 92/232 (39%), Gaps = 7/232 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L S R E + R+ QF +A+ G + S + ++ L Sbjct: 1 MKCYLINLDKSRDRLEFMASQFERLGAQFKRVEAVNGRAMSPLE-LASFIQISKEWPAPL 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC++SH ++IA A + EDD S S+ ++K DA Sbjct: 60 SPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFWASDHWIPKQADIVKIDA 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + G + I + R +T GY + +EAA LL + + P+D + Sbjct: 120 YGHEVLISNPVKN-EGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFD 178 Query: 183 WWE---HNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 + ++ P ++ +STI ++R ++ +R Sbjct: 179 PNDGPFNDFEIYQISPAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 228 >gi|306844961|ref|ZP_07477542.1| glycosyl transferase family protein [Brucella sp. BO1] gi|306274593|gb|EFM56382.1| glycosyl transferase family protein [Brucella sp. BO1] Length = 239 Score = 171 bits (433), Expect = 1e-40, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 86/238 (36%), Gaps = 9/238 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQF 58 M + ++I L + R + + ++ +A+ N RI+ + ++ Sbjct: 1 MNVKAFIIHLKRATDRAPQVEKLIKELPVRAEVIEAVDSRALGNDEIARIYKRRLHTPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LS EI C++SH W+ I I+EDD ++ F I Sbjct: 61 PFALSRNEIACFLSHRKAWQAIIDRKLDAGFIVEDDIALTEIFMGAYRAAVDHLEPGGFI 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + ++ I P + + +AA LL++ + RP+D Sbjct: 121 RFTF--RDDREHGREVFRDEAVRIIIPNPIGLGMVAQLVSYDAARKLLDITQTFDRPVDT 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 ++ W + L PG V E + +TI+ + + + P+Y R Y Sbjct: 179 TVQMRWVTGLQPLSVIPGGVKEISSQLGGTTIQHKKNFSDKLAREILRPIYRMRVRAY 236 >gi|83951263|ref|ZP_00959996.1| glycosyl transferase, family 25 [Roseovarius nubinhibens ISM] gi|83839162|gb|EAP78458.1| glycosyl transferase, family 25 [Roseovarius nubinhibens ISM] Length = 250 Score = 171 bits (433), Expect = 1e-40, Method: Composition-based stats. Identities = 54/241 (22%), Positives = 92/241 (38%), Gaps = 10/241 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M +++ L + ARRE L + A+ G ++ ++ Sbjct: 1 MTGHAFILHLTRASARRENAHRLRDSCGLPAEIWPAVDGAAMDDRELRARVDAGLFAPRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNIL 117 L + EIGC++SH +W I A+ILEDDA F+Q L Sbjct: 61 PFPLRMGEIGCFLSHRQIWAEILRRDLDHALILEDDAGLDPEHFAQALTLGRDHLDRFGY 120 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 I+F + + + L G + P++ RTT + + +A LL RP+D Sbjct: 121 IQFQT--RPARGPARLIATQGACQLLLPQMAGLRTTAQLVTRASAEQLLAASDRFDRPVD 178 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAID-TNDSTIEESRLVRKPTFSPLYF-YRNTCYQWNL 235 ++ W+ + + P + E D STI+ +R KP L + Y+ + Sbjct: 179 TFVQSHWQTGLRPAMILPSGISEIADQLEGSTIQSAR---KPVLEKLRREWLRGRYRRAV 235 Query: 236 H 236 Sbjct: 236 R 236 >gi|114768964|ref|ZP_01446590.1| glycosyl transferase, family 25 [alpha proteobacterium HTCC2255] gi|114549881|gb|EAU52762.1| glycosyl transferase, family 25 [alpha proteobacterium HTCC2255] Length = 249 Score = 171 bits (433), Expect = 1e-40, Method: Composition-based stats. Identities = 55/241 (22%), Positives = 100/241 (41%), Gaps = 10/241 (4%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQK-RQCQF 58 I ++I L ++ RR+ + + DAI G P N I S++K ++ Sbjct: 7 NIKCFIIHLKRANKRRKFVDEIVNNMPIASEIIDAIDGSLLPEKEINSILSNKKICNPKY 66 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQLLPHLSKCDINNIL 117 ++L EIGC++SH W+RI +I+EDD + F++ L + Sbjct: 67 PFNINLGEIGCFLSHREAWQRIVDQKLEAGLIIEDDVRVNTPIFNKSLNFTLRYIRKYKY 126 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 I+F ++P + + + I P + RT+ + EAA+ LLN K I RP+D Sbjct: 127 IQFQV--REPNRRADIIETENKLKILNPLPVHLRTSAQLVSFEAAVELLNKTKKIDRPVD 184 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAI-DTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLH 236 ++ +WE I + + + ST+ + + + Y+ + Sbjct: 185 TTLQMFWETKIKCYCVNQSGISDHTLEAGGSTLAHKNHSKHKILRNI---KRLEYRLKIF 241 Query: 237 Y 237 Y Sbjct: 242 Y 242 >gi|260779651|ref|ZP_05888541.1| putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrio coralliilyticus ATCC BAA-450] gi|260604460|gb|EEX30764.1| putative Lex2B protein (lipooligosaccharide 5G8 epitope biosynthesis-associated protein) [Vibrio coralliilyticus ATCC BAA-450] Length = 249 Score = 171 bits (433), Expect = 1e-40, Method: Composition-based stats. Identities = 56/240 (23%), Positives = 88/240 (36%), Gaps = 13/240 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNP-ICNRIFSHQKRQCQFKR 60 + ++VI+LP S RR + ++ + + FF+AI G P + R+ R Sbjct: 1 MKIFVINLPRSVDRRISMEAQLKKLGVANYEFFEAIDGSQMPSYLADKVDDEHRKKFRSR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS E G Y S+ LW++ +I+EDD D F + + K ++ Sbjct: 61 PLSYGERGIYASNYLLWEKCVQQNEPF-LIMEDDVVLEDCFVSICNRVEKLHSKGF--EY 117 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 L + +H S T Y I A LL + +D + Sbjct: 118 FRLGISDTSEVPQVIDVNENIVHWCDNQSGSTRCYSISPRGAAKLLKRSEKWICAVDNYI 177 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIE-----ESRLVRKPTFSPLYFYRNTCY-QWN 234 + IP L P A +A +STI+ + L+ K FYR WN Sbjct: 178 GEAYRTKIPCLGIMPYASSKA--EYESTIQLGEKSKVSLINKLNRELYRFYRFLRMSVWN 235 >gi|317009185|gb|ADU79765.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori India7] Length = 281 Score = 170 bits (432), Expect = 1e-40, Method: Composition-based stats. Identities = 55/276 (19%), Positives = 96/276 (34%), Gaps = 49/276 (17%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFIIEDLWAFCKNKKHPPCTLKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQKT--PIKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLEYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAISCDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCYQWNL 235 ++ +S E+ R + P + + +++T QWN+ Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRFLHKSTIKQWNI 270 >gi|163803283|ref|ZP_02197162.1| 3-deoxy-D-manno-octulosonic-acid kinase [Vibrio sp. AND4] gi|159172920|gb|EDP57758.1| 3-deoxy-D-manno-octulosonic-acid kinase [Vibrio sp. AND4] Length = 246 Score = 170 bits (431), Expect = 2e-40, Method: Composition-based stats. Identities = 49/252 (19%), Positives = 92/252 (36%), Gaps = 18/252 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 + ++VI+L S RRE + + F FF+ +NP ++ K Sbjct: 1 MKIFVINLKRSVDRREHVSNLLR--GIDFEFFEGEDIADNPEHEIYSLYDASKTLKYKGY 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L+ PE+GC+ S I+LW+ +I ED+ + L +++ ++K Sbjct: 59 TLTTPELGCFASQINLWRHCVKINEP-VLIFEDNIKLYGDLKAQLNNINSLVSQYGIVKL 117 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 ++ K + + L + + + + Y I E A L++ + + P+D + Sbjct: 118 GNYFER--KYTKVAVLDDTYSLVSCAESACGNSAYAITPEVANKYLSILQGFFEPVDDFI 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR------KPTFSPLYFYRNTCYQWN 234 + W + P V + STI ++ K YR + Sbjct: 176 DNEWRTGQTLVSYSPNLVARSD--TCSTIGIRKIKGKISCLNKVNVEAYRLYRQLRQSF- 232 Query: 235 LHYNAWRKDLPP 246 YN K P Sbjct: 233 --YNIRHKRNIP 242 >gi|224437264|ref|ZP_03658236.1| hypothetical protein HcinC1_04787 [Helicobacter cinaedi CCUG 18818] Length = 267 Score = 169 bits (429), Expect = 3e-40, Method: Composition-based stats. Identities = 57/273 (20%), Positives = 106/273 (38%), Gaps = 24/273 (8%) Query: 3 IPVYVISLPFSHARREKFCHRA----ARIHLQFS--FFDAIYGENNP-ICNRIFSHQKRQ 55 + +++I+LP + R+ + +++ ++ FF+AI + + + +S K Sbjct: 1 MKLFIINLPQATERKALMSKQFENLDSKLKEKYEIIFFNAINAQAKEHLNFKQYSKIKSI 60 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + LS E C+ SH LW++ S I+LEDD + F Q L + + Sbjct: 61 LFRGKELSDGERACFASHYTLWQKCVESNEP-IIVLEDDVEILPHFWQELERIEQ--SAF 117 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 ++ L K K F+ T GY++ AA + K+ YRP Sbjct: 118 AYVRLMYLITKAKFYDLQKGFHFTFNNV------AGTQGYYLTPTAAKAFIESAKSWYRP 171 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR----LVRKPTFSPLYFYRNTCY 231 +D M ++ H IP + +P +T S + + + L+ R + Sbjct: 172 VDDYMDMFYIHKIPIVCVKPVIQEVRANTTISGRWQKPSLLLKIVRECVRLLFQTRRFVF 231 Query: 232 QWNLHYNAWRKDLPPVSTTKF-LPSSSSSLIKS 263 + LP S K L +S +++S Sbjct: 232 LFFFKNTLL---LPKESLQKIHLGRENSEILRS 261 >gi|126740757|ref|ZP_01756442.1| glycosyl transferase, family 25 [Roseobacter sp. SK209-2-6] gi|126718053|gb|EBA14770.1| glycosyl transferase, family 25 [Roseobacter sp. SK209-2-6] Length = 237 Score = 169 bits (429), Expect = 3e-40, Method: Composition-based stats. Identities = 49/233 (21%), Positives = 89/233 (38%), Gaps = 8/233 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 + +I +P S R R+ +A+ G + + + Sbjct: 1 MRSLIIHMPSSGKRAANVAQLLERLP-NAEVAEAVNGREVVATNAQELRQGDMHRPHYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINNILIK 119 LS E+GC++SH W+RI + +I+EDD E + L +S I+ Sbjct: 60 PLSPGEVGCFLSHRRCWQRIVDAGWDYGMIVEDDLSLEPEQWRDTLALISAHADAESFIR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A K+ + + G + P+++ +T +G++AA LL + + RP+D Sbjct: 120 LPA--KRREAPREVIAEEGQSKLFLPKVIGLQTVAQVVGRKAAKRLLTATEVLDRPVDTF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAI-DTNDSTIEESRLVRKPTFS-PLYFYRNTC 230 ++ W H P V E + STI++ + K + YR Sbjct: 178 LQMHWIHGQRIQTILPNGVSELTQELGGSTIQKRKSSNKISREFKRSLYRAQV 230 >gi|57168030|ref|ZP_00367169.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter coli RM2228] gi|57020404|gb|EAL57073.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter coli RM2228] Length = 253 Score = 168 bits (427), Expect = 5e-40, Method: Composition-based stats. Identities = 58/250 (23%), Positives = 97/250 (38%), Gaps = 29/250 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICN-RIFSHQ 52 + ++I+L S R+E + ++ L+F FF A+ +N + Sbjct: 1 MKAFIINLERSLDRKEYMQRQNQKLFEKNPSLKNKLEFIFFKAVDAKNKEYLEFKQHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ IILEDD +FSDEF + + L Sbjct: 61 WASWVLGRELSDGEKACFASHYKLWQECIKLDKP-IIILEDDVEFSDEFLNNGEEYIEEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 SK ++F L D L N+ + ++ T GY + AA ++ Sbjct: 120 SK--SEYEYVRFCYLF-----DKKFYPLNENYILSFEKL--AGTQGYVLKPSAANKFISK 170 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRN 228 K IY P+D M +++H + ++V +P + R +K L +R Sbjct: 171 AKFIYTPVDDYMDMFYKHKVLNIVKKPLLLKHNCKLESEISNLGRTNKK-----LKIHRK 225 Query: 229 TCYQWNLHYN 238 ++ Y Sbjct: 226 IVREFFRLYR 235 >gi|187920688|ref|YP_001889720.1| glycosyl transferase family 25 [Burkholderia phytofirmans PsJN] gi|187719126|gb|ACD20349.1| glycosyl transferase family 25 [Burkholderia phytofirmans PsJN] Length = 275 Score = 168 bits (427), Expect = 5e-40, Method: Composition-based stats. Identities = 65/278 (23%), Positives = 113/278 (40%), Gaps = 28/278 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 +PV+VISLP S RR + + F DA+ G + N ++ + ++ R Sbjct: 6 VPVHVISLPRS-GRRGAIAALLSGRGVAFRIEDAVDGRLLTHSELNAVYDDTAARRRYGR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN----I 116 ++ E+ C++SH +W++I A+ILEDDA F + + H + + + Sbjct: 65 SMTSAEVACFLSHRSVWRKIV-DNGCAAVILEDDAILEPAFFERVLHADESQLAAVAGIV 123 Query: 117 LIKFDALRKKPKKDSYL------CTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLNVR 169 L+ LR+ +YL G + P + + + GY+I +AA L Sbjct: 124 LLGRSKLRRAASFWTYLNEPLRWRASVGGLRVGVPFKQWTSGSVGYWISAQAARQALAYS 183 Query: 170 KNIYRPIDMDMKHW-W---EHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 + PI + W W E P AV+E + S+IE+ R R + L+ Sbjct: 184 EG---PIGALLDDWPWHRDEGGARVAELRPYAVWEDFERLPSSIEQERKARIKPRASLH- 239 Query: 226 YRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSLIKS 263 W L V+ + SS S ++++ Sbjct: 240 ---VVALWPLRLARTAVRWSVVALQRL--SSGSDVVRA 272 >gi|315452598|ref|YP_004072868.1| Beta-1,4-galactosyltransferase [Helicobacter felis ATCC 49179] gi|315131650|emb|CBY82278.1| Beta-1,4-galactosyltransferase [Helicobacter felis ATCC 49179] Length = 263 Score = 168 bits (426), Expect = 6e-40, Method: Composition-based stats. Identities = 53/260 (20%), Positives = 96/260 (36%), Gaps = 43/260 (16%) Query: 3 IPVYVISLPFSH-------ARR-EKFCHRAARIHLQFSFFDAIYGENNPICN-------- 46 + +++I L + R E + F+AIY + + + Sbjct: 1 MRIFIIHLSKATCTQWGLKERNIEGLLQSLEQNKHPVEVFEAIYSQTSEGLHPLVAKHLH 60 Query: 47 -------------RIFSHQKRQCQF-----KRLLSLPEIGCYISHIHLWKRIAYSPAIGA 88 + + + C + +SL E+GC+ SH LW+R Sbjct: 61 PFFIHPSVQASTPKNWLSYLQACWYAFQQEGVPMSLGELGCFASHYALWQRCVALQEP-I 119 Query: 89 IILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRIL 148 ILEDD F + L ++ ++ L P + + + G F+I +P Sbjct: 120 CILEDDIALEPHFVENLDYIESYIPRLHWVRLMHLFDYPLETTPIL---GVFEI-KPFTW 175 Query: 149 SPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 T GY I +AA L + P+D M++ + H + + V +P + E ++ Sbjct: 176 GSGTQGYIITPKAASKFLKASQKWVMPVDCVMENTYLHGVKNYVIKPFVIRE--NSTTGN 233 Query: 209 IEESRLVRKPTFSPLYFYRN 228 I R V P +PL F++ Sbjct: 234 ITRDRNVTCP--APLRFFKK 251 >gi|113866928|ref|YP_725417.1| glycosyltransferase involved in LPS biosynthesis [Ralstonia eutropha H16] gi|113525704|emb|CAJ92049.1| glycosyltransferase involved in LPS biosynthesis [Ralstonia eutropha H16] Length = 257 Score = 168 bits (425), Expect = 1e-39, Method: Composition-based stats. Identities = 51/241 (21%), Positives = 93/241 (38%), Gaps = 11/241 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 I YVI+L + RRE+ + A + + F F A+ G + ++ R Sbjct: 2 IGAYVINLDSARERRERISGQLAGLGIPFQVFRAVDGRVLAEQEVASKYDAAAAAARY-R 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILI 118 +S E+GC +SH+ +++++ A A++LEDDA + + +L L ++ Sbjct: 61 PMSRGEVGCGLSHLGVYRKMLDDGAAFALVLEDDASPGPDLATVLADLESRLDPDRAEVV 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + K + + L G + + R GY + + AA LL + ++ D Sbjct: 121 LLSHVDKYTRWGA--RRLGGERRLVRRYGEWWRAHGYVVTRAAARRLLAGLQPMWCAADY 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV----RKPTFSPLYFYRNTCYQWN 234 + + P + S++E R K S Y+ R YQ Sbjct: 179 WSAFEKQGLVSVSAVVPYCIGLTELAETSSLETHRADLDATDKAQRSIGYYLRRYVYQRF 238 Query: 235 L 235 L Sbjct: 239 L 239 >gi|317011021|gb|ADU84768.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori SouthAfrica7] Length = 283 Score = 167 bits (424), Expect = 1e-39, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 91/270 (33%), Gaps = 45/270 (16%) Query: 3 IPVYVISLP----FSHARRE-KFC---HRAARIHLQFSFFDAIYGENNP----------- 43 + V++I L + + +E + + FDAIY +++P Sbjct: 1 MRVFIIHLSPKTCHNFSLKETHITPLLESLKLQGISYEIFDAIYSKSSPTQLHPLILEHL 60 Query: 44 ------ICNRIFSHQKRQ------------CQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 F K+ + + E+GCY SH LW++ Sbjct: 61 HLSFMIEDLLAFCENKKHPPCAFKNFFHAIKHCGKRMGFGELGCYASHYSLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L + I+ L + K Sbjct: 121 -AICILEDDITLKEHFKESLEFCRQHINELGYIRLMHLEENVAKQKTPVK---GVSQILN 176 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 T GY + +AA LL K PID M + H + + V E A++ + ++ Sbjct: 177 FKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIFCDGMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 T +S E+ R + P + + +++ Sbjct: 237 TQNSNTEKQRPKKLPLSIRIGRFLHKSAVK 266 >gi|222084495|ref|YP_002543024.1| glycosyltransferase protein [Agrobacterium radiobacter K84] gi|221721943|gb|ACM25099.1| glycosyltransferase protein [Agrobacterium radiobacter K84] Length = 279 Score = 167 bits (423), Expect = 1e-39, Method: Composition-based stats. Identities = 59/251 (23%), Positives = 92/251 (36%), Gaps = 11/251 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRL 61 + +YVI+L S R E+ C +AAR L AI G P +RI F Q+ R Sbjct: 25 VCIYVINLDRSRERWERLCGQAARYGLNVIRVAAIDGAKIPEGDRIDFQQQQFVYHNGRK 84 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E GCY SH+ ++ S AII+EDD + ++ L+K Sbjct: 85 LLAGEYGCYRSHLLALQQFIDSGDKMAIIMEDDVELNERLIPRALSAMNSVCGARLVKLV 144 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 R L N + + Y + + AA LL K + P D+ + Sbjct: 145 NHRL--VGFKPLSETTENDVVGRCMHGPQGSAACYIVNRSAAKKLLITLKPMLLPYDIAL 202 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL----- 235 + W + + T + +D+TI + R + R T Y + Sbjct: 203 ERGWSTGVETFSTLENIADFSPHRSDTTIGKRVHYRAVKRH--FLLRATAYWFRACDQLH 260 Query: 236 HYNAWRKDLPP 246 ++ K P Sbjct: 261 RWHYTVKAKPQ 271 >gi|308184590|ref|YP_003928723.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori SJM180] gi|308060510|gb|ADO02406.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori SJM180] Length = 284 Score = 167 bits (422), Expect = 2e-39, Method: Composition-based stats. Identities = 53/272 (19%), Positives = 96/272 (35%), Gaps = 49/272 (18%) Query: 3 IPVYVISLP----FSHARRE-KFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + +E + + FDAIY +++P Sbjct: 1 MRVFIIHLSPKTCHNFSLKETHITPLLESLKLQGVSYEIFDAIYSKSSPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFVIEDLLAFCKNEKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L K I+ L + K + G I Sbjct: 119 NE-AICILEDDIVLKECFKESLEFCDKHINELGYIRLMHLEENVAKQKT--PVKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLKYSTKEWVMPIDCVMDRHYWHGVKNYVLEEFAITCDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 +++ +S E+ R + P + + +++ Sbjct: 235 MNSQNSNTEKQRPKKLPLSIRIGRFLHKSAVK 266 >gi|163757900|ref|ZP_02164989.1| glycosyl transferase, family 25 [Hoeflea phototrophica DFL-43] gi|162285402|gb|EDQ35684.1| glycosyl transferase, family 25 [Hoeflea phototrophica DFL-43] Length = 252 Score = 166 bits (421), Expect = 3e-39, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 87/218 (39%), Gaps = 7/218 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQ-CQF 58 I ++I L + ARR + H + DA+ G ++S + + Sbjct: 6 DIEAFIIHLARATARRPQVDHLLKACPVPARVLDAVDGRAMSEAEIDAVYSRKSMHAPHY 65 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNIL 117 ++ E+GC++SH W+ I +++EDD + D F+ L + + + Sbjct: 66 PFEMTAGEVGCFLSHRKAWQAIVACGRDAGLVIEDDVEIDADVFATALEFAREHVSAHGI 125 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 ++F ++ + L + +P I+ R + + AA LL RPID Sbjct: 126 VQFQV--RRIAEPGPLIASTDAVVLARPMIIPLRASCTLYSRSAAERLLAQTGRFDRPID 183 Query: 178 MDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESRL 214 ++ W + L+ P V + A D +TI+ + Sbjct: 184 GHVQLHWVTGLRPLIAVPSGVRDKAADIGGTTIQARNV 221 >gi|84515249|ref|ZP_01002611.1| hypothetical protein SKA53_01286 [Loktanella vestfoldensis SKA53] gi|84510532|gb|EAQ06987.1| hypothetical protein SKA53_01286 [Loktanella vestfoldensis SKA53] Length = 256 Score = 166 bits (421), Expect = 3e-39, Method: Composition-based stats. Identities = 53/248 (21%), Positives = 96/248 (38%), Gaps = 14/248 (5%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFK 59 + +++I+L +R + + + L ++ F AI G + + R+ + Sbjct: 8 DLGIWLINLDRDTSRLARMQQQLDAMGLDYTRFGAIDGRARADELAPRV-NAAAYARNMG 66 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 + +S IGCY SH+ +W ++A + A+ILEDD F D+F L ++ Sbjct: 67 QAVSPGHIGCYASHVAVWDQVAQATQPVALILEDDVVFHDDFLDSLDAALAGAAAWDTLR 126 Query: 120 FDALRKK-PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 F+ +R K P YL G + ++ Y + ++ A L R D Sbjct: 127 FNCIRAKWPVCQGYL----GRYRLNAYIGPFTGNATYLVKRDVAGRLAPAFWPQTRAHDH 182 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI-----EESRLVRKPTFSPLYFYRNTCYQW 233 +M ++ H+ EP + D STI ++ R P Y + Y Sbjct: 183 EMNRFFHHDFRQFGLEPFS-SHVDDGGVSTITGTGFDQVRKFAWYRRLPHYRLKAGNYLR 241 Query: 234 NLHYNAWR 241 + A Sbjct: 242 RAWWLARN 249 >gi|315586530|gb|ADU40911.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 35A] Length = 280 Score = 166 bits (420), Expect = 3e-39, Method: Composition-based stats. Identities = 52/270 (19%), Positives = 91/270 (33%), Gaps = 45/270 (16%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPIC--------- 45 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 46 NRIF--------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 + F + + + E+GCY SH LW++ Sbjct: 61 HSSFMIEDLWAFCKNEKHPPCTLKNFFYALKHCGKRMGFGELGCYASHYSLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L + I+ L + K + G I Sbjct: 121 -AICILEDDIIVKEHFKESLEFCYQHINELGYIRLMHLEENVAKQKT--PIKGVSQILNF 177 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 + T GY + +AA LL K PID M + H + + V E A+ + ++ Sbjct: 178 KD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDGMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 T +S E+ R + P + + +++T Sbjct: 237 TQNSNTEKQRPKKLPLSIRIGRFLHKSTIK 266 >gi|126735046|ref|ZP_01750792.1| hypothetical protein RCCS2_14254 [Roseobacter sp. CCS2] gi|126715601|gb|EBA12466.1| hypothetical protein RCCS2_14254 [Roseobacter sp. CCS2] Length = 261 Score = 166 bits (420), Expect = 4e-39, Method: Composition-based stats. Identities = 56/247 (22%), Positives = 100/247 (40%), Gaps = 12/247 (4%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFKR 60 + V++I+L R + + + L F AI G+ ++ Sbjct: 9 DLGVWLINLDRDADRLQHMEDQLRPLDLPCQRFPAIDGKQQVDTLSKRADAAAYARNMGS 68 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + ++GCY SHI +W+ S + A+ILEDD F D+F Q L + + ++F Sbjct: 69 PILPGKMGCYASHIAVWEAFLASDSDVALILEDDVVFHDDFLQSLDLALEASDHWDTVRF 128 Query: 121 DALRKK-PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + +R K P + Y+ G++ ++ Y I ++ A +L R D + Sbjct: 129 NCIRAKLPVRQGYV----GDYSLNAYVGPFTGNAAYLIKRDVAQRVLPNLWPQTRAFDHE 184 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR--LVRKPT---FSPLYFYRNTCYQWN 234 + ++ H+ EP + D N+S+I S LV+K P Y + Y Sbjct: 185 LNRFFLHDFRQCGLEPFS-THVDDGNESSITGSNFALVKKFKWYQRLPHYRLKAANYFRR 243 Query: 235 LHYNAWR 241 + A R Sbjct: 244 AGWLARR 250 >gi|148652794|ref|YP_001279887.1| glycosyl transferase family protein [Psychrobacter sp. PRwf-1] gi|148571878|gb|ABQ93937.1| glycosyl transferase, family 25 [Psychrobacter sp. PRwf-1] Length = 254 Score = 166 bits (420), Expect = 4e-39, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 93/251 (37%), Gaps = 13/251 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R E+ + + + F A G + + ++ + Sbjct: 1 MKVLTYLINLDGSDKRLERSTEQLQQQNWDFERISAYDGRGKALSEFVDYDDEQTRSNLG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-----LSKCDIN 114 R L EIGCY+SH ++ + A I+LEDD S F+ ++ + +++ Sbjct: 61 RSLLNSEIGCYLSHYQCAQKFLETDADMLIVLEDDMKMSSAFAPVVKEAVSYLYTHPELD 120 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 L+ A +KK KD T + R G ++ A + K I Sbjct: 121 WYLVNIAAKKKKLAKD---ITTINEHTLWHAYYFPIRGLGLIWSRKGAEEFVKHGKPITM 177 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESRLVRKPTFSPL---YFYRNTC 230 P+D+ + W N L P + ID++ + S+ +++ Y + Sbjct: 178 PVDIFFQRWLSQNGKGLGVWPAVISPMGIDSDILGTQSSQNIQRKDKEGRDGSYNLKKQK 237 Query: 231 YQWNLHYNAWR 241 A + Sbjct: 238 RMLQDKAAAVK 248 >gi|332673390|gb|AEE70207.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 83] Length = 280 Score = 165 bits (419), Expect = 4e-39, Method: Composition-based stats. Identities = 52/270 (19%), Positives = 94/270 (34%), Gaps = 45/270 (16%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNP----------- 43 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 44 ----ICNRIFSHQKRQCQF--------------KRLLSLPEIGCYISHIHLWKRIAYSPA 85 + +++ K + + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNKKHLPCALKNFFYVLKHCGKRMGFGELGCYASHYSLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L + I+ L + K + G I Sbjct: 121 -AICILEDDIIVKERFKESLEFCCQHINELGYIRLMHLEENVAKQKT--PIKGVSQILNF 177 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 + T GY + +AA LL K PID M + H + + V E A+ + ++ Sbjct: 178 KD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDGMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 T +S E+ R + P + + +++T Sbjct: 237 TQNSNTEKQRPKKLPLSIRIGRFLHKSTIK 266 >gi|300021717|ref|YP_003754328.1| glycosyl transferase family 25 [Hyphomicrobium denitrificans ATCC 51888] gi|299523538|gb|ADJ22007.1| glycosyl transferase family 25 [Hyphomicrobium denitrificans ATCC 51888] Length = 244 Score = 165 bits (419), Expect = 5e-39, Method: Composition-based stats. Identities = 52/233 (22%), Positives = 92/233 (39%), Gaps = 16/233 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V+VI+L + R + I F +A+ GE P K L Sbjct: 1 MQVFVINLASATDRLAYISGQ---IGGPFERIEAVRGEAVPE------RLKANFSGTVSL 51 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-CDINNILIKFD 121 EIGCY SH+ + I A++LEDDA + +F +++ K ++ Sbjct: 52 LPGEIGCYASHLIAAENIVARGLPYAVVLEDDAILASDFHEVVETCVKHLPAGWDIVALS 111 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 ++ P L L G + + TT Y + + LL + RP+D+D++ Sbjct: 112 DVKSLP--HCRLSQLSGERWLVRYAHFPKTTTAYVLSQSGCRKLL-ASRPRTRPVDVDIR 168 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWN 234 + WE + P + + +S+I +S VR+ + + R + N Sbjct: 169 YGWEMGLNGYGILPPPAGPS-EKFESSIPKS--VRRRFYWRSHPLRYLKGRIN 218 >gi|254476209|ref|ZP_05089595.1| glycosyl transferase, family 25 [Ruegeria sp. R11] gi|214030452|gb|EEB71287.1| glycosyl transferase, family 25 [Ruegeria sp. R11] Length = 238 Score = 165 bits (419), Expect = 5e-39, Method: Composition-based stats. Identities = 61/233 (26%), Positives = 93/233 (39%), Gaps = 10/233 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC--QFKRL 61 YVI L S AR + + DA+ G R ++ Sbjct: 3 KSYVIHLAASTARTDLVSALVQTLP-GGQVLDAVDGRKMSAQERAQVSVPHLLDPRYPFP 61 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF- 120 L EIGC++SH WK IA A + EDD +D F+ L LI+F Sbjct: 62 LMPSEIGCFLSHRKAWKAIAEGEADFGFVAEDDVVTTDGFADALQIALDHADRQSLIRFP 121 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 A R+KPK + G+ + +P+ + Y +GK+AA L + + RP+D + Sbjct: 122 MAPREKPKS---VVATKGDVTLFRPQEIGLTAALYLLGKDAAHSLWSQSERFDRPVDTWL 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDT-NDSTIEE--SRLVRKPTFSPLYFYRNTC 230 + WE + SL P + A + STI++ SR+ + YR Sbjct: 179 QMRWETGVDSLTLWPSCIKSAAASHGGSTIQKKRSRMDQLKRTWHRARYRAAI 231 >gi|254779221|ref|YP_003057326.1| putative glycosyl transferase, family 25; putative LPS biosynthesis protein [Helicobacter pylori B38] gi|254001132|emb|CAX29090.1| Putative glycosyl transferase, family 25; putative LPS biosynthesis protein [Helicobacter pylori B38] Length = 284 Score = 165 bits (418), Expect = 6e-39, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 85/270 (31%), Gaps = 45/270 (16%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNP----------- 43 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 44 ------ICNRIFSHQKRQ------------CQFKRLLSLPEIGCYISHIHLWKRIAYSPA 85 F K+ + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLLAFCENKKHPPCTFKNFFYALKHCGKRMGFGELGCYASHYLLWQKCIELNE 120 Query: 86 IGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP 145 ILEDD + F + L K I+ L + K Sbjct: 121 -AICILEDDIIVKERFKESLEFCDKHINELGYIRLMHLEENVAKQKTPVK---GVSQILN 176 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAID 203 T GY + +AA LL K PID M + H + + V E A+ + ++ Sbjct: 177 FKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDEMN 236 Query: 204 TNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 T +S E+ R + P + +++ Sbjct: 237 TQNSNTEKQRPKKLPLSIRIGRSLHKSAVK 266 >gi|121613507|ref|YP_001000825.1| LPS glycosyltransferase subfamily protein [Campylobacter jejuni subsp. jejuni 81-176] gi|167005737|ref|ZP_02271495.1| LPS glycosyltransferase subfamily protein [Campylobacter jejuni subsp. jejuni 81-176] gi|12837556|gb|AAK08966.1|AF334961_2 unknown [Campylobacter jejuni] gi|40218030|gb|AAR82939.1| unknown [Campylobacter jejuni subsp. jejuni 81-176] gi|87249290|gb|EAQ72251.1| LPS glycosyltransferase subfamily [Campylobacter jejuni subsp. jejuni 81-176] Length = 251 Score = 165 bits (417), Expect = 8e-39, Method: Composition-based stats. Identities = 57/235 (24%), Positives = 98/235 (41%), Gaps = 26/235 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNP-ICNRIFSHQ 52 + V++I+L S R+E + ++ L+F FF AI +N + + Sbjct: 1 MKVFIINLERSLDRKEHIQKQIQKLFEKNLSLKNKLEFIFFKAIDAKNKEHLEFKDHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ IILEDD +FSDEF ++ + L Sbjct: 61 WGSWVLGRELSDGEKACFASHYKLWQECVKLDEP-IIILEDDVEFSDEFLNNGAEYIDEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K K++ +R + L NF + + T GY I +A+ ++ Sbjct: 120 LKS-------KYEYIRLCYLTQGKMLKLNDNF-LFTLDSIG-GTQGYLIKPISALKFIDN 170 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 K +P+D M ++ + ++ P + +STIEE + ++P F + Sbjct: 171 LKFWIKPVDNVMDMYYYSKVFTITYTPLLLKTTDI--NSTIEERKKDKRPIFKRI 223 >gi|47213906|emb|CAF95848.1| unnamed protein product [Tetraodon nigroviridis] Length = 601 Score = 165 bits (417), Expect = 8e-39, Method: Composition-based stats. Identities = 55/265 (20%), Positives = 98/265 (36%), Gaps = 39/265 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +++I+L RR K A + L F+ DA+ G+ + ++ Sbjct: 320 IFLINLKRRLDRRTKMLKTFAALGLHFTLTDAVDGKALNTSQLQALGIEMLPRYKDPYSG 379 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINN 115 R+L+ EIGC++SH +W ++ ++LEDD F F + L + + ++ Sbjct: 380 RVLTRGEIGCFLSHHSIWTQVLERGLEKVLVLEDDVRFEPRFKRRLQAIMDDIDRAQLDW 439 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + ++ G ++ + S T GY + + A LL Sbjct: 440 DLI--YVGRKRMQVQQPERSVDGVNNLVEA-DYSYWTLGYALSRRGARKLL-AADPFTKM 495 Query: 174 RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M H+ ++ + EP +Y DT STI + Sbjct: 496 LPVDEFLPILYNKHPNTEYMSHFEPRDLRAFSVEPLLIYPTHYTGEPGYISDTETSTIWD 555 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLH 236 V T + T Q + Sbjct: 556 DEAVA--TDWDRQQAQKTAQQGRIR 578 >gi|225025752|ref|ZP_03714944.1| hypothetical protein EIKCOROL_02656 [Eikenella corrodens ATCC 23834] gi|224941533|gb|EEG22742.1| hypothetical protein EIKCOROL_02656 [Eikenella corrodens ATCC 23834] Length = 264 Score = 164 bits (416), Expect = 9e-39, Method: Composition-based stats. Identities = 56/258 (21%), Positives = 100/258 (38%), Gaps = 23/258 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQK---RQCQFK 59 IP Y+ISL RRE AR+ ++ +F DA+ + + ++ + + Sbjct: 2 IPAYIISLANQQDRREHMRQECARVGIEATFIDAVDMRQASQSDIECLSSRLLHKKTKKQ 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH------LSKCDI 113 R L+ E+GC +SH ++ I A++LEDDA+F LL ++ Sbjct: 62 RWLTRGELGCALSHHQVYAHIVRQQHPYALVLEDDAEFIRNPQPLLNEGYLKALSAQYPF 121 Query: 114 NNILIKF---------DALRKKPKKDSYLCTLPGN--FDIHQPRILSPRTTGYFIGKEAA 162 + +++ + R+ P K GN + T Y I +E A Sbjct: 122 DILILGYVKTLPHQLPYYYRRIPIKHRAKMNTDGNTIYFGTPWEQYGCGTVAYIISREGA 181 Query: 163 IHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPT 219 L ++ D + H + L + P V EA++ DSTI + + L K + Sbjct: 182 EKLCRATQSPCATADDWLYFEQHHGLRILHSRPAFVLEALEKFDSTIRQEKAGFLQPKTS 241 Query: 220 FSPLYFYRNTCYQWNLHY 237 + + ++Y Sbjct: 242 SVIIRSIKGYLKHIGMNY 259 >gi|294678480|ref|YP_003579095.1| family 25 glycosyl transferase [Rhodobacter capsulatus SB 1003] gi|294477300|gb|ADE86688.1| glycosyl transferase, family 25 [Rhodobacter capsulatus SB 1003] Length = 258 Score = 164 bits (415), Expect = 1e-38, Method: Composition-based stats. Identities = 53/245 (21%), Positives = 91/245 (37%), Gaps = 8/245 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKR-QCQFKR 60 +PV VISL + AR+ + ARI + ++ F+ I G + + Sbjct: 7 DLPVQVISLARATARQAQVAQEFARIGMGYTIFEGIDGAAHQAELLGRTDLAAWHRNMGA 66 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 +S +GCY SH+ LW++I +I EDD F+ +F Q L +++F Sbjct: 67 PISAGHMGCYASHVELWRQIGAEGPEIVLICEDDVTFTADFPQALAAGLAMAERWDILRF 126 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 +R K + G F + Y I + A L I R D ++ Sbjct: 127 SCIRAKGRLPQARL---GPFTLMAYWGPFTGNGCYLIKRAVAARLTERFYPIRRAHDHEL 183 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTN----DSTIEESRLVRKPTFSPLYFYRNTCYQWNLH 236 +++H+I + EP + + ++ K P Y + Y L Sbjct: 184 NRFFDHDIRLMGLEPFTAPPRDRGESFITGTAMAGAKKFPKYKRLPHYRQKLANYARRLI 243 Query: 237 YNAWR 241 + A Sbjct: 244 WLARH 248 >gi|108563215|ref|YP_627531.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori HPAG1] gi|107836988|gb|ABF84857.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori HPAG1] Length = 294 Score = 164 bits (415), Expect = 1e-38, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 88/273 (32%), Gaps = 49/273 (17%) Query: 2 PIPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI--------- 44 + V++I L + + + FDAIY + +P Sbjct: 11 NMRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEH 70 Query: 45 ----------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAY 82 + F + + C + + E+GCY SH LWK+ Sbjct: 71 LHPSFMVEDLWAFCKNKKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWKKCIE 128 Query: 83 SPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI 142 ILEDD + F + L + I+ L + K Sbjct: 129 LNE-AICILEDDIIVKERFKESLEFCRQHINELGYIRLMHLEENVAKQKTPVK---GVSQ 184 Query: 143 HQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-E 200 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 185 ILNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACD 244 Query: 201 AIDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 +++ +S E+ R + P + + ++N Sbjct: 245 GMNSQNSNTEKQRPKKLPLSIRIGRFLHKNAVK 277 >gi|126729290|ref|ZP_01745104.1| Glycosyl transferase, family 25 [Sagittula stellata E-37] gi|126710280|gb|EBA09332.1| Glycosyl transferase, family 25 [Sagittula stellata E-37] Length = 265 Score = 163 bits (414), Expect = 2e-38, Method: Composition-based stats. Identities = 50/246 (20%), Positives = 100/246 (40%), Gaps = 8/246 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR-IFSHQKRQCQFKRL 61 + YVI+L S RR+ + AR +F +A+ G +R ++S +F R Sbjct: 1 MLTYVINLERSVDRRQVLTEKLARDSWPHTFVNAVDGRALSAQDRALYSPLLAFLRFGRG 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNIL 117 L E+ CY+SH+ ++ + ++LEDD D++ ++ L++ + Sbjct: 61 LKAGELACYLSHLKCLRQFLATDEPVCLVLEDDVDWTPGTGAVVQELAQVLGQHGEPWEV 120 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + K+P + + + + + + T +E A H L+ + I+ P+D Sbjct: 121 VNLGHVSKRPALAAPIARIAEDNAVSRIFDFPLTTHALLWNREGARHFLHRSRRIFDPVD 180 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT-FSPLYFYRNTCYQWNLH 236 + + + L V++A DS I+ S T P+ +R + Sbjct: 181 VAARRIYSGTGLGLGLIARLVFQAD--FDSEIDASGSAGSGTASKPVEAWRRLRARQLRK 238 Query: 237 YNAWRK 242 Y+ R Sbjct: 239 YHLIRA 244 >gi|15645424|ref|NP_207598.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] gi|2313935|gb|AAD07853.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] Length = 284 Score = 163 bits (413), Expect = 2e-38, Method: Composition-based stats. Identities = 51/272 (18%), Positives = 86/272 (31%), Gaps = 49/272 (18%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLWAFCKNKKHPPCALKNFFYAIKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD D F + L I+ L + K Sbjct: 119 NE-AICILEDDIIIKDRFKESLEFCRHHINELGYIRLMHLEENVAKQKTPVK---GVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ++ +S E+ R + P + +++T Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRSLHKSTIK 266 >gi|294651467|ref|ZP_06728780.1| glycosyl transferase [Acinetobacter haemolyticus ATCC 19194] gi|292822617|gb|EFF81507.1| glycosyl transferase [Acinetobacter haemolyticus ATCC 19194] Length = 252 Score = 163 bits (412), Expect = 3e-38, Method: Composition-based stats. Identities = 53/252 (21%), Positives = 87/252 (34%), Gaps = 7/252 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R + + FS F A G + + Q Sbjct: 1 MKVVTYLINLEGSDQRLANATAQLQQAGWAFSRFPAYDGRGKALSEFENYDDAAAQKILG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L E+GCY+SH K+ + A ++LEDD F Q L L + ++ Sbjct: 61 RSLISSELGCYLSHYGCAKKFLETDADYLVVLEDDIQVLPNFKQNLNSLISYLDQHKELE 120 Query: 120 FDALRKKPKKDSYL--CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + KK + I R G + A + + K + P+D Sbjct: 121 WYVVNLAAKKKKLAKDIVQIDGYTIWHAYYFPIRGVGLVWSRAGAEAFVELGKTMQVPVD 180 Query: 178 MDMKHWWEHNIPSLVTE-PGAVYEAIDTND-STIEESRLVRK--PTFSPLYFYRNTCYQW 233 + + W N L P ID++ T+ + RK S + ++ W Sbjct: 181 IFFQSWLSKNGKGLGVWQPFVQPAGIDSDILGTVATQGIKRKALENRSASHGFKKQKRMW 240 Query: 234 NLHYNAWRKDLP 245 + A R L Sbjct: 241 RDRFYAIRNLLS 252 >gi|226954450|ref|ZP_03824914.1| glycosyl transferase [Acinetobacter sp. ATCC 27244] gi|226834799|gb|EEH67182.1| glycosyl transferase [Acinetobacter sp. ATCC 27244] Length = 252 Score = 163 bits (412), Expect = 3e-38, Method: Composition-based stats. Identities = 53/252 (21%), Positives = 86/252 (34%), Gaps = 7/252 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R + + FS F A G + + Q Sbjct: 1 MKVVTYLINLEGSDQRLANATAQLQQAGWAFSRFPAYDGRGKALSEFENYDDAAAQKILG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L E+GCY+SH K+ + A ++LEDD F Q L L + ++ Sbjct: 61 RSLISSELGCYLSHYGCAKKFLETDADYLVVLEDDIQVLPNFKQKLNSLISYLDQHKELE 120 Query: 120 FDALRKKPKKDSYL--CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + KK + I R G + A + + K + P+D Sbjct: 121 WYVVNLAAKKKKLAKDIVQIDGYTIWHAYYFPIRGVGLVWSRAGAEAFVELGKTMQVPVD 180 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDTND--STIEESRLVRK--PTFSPLYFYRNTCYQW 233 + + W N L V A +D T+ + RK S + ++ W Sbjct: 181 IFFQSWLSKNGKGLGVWQPFVQPAGIDSDILGTVATQGIQRKALENRSASHGFKKQKRMW 240 Query: 234 NLHYNAWRKDLP 245 + A R L Sbjct: 241 RDRFYAIRNLLS 252 >gi|208434716|ref|YP_002266382.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori G27] gi|208432645|gb|ACI27516.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori G27] Length = 284 Score = 163 bits (412), Expect = 3e-38, Method: Composition-based stats. Identities = 50/272 (18%), Positives = 88/272 (32%), Gaps = 49/272 (18%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLLAFCKNKKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQKTPVK---GVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKAAQKLLKYSTKEWVMPIDCVMDRHYWHGVKNYVLEEFAITCDE 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ++ +S E+ R + P + + +++T Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRFLHKSTVK 266 >gi|296161257|ref|ZP_06844065.1| glycosyl transferase family 25 [Burkholderia sp. Ch1-1] gi|295888415|gb|EFG68225.1| glycosyl transferase family 25 [Burkholderia sp. Ch1-1] Length = 275 Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats. Identities = 61/274 (22%), Positives = 112/274 (40%), Gaps = 26/274 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKR 60 +PV+VISL S RR+ F DA+ G + + ++ + ++ R Sbjct: 6 VPVHVISLSRS-GRRDAIARFLTDHGAAFRIEDAVDGRSLAQSELDAVYDDAAARRRYGR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI----NNI 116 ++ E+ C++SH +W++I + A++LEDDA F + + ++ ++ + + Sbjct: 65 SMTRAEVACFMSHRSVWRKIVETG-RAAVVLEDDAMLEPAFFEHVLRANEPELSAVADIV 123 Query: 117 LIKFDALRKKPKKDSYL------CTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLNVR 169 L+ LR+ +Y T G + P + + GY+I AA L Sbjct: 124 LLGRSKLRRTASSWTYFNEPLRRVTGVGGLRVGVPFKQWTSGAVGYWISAHAARRALAYS 183 Query: 170 KNIYRPIDMDMKHW-W---EHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 + RP+ + W W E + P AV+E D S+IE+ R R S + Sbjct: 184 E---RPLGALLDDWPWHRDEGGARVVELRPYAVWEDFDRLPSSIEDER--RANIRSRASW 238 Query: 226 YRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSS 259 + W L V+ +F P + ++ Sbjct: 239 H--DVALWPLRLARTAARWSVVALQRFSPVNEAA 270 >gi|262371669|ref|ZP_06064948.1| glycosyl transferase, family 25 [Acinetobacter junii SH205] gi|262311694|gb|EEY92779.1| glycosyl transferase, family 25 [Acinetobacter junii SH205] Length = 252 Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats. Identities = 47/248 (18%), Positives = 88/248 (35%), Gaps = 7/248 (2%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK 59 M + Y+I+L S R + + FS F A G + + Q Sbjct: 1 MKVVTYLINLDGSDQRLASATAQLNQAGWDFSRFSAYDGRGKALSEFENYDDLAAQKILG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L E+GCY+SH ++ + A ++LEDD + F Q L L + + Sbjct: 61 RSLINSELGCYLSHYGCAEKFLETDADYLVVLEDDIEVLPNFKQKLESLLSYLDQHKELD 120 Query: 120 FDALRKKPKKDSYLCTLPG--NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + KK + ++ + R G ++ A + + K + P+D Sbjct: 121 WYTVNIAAKKKKLAKDIVEFDHYTLWHAYYFPIRGVGLVWSRKGAEQFVQLGKIMTVPVD 180 Query: 178 MDMKHWWEHNIPSLVTE-PGAVYEAIDTND-STIEESRLVRK--PTFSPLYFYRNTCYQW 233 + + W L P +D++ T+ + RK S + ++ W Sbjct: 181 IFFQSWLSKTGKGLGVWQPFVQPAGLDSDILGTVAAQGIKRKQLENRSASHGFKKQKRMW 240 Query: 234 NLHYNAWR 241 + A + Sbjct: 241 RDRFYAIK 248 >gi|57241911|ref|ZP_00369851.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis RM3195] gi|57017103|gb|EAL53884.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis RM3195] Length = 243 Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats. Identities = 53/239 (22%), Positives = 101/239 (42%), Gaps = 25/239 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + +++I+L S R+E + +++ HL+F FF+A+ G+ + F Sbjct: 1 MKIFIINLLRSTHRKESIRKQISKLFSENPSLKDHLEFIFFEAVDGDGGEEFAK-FHPAW 59 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHLSKCD 112 + R LS E GC+ SH LW++ G ++EDD +F+ F ++ L + K Sbjct: 60 AKYLNGRKLSPGEKGCFASHYKLWQKCLELNE-GIFVIEDDVEFTQAFNAKNLEKILKSP 118 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 ++ +R+K L +F+I + T GYFI AA + + + Sbjct: 119 --YEYVRIHYIREKSS-----YALNNHFNITFSNV--AGTRGYFIKPSAAKKFIKHTRIL 169 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESRLVRKPTFSPLYFYRNTC 230 PID M + + +++ +P + + ++ + E K T +RN Sbjct: 170 INPIDDSMDY---STVSNIIFKPLFLQDFDTASDIAATRERERNYKLTRELYRLWRNVK 225 >gi|301608466|ref|XP_002933810.1| PREDICTED: procollagen galactosyltransferase 2-like [Xenopus (Silurana) tropicalis] Length = 616 Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats. Identities = 59/289 (20%), Positives = 109/289 (37%), Gaps = 39/289 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +++I+L RR + +Q FDA+ G+ + + Sbjct: 334 IFMINLKRRQDRRVRMLRSLYEQEIQVKIFDAVDGKALNASQLKAMKIEVIPGYHDPYSG 393 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ EIGC++SH ++WK + ++++EDD F F +L+ + + ++ Sbjct: 394 RTLTSGEIGCFLSHYYIWKEVVDRGLEKSLVIEDDVRFEPLFKHKLMKLMNDIEEAEVEW 453 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T GY I ++ A L+ + Sbjct: 454 DLIYIG--RKRMQVERPEKAVPSVMNLVEA-DYSYWTLGYAISRQGAEKLI-AAEPFNKM 509 Query: 174 RPIDM-------------DMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D ++++ N+ + EP VY DT STI + Sbjct: 510 LPVDEFLPVMFNKHPVKKYLEYYESRNLKAFSAEPLLVYPTHYTGQPGYFSDTETSTIWD 569 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 + + + Q +H +A KD P T SS L Sbjct: 570 NETLVTDWDRRYSL--KSREQGQIHSDAQNKDALPPQPTLETTSSRDEL 616 >gi|171318340|ref|ZP_02907499.1| glycosyl transferase family 25 [Burkholderia ambifaria MEX-5] gi|171096482|gb|EDT41380.1| glycosyl transferase family 25 [Burkholderia ambifaria MEX-5] Length = 272 Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats. Identities = 66/260 (25%), Positives = 101/260 (38%), Gaps = 31/260 (11%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQF 58 M ++VISL S RR R + F F DA C+ + + ++ Sbjct: 1 MTFAIHVISLADS-PRRATIRRAVGRHGVDFDFEDAFDARALDAHACSAMTDTARVIARY 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----FSQLLPHLSKCDIN 114 R LS E+GC++SH+ +W++I S AI+LEDDA D F L + Sbjct: 60 GRPLSRGEVGCFMSHVRVWEKIVRSG-RAAIVLEDDAMLDDALFERFRSTPNELLSDHAD 118 Query: 115 NILIKFDALRKKPKKDSYLC------TLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLN 167 +L+ L ++ +YL G I P + + GY+I + A L Sbjct: 119 LVLLGRSKLSRERAAQAYLYEPLKRARRIGGLQIGVPFKQWTSGAVGYWISADGARKALE 178 Query: 168 VRKNIYRPIDMDMKHW-WEH---NIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT---- 219 + P+ + W W + P AV+EA +T S IE R P Sbjct: 179 HARG---PVGALLDDWPWHRDHGGLRIAELRPYAVWEAFETMPSAIEAGRAALSPRRHGA 235 Query: 220 ----FSPLYFYRNTCYQWNL 235 F PL R ++W + Sbjct: 236 LELLFKPLRAVR-AVFRWLI 254 >gi|317180568|dbj|BAJ58354.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 284 Score = 162 bits (410), Expect = 4e-38, Method: Composition-based stats. Identities = 52/272 (19%), Positives = 92/272 (33%), Gaps = 49/272 (18%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDSWAFCKNEKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCCQHINELGYIRLMHLEENVAKQKT--PIKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ++T +S E+ R + + + +++T Sbjct: 235 MNTQNSNTEKQRPKKLHLSIRIGRFLHKSTIK 266 >gi|32265822|ref|NP_859854.1| hypothetical protein HH0323 [Helicobacter hepaticus ATCC 51449] gi|32261871|gb|AAP76920.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449] Length = 239 Score = 162 bits (410), Expect = 5e-38, Method: Composition-based stats. Identities = 57/242 (23%), Positives = 92/242 (38%), Gaps = 25/242 (10%) Query: 20 FCHRAARIH----LQFS--FFDAIYG-ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYIS 72 + A++ QF FFDAI G + + +S + +S E C+ S Sbjct: 1 MQQQFAQMSEQQRAQFEIIFFDAIDGQKGEHLAFSQYSKGASIAFRGKEMSDGERACFAS 60 Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 H LW++ I+LEDD + F L +SK + ++ L K K Sbjct: 61 HYSLWQKCVDLNEP-IIVLEDDVMILEHFWTELTRISK--SEYVYVRLTFLEDKVKAFV- 116 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLV 192 LP F I R+ T GY++ A + K YRP+D M ++ H+IP + Sbjct: 117 ---LPNEFYITFSRV--TGTQGYYLTPIGAQGFIASAKTWYRPVDDYMDMFYIHHIPIIC 171 Query: 193 TEPGAVYEAIDTNDSTIEESRLVRKPTFSPL-----YFYRNTCYQWNLHYNAWRKDLPPV 247 +P V ++T T R + P + + Y L +N +P Sbjct: 172 IKP--VLGTMETE--TTIAGRWSKPPLYLKIVRECTRLYFQLKRLIFLTFNRKSLLMPKE 227 Query: 248 ST 249 + Sbjct: 228 AL 229 >gi|222087061|ref|YP_002545596.1| glycosyl transferase protein [Agrobacterium radiobacter K84] gi|221724509|gb|ACM27665.1| glycosyl transferase protein [Agrobacterium radiobacter K84] Length = 278 Score = 162 bits (410), Expect = 5e-38, Method: Composition-based stats. Identities = 46/236 (19%), Positives = 85/236 (36%), Gaps = 9/236 (3%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKRLLS 63 ++I L + R + AA + ++ A+ P + ++ ++ +S Sbjct: 43 FIIHLDRARDRLPQVERLAAMLPVRSDIIQAVDASAFPDADIDLVYRRHLHTPRYPFEMS 102 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 + EI C++SH W I ++ EDD + F C I+F Sbjct: 103 VGEIACFLSHRKAWAAIVEQGVDAGLVFEDDVEIDASFHAAFAAAQACLTPGAFIRFPF- 161 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 + K+ G + QP + + ++AAI LL RP+D ++ Sbjct: 162 -RMGKEHGECVLTHGQASVIQPGRVGLGMVAQLVSRDAAIRLLEATALFDRPVDTTVQMR 220 Query: 184 WEHNIPSLVTEPGAVYE-AIDTNDSTIEESR----LVRKPTFSPLYFYRNTCYQWN 234 W + L PG V+E + STI+ + + + PLY + Sbjct: 221 WITGLSPLAVLPGGVHEISSQLGGSTIKSRKTVFEKLSREILRPLYRAKIAIRSRR 276 >gi|317012616|gb|ADU83224.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori Lithuania75] Length = 283 Score = 161 bits (408), Expect = 8e-38, Method: Composition-based stats. Identities = 51/272 (18%), Positives = 91/272 (33%), Gaps = 49/272 (18%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMVEDLLAFCKNKKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYLLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K + G I Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQHINELGYIRLMHLEENVAKQKT--PIKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 + T GY + +AA LL K PID M + H + + V E A+ + Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAIACDE 234 Query: 202 IDTNDSTIEESRLVRKPTFSPL--YFYRNTCY 231 ++ +S E+ R + P + + +++ Sbjct: 235 MNAQNSNTEKQRPKKLPLSIRIGRFLHKSAVK 266 >gi|325268042|ref|ZP_08134688.1| lipooligosaccharide biosynthesis protein [Kingella denitrificans ATCC 33394] gi|324980427|gb|EGC16093.1| lipooligosaccharide biosynthesis protein [Kingella denitrificans ATCC 33394] Length = 263 Score = 160 bits (406), Expect = 1e-37, Method: Composition-based stats. Identities = 52/242 (21%), Positives = 92/242 (38%), Gaps = 21/242 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFS---HQKRQCQFK 59 IP Y+ISLP RR+ A+ + DA+ + ++ + + Sbjct: 2 IPAYIISLPSETRRRQHVQAECAKYGIPAEMVDAVDMRHIEEAKLRELCSLPLHKKPKKQ 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL------SKCDI 113 R L+ E+GC +SH +++ + A+ILEDDA F + + ++ +CD Sbjct: 62 RYLTKGELGCALSHHQIYQTMLARQQEFALILEDDATFIQDPTPVVQAAFLQRIKQQCDF 121 Query: 114 NNILIKFDALRKKPKKDSYL-CTLPGNFDIHQPRI----------LSPRTTGYFIGKEAA 162 + +L+ + + Y + + P+ Y I ++ A Sbjct: 122 DVLLLGYVKTVARYLPYYYRRIPIKKRVTMSTPQQDFVFGTPWEQFGCGAVAYVITQKGA 181 Query: 163 IHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFS 221 LL++ K P D + + + L P V E ++ DSTI KP S Sbjct: 182 EKLLDITKKPCVPADDWLYFEQQCGLKVLHCRPTLVLEDLENLDSTIRTECPDFWKPKLS 241 Query: 222 PL 223 L Sbjct: 242 SL 243 >gi|147899177|ref|NP_001088623.1| procollagen galactosyltransferase 1-B precursor [Xenopus laevis] gi|82179978|sp|Q5U483|G251B_XENLA RecName: Full=Procollagen galactosyltransferase 1-B; AltName: Full=Glycosyltransferase 25 family member 1-B; AltName: Full=Hydroxylysine galactosyltransferase 1-B; Flags: Precursor gi|55153756|gb|AAH85226.1| Glt25d1a protein [Xenopus laevis] Length = 611 Score = 160 bits (406), Expect = 1e-37, Method: Composition-based stats. Identities = 65/270 (24%), Positives = 104/270 (38%), Gaps = 39/270 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + + F DA+YG+ N K + Sbjct: 332 VFLINLKHRQDRRERMKRTLYELQIDFKLVDAVYGKMLNQSNVTEMGIKMLPGYKDPYHG 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 R L+ E+GC++SH ++WK I+ + +LEDD F F + LL L ++ Sbjct: 392 RPLTRGEMGCFLSHYNIWKEISERNLEVSAVLEDDLRFEIFFKRRLQTLLHDLEIAKLDW 451 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + D +PG ++ S T GY I A LLN + + Sbjct: 452 DLIYLG--RKRMQVDEPEEPVPGVRNLVV-SDYSYWTLGYLISLRGARKLLN-AEPLGKM 507 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ ++ + EP +Y DT S + + Sbjct: 508 LPVDEFLPVMYDKHPISDYSSHFSTRDLRAFSVEPLLLYPTHYTGDKGYISDTETSVLWD 567 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 + V +PT R T Q L A Sbjct: 568 N--VTQPTDWDRAKSRKTHQQEKLRSEALN 595 >gi|209884650|ref|YP_002288507.1| glycosyl transferase [Oligotropha carboxidovorans OM5] gi|209872846|gb|ACI92642.1| glycosyl transferase [Oligotropha carboxidovorans OM5] Length = 256 Score = 160 bits (406), Expect = 1e-37, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 81/229 (35%), Gaps = 9/229 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M IP Y+I+L R + R + + F+ DA+ G+ Sbjct: 1 MTIPPYLINLARDPERLAEMQARFRALGIAFTRVDAVDGKAMTAAELDTVRAPHPAWL-- 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L E+ C +SH W+ IA A + EDD FS L + ++ ++K Sbjct: 59 PLKPNEVACALSHRKCWQMIADGAASHGCVFEDDMLFSPRLPAFLSDTAWIPVDADIVKI 118 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + D + F + + R R Y + + A LL + + I P+D+ + Sbjct: 119 EESFNRVWIDVPPREVGQGFRLGRVRSTHYRAGAYIVSRTGAKRLLAMTERISLPLDLII 178 Query: 181 KHWWEHN---IPSLVTEPGAVY----EAIDTNDSTIEESRLVRKPTFSP 222 + + + P A++ S + S R+ + Sbjct: 179 FDYALGKASRLATYQMFPALAVQKKGPALEEAGSAVGRSDFKRRGELAR 227 >gi|159184922|ref|NP_354784.2| hypothetical protein Atu1799 [Agrobacterium tumefaciens str. C58] gi|159140201|gb|AAK87569.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 247 Score = 160 bits (406), Expect = 1e-37, Method: Composition-based stats. Identities = 53/229 (23%), Positives = 96/229 (41%), Gaps = 7/229 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKR 60 P+Y++S+ + AR EK A+ + L + + G+ +P R F + + + R Sbjct: 6 TFPIYIVSIARARARLEKMLEGASGLGLDLRPVEGVDGKTISPDDWRDFDRRGFELRNGR 65 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 E GCY SHI + + A A+I+EDD F+ +FS+ + + ++ ++K Sbjct: 66 HALPGEYGCYASHIKALETFLATDAPVAVIVEDDVAFTPDFSERVKAMVAAMPDDAIVKL 125 Query: 121 DALRKKPKKDSYLCTLPGNFD--IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 R+K K L F I+ P+ + Y I + AA L + + P D Sbjct: 126 TNHRRKGFKARKTSALGDTFGRCIYGPQ---GSSACYIISRRAAERFLKAARRMTLPFDR 182 Query: 179 DMKHWWEHNIPSLVTEP-GAVYEAIDTNDSTIEESRLVRKPTFSPLYFY 226 ++ W + +T+ + DT T +E R + + Y Sbjct: 183 ALECGWGYGTEVYITDKDFLPFGDPDTLVGTRDEYRGSKFARLKRVPAY 231 >gi|317182117|dbj|BAJ59901.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F57] Length = 280 Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats. Identities = 51/272 (18%), Positives = 87/272 (31%), Gaps = 49/272 (18%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFMIEDLWAFCKNKKHPPCTLKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD + F + L + I+ L + K TL Sbjct: 119 NE-AICILEDDIIVKERFKESLEFCYQYINELGYIRLMHLEENVAKQK---TLIKGVSQI 174 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EA 201 T GY + +A LL K PID M + H + + V E A+ + Sbjct: 175 LNFKDGIGTQGYVLAPKATQKLLEYSTKKWVMPIDCVMDRHYWHGVKNYVLEEFAIACDG 234 Query: 202 IDTNDSTIEESRLVRKPTFSP--LYFYRNTCY 231 ++T +S E+ R + P +++ Sbjct: 235 MNTQNSNTEKQRPKKLPLSIRVGRSLHKSAVK 266 >gi|194272156|ref|NP_001123548.1| procollagen galactosyltransferase 2 [Danio rerio] gi|159570814|emb|CAP19485.1| novel protein similar to vertebrate glycosyltransferase 25 domain containing 1 (GLT25D1) [Danio rerio] Length = 613 Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats. Identities = 58/270 (21%), Positives = 104/270 (38%), Gaps = 39/270 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +++I+L RRE+ + A + L+ + DA+ G+ + + Sbjct: 332 IFLINLKRRFDRRERMLNTMAVLGLEATLVDAVDGKTLNTSQLQALGIEMMPGYKDPYSG 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINN 115 R+L+ EIGC++SH WK++ ++LEDD F F + L + K +N Sbjct: 392 RVLTRGEIGCFLSHHFTWKQVLERGLRHVLVLEDDVRFEPRFKRRLQTIMKDVEKTQLNW 451 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + ++ G ++ + S T GY + ++ A LL + + Sbjct: 452 DLI--YVGRKRMQVAQPEVSVEGVNNLVEA-DYSYWTLGYALSQQGAKKLL-AAQPLGKM 507 Query: 174 RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M H+ ++ + EP +Y DT STI + Sbjct: 508 LPVDEFLPVMFNKHPNSAYMSHFDPRDLRAFSVEPLLLYPTHYTGEPGYFSDTETSTIWD 567 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 V T + + T Q + A Sbjct: 568 DESVN--TDWDRQYAQKTAQQAQIRSVAQN 595 >gi|56698209|ref|YP_168581.1| glycosyl transferase family protein [Ruegeria pomeroyi DSS-3] gi|56679946|gb|AAV96612.1| glycosyl transferase, family 25 [Ruegeria pomeroyi DSS-3] Length = 273 Score = 160 bits (404), Expect = 3e-37, Method: Composition-based stats. Identities = 48/216 (22%), Positives = 81/216 (37%), Gaps = 8/216 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQF 58 M + VI L + RR + +++ A+ G R + + Sbjct: 38 MNLMGLVIHLERATDRRAQANALVSQLPFAA-LLPAVDGAAMGADARARAVQPGLLKPAY 96 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 L EIGC++SH W R+ S A A+I+EDD F+ L + I Sbjct: 97 PFELRPGEIGCFLSHRAAWARLVESEAEAALIIEDDMALEAGFADALALAQRNVAQMGYI 156 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 +F + G + QP + RT+G + + AA LL + + RP+D Sbjct: 157 QFQT----RPVTAPAIDQQGQSVLMQPGLTPLRTSGQLVARWAAERLLALTEPFDRPVDT 212 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAID-TNDSTIEESR 213 ++ W + V P + + + STI + Sbjct: 213 FLQMHWHTGLRLGVISPSGLSDRTEVIGGSTIGAKK 248 >gi|172062730|ref|YP_001810381.1| glycosyl transferase family protein [Burkholderia ambifaria MC40-6] gi|171995247|gb|ACB66165.1| glycosyl transferase family 25 [Burkholderia ambifaria MC40-6] Length = 272 Score = 160 bits (404), Expect = 3e-37, Method: Composition-based stats. Identities = 67/260 (25%), Positives = 103/260 (39%), Gaps = 31/260 (11%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQF 58 M ++VISL S RR R + F F DA + C+ + + ++ Sbjct: 1 MTFAIHVISLADS-PRRATIRRAVGRHGVDFDFEDAFDARALDADACSAMTDTVRVIARY 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLP---HLSKCDIN 114 R LS E+GC++SH+ +W++I S AI+LEDDA D F + L L + Sbjct: 60 GRPLSRGEVGCFMSHVRVWEKIVRSG-RAAIVLEDDAMLDDALFERFLSTPGELLSDHAD 118 Query: 115 NILIKFDALRKKPKKDSYLC------TLPGNFDIHQP-RILSPRTTGYFIGKEAAIHLLN 167 +L+ L + +YL G I P + + GY+I A L Sbjct: 119 FVLLGRSKLSRDRAAQAYLYEPLKRARRIGGLQIGVPFKQWTSGAVGYWISVHGARKALE 178 Query: 168 VRKNIYRPIDMDMKHW-WEH---NIPSLVTEPGAVYEAIDTNDSTIEESRLV---RKP-- 218 + P+ + W W + P AV+EA +T S IE R R+ Sbjct: 179 HARG---PVGAVLDDWPWHRDHGGLRIAELRPYAVWEAFETMPSAIEAGRAALSSRRHGT 235 Query: 219 ---TFSPLYFYRNTCYQWNL 235 F PL R ++W + Sbjct: 236 LELLFKPLRVVR-AVFRWLI 254 >gi|91205319|ref|YP_537674.1| glycosyltransferase [Rickettsia bellii RML369-C] gi|157826924|ref|YP_001495988.1| glycosyltransferase [Rickettsia bellii OSU 85-389] gi|91068863|gb|ABE04585.1| Glycosyltransferase [Rickettsia bellii RML369-C] gi|157802228|gb|ABV78951.1| Glycosyltransferase [Rickettsia bellii OSU 85-389] Length = 290 Score = 159 bits (402), Expect = 4e-37, Method: Composition-based stats. Identities = 58/238 (24%), Positives = 90/238 (37%), Gaps = 31/238 (13%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR-------------- 47 I ++VI+L S R + I+L + F A+ G I N+ Sbjct: 51 DIKIFVINLDHSTNRYNNINKQFIDINLSYERFSAVDGYELSIANQKGEKFTGLDLKNNS 110 Query: 48 ----------IFSHQKRQCQF--KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA 95 IF + + R L+ E GCY SH +W ++ A+ILEDDA Sbjct: 111 SLLSLDNNYTIFCPSENINYYALNRTLTAGEFGCYCSHREIWLKMVEENIPYALILEDDA 170 Query: 96 DFSDEF-SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG 154 + F + + + L+ + +K + P I+ R T G Sbjct: 171 ILKENFYKKFSTIIKSLPSDWDLVYLFLMYLPRQKFYKIYNNPYLKKINTQRYFYTST-G 229 Query: 155 YFIGKEAAIHLLNVRKNIYRPIDMDM-KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 Y I +AA LL K+ PID + + + I + +T P + D DSTI E Sbjct: 230 YLINLKAATKLLKYSKSFSEPIDDSIAQEVADSRIQAYITTPFLI--NTDFKDSTIIE 285 >gi|194288948|ref|YP_002004855.1| glycosyl transferase [Cupriavidus taiwanensis LMG 19424] gi|193222783|emb|CAQ68786.1| glycosyl transferase [Cupriavidus taiwanensis LMG 19424] Length = 256 Score = 159 bits (402), Expect = 4e-37, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 94/241 (39%), Gaps = 12/241 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQFKR 60 I YVI+L + ARR++ + R+ + F F A+ G R + Q + R Sbjct: 2 IGAYVINLEAAEARRQRIAGQLTRLGVPFQVFPAVNGRALAEDEVARRYDAQAASASY-R 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILI 118 +S EIGC +SH+ +++++ A A++LEDDA D+ +L L+ + ++ Sbjct: 61 PMSRGEIGCALSHLGVYRKMLEDGASLALVLEDDALLGDDVPAVLEALASKMDPDSADVV 120 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + K + + G + + R GY + + AA L+ + + D Sbjct: 121 LLSHVDKFTR---WGIKPLGQRKLVRRYGEWWRAHGYVVTRAAAERLVAGLQPTWCAADY 177 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV----RKPTFSPLYFYRNTCYQWN 234 + P + S++E R K S Y+ R YQ Sbjct: 178 WSAFERRGLVSVRAVVPYCIGLTELAEASSLETHRADLDATDKARRSVGYYLRRYVYQRF 237 Query: 235 L 235 L Sbjct: 238 L 238 >gi|315638778|ref|ZP_07893951.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis JV21] gi|315481187|gb|EFU71818.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis JV21] Length = 215 Score = 158 bits (401), Expect = 6e-37, Method: Composition-based stats. Identities = 52/222 (23%), Positives = 98/222 (44%), Gaps = 25/222 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQK 53 + +++I+L S R+E + +++ L+F FF+A+ G + F Sbjct: 1 MKIFIINLLRSTHRKESMKEQISKLFSENPSLKDRLEFIFFEAVDGGGGEEFAK-FHPAW 59 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ-LLPHLSKCD 112 + F R LSL E GC+ SH LW++ G ++EDD +F+ F+ L + K Sbjct: 60 AKYLFGRELSLGEKGCFASHYKLWQKCLELNE-GIFVIEDDVEFTQAFNANNLEKILKSP 118 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 ++ +R K++ L NF I + T GY++ AA + + + Sbjct: 119 -------YEYVRIYHIKEAKFYDLELNFKITFSNV--AGTQGYYLKPSAARKFIKKCQFL 169 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR 213 +P+D M + + ++V P + E + ++ +T E R Sbjct: 170 IKPVDDSMDY---SEVLNIVFTPLFLKELPLQSDIATRERER 208 >gi|332970427|gb|EGK09419.1| glycosyltransferase [Kingella kingae ATCC 23330] Length = 263 Score = 158 bits (400), Expect = 7e-37, Method: Composition-based stats. Identities = 58/242 (23%), Positives = 88/242 (36%), Gaps = 21/242 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKRQCQFK 59 I Y+ISL RR +A R L +FFDA+ I + + ++ + + Sbjct: 2 ISTYIISLASETQRRAHMKAQAERYQLNAAFFDAVDMRQATQTDIEHLSVLPKHKKPKKQ 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL--PHLSKCDINNIL 117 R LS E+GC +SH +++ + A ILEDDA F LL +L K Sbjct: 62 RWLSKGELGCALSHHQIYQEMINKQLDYAFILEDDARFLQSPKALLLPENLRKIAAQYDF 121 Query: 118 --------------IKFDALRKKPKKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAA 162 + + R KK + L P Y I K+ A Sbjct: 122 DILILGYVKTLEHQLPYYHRRIPIKKRATLQLPEQTIQFGTPWEQYGCGAVAYVITKKGA 181 Query: 163 IHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFS 221 LLN+ + P D + + L P V E ++ STI + +P S Sbjct: 182 EKLLNITQKPCVPADDWLYFEQHCGVKVLHARPTFVLEDLEQLISTIRVEKANFLQPKLS 241 Query: 222 PL 223 + Sbjct: 242 SI 243 >gi|157073889|ref|NP_001096660.1| procollagen galactosyltransferase 1-A precursor [Xenopus laevis] gi|160385807|sp|A0JPH3|G251A_XENLA RecName: Full=Procollagen galactosyltransferase 1-A; AltName: Full=Glycosyltransferase 25 family member 1-A; AltName: Full=Hydroxylysine galactosyltransferase 1-A; Flags: Precursor gi|117558235|gb|AAI27423.1| Glt25d1b protein [Xenopus laevis] Length = 611 Score = 158 bits (400), Expect = 7e-37, Method: Composition-based stats. Identities = 61/270 (22%), Positives = 102/270 (37%), Gaps = 39/270 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + + + DA+YG+ K + Sbjct: 332 VFLINLKHRQDRRERMKRTLYELQIDYKLVDAVYGKTLNQTQVSELGIKMLPDYKDPYHG 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 R L+ E+GC++SH ++WK I+ + + EDD F F + LL L ++ Sbjct: 392 RPLTRGEMGCFLSHYNIWKEISERNLAVSAVFEDDLRFEIYFKRRLQTLLHDLETAKLDW 451 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + D +PG ++ S T GY I A LLN + + Sbjct: 452 DLIYLG--RKRMQVDEPEEPVPGVRNLVV-SDYSYWTLGYLISLRGAKKLLN-AEPLVKM 507 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ ++ + EP +Y DT S + + Sbjct: 508 LPVDEFLPVMYDKHPISDYSSHFSPRDLLAFSVEPLLLYPTHYTGDEGYISDTETSVLWD 567 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 + + +PT R T Q L A Sbjct: 568 N--LTEPTDWDRAKSRKTQQQEKLRSEALN 595 >gi|304391248|ref|ZP_07373192.1| glycosyl transferase family 25 [Ahrensia sp. R2A130] gi|303296604|gb|EFL90960.1| glycosyl transferase family 25 [Ahrensia sp. R2A130] Length = 276 Score = 158 bits (399), Expect = 9e-37, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 87/229 (37%), Gaps = 11/229 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y+I+L S R E + A++ L F +A+ G + + + + ++ Sbjct: 22 IYLINLDRSPDRLETQREQFAKLGLSFERVEAVDGSGLTFPHPDLDEKGFLRRHGKTINP 81 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 E+GC +SH+ + S A+ILEDDAD +D+F Q++ ++K + Sbjct: 82 NELGCTLSHMRAIRMFGDSNNDFAVILEDDADLADDFEQVVEAAIAQANRWDMVKLNQRH 141 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 + L ++ + GY I ++AA + + P D + Sbjct: 142 SGM--PLTVSQLTPDYSLVTYLAKQSGAVGYLINRKAAQQYIKNLLPMRVPYDHEFDRGH 199 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIE--------ESRLVRKPTFSPLYF 225 + + P V + + STI + V KP P Y Sbjct: 200 AYGLRVFGVLPPVVKDNLAA-GSTIGYGYVSDTTGEKPVTKPRKLPRYK 247 >gi|225025759|ref|ZP_03714951.1| hypothetical protein EIKCOROL_02663 [Eikenella corrodens ATCC 23834] gi|224941540|gb|EEG22749.1| hypothetical protein EIKCOROL_02663 [Eikenella corrodens ATCC 23834] Length = 270 Score = 158 bits (399), Expect = 1e-36, Method: Composition-based stats. Identities = 60/244 (24%), Positives = 93/244 (38%), Gaps = 19/244 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 Y+ISL SH RR+ + + FSFFDAI ++ N + LSL Sbjct: 5 YIISLATSHTRRQHIFSQFNSKSIPFSFFDAI--SSSEQLNSATTRLLPLFSEVDYLSLG 62 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NNILIKF 120 E GC++SHI LW+ I EDD ++ ++ L + S +I+ Sbjct: 63 EKGCFMSHISLWQHCLDQNLPYIGIFEDDIWLGEQANKFLTNDSWLQEFFPINEPFIIRL 122 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + + + + S N I++ T GY I + A LL+ +N + PI Sbjct: 123 ETVYEPCQIKSMPRNSYHNHQIYKLCSPHHGTGGYIISQAAIKWLLSYLRNTHLDDFTPI 182 Query: 177 DMDMKHWW--EHNIPSLVTEPG-AVYEAI-----DTNDSTIEESRLVRKPTFSPLYFYRN 228 D+ M I P + EAI + S IE R + P S + Sbjct: 183 DVLMFEHLIPHTKINVYQLNPAICIQEAILHPQESSLGSQIEVERSKKNPHISKRKLPQK 242 Query: 229 TCYQ 232 + Sbjct: 243 IKRE 246 >gi|148652793|ref|YP_001279886.1| glycosyl transferase family protein [Psychrobacter sp. PRwf-1] gi|148571877|gb|ABQ93936.1| glycosyl transferase, family 25 [Psychrobacter sp. PRwf-1] Length = 255 Score = 157 bits (398), Expect = 1e-36, Method: Composition-based stats. Identities = 53/249 (21%), Positives = 93/249 (37%), Gaps = 14/249 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + VISL + RR+ A+ + + FFDA+ P ++ + L Sbjct: 1 MKKIVISLSSATDRRQHIESEFAKHQVDYEFFDAL----TPDVATPYAKKLGLDVSSANL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ C +SH+ LW++I A I EDD ++ S LL + +IK +A Sbjct: 57 TPGEVACMMSHVALWEKIINEDLEYATIFEDDIYLGEDSSALLNRTDWIKPDWHIIKIEA 116 Query: 123 LRKKP-KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN-IYRPIDMDM 180 KK + L + Q + + T GY + K+ A L+ K P+D M Sbjct: 117 FAKKTYLANKTHEVLANKRHVTQLKGKNLGTAGYILSKQGAKIYLDYLKRHALLPLDRVM 176 Query: 181 KHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQ 232 + H +P+L + +++ S + + R R + Sbjct: 177 FDDFVHQGAEPVQQLVPALCIQEMLLHKNNPALPSALLKERQDRMKAEKKKGLAKVNKEA 236 Query: 233 WNLHYNAWR 241 L A + Sbjct: 237 SRLITQAKK 245 >gi|260582795|ref|ZP_05850581.1| glycosyltransferase [Haemophilus influenzae NT127] gi|260094121|gb|EEW78023.1| glycosyltransferase [Haemophilus influenzae NT127] Length = 257 Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 89/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRELFFSQKNIE--DFQIFSAINTMQKDCNELATIFNIEQFKAHYGR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNEISEDSYALVCEDDALFHPNFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|327277409|ref|XP_003223457.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 2-like [Anolis carolinensis] Length = 631 Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats. Identities = 59/289 (20%), Positives = 111/289 (38%), Gaps = 39/289 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ + +A+ G+ + ++ Sbjct: 349 IFLINLKRRKDRRDRMLQTLHEQEIAVKIVEAVDGKTLNTSQLKALNIDMLPGYQDPYSS 408 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP----HLSKCDINN 115 R+L+ EIGC++SH ++WK + +++EDD F +F + L + + ++ Sbjct: 409 RVLTRGEIGCFLSHYYIWKEVVDRELEKTLVIEDDVRFEHQFKKKLTKLMDDIERAQLDW 468 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ +P S T GY I + A L+ + Sbjct: 469 ELIYIG--RKRMQVERPEKAVPNVVNLVEP-DYSYWTLGYAISLQGAQKLIE-AQPFSKM 524 Query: 174 RPIDM-------------DMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M+++ ++ + EP VY DT STI + Sbjct: 525 LPVDEFLPVMYNKHPVAKYMEYYEPRDLKAFSAEPLLVYPTHYTGQPGYLSDTETSTIWD 584 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 + V T R T Q +H +A KD P + S+ L Sbjct: 585 NETVA--TDWDRTLSRKTRQQGQIHTDAQNKDALPPQPSLGTSSARDEL 631 >gi|163744085|ref|ZP_02151451.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis 2.10] gi|161382632|gb|EDQ07035.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis 2.10] Length = 237 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 50/242 (20%), Positives = 86/242 (35%), Gaps = 9/242 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC--QFKR 60 + Y+I + R A + +A+ G + + Sbjct: 1 MRSYIIHMAGDQKRAPNVARLLADLP-HAQVVEAVDGRAVMAAGDVAIRTGDLHAPHYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCDINNILIK 119 LS E+GC++SH W+ IA +I+EDD +L + + +I+ Sbjct: 60 PLSGGEVGCFLSHRKCWQLIANGIDDYGLIVEDDMATDPGIWRDVLALIDSHAGPDSMIR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A K+ + S + G + PR + +T +GK AA LL K + RP+D Sbjct: 120 LPA--KQRETASTVIAAHGAAQLFLPRRIGLQTVAQVVGKTAAARLLTATKVLDRPVDTF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 ++ W H+ P V E + +SR V L + Y+ + Sbjct: 178 LQMHWVHDQTIHTVLPNGVSEETAALGGSTIQSRPVGGKLAREL---KRMLYRAQVALRP 234 Query: 240 WR 241 R Sbjct: 235 QR 236 >gi|82779115|ref|YP_405464.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae Sd197] gi|293417086|ref|ZP_06659713.1| beta1,4-galactosyltransferase WaaX [Escherichia coli B185] gi|309784398|ref|ZP_07679037.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae 1617] gi|81243263|gb|ABB63973.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae Sd197] gi|291431117|gb|EFF04110.1| beta1,4-galactosyltransferase WaaX [Escherichia coli B185] gi|308927905|gb|EFP73373.1| putative beta1,4-galactosyltransferase WaaX [Shigella dysenteriae 1617] Length = 257 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 50/241 (20%), Positives = 94/241 (39%), Gaps = 15/241 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M +P+Y++SL RR K R+++ F FFDAI ++ I + R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFQRLNINFDFFDAIDAKDPQNKEII--DKMRLSGVGA 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN------ 114 ++ EI C +SH +++ + A+ILEDD +++F + L + + + + Sbjct: 59 EMTDGEIACTLSHQLIYQDMIDKNIEWAVILEDDVIVNEKFKKFLQYFNLSEKDKLKHNN 118 Query: 115 -NILIKFDALRKKPKKDSYLCTLPG-----NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +L L P L + ++ + RT Y + K+ A LL + Sbjct: 119 LYLLGGQKGLHDYPVLGQSLFSKVKISTCTFRRVNFNKNKIRRTCSYLMNKDMAQKLLKL 178 Query: 169 RKNIYRP-IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 K+ D H I + ++ ++ +S +E RL+ P + Sbjct: 179 TKDYGTYRADSWKLMHQHHIIKEFYLDEIILHPILNEFNSHLESERLLTSEKKQPRTRLQ 238 Query: 228 N 228 Sbjct: 239 K 239 >gi|48243736|gb|AAT40841.1| putative Lex2B [Haemophilus influenzae] Length = 175 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 39/152 (25%), Positives = 68/152 (44%), Gaps = 3/152 (1%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 ++GCY SH +W++ I+LEDDA F + F ++L ++ R Sbjct: 1 GQLGCYASHYSMWEKCVELDYP-IIVLEDDAKFKNNFLEVLDFINSDKNTFEFFWLQPDR 59 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 K K+ L + GN I+Q TTGY++ +AA L K Y +D+ M ++ Sbjct: 60 LKNKR--KLISNFGNLSIYQFSKGFAGTTGYYLTPQAARKFLTQSKEWYLTVDVTMDRFF 117 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 E+ +P P + + + + E+ + R Sbjct: 118 ENKVPPYAIVPFCLEDDGEIESTIYEKQKKRR 149 >gi|119385481|ref|YP_916537.1| glycosyl transferase family protein [Paracoccus denitrificans PD1222] gi|119375248|gb|ABL70841.1| glycosyl transferase, family 25 [Paracoccus denitrificans PD1222] Length = 262 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 58/246 (23%), Positives = 90/246 (36%), Gaps = 9/246 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRL 61 +PVY+I+L S R + + F A G I + R Sbjct: 11 VPVYLINLDGSDERLRSATRQLDEAGIPFERVPAFDGRALRIEEFPDYDPAGAMAYMGRP 70 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINNIL 117 L EIGCY+SH+ +R S A ++ EDD F++ L L + D + L Sbjct: 71 LRGGEIGCYLSHLDCARRFLDSGAEYGVVFEDDMQLKPGFAKGLRILSDWLDRHDRDWDL 130 Query: 118 IKFDALRKKPKKDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I A + K + G + D+ + TTG ++ A ++ + I + Sbjct: 131 INIGAGQHKIFTPVMGFEVAGRHHDLTRAHYFPMTTTGLIWSRQGAETFVSTHRRITAGV 190 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT-FSPLYFYRNTCYQWNL 235 D +HW + L P V +S IE+ + R + PLY Q Sbjct: 191 DNHFRHWLTRSDRGLAVWPPLVTTTG--VESQIEDGKGKRSASGRHPLYGLIKQRRQLIN 248 Query: 236 HYNAWR 241 AWR Sbjct: 249 KLIAWR 254 >gi|163739820|ref|ZP_02147227.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis BS107] gi|161386854|gb|EDQ11216.1| glycosyl transferase, family 25 [Phaeobacter gallaeciensis BS107] Length = 237 Score = 156 bits (394), Expect = 3e-36, Method: Composition-based stats. Identities = 50/242 (20%), Positives = 87/242 (35%), Gaps = 9/242 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC--QFKR 60 + Y+I + R A + +A+ G + + Sbjct: 1 MRSYIIHMAGDQKRAPNVARLLADLP-HAQVVEAVDGRAVMAAGDVAIRTGDLHAPHYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCDINNILIK 119 LS E+GC++SH W+ IA +I+EDD +L + + +I+ Sbjct: 60 PLSGGEVGCFLSHRKCWQLIANGIDDYGLIVEDDMATDPGIWRDVLALIDSHAGPDSMIR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 A K+ + S + G + PR + +T +GK AA+ LL K + RP+D Sbjct: 120 LPA--KQRETASTVIAAHGAAQLFLPRRIGLQTVAQVVGKTAAVRLLTATKVLDRPVDTF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 ++ W H+ P V E + +SR V L + Y+ + Sbjct: 178 LQMHWVHDQTIHTVLPNGVSEETAALGGSTIQSRPVGGKLAREL---KRMLYRAQVALRP 234 Query: 240 WR 241 R Sbjct: 235 QR 236 >gi|301611908|ref|XP_002935453.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 1-B-like [Xenopus (Silurana) tropicalis] Length = 610 Score = 155 bits (393), Expect = 4e-36, Method: Composition-based stats. Identities = 59/270 (21%), Positives = 102/270 (37%), Gaps = 39/270 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +++I+L RRE+ + + + DA+YG+ K + Sbjct: 331 IFLINLKHRQDRRERMKRTLYELQIDYKLVDAVYGKTLNQTQVDKMGIKMLPGYKDPYHG 390 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 R L+ E+GC++SH ++WK I+ + + EDD F F + LL L ++ Sbjct: 391 RPLTRGEMGCFLSHYNIWKEISERSLEASAVFEDDLRFEIFFKRRLQTLLHDLEVAKLDW 450 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +PG ++ S T GY I A LL+ + + Sbjct: 451 DLIYLG--RKRMQVEEPEEPVPGVRNLVV-SDYSYWTLGYLISLRGARKLLD-AEPLVKM 506 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ ++ + EP +Y DT S + + Sbjct: 507 LPVDEFLPVMYDKHPNSNYSAHFSPRDLLAFSVEPLLLYPTHYTGDEGYISDTETSVLWD 566 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 + V +PT R T Q L A Sbjct: 567 N--VTQPTDWDRDKSRKTQQQEKLRSEALN 594 >gi|325473997|gb|EGC77185.1| hypothetical protein HMPREF9353_01535 [Treponema denticola F0402] Length = 210 Score = 155 bits (393), Expect = 4e-36, Method: Composition-based stats. Identities = 48/210 (22%), Positives = 96/210 (45%), Gaps = 3/210 (1%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +V++L + R++ I + + FF A+YG++ ++ + + + + KR L Sbjct: 1 MRTFVLNLEHNTERKKYMQDILKGIPIDYEFFPAVYGKSITDIDQFYDSKLAEKRAKRQL 60 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 ++ EIGC +SH ++K++ A+ILEDD F F ++ +S+ ++ N ++ Sbjct: 61 NVGEIGCALSHKAIYKKMIDENISQALILEDDISFLPNFFEVYTAMSQFNVGNKVVLLGT 120 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 KKP K + L N+ ++ T GY IG +AA + + ++ D Sbjct: 121 TAKKPMKKVWKKNLFNNYSMYLVLNSYGGTYGYIIGLDAAKRIYYHNEKVFIEADKW--K 178 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 ++ + P V E + S I + Sbjct: 179 YYRRLSQIWLVSPSIV-EVNEVFPSEIGDD 207 >gi|239831361|ref|ZP_04679690.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] gi|239823628|gb|EEQ95196.1| Glycosyltransferase 25 family member precursor [Ochrobactrum intermedium LMG 3301] Length = 194 Score = 155 bits (393), Expect = 4e-36, Method: Composition-based stats. Identities = 46/179 (25%), Positives = 74/179 (41%), Gaps = 5/179 (2%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKR 60 +PVYVI+L S R ++ A + ++ +A+ G+ F + + Sbjct: 4 SVPVYVINLARSRDRWDRLKSNADALSIELRRVEAVEGKLLSSEELTDFDEAGFRRWHGK 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + EIGCY SHI + IA +P A+I+EDD F+ EF L HL+K +K Sbjct: 64 IAMPAEIGCYFSHIRALEIIADAPEPFAVIVEDDIVFTPEFKPFLTHLTKA-AGWDAVKL 122 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTGYFIGKEAAIHLLNVRKNIYRPIDM 178 R + F I + + Y + +E A LL + + P D+ Sbjct: 123 VNHR--TAAFQSFQRIDSQFSIGRCLHGPLGSSAAYVVTREGASKLLKALRPMRLPYDV 179 >gi|332992059|gb|AEF02114.1| glycosyl transferase family protein [Alteromonas sp. SN2] Length = 227 Score = 155 bits (392), Expect = 5e-36, Method: Composition-based stats. Identities = 41/178 (23%), Positives = 76/178 (42%), Gaps = 2/178 (1%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 F A +FS AI G ++ + ++ +S + + + + L+L EIGC++SH W Sbjct: 2 FTQSAKHYSAEFSVITAIKGSSDNVMHQGYSKRANKLNYYKKLTLGEIGCFLSHRKAWSF 61 Query: 80 IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGN 139 I A++LEDD F F QL + K + I + + +K + + Sbjct: 62 IVEHKLAFAVVLEDDVMFEHSFQQLDKVIDKINEPWDYINLNEVHEKRS--ATTVFMSEG 119 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGA 197 + + L + I A LL ++ RP+D++++ I +L +P Sbjct: 120 VSVVKYNKLPIGASAQVISYSGAEKLLKYTESFSRPLDVELQWLAVDKIKALGIKPYP 177 >gi|254487426|ref|ZP_05100631.1| glycosyl transferase, family 25 [Roseobacter sp. GAI101] gi|214044295|gb|EEB84933.1| glycosyl transferase, family 25 [Roseobacter sp. GAI101] Length = 238 Score = 154 bits (390), Expect = 9e-36, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 86/237 (36%), Gaps = 8/237 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKR 60 + +I +P S ARR + DA+ G + + F+ + Sbjct: 1 MKSLIIHMPSSTARRPNADRLLKDLP-DAQLVDAVDGRDPAQIAGVQTFNGNLHSPPYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILIK 119 L EIG + SH W+ I A+I+EDD S ++ L + ++ ++ Sbjct: 60 ALKPAEIGVFQSHRKCWQLIVDQGWDYAMIVEDDLAVSPPHMARALALIDAHMTPDMYLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K + + + G+ + P + + +G++AA LL + I RP+D Sbjct: 120 LPP--KMRETPARVIAQDGDMRLILPERIGLQCCCQVVGRKAAERLLAGSEKIDRPVDTW 177 Query: 180 MKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEES-RLVRKPTFSPLYFYRNTCYQWN 234 ++ W P P E A STI++ R K + T + Sbjct: 178 LQMHWITGQPVHALLPNGNAEIAGQIGGSTIQQKTRTSGKLMRELKRAWYRTRVRMR 234 >gi|71065765|ref|YP_264492.1| glycosyl transferase family protein [Psychrobacter arcticus 273-4] gi|71038750|gb|AAZ19058.1| probable glycosyl transferase family 25, LPS biosynthesis [Psychrobacter arcticus 273-4] Length = 255 Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats. Identities = 49/238 (20%), Positives = 91/238 (38%), Gaps = 14/238 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR+ + ++ F FFDA+ P ++ L Sbjct: 1 MQNHVISLTTATDRRQHITQEFTKQNISFEFFDAL----IPDLAHTYAQTLPIDLENIAL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ C++SH +W+++ I EDD ++ LL S +IK +A Sbjct: 57 TGGELACFMSHASVWQQMIDQNIPYLAIFEDDVYLGEDAEALLTTTSWIKPEWHIIKIEA 116 Query: 123 LRKKPKKDSYLCTLPGNFD-IHQPRILSPRTTGYFIGKEAAIHLLNV-RKNIYRPIDMDM 180 +K S + + I Q + + T GY + A L+ KN RP+D M Sbjct: 117 FSEKVFLSSNSSKIISDRRCIAQLKGRNLGTAGYILSLRGAQVYLDYISKNKLRPLDELM 176 Query: 181 -KHWWEHN-------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTC 230 + +H P+L + + + + S + + R R + ++ Sbjct: 177 FDDFIKHGAEPVYQMTPALCIQEMLLNKGNPSLPSALIKDRQGRMKSEKKPAMFKAKR 234 >gi|254708263|ref|ZP_05170091.1| glycosyl transferase family protein [Brucella pinnipedialis M163/99/10] Length = 245 Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 85/217 (39%), Gaps = 7/217 (3%) Query: 18 EKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW 77 E + R+ QF +A+ G + S + ++ LS EIGC++SH Sbjct: 1 EFMASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPLSPAEIGCFLSHRKCL 59 Query: 78 KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLP 137 ++IA A + EDD S S+ L ++K DA + + + Sbjct: 60 EKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYGHEVLISNPVKN-E 118 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE---HNIPSLVTE 194 G + I + R +T GY + +EAA LL + + P+D + + ++ Sbjct: 119 GPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFDPNDGPFNDFEIYQIS 178 Query: 195 PGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 P ++ +STI ++R ++ +R Sbjct: 179 PAICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 213 >gi|167412429|gb|ABZ79878.1| unknown [Campylobacter jejuni] Length = 259 Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats. Identities = 52/222 (23%), Positives = 91/222 (40%), Gaps = 14/222 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKR 60 + +I+L + R+E + L + +A+ G+ I + K KR Sbjct: 1 MKFLIINLKKAKQRKEYISKLCQKYQLDYEIIEAVEGKAISKQEYLNIVDYDKMLNFHKR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L L E+GC +SH +++I A+ILEDDA F + + L + ++ + L+ Sbjct: 61 ELGLGELGCSLSHKKCYEKILQEKLKYAVILEDDAYFDENLLEFLQYFNEFPKDLELLLL 120 Query: 121 DALRKKPKKDSYLCTLP---------GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 R+ D + P N+ I + T GYFI K A+ LL+ + Sbjct: 121 GHQRQVYSDDGFRIESPYSRRFAKKISNYKIRRLIARGNGTYGYFITKNGALKLLSHLEK 180 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDT-NDSTIEES 212 IY PID + + + P VY + +S+ ++ Sbjct: 181 IYLPIDALTCNEKV--LNTYALFPVLVYTHENFMFESSTQDD 220 >gi|16273584|ref|NP_439839.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae Rd KW20] gi|260581038|ref|ZP_05848860.1| conserved hypothetical protein [Haemophilus influenzae RdAW] gi|7387855|sp|P71398|LSG4_HAEIN RecName: Full=Lsg locus putative protein 4 gi|1574550|gb|AAC23343.1| lipopolysaccharide biosynthesis protein, putative [Haemophilus influenzae Rd KW20] gi|260092278|gb|EEW76219.1| conserved hypothetical protein [Haemophilus influenzae RdAW] Length = 257 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 90/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRELFFSQKNTE--DFQVFSAINTMQKDWDELAAIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDIN-- 114 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 115 ---------NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 N FD P S+LC G+ + P T GY I K AA Sbjct: 120 IILLGQSKINDFNDFDLEINYPTTFSFLCKKTGDVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLIVIENP-VLISNLESVR 229 >gi|325293185|ref|YP_004279049.1| glycosyl transferase, family 25 [Agrobacterium sp. H13-3] gi|325061038|gb|ADY64729.1| Glycosyl transferase, family 25 [Agrobacterium sp. H13-3] Length = 247 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 51/227 (22%), Positives = 90/227 (39%), Gaps = 5/227 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRL 61 PVY+IS+ + AR EK + A+ + L + + G+ P F+ + + + R Sbjct: 7 FPVYIISIARARARLEKMLNGASGLRLDLRPVEGVDGKTIPAAEWTDFNRRGFELRNGRH 66 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 E GCY SHI + + A A+I+EDD F+ +FS + + N ++K Sbjct: 67 ALPGEYGCYASHIKALEIFLATDAPVAVIVEDDVTFTPDFSDRVKAMVAIMPENSIVKLT 126 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG-YFIGKEAAIHLLNVRKNIYRPIDMDM 180 R + + L + + ++ Y I + A L + + P D + Sbjct: 127 NHRHRGFRARKTSALGD--KLGRCIYGPQGSSACYIISRGGAEEFLKAARVMTLPFDRAL 184 Query: 181 KHWWEHNIPSLVTEP-GAVYEAIDTNDSTIEESRLVRKPTFSPLYFY 226 + W + VTE + DT T E R + + + Y Sbjct: 185 ECGWGYGTHVYVTEKDFLPFGDPDTLVGTRAEYRGSKFARLNRVPAY 231 >gi|191167804|ref|ZP_03029610.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli B7A] gi|309797494|ref|ZP_07691885.1| LPS glycosyltransferase [Escherichia coli MS 145-7] gi|190902147|gb|EDV61890.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli B7A] gi|308118930|gb|EFO56192.1| LPS glycosyltransferase [Escherichia coli MS 145-7] Length = 257 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 50/241 (20%), Positives = 94/241 (39%), Gaps = 15/241 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M +P+Y++SL RR K R+++ F FFDAI ++ I + R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFHRLNINFDFFDAIDAKDPQNKEII--DKMRLSGVGA 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN------ 114 ++ EI C +SH ++K + A+ILEDD +++F + L + + + + Sbjct: 59 EMTDGEIACTLSHQLIYKDMIDKNIEWAVILEDDVIVNEKFKKFLQYFNLSEKDKLKHNN 118 Query: 115 -NILIKFDALRKKPKKDSYLCTLPG-----NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +L L P L + ++ + RT Y + K+ A LL + Sbjct: 119 LYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTCSYLMNKDMAQKLLKL 178 Query: 169 RKNIYRP-IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 K+ D + I + ++ ++ +S +E RL+ P + Sbjct: 179 TKDYGTYRADSWKLMHQHNIIKEFYLDEIILHPILNEFNSHLESERLLTSEKKQPRTRLQ 238 Query: 228 N 228 Sbjct: 239 K 239 >gi|254719629|ref|ZP_05181440.1| lipooligosaccharide biosynthesis protein lic2B [Brucella sp. 83/13] Length = 244 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 87/221 (39%), Gaps = 6/221 (2%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 + A++ F A+ G P + L+ EIGC++SH + Sbjct: 1 MTSQFAKLGADFVKIPAVDGREMPPEALKAVTATERPW-AAPLTPTEIGCFLSHRRCLEA 59 Query: 80 IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGN 139 IA + A+++EDD F+D+ +LL H + ++K + KK + Sbjct: 60 IARNDDPYAVVVEDDVIFADDAGKLLLHSEWVPHDADIVKIETQGKKVLIGKPIACASTR 119 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM---KHWWEHNIPSLVTEPG 196 + + + + GY + ++AA ++ + + PID + + + + P Sbjct: 120 YSVARLYSVHILAAGYIVSRDAARRIVAEMEKVSAPIDHFLFNAPYGVFNQLSVYQCTPA 179 Query: 197 AVYEAIDTNDSTIEESRLVRKPTF--SPLYFYRNTCYQWNL 235 +A T+ E S+L +KP F L + T + + Sbjct: 180 LCKQAGLTSTLQSERSQLYQKPPFVGRVLREIKRTFKRAGI 220 >gi|301018954|ref|ZP_07183177.1| LPS glycosyltransferase [Escherichia coli MS 69-1] gi|3821849|gb|AAC69683.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli] gi|300399448|gb|EFJ82986.1| LPS glycosyltransferase [Escherichia coli MS 69-1] Length = 257 Score = 153 bits (388), Expect = 2e-35, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 94/241 (39%), Gaps = 15/241 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M +P+Y++SL RR K R+++ F FFDAI ++ I + R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFHRLNINFDFFDAIDAKDPQNKEII--DKMRLSGVGA 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-----SKCDINN 115 ++ EI C +SH ++K + A+ILEDD +++F + L + K NN Sbjct: 59 EMTDGEIACTLSHQLIYKDMIDKNIEWAVILEDDVIVNEKFKKFLQYFNLPEKDKLKHNN 118 Query: 116 ILI--KFDALRKKPKKDSYLCTLPG-----NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + L P L + ++ + RT Y + K+ A LL + Sbjct: 119 LYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTCSYLMNKDMAQKLLKL 178 Query: 169 RKNIYRP-IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 K+ D + I + ++ ++ +S +E RL+ P + Sbjct: 179 TKDYGTYRADSWKLMHQHNIIKEFYLDEIILHPILNEFNSHLESERLLTSEKKQPRTRLQ 238 Query: 228 N 228 Sbjct: 239 K 239 >gi|238919223|ref|YP_002932738.1| glycosyltransferase [Edwardsiella ictaluri 93-146] gi|19113667|gb|AAL25629.2| putative glycosyltransferase [Edwardsiella ictaluri 93-146] gi|238868792|gb|ACR68503.1| putative glycosyltransferase [Edwardsiella ictaluri 93-146] Length = 261 Score = 153 bits (388), Expect = 2e-35, Method: Composition-based stats. Identities = 50/248 (20%), Positives = 95/248 (38%), Gaps = 21/248 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY---GENNPICN-RIFSHQKRQCQF 58 + ++ISL ARR R F FFDA+ ++ + + + + Sbjct: 1 MKTFIISLKDEVARRNSISDRLHHH--DFEFFDAVDLRVASDDVLSAVKCKNINYKHPAI 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----FSQLLPHLSKCDIN 114 + ++ EIGC +SH+ L+K+I A ++EDDA + + + L +I+ Sbjct: 59 REHMTKGEIGCALSHMQLYKKIVDDNLAFARVIEDDAVILNHDEKILNDFILALEMKNID 118 Query: 115 NILIKFDALRKKP--------KKDSYLCTLPGNFDIHQP-RILSPRTTGYFIGKEAAIHL 165 ++ + K K+ G + + P R + T GY + + A + Sbjct: 119 WDIMLLGYSKLKACDSFGFYLKEPIKNIVKSGAYSLGIPFRNWTCGTVGYLVSQSGAKKM 178 Query: 166 LNVRK--NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 +N + D + +N+ L P V E ++ +S+IE R S + Sbjct: 179 INNAAFGKVCTVADDWLFFEKNYNLKILHIRPLIVLEDFESYESSIETDRKTYVNHESHI 238 Query: 224 YFYRNTCY 231 F+ Sbjct: 239 SFFLRVMR 246 >gi|226954444|ref|ZP_03824908.1| glycosyltransferase LpsA [Acinetobacter sp. ATCC 27244] gi|226834793|gb|EEH67176.1| glycosyltransferase LpsA [Acinetobacter sp. ATCC 27244] Length = 270 Score = 153 bits (388), Expect = 2e-35, Method: Composition-based stats. Identities = 55/253 (21%), Positives = 97/253 (38%), Gaps = 21/253 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL ++ RR+ + + F FFDA+ P ++ K L Sbjct: 1 MKNYVISLTYADQRRDHILSIFQKQAIDFCFFDAV----TPATMEQIANVLGLDITKTDL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ C +SH LW++ I EDD +E + L +IK +A Sbjct: 57 AKSEVACLLSHAALWQKAIDENLSHIAIFEDDIHLGEEATIFLTKSDWIPEGCSIIKLEA 116 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDM 180 K DS +LP + + + GY + ++AA LLN P+D + Sbjct: 117 FYPKVGVDSSFRSLPSKRKLSLLKTVHMGCGGYILSQQAARDLLNLLATYEQLIPVDHIV 176 Query: 181 KHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS-----PLYFYR 227 + +P+L + + + ++ S +E+ R +RK + L F + Sbjct: 177 FKDYMTKFPNQIFQMLPALCIQDHLLGQHVN-FPSFLEKDRNIRKGEYEYQIKPKLNFVQ 235 Query: 228 NTCYQWNLHYNAW 240 + L +A Sbjct: 236 KIQRE-RLRLSAQ 247 >gi|145631596|ref|ZP_01787362.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae R3021] gi|144982796|gb|EDJ90322.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae R3021] Length = 270 Score = 153 bits (387), Expect = 2e-35, Method: Composition-based stats. Identities = 66/267 (24%), Positives = 98/267 (36%), Gaps = 31/267 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI +N IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRELFFSQKNTE--DFQIFSAINTMQKNWDELAAIFNIEQFKAYYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I ++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIVEDNDIAENSYTLVCEDDALFHLDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR------LV 215 + K + D + NI + V P V E S +E R L Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVRGSLSNNLF 237 Query: 216 RKPTFSPLYFYRNTCYQWNLHYNAWRK 242 +K PL W + + K Sbjct: 238 KKLMKYPLKKSLRLRKIWLIKGRNYAK 264 >gi|145637636|ref|ZP_01793291.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittHH] gi|145269159|gb|EDK09107.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittHH] Length = 257 Score = 153 bits (387), Expect = 2e-35, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 90/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI +N IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKNWDELAAIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLIVIENP-VLISNLESVR 229 >gi|319776043|ref|YP_004138531.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3047] gi|319896851|ref|YP_004135046.1| udp-glcnac--lipooligosaccharide n-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3031] gi|317432355|emb|CBY80710.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3031] gi|317450634|emb|CBY86854.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae F3047] Length = 257 Score = 153 bits (387), Expect = 2e-35, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 89/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI +N IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKNWDELAAIFNIEQFKAYYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHQNFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|294651461|ref|ZP_06728774.1| glycosyltransferase LpsA [Acinetobacter haemolyticus ATCC 19194] gi|292822611|gb|EFF81501.1| glycosyltransferase LpsA [Acinetobacter haemolyticus ATCC 19194] Length = 270 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 97/253 (38%), Gaps = 21/253 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL ++ RR+ + + F FFDA+ P ++ K L Sbjct: 1 MKNYVISLTYADQRRDHILSIFQKQAIDFCFFDAV----TPATMEQIANALGLDITKTDL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ C +SH LW++ I EDD +E + L +IK +A Sbjct: 57 AKSEVACLLSHAALWQKAIDENLSHIAIFEDDIHLGEEATIFLTKSDWIPEGCSIIKLEA 116 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDM 180 K +S +LP + + + GY + ++AA LLN P+D + Sbjct: 117 FYPKVGVESSFRSLPSKRKLSLLKTVHMGCGGYILSQQAARDLLNLLATYEQLIPVDHIV 176 Query: 181 KHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS-----PLYFYR 227 + +P+L + + + ++ S +E+ R +RK + L F + Sbjct: 177 FKDYMTKFPNQIFQMLPALCIQDHLLGQHVN-FPSFLEKDRNIRKGEYEYQIKPKLNFVQ 235 Query: 228 NTCYQWNLHYNAW 240 + L +A Sbjct: 236 KIQRE-RLRLSAQ 247 >gi|218461235|ref|ZP_03501326.1| putative glycosyltransferase protein [Rhizobium etli Kim 5] Length = 289 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 59/264 (22%), Positives = 107/264 (40%), Gaps = 8/264 (3%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I Y+I+L + RR + + I + F A+ G + R F Q + R Sbjct: 29 INTYLINLDRAELRRFRMERLLSEIGISFERVAAVDGVGISLPRRDFDGQSYLRRHGRTP 88 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGCY+SH+ +R A+ILEDD DF D+F+++L + + +++ Sbjct: 89 NPFEIGCYLSHVECARRFLAGNGEFALILEDDLDFDDDFAEVLAAALEQHASWDILRLST 148 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + K + L + + + Y I ++AA + +V + P D+ + Sbjct: 149 VNSGRKHR--VEPLTASRSLAIALTREKGSGAYLINRKAAGWIADVLVPMRLPYDLALDL 206 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES----RLVRKPTFSPLYFYRNTCYQWNLHYN 238 ++ + + +P V + D S I+ RL R+ +S L YR Sbjct: 207 EFDEGLRACFVDPVPVSQRADP-HSQIQAGLSSYRLGRRRPWSVLP-YRAAAEVRRFSVR 264 Query: 239 AWRKDLPPVSTTKFLPSSSSSLIK 262 R + K +++ SL K Sbjct: 265 FVRLAALRMRGWKDDTAAAGSLSK 288 >gi|255264180|ref|ZP_05343522.1| lipooligosaccharide galactosyltransferase I [Thalassiobium sp. R2A62] gi|255106515|gb|EET49189.1| lipooligosaccharide galactosyltransferase I [Thalassiobium sp. R2A62] Length = 255 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 61/241 (25%), Positives = 93/241 (38%), Gaps = 13/241 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRL 61 P++VISLP RR + L F DAI G + H Q R Sbjct: 9 PIFVISLPDRLDRRAAITAQLTAHDLPFEIIDAIDGRSGLTEAQKSAVDHAAVQRHLGRP 68 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS PEI C SH +++R+ GA+ILEDDA + F+ L + +++F Sbjct: 69 LSDPEIACAQSHQSVYQRVITLDLGGAVILEDDATLTTAFATLYRA--RAYQRFDILQFS 126 Query: 122 ALRKKPKKDSYLCT-LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 R + +L L + + + S + Y + + AA +L + K I P D Sbjct: 127 YRRARVFPSRWLWPHLTEDIRAARVLLNSEVASTYSVNQRAARYLSHGPKPIGGPADWPR 186 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 + + VT P + DT S IE R+ + +R L W Sbjct: 187 DTF---GLRHAVTVPQTADQNCDTLSSDIEFGRVDLASRVNKDRSFRRL-----LRLQYW 238 Query: 241 R 241 + Sbjct: 239 K 239 >gi|329123202|ref|ZP_08251771.1| glycosyltransferase [Haemophilus aegyptius ATCC 11116] gi|327471547|gb|EGF16991.1| glycosyltransferase [Haemophilus aegyptius ATCC 11116] Length = 267 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 89/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI +N IF+ ++ + + R Sbjct: 12 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKNWDELAAIFNIEQFKAYYFR 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F F Q L L + Sbjct: 70 NVTKGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHQNFQQNLTALLSEKLEAE 129 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 130 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 189 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 190 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 239 >gi|145633631|ref|ZP_01789358.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 3655] gi|145635446|ref|ZP_01791147.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittAA] gi|229845232|ref|ZP_04465365.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 6P18H1] gi|144985508|gb|EDJ92324.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 3655] gi|145267320|gb|EDK07323.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittAA] gi|229811827|gb|EEP47523.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 6P18H1] Length = 257 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 59/232 (25%), Positives = 91/232 (39%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKDWDELASIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINSFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQDFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|74318870|gb|ABA02572.1| Lgt2A [Moraxella catarrhalis] gi|326563616|gb|EGE13868.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 103P14B1] gi|326566382|gb|EGE16532.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis BC1] gi|326573223|gb|EGE23191.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 101P30B1] Length = 254 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 95/259 (36%), Gaps = 26/259 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISIKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPIINNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L+ E C++SH+ LW+++ I EDD ++ + + + +IK + Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDAACFIKN-DWLYFEFDIIKLE 116 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID 177 + + GN ++ + T GY I + A LL K+I Y ID Sbjct: 117 TQHELVHIGKSIH-HHGNRTLNPLKSTHVGTAGYIISQSGAKRLLEFIKSIDEYEYYAID 175 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEA---IDTNDSTIEESR------------LVRKPTFSP 222 M + L P ++ I +S +E+ R L K Sbjct: 176 HVMFGAYLSKGKVLQLCPALCQQSDSQIKNLESQLEQDRKNHQYIYHANESLYTKLNKIV 235 Query: 223 LYFYRNTCYQWNLHYNAWR 241 FYR+ ++ A+R Sbjct: 236 KRFYRSFGKRFFYITVAFR 254 >gi|325293427|ref|YP_004279291.1| glycosyltransferase 25 family member 1 [Agrobacterium sp. H13-3] gi|325061280|gb|ADY64971.1| Glycosyltransferase 25 family member 1 [Agrobacterium sp. H13-3] Length = 270 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 75/220 (34%), Gaps = 3/220 (1%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + V++I++ + R E R I L+ + G FS R Sbjct: 10 KLAVFLINMDSATKRLEDMNARLDAIGLKAERVPGVNGRELNYPIPEFSELSYMLMHGRR 69 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 S PEIGCY+SH+ R S A A+ILEDD F +F + + + L++ Sbjct: 70 TSPPEIGCYLSHVACANRFMGSDADIALILEDDVVFEKDFLEAIDEAIVNGSDWDLLRLT 129 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 + K L G + + Y + + A ++ + D+ Sbjct: 130 TVSNGRK--FPFRKLVGGRSLAIALTREKGSGAYLVNRRAGKW-ISKLIPMRLAYDIAFD 186 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 + + + P + D R+ R P + Sbjct: 187 LEYLAGLKAAFIYPLCATQDADGESQIQNNLRIYRLPRWR 226 >gi|83855233|ref|ZP_00948763.1| glycosyl transferase, family 25 [Sulfitobacter sp. NAS-14.1] gi|83843076|gb|EAP82243.1| glycosyl transferase, family 25 [Sulfitobacter sp. NAS-14.1] Length = 238 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 90/250 (36%), Gaps = 16/250 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKR 60 + +I + S AR + +A+ G + RI S + Sbjct: 1 MKSLIIHMSTSTARHANVARLLQDLP-DAQVVEAVNGRDPTQVADVRIHSGNLHSPSYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNILIK 119 L+ EIG + SH W+ I A+I+EDD S E + L + + ++ ++ Sbjct: 60 ALTPAEIGVFQSHRKCWQMIVDQGWDYAMIVEDDLAVSRPELDRALQLIHQNMTPDMYLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K + + + G+ + P+ + + +G++AA LL+ I RP+D Sbjct: 120 LPP--KMRELPADVVARDGDMALILPKRIGLQCCCQVVGRDAAQRLLDGSTQIDRPVDTW 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAID-TNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 ++ W P + E + STI++ T + R W Y Sbjct: 178 LQMHWVTGQKLHALLPNGLSEIANQIGGSTIQQ------KTRTSGKLMRELKRAW---YR 228 Query: 239 AWRKDLPPVS 248 K P + Sbjct: 229 TQVKLRPQKA 238 >gi|296233252|ref|XP_002761953.1| PREDICTED: procollagen galactosyltransferase 1-like [Callithrix jacchus] Length = 738 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 101/275 (36%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 459 VFMINLKRRQDRRERMLRALQEQGIECRLVEAVDGKAMNTSQVEALGIQMLPGYQDPYHG 518 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D++ Sbjct: 519 RPLTKGELGCFLSHYNIWKEVVDRGLEKSLVFEDDLRFEIFFKRRLMNLMRDVEREDLDW 578 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL+ K + Sbjct: 579 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLLD-AKPLSKM 634 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ N+ + EP VY DT S + Sbjct: 635 LPVDEFLPVMFDKHPVSEYKAHFSLRNLRAFSVEPLLVYPTHYTGDDGYVSDTETSVVWN 694 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V+ T + Q L A D+ Sbjct: 695 NEQVK--TDWDRAKSQKMREQQALSREAKNSDVLQ 727 >gi|332345593|gb|AEE58927.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 257 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 95/241 (39%), Gaps = 15/241 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M +P+Y++SL RR K R+++ F FFDAI ++ I + R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFHRLNINFDFFDAIDAKDPQNKEII--DKMRLSGVGA 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN------ 114 ++ EI C +SH +++ + A+ILEDD +++F + L + + + + Sbjct: 59 EMTDGEIACTLSHQLIYQDMIDKNIEWAVILEDDVIVNEKFKKFLQYFNLSEKDKLKHNN 118 Query: 115 -NILIKFDALRKKPKKDSYLCTLPG-----NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +L L P L + ++ + RT Y + K+ A +LL + Sbjct: 119 LYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTCSYLMNKDMAQNLLKL 178 Query: 169 RKNIYRP-IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 K+ D + I + ++ ++ +S +E RL+ P + Sbjct: 179 TKDYGTYRADSWKLMHQHNIIKEFYLDEIILHPILNEFNSHLESERLLTSEKKQPRTRLQ 238 Query: 228 N 228 Sbjct: 239 K 239 >gi|148935|gb|AAA24981.1| ORF 4 [Haemophilus influenzae] Length = 257 Score = 152 bits (385), Expect = 3e-35, Method: Composition-based stats. Identities = 59/232 (25%), Positives = 90/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKDWDELASIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHLDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSNINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|187929477|ref|YP_001899964.1| glycosyl transferase family 25 [Ralstonia pickettii 12J] gi|309781748|ref|ZP_07676481.1| LPS glycosyltransferase subfamily [Ralstonia sp. 5_7_47FAA] gi|187726367|gb|ACD27532.1| glycosyl transferase family 25 [Ralstonia pickettii 12J] gi|308919389|gb|EFP65053.1| LPS glycosyltransferase subfamily [Ralstonia sp. 5_7_47FAA] Length = 260 Score = 152 bits (385), Expect = 3e-35, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 87/245 (35%), Gaps = 12/245 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 +P + I+L RRE + + L F +YG+ P R + H + Q R Sbjct: 5 VPTFFINLDHDAGRREALERQLDALGLPHQRFPGVYGKTLPADELARHYDHARATSQS-R 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++ E+GC +SH+ +++ + A+ILEDDA + +L L++ + + Sbjct: 64 ELTVGEVGCALSHLGVYRAMIEQNLPYALILEDDAKLGPDVPAVLDALAQSVSPDEPVVT 123 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 L + + GYF+ + AA ++ ++ D Sbjct: 124 LLTHIDRYYKRSARPLTADHRTVKLANYQWLAHGYFVTRAAAKRMVEQLYPVWLAADYWY 183 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF---------SPLYFYRNTCY 231 K E + P + + S +E R ++ F R Y Sbjct: 184 KFEREGIVQMRAVVPYVIGVQDFDSGSNLEADRAIKSREADDRLGLGHHLRQIFVRKFLY 243 Query: 232 QWNLH 236 Q + Sbjct: 244 QIFVR 248 >gi|207092597|ref|ZP_03240384.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori HPKX_438_AG0C1] Length = 232 Score = 152 bits (385), Expect = 4e-35, Method: Composition-based stats. Identities = 49/236 (20%), Positives = 76/236 (32%), Gaps = 46/236 (19%) Query: 3 IPVYVISL-----PFSHARREKFC---HRAARIHLQFSFFDAIYGENNPI---------- 44 + V++I L + + + FDAIY + +P Sbjct: 1 MRVFIIHLSPKTCQNFSLKETHITPLLESLKLQGISYEIFDAIYSKISPTQLHPLILEHL 60 Query: 45 ---------------------CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + F + + C + + E+GCY SH LW++ Sbjct: 61 HPSFVVEDLLAFCKNEKHPPCALKNFFYALKHC--GKRMGFGELGCYASHYSLWQKCIEL 118 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIH 143 ILEDD D F + L K I+ L + K + G I Sbjct: 119 NE-AICILEDDIIIEDRFKESLEFCDKHINELGYIRLMHLEENVAKQKT--PVKGVSQIL 175 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 + T GY + +AA LL K PID M + H + + V E A+ Sbjct: 176 NFKD-GIGTQGYVLAPKAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAI 230 >gi|261315764|ref|ZP_05954961.1| glycosyl transferase [Brucella pinnipedialis M163/99/10] gi|261304790|gb|EEY08287.1| glycosyl transferase [Brucella pinnipedialis M163/99/10] Length = 243 Score = 152 bits (385), Expect = 4e-35, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 84/215 (39%), Gaps = 7/215 (3%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 + R+ QF +A+ G + S + ++ LS EIGC++SH ++ Sbjct: 1 MASQFERLGAQFERVEAVNGRAMSPLE-LASFTQISKEWPAPLSPAEIGCFLSHRKCLEK 59 Query: 80 IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGN 139 IA A + EDD S S+ L ++K DA + + + G Sbjct: 60 IAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYGHEVLISNPVKN-EGP 118 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE---HNIPSLVTEPG 196 + I + R +T GY + +EAA LL + + P+D + + ++ P Sbjct: 119 YSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFDPNDGPFNDFEIYQISPA 178 Query: 197 AVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 ++ +STI ++R ++ +R Sbjct: 179 ICRQSG--MESTIGQNRRPKQRPSLLGLVWREAKR 211 >gi|301170452|emb|CBW30059.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 10810] Length = 257 Score = 152 bits (385), Expect = 4e-35, Method: Composition-based stats. Identities = 59/232 (25%), Positives = 90/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKDWDELASIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIVEDNDIAEDSYALVCEDDALFHLDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|309972907|gb|ADO96108.1| Lipopolysaccharide biosynthesis protein LsgD [Haemophilus influenzae R2846] Length = 257 Score = 152 bits (385), Expect = 4e-35, Method: Composition-based stats. Identities = 59/232 (25%), Positives = 89/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRELFFSQKNTE--DFQIFSAINTMQKDWDELAAIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I ++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|68250301|ref|YP_249413.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 86-028NP] gi|229847318|ref|ZP_04467420.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 7P49H1] gi|68058500|gb|AAX88753.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 86-028NP] gi|229809743|gb|EEP45467.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 7P49H1] Length = 257 Score = 152 bits (385), Expect = 4e-35, Method: Composition-based stats. Identities = 59/232 (25%), Positives = 90/232 (38%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKDWDELASIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQNFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|71149096|gb|AAZ29047.1| Lgt2 [Moraxella catarrhalis] gi|326570929|gb|EGE20953.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis BC7] gi|326575875|gb|EGE25798.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis CO72] Length = 254 Score = 152 bits (384), Expect = 5e-35, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 95/259 (36%), Gaps = 26/259 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPIINNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L+ E C++SH+ LW+++ I EDD ++ + + + +IK + Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDAACFIKN-DWLYFEFDIIKLE 116 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID 177 + + GN ++ + T GY I + A LL K+I Y ID Sbjct: 117 TQHELVHIGKSIH-HHGNRTLNPLKSTHVGTAGYIISQSGAKRLLEFIKSIDEYEYYAID 175 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEA---IDTNDSTIEESR------------LVRKPTFSP 222 M + L P ++ I +S +E+ R L K Sbjct: 176 HVMFGAYLSKGKVLQLCPALCQQSDSQIKNLESQLEQDRKNHQYIYHANESLYTKLNKIV 235 Query: 223 LYFYRNTCYQWNLHYNAWR 241 FYR+ ++ A+R Sbjct: 236 KRFYRSFGKRFFYITVAFR 254 >gi|86133854|ref|ZP_01052436.1| glycosyltransferase family 25 (LPS biosynthesis protein) [Polaribacter sp. MED152] gi|85820717|gb|EAQ41864.1| glycosyltransferase family 25 (LPS biosynthesis protein) [Polaribacter sp. MED152] Length = 253 Score = 152 bits (384), Expect = 5e-35, Method: Composition-based stats. Identities = 54/230 (23%), Positives = 94/230 (40%), Gaps = 18/230 (7%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ---FKR 60 Y I+L S RR + ++++ + A YG++ + +I +KR+ Sbjct: 9 KFYYINLDKSTERRAFMEEQFNKLNIPITRISATYGKD--LDPKILKEEKRKHNILAHYP 66 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L + EIG ++H LW+ ++ P +I+LEDDA + F++ L L N I Sbjct: 67 LPNDGEIGICLTHFKLWEFLSKQPEDFSIVLEDDALIQNNFTEDLEQLLSQITINDFIDI 126 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 L N + + + G IGK AA L N + Y PID+ Sbjct: 127 SG-------KKGFYALEKNTLLTKFLMPPVLMIGQVIGKNAAQKLTNNLSDYYAPIDVMK 179 Query: 181 KHWWEHNIPSLVTEPGAVYEAI-DTNDSTIEESRLVR-----KPTFSPLY 224 + ++H +P T V STI+++ ++ + F P + Sbjct: 180 QDVYKHKVPLFSTNKQYVKSIDKQMGGSTIQQNNMIAWKKVIREIFRPFW 229 >gi|331685288|ref|ZP_08385874.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli H299] gi|331077659|gb|EGI48871.1| putative beta1,4-galactosyltransferase WaaX [Escherichia coli H299] Length = 257 Score = 152 bits (384), Expect = 5e-35, Method: Composition-based stats. Identities = 51/249 (20%), Positives = 98/249 (39%), Gaps = 18/249 (7%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M +P+Y++SL RR K R+++ F FFDAI ++ I + R Sbjct: 1 MNLPIYIVSLKRDIERRNKINDVFYRLNINFDFFDAIDAKDPQNKEII--DKMRLSGVGA 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN------ 114 ++ EI C +SH +++ + A+ILEDD +++F + L + + + + Sbjct: 59 EMTDGEIACTLSHQLIYQDMIDKNIEWAVILEDDVIVNEKFKKFLQYFNLSEKDKLKHNN 118 Query: 115 -NILIKFDALRKKPKKDSYLCTLPG-----NFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 +L L P L + ++ + RT Y + K+ A +LL + Sbjct: 119 LYLLGGQKGLHDYPVLGQSLFSKVKVSTCTFRRVNFNKNKIRRTCSYLMNKDMAQNLLKL 178 Query: 169 RKNIYRP-IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---RKPTFSPLY 224 K+ D + I + ++ ++ +S +E RL+ +K + L Sbjct: 179 TKDYGTYRADSWKLMHQHNIIKEFYLDEIILHPILNEFNSHLESERLLTSGKKQPRTRLQ 238 Query: 225 FYRNTCYQW 233 W Sbjct: 239 KRMKFIRSW 247 >gi|145629208|ref|ZP_01785007.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 22.1-21] gi|145639146|ref|ZP_01794753.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittII] gi|144978711|gb|EDJ88434.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 22.1-21] gi|145271708|gb|EDK11618.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittII] gi|309750727|gb|ADO80711.1| Lipopolysaccharide biosynthesis protein LsgD [Haemophilus influenzae R2866] Length = 257 Score = 152 bits (383), Expect = 6e-35, Method: Composition-based stats. Identities = 56/232 (24%), Positives = 91/232 (39%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKDWNELAAIFNIEQFKAHYGR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I ++ EDDA F +F + L L + + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHPDFQKNLTALLAEKLESE 119 Query: 117 LI-----------KFDALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P S+LC GN + P T GY I K AA Sbjct: 120 IILVGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + + + D + + NI + V P V E S +E R Sbjct: 180 FIQQISQNEPFWLA-DDFLLFEQDFNIRNKVVRPLMVIENP-VLISNLESIR 229 >gi|156120717|ref|NP_001095505.1| glycosyltransferase 25 family member 3 precursor [Bos taurus] gi|160395522|sp|A7MB73|GT253_BOVIN RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule; Flags: Precursor gi|154425666|gb|AAI51374.1| CERCAM protein [Bos taurus] Length = 595 Score = 152 bits (383), Expect = 7e-35, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 78/227 (34%), Gaps = 29/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-----ICNRIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSVMRTLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F QL+ + + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVTRGLAQVVVFEDDVRFESNFKGRLEQLMEEVEAEKLPW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P++++ + LP S T Y + A LL + + R Sbjct: 442 DLIYLGRKQVNPEEEAVVEGLPH----LVAAGYSYWTLAYVLSLAGARKLL-ASQPLRRM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+W ++ + P DS Sbjct: 497 LPVDEFLPIMFDQHPNEQYKAHFWPRDLQAFSARPLLAAPTHYAGDS 543 >gi|241663595|ref|YP_002981955.1| glycosyl transferase family 25 [Ralstonia pickettii 12D] gi|240865622|gb|ACS63283.1| glycosyl transferase family 25 [Ralstonia pickettii 12D] Length = 260 Score = 151 bits (382), Expect = 9e-35, Method: Composition-based stats. Identities = 50/246 (20%), Positives = 92/246 (37%), Gaps = 14/246 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 +P + I+L RRE + + L F +YG+ P R + H + Q R Sbjct: 5 VPTFFINLDHDAGRREALERQLDALGLPHRRFPGVYGKTLPADELARHYDHARATGQS-R 63 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L++ E+GC +SH+ +++ + A+ILEDDA + +L L++ + + Sbjct: 64 ELTVGEVGCALSHLGVYRAMIEQDLPYALILEDDAKLGPDVPAVLDALARSVSPDEPVVT 123 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 L N + GYF+ + AA ++ ++ D Sbjct: 124 LLTHIDRYYKRSARPLTANHRTVKLANYQWLAHGYFVTRAAAKRMVEQLYPVWLAADYWY 183 Query: 181 KHWWEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLVR-KPTFSPLY--------FYRNTC 230 K E + P + + D+ S +E R ++ + + L F R Sbjct: 184 KFEREGIVQMRAVVPYVIGVQNFDS-GSNLEADRAIKSREADTRLGLGHHLRQIFVRKFL 242 Query: 231 YQWNLH 236 YQ + Sbjct: 243 YQIFVR 248 >gi|126724376|ref|ZP_01740219.1| Glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] gi|126705540|gb|EBA04630.1| Glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] Length = 247 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 55/244 (22%), Positives = 98/244 (40%), Gaps = 13/244 (5%) Query: 10 LPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRLLSLPEI 67 + + R E ++ A ++ F +A+ G + ++++ + K L PEI Sbjct: 1 MAENTTRMENVSNQMAEQNIAFVRVNAVDGRKFDEAETSKVYDAAANKKYGKYPLVGPEI 60 Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 GCY+SHI+ W+ IA A G I EDD +L LS + + ++K A Sbjct: 61 GCYLSHINAWRTIAEGDAEGGFIFEDDFQSDQYLKTVLTALSNDERDWDMVKLFAFNPAV 120 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NIYRPIDMDMKHWWE 185 + P + I P + GY + K AA L+ +RP+D D K WE Sbjct: 121 PLANGRELTPDHI-IGIPYRVPTCLIGYGVTKLAAQSLVKRADQARFFRPVDEDQKFVWE 179 Query: 186 HNIP-SLVTEPGAVYEAIDTNDSTIEESRL------VRKPTFSPLYFYRNTCYQWNLHYN 238 + +LV P T+ +R + L+ + +++N H + Sbjct: 180 TGLRVALVARPPIQIGDQTAQSGTVSNARRGLAHNGISAKIRKALHVF-WYAFRYNYHLH 238 Query: 239 AWRK 242 +++ Sbjct: 239 KYKR 242 >gi|194225890|ref|XP_001499943.2| PREDICTED: similar to cerebral endothelial cell adhesion molecule 1 [Equus caballus] Length = 584 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 76/220 (34%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 311 VFVISLARRPDRRERMLSSLWEMEISGRVVDAVDGRTLNSSILRSLGVDLLPGYQDPYSG 370 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F QL+ + + Sbjct: 371 RTLTKGEVGCFLSHYSIWEEVVARGLAQVLVFEDDVRFESNFRGRLEQLMEEVEAEKLPW 430 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P++++ + LP S T Y + A LL + + R Sbjct: 431 DLIYLGRKQVNPEEEAVVEGLPH----LVVAGYSYWTLAYVLSLAGARKLL-ASQPLRRM 485 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + P Sbjct: 486 LPVDEFLPIMFDQHPNEQYKAHFWPRDLRAFSARPLLAAP 525 >gi|306842115|ref|ZP_07474785.1| LPSA protein [Brucella sp. BO2] gi|306287788|gb|EFM59216.1| LPSA protein [Brucella sp. BO2] Length = 726 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 24 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 82 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 83 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 142 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 143 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 201 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 202 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKYPWYKRLRTSLFRTFDYY 261 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 262 VRVHHTLLQPQNPDGSSIK 280 >gi|254718764|ref|ZP_05180575.1| LPSA protein [Brucella sp. 83/13] gi|265983742|ref|ZP_06096477.1| conserved hypothetical protein [Brucella sp. 83/13] gi|306840196|ref|ZP_07472976.1| LPSA protein [Brucella sp. NF 2653] gi|264662334|gb|EEZ32595.1| conserved hypothetical protein [Brucella sp. 83/13] gi|306404727|gb|EFM61026.1| LPSA protein [Brucella sp. NF 2653] Length = 703 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSIK 257 >gi|306845221|ref|ZP_07477797.1| LPSA protein [Brucella sp. BO1] gi|306274380|gb|EFM56187.1| LPSA protein [Brucella sp. BO1] Length = 703 Score = 150 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 50/259 (19%), Positives = 88/259 (33%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP------TFSPLYFYRNTCYQ 232 M+ +W H ++ + + + S I + +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKYPWYKRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSIK 257 >gi|254713752|ref|ZP_05175563.1| LPSA protein [Brucella ceti M644/93/1] gi|254717190|ref|ZP_05179001.1| LPSA protein [Brucella ceti M13/05/1] gi|261219006|ref|ZP_05933287.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261321494|ref|ZP_05960691.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|260924095|gb|EEX90663.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261294184|gb|EEX97680.1| conserved hypothetical protein [Brucella ceti M644/93/1] Length = 703 Score = 150 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|254701409|ref|ZP_05163237.1| LPSA protein [Brucella suis bv. 5 str. 513] gi|261751948|ref|ZP_05995657.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] gi|261741701|gb|EEY29627.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] Length = 703 Score = 150 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|62289577|ref|YP_221370.1| hypothetical protein BruAb1_0634 [Brucella abortus bv. 1 str. 9-941] gi|82699507|ref|YP_414081.1| glycosyl transferase family O-antigen polymerase [Brucella melitensis biovar Abortus 2308] gi|225852140|ref|YP_002732373.1| glycosyltransferase 25 family protein [Brucella melitensis ATCC 23457] gi|254693379|ref|ZP_05155207.1| LPSA protein [Brucella abortus bv. 3 str. Tulya] gi|254697030|ref|ZP_05158858.1| LPSA protein [Brucella abortus bv. 2 str. 86/8/59] gi|254707667|ref|ZP_05169495.1| LPSA protein [Brucella pinnipedialis M163/99/10] gi|254709749|ref|ZP_05171560.1| LPSA protein [Brucella pinnipedialis B2/94] gi|254729929|ref|ZP_05188507.1| LPSA protein [Brucella abortus bv. 4 str. 292] gi|256031239|ref|ZP_05444853.1| LPSA protein [Brucella pinnipedialis M292/94/1] gi|256060748|ref|ZP_05450911.1| LPSA protein [Brucella neotomae 5K33] gi|256159346|ref|ZP_05457132.1| LPSA protein [Brucella ceti M490/95/1] gi|256254647|ref|ZP_05460183.1| LPSA protein [Brucella ceti B1/94] gi|256257146|ref|ZP_05462682.1| LPSA protein [Brucella abortus bv. 9 str. C68] gi|256369054|ref|YP_003106562.1| hypothetical protein BMI_I614 [Brucella microti CCM 4915] gi|260168375|ref|ZP_05755186.1| LPSA protein [Brucella sp. F5/99] gi|260545662|ref|ZP_05821403.1| LpsA protein [Brucella abortus NCTC 8038] gi|260757602|ref|ZP_05869950.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260761428|ref|ZP_05873771.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260883408|ref|ZP_05895022.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|261213629|ref|ZP_05927910.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|261221822|ref|ZP_05936103.1| conserved hypothetical protein [Brucella ceti B1/94] gi|261315164|ref|ZP_05954361.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261317280|ref|ZP_05956477.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261324737|ref|ZP_05963934.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261757835|ref|ZP_06001544.1| LPSA protein [Brucella sp. F5/99] gi|265988318|ref|ZP_06100875.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|265997786|ref|ZP_06110343.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|62195709|gb|AAX74009.1| hypothetical membrane protein [Brucella abortus bv. 1 str. 9-941] gi|82615608|emb|CAJ10595.1| Glycosyl transferase, family 25:O-antigen polymerase [Brucella melitensis biovar Abortus 2308] gi|225640505|gb|ACO00419.1| Glycosyltransferase 25 family member 3 precursor [Brucella melitensis ATCC 23457] gi|255999214|gb|ACU47613.1| hypothetical protein BMI_I614 [Brucella microti CCM 4915] gi|260097069|gb|EEW80944.1| LpsA protein [Brucella abortus NCTC 8038] gi|260667920|gb|EEX54860.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260671860|gb|EEX58681.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260872936|gb|EEX80005.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|260915236|gb|EEX82097.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|260920406|gb|EEX87059.1| conserved hypothetical protein [Brucella ceti B1/94] gi|261296503|gb|EEX99999.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261300717|gb|EEY04214.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261304190|gb|EEY07687.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261737819|gb|EEY25815.1| LPSA protein [Brucella sp. F5/99] gi|262552254|gb|EEZ08244.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|264660515|gb|EEZ30776.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|326408640|gb|ADZ65705.1| LPSA protein [Brucella melitensis M28] gi|326538362|gb|ADZ86577.1| glycosyltransferase 25 family member 3 precursor [Brucella melitensis M5-90] Length = 703 Score = 150 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|294851984|ref|ZP_06792657.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] gi|294820573|gb|EFG37572.1| glycosyl transferase, family 25 [Brucella sp. NVSL 07-0026] Length = 759 Score = 150 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 57 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 115 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 116 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 175 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 176 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 234 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 235 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 294 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 295 VRVHHTLLQPQNPDGSSMK 313 >gi|189023836|ref|YP_001934604.1| LPSA protein [Brucella abortus S19] gi|225627122|ref|ZP_03785160.1| Glycosyltransferase 25 family member 3 [Brucella ceti str. Cudo] gi|237815074|ref|ZP_04594072.1| Glycosyltransferase 25 family member 3 [Brucella abortus str. 2308 A] gi|265999554|ref|ZP_05466880.2| LPSA protein [Brucella melitensis bv. 2 str. 63/9] gi|297247992|ref|ZP_06931710.1| glycosyl transferase, family 25 [Brucella abortus bv. 5 str. B3196] gi|189019408|gb|ACD72130.1| LPSA protein [Brucella abortus S19] gi|225617957|gb|EEH15001.1| Glycosyltransferase 25 family member 3 [Brucella ceti str. Cudo] gi|237789911|gb|EEP64121.1| Glycosyltransferase 25 family member 3 [Brucella abortus str. 2308 A] gi|263094622|gb|EEZ18420.1| LPSA protein [Brucella melitensis bv. 2 str. 63/9] gi|297175161|gb|EFH34508.1| glycosyl transferase, family 25 [Brucella abortus bv. 5 str. B3196] Length = 759 Score = 150 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 57 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 115 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 116 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 175 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 176 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 234 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 235 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 294 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 295 VRVHHTLLQPQNPDGSSMK 313 >gi|163842890|ref|YP_001627294.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella suis ATCC 23445] gi|163673613|gb|ABY37724.1| Lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella suis ATCC 23445] Length = 703 Score = 150 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|83941755|ref|ZP_00954217.1| glycosyltransferase, family protein 25 [Sulfitobacter sp. EE-36] gi|83847575|gb|EAP85450.1| glycosyltransferase, family protein 25 [Sulfitobacter sp. EE-36] Length = 238 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 89/250 (35%), Gaps = 16/250 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKR 60 + +I + S AR + +A+ G + R S + Sbjct: 1 MKSLIIHMSTSTARHANVARLLQDLP-DAQVVEAVNGRDPTQVADVRTHSGNLHSPSYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNILIK 119 L+ EIG + SH W+ I A+I+EDD S E + L + + ++ ++ Sbjct: 60 ALTPAEIGVFQSHRKCWQMIVDQGWDYAMIVEDDLAVSRPELDRALQLIHQNMTPDMYLR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K + + + G+ + P+ + + +G++AA LL+ I RP+D Sbjct: 120 LPP--KMRELPADVVARDGDMALILPKRIGLQCCCQVVGRDAAQLLLDGSTQIDRPVDTW 177 Query: 180 MKHWWEHNIPSLVTEPGAVYEAID-TNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 ++ W P + E + STI++ T + R W Y Sbjct: 178 LQMHWVTGQKLHALLPNGLSEIANQIGGSTIQQ------KTRTSGKLMREFKRAW---YR 228 Query: 239 AWRKDLPPVS 248 K P + Sbjct: 229 TQVKLRPQKA 238 >gi|281349465|gb|EFB25049.1| hypothetical protein PANDA_003209 [Ailuropoda melanoleuca] Length = 569 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 75/220 (34%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 296 VFVISLARRPDRRERMLSSLWEMEISGRVVDAVDGRTLNSSIMRRLGVDLLPGYQDPYSG 355 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINN 115 R L+ E+GC++SH +W+ + ++LEDD F F L L + Sbjct: 356 RTLTKGEVGCFLSHYSIWEEVVARGLAQVLVLEDDVRFESNFRGRLERLMEEVEAEKLPW 415 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P++++ + LP S T Y + A LL + + R Sbjct: 416 DLIYLGRKQVNPEEEAAVEGLPH----LVVAGYSYWTLAYVLSLAGARKLL-ASQPLRRM 470 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + P Sbjct: 471 LPVDEFLPIMFDQHPNEQYKAHFWPRDLRAFSARPLLAAP 510 >gi|297270127|ref|XP_001111820.2| PREDICTED: glycosyltransferase 25 family member 3-like [Macaca mulatta] Length = 714 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 76/220 (34%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 441 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 500 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + + Sbjct: 501 RTLTKGEVGCFLSHYSIWEEVVARGLSQVLVFEDDVRFESNFRGRLERLMEDVEAEKLPW 560 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 561 DLIYLGRKQVNPEKEAAVEGLPG----LVVAGYSYWTLAYALSLAGARKLL-ASQPLRRM 615 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + P Sbjct: 616 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSARPLLAAP 655 >gi|157823499|ref|NP_001099537.1| procollagen galactosyltransferase 1 [Rattus norvegicus] gi|149036101|gb|EDL90767.1| glycosyltransferase 25 domain containing 1 (predicted) [Rattus norvegicus] gi|169642770|gb|AAI60899.1| Glycosyltransferase 25 domain containing 1 [Rattus norvegicus] Length = 617 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 98/275 (35%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIF-----SHQKRQCQFK 59 V++I+L RRE+ + F +A+ G+ R Sbjct: 338 VFMINLKRRLDRRERMLRALHEQEIDFQLVEAVDGKAMNTSQVEAMGIQMLPGYRDPYHG 397 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMQDVEREGLDW 457 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL K + Sbjct: 458 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGAQKLL-AAKPLAKM 513 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ N+ + EP +Y DT S + Sbjct: 514 LPVDEFLPVMFDKHPMSEYKSHFSPRNLRAFSVEPLLIYPTHYTGDDGYVSDTETSVVWN 573 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V+ T + Q L A D+ Sbjct: 574 NEQVK--TDWDRAKSQKMREQQALSREAKNSDVLQ 606 >gi|301758784|ref|XP_002915272.1| PREDICTED: glycosyltransferase 25 family member 3-like [Ailuropoda melanoleuca] Length = 590 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 75/220 (34%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 317 VFVISLARRPDRRERMLSSLWEMEISGRVVDAVDGRTLNSSIMRRLGVDLLPGYQDPYSG 376 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINN 115 R L+ E+GC++SH +W+ + ++LEDD F F L L + Sbjct: 377 RTLTKGEVGCFLSHYSIWEEVVARGLAQVLVLEDDVRFESNFRGRLERLMEEVEAEKLPW 436 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P++++ + LP S T Y + A LL + + R Sbjct: 437 DLIYLGRKQVNPEEEAAVEGLPH----LVVAGYSYWTLAYVLSLAGARKLL-ASQPLRRM 491 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + P Sbjct: 492 LPVDEFLPIMFDQHPNEQYKAHFWPRDLRAFSARPLLAAP 531 >gi|256113154|ref|ZP_05454031.1| LPSA protein [Brucella melitensis bv. 3 str. Ether] gi|265994567|ref|ZP_06107124.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] gi|262765680|gb|EEZ11469.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] Length = 703 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPHNPDGSSMK 257 >gi|296190930|ref|XP_002743398.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Callithrix jacchus] Length = 595 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 77/215 (35%), Gaps = 29/215 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F + L L + + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRRRLERLMEDVEAEKLRW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 442 DLIYLGRKQVNPEKEAAVEGLPG----LVVAGYSYWTLAYALSLAGARKLL-ASQPLRRM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEP 195 P+D + H+W ++ + P Sbjct: 497 LPVDEFLPIMFDQHPNEEYKAHFWPRDLLAFSARP 531 >gi|160395571|sp|Q5U309|GT253_RAT RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule Length = 572 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 80/227 (35%), Gaps = 29/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RR + + + DA+ G + Sbjct: 299 VFVISLARRPQRRARMLSSLWEMEISARVVDAVDGRTLNSSILKHLGVDLLPGYQDPYSG 358 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F D F + L L + ++ Sbjct: 359 RTLTKGEVGCFLSHYSIWEEVVAKGLARVVVFEDDVRFEDNFRKRLERLMEDVLTQKLSW 418 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+++ + LPG S T Y + A LL + ++R Sbjct: 419 DLIYLGRKQVNPEEEVAVEGLPG----LVVAGYSYWTLAYTLSLAGARKLL-ASQPLHRM 473 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+W ++ + P D+ Sbjct: 474 LPVDEFLPVMFDRHPNDQYKAHFWPRDLQAFSARPLLASPTHYAGDT 520 >gi|23501502|ref|NP_697629.1| hypothetical protein BR0615 [Brucella suis 1330] gi|161618590|ref|YP_001592477.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella canis ATCC 23365] gi|254703954|ref|ZP_05165782.1| lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella suis bv. 3 str. 686] gi|260566797|ref|ZP_05837267.1| LPSA protein [Brucella suis bv. 4 str. 40] gi|261754606|ref|ZP_05998315.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] gi|23347409|gb|AAN29544.1| membrane protein, putative [Brucella suis 1330] gi|161335401|gb|ABX61706.1| Lacto-N-neotetraose biosynthesis glycosyl transferase lgtB [Brucella canis ATCC 23365] gi|260156315|gb|EEW91395.1| LPSA protein [Brucella suis bv. 4 str. 40] gi|261744359|gb|EEY32285.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] Length = 703 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDTIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|253989141|ref|YP_003040497.1| hypothetical protein PAU_01661 [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253780591|emb|CAQ83753.1| conserved hypothetical protein [Photorhabdus asymbiotica] Length = 261 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 60/267 (22%), Positives = 104/267 (38%), Gaps = 24/267 (8%) Query: 3 IPVYVISLPFSHA-RREKFCHRAARIHLQFSFFDAIYGENNPICNRI---FSHQKRQCQF 58 + +VISL ++ RR + ++ + F FFDAI I N + F + Sbjct: 1 MNNFVISLSQNNEKRRNHIVEQFSKKSIPFEFFDAIDKTKIHIANDLGVSFDNP------ 54 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LSL E GC +SH+ LWK++ A I EDD S E L + + + +I Sbjct: 55 --NLSLGEKGCILSHVMLWKKVIDENLPMATIFEDDIYLSKEAENYLKNYDWINPDWHVI 112 Query: 119 KFDALRKKPKKD-SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRP 175 K + +K K S + T + I + R GY I + A +L P Sbjct: 113 KIERADEKVKTAISPVKTFNKHEGIFKLRGEHLGAGGYIITNKGAKYLFEKITSSPFKDP 172 Query: 176 IDMDMKHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 ID ++ + + ++ IP+L + + + D S++E R+ ++ + + Sbjct: 173 IDYEIFNNFIYDKNYFICQFIPALCMQDYTINKCHDKFPSSLENERIHKEKIITKNSSNK 232 Query: 228 NTCYQWNLHYNAWRKDLPPVSTTKFLP 254 + L + LP Sbjct: 233 LIREMSRVKKQLINLVLKKQK-ERKLP 258 >gi|149039147|gb|EDL93367.1| rCG45647, isoform CRA_a [Rattus norvegicus] Length = 596 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 80/227 (35%), Gaps = 29/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RR + + + DA+ G + Sbjct: 323 VFVISLARRPQRRARMLSSLWEMEISARVVDAVDGRTLNSSILKHLGVDLLPGYQDPYSG 382 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F D F + L L + ++ Sbjct: 383 RTLTKGEVGCFLSHYSIWEEVVAKGLARVVVFEDDVRFEDNFRKRLERLMEDVLTQKLSW 442 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+++ + LPG S T Y + A LL + ++R Sbjct: 443 DLIYLGRKQVNPEEEVAVEGLPG----LVVAGYSYWTLAYTLSLAGARKLL-ASQPLHRM 497 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+W ++ + P D+ Sbjct: 498 LPVDEFLPVMFDRHPNDQYKAHFWPRDLQAFSARPLLASPTHYAGDT 544 >gi|296190932|ref|XP_002743399.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2 [Callithrix jacchus] Length = 548 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 77/215 (35%), Gaps = 29/215 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 275 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 334 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F + L L + + Sbjct: 335 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRRRLERLMEDVEAEKLRW 394 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 395 DLIYLGRKQVNPEKEAAVEGLPG----LVVAGYSYWTLAYALSLAGARKLL-ASQPLRRM 449 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEP 195 P+D + H+W ++ + P Sbjct: 450 LPVDEFLPIMFDQHPNEEYKAHFWPRDLLAFSARP 484 >gi|148560689|ref|YP_001258608.1| hypothetical protein BOV_0614 [Brucella ovis ATCC 25840] gi|148371946|gb|ABQ61925.1| putative membrane protein [Brucella ovis ATCC 25840] Length = 759 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 53/259 (20%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 57 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 115 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + A +ILEDD F++ S + + K + ++ Sbjct: 116 GREMLPGEYGCYRSHLKALESFLSDGAPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 175 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 176 KLVNHRS-PLFMSLLETDAGDRIGRVIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 234 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 235 AMERFWYHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 294 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 295 VRVHHTLLQPQNPDGSSMK 313 >gi|254688898|ref|ZP_05152152.1| LPSA protein [Brucella abortus bv. 6 str. 870] gi|260754383|ref|ZP_05866731.1| predicted protein [Brucella abortus bv. 6 str. 870] gi|260674491|gb|EEX61312.1| predicted protein [Brucella abortus bv. 6 str. 870] Length = 703 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 93/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH+ + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ + Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSSAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|304398828|ref|ZP_07380698.1| glycosyl transferase family 25 [Pantoea sp. aB] gi|304353532|gb|EFM17909.1| glycosyl transferase family 25 [Pantoea sp. aB] Length = 257 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 49/252 (19%), Positives = 95/252 (37%), Gaps = 18/252 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++I+L S RR + + + F A+ G+ Q FK Sbjct: 1 MKAFIINLDSSTKRRATIAAQCQNAGISYEFIQAVNGKQLSNEEIALHTQPVNYAFKA-- 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP--HLSKCDINNILIKF 120 EIGC +SHI++++++ A+ILEDDA ++ +L L L+ Sbjct: 59 --GEIGCALSHINIYRKMVDEKIPQALILEDDALLTEALPPVLASPALQLSAAKPTLVLL 116 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 ++ K + + T+ + I+ P + + Y + EAA LL + ++ D Sbjct: 117 SSVNKY--VNKPIATVTESARIY-PVYSATTSHAYVVNFEAAKRLLTLLYPVWIAADKWC 173 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE------SRLVRKPTFSPLYFYRNTCYQWN 234 L P V +TI++ +++ +S L R + Sbjct: 174 VFEEYGAFRLLAVHPAPVLLHEVAQQTTIQQIDDFENHNRLKRELWSKLMAKRPLRAKLR 233 Query: 235 LHYNAWRKDLPP 246 + +R+ + P Sbjct: 234 ---HRYRRAMLP 242 >gi|68357136|ref|XP_694217.1| PREDICTED: procollagen galactosyltransferase 1 [Danio rerio] Length = 609 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 56/280 (20%), Positives = 101/280 (36%), Gaps = 39/280 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ A+ G+ + + Sbjct: 330 VFIINLKRRGDRRERMLRALTEQEIECKIIAAVDGKAMNVSEIHALGIHMLPGYSDPYHG 389 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH----LSKCDINN 115 R L+ E+GC++SH ++WK I ++ILEDD F F + L + L ++ Sbjct: 390 RPLTKGELGCFLSHYNIWKEIVDRGLKTSLILEDDLRFEIFFKRRLQNLLLELQSQSLDW 449 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + D +P ++ + S T GY + + A LL + + Sbjct: 450 DLIYIG--RKRMQVDRPEKAVPNIRNLVEA-DYSYWTLGYMMSLQGARKLLK-AQPLSKM 505 Query: 174 RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M+ + ++ + EP V+ DT ST+ + Sbjct: 506 LPVDEFLPVMFNKHPVSDYMEQFETRDLKAFSAEPLLVFPTHYTGEAGYISDTETSTVWD 565 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + V T T Q + A D+ + Sbjct: 566 NEKV--HTDWDRERRGKTQEQGEISTEAQNSDVLQSPLDR 603 >gi|296113314|ref|YP_003627252.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis RH4] gi|295921008|gb|ADG61359.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis RH4] Length = 254 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 95/249 (38%), Gaps = 24/249 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISIKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPIINNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIK 119 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +IK Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVIK 117 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR----P 175 + +K + L + + + T GY I ++ A +L+ + P Sbjct: 118 LETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFFP 176 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL-- 223 ID + + L P V +A DT S IE R R+ T + Sbjct: 177 IDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLADYGK 236 Query: 224 YFYRNTCYQ 232 +YR+ + Sbjct: 237 KYYRSIGKR 245 >gi|296482048|gb|DAA24163.1| glycosyltransferase 25 family member 3 precursor [Bos taurus] Length = 531 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 76/215 (35%), Gaps = 29/215 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-----ICNRIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGRMLNSSVMRTLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F QL+ + + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVTRGLAQVVVFEDDVRFESNFKGRLEQLMEEVEAEKLPW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P++++ + LP S T Y + A LL + + R Sbjct: 442 DLIYLGRKQVNPEEEAVVEGLPH----LVAAGYSYWTLAYVLSLAGARKLL-ASQPLRRM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEP 195 P+D + H+W ++ + P Sbjct: 497 LPVDEFLPIMFDQHPNEQYKAHFWPRDLQAFSARP 531 >gi|58865502|ref|NP_001011962.1| glycosyltransferase 25 family member 3 [Rattus norvegicus] gi|55249709|gb|AAH85782.1| Cerebral endothelial cell adhesion molecule [Rattus norvegicus] Length = 517 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 80/227 (35%), Gaps = 29/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RR + + + DA+ G + Sbjct: 244 VFVISLARRPQRRARMLSSLWEMEISARVVDAVDGRTLNSSILKHLGVDLLPGYQDPYSG 303 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F D F + L L + ++ Sbjct: 304 RTLTKGEVGCFLSHYSIWEEVVAKGLARVVVFEDDVRFEDNFRKRLERLMEDVLTQKLSW 363 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+++ + LPG S T Y + A LL + ++R Sbjct: 364 DLIYLGRKQVNPEEEVAVEGLPG----LVVAGYSYWTLAYTLSLAGARKLL-ASQPLHRM 418 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+W ++ + P D+ Sbjct: 419 LPVDEFLPVMFDRHPNDQYKAHFWPRDLQAFSARPLLASPTHYAGDT 465 >gi|308187262|ref|YP_003931393.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Pantoea vagans C9-1] gi|308057772|gb|ADO09944.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Pantoea vagans C9-1] Length = 258 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 50/252 (19%), Positives = 88/252 (34%), Gaps = 18/252 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++I+L S RR + L + F A+ G + Sbjct: 1 MKTFIINLASSTGRRATISAQCDAAGLDYEFISAVNGYALTDAE----IAQHTRTVNYAF 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP--HLSKCDINNILIKF 120 EIGC +SHI +++++ A+ILEDDA F+ + +L + N L+ Sbjct: 57 KPGEIGCALSHIAIYRKMKDEKIPQALILEDDALFTGQLGAVLSSPAMQLSPDNPTLVLL 116 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + K T + P + Y I EAAI LL + ++ D Sbjct: 117 SRVNRYVDKAIAAVTETSHL---YPVYSATTAHAYVINLEAAIRLLKLLYPVWMVADKWC 173 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIE------ESRLVRKPTFSPLYFYRNTCYQWN 234 I L P V +TI+ + ++ ++ L R + Sbjct: 174 LFEEYGAIKLLAVHPAPVLLHDLAQQTTIQHFSDVAQHNQQKRDIWAGLMARRPFSVRLR 233 Query: 235 LHYNAWRKDLPP 246 + +R+ L P Sbjct: 234 ---HRYRRALLP 242 >gi|59896171|gb|AAX11432.1| Lgt2 [Moraxella catarrhalis 7169] gi|326560301|gb|EGE10689.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 7169] Length = 254 Score = 149 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 95/249 (38%), Gaps = 24/249 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC-QFKRL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPITNNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIK 119 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +IK Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVIK 117 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR----P 175 + +K + L + + + T GY I ++ A +L+ + P Sbjct: 118 LETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFFP 176 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL-- 223 ID + + L P V +A DT S IE R R+ T + Sbjct: 177 IDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLADYGK 236 Query: 224 YFYRNTCYQ 232 +YR+ + Sbjct: 237 KYYRSIGKR 245 >gi|17987609|ref|NP_540243.1| hypothetical protein BMEI1326 [Brucella melitensis bv. 1 str. 16M] gi|17983318|gb|AAL52507.1| lpsa protein [Brucella melitensis bv. 1 str. 16M] Length = 721 Score = 149 bits (377), Expect = 4e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 91/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 19 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 77 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH + + +ILEDD F++ S + + K + ++ Sbjct: 78 GREMLPGEYGCYRSHSKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 137 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 138 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 196 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 197 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 256 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 257 VRVHHTLLQPQNPDGSSMK 275 >gi|217032517|ref|ZP_03438008.1| hypothetical protein HPB128_180g16 [Helicobacter pylori B128] gi|298736530|ref|YP_003729056.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|216945795|gb|EEC24418.1| hypothetical protein HPB128_180g16 [Helicobacter pylori B128] gi|298355720|emb|CBI66592.1| glycosyl transferase, family 25 [Helicobacter pylori B8] Length = 273 Score = 149 bits (376), Expect = 4e-34, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 93/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSF-----FDAIYGENNP----- 43 + V++ISL +K C + + ++ + FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFDLVFRDTTTLLNNINATRHKAQIFDAIYSKTFEGGLHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWQKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L S + Sbjct: 115 IELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDTSVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A L + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|114626942|ref|XP_001157210.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Pan troglodytes] Length = 595 Score = 149 bits (376), Expect = 4e-34, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 77/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + Sbjct: 442 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALSLAGARKLL-ASQPLRCM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 497 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 536 >gi|256044317|ref|ZP_05447221.1| hypothetical protein Bmelb1R_07451 [Brucella melitensis bv. 1 str. Rev.1] gi|260563671|ref|ZP_05834157.1| LPSA protein [Brucella melitensis bv. 1 str. 16M] gi|265990729|ref|ZP_06103286.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. Rev.1] gi|260153687|gb|EEW88779.1| LPSA protein [Brucella melitensis bv. 1 str. 16M] gi|263001513|gb|EEZ14088.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. Rev.1] Length = 703 Score = 149 bits (376), Expect = 4e-34, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 91/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY SH + + +ILEDD F++ S + + K + ++ Sbjct: 60 GREMLPGEYGCYRSHSKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|321463619|gb|EFX74634.1| hypothetical protein DAPPUDRAFT_199801 [Daphnia pulex] Length = 623 Score = 149 bits (376), Expect = 5e-34, Method: Composition-based stats. Identities = 49/238 (20%), Positives = 87/238 (36%), Gaps = 38/238 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ LQ +A+ G++ + F Sbjct: 324 IFLINLERRPERRQRMEWSLNQLGLQHKLINAVDGKSLNDSYVASLGIRMLPNFADPYHH 383 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINN 115 R +++ EIGC++SH +W+ I +II EDD F F + L L + ++ Sbjct: 384 RAMTMGEIGCFLSHYAIWQEIVDRQLSSSIIFEDDIRFESNFVKKLADLVNEVDRLQVDW 443 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--Y 173 LI R K + ++++ + S T Y + K A LL + Sbjct: 444 DLIYLGRKRLKHENETWV----EGGQLLVNVEYSYWTLSYILSKRGAEKLLKG-EPFGQL 498 Query: 174 RPIDMDM--------KHWWE-----HNIPSLVTEPGAVYEA---------IDTNDSTI 209 P+D + + W N+ + P VY DT S I Sbjct: 499 VPVDEYLPIMFDRHPESRWREAFPNRNLKAFSIAPLLVYPTHYTGEAGYISDTESSEI 556 >gi|74318875|gb|ABA02576.1| Lgt2B/C [Moraxella catarrhalis] gi|124245099|gb|ABM92441.1| Lgt2B/C [Moraxella catarrhalis] gi|326562971|gb|EGE13250.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 46P47B1] gi|326565312|gb|EGE15492.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis 12P80B1] gi|326570192|gb|EGE20237.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis BC8] Length = 254 Score = 148 bits (375), Expect = 5e-34, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 95/249 (38%), Gaps = 24/249 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPIINNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIK 119 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +IK Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVIK 117 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR----P 175 + +K + L + + + T GY I ++ A +L+ + P Sbjct: 118 LETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFFP 176 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL-- 223 ID + + L P V +A DT S IE R R+ T + Sbjct: 177 IDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLADYGK 236 Query: 224 YFYRNTCYQ 232 +YR+ + Sbjct: 237 KYYRSIGKR 245 >gi|301598260|ref|ZP_07243268.1| LPS glycosyltransferase subfamily protein [Acinetobacter baumannii AB059] Length = 238 Score = 148 bits (375), Expect = 5e-34, Method: Composition-based stats. Identities = 58/235 (24%), Positives = 99/235 (42%), Gaps = 21/235 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKR 54 I + +SL S+ R+EK + + + F FFDAIYG+ P IF + R Sbjct: 3 KILILCVSLQGSNERQEKIKQQHQNLQSTIHDIQIDFQFFDAIYGKKLPNEYLIFLNLSR 62 Query: 55 QCQF--KRLLSLPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADF-SDEFSQLLPHL 108 Q L E+GC++SH+ +W+R+A II+EDD F ++ Q L L Sbjct: 63 QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQQKLHSL 122 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + + + + + Y+ F++ P+ L T Y + +E A +N Sbjct: 123 LETNPAFAFLGGHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDFVNK 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRLV--RKPTF 220 + ID W++ + ++ P +E D S I R + +KP Sbjct: 183 QIKKLTYIDD-----WKYLLEGDISTPFYYCFEHDDEMQSHIASDRKIYMKKPNR 232 >gi|332832911|ref|XP_003312339.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2 [Pan troglodytes] Length = 548 Score = 148 bits (375), Expect = 6e-34, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 77/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 275 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGRMLNSSAIRSLGVDLLPGYQDPYSG 334 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 335 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 394 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + Sbjct: 395 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALSLAGARKLL-ASQPLRCM 449 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 450 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 489 >gi|157136453|ref|XP_001656834.1| hypothetical protein AaeL_AAEL003481 [Aedes aegypti] gi|122095142|sp|Q17FB8|GLT25_AEDAE RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor gi|108881003|gb|EAT45228.1| conserved hypothetical protein [Aedes aegypti] Length = 607 Score = 148 bits (375), Expect = 6e-34, Method: Composition-based stats. Identities = 42/185 (22%), Positives = 72/185 (38%), Gaps = 13/185 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 +Y+I+L RR K + + L FF A+ G F K Sbjct: 325 IYMINLERRPERRNKMFNNFDELGLDVEFFPAVDGRQLSDDKLRDIGVKFLPGYADPYHK 384 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI---NNI 116 R +++ EIGC++SH ++W+++ ++LEDD F F + + + Sbjct: 385 RPMTMGEIGCFLSHYYIWEKMVAMNQEEVLVLEDDIRFEPYFKRRVAQVLDDARRIGGWD 444 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRP 175 LI F R + + ++ G+ + S T GY I + A LL P Sbjct: 445 LIYFGRKRLQEDDEKWVV---GSETLVVA-GYSYWTLGYLISLQGAKKLLEEKPLEKLVP 500 Query: 176 IDMDM 180 +D + Sbjct: 501 VDEYI 505 >gi|108563237|ref|YP_627553.1| beta-1,4-galactosyltransferase [Helicobacter pylori HPAG1] gi|107837010|gb|ABF84879.1| beta-1,4-galactosyltransferase [Helicobacter pylori HPAG1] Length = 273 Score = 148 bits (375), Expect = 6e-34, Method: Composition-based stats. Identities = 54/280 (19%), Positives = 88/280 (31%), Gaps = 53/280 (18%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H Q FDAIY + Sbjct: 1 MRVFIISLNQKVCDTFGLVFRDT---TTLLNSINATHHQAQIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + + + +SL E+GCY SH LW++ Sbjct: 58 KHLHPYFIAQNIKDMGITTNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWQKCIEL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPG 138 ILEDD + F + L L K ++ L P S + + Sbjct: 118 NE-AICILEDDITLKEGFKEGLDFLEKHIQELGYVRLMHLLYDPNVKSEPLSHKNQEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A L + P+D M + H + +LV +P + Sbjct: 177 RVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWVVPVDTIMDATFIHGVKNLVLQPFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 A D STI P + R +++ ++ Sbjct: 237 --ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|13474441|ref|NP_106009.1| glycosyl transferase [Mesorhizobium loti MAFF303099] gi|14025194|dbj|BAB51795.1| glycosyl transferase [Mesorhizobium loti MAFF303099] Length = 265 Score = 148 bits (373), Expect = 9e-34, Method: Composition-based stats. Identities = 46/246 (18%), Positives = 75/246 (30%), Gaps = 18/246 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + VI+L S R ARI + F + G+ +P + R L Sbjct: 1 MKRLVINLDRSPERLAHMSKEFARIGIGFERVAGVDGKEHPDLML---QPQHAIHATRPL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EI C SH W IA A + EDD FS + LL S + ++K + Sbjct: 58 SGSEIACLHSHRACWTIIAQDEAPYGAVFEDDMVFSSKAGALLGDTSWVPADADVVKLET 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + F + + R P GY + + A L + +D + Sbjct: 118 FFSRTVIQRRRISAGHGFSMFRLRKGHPGAGGYLLSRRTARDFLEATAQVNIAVDDLIFD 177 Query: 183 WWEH-NIPSLVTEPGAVYEA---IDTNDSTIEESR-----------LVRKPTFSPLYFYR 227 P + S +++ R +KP + Sbjct: 178 PIISAGKTIYQLVPALCAQDQFVGGRLPSLLDQERDAARTASGLMTRQKKPMAVRISREA 237 Query: 228 NTCYQW 233 +W Sbjct: 238 GRTARW 243 >gi|148825759|ref|YP_001290512.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittEE] gi|148715919|gb|ABQ98129.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittEE] Length = 257 Score = 148 bits (373), Expect = 1e-33, Method: Composition-based stats. Identities = 56/232 (24%), Positives = 88/232 (37%), Gaps = 25/232 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRKLFFSQKNTE--DFQIFSAINTMQKDWDELASIFNIEQFKAHYFR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I A++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNDIAEDSYALVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I D P + GN + P T GY I K AA Sbjct: 120 IILLGQSKINSFNDTDLEINYPTTFLFYAKKQGNVNYAFPYKSYFAGTVGYLIKKSAARR 179 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + K + D + + NI + V P V E S +E R Sbjct: 180 FIQQISQNKPFWLA-DDFLLFEQDFNIRNKVVRPLMVIENP-VLISNLESVR 229 >gi|226328064|ref|ZP_03803582.1| hypothetical protein PROPEN_01955 [Proteus penneri ATCC 35198] gi|225203768|gb|EEG86122.1| hypothetical protein PROPEN_01955 [Proteus penneri ATCC 35198] Length = 245 Score = 148 bits (373), Expect = 1e-33, Method: Composition-based stats. Identities = 60/246 (24%), Positives = 100/246 (40%), Gaps = 24/246 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL- 61 + +VI+L +R+ ++ I + + F +AI G + +Q + Sbjct: 1 MNTFVINLKDDLLKRDFMRNQLDNIGISYQFIEAIDGRIMS------DNTIKQLAYDYPD 54 Query: 62 --LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL---PHLSKCDINNI 116 L+ EIGC +SH+ ++K++ A+ILEDDA L ++ K N+ Sbjct: 55 CYLTKGEIGCTLSHMAIYKKMIEDNIEIALILEDDALLPSNIYSKLIDIKNIDKSSKPNV 114 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I K +SY+ N +I+ +S T GY I K AA LL+ I Sbjct: 115 YIL-------TKTESYIKNKQLNNNIYYAYSVS-GTYGYVINKAAAKSLLSKLTPIKYEA 166 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAI-DTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL 235 DM + I P + D++ S+I + RL+RK YR+ + Sbjct: 167 DMWSTFKRQGLINLYCHIPHLIDNNDKDSSQSSIHQERLIRKNKRE---HYRHLLKKKEP 223 Query: 236 HYNAWR 241 +Y R Sbjct: 224 NYQFNR 229 >gi|73968112|ref|XP_851283.1| PREDICTED: similar to cerebral endothelial cell adhesion molecule 1 [Canis familiaris] Length = 595 Score = 147 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 75/220 (34%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLSSLWEMEVSGRVVDAVDGRTLNSSLMRSLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARRLARILVFEDDVRFESNFRGRLERLMEEVEAEKLPW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P++++ + LP S T Y + A LL + ++R Sbjct: 442 DLIYLGRKQVNPEEEAAVEGLPH----LVVAGYSYWTLAYVLSLAGARKLL-ASQPLHRM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + P Sbjct: 497 LPVDEFLPIMFDQHPNEQYKAHFWPRDLRAFSARPLLAAP 536 >gi|46411176|ref|NP_997181.1| glycosyltransferase 25 family member 3 precursor [Mus musculus] gi|160395523|sp|A3KGW5|GT253_MOUSE RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule; Flags: Precursor gi|126540790|emb|CAM46219.1| cerebral endothelial cell adhesion molecule 1 [Mus musculus] gi|148676479|gb|EDL08426.1| cerebral endothelial cell adhesion molecule 1 [Mus musculus] gi|187953029|gb|AAI38848.1| Cerebral endothelial cell adhesion molecule [Mus musculus] Length = 592 Score = 147 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 47/264 (17%), Positives = 87/264 (32%), Gaps = 33/264 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 V+VISL RR + + + DA+ G + Sbjct: 319 VFVISLARRPQRRARMLSSLWEMEISAQVVDAVDGRTLNSSILKHLGVDLLPGYQDPYSG 378 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 L+ E+GC++SH +W+ + ++ EDD F D F + L L + ++ Sbjct: 379 HTLTKGEVGCFLSHYSIWEEVVARGLARVVVFEDDVRFKDNFRRRLERLMEDVLIQKLSW 438 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+++ + LPG S T Y + A LL + ++R Sbjct: 439 DLIYLGRKQVNPEEEVAVEGLPG----LVVAGYSYWTLAYTLSLAGARKLL-ASQPLHRM 493 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTND----STIEESRLVR 216 P+D + ++W ++ + P + D S E S Sbjct: 494 LPVDEFLPVMFDRHPNDQYKAYFWPRDLQAFSARPLLASPTHYSGDAEWLSDTETSSPWD 553 Query: 217 KPTFSPLYFYRNTCYQWNLHYNAW 240 + + + H + Sbjct: 554 DDSGRLISQTGSQKALRGPHLHLT 577 >gi|317182139|dbj|BAJ59923.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F57] Length = 275 Score = 147 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 53/287 (18%), Positives = 92/287 (32%), Gaps = 59/287 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD + F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKENFKEGLDFLEKHIQELGYARLMHLLYDASVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 + A D STI P + R +++ ++ + + Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQLYNQ 275 >gi|260432180|ref|ZP_05786151.1| glycosyl transferase family 25 [Silicibacter lacuscaerulensis ITI-1157] gi|260416008|gb|EEX09267.1| glycosyl transferase family 25 [Silicibacter lacuscaerulensis ITI-1157] Length = 239 Score = 147 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 48/247 (19%), Positives = 90/247 (36%), Gaps = 14/247 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKR 60 + ++I + S ARR A + +A+ G + + + ++ Sbjct: 1 MRSFIIHMSSSTARRPNAEKLCAELP-NAELIEAVDGRDPAQIAGVTLHPGNLHRPRYPF 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINNILIK 119 +L EIG + SH +W+++ AI EDD + + L L + I+ Sbjct: 60 VLRPAEIGVFESHRRIWRKVVDEGIDLAITAEDDLRIDPDRMNAALDLLRPIATPDHYIR 119 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K+ + + + P+++ + +G++AA LL I RP+D Sbjct: 120 IPV--KQRETAAQTLAEHDGLRLILPKVIGLQCVCQVVGRKAAERLLAATDQIDRPVDTF 177 Query: 180 MKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESRLVRKPTF--SPLYFYRNTCYQWNLH 236 ++ W P E A + STI+ KP ++ Y+ LH Sbjct: 178 LQMHWITGQPVHSLLGSGNREVAGEIGGSTIQ-----TKPPLTDKLTREFKRAAYRAQLH 232 Query: 237 YNAWRKD 243 + R D Sbjct: 233 LHPQRPD 239 >gi|260912785|ref|ZP_05919271.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Pasteurella dagmatis ATCC 43325] gi|260633163|gb|EEX51328.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Pasteurella dagmatis ATCC 43325] Length = 284 Score = 147 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 92/245 (37%), Gaps = 20/245 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL + RR+ + ++ F FFDA+ + + Q + L Sbjct: 7 LKNYVISLSSATDRRKHILEEFYQHNIPFLFFDAL---SPSLEFNQLVEQLIPNLSRCNL 63 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NNIL 117 + E GC +SH+ LW + + EDD + + L +N + Sbjct: 64 TNGEKGCLMSHLSLWHKCVEENLPYIGVFEDDILLGKDAEEFLASDEWIHSRFQHGDNFI 123 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 I+ + +K D+ + + D + R T GY I AA + LN+ N+ Sbjct: 124 IRLETFLQKVLCDATNISPYLDRDFLKLRSTHFGTAGYIISFSAAKYFLNIFANMKEDEI 183 Query: 174 RPIDMD-------MKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 PID M ++ + + P++ + V + S +E+ R++ + Sbjct: 184 EPIDQLIFDRFLPMDNFQVYQLSPAICIQELQVNQEHSLLSSQLEDERIIARLDRDRKRK 243 Query: 226 YRNTC 230 + Sbjct: 244 QKKIK 248 >gi|317009163|gb|ADU79743.1| beta-1,4-galactosyltransferase [Helicobacter pylori India7] Length = 273 Score = 147 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 91/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQFSF-----FDAIYGENNP----- 43 + V++ISL ++ C + ++ + FDAIY + Sbjct: 1 MRVFIISLN------QRVCDQFGLVFRDTTTLLHNINATRHKAQIFDAIYSKTFEGGLHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + + + +SL E+GCY SH LW++ Sbjct: 55 LVKEHLHPYFTTQNIKDMGITTNLISGVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD + F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKENFKEGLDFLEKNIQELGYARLMHLLYDANVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A L + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFLKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|301344884|ref|ZP_07225625.1| LPS glycosyltransferase subfamily protein [Acinetobacter baumannii AB056] Length = 271 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 58/235 (24%), Positives = 99/235 (42%), Gaps = 21/235 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKR 54 I + +SL S+ R+EK + + + F FFDAIYG+ P IF + R Sbjct: 3 KILILCVSLQGSNERQEKIKQQHQNLQSTIHDIQIDFQFFDAIYGKKLPNEYLIFLNLSR 62 Query: 55 QCQF--KRLLSLPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADF-SDEFSQLLPHL 108 Q L E+GC++SH+ +W+R+A II+EDD F ++ Q L L Sbjct: 63 QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQQKLHSL 122 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + + + + + Y+ F++ P+ L T Y + +E A +N Sbjct: 123 LETNPAFAFLGGHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDFVNK 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRLV--RKPTF 220 + ID W++ + ++ P +E D S I R + +KP Sbjct: 183 QIKKLTYIDD-----WKYLLEGDISTPFYYCFEHDDEMQSHIASDRKIYMKKPNR 232 >gi|262380196|ref|ZP_06073351.1| LPS glycosyltransferase subfamily [Acinetobacter radioresistens SH164] gi|262298390|gb|EEY86304.1| LPS glycosyltransferase subfamily [Acinetobacter radioresistens SH164] Length = 271 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 58/235 (24%), Positives = 99/235 (42%), Gaps = 21/235 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKR 54 I + +SL S+ R+EK + + + F FFDAIYG+ P IF + R Sbjct: 3 KILILCVSLQGSNERQEKIKQQHQNLQSTIHDIQIDFEFFDAIYGKKLPNEYLIFLNLSR 62 Query: 55 QCQF--KRLLSLPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADF-SDEFSQLLPHL 108 Q L E+GC++SH+ +W+R+A II+EDD F + Q L L Sbjct: 63 QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYTAYDRIIIIEDDVIFQTKHIQQRLHSL 122 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + D + + + + + Y+ F++ P+ L T Y + +E A ++ Sbjct: 123 LQTDPSFAFLGGHSEPSRRRIRGYISNDELYFNMTGPKDLYTATYAYSLTRETAQDFIHK 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRLV--RKPTF 220 + ID W++ + ++ P +E D S I R + +KP Sbjct: 183 QIKKLTYIDD-----WKYLLEGDISTPFYYCFEHDDEMQSYIASDRKIYMKKPNR 232 >gi|47220022|emb|CAG12170.1| unnamed protein product [Tetraodon nigroviridis] Length = 635 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 50/241 (20%), Positives = 93/241 (38%), Gaps = 37/241 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + +A+ G+ I + Sbjct: 356 VFMINLKRRTDRRERMLRALYEQKISCKVVEAVDGKAMNISEIHALGIHMLPGYSDPYHG 415 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ E+GC++SH ++WK I ++++EDD F F + L + + ++ Sbjct: 416 RPLTKGELGCFLSHYNIWKEIVKRRLHTSLVIEDDLRFEVFFKRRLMDLMTEVEEEGLDW 475 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI RK+ + + +P ++ + S T GY I + A LL + + R Sbjct: 476 DLIYIG--RKRMQVERPEKAVPNIHNLVEA-DYSYWTLGYMISLQGAEKLLK-AEPLKRV 531 Query: 175 -PID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M+ + ++ + EP VY DT ST+ + Sbjct: 532 LPVDEFLPIMYNKHPESDYMEQFESRDLKAFSVEPLLVYPTHYTGDLGYISDTETSTVWD 591 Query: 212 S 212 + Sbjct: 592 N 592 >gi|188527328|ref|YP_001910015.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Shi470] gi|188143568|gb|ACD47985.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Shi470] Length = 273 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 92/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWQKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDASVRSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSQKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|307637518|gb|ADN79968.1| lipo oligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 908] Length = 273 Score = 147 bits (370), Expect = 2e-33, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 86/280 (30%), Gaps = 53/280 (18%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H Q FDAIY + Sbjct: 1 MRVFIISLNQKVCDKFGLVFRDT---TTLLNNINATHHQAQIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + + + +SL E+GCY SH LW++ Sbjct: 58 KHLHPYFITQNIKGMGIATNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWEKCIEL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDS-----YLCTLPG 138 ILEDD ++F + L L K + L S + Sbjct: 118 DEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDASVKSEPLIHKNHEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A L + P+D M + H + +LV +P + Sbjct: 177 RVGIIKAYSHGVGTQGYVITPKIAKVFLKHSQKWVVPVDTIMDATFIHGVKNLVLQPFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 A D STI P + R +++ ++ Sbjct: 237 --ADDEQISTIARKEESYSPK---IALMRELHFKFLKYWQ 271 >gi|145228085|gb|ABP48752.1| beta(1-4) galactosyltransferase [Moraxella catarrhalis] Length = 254 Score = 147 bits (370), Expect = 2e-33, Method: Composition-based stats. Identities = 54/249 (21%), Positives = 95/249 (38%), Gaps = 24/249 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPIINNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIK 119 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +IK Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVIK 117 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR----P 175 + +K + L + + + T GY I ++ A +L+ + P Sbjct: 118 LETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFFP 176 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL-- 223 ID + + L P V +A DT S +E R R+ T + Sbjct: 177 IDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLLETQRKQNVNQHRRRTLADYGK 236 Query: 224 YFYRNTCYQ 232 +YR+ + Sbjct: 237 KYYRSIGKR 245 >gi|7959265|dbj|BAA96026.1| KIAA1502 protein [Homo sapiens] Length = 560 Score = 147 bits (370), Expect = 2e-33, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 321 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 380 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 381 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 440 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 441 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 495 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 496 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 535 >gi|317014235|gb|ADU81671.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Gambia94/24] Length = 273 Score = 147 bits (370), Expect = 2e-33, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 86/280 (30%), Gaps = 53/280 (18%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H Q FDAIY + Sbjct: 1 MRVFIISLNQKVCDKFGLVFRDT---TTLLNNINATHHQAQIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + + + +SL E+GCY SH LW++ Sbjct: 58 KHLHPYFITQNIKGMGIATNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWEKCIEL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPG 138 ILEDD ++F + L L K + L S + Sbjct: 118 DEP-ICILEDDITLKEDFKEGLDFLEKHIHELGYARLMHLLYDASVKSEPLNHKNQEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A L + P+D M + H + +LV +P + Sbjct: 177 RVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWVVPVDTIMDATFIHGVKNLVLQPFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 A D STI P + R +++ ++ Sbjct: 237 --ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|317180590|dbj|BAJ58376.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F32] Length = 273 Score = 147 bits (370), Expect = 2e-33, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 90/283 (31%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKVQIFDAIYSKAFESELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L S Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDANVKSEPLNHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKAFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|15611832|ref|NP_223483.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155330|gb|AAD06344.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 273 Score = 147 bits (370), Expect = 2e-33, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 85/280 (30%), Gaps = 53/280 (18%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H Q FDAIY + Sbjct: 1 MRVFIISLNQKVCDKFGLVFRDT---TTLLNNINATHHQAQIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + + + +SL E+GCY SH LW++ Sbjct: 58 KHLHPYFITQNIKDMGIATSLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWQKCIEL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPG 138 ILEDD ++F + L L K + L S + Sbjct: 118 NE-AICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDASVKSEPLNHKNQEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A L + P+D M + H + +LV +P + Sbjct: 177 RVGIIKAYSHGVGTQGYVITPKIAKVFLKHSRKWVVPVDTIMDATFIHGVKNLVLQPFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 A D STI P + R +++ + Sbjct: 237 --ADDEQISTIARKEEPYSPK---IALMRELHFKFLKWWQ 271 >gi|313143720|ref|ZP_07805913.1| lipooligosaccharide 5G8 epitope biosynthesis-protein [Helicobacter cinaedi CCUG 18818] gi|313128751|gb|EFR46368.1| lipooligosaccharide 5G8 epitope biosynthesis-protein [Helicobacter cinaedi CCUG 18818] Length = 250 Score = 146 bits (369), Expect = 2e-33, Method: Composition-based stats. Identities = 53/256 (20%), Positives = 95/256 (37%), Gaps = 24/256 (9%) Query: 20 FCHRA----ARIHLQFS--FFDAIYGENNP-ICNRIFSHQKRQCQFKRLLSLPEIGCYIS 72 + +++ ++ FF+AI + + + +S K + LS E C+ S Sbjct: 1 MSKQFENLDSKLKEKYEIIFFNAINAQAKEHLNFKQYSKIKSILFRGKELSDGERACFAS 60 Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 H LW++ S I+LEDD + F Q L + + ++ L K K Sbjct: 61 HYTLWQKCVESNEP-IIVLEDDVEILPHFWQELERIEQ--SAFAYVRLMYLITKAKFYDL 117 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLV 192 F+ T GY++ AA + K+ YRP+D M ++ H IP + Sbjct: 118 QKGFHFTFNNV------AGTQGYYLTPTAAKAFIESAKSWYRPVDDYMDMFYIHKIPIVC 171 Query: 193 TEPGAVYEAIDTNDSTIEESR----LVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVS 248 +P +T S + + + L+ R + + LP S Sbjct: 172 VKPVIQEVRANTTISGRWQKPSLLLKIVRECVRLLFQTRRFVFLFFFKNTLL---LPKES 228 Query: 249 TTKF-LPSSSSSLIKS 263 K L +S +++S Sbjct: 229 LQKIHLGRENSEILRS 244 >gi|71149101|gb|AAZ29051.1| Lgt2 [Moraxella catarrhalis] Length = 254 Score = 146 bits (369), Expect = 3e-33, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 94/249 (37%), Gaps = 24/249 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC-QFKRL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISVKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPITNNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIK 119 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +IK Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVIK 117 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR----P 175 + +K + L + + + T GY I ++ A +L+ + P Sbjct: 118 LETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFFP 176 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL-- 223 ID + L P V +A DT S IE R R+ T + Sbjct: 177 IDHVIFDAIISKKSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLADYGK 236 Query: 224 YFYRNTCYQ 232 +YR+ + Sbjct: 237 KYYRSIGKR 245 >gi|229844633|ref|ZP_04464772.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae 6P18H1] gi|229812347|gb|EEP48037.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae 6P18H1] Length = 257 Score = 146 bits (369), Expect = 3e-33, Method: Composition-based stats. Identities = 50/218 (22%), Positives = 85/218 (38%), Gaps = 12/218 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ K L Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNII-FDRSSKATLCDA 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ D + ++K + L Sbjct: 68 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NVDYIDSDTDILKLETLYG 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NIYRPIDMDM--K 181 K + + +I + GY I + A +LLN K + ID + Sbjct: 127 K-IMLGKMYQIKCERNIFHLKFKHTGMGGYSITAKGAKYLLNKIKNTRLDLAIDTLIFDD 185 Query: 182 HWWEHNIPSLVTEPG-----AVYEAIDTNDSTIEESRL 214 +++ + P + +S+++E RL Sbjct: 186 LLSQNDYKVMQLFPAVCIQDCILHGKSHFESSLDEGRL 223 >gi|157885917|emb|CAP09301.1| novel protein similar to vertebrate glycosyltransferase 25 domain containing 1 (GLT25D1) [Danio rerio] Length = 598 Score = 146 bits (369), Expect = 3e-33, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 96/240 (40%), Gaps = 35/240 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 VY+I+L RR++ + + + DA+ G+ + FK Sbjct: 332 VYLINLKRREDRRDRMLRSLEVLGIDVTLTDAVDGKAMNSTQLRTLGIEMLPGFKDPYSD 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R+L+ EIGC++SH ++WK++ ++LEDD F F L + + D+ ++ Sbjct: 392 RVLTKGEIGCFLSHYNIWKKVVELQQQQVLVLEDDVRFETNFKSRLNTIME-DVKRSGLQ 450 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--R 174 +D + RK+ + + G ++ P S T GY + + A LL + + Sbjct: 451 WDLIYVGRKRLQIKHPERWVEGVKNLVSP-DYSYWTLGYALSLQGAKILLE-AQPLSKML 508 Query: 175 PID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEES 212 P+D M H+ N+ + +P ++ DT S+I + Sbjct: 509 PVDEFLPVMFNKHPKKEYMSHFQLRNLKAFSVQPLLLFPTHYTGEAGYFSDTETSSIWDD 568 >gi|308182985|ref|YP_003927112.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori PeCan4] gi|308065170|gb|ADO07062.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori PeCan4] Length = 273 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 55/283 (19%), Positives = 94/283 (33%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIY-----GENNP 43 + V++ISL +K C + ++ FDAIY GE +P Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 ICNRIF-----------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLIGRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDASVRSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQEHVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI + P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEELYSPK---IALMRKLHFKYLKYWQ 271 >gi|325996107|gb|ADZ51512.1| putative lipopolysaccharide biosynthesis protein/Beta-1,4-galactosyltransferase [Helicobacter pylori 2018] gi|325997703|gb|ADZ49911.1| putative lipo oligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 2017] Length = 273 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 86/280 (30%), Gaps = 53/280 (18%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H Q FDAIY + Sbjct: 1 MRVFIISLNQKVCDKFGLVFRDT---TTLLNNINATHHQAQIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + + + +SL E+GCY SH LW++ Sbjct: 58 KHLHPYFITQNIKGMGIATNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWEKCIEL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDS-----YLCTLPG 138 ILEDD ++F + L L K + L S + Sbjct: 118 DEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDASVKSEPLIHKNHEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A L + P+D M + H + +LV +P + Sbjct: 177 RVGIIKAYSHGVGTQGYVITPKIAKVFLKHSQKWVVPVDTIMDATFIHGVKNLVLQPFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 A D STI P + R +++ ++ Sbjct: 237 --ADDEQISTIARKEEPYSPK---IALMRELHFKFLKYWQ 271 >gi|170038076|ref|XP_001846879.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167881499|gb|EDS44882.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 496 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 40/182 (21%), Positives = 72/182 (39%), Gaps = 7/182 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-----ICNRIFSHQKRQCQFK 59 +Y+I+L RR K + + L+ +F A+ G+ F K Sbjct: 211 IYMINLERRPERRLKMVNNFDALGLEVEYFPAVDGKKLNDDVLREIGVEFLPGYTDPYHK 270 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R +++ EIGC++SH ++W+R+ ++LEDD F F + + + Sbjct: 271 RPMTMGEIGCFLSHYYIWERMVELGQEEVLVLEDDIRFEPYFKRRVYQVLNDARRIGGWD 330 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPIDM 178 +K ++ + G+ + Q S T GY I + A LL P+D Sbjct: 331 LIYFGRKRLQEDDERWIEGSDSLVQA-GYSYWTLGYLINLQGAKKLLAENPLKKLVPVDE 389 Query: 179 DM 180 + Sbjct: 390 YI 391 >gi|18129284|emb|CAD20562.1| putative glycosyltransferase [Brucella abortus] Length = 703 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 50/259 (19%), Positives = 92/259 (35%), Gaps = 10/259 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFS--FFDAIYGENNPICNRIFSHQKRQCQF 58 M +PV+VI++ A + F DA+ G + Sbjct: 1 MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTAT-QRIGIDDARFDAIN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R + E GCY +H+ + + +ILEDD F++ S + + + + ++ Sbjct: 60 GREMLPGEYGCYRNHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIESLPDFDVV 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 K R P S L T G+ Y + +E A LL+ + P D+ Sbjct: 120 KLVNHRS-PLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL----VRKPTFSPL--YFYRNTCYQ 232 M+ +W H ++ + + + S I + + P + L +R Y Sbjct: 179 AMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYY 238 Query: 233 WNLHYNAWRKDLPPVSTTK 251 +H+ + P S+ K Sbjct: 239 VRVHHTLLQPQNPDGSSMK 257 >gi|126322946|ref|XP_001368839.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 623 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 47/226 (20%), Positives = 89/226 (39%), Gaps = 26/226 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ DAI G + + Sbjct: 344 VFMINLKRRLDRRERMLRTLHEQEIECKIVDAIDGRAMNTSQVEALGIRMLPGYQDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +WK + +++ EDD F F + L +L DI ++ Sbjct: 404 RPLTKGELGCFLSHHQIWKEVVERGLEKSLVFEDDLRFEIFFKRRLMNLM-YDIEEEGLE 462 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--R 174 +D + RK+ + + +P ++ + S T Y + + A LL + + Sbjct: 463 WDLIYVGRKRMQVEHPEKAVPHVRNLVEA-DYSYWTLAYVLSLQGAQKLL-AAQPLSKML 520 Query: 175 PIDMDM-------------KHWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + +H+ + ++ + EP VY T D Sbjct: 521 PVDEFLPVMFDKHPVSDYKEHFEKRDLLAFSVEPLLVYPTHYTGDD 566 >gi|325268033|ref|ZP_08134679.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Kingella denitrificans ATCC 33394] gi|324980418|gb|EGC16084.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Kingella denitrificans ATCC 33394] Length = 277 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 85/234 (36%), Gaps = 26/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YV+S+ + RR+ + + F FFDA+ E + I H L Sbjct: 1 MENYVVSIRTAAQRRQHVAAEFNKHQIAFHFFDAVTPET--LAESIAEHCPNLA--DAFL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN----ILI 118 + E GC++SH+ LW + I EDD F S+ L D +I Sbjct: 57 TGGEKGCFMSHVCLWAKCVADDLPYIGIFEDDVIFGQNSSRFLNDTKWLDERFQNQSFII 116 Query: 119 KFDALRKK--PKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI---- 172 + + K I + + T Y I ++ AI LLN + + Sbjct: 117 RMETFLKANPVALSKSGVRPFNGRKILRLQSFGFGTAAYLISQQTAITLLNWIREVAPEK 176 Query: 173 YRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 PID + + + P+L + + A + ST+E+ RL R Sbjct: 177 LEPIDNMLFNAASEIPEIQMYQIS--PALCIQELQLNRADSSLSSTLEDGRLAR 228 >gi|319779934|ref|YP_004139410.1| glycosyl transferase family 25 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317165822|gb|ADV09360.1| glycosyl transferase family 25 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 274 Score = 146 bits (368), Expect = 4e-33, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 79/220 (35%), Gaps = 5/220 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + VI+L S R + ARI + F AI + P +++ + + + L Sbjct: 1 MKRLVINLDRSPDRLARVTAEFARIGVAFERVQAIDARDRPELDQLPQNVGYRNRL--PL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EI C +SH W IA A I EDD F+ + LL + ++K ++ Sbjct: 59 TDGEIACLLSHRACWAVIAQGDARYGAIFEDDIVFAAKAGALLRDSDWIPADAEVVKLES 118 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM-- 180 K + F + T GY I ++ A+ L+ + I +D + Sbjct: 119 YFCKTVIRRKRIAIGHGFSTSKLDAEHIGTAGYIISRQTALDLIKATEEIGIAVDDLVFN 178 Query: 181 -KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT 219 + P + D ++ L+R+ Sbjct: 179 PEFSVLRGETIYQLVPALCAQDQFLGDRALQLPSLLREKR 218 >gi|308062146|gb|ADO04034.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Cuz20] Length = 273 Score = 146 bits (368), Expect = 4e-33, Method: Composition-based stats. Identities = 55/283 (19%), Positives = 93/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIY-----GENNP 43 + V++ISL +K C + ++ FDAIY GE +P Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 ICNRIF-----------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISGVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L S + Sbjct: 115 IELNEP-VCILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDASVRSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIVRKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|326664713|ref|XP_686329.4| PREDICTED: procollagen galactosyltransferase 2 [Danio rerio] Length = 584 Score = 146 bits (368), Expect = 4e-33, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 96/240 (40%), Gaps = 35/240 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 VY+I+L RR++ + + + DA+ G+ + FK Sbjct: 318 VYLINLKRREDRRDRMLRSLEVLGIDVTLTDAVDGKAMNSTQLRTLGIEMLPGFKDPYSD 377 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R+L+ EIGC++SH ++WK++ ++LEDD F F L + + D+ ++ Sbjct: 378 RVLTKGEIGCFLSHYNIWKKVVELQQQQVLVLEDDVRFETNFKSRLNTIME-DVKRSGLQ 436 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--R 174 +D + RK+ + + G ++ P S T GY + + A LL + + Sbjct: 437 WDLIYVGRKRLQIKHPERWVEGVKNLVSP-DYSYWTLGYALSLQGAKILLE-AQPLSKML 494 Query: 175 PID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEES 212 P+D M H+ N+ + +P ++ DT S+I + Sbjct: 495 PVDEFLPVMFNKHPKKEYMSHFQLRNLKAFSVQPLLLFPTHYTGEAGYFSDTETSSIWDD 554 >gi|262371063|ref|ZP_06064385.1| glycosyl transferase [Acinetobacter johnsonii SH046] gi|262313949|gb|EEY94994.1| glycosyl transferase [Acinetobacter johnsonii SH046] Length = 256 Score = 145 bits (367), Expect = 4e-33, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 84/225 (37%), Gaps = 15/225 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + VISL + ARRE + ++ F FFDA+ P R + + + L Sbjct: 1 MKNIVISLKTATARREHIVQEFGKQNIGFEFFDAL----TPDLARPLAEKMQLNVEDEYL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ C++SH+ +W+++ I EDD + + S + +IK +A Sbjct: 57 TGGELACFMSHVSIWQKMVDEQLPYVAIFEDDVFLGERAFDIFNSSSWIQEDWHIIKTEA 116 Query: 123 LRKKPKKDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-PIDMDM 180 +K + + I + + T GY + + A+ LN K P+D M Sbjct: 117 FSEKVLLAKEVYRINDTKRQITRLTGKNLGTAGYILSLKGAMAYLNYIKKSSLIPLDELM 176 Query: 181 KHWWEH--NIPSLVTEP-GAVYEAI------DTNDSTIEESRLVR 216 + H P + E + S + + R R Sbjct: 177 FRDFIHAKQFNVYQMSPASCIQEMMLYPEKKTILSSDLIDERKAR 221 >gi|240949472|ref|ZP_04753812.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor NM305] gi|240296045|gb|EER46706.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor NM305] Length = 210 Score = 145 bits (367), Expect = 5e-33, Method: Composition-based stats. Identities = 47/201 (23%), Positives = 78/201 (38%), Gaps = 9/201 (4%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR + ++ F FFDAI + + FS L+ Sbjct: 10 YVISLLTAKERRNHIIQEFGKQNIPFEFFDAITPDLIEEKAKEFSIDISNS----PLTKG 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E+GC +SHI LW + I EDD + + L + S + + +IK + Sbjct: 66 EMGCALSHIALWHFAKENKLDYICIFEDDIYLGENAKEFLTN-SYVNHDIDVIKIEKHSS 124 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PIDMDM--K 181 K + N ++H+ + T GY + + +LL K + P+D+ M K Sbjct: 125 KIIYAANPEAHFCNRNLHKLKSKHTGTAGYILTAKGIKYLLEKTKVYHLSIPVDVLMFEK 184 Query: 182 HWWEHNIPSLVTEPGAVYEAI 202 + + L P + Sbjct: 185 FLKKSDYKVLQLIPAVCIQDD 205 >gi|193078329|gb|ABS90266.2| hypothetical protein A1S_3841 [Acinetobacter baumannii ATCC 17978] Length = 271 Score = 145 bits (367), Expect = 5e-33, Method: Composition-based stats. Identities = 56/235 (23%), Positives = 98/235 (41%), Gaps = 21/235 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKR 54 I + +SL S+ R+EK + + + F FF+AIYG+ P F + R Sbjct: 3 KILILCVSLKGSNERQEKIKQQHQSLQSVIHDIQIDFEFFEAIYGKKLPNEYLTFLNLSR 62 Query: 55 QCQF--KRLLSLPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADF-SDEFSQLLPHL 108 Q L E+GC++SH+ +W+R+A II+EDD F ++ Q L L Sbjct: 63 QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQQKLHSL 122 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + + + + + Y+ F++ P+ L T Y + +E A +N Sbjct: 123 LETNPAFAFLGGHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDFVNK 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRL--VRKPTF 220 + ID W++ + ++ P +E D S I R ++KP Sbjct: 183 QIKKLTYIDD-----WKYLLEGDISTPFYYCFEHDDEMQSHIASDRKTYMKKPNR 232 >gi|217034127|ref|ZP_03439547.1| hypothetical protein HP9810_868g20 [Helicobacter pylori 98-10] gi|216943411|gb|EEC22867.1| hypothetical protein HP9810_868g20 [Helicobacter pylori 98-10] Length = 273 Score = 145 bits (366), Expect = 5e-33, Method: Composition-based stats. Identities = 54/284 (19%), Positives = 92/284 (32%), Gaps = 61/284 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL------RKKPKKDSYLC 134 ILEDD ++F + L L K + L R +P Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDANVRSEPLSHKNHE 173 Query: 135 TLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTE 194 + I + T GY I + A + P+D M + H + +LV + Sbjct: 174 -IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQ 232 Query: 195 PGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 P + A D STI P + R +++ ++ Sbjct: 233 PFVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKFVKYWQ 271 >gi|6630626|dbj|BAA88524.1| beta-1,4-galactosyltransferase [Helicobacter pylori] Length = 273 Score = 145 bits (366), Expect = 6e-33, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 87/280 (31%), Gaps = 53/280 (18%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H Q FDAIY + Sbjct: 1 MRVFIISLNQKVCDTFGLVFRDT---TTLLNNINATHHQAQIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + + + +SL E+GCY SH LW++ Sbjct: 58 KHLHPYFITQNIKDMGITTNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWEKCIEL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPG 138 ILEDD ++F + L L K ++ L P S + Sbjct: 118 NE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDPNVKSEPLNHKNHEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A + P+D M + H + +LV +P + Sbjct: 177 RVGIIKAYSHGVGTQGYVITPKIAKVFKKHSRKWVVPVDTIMDATFIHGVKNLVLQPFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 A D STI P + R +++ ++ Sbjct: 237 --ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|15645445|ref|NP_207619.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] gi|2313960|gb|AAD07876.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 26695] Length = 273 Score = 145 bits (366), Expect = 6e-33, Method: Composition-based stats. Identities = 44/205 (21%), Positives = 73/205 (35%), Gaps = 11/205 (5%) Query: 39 GENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 G + + + + +SL E+GCY SH LW++ ILEDD Sbjct: 73 GITTNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWEKCIELNE-AICILEDDITLK 131 Query: 99 DEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPGNFDIHQPRILSPRTT 153 ++F + L L K I+ L S + + I + T Sbjct: 132 EDFKEGLDFLEKHIQELGYIRLMHLLYDASVKSEPLSHKNHEIQERVGIIKAYSEGVGTQ 191 Query: 154 GYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 GY I + A L + P+D M + H + +LV +P + A D STI Sbjct: 192 GYVITPKIAKVFLKCSRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKE 249 Query: 214 LVRKPTFSPLYFYRNTCYQWNLHYN 238 P + R +++ ++ Sbjct: 250 EPYSPK---IALMRELHFKYLKYWQ 271 >gi|297380029|gb|ADI34916.1| Hypothetical protein HPV225_0845 [Helicobacter pylori v225d] Length = 273 Score = 145 bits (366), Expect = 6e-33, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 92/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQFSF-----FDAIYGENNP----- 43 + V++ISL +K C + ++ + FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATRHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDASVRSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|326930289|ref|XP_003211280.1| PREDICTED: glycosyltransferase 25 family member 3-like [Meleagris gallopavo] Length = 541 Score = 145 bits (366), Expect = 6e-33, Method: Composition-based stats. Identities = 43/241 (17%), Positives = 78/241 (32%), Gaps = 38/241 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 +++I+L RR++ + + DA+ G + Sbjct: 274 IFLINLVRRPDRRQRMLESLQELEIAVRVVDAVDGSTLNSSDIKVLGVDLLPGYYDPFSG 333 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK----CDINN 115 R L+ E+GC++SH ++WK I ++ EDD F F L L + + Sbjct: 334 RTLTKGEVGCFLSHYNIWKEIVSRGLQRVLVFEDDVRFEVSFPARLQRLMEELEGARHDW 393 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 L+ + + ++ + S T Y I A LL + + Sbjct: 394 DLVYLGRKQVNAEDEAPV----EGVRNLVVAGYSYWTLAYAISLRGAQKLL-AAEPLSKM 448 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +H+ ++ P VY DT STI + Sbjct: 449 LPVDEFLPIMYDKHPNEDYKRHFAPRDLQVFSAHPLLVYPTHYAGDSNWLSDTETSTIWD 508 Query: 212 S 212 Sbjct: 509 D 509 >gi|308184613|ref|YP_003928746.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori SJM180] gi|308060533|gb|ADO02429.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori SJM180] Length = 273 Score = 145 bits (366), Expect = 7e-33, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 91/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + +++ISL +K C + ++ FDAIY + Sbjct: 1 MRIFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD + F + L L K + L S + Sbjct: 115 IELNEP-VCILEDDITLKEGFKEGLDFLEKHIQELGYARLMHLLYDASVRSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQEHVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|162951747|gb|ABY21735.1| LD07116p [Drosophila melanogaster] Length = 639 Score = 145 bits (366), Expect = 7e-33, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 100/281 (35%), Gaps = 28/281 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 +++I+L RREK I ++ F A+ G+ + F Sbjct: 362 IFMINLKRRPERREKMERLFDEIGIEAEHFPAVDGKELSTERLLEMGVRFLPGYEDPYHH 421 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R +++ EIGC++SH ++W + +ILEDD F F Q + N Sbjct: 422 RAMTMGEIGCFLSHYNIWVMMVRKQLKEVLILEDDIRFEPYFRQNAVRILNQARNAAQYD 481 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PID 177 +K K+ + ++ S T GY I + A+ LL K + + P+D Sbjct: 482 LIYFGRKRLKEESEPAVENADNLVHA-GYSYWTLGYVISLQGALKLL-AAKPLDKLIPVD 539 Query: 178 MDMKHWW-------------EHNIPSLVTEPGAVYEAIDTN----DSTIEESRLVRKPTF 220 + + + N+ + P +Y T S E+S+ + T Sbjct: 540 EFLPLMFDRHPNKTWTEAFPKRNLVAFSASPLLLYPIYYTGESGYISDTEDSQQISVETS 599 Query: 221 SPL-YFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 ++ Q H ++ + P + + L S L Sbjct: 600 EEGEARLKSDREQVFDHEQEFKLN-PELKLGESLSKSHQEL 639 >gi|119608196|gb|EAW87790.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_d [Homo sapiens] Length = 534 Score = 145 bits (366), Expect = 7e-33, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 261 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 320 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 321 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 380 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 381 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 435 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 436 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 475 >gi|119608194|gb|EAW87788.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_b [Homo sapiens] Length = 539 Score = 145 bits (365), Expect = 7e-33, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 266 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 325 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 326 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 385 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 386 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 440 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 441 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 480 >gi|308063385|gb|ADO05272.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori Sat464] Length = 273 Score = 145 bits (365), Expect = 7e-33, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 91/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMHLLYDASVRSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|309751470|gb|ADO81454.1| Probable lipooligosaccharide biosynthesis protein [Haemophilus influenzae R2866] Length = 280 Score = 145 bits (365), Expect = 7e-33, Method: Composition-based stats. Identities = 55/270 (20%), Positives = 90/270 (33%), Gaps = 30/270 (11%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-K 59 MP+ YVISL + RR+ + ++ H+ F FFDA+ +P F Q+ Sbjct: 1 MPMTNYVISLSSARERRQHVMNEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNG 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DIN 114 L+ E GC ISH+ LW + I EDD + L + Sbjct: 57 TSLTDGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGRDARTFLAEDEWLFSRFNCDD 116 Query: 115 NILIKFDALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI- 172 +I+ + ++ + + D + T GY I AA +LL + KN+ Sbjct: 117 IFIIRLETFLQETICEELPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNME 176 Query: 173 ---YRPIDMDMKHWWEHN--IPSLVTEPGAVYEAIDTN------DSTIEESRLVRKPTFS 221 PID + + + + P + + N DS +E R Sbjct: 177 SNNIFPIDHLIFNRFLAGEELMVYQLSPALCIQEVQLNENESLLDSQLESERKN------ 230 Query: 222 PLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + + P K Sbjct: 231 -YRLAEKARKKKTWREKVYHIFTKPQRMLK 259 >gi|24581946|ref|NP_723087.1| CG31915 [Drosophila melanogaster] gi|74864910|sp|Q8IPK4|GLT25_DROME RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor gi|22945672|gb|AAN10543.1| CG31915 [Drosophila melanogaster] Length = 612 Score = 145 bits (365), Expect = 8e-33, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 100/281 (35%), Gaps = 28/281 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 +++I+L RREK I ++ F A+ G+ + F Sbjct: 335 IFMINLKRRPERREKMERLFDEIGIEAEHFPAVDGKELSTERLLEMGVRFLPGYEDPYHH 394 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R +++ EIGC++SH ++W + +ILEDD F F Q + N Sbjct: 395 RAMTMGEIGCFLSHYNIWVMMVRKQLKEVLILEDDIRFEPYFRQNAVRILNQARNAAQYD 454 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PID 177 +K K+ + ++ S T GY I + A+ LL K + + P+D Sbjct: 455 LIYFGRKRLKEESEPAVENADNLVHA-GYSYWTLGYVISLQGALKLL-AAKPLDKLIPVD 512 Query: 178 MDMKHWW-------------EHNIPSLVTEPGAVYEAIDTN----DSTIEESRLVRKPTF 220 + + + N+ + P +Y T S E+S+ + T Sbjct: 513 EFLPLMFDRHPNKTWTEAFPKRNLVAFSASPLLLYPIYYTGESGYISDTEDSQQISVETS 572 Query: 221 SPL-YFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 ++ Q H ++ + P + + L S L Sbjct: 573 EEGEARLKSDREQVFDHEQEFKLN-PELKLGESLSKSHQEL 612 >gi|193788560|ref|NP_057258.3| glycosyltransferase 25 family member 3 precursor [Homo sapiens] gi|74744901|sp|Q5T4B2|GT253_HUMAN RecName: Full=Glycosyltransferase 25 family member 3; AltName: Full=Cerebral endothelial cell adhesion molecule; Flags: Precursor gi|55958827|emb|CAI13494.1| cerebral endothelial cell adhesion molecule [Homo sapiens] gi|119608193|gb|EAW87787.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_a [Homo sapiens] Length = 595 Score = 145 bits (365), Expect = 8e-33, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 442 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 497 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 536 >gi|83318248|gb|AAI08699.1| CERCAM protein [Homo sapiens] Length = 558 Score = 145 bits (365), Expect = 9e-33, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 382 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 442 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 496 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 497 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 536 >gi|260462737|ref|ZP_05810942.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] gi|259031381|gb|EEW32652.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] Length = 265 Score = 145 bits (365), Expect = 9e-33, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 75/246 (30%), Gaps = 18/246 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + VI+L S R ARI + F I + +P + + R L Sbjct: 1 MKRLVINLDRSPDRLVHMTSEFARIGIGFERVAGIDAQEHP---NLVQQPQHAIHAIRPL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EI C SH W IA A + EDD FS LL S + ++K + Sbjct: 58 AGSEIACLHSHRACWTIIAQDDAPYGAVFEDDMVFSGRAGALLGDTSWVPEDADVVKLET 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + F + R P GY + ++ A L + +D + + Sbjct: 118 FFSRTVIQRRRAPAGHGFSTVRLRKGHPGAGGYLLSRQMAREFLEATAQVNIAVDDLIFN 177 Query: 183 WWEH-NIPSLVTEPGAVYEA---IDTNDSTIEESR-----------LVRKPTFSPLYFYR 227 P + S +++ R RKPT + Sbjct: 178 PAMSAGRNIYQLVPALCAQDQFVGYRLPSLLDQERDAARTASGLMTRQRKPTAVRISREA 237 Query: 228 NTCYQW 233 +W Sbjct: 238 GRTMRW 243 >gi|322379338|ref|ZP_08053714.1| Beta-1,4-galactosyltransferase [Helicobacter suis HS1] gi|321148251|gb|EFX42775.1| Beta-1,4-galactosyltransferase [Helicobacter suis HS1] Length = 269 Score = 145 bits (365), Expect = 9e-33, Method: Composition-based stats. Identities = 52/244 (21%), Positives = 86/244 (35%), Gaps = 42/244 (17%) Query: 3 IPVYVISL-----------PFSHARREKFCHRAARIHLQFSFFDAIYGEN---------- 41 + +++I L + +K +++ + FDAIY + Sbjct: 10 MRIFLIHLSSKTCQENQLFERDISDLKKSLSTSSKHSV--EVFDAIYSKTKTGLHPLVKA 67 Query: 42 -------NPICNRIFSHQKRQCQF-------KRLLSLPEIGCYISHIHLWKRIAYSPAIG 87 +P C + + + + +SL E+GCY SH LW R Sbjct: 68 HLHPYFTHPSCEVTHNLKDTLSAYFYGLKYKGKYMSLGELGCYASHFCLWYRCLEYNEP- 126 Query: 88 AIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRI 147 +ILEDD + F Q L L + ++ L F I P + Sbjct: 127 IVILEDDIELEPCFWQSLDFLEEHIYTLGYVRLMHLDLFELVKKPTRFTA-VFQIVGP-V 184 Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 T GY + + A+ + P+D M + H I +LV EP A+ E + +S Sbjct: 185 AGNGTQGYCLTPQVAMAFIKASAKWVIPVDNLMDCTYLHGISNLVLEPFAIAEKPN--NS 242 Query: 208 TIEE 211 IE Sbjct: 243 NIER 246 >gi|332852212|ref|ZP_08434017.1| LPS glycosyltransferase [Acinetobacter baumannii 6013150] gi|332868180|ref|ZP_08438059.1| LPS glycosyltransferase [Acinetobacter baumannii 6013113] gi|332729342|gb|EGJ60682.1| LPS glycosyltransferase [Acinetobacter baumannii 6013150] gi|332733524|gb|EGJ64692.1| LPS glycosyltransferase [Acinetobacter baumannii 6013113] Length = 271 Score = 145 bits (365), Expect = 9e-33, Method: Composition-based stats. Identities = 56/235 (23%), Positives = 98/235 (41%), Gaps = 21/235 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKR 54 I + +SL S+ R+EK + + + F FF+AIYG+ P F + R Sbjct: 3 KILILCVSLKGSNERQEKIKQQHQSLQSVIHDIQIDFEFFEAIYGKKLPNEYLTFLNLSR 62 Query: 55 QCQF--KRLLSLPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADF-SDEFSQLLPHL 108 Q L E+GC++SH+ +W+R+A II+EDD F ++ Q L L Sbjct: 63 QFAGLCDHELGPSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQQKLHSL 122 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + + + + + Y+ F++ P+ L T Y + +E A +N Sbjct: 123 LETNPAFAFLGGHSEPSRRRIRGYVSKDELYFNMTGPKDLYTATYAYSLTRETAQDFVNK 182 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRL--VRKPTF 220 + ID W++ + ++ P +E D S I R ++KP Sbjct: 183 QIKKLTYIDD-----WKYLLEGDISTPFYYCFEHDDEMQSHIASDRKTYMKKPNR 232 >gi|208434740|ref|YP_002266406.1| beta-1,4-galactosyl transferase [Helicobacter pylori G27] gi|208432669|gb|ACI27540.1| beta-1,4-galactosyl transferase [Helicobacter pylori G27] Length = 273 Score = 144 bits (364), Expect = 9e-33, Method: Composition-based stats. Identities = 54/283 (19%), Positives = 94/283 (33%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSF-----FDAIYGEN-----NP 43 + V++ISL +K C + + ++ + FDAIY + +P Sbjct: 1 MRVFIISLN------QKVCDKFGLVFRDTTTLLNNINATRHKAQIFDAIYSKTFEGGLHP 54 Query: 44 ICNRIF-----------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFIVQNIKDMGITTNLISGVSNFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L P S Sbjct: 115 IELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDPNVKSEPLNHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKHSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|329756961|gb|AEC04736.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 144 bits (364), Expect = 9e-33, Method: Composition-based stats. Identities = 54/252 (21%), Positives = 92/252 (36%), Gaps = 18/252 (7%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-K 59 MP+ YVISL + RR+ + ++ H+ F FFDA+ +P F Q+ Sbjct: 1 MPMTNYVISLSSARERRQHVMNEFSKHHVPFQFFDAV----SPSLQLDFLIQRLVPNLNG 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DIN 114 L+ E GC ISH+ LW + I EDD E + L + Sbjct: 57 TSLTGGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGREAKKFLAEDDWLFSLFNCDD 116 Query: 115 NILIKFDALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI- 172 +I+ + ++ + + D + T GY I AA +LL++ +N+ Sbjct: 117 VFIIRLETFLQETMCRALPHPISYHGRDFLALKEEHLGTAGYIISLGAAKYLLSILRNME 176 Query: 173 ---YRPIDMDMKHWWE--HNIPSLVTEPG-AVYEAIDTNDSTIEESRLVRKPTFSPLYFY 226 PID + + + + P + E + +I +S+L + L Sbjct: 177 SKDIFPIDHLIFNRFLESEKLMVYQLSPALCIQELQLNENESILDSQLESERKNYRLAEK 236 Query: 227 RNTCYQWNLHYN 238 W Sbjct: 237 ARKKKTWREKLQ 248 >gi|111185604|gb|AAI19700.1| Cerebral endothelial cell adhesion molecule [Homo sapiens] Length = 517 Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 244 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 303 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 304 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 363 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 364 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 418 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 419 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 458 >gi|22760015|dbj|BAC11036.1| unnamed protein product [Homo sapiens] gi|22760023|dbj|BAC11040.1| unnamed protein product [Homo sapiens] gi|55958829|emb|CAI13496.1| cerebral endothelial cell adhesion molecule [Homo sapiens] gi|111185706|gb|AAI19699.1| Cerebral endothelial cell adhesion molecule [Homo sapiens] gi|119608195|gb|EAW87789.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_c [Homo sapiens] gi|127802779|gb|AAH98432.2| Cerebral endothelial cell adhesion molecule [Homo sapiens] Length = 517 Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 244 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 303 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 304 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 363 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 364 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 418 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 419 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 458 >gi|194388556|dbj|BAG60246.1| unnamed protein product [Homo sapiens] Length = 548 Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 275 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 334 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 335 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 394 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 395 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 449 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 450 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 489 >gi|73986206|ref|XP_541950.2| PREDICTED: similar to glycosyltransferase 25 domain containing 1 [Canis familiaris] Length = 623 Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 344 VFMINLKRRQDRRERMLRALQEQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 404 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLLNLMRDVEREGLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 464 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAQPLSKM 519 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 520 LPVDEFLPVMFDKHPVSEYKAHFSPRNLRAFSVEPLLIYPTHYTGDD 566 >gi|312113955|ref|YP_004011551.1| glycosyl transferase family 25 [Rhodomicrobium vannielii ATCC 17100] gi|311219084|gb|ADP70452.1| glycosyl transferase family 25 [Rhodomicrobium vannielii ATCC 17100] Length = 273 Score = 144 bits (363), Expect = 1e-32, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 67/178 (37%), Gaps = 7/178 (3%) Query: 1 MP-IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 MP +PV++++L + R + + L F F A+ G+ + + Sbjct: 1 MPQLPVFLLNLDSAPHRLAAMKEQLDALGLAFERFPAVAGKLLSADELEAVAPSHLWEGR 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + EIGC++SH + + I A ILEDD S +F+ +L ++K Sbjct: 61 RRRNPGEIGCFLSHRAILETIVARNLPLACILEDDVHLSPDFATILDAARHLPPQVDVLK 120 Query: 120 FDALRKKPKK-DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + + K + G + P P Y + + A L+ P+ Sbjct: 121 LEIALPRAKIPFIEVSAFAGRDLVFLPSGGWPGAAAYIVTQRGAKALIA-----RMPV 173 >gi|86139388|ref|ZP_01057957.1| glycosyl transferase, family 25 [Roseobacter sp. MED193] gi|85823891|gb|EAQ44097.1| glycosyl transferase, family 25 [Roseobacter sp. MED193] Length = 241 Score = 144 bits (363), Expect = 1e-32, Method: Composition-based stats. Identities = 51/235 (21%), Positives = 88/235 (37%), Gaps = 9/235 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFK 59 P+ ++I +P S AR+ + + FDA+ G + + + Sbjct: 3 PMRSFIIHMPDSTARQANAERLCDDLP-SGTLFDAVNGRDPAQIGAVKYHGGDLHSPNYP 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILI 118 L EIG + SH +W+++ A+I EDD F + L L + + I Sbjct: 62 FALRPAEIGVFQSHRRIWQKMVDEGIDLALITEDDLQIDPVLFPKALELLQRHATPDHYI 121 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + K+ + + + + PR++ + +G+ AA LL K I RP+D Sbjct: 122 RLPV--KQRETPAKVLEDKDGLQLILPRVIGLQCICQCVGRHAAARLLQATKEIDRPVDT 179 Query: 179 DMKHWWEHNIPSLVTEPGAVYE-AIDTNDSTIEESRLVR-KPTFS-PLYFYRNTC 230 ++ W P E A STI++ + K T FYR Sbjct: 180 FLQMHWITQQPVHALLGTGNQEVAAQIGGSTIQQKTPKKGKLTRELKRAFYRAQI 234 >gi|281343517|gb|EFB19101.1| hypothetical protein PANDA_000528 [Ailuropoda melanoleuca] Length = 535 Score = 144 bits (363), Expect = 1e-32, Method: Composition-based stats. Identities = 46/227 (20%), Positives = 86/227 (37%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + +A+ G+ + + Sbjct: 256 VFMINLKRRQDRRERMLRALQAQEIACRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 315 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 316 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLLNLMRDVEREGLDW 375 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 376 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAQPLSKM 431 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP VY T D Sbjct: 432 LPVDEFLPVMFDKHPVSEYKAHFSPRNLRAFSVEPLLVYPTHYTGDD 478 >gi|301753877|ref|XP_002912839.1| PREDICTED: procollagen galactosyltransferase 1-like [Ailuropoda melanoleuca] Length = 542 Score = 144 bits (363), Expect = 1e-32, Method: Composition-based stats. Identities = 46/227 (20%), Positives = 86/227 (37%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + +A+ G+ + + Sbjct: 263 VFMINLKRRQDRRERMLRALQAQEIACRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 322 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 323 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLLNLMRDVEREGLDW 382 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 383 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAQPLSKM 438 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP VY T D Sbjct: 439 LPVDEFLPVMFDKHPVSEYKAHFSPRNLRAFSVEPLLVYPTHYTGDD 485 >gi|317012636|gb|ADU83244.1| beta-1,4-galactosyltransferase [Helicobacter pylori Lithuania75] Length = 273 Score = 143 bits (362), Expect = 2e-32, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 93/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSF-----FDAIYGENNP----- 43 + V++ISL +K C + + ++ + FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLNNINATRHKAQIFDAIYSKTFEGGLHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L P S + Sbjct: 115 IELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDPNVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKVYSEGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTITRKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|329756909|gb|AEC04692.1| hypothetical protein [Pasteurella multocida] gi|329756917|gb|AEC04699.1| hypothetical protein [Pasteurella multocida] gi|329756924|gb|AEC04705.1| hypothetical protein [Pasteurella multocida] gi|329756931|gb|AEC04711.1| hypothetical protein [Pasteurella multocida] gi|329756937|gb|AEC04716.1| hypothetical protein [Pasteurella multocida] gi|329756942|gb|AEC04720.1| hypothetical protein [Pasteurella multocida] gi|329756948|gb|AEC04725.1| hypothetical protein [Pasteurella multocida] gi|329756954|gb|AEC04730.1| hypothetical protein [Pasteurella multocida] Length = 282 Score = 143 bits (362), Expect = 2e-32, Method: Composition-based stats. Identities = 51/242 (21%), Positives = 89/242 (36%), Gaps = 21/242 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ ++ ++ ++ F FFDAI + + R L+ Sbjct: 6 YVISLSSAVERRQHIRNQFSQKNIPFQFFDAISPSPL-LDQLVLQFFPRLADS--SLTGG 62 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILIKF 120 E C++SH+ LW + ++ EDD + + L +I+ Sbjct: 63 EKACFMSHLSLWHKCVEENLPYIVVFEDDIVLGKDADKFLIGDEWLFSRFDPEEIFIIRL 122 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + +K +S + D + T GY I + AA LL++ KNI PI Sbjct: 123 ETFLQKVVCESTHIAPYTHRDFLSLKSAHFGTAGYVISQGAAKFLLDIFKNISNEHIAPI 182 Query: 177 DMDM--KHWWEHNIPSLVTEPGAVYEAIDTND------STIEESR-LVRKPTFSPLYFYR 227 D + + +++ P + + N+ S +E R R L R Sbjct: 183 DELIFNQFLVKNSFNVYQLSPAICVQELQLNNESSALQSQLELERNKFRNKKSEELKRNR 242 Query: 228 NT 229 Sbjct: 243 KN 244 >gi|40225472|gb|AAH11811.2| CERCAM protein [Homo sapiens] Length = 275 Score = 143 bits (362), Expect = 2e-32, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 2 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDGWMLNSSAIRNLGVDLLPGYQDPYSG 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 62 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 121 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 122 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 176 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 177 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 216 >gi|315586508|gb|ADU40889.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 35A] Length = 273 Score = 143 bits (362), Expect = 2e-32, Method: Composition-based stats. Identities = 54/284 (19%), Positives = 91/284 (32%), Gaps = 61/284 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFESGLHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL------RKKPKKDSYLC 134 ILEDD ++F + L L K + L R +P Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMYLLYDANVRSEPLSHKNHE 173 Query: 135 TLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTE 194 + I + T GY I + A + P+D M + H + +LV + Sbjct: 174 -IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQ 232 Query: 195 PGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 P + A D STI P + R +++ ++ Sbjct: 233 PFVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|317178813|dbj|BAJ56601.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F30] Length = 273 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 90/283 (31%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMYLLYDANVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|109947310|ref|YP_664538.1| pbeta-1,4-galactosyltransferase [Helicobacter acinonychis str. Sheeba] gi|109714531|emb|CAJ99539.1| pbeta-1,4-galactosyltransferase [Helicobacter acinonychis str. Sheeba] Length = 275 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 55/283 (19%), Positives = 91/283 (32%), Gaps = 54/283 (19%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGEN-----NPICN 46 + V++ISL + H + FDAIY + +P+ Sbjct: 1 MRVFIISLNKKVCDTFGLVFRDT---TTLLNNINATHHRAEIFDAIYSKAFEGGLHPLVK 57 Query: 47 RIF-----------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + ++ E+GCY SH LW++ Sbjct: 58 KHLHPYFITQSIKDMGIAINPISEISKFYYALKYHAKFMNFGELGCYASHYSLWEKCIDL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPG 138 ILEDD + F + L L K ++ L P S + Sbjct: 118 NEP-ICILEDDITLKENFKEGLDFLEKHIQELGYVRLMHLLYDPNIKSEPLDHNNPEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 + I + T GY I + A L + P+D M + H + +LV + + Sbjct: 177 HIGIIKAYSEGVGTQGYVITPKIAKVFLKYSRRWIVPVDTIMDATFIHGVKNLVLQSFVI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 E + STIE R + +P + R ++ H WR Sbjct: 237 AE--EEQISTIE-RRGIEEPYSPKITLMRELHFK---HLEWWR 273 >gi|261837943|gb|ACX97709.1| beta-4-galactosyltransferase [Helicobacter pylori 51] Length = 273 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 91/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFESELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + +S E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYHAKFMSFGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYARLMYLLYDANVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI +P S + R +++ ++ Sbjct: 234 FVI--ADDEQISTIAR---KEEPYSSKIALMRKLHFKYLKYWQ 271 >gi|332673368|gb|AEE70185.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori 83] Length = 273 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 88/283 (31%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFESELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + R+ + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLIGRVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD + F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKENFKEGLDFLEKHIQELDYARLMYLLYDANVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|322380695|ref|ZP_08054834.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter suis HS5] gi|321146863|gb|EFX41624.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter suis HS5] Length = 173 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 43/156 (27%), Positives = 65/156 (41%), Gaps = 7/156 (4%) Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 + +SL E+GCY SH LW R +ILEDD + F Q L L + Sbjct: 2 KYKGKYMSLGELGCYASHFCLWYRCLEYNEP-IVILEDDIELEPCFWQSLDFLEEHIHTL 60 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 ++ L + KK + G I ++ T GY + + A+ + P Sbjct: 61 GYVRLMHLFELVKKPTRF---TGVLQIVGA-VVGNGTQGYCLTPQVAMAFIKASAKWVIP 116 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 +D M + H I +LV EP A+ E + +S IE Sbjct: 117 VDNLMDCTYLHGISNLVLEPFAIAEKPN--NSNIER 150 >gi|329756902|gb|AEC04686.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 56/270 (20%), Positives = 91/270 (33%), Gaps = 30/270 (11%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-K 59 MP+ YVISL + RR + ++ H+ F FFDA+ +P F Q+ Sbjct: 1 MPMTNYVISLSSARERRRHVMNEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNG 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DIN 114 L+ E GC ISH+ LW + I EDD + L + Sbjct: 57 TSLTGGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGRDARAFLAEDEWLFSRFNCDD 116 Query: 115 NILIKFDALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI- 172 +I+ + ++ ++ + D + T GY I AA +LL + KN+ Sbjct: 117 IFIIRLETFLQETICEALPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNME 176 Query: 173 ---YRPIDMDMKHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 PID + + + P+L + + E DS +E R Sbjct: 177 SNNIFPIDHLIFNRFLAGEELMVYQLSPALCIQELQLNENESLLDSQLESERKN------ 230 Query: 222 PLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + + P K Sbjct: 231 -YRLAEKARKKKTWREKVYHIFTKPQRMLK 259 >gi|149944687|ref|NP_001092425.1| procollagen galactosyltransferase 1 precursor [Bos taurus] gi|160395520|sp|A5PK45|GT251_BOVIN RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|148744100|gb|AAI42351.1| GLT25D1 protein [Bos taurus] gi|296486064|gb|DAA28177.1| glycosyltransferase 25 domain containing 1 precursor [Bos taurus] Length = 623 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 86/227 (37%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + +A+ G+ + + Sbjct: 344 VFMINLKRRQDRRERMLRALEEQEIACRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 404 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMQDVEREGLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 464 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AARPLSKM 519 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ ++ + EP +Y T D Sbjct: 520 LPVDEFLPVMFDKHPVSEYKAHFSPRDLRAFSVEPLLIYPTHYTGDD 566 >gi|319784667|ref|YP_004144143.1| glycosyl transferase family 25 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170555|gb|ADV14093.1| glycosyl transferase family 25 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 696 Score = 143 bits (361), Expect = 3e-32, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 82/253 (32%), Gaps = 10/253 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-ICNRIFSHQKRQCQFK 59 M +P++ I+L R + A L A+ G + Sbjct: 1 MNLPIFAINLDRETQRWSELLASAEAAGLTLQRIAAVDGHALAKEAWTEIDLPAARKLSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + E CY SHI + + +I+EDD FS++ + + + + +IK Sbjct: 61 RDILSGEYACYRSHIQALETFVAGGSAHGLIVEDDVLFSEDTMRRIEAVIAAVPDFDVIK 120 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + + +I + Y + +E A LL+ + P D+ Sbjct: 121 L--INHRISFFMRAVETAEGDEIGRALHGPQGSAAAYLVTREGAQGLLSALAVMKMPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK------PTFSPLYFYRNTCYQ 232 ++ +W+ + + A + S I K P+ R + Sbjct: 179 ALERFWDTGLKVYSARQNVLGFAASSEISGIAGPSGSYKSARLGGPSRLRAGVLRASDEL 238 Query: 233 WNLHYNAWRKDLP 245 LH+ R LP Sbjct: 239 RRLHHVLLRPPLP 251 >gi|260460273|ref|ZP_05808525.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] gi|259033918|gb|EEW35177.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] Length = 696 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 81/253 (32%), Gaps = 10/253 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-ICNRIFSHQKRQCQFK 59 M +P++ I+L R A L A+ G P + Sbjct: 1 MNLPIFAINLDRETCRWSGLLASAEAAGLTLQRIAAVDGRTLPKEDWTEIDLPTAKKLSG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R + E CY SHI + + +I+EDD F ++ + + + + +IK Sbjct: 61 RDILPGEYACYRSHIQALETFLAGGSAHGLIVEDDVLFGEDTIRRVEAIIAAVPDFDVIK 120 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTGYFIGKEAAIHLLNVRKNIYRPIDM 178 R +I + Y + +E A LL+ + P D+ Sbjct: 121 LTNHR--MSFFMRAVKTTQGDEIGRALHGPQGSAAAYLVTREGAKGLLSALAVMKMPWDV 178 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI----EESRLVRKPTFSPLY--FYRNTCYQ 232 ++ +W+ + + A + S I + R +S L R Sbjct: 179 ALERFWDTGLKVYSVRQNLLGFAKSSAISGIAGPSGSYKSARLGGWSRLSAGMSRAWDEL 238 Query: 233 WNLHYNAWRKDLP 245 LH+ R LP Sbjct: 239 HRLHHVFLRPPLP 251 >gi|261839355|gb|ACX99120.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori 52] Length = 273 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 88/283 (31%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MHVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKAQIFDAIYSKTFEGGLHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITINLISRVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD + F + L L K + L S + Sbjct: 115 IELNEP-ICILEDDITLKEGFKEGLDFLEKHIQELGYARLMHLLYDANVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSEGMGTQGYVITPNIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|160333551|ref|NP_001103992.1| procollagen galactosyltransferase 1 precursor [Danio rerio] gi|160395521|sp|A5PMF6|GT251_DANRE RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|148725420|emb|CAN87888.1| novel protein similar to vertebrate glycosyltransferase 25 domain containing 1 (GLT25D1) [Danio rerio] Length = 604 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 56/275 (20%), Positives = 100/275 (36%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + A+ G+ K + Sbjct: 325 VFMINLLRRSDRRERMLRTLYEQEIACKIITAVDGKALNASQVEALGIKMLPGYSDPYHG 384 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK----CDINN 115 R L+ E+GC++SH ++W I ++++EDD F F + L +L + ++ Sbjct: 385 RPLTKGELGCFLSHYNIWNEIVDRGLQSSLVIEDDLRFEVFFKRRLQNLMQEVQSQQLDW 444 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIY 173 LI RK+ + + ++P + + S T GY I A LL K + Sbjct: 445 DLIYIG--RKRMQVERPEKSVPRIHSLVEA-DYSYWTLGYVISLRGAQKLLRAEPLKKM- 500 Query: 174 RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M H+ + ++ + EP +Y DT S++ + Sbjct: 501 LPVDEFLPVMYNKHPIEEYMSHFPQRDLRAFSAEPLLIYPTHYTGDQGYISDTETSSVWD 560 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V T R + Q L A D+ Sbjct: 561 NESVL--TDWDRARSRKSREQEELSSEAQNTDVLQ 593 >gi|317011045|gb|ADU84792.1| pbeta-1,4-galactosyltransferase [Helicobacter pylori SouthAfrica7] Length = 273 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 55/274 (20%), Positives = 84/274 (30%), Gaps = 53/274 (19%) Query: 3 IPVYVISLP-----------FSHARREKFCHRAARIHLQFSFFDAIYGENNP-------- 43 + V++ISL + H + FDAIY + Sbjct: 1 MRVFIISLNQKVCDTFGLVFRDT---TTLLNNINATHHKAEIFDAIYSKTFEGGLHPLVK 57 Query: 44 --------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 +RI + +SL E+GCY SH LW++ Sbjct: 58 KRLHPYFIAQNIKDMGIAIDPISRISKFYYALKYHAKFMSLGELGCYASHYSLWEKCIDL 117 Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-----LPG 138 ILEDD + F + L L K ++ L P S + Sbjct: 118 NEP-ICILEDDITLKENFKEGLDFLEKHIQELGYVRLMHLLYDPNVKSEPLDHQNQEIQE 176 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T GY I + A L + P+D M + H + +LV + A+ Sbjct: 177 YIGIIKAYSEGVGTQGYVITPKIAKVFLKYSQRWVVPVDTIMDATFIHGVKNLVLQSFAI 236 Query: 199 YEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQ 232 A D STI P + R ++ Sbjct: 237 --ADDEQISTIARKEEPYSPK---IALMRKLHFK 265 >gi|311249255|ref|XP_003123541.1| PREDICTED: procollagen galactosyltransferase 1-like [Sus scrofa] Length = 623 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RR++ ++ +A+ G+ + + Sbjct: 344 VFMINLKRRQDRRDRMLRALQEQEIESRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 404 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 464 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AARPLSKM 519 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 520 LPVDEFLPVMFDKHPVSEYKAHFSPRNLRAFSVEPLLIYPTHYTGDD 566 >gi|149629292|ref|XP_001516534.1| PREDICTED: similar to Cerebral endothelial cell adhesion molecule 1 [Ornithorhynchus anatinus] Length = 431 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 82/241 (34%), Gaps = 38/241 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VI+L RRE+ + + + +A+ G + Sbjct: 159 VFVINLARRPDRRERMLNSLHELEIVGRVVEAVDGSALNSSSIKSLGVDLLTGYYDPYSG 218 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ E+GC++SH +W+ +A ++LEDD F F +LL + + Sbjct: 219 RTLTKGEVGCFLSHHAVWEEVAARGLGQVVVLEDDVRFEANFKRRLERLLEEVEAKQLEW 278 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI ++ ++ N + T Y + A LL + + R Sbjct: 279 DLIYLGR-KQVTSEEEEAVADVHNLVVAHYSYW---TLAYALSLSGARKLL-AAQPLGRM 333 Query: 175 -PIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +H+ ++ + P + DT STI + Sbjct: 334 LPVDEFLPIMYDRHPNEDYKQHFRPRDLRAFAMRPSLAFPTHYTGDAQWLSDTETSTIWD 393 Query: 212 S 212 Sbjct: 394 D 394 >gi|260462736|ref|ZP_05810941.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] gi|259031380|gb|EEW32651.1| glycosyl transferase family 25 [Mesorhizobium opportunistum WSM2075] Length = 281 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 49/248 (19%), Positives = 77/248 (31%), Gaps = 20/248 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + VI+L S R ARI + F+ A ++P + + R L Sbjct: 1 MKCLVINLDRSPDRLAHITAEFARIGIAFARIVATDARDHPE---LVLQPQFSIYAVRRL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E+ C SH W IA A + EDD FS + LL + +K + Sbjct: 58 SSSEVACMHSHRACWSIIAQDDAPYGAVFEDDVVFSAKAGALLADSGWISADADAVKLET 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 K + F + + R T GY + ++ A LL D + + Sbjct: 118 FFSKTMIQRKRIAVGNGFSLFRLRRSHMGTGGYIVSRQMARDLLEATAQASAAADDLLFN 177 Query: 183 W-WEHNIP--SLVTEPGAVYEA---IDTNDSTIEESR-----------LVRKPTFSPLYF 225 + + P + D S + E R RKP + Sbjct: 178 PVFPTSASKTIYQLVPALCAQDQFVGDRLPSLLYEEREAEWVASGLTIKRRKPLAEKIRV 237 Query: 226 YRNTCYQW 233 +W Sbjct: 238 EIRRVVEW 245 >gi|110006041|gb|ABG48523.1| LpsB [Haemophilus influenzae] gi|110006043|gb|ABG48524.1| LpsB [Haemophilus influenzae] Length = 256 Score = 142 bits (359), Expect = 3e-32, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 90/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + T GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLKEGR 221 >gi|52424960|ref|YP_088097.1| hypothetical protein MS0905 [Mannheimia succiniciproducens MBEL55E] gi|52307012|gb|AAU37512.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 261 Score = 142 bits (359), Expect = 3e-32, Method: Composition-based stats. Identities = 51/235 (21%), Positives = 87/235 (37%), Gaps = 26/235 (11%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFK 59 + +++SL RRE F + F F+AI ++ +F K + Sbjct: 4 KMKKFLVSLEKDIQRRELFFSQRNTQ--DFEVFNAINTMTQDLTSLGNLFDIIKFAQYYG 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPA----IGAIILEDDADFSDEFSQLLPHLSKCDINN 115 R ++ EIGC +SH+ ++++IA A++ EDDA F++ F Q++ + K + Sbjct: 62 RNVTKGEIGCTLSHLAIYQKIADDETINERDYALVCEDDALFAENFQQVIQEIVKQPMGA 121 Query: 116 ILI-----------KFDALRKKPKKDSYL---CTLPGNFDIHQPRILSPRTTGYFIGKEA 161 +I + P +L G + R T Y I K A Sbjct: 122 DIILTGQSKILEFNHIELEINYPSTFKFLQKKIANSGYRYSYPYRNYFAGTVCYLITKAA 181 Query: 162 AIHLLNVRKNIYRP---IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 A L N P D + + + + + P E S +E SR Sbjct: 182 AKRFLAELTNGRLPFWLADDFILFNEKFKLNTAIIRPLLAIENP-VLTSNLENSR 235 >gi|110006033|gb|ABG48519.1| LpsD [Haemophilus influenzae] gi|110006035|gb|ABG48520.1| LpsD [Haemophilus influenzae] gi|110006037|gb|ABG48521.1| LpsD [Haemophilus influenzae] Length = 256 Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 87/217 (40%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI + + F+ K L Sbjct: 9 YVISLTTEQKRRKNITEEFGKQNIPFAFFDAITPDLIEETAKKFNIT-FDRSSKATLCDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ + ++K +A K Sbjct: 68 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDM--K 181 + N +I + + T GY I + A +LLN +N +Y ID + + Sbjct: 127 --IIYGKREQIKCNRNISRLKFKHTGTAGYSITAKGARYLLNNIRNKQLYLAIDTLIFDE 184 Query: 182 HWWEHNIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + + P + + +S++ R Sbjct: 185 LLSQKDYKVMQLSPAICTQSFILHDENYFESSLHNGR 221 >gi|110006045|gb|ABG48525.1| LpsB [Haemophilus influenzae] gi|110006047|gb|ABG48526.1| LpsB [Haemophilus influenzae] Length = 256 Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 53/220 (24%), Positives = 96/220 (43%), Gaps = 19/220 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK---- 181 K ++ + +++ + T GY + + A +LL + KN +P+D+ + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPLDVAVDSLVF 182 Query: 182 ----HWWEHNI----PSLVTEPGAVYEAIDTNDSTIEESR 213 H+ ++ I P + + V+ + +S+++E R Sbjct: 183 EDFLHFKDYKIVQLSPGICVQDFVVHPN-NHFESSLKEGR 221 >gi|149757348|ref|XP_001499949.1| PREDICTED: glycosyltransferase 25 domain containing 1 [Equus caballus] Length = 548 Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 44/225 (19%), Positives = 89/225 (39%), Gaps = 24/225 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RR + ++ +A+ G+ + + Sbjct: 269 VFMINLKRRQDRRTRMLEALRAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 328 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + D++ + Sbjct: 329 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMR-DVDREGLD 387 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-P 175 +D + RK+ + + +P ++ + S T Y + ++ A LL + P Sbjct: 388 WDLIYVGRKRMQVERPEKAVPRVRNLVEA-DYSYWTLAYVLSQQGARKLLAAQPLAKMLP 446 Query: 176 IDMDM-------------KHWWEHNIPSLVTEPGAVYEAIDTNDS 207 +D + H+ N+ + EP VY T D Sbjct: 447 VDEFLPVMFDKHPVSEYKSHFSPRNLHAFSVEPLLVYPTHYTGDD 491 >gi|145634824|ref|ZP_01790532.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittAA] gi|110006031|gb|ABG48518.1| LpsD [Haemophilus influenzae] gi|145267990|gb|EDK07986.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittAA] Length = 256 Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 87/217 (40%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI + + F+ K L Sbjct: 9 YVISLTTEQKRRKNITEEFGKQNIPFAFFDAITPDLIEETAKKFNIT-FDRSSKATLCDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ + ++K +A K Sbjct: 68 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDM--K 181 + N +I + + T GY I + A +LLN +N +Y ID + + Sbjct: 127 --IIYGKREQIKCNRNISRLKFKHTGTAGYSITAKGARYLLNNIRNKQLYLAIDTLIFDE 184 Query: 182 HWWEHNIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + + P + + +S++ R Sbjct: 185 LLSQKDYKVMQLSPAICTQSFILHDENYFESSLHNGR 221 >gi|297276457|ref|XP_001114885.2| PREDICTED: procollagen galactosyltransferase 1-like [Macaca mulatta] Length = 474 Score = 142 bits (358), Expect = 5e-32, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 195 VFMINLTRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 254 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 255 RPLTKGELGCFLSHYNIWKEVCPEXLQKSLVFEDDLLFEIFFKRRLMNLMRDVEREGLDW 314 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 315 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKM 370 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 371 LPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 417 >gi|5764665|gb|AAD51367.1|AF177203_1 cerebral cell adhesion molecule [Homo sapiens] Length = 517 Score = 142 bits (358), Expect = 5e-32, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 77/220 (35%), Gaps = 29/220 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ + Sbjct: 244 VFVISLARRPDRRERMLASLWEMEISGRVVDAVDAGWLNSSAIRNLGVDLLPGYQDPYSG 303 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH +W+ + ++ EDD F F L L + ++ Sbjct: 304 RTLTKGEVGCFLSHYSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLMEDVEAEKLSW 363 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + P+K++ + LPG S T Y + A LL + + R Sbjct: 364 DLIYLGRKQVNPEKETAVEGLPG----LVVAGYSYWTLAYALRLAGARKLL-ASQPLRRM 418 Query: 175 -PIDMDMK-------------HWWEHNIPSLVTEPGAVYE 200 P+D + H+W ++ + +P Sbjct: 419 LPVDEFLPIMFDQHPNEQYKAHFWPRDLVAFSAQPLLAAP 458 >gi|110006081|gb|ABG48543.1| LpsA [Haemophilus influenzae] Length = 256 Score = 142 bits (358), Expect = 5e-32, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 221 >gi|192360086|ref|YP_001981136.1| glycosyl transferase [Cellvibrio japonicus Ueda107] gi|190686251|gb|ACE83929.1| glycosyl transferase, putative, gt25A [Cellvibrio japonicus Ueda107] Length = 250 Score = 142 bits (358), Expect = 5e-32, Method: Composition-based stats. Identities = 48/248 (19%), Positives = 87/248 (35%), Gaps = 18/248 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 IP Y+I+L R + + F A+ G +R Q K+ Sbjct: 2 IPSYLINLDRDQERLAFVLAHFQQQGINPERFAAVDGRQYSELEYQRFIAQRPRQGKKSW 61 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 ++GC++SH +W+RI S A + EDD + +LL + +I+ D Sbjct: 62 LRGQMGCFLSHYGVWQRIVNSRVRFAAVFEDDIHIAPGLYELLEGDDWIADSVDIIRLDT 121 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + ++ S T GY I + A LL++ ++P D+ + + Sbjct: 122 STNRVRLGPEPALQCHSHYFYRLLSTSWCTGGYIIHQRTARQLLDLAPRYHQPSDVLLFN 181 Query: 183 WWE----HNIPSLVTEPGAV-----YEAIDTNDSTIEES---------RLVRKPTFSPLY 224 E + L P + T S IE R + + L Sbjct: 182 HEESAIARQLKVLQASPALCVQDKHHRGAVTFSSNIEAGATRDWRLYCRQCLQGGYRYLA 241 Query: 225 FYRNTCYQ 232 Y+ ++ Sbjct: 242 GYKRVEFK 249 >gi|218676979|ref|YP_002395798.1| Glycosyltransferase [Vibrio splendidus LGP32] gi|218325247|emb|CAV27221.1| Glycosyltransferase [Vibrio splendidus LGP32] Length = 262 Score = 142 bits (358), Expect = 6e-32, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 86/257 (33%), Gaps = 17/257 (6%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIH-LQFSFFDAIYG---ENNPICNRIFSHQKRQCQ 57 + +ISL R + I L + +++P+ NR + Sbjct: 4 TVKSIIISLERKSERFLSSQEQLLNIKKLDIEKLSGVDASLSKDHPLMNR-YDEAAFYAL 62 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILE-DDADFSDEFSQLLPHLSKCDINNI 116 R+ EIGCY SH W++ II E D D F + + H S+ Sbjct: 63 NGRVAVPGEIGCYSSHYLAWEKCIELNQP-IIIFEDDVMVDVDVFEKTVQHASEHIEECG 121 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + K + + + L + + T Y I A + + P+ Sbjct: 122 YIRLENYSNKREYNYVVENLDDEQSLVRHIKTPLCMTAYMITPHVAKTFIEKSERFLYPV 181 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES-----RLVRKPTFSPLYFYRNTCY 231 D+ +++ W H P+ P + DS I E + +R T L +R+ Sbjct: 182 DVFIRNVWLHKQPTYGVSPAGLTGGAA--DSVIGERTFRIKKSLRIKTLKFLSKFRDVAM 239 Query: 232 QWNLHYNAW---RKDLP 245 + +K P Sbjct: 240 NGLFNLTYTVIIKKSRP 256 >gi|145640853|ref|ZP_01796435.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae R3021] gi|145274367|gb|EDK14231.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 22.4-21] Length = 286 Score = 142 bits (357), Expect = 6e-32, Method: Composition-based stats. Identities = 53/220 (24%), Positives = 96/220 (43%), Gaps = 19/220 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 39 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 97 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 98 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLEANGK 156 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK---- 181 K ++ + +++ + T GY + + A +LL + KN +P+D+ + Sbjct: 157 MFFKQPK--SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPLDVAVDSLVF 212 Query: 182 ----HWWEHNI----PSLVTEPGAVYEAIDTNDSTIEESR 213 H+ ++ I P + + V+ + +S+++E R Sbjct: 213 EDFLHFKDYKIVQLSPGICVQDFVVHPN-NHFESSLKEGR 251 >gi|292621863|ref|XP_002664798.1| PREDICTED: procollagen galactosyltransferase 1-like, partial [Danio rerio] Length = 535 Score = 142 bits (357), Expect = 7e-32, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 100/275 (36%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ + A+ G+ + + Sbjct: 256 VFMINLLRRSDRRERMLRTLYEQEIACKIITAVDGKALNASQVEALGIEMLPGYSDPYHG 315 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK----CDINN 115 R L+ E+GC++SH ++W I ++++EDD F F + L +L + ++ Sbjct: 316 RPLTKGELGCFLSHYNIWNEIVDRGLQSSLVIEDDLRFEVFFKRRLQNLMQEVQSQQLDW 375 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIY 173 LI RK+ + + ++P + + S T GY I A LL K + Sbjct: 376 DLIYIG--RKRMQVERPEKSVPRIHSLVEA-DYSYWTLGYVISLRGAQKLLRAEPLKKM- 431 Query: 174 RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M H+ + ++ + EP +Y DT S++ + Sbjct: 432 LPVDEFLPVMYNKHPIEEYMSHFPQRDLRAFSAEPLLIYPTHYTGDQGYISDTETSSVWD 491 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V T R + Q L A D+ Sbjct: 492 NESVL--TDWDRARSRKSREQEELSSEAQNTDVLQ 524 >gi|254361948|ref|ZP_04978079.1| glycosyltransferase LpsA [Mannheimia haemolytica PHL213] gi|1346467|sp|Q05770|LPSA_PASHA RecName: Full=Lipooligosaccharide biosynthesis protein lpsA gi|561691|gb|AAA80283.1| LpsA [Mannheimia haemolytica] gi|153093495|gb|EDN74475.1| glycosyltransferase LpsA [Mannheimia haemolytica PHL213] Length = 263 Score = 141 bits (356), Expect = 8e-32, Method: Composition-based stats. Identities = 49/223 (21%), Positives = 84/223 (37%), Gaps = 15/223 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL + RR+ + ++ F FFDAI + + F+ L Sbjct: 1 MNNYVISLTSAQERRKHIEAEFGKQNIPFQFFDAITPDLIKEKAKAFNIDISNTN----L 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EI C +SHI LW I EDD + +LL + N ++K + Sbjct: 57 TKGEIACALSHIALWHLAKQQNLDYICIFEDDIYLGNNAFELLK-TNYIPENTHIVKLET 115 Query: 123 L-RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 L + + + N + + T GY + + A L+N+ K + PID + Sbjct: 116 LPFDRINRFNKTEKYILNRRLFKLNSRHVGTAGYILTNKGAEFLINILKTLNIPIDDLIF 175 Query: 182 HWWEHNIPSLVT---EPGAVYEAIDTND-----STIEESRLVR 216 + I P + N S++++ R +R Sbjct: 176 DEYL-KIKEYKVLQMSPALCVQDFILNSKTNFKSSLQDDRAIR 217 >gi|160395584|sp|Q7Q021|GLT25_ANOGA RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor Length = 592 Score = 141 bits (356), Expect = 8e-32, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 76/193 (39%), Gaps = 15/193 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 +Y+I+L RR K + L F A+ G+ F K Sbjct: 321 IYMINLERRTERRTKMLKHFDLLGLDVEHFPAVDGKQLSDKKVYDMGIRFLPGYADPFHK 380 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 R +++ EIGC++SH ++W+R+ ++LEDD F F + +L + Sbjct: 381 RPMTMGEIGCFLSHYNIWERMVRLNQQEVLVLEDDIRFEPFFRRRAYGVLADARRIG-GW 439 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYR 174 LI F R + + + ++ + S T GY I E A LL + Sbjct: 440 DLIYFGRKRLQEEDEKWI----DGSEYLVKAGYSYWTLGYVITLEGAKKLLREQPLSKLL 495 Query: 175 PIDMDMKHWWEHN 187 P+D + ++++ Sbjct: 496 PVDEYLPIMFDNH 508 >gi|110006039|gb|ABG48522.1| LpsB [Haemophilus influenzae] Length = 256 Score = 141 bits (356), Expect = 8e-32, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 90/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + T GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLKEGR 221 >gi|30995400|ref|NP_438924.2| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] Length = 278 Score = 141 bits (356), Expect = 9e-32, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 90 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 148 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 149 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 206 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 207 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 243 >gi|301169482|emb|CBW29083.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 10810] Length = 278 Score = 141 bits (356), Expect = 9e-32, Method: Composition-based stats. Identities = 53/220 (24%), Positives = 96/220 (43%), Gaps = 19/220 (8%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 90 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLEANGK 148 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK---- 181 K ++ + +++ + T GY + + A +LL + KN +P+D+ + Sbjct: 149 MFFKQPK--SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKN--KPLDVAVDSLVF 204 Query: 182 ----HWWEHNI----PSLVTEPGAVYEAIDTNDSTIEESR 213 H+ ++ I P + + V+ + +S+++E R Sbjct: 205 EDFLHFKDYKIVQLSPGICVQDFVVHPN-NHFESSLKEGR 243 >gi|254779201|ref|YP_003057306.1| putative glycosyltransferase, family 25 [Helicobacter pylori B38] gi|254001112|emb|CAX29067.1| Putative glycosyltransferase, family 25 [Helicobacter pylori B38] Length = 273 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 54/286 (18%), Positives = 88/286 (30%), Gaps = 65/286 (22%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQF-----------------SFFDAIYGENNP-- 43 + V++ISL + L F FDAIY + Sbjct: 1 MRVFIISLN---------QKVCDKFGLVFRDTTTLLNNIHATRHKAQIFDAIYSKTFEGG 51 Query: 44 --------------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW 77 + + + + +SL E+GCY SH LW Sbjct: 52 LHPLVKKHLHPYFITQNIKDMGITTNLISEVSKFYYALKYHAKFMSLGELGCYASHYSLW 111 Query: 78 KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT-- 135 ++ ILEDD ++F + L L K I+ L S + Sbjct: 112 QKCIELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYIRLMHLLYDSSVKSEPLSHK 170 Query: 136 ---LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLV 192 + I + T GY I + A + P+DM M + H + +LV Sbjct: 171 NHEIQERVGIIKAYSKGVGTQGYVITPKIAKVFKKHSRKWVVPVDMIMDATFIHGVKNLV 230 Query: 193 TEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 +P + A D STI P + R +++ ++ Sbjct: 231 LQPFVI--ADDEQISTIARKEEPYSPK---IALMRELHFKYLKYWQ 271 >gi|151337713|gb|ABS01288.1| glycosyltransferase [Haemophilus influenzae] Length = 278 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 90 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 148 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 149 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 206 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 207 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 243 >gi|2497668|sp|Q57125|Y765_HAEIN RecName: Full=Putative glycosyltransferase HI_0765 gi|1573773|gb|AAC22423.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] Length = 282 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 94 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 152 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 153 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 210 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 211 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 247 >gi|269139614|ref|YP_003296315.1| putative beta 1,4-galactosyltransferase [Edwardsiella tarda EIB202] gi|267985275|gb|ACY85104.1| putative beta 1,4-galactosyltransferase [Edwardsiella tarda EIB202] gi|304559490|gb|ADM42154.1| Beta-1,4-galactosyltransferase [Edwardsiella tarda FL6-60] Length = 251 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 89/223 (39%), Gaps = 5/223 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V++I+L ++ + +R+++ +A+ G + K + L Sbjct: 1 MKVFIINLKRDLQKKHEIISECSRLNINHEIVEAVSGIDLSAAEVDRLIDKDAQIY---L 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGC +SH+ ++++I A+ILEDDA D+ ++++ + + Sbjct: 58 TKGEIGCSLSHLRIYQKIISEALPYALILEDDAILHDDLNEVIQAIENVIDKESSHAYLL 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + L + +++ + T GY + + A L + + D Sbjct: 118 YKTGCVYGNQRIKLSEKYSLYE-SNVPTCTHGYVVTNKTARLLTAINTPVRFEADAWRNF 176 Query: 183 WWEHNIPSLVTEPGAVYEAIDT-NDSTIEESRLVRKPTFSPLY 224 ++E NI + + +STIEE RL R P L Sbjct: 177 YFEKNIRPYSLNINLIDSRDQSKENSTIEEERLSRTPIQITLR 219 >gi|158300399|ref|XP_320324.3| AGAP012208-PA [Anopheles gambiae str. PEST] gi|157013141|gb|EAA00118.3| AGAP012208-PA [Anopheles gambiae str. PEST] Length = 554 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 76/193 (39%), Gaps = 15/193 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 +Y+I+L RR K + L F A+ G+ F K Sbjct: 321 IYMINLERRTERRTKMLKHFDLLGLDVEHFPAVDGKQLSDKKVYDMGIRFLPGYADPFHK 380 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 R +++ EIGC++SH ++W+R+ ++LEDD F F + +L + Sbjct: 381 RPMTMGEIGCFLSHYNIWERMVRLNQQEVLVLEDDIRFEPFFRRRAYGVLADARRIG-GW 439 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYR 174 LI F R + + + ++ + S T GY I E A LL + Sbjct: 440 DLIYFGRKRLQEEDEKWI----DGSEYLVKAGYSYWTLGYVITLEGAKKLLREQPLSKLL 495 Query: 175 PIDMDMKHWWEHN 187 P+D + ++++ Sbjct: 496 PVDEYLPIMFDNH 508 >gi|110006077|gb|ABG48541.1| LpsA [Haemophilus influenzae] Length = 256 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNIALDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 221 >gi|260579856|ref|ZP_05847686.1| conserved hypothetical protein [Haemophilus influenzae RdAW] gi|260093140|gb|EEW77073.1| conserved hypothetical protein [Haemophilus influenzae RdAW] Length = 282 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 94 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 152 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 153 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 210 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 211 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 247 >gi|110006069|gb|ABG48537.1| LpsA [Haemophilus influenzae] gi|110006071|gb|ABG48538.1| LpsA [Haemophilus influenzae] gi|110006073|gb|ABG48539.1| LpsA [Haemophilus influenzae] gi|110006075|gb|ABG48540.1| LpsA [Haemophilus influenzae] gi|110006079|gb|ABG48542.1| LpsA [Haemophilus influenzae] Length = 256 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 221 >gi|329756889|gb|AEC04675.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 53/236 (22%), Positives = 88/236 (37%), Gaps = 23/236 (9%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-K 59 MP+ YVISL + RR + ++ H+ F FDA+ +P + F Q+ Sbjct: 1 MPMTNYVISLSSARERRRHVMNEFSKHHVPFQIFDAV----SPSSHLDFLIQRLVPNLNG 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DIN 114 L+ E GC ISH+ LW + I EDD + L + Sbjct: 57 TSLTGGEKGCLISHVALWHKCIQDNLPYIAIFEDDILLGRDARAFLAEDEWLFSRFNCDD 116 Query: 115 NILIKFDALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI- 172 +I+ + ++ ++ + D + T GY I AA +LL + KN+ Sbjct: 117 IFIIRLETFLQETICEALPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNME 176 Query: 173 ---YRPIDMDMKHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 PID + + + P+L + + E DS +E R + Sbjct: 177 SNNIFPIDHLIFNRFLAGEELMVYQLSPALCIQELQLNENESLLDSQLESERKNYR 232 >gi|210135026|ref|YP_002301465.1| beta-1,4-galactosyltransferase [Helicobacter pylori P12] gi|210132994|gb|ACJ07985.1| beta-1,4-galactosyltransferase [Helicobacter pylori P12] Length = 273 Score = 140 bits (354), Expect = 1e-31, Method: Composition-based stats. Identities = 53/284 (18%), Positives = 98/284 (34%), Gaps = 61/284 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSF-----FDAIYGEN-----NP 43 + V++ISL +K C + + ++ + FDA+Y + +P Sbjct: 1 MRVFIISLN------QKVCDKFGLVFRDTTTLLNNINATRHKAQIFDAVYSKTFEGGLHP 54 Query: 44 ICNRIF-----------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISGVSKFYYALKYHAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L S + Sbjct: 115 IELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDASVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSP-LYFYRNTCYQWNLHYN 238 + A D STI + ++ +SP + R +++ ++ Sbjct: 234 FVI--ADDEQISTI----VRKEEPYSPKIALMRELHFKYLKYWQ 271 >gi|57790492|gb|AAW56186.1| Cj81-127 [Campylobacter jejuni subsp. jejuni 81-176] Length = 180 Score = 140 bits (354), Expect = 1e-31, Method: Composition-based stats. Identities = 48/190 (25%), Positives = 78/190 (41%), Gaps = 24/190 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNP-ICNRIFSHQ 52 + V++I+L S R+E + ++ L+F FF AI +N + + Sbjct: 1 MKVFIINLERSLDRKEHIQKQIQKLFEKNLSLKNKLEFIFFKAIDAKNKEHLEFKDHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ IILEDD +FSDEF ++ + L Sbjct: 61 WGSWVLGRELSDGEKACFASHYKLWQECVKLDEP-IIILEDDVEFSDEFLNNGAEYIDEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K K++ +R + L NF + + T GY I +A+ ++ Sbjct: 120 LKS-------KYEYIRLCYLTQGKMLKLNDNF-LFTLDSIG-GTQGYLIKPISALKFIDN 170 Query: 169 RKNIYRPIDM 178 K +P+D Sbjct: 171 LKFWIKPVDN 180 >gi|25989453|gb|AAL82722.1| putative beta1,4-galactosyltransferase [Edwardsiella tarda] Length = 251 Score = 140 bits (354), Expect = 1e-31, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 89/223 (39%), Gaps = 5/223 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V++I+L ++ + +R+++ +A+ G + K + L Sbjct: 1 MKVFIINLKRDLQKKHEIISECSRLNINHEIVEAVSGIDLSAAEVDRLIDKDAQIY---L 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGC +SH+ ++++I A+ILEDDA D+ ++++ + + Sbjct: 58 TKGEIGCSLSHLRIYQKIISEALPYALILEDDAILHDDLNEVIQAIENVIDKESGHAYLL 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + L + +++ + T GY + + A L + + D Sbjct: 118 YKTGCVYGNQRIKLSEKYSLYE-SNVPTCTHGYVVTNKTARLLTAINTPVRFEADAWRNF 176 Query: 183 WWEHNIPSLVTEPGAVYEAIDT-NDSTIEESRLVRKPTFSPLY 224 ++E NI + + +STIEE RL R P L Sbjct: 177 YFEKNIRPYSLNINLIDSRDQSKENSTIEEERLSRTPIQITLR 219 >gi|323144164|ref|ZP_08078800.1| LPS glycosyltransferase [Succinatimonas hippei YIT 12066] gi|322416072|gb|EFY06770.1| LPS glycosyltransferase [Succinatimonas hippei YIT 12066] Length = 265 Score = 140 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 61/265 (23%), Positives = 101/265 (38%), Gaps = 33/265 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQ------ 55 +P++++SL RR+K + + +F A+ G + K Q Sbjct: 2 LPIFLVSLEHHQNRRDKVFAGLQKYGINDAIWFPAVNGNKEDLTKFNIDRSKHQRFWHNI 61 Query: 56 ---CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD-----ADFSDEFSQLLPH 107 C + R+ + E GC +SH+HL+++I A+I+EDD AD + Q+ Sbjct: 62 ISCCSYSRVYTNGEYGCALSHLHLYQKIVDENISAALIIEDDVVPRNADVTAFLEQIKDV 121 Query: 108 LSKCDINNILIKFDALRKKPKK--------DSYLCTLPGNFDIHQPRILSPRTTGYFIGK 159 K + + +D K +P + R RT+GY I K Sbjct: 122 QEKTGFEILFLDYDDRLNSFSKATISFGSIQVKRIGIPNWDWLFNRRKNVYRTSGYIITK 181 Query: 160 EAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT 219 E A LL + P D+ + + + T P T ST E S+L R Sbjct: 182 EGAKKLLAKGYPVRMPADILSGLIAYNKLNAWKTLPL-------TFVSTGEPSQLQR--- 231 Query: 220 FSPLYFYRNTCYQWNLHYNAWRKDL 244 F+ L + + + NL R+ L Sbjct: 232 FTTLPENQKSWLRKNLTLKNIRRVL 256 >gi|148826592|ref|YP_001291345.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittEE] gi|148716752|gb|ABQ98962.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittEE] Length = 278 Score = 140 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRS-SKAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 90 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLEANGK 148 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 149 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 206 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 207 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 243 >gi|20159755|gb|AAM12037.1| putative galactosyl transferase Lic2B [Haemophilus influenzae] Length = 266 Score = 140 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 81/240 (33%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 1 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 57 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 58 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 116 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + GY I +EA L N Sbjct: 117 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGAAGYVISREAVKQLSAHICSLTSN 176 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ M + + N + PG +Y S +E R Sbjct: 177 HLLAIDLLMFNIF--NQSTYQVSPGVCVQEGQLYPKDIKLHSQLETERQKYLSVKKKRTL 234 >gi|319775530|ref|YP_004138018.1| putative galactosyl transferase Lic2B [Haemophilus influenzae F3047] gi|317450121|emb|CBY86335.1| putative galactosyl transferase Lic2B [Haemophilus influenzae F3047] Length = 336 Score = 140 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 81/240 (33%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 65 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 121 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 122 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 180 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + GY I +EA L N Sbjct: 181 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGAAGYVISREAVKQLSTHICSLTSN 240 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ M + + N + PG +Y S +E R Sbjct: 241 HLLAIDLLMFNIF--NQSTYQVSPGVCVQEEQLYPNDIKLHSQLETERQKYLSVKKKRTL 298 >gi|237654429|ref|YP_002890743.1| glycosyl transferase family 25 [Thauera sp. MZ1T] gi|237625676|gb|ACR02366.1| glycosyl transferase family 25 [Thauera sp. MZ1T] Length = 265 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 95/247 (38%), Gaps = 10/247 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M I +++I+LP + RR++ A + L + +A+ G + R + + ++ Sbjct: 1 MSIGIFIINLPEAVERRQRVSGHLAALGLDATVIEAVRGSTLSVAERASVADDPRSVGRY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL---PHLSKCDINN 115 R+L+ E+GC +SH+ ++++ S +ILEDDA + + LL + + Sbjct: 61 GRVLTPGELGCAMSHVRAYEQLLCSGHQFGLILEDDAVLLPDVANLLVSAENCLWLQSPH 120 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + + + G + R GY + + AA + + + Sbjct: 121 PRLLLMTPIRAFLARGAVPFAAGYRRVKVRRAWEG--YGYLVNRAAADAMRRINSPAWLS 178 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE--ESRLVRKPTFSPLYFYRNTCYQW 233 D + + +I +P + +DT S +E R+ S R +Q Sbjct: 179 ADDWVAYRRFGSIELCGLDPFCIGY-LDTAPSQLEYDRRRVETASGRSKSLKARVEKWQR 237 Query: 234 NLHYNAW 240 + + Sbjct: 238 QIMDAVY 244 >gi|110006029|gb|ABG48517.1| LpsC [Haemophilus influenzae] Length = 256 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 85/217 (39%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ K L Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNII-FDRSSKATLCDA 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ + ++K +A K Sbjct: 68 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDM--K 181 + N +I + + GY I + A +LLN +N +Y ID + + Sbjct: 127 --IIYGKCEQIKCNRNISRLKFKHTGMAGYSITAKGARYLLNNIRNKQLYLAIDTLIFDE 184 Query: 182 HWWEHNIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + + P + + +S++ R Sbjct: 185 LLGKKDYKVMQLSPAICAQSFILHDENYFESSLHNGR 221 >gi|164604787|gb|ABY61954.1| lipopolysaccharide glycosyltransferase [Haemophilus influenzae] gi|309973739|gb|ADO96940.1| Lipooligosaccharide glucosyltransferase LpsA [Haemophilus influenzae R2846] Length = 278 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 90/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 90 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 148 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + T GY + + A +LL + K + +D + Sbjct: 149 MFFKQPK--SVKCDRNVYPMTVKQSGTAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 206 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 207 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLKEGR 243 >gi|255323434|ref|ZP_05364565.1| LPS glycosyltransferase [Campylobacter showae RM3277] gi|255299471|gb|EET78757.1| LPS glycosyltransferase [Campylobacter showae RM3277] Length = 251 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 64/250 (25%), Positives = 94/250 (37%), Gaps = 21/250 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M V+VISL ARREK R +F +A G + R Sbjct: 1 MKKLVFVISLKSDEARREKLKERFKNYG-EFKLVEATDGRTMSAKEYYGYALPSLEAYDR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD-FSDEFSQLLPHLSKCDINNILI- 118 LLS EIGC +SH+ ++ S A A+ILEDD D + +K D + LI Sbjct: 60 LLSPSEIGCSLSHVCAYEEFLKSDAKFALILEDDVIGDEDGVKKAFETAAKMDEGSALIC 119 Query: 119 --------KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 +F A KK + D +L + I+ R Y + + AA +L K Sbjct: 120 GAQDGLEGRFSAFGKKLEDDFWLVSKRSYGTIY-------RAAAYVLDRRAAEKILQTHK 172 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR-KPTFSPLYFYRNT 229 D ++ + ++ A D DS I+ R+ R + SPL + Sbjct: 173 RALCVADFWRILLLQNGLKMYFSDIFA--HPTDLTDSNIQAERVQRTQVKVSPLARLNSL 230 Query: 230 CYQWNLHYNA 239 Y + A Sbjct: 231 KYVAATRFEA 240 >gi|301169264|emb|CBW28861.1| putative galactosyl transferase lic2b [Haemophilus influenzae 10810] Length = 278 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 81/240 (33%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 13 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + GY I +EA L N Sbjct: 129 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGAAGYVISREAVKQLSAHICSLTSN 188 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ M + + N + PG +Y S +E R Sbjct: 189 HLLAIDLLMFNIF--NQSTYQVSPGVCVQEGQLYPKDIKLHSQLETERQKYLSVKKKRTL 246 >gi|319897736|ref|YP_004135933.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae F3031] gi|317433242|emb|CBY81617.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae F3031] Length = 282 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 85/217 (39%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ K L Sbjct: 35 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNIT-FDRSSKATLCDA 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ + ++K +A K Sbjct: 94 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK 152 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDM--K 181 + N +I + + GY I + A +LLN +N +Y ID + + Sbjct: 153 --IIYGKCEQIKCNRNISRLKFKHTGMAGYSITAKGARYLLNNIRNKQLYLAIDTLIFDE 210 Query: 182 HWWEHNIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + + P + + +S++ R Sbjct: 211 LLGKKDYKVMQLSPAICAQSFILHDENYFESSLHNGR 247 >gi|238920443|ref|YP_002933958.1| beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] gi|238870013|gb|ACR69724.1| putative beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] Length = 246 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 93/245 (37%), Gaps = 16/245 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V++I+L ++ + R+ ++F DA+ G + + + L Sbjct: 1 MQVFIINLKHDIQKKNAIQRQCKRLRIKFEIIDAVCGIDLTDSDLDSFIDADARSY---L 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E+GC +SH L++RI A+ILEDDA ++ + ++ + K I Sbjct: 58 SRGEMGCALSHNSLYQRIIAEQLPFALILEDDAILHNDTAHVIALIEKRISKEDNIALLL 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + L TL ++ I T GY I +AA LL + + D Sbjct: 118 YKTEYIYKNKLITLSDDY-IFYESNTPTLTHGYVITNKAARTLLTLNTPVRVEADAWRYF 176 Query: 183 WWEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLVRK-----------PTFSPLYFYRNTC 230 ++ + + + +D STIE+ R ++ S + Y Sbjct: 177 YFTRFVRAYSLNTDLILSHDVDKQTSTIEKERQLKSDAQKHKRKSLDRKNSVIKIYHKLI 236 Query: 231 YQWNL 235 + L Sbjct: 237 RRMFL 241 >gi|268574800|ref|XP_002642379.1| Hypothetical protein CBG18383 [Caenorhabditis briggsae] Length = 497 Score = 140 bits (353), Expect = 2e-31, Method: Composition-based stats. Identities = 45/239 (18%), Positives = 87/239 (36%), Gaps = 38/239 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF----- 58 +Y+I+L R ++ + +++S +A G+ + ++ Sbjct: 254 KIYLINLKRRSERLDRMQKIFDLLGIEYSLLEATDGQKLDQLPEDLKNYHILDKYLDPIT 313 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----FSQLLPHLSKCDIN 114 KR + EIGC++SH +W+ + + I+ EDD FS + ++L L Sbjct: 314 KRPMKNGEIGCFLSHYRIWQDVVKNKYEKVIVFEDDLRFSHDGLTRVREVLQDLGASGKE 373 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI ++ ++ ++ + S T GY + A LL + + + Sbjct: 374 WDLIYLGRKKQSDSEELWISYHRHLSSVE----YSYWTLGYMLSLRGAEKLL-AAEPLNK 428 Query: 175 --PIDMDM------------KHWWEH-NIPSLVTEPGAVYE---------AIDTNDSTI 209 P+D + +EH N+ + P V+ DT DS I Sbjct: 429 MVPVDEYLPIMFDKHPNKDWSSQFEHRNLNAFTLYPLVVFPQRYTNQPGYISDTEDSVI 487 >gi|37526907|ref|NP_930251.1| hypothetical protein plu3019 [Photorhabdus luminescens subsp. laumondii TTO1] gi|36786340|emb|CAE15393.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii TTO1] Length = 251 Score = 140 bits (352), Expect = 2e-31, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 97/242 (40%), Gaps = 12/242 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +++I+L R+ +A+ + L A+ G+ + + + + + Sbjct: 1 MKIFIINLESDVERKYSMLQQASSLRLDVEIIKAVNGKQLSKDEVM---KLSRDFYNNGM 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI----LI 118 +L E+GC +SH+ +++RI A+I+EDDA+ + S +L L K + N +I Sbjct: 58 TLGELGCSLSHLLVYQRIVDENIPLALIMEDDAEINKNISDVLSALDKFNTKNPNKPNII 117 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + + D++ + G + + + T GY I AA LL+ + ++ D Sbjct: 118 LLNKTNEY--IDTFKKNITGQYYLVNVIEAAC-TYGYVINNYAAQCLLDFLQPVWLEADK 174 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 I P + S + R +K + F+R + +L+ Sbjct: 175 WRFLNERRIIKVKAVVPPVISTTPLYLQSNLALERKKQKQSRQ--EFFRIQRKRRSLYVK 232 Query: 239 AW 240 + Sbjct: 233 LY 234 >gi|149636351|ref|XP_001516115.1| PREDICTED: similar to C1orf17 [Ornithorhynchus anatinus] Length = 845 Score = 140 bits (352), Expect = 2e-31, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 102/273 (37%), Gaps = 39/273 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + ++ Sbjct: 563 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIDMLPGYRDPYSS 622 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 623 RPLTRGEIGCFLSHYSIWKEVIDRDLEKTLVIEDDVRFEHQFKKKLIKLMDDIDRVQLDW 682 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 683 ELIYIG--RKRMQVQEPEKAVPNVVNLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKM 738 Query: 174 RPIDM-------------DMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M+++ ++ + EP +Y DT STI + Sbjct: 739 LPVDEFLPVMYNKHPVAKYMEYYEPRDLKAFSVEPLLIYPTHYTGQPGYLSDTETSTIWD 798 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDL 244 + V T R T Q ++H NA D Sbjct: 799 NETVA--TDWDRKHSRKTQEQEHIHSNAQNTDA 829 >gi|309751574|gb|ADO81558.1| Lipooligosaccharide glucosyltransferase LpsA [Haemophilus influenzae R2866] Length = 278 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 31 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 90 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 148 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 149 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 206 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 207 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 243 >gi|219871215|ref|YP_002475590.1| lipooligosaccharide biosynthesis protein lpsA [Haemophilus parasuis SH0165] gi|219691419|gb|ACL32642.1| lipooligosaccharide biosynthesis protein lpsA [Haemophilus parasuis SH0165] Length = 263 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 87/222 (39%), Gaps = 15/222 (6%) Query: 3 IPVYVISLPFSHA-RREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + +VISL +S+ RR + + ++ F FFDAI P N +++ + Sbjct: 1 MKCFVISLKYSNDKRRTHMKNEFSSHNIPFEFFDAI----TPNENAQYANLFKINLDNTK 56 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-DINNILIKF 120 L+ EI C SH+ LWK++ + I EDD S +L + + +IK Sbjct: 57 LTQGEISCLFSHLTLWKQMIDNNLDRIAIFEDDIYLSSSAKDMLDSIESISTTDFDVIKL 116 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NIYRPID- 177 + ++ + N +H + + GY I AA ++ + NI PID Sbjct: 117 EKSLERVLASKFYKIRMRNSSLHILKDSHLGSAGYVITNSAARKIITYIQNENIIAPIDI 176 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAI------DTNDSTIEESR 213 + + N + P + + S++E R Sbjct: 177 IVFDKLIKSNYKIMQIFPAFCIQDFIKNKCDNNFPSSLENER 218 >gi|209548307|ref|YP_002280224.1| glycosyl transferase family 25 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534063|gb|ACI53998.1| glycosyl transferase family 25 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 294 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 53/248 (21%), Positives = 91/248 (36%), Gaps = 8/248 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L + RR + I L F A+ G + + F + R + Sbjct: 23 TYLINLDRAPLRRFRMERLLVGIGLAFERVAAVDGAGLSLPHPDFDEAAYLSRHGRRPNP 82 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 EIGC+ SH+ +R A A+ILEDD DF ++ + ++ + +++ + Sbjct: 83 FEIGCFFSHVECARRFLSGNAEFALILEDDLDFDEDLADVIEAALQHQTRWDILRLSTVN 142 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 K T + I R Y I ++AA + + P D+ + Sbjct: 143 TGRKHRVEALTASRSLAIALTREKGSG--AYLINRKAAGWIAGSMLPMRLPYDLAFDLEF 200 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEES----RLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 + + + +P V + D S I+ RL R+ +S L YR Sbjct: 201 DEGLSACFVDPLPVSQRADAC-SQIQAGLSAYRLGRRRPWSVLP-YRAAAEIRRFAARFG 258 Query: 241 RKDLPPVS 248 R VS Sbjct: 259 RLVAWRVS 266 >gi|317177342|dbj|BAJ55131.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter pylori F16] Length = 273 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 91/283 (32%), Gaps = 59/283 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAAR---------IHLQF-----SFFDAIYGENNP----- 43 + V++ISL +K C + ++ FDAIY + Sbjct: 1 MRVFIISLN------QKVCDQFGLVFRDTTTLLHNINATHHKVQIFDAIYSKTFEGELHP 54 Query: 44 -----------------------ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + +R+ + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISRVSKFYYALKYYAKFMSLGELGCYASHYSLWEKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L S + Sbjct: 115 IELNEP-ICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMYLLYDANVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV + Sbjct: 174 IQERVGIIKAYSHGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQL 233 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 234 FVI--ADDEQISTIARKEEPYSPK---IALMRKLHFKYLKYWQ 271 >gi|229846632|ref|ZP_04466740.1| dimethyladenosine transferase [Haemophilus influenzae 7P49H1] gi|229810725|gb|EEP46443.1| dimethyladenosine transferase [Haemophilus influenzae 7P49H1] Length = 274 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 48/211 (22%), Positives = 80/211 (37%), Gaps = 16/211 (7%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 13 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL----LNVRKN 171 +++F+ K I + + T GY I +EA L ++ N Sbjct: 129 FILRFETFLDFSKCKDKKIKPYSGRKILKLVSENCGTAGYVISREAVKQLNAHICSLTSN 188 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPG-AVYEA 201 ID+ M + + N + PG V E Sbjct: 189 HLLAIDLLMFNIF--NQFTYQVSPGVCVQEG 217 >gi|33151689|ref|NP_873042.1| lipooligosaccharide galactosyltransferase II [Haemophilus ducreyi 35000HP] gi|6942295|gb|AAF32397.1|AF224466_4 galactosyltransferase II [Haemophilus ducreyi] gi|33147910|gb|AAP95431.1| lipooligosaccharide galactosyltransferase II [Haemophilus ducreyi 35000HP] Length = 280 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 67/260 (25%), Positives = 98/260 (37%), Gaps = 24/260 (9%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + ARR+ A+ ++ F FFDA ENN + I H K LS Sbjct: 9 YVISLKTADARRQHIIQEFAKHNIPFQFFDACSIENN-LYMDIEKHLPMLLNSK--LSNS 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILIKF 120 E GC +SH LWK+ I EDD SDE ++ + S + ILIKF Sbjct: 66 EKGCLMSHFLLWKKCVLDDIPYMTIFEDDIILSDESNEFISDYSWVNNRFYEQKEILIKF 125 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + + +Q T Y I KEAA +L+ + + + + Sbjct: 126 ETFLMPVIRKFTKINEYKERAFNQLISRHFGTASYLISKEAAKYLILLYEKLPADELIAV 185 Query: 177 DMDMKHWWEHN--IPSLVTEPGAVYEAIDTN------DSTIEESR----LVRKPTFSPLY 224 D + + ++ + EP + + N S IEE R KP + Sbjct: 186 DESIFNVLLNDKNLCVYQLEPAICVQELQFNKENSLLISQIEEDRYKNQKQNKPRKTLKQ 245 Query: 225 FYRNTCYQWNLHYNAWRKDL 244 CY N + +K Sbjct: 246 RIIKVCYNINRIFKKIKKTK 265 >gi|145636679|ref|ZP_01792346.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittHH] gi|145638436|ref|ZP_01794046.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittII] gi|145270205|gb|EDK10141.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittHH] gi|145272765|gb|EDK12672.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae PittII] Length = 282 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 94 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 152 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 153 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 210 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 211 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 247 >gi|68249361|ref|YP_248473.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 86-028NP] gi|68057560|gb|AAX87813.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 86-028NP] Length = 282 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 35 YVISLTTEQKRRKHITDEFGKQNIPFEFFDAITPDIIEETAKKFNIALDRSP-KAKLSDG 93 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 94 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 152 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 153 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 210 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 211 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 247 >gi|126297741|ref|XP_001367449.1| PREDICTED: similar to cerebral endothelial cell adhesion molecule 1 [Monodelphis domestica] Length = 592 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 77/241 (31%), Gaps = 38/241 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-----ENNPICNRIFSHQKRQCQFK 59 V+VISL RR++ + + +A+ G Sbjct: 320 VFVISLARRPERRQRMLSSLWEMEVAGRVLEAVDGGALNSSTIKSLGVDLLPGYYDPYSG 379 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS----KCDINN 115 R L+ E+GC++SH +W+ + ++ EDD F F + L L + + Sbjct: 380 RTLTKGEVGCFLSHYSIWEEMVTRGLERVVVFEDDVRFEAGFRKRLERLMEEVAQEQLPW 439 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI L +K + G + S T Y + A LL + + Sbjct: 440 DLI---YLGRKQVTWDEEPAVEGVRHLVVANY-SYWTLAYALSLSGAGKLL-AAQPLGKM 494 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ ++ + P + DT STI + Sbjct: 495 LPVDEFLPIMYDQHPNEDYKGHFSPRDLRAFAVRPLLAFPTHYAGDAQWLSDTETSTIWD 554 Query: 212 S 212 Sbjct: 555 D 555 >gi|309357488|emb|CAP35849.2| hypothetical protein CBG_18383 [Caenorhabditis briggsae AF16] Length = 523 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 45/239 (18%), Positives = 87/239 (36%), Gaps = 38/239 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF----- 58 +Y+I+L R ++ + +++S +A G+ + ++ Sbjct: 280 KIYLINLKRRSERLDRMQKIFDLLGIEYSLLEATDGQKLDQLPEDLKNYHILDKYLDPIT 339 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----FSQLLPHLSKCDIN 114 KR + EIGC++SH +W+ + + I+ EDD FS + ++L L Sbjct: 340 KRPMKNGEIGCFLSHYRIWQDVVKNKYEKVIVFEDDLRFSHDGLTRVREVLQDLGASGKE 399 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI ++ ++ ++ + S T GY + A LL + + + Sbjct: 400 WDLIYLGRKKQSDSEELWISYHRHLSSVE----YSYWTLGYMLSLRGAEKLL-AAEPLNK 454 Query: 175 --PIDMDM------------KHWWEH-NIPSLVTEPGAVYE---------AIDTNDSTI 209 P+D + +EH N+ + P V+ DT DS I Sbjct: 455 MVPVDEYLPIMFDKHPNKDWSSQFEHRNLNAFTLYPLVVFPQRYTNQPGYISDTEDSVI 513 >gi|260582747|ref|ZP_05850534.1| lipooligosaccharide galactosyltransferase II [Haemophilus influenzae NT127] gi|260094197|gb|EEW78098.1| lipooligosaccharide galactosyltransferase II [Haemophilus influenzae NT127] Length = 278 Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 82/240 (34%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + + K Sbjct: 13 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLMEKFIPNLLHAK- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + T GY I +EA L N Sbjct: 129 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGTAGYVISREAVKQLSAHICSLTSN 188 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ + + + N + PG +Y S +E R Sbjct: 189 HLLAIDLLIFNIF--NQSTYQVSPGVCVQEEQLYPKDIKLHSQLETERQKYLSVKKKRTL 246 >gi|238920436|ref|YP_002933951.1| beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] gi|238870005|gb|ACR69716.1| putative beta1,4-galactosyltransferase [Edwardsiella ictaluri 93-146] Length = 247 Score = 139 bits (351), Expect = 4e-31, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 89/223 (39%), Gaps = 5/223 (2%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V++I+L ++ + + ++++ +A+ G + K + L Sbjct: 1 MKVFIINLKRDLQKKHEIINECDQLNISHEIIEAVSGIDLSATEIDHLIDKDAQIY---L 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGC +SH+ ++++I A+ILEDDA D+ S+++ + + Sbjct: 58 TKGEIGCSLSHLRIYQKIISEDLPYALILEDDAILHDDLSEVIHAIENVIDKEKSHAYLL 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + L + +++ T GY + + A + + + D Sbjct: 118 YKTNCVYSNKEIKLSEKYSLYE-SNAPTYTHGYVVTNKTARLITEINTPVRLEADAWRNF 176 Query: 183 WWEHNIPSLVTEPGAVYEAIDT-NDSTIEESRLVRKPTFSPLY 224 ++E NI S + + +STIEE RL R P L Sbjct: 177 YFEKNIRSYSLNIDLITSRDQSKENSTIEEERLSRSPRQISLR 219 >gi|304312996|ref|YP_003812594.1| Putative galactosyl transferase [gamma proteobacterium HdN1] gi|301798729|emb|CBL46962.1| Putative galactosyl transferase [gamma proteobacterium HdN1] Length = 283 Score = 139 bits (351), Expect = 4e-31, Method: Composition-based stats. Identities = 45/230 (19%), Positives = 89/230 (38%), Gaps = 7/230 (3%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICN--RIFSHQKRQCQ 57 M + ++I+L + + S A+ G P + + Sbjct: 20 MNVTAWIIALNPENPVANALERSLRDQGILASIRPAVDGRKEMPSLQGRERLNQRDAVLY 79 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 + L+ E+GCY+SH L + + + EDD LL ++ + L Sbjct: 80 RQTELTATEVGCYLSHYRLIQEAYDTGLSHICLFEDDVVAEHGLGDLLREIAALEDTFHL 139 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + +L K +K L + + +P + T GY + + +L+ NIY PID Sbjct: 140 TRLMSL--KIRKRKLARPLSHGYSVVRPLRGALGTQGYVVNRTGMKEILDFGANIYMPID 197 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 ++ +N+ EP A+YE + +++E++R + S ++ Sbjct: 198 KLYDSFFLYNLNCYSVEPHAIYET--SRKTSVEKTRQPGRRNLSTFILWQ 245 >gi|148827740|ref|YP_001292493.1| lipooligosaccharide galactosyltransferase II [Haemophilus influenzae PittGG] gi|148718982|gb|ABR00110.1| lipooligosaccharide galactosyltransferase II [Haemophilus influenzae PittGG] Length = 278 Score = 139 bits (351), Expect = 4e-31, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 82/240 (34%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 13 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + T GY I +EA L N Sbjct: 129 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGTAGYVISREAVKQLSAHICSLTSN 188 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ + + + N + PG +Y S +E R Sbjct: 189 HLLAIDLLIFNIF--NQSTYQVSPGVCVQEEQLYPKDIKLHSQLETERQKYLSVKKKRTL 246 >gi|71983853|ref|NP_001021233.1| hypothetical protein D2045.9 [Caenorhabditis elegans] gi|30424338|emb|CAA84699.2| C. elegans protein D2045.9, confirmed by transcript evidence [Caenorhabditis elegans] Length = 534 Score = 139 bits (351), Expect = 4e-31, Method: Composition-based stats. Identities = 46/239 (19%), Positives = 85/239 (35%), Gaps = 38/239 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF----- 58 +Y+++L R + + +++S +A G+ + + + Sbjct: 291 KIYLVNLKRRQERLNRMQKIFDILGIEYSLLEATDGQKLDELPEELKNYQILEGYLDPIS 350 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----FSQLLPHLSKCDIN 114 KR + EIGC++SH +W+ + I+ EDD FS + ++L L Sbjct: 351 KRPMKNGEIGCFLSHYRVWQDVVQHNYEKVIVFEDDLRFSHDGLTRIREVLQDLDASHKP 410 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL--NVRKNI 172 LI ++ ++ ++ + S T GY + A LL N K + Sbjct: 411 WDLIYLGRKKQSENEELWISQHRHLSSVE----YSYWTLGYMLSLNGARKLLRPNPLKKM 466 Query: 173 YRPIDMDM---------KHWWEH----NIPSLVTEPGAVYE---------AIDTNDSTI 209 P+D + K W H ++ + P V+ DT DS I Sbjct: 467 V-PVDEYLPIMFNKHPNKDWSLHFEPLDVEAFTLYPLVVFPQKYTNEPGYVSDTEDSVI 524 >gi|261493664|ref|ZP_05990183.1| Lipooligosaccharide biosynthesis protein lpsA [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261310664|gb|EEY11848.1| Lipooligosaccharide biosynthesis protein lpsA [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 263 Score = 139 bits (350), Expect = 4e-31, Method: Composition-based stats. Identities = 48/223 (21%), Positives = 83/223 (37%), Gaps = 15/223 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL + RR+ + ++ F FFDAI + + F+ L Sbjct: 1 MNNYVISLTSAQERRKHIEAEFGKQNIPFQFFDAITPDLIKEKAKAFNIDISNTN----L 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EI C +SHI LW I EDD + +LL + N ++K + Sbjct: 57 TKGEIACALSHIALWHLAKQQNLDYICIFEDDIYLDNNAFELLK-TNYIPENTHIVKLET 115 Query: 123 L-RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 L + + + N + + GY + + A L+N+ K + PID + Sbjct: 116 LPFDRINRFNKTEKYILNRRLFKLNSRHVGMAGYILTNKGAEFLINILKTLNIPIDDLIF 175 Query: 182 HWWEHNIPSLVT---EPGAVYEAIDTND-----STIEESRLVR 216 + I P + N S++++ R +R Sbjct: 176 DEYL-KIKEYKVLQMSPALCVQDFILNSKTNFKSSLQDDRAIR 217 >gi|2497662|sp|Q57394|LIC2B_HAEIN RecName: Full=Lipooligosaccharide biosynthesis protein lic2B gi|1145193|gb|AAA84947.1| Lic2B protein [Haemophilus influenzae] gi|1150403|emb|CAA90892.1| Lic2B protein [Haemophilus influenzae] Length = 266 Score = 139 bits (350), Expect = 4e-31, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 81/240 (33%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 1 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 57 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 58 -LTEGEKACFMSHYMLWQKCFSEDLPYIYIFEDDVLLGENADKFLAEDEWLEEAFKQTDK 116 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + GY I +EA L N Sbjct: 117 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGAAGYVISREAVKQLSAHICSLTSN 176 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ M + + N + PG +Y S +E R Sbjct: 177 HLLAIDLLMFNIF--NQSTYQVSPGVCVQEGQLYPKDIKLHSQLETERQKYLSVKKKRTL 234 >gi|260182079|gb|ACX35572.1| beta-1,4 galactosyltransferase [Pasteurella multocida] Length = 280 Score = 139 bits (350), Expect = 4e-31, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 79/213 (37%), Gaps = 14/213 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ + ++ + F FFDAI +P+ N++ S L+ Sbjct: 7 YVISLASAVERRQHISEQFSQYDIPFQFFDAI--SPSPLLNQLVSQF-FPSLADSSLTDG 63 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDINNILIKF 120 E GC+ISH+ LW + ++ EDD + L + +++ Sbjct: 64 EKGCFISHLSLWHKCVEKNLPYIVVFEDDILLGKNADKFLIEDEWFFSRFNTNDVFIVRL 123 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + +K N ++ + T GY I AA LL++ + PI Sbjct: 124 ETFLQKVYCQPSYIKSYYNRELLTLKSTHFGTAGYIISLGAAKFLLSLFNKMHIEEVAPI 183 Query: 177 DMDM--KHWWEHNIPSLVTEPGAVYEAIDTNDS 207 D + K + P + + N S Sbjct: 184 DELLFNKFLERKDFTVYQFSPALCIQELQLNKS 216 >gi|170719162|ref|YP_001784307.1| glycosyl transferase [Haemophilus somnus 2336] gi|168827291|gb|ACA32662.1| glycosyl transferase family 25 [Haemophilus somnus 2336] Length = 277 Score = 139 bits (350), Expect = 4e-31, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 75/206 (36%), Gaps = 13/206 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL S RR+ ++ +SFF+A + K L++ Sbjct: 4 HVISLKSSVERRQHIEKYFGEQNIPYSFFNAFQPNEENENIIQY---YVPSLNKTDLTIG 60 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E GC++SH+ LW++ I EDD S Q L + ++ LR Sbjct: 61 EKGCFMSHVLLWQKCVDENLPFIAIFEDDIILSPVAHQFLGQYDWLKERFDVNQYFILRL 120 Query: 126 KPK--KDSYLCTLPGNFDIHQPRILSP---RTTGYFIGKEAAIHLLNVRKNIY---RPID 177 + +Y T F L T GY I + AA LL++ K+ PID Sbjct: 121 ETYLMNSAYRKTKIKFFQQRCFPQLMSQQWGTAGYIISQIAAKKLLDLFKSPDFSVEPID 180 Query: 178 MDM--KHWWEHNIPSLVTEPGAVYEA 201 + + E P P V + Sbjct: 181 VILFESLLKEEQYPVYQISPAIVIQD 206 >gi|332525260|ref|ZP_08401433.1| glycosyl transferase family 25 [Rubrivivax benzoatilyticus JA2] gi|332108542|gb|EGJ09766.1| glycosyl transferase family 25 [Rubrivivax benzoatilyticus JA2] Length = 248 Score = 139 bits (350), Expect = 5e-31, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 74/217 (34%), Gaps = 9/217 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKR 60 + +VI+L ARR + L + + G + + Sbjct: 1 MRTFVITLAGDDARRAPLLAALSAQGLDWELMLGVDGRRGLPAPLEAEVDRAAARTRSGA 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 ++ E+ C +SH +++RI A++LEDDA + L L+ Sbjct: 61 PMTDGELACALSHRAVYRRIVDEGLDAALVLEDDAIVHPRLGEFLAA--WDPAVADLLLL 118 Query: 121 DALRK-KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 D R K+ ++ + P TGY + + A LL + D Sbjct: 119 DHSRAWGEKRPRFVSGAFAVHRLGLPYYNPSFATGYVLTRRGAERLLQACSPVRAVADWP 178 Query: 180 MKHWWEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLV 215 + H ++ ++ P V ++ DT S + R Sbjct: 179 V-HLYDFG--AMAVTPRLVDHQDPDTGHSNLRVERQA 212 >gi|260182073|gb|ACX35567.1| beta-1,4 galactosyltransferase [Pasteurella multocida] Length = 283 Score = 139 bits (350), Expect = 5e-31, Method: Composition-based stats. Identities = 47/213 (22%), Positives = 80/213 (37%), Gaps = 14/213 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ + ++ + F FFDAI +P+ N++ S L+ Sbjct: 7 YVISLASAVERRQHISEQFSQYDIPFQFFDAI--SPSPLLNQLVSQF-FPSLADSSLTDG 63 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-----CDINNILIKF 120 E GC+ISH+ LW + ++ EDD + L + +++ Sbjct: 64 EKGCFISHLSLWHKCVEKNLPYIVVFEDDILLGKNADKFLIEDEWFFSRFNTNDVFIVRL 123 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + +K N ++ + T GY I AA LL++ N+ PI Sbjct: 124 ETFLQKVYCQPSHIKSYYNRELLTLKSTHFGTAGYIISLGAAKFLLSLFNNMHIEEVAPI 183 Query: 177 DMDM--KHWWEHNIPSLVTEPGAVYEAIDTNDS 207 D + K + P + + N S Sbjct: 184 DELLFNKFLERKDFTVYQFSPALCIQELQLNKS 216 >gi|90419935|ref|ZP_01227844.1| possible glycosyl transferase [Aurantimonas manganoxydans SI85-9A1] gi|90335976|gb|EAS49724.1| possible glycosyl transferase [Aurantimonas manganoxydans SI85-9A1] Length = 260 Score = 139 bits (350), Expect = 5e-31, Method: Composition-based stats. Identities = 52/244 (21%), Positives = 90/244 (36%), Gaps = 14/244 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRLLSL 64 +VI+L S R + AA I + + G P+ R H + + R++ Sbjct: 20 FVINLDRSTERWHEIAQSAADIGVDVTRVPGTDGAAIPVAEWRDVDHDRFRRSHGRIILP 79 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 E GCY SH+ + + S A+I EDD E + + + + ++K R Sbjct: 80 GEYGCYRSHLAALQAVIDSGETVAVICEDDIILKPETPAAVAAILEQKPDLHVLKLANHR 139 Query: 125 KKPKKDSYLCTLPGNFD--IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + S+ G F +H P+ Y + +E A L + + ++ P D+ + Sbjct: 140 IRGFV-SHGGEGVGKFGRCVHGPQ---GSAACYAVTREGARILHDALQTMWLPWDIAFER 195 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV-RKPTFSPLY------FYRNTCYQWNL 235 W N + + + STI SR K + PL+ +R Y Sbjct: 196 GWALNARIYTVPRKVIGFSKLSRGSTITTSRSSDYKSSKIPLFQRTPTLLFRALDYVRRA 255 Query: 236 HYNA 239 Y Sbjct: 256 AYAL 259 >gi|229844291|ref|ZP_04464431.1| lipooligosaccharide galactosyltransferase II [Haemophilus influenzae 6P18H1] gi|229812540|gb|EEP48229.1| lipooligosaccharide galactosyltransferase II [Haemophilus influenzae 6P18H1] Length = 330 Score = 138 bits (349), Expect = 5e-31, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 82/240 (34%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 65 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 121 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 122 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 180 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + T GY I +EA L N Sbjct: 181 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGTAGYVISREAVKQLSTHICSLTSN 240 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ + + + N + PG +Y S +E R Sbjct: 241 HLLAIDLLIFNIF--NQSTYQVSPGVCVQEEQLYPKDIKLHSQLETERQKYLSVKKKRTL 298 >gi|329756895|gb|AEC04680.1| hypothetical protein [Pasteurella multocida] Length = 280 Score = 138 bits (349), Expect = 6e-31, Method: Composition-based stats. Identities = 57/256 (22%), Positives = 91/256 (35%), Gaps = 27/256 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--NPICNRIFSHQKRQCQF 58 MP+ YVISL + RR+ + ++ + F FFDAI + + + + + Sbjct: 1 MPMTNYVISLLSAKERRQHVINEFSKHQVPFQFFDAISPSSQLDSLIQELIPNL-----N 55 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 L+ E GC ISH+ LW + + I EDD + + L Sbjct: 56 GASLTGGEKGCLISHLALWHKCIQDNSPYVTIFEDDILLGRDARKFLGEDEWLFSLFNCD 115 Query: 119 KFDALRKKPKKDSYLCTLPGN------FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 +R + LC P N D + T GY I AA +LL + KN+ Sbjct: 116 DIFIIRLETFLQPTLCQTPPNPISYCGRDFLVLKDEHLGTAGYIISLGAAKYLLEIFKNM 175 Query: 173 ----YRPIDMDMKHWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRK--P 218 PID + + + P+L + + E DS +E R + Sbjct: 176 ESNNIFPIDHLIFNRFLAGEELMVYQLSPALCIQELQLNENESLLDSQLESERKNYRLAE 235 Query: 219 TFSPLYFYRNTCYQWN 234 +R Y+ Sbjct: 236 KARKKKTWREKVYRIF 251 >gi|167855175|ref|ZP_02477946.1| probable glycosyl transferase family 25, LPS biosynthesis [Haemophilus parasuis 29755] gi|167853720|gb|EDS24963.1| probable glycosyl transferase family 25, LPS biosynthesis [Haemophilus parasuis 29755] Length = 263 Score = 138 bits (349), Expect = 6e-31, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 86/222 (38%), Gaps = 15/222 (6%) Query: 3 IPVYVISLPFSHA-RREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + +VISL +S+ RR + + ++ F FFDAI P N +++ + Sbjct: 1 MKCFVISLKYSNDKRRTHMKNEFSSHNIPFEFFDAI----TPNENAQYANLFKINLDNTE 56 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-DINNILIKF 120 L+ EI C SH+ LWK++ + I EDD S +L + + +IK Sbjct: 57 LTQGEISCLFSHLTLWKQMIDNNLDRIAIFEDDIYLSSSAKDMLDSIESISTTDFDVIKL 116 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NIYRPID- 177 + ++ + N +H + + GY I AA + + NI PID Sbjct: 117 EKSLERVLASKFYKIRMRNSSLHILKDSHLGSAGYVITNSAARKTITYIQNENIIAPIDI 176 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAI------DTNDSTIEESR 213 + + N + P + + S++E R Sbjct: 177 IVFDKLIKSNYKIMQIFPAFCIQDFIKNKCDNNFPSSLENER 218 >gi|110006057|gb|ABG48531.1| LpsA [Haemophilus influenzae] Length = 256 Score = 138 bits (349), Expect = 6e-31, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 89/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +I+ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGAVYEAI-----DTNDSTIEESR 213 + H + + PG + + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPENPFESSLKEGR 221 >gi|222086962|ref|YP_002545496.1| glycosyltransferase protein [Agrobacterium radiobacter K84] gi|221724410|gb|ACM27566.1| glycosyltransferase protein [Agrobacterium radiobacter K84] Length = 250 Score = 138 bits (348), Expect = 7e-31, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 82/243 (33%), Gaps = 9/243 (3%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKR 60 P+PVY+I++ + R + ++ ++ + + G P H++ Q + R Sbjct: 5 PLPVYLINIDRATDRLAEVQRQSDEFGIRIERVNGVDGALLPQEQWVDVDHERFQRRHGR 64 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 ++ E GCY SH+ ++ + AII+EDD F + N +IK Sbjct: 65 VILPGEYGCYRSHLLALRQFLAGDSQSAIIIEDDIALDGNFLARAVAAQEAVPNAEVIKL 124 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPR-TTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 R P I + T Y + + A ++ + P D+ Sbjct: 125 VNHRWGG--FYATARSPKGDVIGRCLFGPQGSTACYLVTRRGAEKIVKSLAVMSLPWDVA 182 Query: 180 MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT-----FSPLYFYRNTCYQWN 234 ++ W+ T + + I R P + +R + Sbjct: 183 IERGWDMRTSIFTTRANIAGFSPLQRTTMIGWRRDYHAAKSTALRRIPAHLFRTFDFFRR 242 Query: 235 LHY 237 + Y Sbjct: 243 IGY 245 >gi|261868375|ref|YP_003256297.1| hypothetical protein D11S_1715 [Aggregatibacter actinomycetemcomitans D11S-1] gi|261413707|gb|ACX83078.1| Lsg locus putative protein 4 [Aggregatibacter actinomycetemcomitans D11S-1] Length = 258 Score = 138 bits (348), Expect = 7e-31, Method: Composition-based stats. Identities = 51/237 (21%), Positives = 88/237 (37%), Gaps = 25/237 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQF 58 M + Y++SL RRE F + FS F AI ++ + +F+ K + + Sbjct: 1 MIMNKYLVSLDKDVQRRELFFAQPGT--ADFSVFPAINTMQKDWDALSAVFNPVKFEQHY 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R ++ EIGC +SH+ ++++I A++ EDDA F+ + L L Sbjct: 59 GRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNKNLPENLTALLAQHCT 118 Query: 115 NILIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAA 162 +I + P ++L G+ P T Y I K AA Sbjct: 119 ADIILIGQSKIAGFDDAELEINYPTTFTFLRKKLGDVTYAYPYKSYFAGTVAYLIKKSAA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L + + + D + + + + V P V E S +E R + Sbjct: 179 RTFLAILEQETPFWLA-DDFLLFEIQFKVSNQVVRPLMVIENPQLV-SNLEAVRRSK 233 >gi|329122427|ref|ZP_08251014.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Haemophilus aegyptius ATCC 11116] gi|327473709|gb|EGF19128.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Haemophilus aegyptius ATCC 11116] Length = 282 Score = 138 bits (348), Expect = 8e-31, Method: Composition-based stats. Identities = 53/271 (19%), Positives = 88/271 (32%), Gaps = 39/271 (14%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 13 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I EDD + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDDVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + + GY I +EA L N Sbjct: 129 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGSAGYVISREAVKQLSTHICSLTSN 188 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESR------------ 213 ID+ M + + N PG +Y S +E R Sbjct: 189 HLLAIDLLMFNIF--NQSRYQVSPGVCVQEEQLYPNDIKLHSQLETERQKYLSVKKKRTL 246 Query: 214 ------LVRKPTFSPLYFYRNTCYQWNLHYN 238 L KP R + + + Sbjct: 247 KTVLISLAGKPKKILRKILRKIYRKLFISKH 277 >gi|293392083|ref|ZP_06636417.1| Lsg locus putative protein 4 [Aggregatibacter actinomycetemcomitans D7S-1] gi|290952617|gb|EFE02736.1| Lsg locus putative protein 4 [Aggregatibacter actinomycetemcomitans D7S-1] Length = 258 Score = 138 bits (348), Expect = 8e-31, Method: Composition-based stats. Identities = 51/237 (21%), Positives = 88/237 (37%), Gaps = 25/237 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQF 58 M + Y++SL RRE F + FS F AI ++ + +F+ K + + Sbjct: 1 MIMNKYLVSLDKDVQRRELFFAQPGT--ADFSVFPAINTMQKDWDALSAVFNPVKFEQHY 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R ++ EIGC +SH+ ++++I A++ EDDA F+ + L L Sbjct: 59 GRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNKNLPENLTALLAQHCT 118 Query: 115 NILIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAA 162 +I + P +L G+ P T Y I K AA Sbjct: 119 ADIILIGQSKIAGFDDAELEINYPTTFMFLRKKLGDVTYAYPYKSYFSGTVAYLIKKSAA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L + + + D + + + + V P V E S +E +R + Sbjct: 179 RTFLAILEQEPPFWLA-DDFLLFETQFKLNNNVVRPLMVIENPQLV-SNLEAARGSK 233 >gi|291326788|ref|ZP_06125861.2| putative beta1,4-galactosyltransferase [Providencia rettgeri DSM 1131] gi|291312941|gb|EFE53394.1| putative beta1,4-galactosyltransferase [Providencia rettgeri DSM 1131] Length = 232 Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats. Identities = 46/227 (20%), Positives = 92/227 (40%), Gaps = 18/227 (7%) Query: 19 KFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL---LSLPEIGCYISHIH 75 K + + L++ F +A+ G+N + ++ L+ EIGC +SH+ Sbjct: 2 KMQAQLDKTKLKYEFINAVNGKNLS------DTELKKATHDYPNCMLTKGEIGCALSHLS 55 Query: 76 LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT 135 ++K++A A++LEDDA ++ + I+ + K DSY+ Sbjct: 56 IYKKMANENIEQALVLEDDAILPHNIEDIISQIKI----FDKIRKPNIFLLSKIDSYIRN 111 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 N +I + + + Y I +AA +++ ++ I DM + + P Sbjct: 112 QNLNDNIFKVYQ-AIGSHAYVINLKAAKNIIKIQSPIKYESDMWRYFRYFNCANIYGHIP 170 Query: 196 GAVYEAIDT-NDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 V ++ +S++EE R P YR+ + +Y +R Sbjct: 171 TLVISDDESKLNSSLEEGR---APLLKARERYRSNLKKMFKNYQYYR 214 >gi|324508587|gb|ADY43624.1| Glycosyltransferase 25 family member [Ascaris suum] Length = 539 Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats. Identities = 47/243 (19%), Positives = 90/243 (37%), Gaps = 37/243 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQCQFKR 60 +Y+I+L RR+K + + + A G+ + + +KR Sbjct: 294 IYLINLERRQERRQKMSEILKLMGIDHKLWRATDGKLLENEEFAADVVLLPGYEDPYYKR 353 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF----SDEFSQLLPHLSKCDINNI 116 + EIGC++SH +W+ + I+ EDD F ++ ++L+ L + Sbjct: 354 PMKTGEIGCFLSHYRIWRDVIDKSFERVIVFEDDLRFILNATNMLTELIEDLDHTALPWD 413 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYR 174 L+ R + +++++ PG + S T GY + + A LL K + Sbjct: 414 LVYLGRKRLESARENWV---PGQRHLSTV-GYSYWTLGYMLSQSGAKKLLKAEPLKKMI- 468 Query: 175 PIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEES 212 P+D + + + E ++ + P V DT DS I + Sbjct: 469 PVDEYLPIMFDQHPNGKWKEAFPERDLIAYTIYPTIVVPERYTNEYGYISDTEDSRIVQQ 528 Query: 213 RLV 215 V Sbjct: 529 SKV 531 >gi|118094236|ref|XP_422290.2| PREDICTED: similar to C1orf17 [Gallus gallus] Length = 627 Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats. Identities = 58/289 (20%), Positives = 107/289 (37%), Gaps = 39/289 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ + +A+ G+ ++ Sbjct: 345 IFMINLKRRKDRRDRMLRTLYEQEIAVKIVEAVDGKALNTSQLKALSIDMLPGYRDPYSS 404 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH ++WK + +++EDD F +F + L + + ++ Sbjct: 405 RPLTRGEIGCFLSHYYIWKEVVNRGLEKTLVIEDDVRFEHQFKRKLMKLMDDIEQAQLDW 464 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I + A L+ + Sbjct: 465 ELIYIG--RKRMQVQQPEKAVPNVMNLVEA-DYSYWTLGYAISFQGAQKLI-GAEPFSKM 520 Query: 174 RPIDM-------------DMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M+++ ++ + EP VY DT STI + Sbjct: 521 LPVDEFLPVMYNKHPVAKYMEYYESRDLKAFSAEPLLVYPTHYTGQPGYLSDTETSTIWD 580 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPSSSSSL 260 + V T + Q +H A KD P ++ PSS L Sbjct: 581 NETVS--TDWDRTHSWKSRQQGQIHSEAQNKDALPPQSSLNAPSSRDEL 627 >gi|213157910|ref|YP_002320708.1| LPS glycosyltransferase subfamily [Acinetobacter baumannii AB0057] gi|213057070|gb|ACJ41972.1| LPS glycosyltransferase subfamily [Acinetobacter baumannii AB0057] Length = 261 Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats. Identities = 55/224 (24%), Positives = 94/224 (41%), Gaps = 21/224 (9%) Query: 13 SHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKRQCQF--KRLLS 63 S+ R+EK + + + F FFDAIYG+ P IF + RQ L Sbjct: 4 SNERQEKIKQQHQNLQSTIHDIQIDFQFFDAIYGKKLPNEYLIFLNLSRQFAGLCDHELG 63 Query: 64 LPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADF-SDEFSQLLPHLSKCDINNILIK 119 E+GC++SH+ +W+R+A II+EDD F ++ Q L L + + + Sbjct: 64 PSELGCWLSHMIIWQRLAQGDYAAYDRIIIIEDDVIFQTEHIQQKLHSLLETNPAFAFLG 123 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 + + + Y+ F++ P+ L T Y + +E A +N + ID Sbjct: 124 GHSEPSRRRIRGYVSNDELYFNMTGPKDLYTATYAYSLTRETAQDFVNKQIKKLTYIDD- 182 Query: 180 MKHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRLV--RKPTF 220 W++ + ++ P +E D S I R + +KP Sbjct: 183 ----WKYLLEGDISTPFYYCFEHDDEMQSHIASDRKIYMKKPNR 222 >gi|195998972|ref|XP_002109354.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens] gi|190587478|gb|EDV27520.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens] Length = 546 Score = 137 bits (346), Expect = 1e-30, Method: Composition-based stats. Identities = 57/238 (23%), Positives = 87/238 (36%), Gaps = 38/238 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 VYVI+L RR+ I L FF+AI G + Sbjct: 305 VYVINLLRRKERRQYMKALLDEICLDAEFFEAIDGRQLNEEKIAKLGIQILPGYLDPFHQ 364 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD----FSDEFSQLLPHLSKCDINN 115 R + EIGC++SH LWK++ +ILEDD F + SQ++ + + Sbjct: 365 RPMKYGEIGCFLSHYQLWKKVIQFNYSSILILEDDVKVKMGFHRQLSQVMQEVQSLSLPW 424 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 +I RKK ++ + N+ I S T GY + A LL + + Sbjct: 425 DIIYIG--RKKLLSEAEIPVKGANYLIW--PDYSYWTVGYILSASGAKKLL-ASQPLTHM 479 Query: 174 RPIDMDMKHWW-------------EHNIPSLVTEPGAVYEAI---------DTNDSTI 209 ID + + E N+ ++ EP + A DT DS+I Sbjct: 480 MAIDEFLPIMYDKHPRKDWKEIFPERNLIAISAEPLLIEPAFYVADKDYVSDTEDSSI 537 >gi|148825176|ref|YP_001289929.1| dimethyladenosine transferase [Haemophilus influenzae PittEE] gi|148715336|gb|ABQ97546.1| dimethyladenosine transferase [Haemophilus influenzae PittEE] Length = 278 Score = 137 bits (346), Expect = 1e-30, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 81/240 (33%), Gaps = 21/240 (8%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I YVISL ++ RRE ++ ++ F FFDA+ + + + I K Sbjct: 13 MHIN-YVISLTSAYQRREHIQKEFSQQNIPFEFFDAL-KPSKELTSLIEKFIPNLLHAK- 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 L+ E C++SH LW++ I ED+ + + L + + Sbjct: 70 -LTEGEKACFMSHYMLWQKCLSEDLPYIYIFEDNVLLGENADKFLAEDEWLEERFKQTDK 128 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKN 171 +++F+ K I + + GY I +EA L N Sbjct: 129 FILRFETFLNFSKCKDKKIKPYSGRKILKLVSENCGAAGYVISREAVKQLSAHICSLTSN 188 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGA------VYEAIDTNDSTIEESRLVRKPTFSPLYF 225 ID+ M + + N + PG +Y S +E R Sbjct: 189 HLLAIDLLMFNIF--NQSTYQVSPGVCVQEGQLYPKDIKLHSQLETERQKYLSVKKKRTL 246 >gi|300717598|ref|YP_003742401.1| LPS biosynthesis glycosyltransferase [Erwinia billingiae Eb661] gi|299063434|emb|CAX60554.1| Putative LPS biosynthesis glycosyltransferase [Erwinia billingiae Eb661] Length = 256 Score = 137 bits (346), Expect = 1e-30, Method: Composition-based stats. Identities = 55/251 (21%), Positives = 95/251 (37%), Gaps = 16/251 (6%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 PI ++VI+L S RRE R ++L F +A+ G N + Sbjct: 4 PIRIFVINLKSSVTRRESLQRRLDELNLTAEFIEAVNGHNLSESELAEHTLPVNYAY--- 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 E+GC +SH ++K++ A+ILEDD D +L + + Sbjct: 61 -MKGEVGCALSHQLIYKKMVAGNIATALILEDDVYLPDNLPVILSQIKLTAHLPEVTLLS 119 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID---M 178 + K ++ + L ++ +H P + GY I ++AA +LL I+ D + Sbjct: 120 RVNKFCNRNHRI--LTPDYTLH-PVHQATTAHGYVITRQAAANLLAALYPIWMVADKWRL 176 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + W I P V + ++DSTI + + + + L Sbjct: 177 FEDYGW---IKVNAIVPALVTLSAASSDSTINSQKGSEEVNNKKKALWEELMRKRPLRVK 233 Query: 239 A---WRKDLPP 246 A R+ L P Sbjct: 234 AKCRLRRALVP 244 >gi|229845811|ref|ZP_04465923.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae 7P49H1] gi|229810815|gb|EEP46532.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae 7P49H1] Length = 277 Score = 137 bits (346), Expect = 1e-30, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 87/217 (40%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI + ++ F+ K L Sbjct: 30 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPDLIEETSKKFNIT-FDRSSKATLCDA 88 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ + ++K +A K Sbjct: 89 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK 147 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDM--K 181 + N +I + + GY I + A +LLN +N +Y ID + + Sbjct: 148 --IIYGKREQIKCNRNISRLKFKHTGMAGYSITAKGARYLLNNIRNKQLYLAIDTLIFDE 205 Query: 182 HWWEHNIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + + P + + +S++ R Sbjct: 206 LLSQKDYKVMQLSPAICTQSFILHDENYFESSLHNGR 242 >gi|293390960|ref|ZP_06635294.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951494|gb|EFE01613.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter actinomycetemcomitans D7S-1] Length = 282 Score = 137 bits (346), Expect = 1e-30, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 75/228 (32%), Gaps = 20/228 (8%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +VIS+ + RR + ++ F FFDA + + L+ Sbjct: 7 KNFVISISTAEQRRNHIIEQFTHQNIPFEFFDAFTPSDKLTDH--LQRYLPNVANAAQLT 64 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 + E GC +SH LWK+ + EDD + ++ L + + L Sbjct: 65 MGEKGCLMSHFMLWKKCIDENLDYITLFEDDILLGENANKFLAEGDWLKVRFNFQEIFVL 124 Query: 124 RKKP------KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--- 174 R + + DI T GY I + AA +L+ + + + Sbjct: 125 RLETFLMPVQLEKQTQIPPFQQRDIDILTSKHFGTAGYVISQGAAKYLIALFEKLTTEEI 184 Query: 175 -PIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTND------STIEESR 213 PID M + + P + + N S +E+ R Sbjct: 185 KPIDEIMFNQQINATDYRVYQLNPAICVQELQLNQEASLLVSNLEQER 232 >gi|145632118|ref|ZP_01787853.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 3655] gi|110006049|gb|ABG48527.1| LpsA [Haemophilus influenzae] gi|144987025|gb|EDJ93555.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae 3655] Length = 256 Score = 137 bits (345), Expect = 1e-30, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 90/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNITFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +++ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGAVYEAI-----DTNDSTIEESR 213 + H + ++ PG + + +S+++E R Sbjct: 185 FLHFKDYKTVQLSPGICVQDFVLHPENPFESSLKEGR 221 >gi|201067868|ref|ZP_03217759.1| hypothetical protein CJBH_L15 [Campylobacter jejuni subsp. jejuni BH-01-0142] gi|200004562|gb|EDZ05035.1| hypothetical protein CJBH_L15 [Campylobacter jejuni subsp. jejuni BH-01-0142] Length = 252 Score = 136 bits (344), Expect = 2e-30, Method: Composition-based stats. Identities = 47/231 (20%), Positives = 85/231 (36%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 R + P Y + + + T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I P + T + R RK Sbjct: 181 KIIFPYDHCTSN--DNIINVYALYPVVITTDEIFGAQTYVQDDNKRRYRKR 229 >gi|254452105|ref|ZP_05065542.1| LPS glycosyltransferase subfamily [Octadecabacter antarcticus 238] gi|198266511|gb|EDY90781.1| LPS glycosyltransferase subfamily [Octadecabacter antarcticus 238] Length = 169 Score = 136 bits (344), Expect = 2e-30, Method: Composition-based stats. Identities = 44/165 (26%), Positives = 67/165 (40%), Gaps = 4/165 (2%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRL 61 P Y I+L + AR + + + F +A+ G + + + K Sbjct: 3 PSYPINLADNTARPDNSTAQFMAQAIPFERINAVNGWALTPSQIAQNYDVAANKTHAKAP 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L EIGCY+SHI W RIA A G + EDD D L L+ + ++K Sbjct: 63 LVASEIGCYLSHIAAWTRIANGDAAGGFVFEDDFLADDTSGATLADLNVAQSDWDMVKLF 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + + PK + T G + P + GY + K+AA LL Sbjct: 123 SFDQSPK--AVFETTHGTKRLVVPYRVPTCLIGYGMTKQAAQKLL 165 >gi|261867136|ref|YP_003255058.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412468|gb|ACX81839.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter actinomycetemcomitans D11S-1] Length = 282 Score = 136 bits (344), Expect = 2e-30, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 75/228 (32%), Gaps = 20/228 (8%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +VIS+ + RR + ++ F FFDA + + L+ Sbjct: 7 KNFVISISTAEQRRNHIIEQFTHQNIPFEFFDAFTPSDKLTDH--LQRYLPNVANAAQLT 64 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 + E GC +SH LWK+ + EDD + ++ L + + L Sbjct: 65 MGEKGCLMSHFMLWKKCIDENLDYVTLFEDDILLGENANKFLAEDEWLKVRFNFQEIFVL 124 Query: 124 RKKP------KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--- 174 R + + DI T GY I + AA +L+ + + + Sbjct: 125 RLETFLMPVQLEKQAQIPPFQQRDIDILTSKHFGTAGYVISQGAAKYLIALFEKLTTEEI 184 Query: 175 -PIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTND------STIEESR 213 PID M + + P + + N S +E+ R Sbjct: 185 KPIDEIMFNQQINATDYRVYQLNPAICVQELQLNQEASLLVSNLEQER 232 >gi|47500150|gb|AAT28923.1| unknown [Escherichia coli] Length = 260 Score = 136 bits (343), Expect = 3e-30, Method: Composition-based stats. Identities = 64/231 (27%), Positives = 102/231 (44%), Gaps = 22/231 (9%) Query: 2 PIPVYVISLPFSHARREKFCHRAARI-HLQFSFFDAIYGENNP--ICNRIFSHQKRQCQF 58 + + V+SLP ++ARR KF R + + L+F FFD +YG+N P I I+ +K + + Sbjct: 3 NLEIRVLSLPDAYARRTKFQERFSLVSKLKFQFFDGVYGKNIPDEILKSIYDDKKAKLKI 62 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDIN 114 R +++ EIG SH ++K I+LEDD+ + F ++LL ++ D Sbjct: 63 NRSMTVGEIGATYSHYLIYKDAYEKKLDYLIVLEDDSFVDENFDDVINRLLVKITPDDDA 122 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQP---RILSPRTTGYFIGKEAAIHLLNVRKN 171 I I+ L K L F++ + + GY + K++ ++ Sbjct: 123 IIFIQKHTLDSKVIFSRKKDILKNGFELVKMLGSSQYFVGSYGYILTKKSINKIIQN--- 179 Query: 172 IYRPIDMDMKHWW----EHNIPS-LVTEPGAVY---EAIDTNDSTIEESRL 214 Y PI HW+ + I S P VY E I DS I E R Sbjct: 180 -YLPIYCVCDHWFFIKKDSKIESFYCVSPSLVYTNDEDIRLVDSFINEERK 229 >gi|205355890|ref|ZP_03222659.1| hypothetical protein Cj8421_1178 [Campylobacter jejuni subsp. jejuni CG8421] gi|205346324|gb|EDZ32958.1| hypothetical protein Cj8421_1178 [Campylobacter jejuni subsp. jejuni CG8421] Length = 257 Score = 136 bits (343), Expect = 3e-30, Method: Composition-based stats. Identities = 47/231 (20%), Positives = 86/231 (37%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 R + P Y + + + T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I + P + T + R RK Sbjct: 181 KIIFPYDHCTSN--DNIINAYALYPVVITTDEIFGAQTYVQDDNKRRYRKR 229 >gi|157415408|ref|YP_001482664.1| hypothetical protein C8J_1088 [Campylobacter jejuni subsp. jejuni 81116] gi|13123730|gb|AAK12951.1|AF343914_4 unknown [Campylobacter jejuni] gi|108514948|gb|ABF93270.1| hypothetical protein [Campylobacter jejuni] gi|157386372|gb|ABV52687.1| hypothetical protein C8J_1088 [Campylobacter jejuni subsp. jejuni 81116] gi|167412365|gb|ABZ79825.1| unknown [Campylobacter jejuni] gi|167412381|gb|ABZ79839.1| unknown [Campylobacter jejuni] gi|307748049|gb|ADN91319.1| Putative uncharacterized protein [Campylobacter jejuni subsp. jejuni M1] gi|315932286|gb|EFV11229.1| glycosyltransferase [Campylobacter jejuni subsp. jejuni 327] Length = 257 Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats. Identities = 47/231 (20%), Positives = 85/231 (36%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 R + P Y + + + T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I P + T + R RK Sbjct: 181 KIIFPYDHCTSN--DNIINVYALYPVVITTDEIFGAQTYVQDDNKRRYRKR 229 >gi|86150773|ref|ZP_01068989.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni subsp. jejuni 260.94] gi|85841943|gb|EAQ59189.1| lipooligosaccharide biosynthesis glycosyltransferase, putative [Campylobacter jejuni subsp. jejuni 260.94] Length = 178 Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats. Identities = 47/187 (25%), Positives = 72/187 (38%), Gaps = 23/187 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNP-ICNRIFSHQ 52 + V++I+L S R+E + ++ L+F FF AI +N + + Sbjct: 1 MKVFIINLERSLDRKEHIQKQIQKLFEKNLSLKNKLEFIFFKAIDAKNKEHLEFKDHFPW 60 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHL 108 R LS E C+ SH LW+ IILEDD +FSDEF ++ + L Sbjct: 61 WGSWVLGRELSDGEKACFASHYKLWQECVKLDEP-IIILEDDVEFSDEFLNNGAEYIDEL 119 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K I+ L K G + + ++ T GY + AA+ L Sbjct: 120 LKSK--YEYIRLCYLFDKRLYF----LSEGGYYLSFEKL--AGTQGYVLQVSAAMKFLKY 171 Query: 169 RKNIYRP 175 K P Sbjct: 172 AKIGSMP 178 >gi|62868792|gb|AAY17580.1| putative glycosyltransferase [Campylobacter jejuni] gi|108514881|gb|ABF93232.1| hypothetical protein [Campylobacter jejuni] Length = 257 Score = 136 bits (342), Expect = 4e-30, Method: Composition-based stats. Identities = 47/231 (20%), Positives = 85/231 (36%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 R + P Y + + + T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I P + T + R RK Sbjct: 181 KIIFPYDHCTSN--DNIINVYALYPIVITTDEIFGAQTYVQDDNKRRYRKR 229 >gi|300796728|ref|NP_001178231.1| procollagen galactosyltransferase 2 [Bos taurus] gi|297484385|ref|XP_002694258.1| PREDICTED: glycosyltransferase 25 domain containing 2 [Bos taurus] gi|296478943|gb|DAA21058.1| glycosyltransferase 25 domain containing 2 [Bos taurus] Length = 626 Score = 136 bits (342), Expect = 4e-30, Method: Composition-based stats. Identities = 58/285 (20%), Positives = 109/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDRVQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + ++P ++ + S T GY I E A L+ Sbjct: 464 ELIYIG--RKRMQVKEPEKSVPNVVNLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKM 519 Query: 174 RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D M+H+ ++ + EP +Y DT STI + Sbjct: 520 LPVDEFLPIMYNKHPVAEYMEHYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 579 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q +H NA + LPP ++ PS Sbjct: 580 NETVA--TDWDRTHAWKSRKQSRIHRNAKNTEALPPPASLDTAPS 622 >gi|3132249|dbj|BAA28126.1| unnamed protein product [Actinobacillus actinomycetemcomitans] Length = 256 Score = 136 bits (342), Expect = 4e-30, Method: Composition-based stats. Identities = 50/235 (21%), Positives = 87/235 (37%), Gaps = 25/235 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y++SL RRE F + FS F AI ++ + +F+ K + + R Sbjct: 1 MNKYLVSLDKDVQRRELFFAQPGT--ADFSVFPAINTMQKDWDALSAVFNPVKFEQHYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I A++ EDDA F+ + L L Sbjct: 59 KVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDALFNKNLPENLTALLAQHCTAD 118 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I + P ++L G+ P T Y I K AA Sbjct: 119 IILIGQSKIAGFDDAELEINYPTTFTFLRKKLGDVTYAYPYKSYFAGTVAYLIKKSAART 178 Query: 165 LLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L + + + D + + + + V P V E S +E R + Sbjct: 179 FLAILEQETPFWLA-DDFLLFEIQFKVSNQVVRPLMVIENPQLV-SNLEAVRRSK 231 >gi|251793915|ref|YP_003008647.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter aphrophilus NJ8700] gi|247535314|gb|ACS98560.1| lipooligosaccharide biosynthesis protein lex-1 [Aggregatibacter aphrophilus NJ8700] Length = 282 Score = 136 bits (342), Expect = 4e-30, Method: Composition-based stats. Identities = 54/260 (20%), Positives = 91/260 (35%), Gaps = 25/260 (9%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFKRL 61 +VIS+P + RR + + + F FFDA N R + + Sbjct: 7 KNFVISIPTADKRRNHIIQQFGQKKIPFEFFDAFTPSERLNDHLQRYLPNVAATPR---- 62 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL---- 117 L++ E GC +SH LWK+ + EDD + Q L + Sbjct: 63 LTMGEKGCLMSHFMLWKKCVDDGLDYITLFEDDILLGENAEQFLAEDEWLKVRFNFQEIF 122 Query: 118 -IKFDALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 ++ + K + LP +I + T GY I AA +L+ V + Sbjct: 123 VLRLETFLMPVKIEKQQGILPFQQREIDILKSKHFGTAGYVISHGAAKYLIEVFEKFSSE 182 Query: 174 --RPIDMDM--KHWWEHNIPSLVTEPG-AVYEAIDTNDSTIEESRLVRKPTFSPLY---- 224 +PID M + P V E ++++ ES L ++ + + Sbjct: 183 EVKPIDEIMFNQLIDISGYQVYQLNPAICVQELQLNQENSVLESGLQKERKKNTVSHTKK 242 Query: 225 --FYRNTCYQWNLHYNAWRK 242 YR T + N+ +K Sbjct: 243 TLKYRLTRMKENILRALNKK 262 >gi|328789321|ref|XP_397154.3| PREDICTED: glycosyltransferase 25 family member-like [Apis mellifera] Length = 567 Score = 135 bits (341), Expect = 4e-30, Method: Composition-based stats. Identities = 49/238 (20%), Positives = 85/238 (35%), Gaps = 36/238 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQF 58 +Y+I+L RR + + ++ DA+ G + Sbjct: 321 NIYMINLLRRPERRNRMHKLFKELGIRVETHDAVDGRALNQSILEKMGIKIMPEYTDPYH 380 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 R +++ EIGC++SH ++W ++ + IILEDD F F Q +L L + Sbjct: 381 DRPMTMGEIGCFLSHYNIWNKVIENDFKSVIILEDDVRFEPFFCQKLNYILTELKDLHLE 440 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK-NIY 173 L+ RKK +++ + +H T GY + A L+ Sbjct: 441 WDLVYLG--RKKLAENAEPWIDESKYLVHAAYSY--WTLGYILSASGARKLIEAMPLKQL 496 Query: 174 RPIDMDM--------KHWW-----EHNIPSLVTEPGAVYEA---------IDTNDSTI 209 P+D + + W + N+ L P +Y DT +STI Sbjct: 497 IPVDEYLPILSDVHPRKDWKIYYPKRNLIILSANPLLIYPTHYTGEQGYVSDTENSTI 554 >gi|125381152|gb|ABN41495.1| putative glycosyltransferase [Campylobacter jejuni] Length = 266 Score = 135 bits (341), Expect = 5e-30, Method: Composition-based stats. Identities = 47/231 (20%), Positives = 85/231 (36%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 10 MKFFIINLKIAEDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 69 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 70 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 129 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 R + P Y + + + T GY+I K A+ + + Sbjct: 130 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 189 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I P + T + R RK Sbjct: 190 KIIFPYDHCTSN--DNIINVYALYPVVITTDEIFGAQTYVQDDNKRRYRKR 238 >gi|160395573|sp|Q29NU5|GLT25_DROPS RecName: Full=Glycosyltransferase 25 family member; Flags: Precursor Length = 626 Score = 135 bits (341), Expect = 5e-30, Method: Composition-based stats. Identities = 43/186 (23%), Positives = 74/186 (39%), Gaps = 15/186 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 +++I+L RR+K + I LQ F A+ G+ F Sbjct: 344 IFMINLERRPERRQKMENLFEEIGLQVEHFPAVDGKELNADRVQEMGIRFLPGYEDPYHH 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ---LLPHLSKCDINNI 116 R +++ EIGC++SH +W R+ +ILEDD F F + + ++ + Sbjct: 404 RAMTMGEIGCFLSHYRIWVRMVQLELKEVLILEDDIRFDPYFRANAVRVLNQARSVVEYD 463 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YR 174 LI F R K + + ++ S T GY + + A+ LL K + Sbjct: 464 LIYFGRKRLKEESEPWVADADSLVHAG----YSYWTLGYVLSLQGALKLL-AAKPLEKLI 518 Query: 175 PIDMDM 180 P+D + Sbjct: 519 PVDEFL 524 >gi|39936414|ref|NP_948690.1| glycosyl transferase [Rhodopseudomonas palustris CGA009] gi|39650269|emb|CAE28792.1| possible glycosyl transferase [Rhodopseudomonas palustris CGA009] Length = 268 Score = 135 bits (341), Expect = 5e-30, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 75/202 (37%), Gaps = 6/202 (2%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + + V+SLP + ARR + L + +FDA + N + + + F R Sbjct: 4 SVKIVVLSLPTATARRSQMSAMLGNTSLDWQYFDA--HASLMCPNLHYDGAELRRHFGRN 61 Query: 62 LSLPEIGCYISHIHLWKRIAYSP-AIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS PEI SH + + A ++LEDD F +F L +N L Sbjct: 62 LSPPEIAVCSSHAAILQEFVDRNQADYILVLEDDVIFDTDFP--LDQFCAFCASNGLEYV 119 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 K K +L I + Y + + AA L+ ++I +D+ M Sbjct: 120 RLFGKHYAKAVHLGFFYDR-SIVRFTTSPAGAQAYLMSRMAARRFLDSYQSIRATVDLAM 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAI 202 +W +P P + E Sbjct: 179 DSFWITQLPIYSIFPYPIIERF 200 >gi|46487347|gb|AAS99068.1| Tgh001 [Campylobacter jejuni] gi|58585452|gb|AAW79073.1| unknown [Campylobacter jejuni] Length = 257 Score = 135 bits (340), Expect = 6e-30, Method: Composition-based stats. Identities = 46/231 (19%), Positives = 85/231 (36%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL EIGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGEIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 + + P Y + + + T GY+I K A+ + + Sbjct: 121 GHYKQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I P + T + R RK Sbjct: 181 KIIFPYDHCTSN--DNIINVYALYPVVITTDEIFGAQTYVQDDNKRRYRKR 229 >gi|307245938|ref|ZP_07528021.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853157|gb|EFM85379.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 256 Score = 135 bits (339), Expect = 8e-30, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 84/234 (35%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + ++ISL RRE F +A F+ F AI N + F K + ++ R Sbjct: 1 MKKFLISLDKDAQRRELFFSQADT--ADFTVFSAINTMNVEQSELEQRFDFAKFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD F+ F Q L L +N Sbjct: 59 KVTKGEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLNAD 118 Query: 117 LI-----------KFDALRKKPKKDSYLCTLPG--NFDIHQPRI-LSPRTTGYFIGKEAA 162 ++ + P +L G + P T Y I K A Sbjct: 119 IVLVGQSKILSFNDTELAINYPTTFRFLQKKIGETEYRYAYPYKNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I +LV P E + S +E R Sbjct: 179 RKFLAQKATELPFWLA-DDYILFGQQFAINTLVVRPLMAIENPNLV-SNLENLR 230 >gi|332978467|gb|EGK15180.1| lipooligosaccharide biosynthesis protein LpsA [Psychrobacter sp. 1501(2011)] Length = 255 Score = 135 bits (339), Expect = 8e-30, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 85/212 (40%), Gaps = 9/212 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I YVISL + RR+ + A+ +F F +A+ + + + L Sbjct: 2 IKSYVISLEKEYERRDHIVKQFAQHDTEFQFLNAVTPDTTFEEIQTLGLMDLNFES---L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ C +SH+ LW I EDD +F++L+ S + ++K + Sbjct: 59 TAGELACLLSHLKLWHLAVEENYDYIAIFEDDIHLGSDFNKLMNDSSWLK-DLDIVKLEK 117 Query: 123 LRKKPKKDSYLCTLPG-NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDMD 179 R + + G N +H + + T GY + + A +L+ K + ID Sbjct: 118 FRPTVELSFNSIKIEGTNRKLHILKGKNLGTGGYILSLKGANYLIRYFKRLGYVEAIDAV 177 Query: 180 M--KHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 + + + ++P +P V + N+ ++ Sbjct: 178 LFNERKYPKDLPIYQLQPAVVIQDSKLNEQSV 209 >gi|110006061|gb|ABG48533.1| LpsA [Haemophilus influenzae] Length = 256 Score = 135 bits (339), Expect = 9e-30, Method: Composition-based stats. Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI E + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPEIIEETAKKFNITLDRSP-KAKLSDE 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDVALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +I+ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQP--QSVKCDRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGAVYEAI-----DTNDSTIEESR 213 + H + L PG + + +S+++E R Sbjct: 185 FLHFKDYKILQLSPGICVQDFVLHPENPFESSLKEGR 221 >gi|32033957|ref|ZP_00134213.1| COG3306: Glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126208502|ref|YP_001053727.1| putative glycosyltransferase [Actinobacillus pleuropneumoniae L20] gi|307257074|ref|ZP_07538849.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|126097294|gb|ABN74122.1| putative glycosyltransferase [Actinobacillus pleuropneumoniae serovar 5b str. L20] gi|306864445|gb|EFM96353.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 256 Score = 135 bits (339), Expect = 9e-30, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 84/234 (35%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + ++ISL RRE F +A F+ F AI N + F K + ++ R Sbjct: 1 MKKFLISLDKDAQRRELFFSQADT--ADFTVFSAINTMNVEQSELEQRFDFAKFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD F+ F Q L L +N Sbjct: 59 KVTKGEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLNAD 118 Query: 117 LI-----------KFDALRKKPKKDSYLCTLPG--NFDIHQPRI-LSPRTTGYFIGKEAA 162 ++ + P +L G + P T Y I K A Sbjct: 119 IVLVGQSKILSFDDTELAINYPTTFRFLQKKIGETEYRYAYPYKNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I +LV P E + S +E R Sbjct: 179 RKFLAQKATELPFWLA-DDYILFGQQFAINTLVVRPLMAIENPNLV-SNLENLR 230 >gi|307250276|ref|ZP_07532230.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306857717|gb|EFM89819.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 256 Score = 135 bits (339), Expect = 9e-30, Method: Composition-based stats. Identities = 51/234 (21%), Positives = 84/234 (35%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + ++ISL RRE F +A + F+ F AI N + F K + ++ R Sbjct: 1 MKKFLISLDKDAQRRELFFSQADT--VDFTVFSAINTMNVEQSELEQRFDFAKFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD + F Q L L +N Sbjct: 59 KVTKGEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLLAANFQQNLTALLNEKLNAD 118 Query: 117 LI-----------KFDALRKKPKKDSYLCTLPG--NFDIHQPRI-LSPRTTGYFIGKEAA 162 ++ + P +L G + P T Y I K A Sbjct: 119 IVLVGQSKILSFDDTELAINYPTTFRFLQKKIGETEYRYAYPYKNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I +LV P E + S +E R Sbjct: 179 RKFLAQKATELPFWLA-DDYILFGQQFAINTLVVRPLMAIENPNLV-SNLENLR 230 >gi|308497624|ref|XP_003110999.1| hypothetical protein CRE_04808 [Caenorhabditis remanei] gi|308242879|gb|EFO86831.1| hypothetical protein CRE_04808 [Caenorhabditis remanei] Length = 531 Score = 134 bits (338), Expect = 9e-30, Method: Composition-based stats. Identities = 48/239 (20%), Positives = 86/239 (35%), Gaps = 38/239 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF----- 58 +Y+I+L R + + +++S +A G+ + + Sbjct: 288 KIYLINLKRRSERLNRMHKIFDLLGVEYSLLEATDGQQLEDLPADLKNYHILNGYLDPIT 347 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS----DEFSQLLPHLSKCDIN 114 KR + EIGC++SH +W+ + + I+ EDD FS ++L L Sbjct: 348 KRPMKKGEIGCFLSHYRIWQDVVKNKLKKVIVFEDDLRFSYNGLTRIKEVLQDLDASQKE 407 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL--NVRKNI 172 LI L +K + D+ +P + + S T GY + A LL K + Sbjct: 408 WDLI---YLGRKKQSDNEELWIPLHRHLSSV-EYSYWTLGYMLSLSGAEKLLVPEPLKKV 463 Query: 173 YRPIDMDMK-HWWEH------------NIPSLVTEPGAVYE---------AIDTNDSTI 209 P+D + + +H ++ + P V+ DT DS I Sbjct: 464 V-PVDEYLPIMFNKHPNKEWSSHFEPLDLNAFTLYPLVVFPQRYTNEPGYVSDTEDSVI 521 >gi|165976453|ref|YP_001652046.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876554|gb|ABY69602.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 256 Score = 134 bits (338), Expect = 9e-30, Method: Composition-based stats. Identities = 54/234 (23%), Positives = 85/234 (36%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + ++ISL RRE F +A F+ F AI N + F K + ++ R Sbjct: 1 MKKFLISLDKDAQRRELFFSQADT--ADFTVFSAINTMNVEQSELEQRFDFAKFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD F+ F Q L L +N Sbjct: 59 KVTKGEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLNAD 118 Query: 117 LI--------KFDALR---KKPKKDSYLCTLPG--NFDIHQPRI-LSPRTTGYFIGKEAA 162 ++ FD P +L G + P T Y I K A Sbjct: 119 IVLVGQSKILSFDDTELAINYPATFRFLQKKIGETEYRYAYPYKNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I +LV P E + S +E R Sbjct: 179 RKFLAQKATELPFWLA-DDYILFGQQFAINTLVVRPLMAIENPNLV-SNLENLR 230 >gi|303251039|ref|ZP_07337225.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252666|ref|ZP_07534558.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307261511|ref|ZP_07543180.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|302650049|gb|EFL80219.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306859842|gb|EFM91863.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306868794|gb|EFN00602.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 256 Score = 134 bits (338), Expect = 1e-29, Method: Composition-based stats. Identities = 54/234 (23%), Positives = 85/234 (36%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + ++ISL RRE F +A F+ F AI N + F K + ++ R Sbjct: 1 MKKFLISLDKDAQRRELFFSQADT--ADFTVFSAINTMNVEQSELEQRFDFAKFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD F+ F Q L L +N Sbjct: 59 KVTKGEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLFAANFQQNLTALLNEKLNAD 118 Query: 117 LI--------KFDALR---KKPKKDSYLCTLPG--NFDIHQPRI-LSPRTTGYFIGKEAA 162 ++ FD P +L G + P T Y I K A Sbjct: 119 IVLVGQSKILSFDDTELAINYPTTFRFLQKKIGETEYRYAYPYKNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I +LV P E + S +E R Sbjct: 179 RKFLAQKATELPFWLA-DDYILFGQQFAINTLVVRPLMAIENPNLV-SNLENLR 230 >gi|116250924|ref|YP_766762.1| glycosyl transferase [Rhizobium leguminosarum bv. viciae 3841] gi|115255572|emb|CAK06652.1| putative glycosyl transferase [Rhizobium leguminosarum bv. viciae 3841] Length = 267 Score = 134 bits (337), Expect = 1e-29, Method: Composition-based stats. Identities = 58/249 (23%), Positives = 100/249 (40%), Gaps = 13/249 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L + RR + A L F A+ G + + F + R Sbjct: 21 VNTYLINLDRAPLRRFRMERLLASFGLAFERVAAVDGAGLSLPHPGFDDAAYLSRHGRRP 80 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGCY+SH+ KR SPA A+ILEDD DF D+ ++LL +++ Sbjct: 81 NPFEIGCYLSHVECAKRFLGSPAEFALILEDDLDFDDDLAELLDAALDHQARWDILRLST 140 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + K + L + + + Y I ++AA + V + P D+ Sbjct: 141 VNSGKK--HKVEPLTASRSLAIALTREKGSGAYLINRKAAGWIAGVLVPMRLPYDLAFDL 198 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES----RLVRKPTFSPLYF-----YRNTCYQW 233 ++ + + +P V + D S I+ RL R+ +S L + R ++ Sbjct: 199 EFDDGLSACFVDPLPVSQRADPC-SQIQAGLSAYRLGRRRPWSVLPYRAAAELRRFAARF 257 Query: 234 NLHYNAWRK 242 + AWR Sbjct: 258 S-RLAAWRL 265 >gi|145637013|ref|ZP_01792677.1| LosA [Haemophilus influenzae PittHH] gi|145269871|gb|EDK09810.1| LosA [Haemophilus influenzae PittHH] Length = 148 Score = 134 bits (337), Expect = 1e-29, Method: Composition-based stats. Identities = 38/150 (25%), Positives = 60/150 (40%), Gaps = 10/150 (6%) Query: 71 ISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILIKFDALRKKPKK 129 +SH++++ + AIILEDDA S EF + L L+K ++ ++ K K Sbjct: 1 MSHLYIYNMMQEQNIDKAIILEDDAIVSQEFEHIVLDSLNKVPNTMDILFYE--HGKAKT 58 Query: 130 DSYLCTLPGNFDIHQP-------RILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 L + + + RTT Y I +E A LL + I P D Sbjct: 59 YFCKKNLIEGYRLVHYCTPSKRSKRTITRTTAYLITQEGANKLLKLAYPIRMPADYLTGA 118 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 + + EP V+ ID+ IE+ Sbjct: 119 LQLTGLNAYGVEPPCVFRGIDSEIDAIEQR 148 >gi|198415096|ref|XP_002129882.1| PREDICTED: similar to GLT25D1 protein [Ciona intestinalis] Length = 594 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 87/242 (35%), Gaps = 37/242 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF----- 58 +++I+L R + ++F+ F A+ + + Sbjct: 349 KIFMINLERREDRYYRMSKALELQGIEFTHFKAVDSKTLNTTYLKHMGIDMLPGYVDPYR 408 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDIN 114 +R L+ EIGC++SH +W+ + + I+ EDD F F++ L + + Sbjct: 409 ERTLTRGEIGCFLSHYFIWQEVVKNKLDQVIVFEDDLRFEISFNRRLKNVMQEIEDAKLE 468 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 LI RK+ + ++P + + S T Y + A LL K ++R Sbjct: 469 WDLIYIG--RKRMQIKEPEKSVP-DCPLLVEADYSYWTLAYILSNSGAQKLL-AGKPLHR 524 Query: 175 --PIDMDMK-HWWEH------------NIPSLVTEPGAVYEA---------IDTNDSTIE 210 P+D + + +H ++ + P +Y DT S+I Sbjct: 525 MVPVDEYLPIMFNKHPIEKYMTPFEKRDLKAFSVNPLLIYPTHYTGQINYFSDTETSSIW 584 Query: 211 ES 212 E Sbjct: 585 ED 586 >gi|21069171|gb|AAM33851.1|AF470655_3 LgtE [Neisseria meningitidis] Length = 280 Score = 133 bits (336), Expect = 2e-29, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 76/241 (31%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++ISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHIISLASAAERRAHIAATFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + Q L + + + Sbjct: 59 SGVEKACFMSHVVLWKQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLEERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|190150353|ref|YP_001968878.1| glycosyltransferase [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|303253661|ref|ZP_07339799.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307248072|ref|ZP_07530101.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|307254912|ref|ZP_07536734.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307259353|ref|ZP_07541080.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|307263696|ref|ZP_07545304.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189915484|gb|ACE61736.1| putative glycosyltransferase [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|302647581|gb|EFL77799.1| glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|306855470|gb|EFM87644.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306862153|gb|EFM94125.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306866589|gb|EFM98450.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306870950|gb|EFN02686.1| Lsg locus putative protein 4 [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 256 Score = 133 bits (336), Expect = 2e-29, Method: Composition-based stats. Identities = 51/234 (21%), Positives = 83/234 (35%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + ++ISL RRE F +A F+ F AI N + F K + ++ R Sbjct: 1 MKKFLISLDKDAQRRELFFSQADT--ADFTVFSAINTMNVEQSELEQRFDFAKFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD + F Q L L +N Sbjct: 59 KVTKGEIGCTMSHLGVYELIVNDESINEDDYALVCEDDCLLAANFQQNLTALLNEKLNAD 118 Query: 117 LI-----------KFDALRKKPKKDSYLCTLPG--NFDIHQPRI-LSPRTTGYFIGKEAA 162 ++ + P +L G + P T Y I K A Sbjct: 119 IVLVGQSKILSFDDTELAINYPTTFRFLQKKIGETEYRYAYPYKNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I +LV P E + S +E R Sbjct: 179 RKFLAQKATELPFWLA-DDYILFGQQFAINTLVVRPLMAIENPNLV-SNLENLR 230 >gi|326924714|ref|XP_003208570.1| PREDICTED: procollagen galactosyltransferase 2-like [Meleagris gallopavo] Length = 552 Score = 133 bits (336), Expect = 2e-29, Method: Composition-based stats. Identities = 54/262 (20%), Positives = 96/262 (36%), Gaps = 39/262 (14%) Query: 32 SFFDAIYGENNPICNRIFSHQKRQCQFK-----RLLSLPEIGCYISHIHLWKRIAYSPAI 86 +A+ G+ ++ R L+ EIGC++SH ++WK + Sbjct: 297 KIVEAVDGKALNTSQLKALSIDMLPGYRDPYSSRPLTRGEIGCFLSHYYIWKEVLNRGLE 356 Query: 87 GAIILEDDADFSDEFSQLL----PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI 142 +++EDD F +F + L + + ++ LI RK+ + +P ++ Sbjct: 357 KTLVIEDDVRFEHQFKRKLMKLMDDIEQAQLDWELIYIG--RKRMQVQQPEKAVPNVMNL 414 Query: 143 HQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPIDM-------------DMKHWWEHN 187 + S T GY I + A L+ + P+D M+++ + Sbjct: 415 VEA-DYSYWTLGYAISFQGAQKLI-GAEPFSKMLPVDEFLPVMYNKHPVAKYMEYYESRD 472 Query: 188 IPSLVTEPGAVYEA---------IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + + EP VY DT STI ++ V T + Q +H Sbjct: 473 LKAFSAEPLLVYPTHYTGQPGYLSDTETSTIWDNETVS--TDWDRTHSWKSRQQGQIHSE 530 Query: 239 AWRKDLPPVSTTKFLPSSSSSL 260 A KD P ++ PSS L Sbjct: 531 AQNKDALPPQSSLNAPSSRDEL 552 >gi|126306516|ref|XP_001375578.1| PREDICTED: similar to C1orf17 [Monodelphis domestica] Length = 680 Score = 133 bits (336), Expect = 2e-29, Method: Composition-based stats. Identities = 60/289 (20%), Positives = 114/289 (39%), Gaps = 48/289 (16%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + +K Sbjct: 398 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIDMLPGYKDPYSS 457 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ EIGC++SH +WK + +++EDD F +F + L L DI+++ + Sbjct: 458 RPLTRGEIGCFLSHYSIWKEVIDRELEKTLVIEDDVRFEHQFKKKLVKLM-DDIDHVQLD 516 Query: 120 FDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--R 174 ++ + RK+ + +P ++ + S T GY I E A L+ Sbjct: 517 WELIYIGRKRMQVQEPEKAVPNVMNLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKML 574 Query: 175 PIDM-------------DMKHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEES 212 P+D M+++ ++ + EP +Y DT STI ++ Sbjct: 575 PVDEFLPIMYNKHPVAKYMEYYEPRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWDN 634 Query: 213 RLV-----RKPTFSPLYFYRNTCYQWNLHYNAWRKD-LPPVSTTKFLPS 255 V RK ++ + Q ++ NA D LPP ++ PS Sbjct: 635 ETVSTDWDRKHSW-------KSRKQEHIQSNAQNTDALPPTTSLGTAPS 676 >gi|154174436|ref|YP_001407702.1| putative glycosyltransferase involved in LPS biosynthesis [Campylobacter curvus 525.92] gi|112803649|gb|EAU00993.1| putative glycosyltransferase involved in LPS biosynthesis [Campylobacter curvus 525.92] Length = 260 Score = 133 bits (336), Expect = 2e-29, Method: Composition-based stats. Identities = 57/244 (23%), Positives = 95/244 (38%), Gaps = 23/244 (9%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 PV++ISL RRE R +F+ DA+ G + ++LS Sbjct: 3 PVFLISLAKDAGRREALKQRFKSYD-KFTLIDAVDGRQMNAREYFGYVLPSFRAYGKILS 61 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILI---- 118 E+GC +SH+ ++ S A A++LEDD D + + N++LI Sbjct: 62 PSEVGCSLSHVKAYEAFLASEAKFALVLEDDVIGDDVGVAAAFELAEQMPQNSLLICGCQ 121 Query: 119 -----KFDALRKKPKKDSYLCTLPGNFDIHQPRILS----PRTTGYFIGKEAAIHLLNVR 169 +F A K+ D T P N ++ S R Y + +++A +LL + Sbjct: 122 DGLAGRFSAFGKRVNLD----TNPENRELWLVSRYSHSSIYRAGAYVLSRQSAQNLLRIH 177 Query: 170 KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR--KPTFSPLYFYR 227 K D+ + ++ ++ A ID + S IE R+ R KP F Sbjct: 178 KRALCTTDVWEYLLRQDDMTMYFSDIFA--HPIDLSGSNIEAERIDRGYKPNFKARLRSL 235 Query: 228 NTCY 231 + Sbjct: 236 KFLF 239 >gi|258620626|ref|ZP_05715663.1| Putative glycosyltransferase [Vibrio mimicus VM573] gi|258587141|gb|EEW11853.1| Putative glycosyltransferase [Vibrio mimicus VM573] Length = 252 Score = 133 bits (335), Expect = 2e-29, Method: Composition-based stats. Identities = 53/256 (20%), Positives = 97/256 (37%), Gaps = 37/256 (14%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRL 61 + +YV+SL S RR ++ ++++ F F DA+ G P +++ Q +++R Sbjct: 1 MGIYVVSLLDSS-RRINIINQFEKLNINFEFVDAVNGRLLPQAIVDSVNNKDVQSRYRRK 59 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF- 120 + L EIGC +SH +++ IA S AII EDD SD S ++ + ++ Sbjct: 60 IGLGEIGCALSHQKIYQAIAESSVDYAIIFEDDIQLSDNISAVIEYFKSTAVSLGDKNLY 119 Query: 121 -----------DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL-LNV 168 D L + K+ + ++ RT Y + ++ A L Sbjct: 120 ILGGQEGLSSQDLLVRSVKEQINVSNDVSFVKLNDSAKYIYRTCSYVVSRDLAKELKCTY 179 Query: 169 RKNIYRPIDM-------DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 + Y D K + N+ Y D S IE+ R ++ Sbjct: 180 DEAFYIADDWNYLSTLGVFKKLYLTNV---------AYHPEDLVGSVIEQERQAKQ---- 226 Query: 222 PLYFYRNTCYQWNLHY 237 ++ + + L Sbjct: 227 --SSWKRSKLRALLRT 240 >gi|21069194|gb|AAM33868.1|AF470661_2 LgtE [Neisseria meningitidis] Length = 280 Score = 133 bits (335), Expect = 3e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRVMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + Sbjct: 59 SEVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAEDGWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|269213729|ref|ZP_05982739.2| lacto-N-neotetraose biosynthesis glycosyl tranferase LgtE [Neisseria cinerea ATCC 14685] gi|269145630|gb|EEZ72048.1| lacto-N-neotetraose biosynthesis glycosyl tranferase LgtE [Neisseria cinerea ATCC 14685] Length = 295 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 49/261 (18%), Positives = 83/261 (31%), Gaps = 24/261 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL + RR R + F FFDA+ + + + + LL Sbjct: 16 MRNYVISLASAKERRAHIADTFGRHDIPFQFFDALMP-SEELEQAMGELVPGLVRHN-LL 73 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + Sbjct: 74 SSVEKACFMSHVVLWKQALDEGVSYIAVFEDDVLLGEGAEKFLAKDGWLKERFDPDSAFI 133 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K T GY I +EA L + I Sbjct: 134 VRLETMFIKVIARPCGVPDYEGRHFPLLESEHWGTAGYIISREAMRFFLERFAVLPAERI 193 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D M ++ + +P P + + S +EESR + Sbjct: 194 -KAVDWMMFTYFFDKEGMPVYQVSPALCAQELHFSMYNRQISRLGSDLEESREKGRKHCR 252 Query: 222 PLYFYRNTCYQWNLHYNAWRK 242 L + N W +K Sbjct: 253 LLKLFFNVRRAWGKFDREKKK 273 >gi|261393257|emb|CAX50883.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis 8013] Length = 280 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRVMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + Sbjct: 59 SEVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAEDGWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKILNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|240948365|ref|ZP_04752751.1| putative glycosyltransferase [Actinobacillus minor NM305] gi|240297404|gb|EER47945.1| putative glycosyltransferase [Actinobacillus minor NM305] Length = 256 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 51/234 (21%), Positives = 85/234 (36%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + ++F + + + R Sbjct: 1 MKKYLISLEKDVQRRELFFSQPDTQ--DFQIFSAINTMSLSSEELAKMFDVTQFEQHYHR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I A++ EDD FS F++ L L + Sbjct: 59 AVTKGEIGCTMSHLKVYQWIVDDESIAEEDYALVCEDDVLFSANFNENLTALLNEKLTAD 118 Query: 117 LIK--------FDALR---KKPKKDSYLCTLPGNFDI---HQPRILSPRTTGYFIGKEAA 162 ++ FD + P ++ G D + + T Y I K AA Sbjct: 119 IVLVGQSKIPTFDDVELSINYPTTFKWMQKKIGQTDYCYAYPYKNYFAGTVAYLIKKSAA 178 Query: 163 IHLL---NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + D + + ++ LV P E T S +E R Sbjct: 179 RRFLMEQQKALPFWLA-DDFIWFGEKFDMDILVVRPLMAIENP-TLVSNLETLR 230 >gi|192292142|ref|YP_001992747.1| glycosyl transferase family 25 [Rhodopseudomonas palustris TIE-1] gi|192285891|gb|ACF02272.1| glycosyl transferase family 25 [Rhodopseudomonas palustris TIE-1] Length = 268 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 74/202 (36%), Gaps = 6/202 (2%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + + V+SLP + ARR + L + +FDA + N + + + F R Sbjct: 4 SVKIVVLSLPTATARRSQMSAMLGNTSLDWQYFDA--HASLMCPNLHYDGAELRRHFGRN 61 Query: 62 LSLPEIGCYISHIHLWKRIAYSP-AIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LS PEI SH + A ++LEDD F +F L +N L Sbjct: 62 LSPPEIAVCSSHAAILHEFVDRNQADYILVLEDDVIFDTDFP--LDQFCAFCASNGLEYV 119 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 K K +L I + Y + + AA L+ ++I +D+ M Sbjct: 120 RLFGKHYAKAVHLGFFYDR-SIVRFTTSPAGAQAYLMSRMAARRFLDSYRSIRATVDLAM 178 Query: 181 KHWWEHNIPSLVTEPGAVYEAI 202 +W +P P + E Sbjct: 179 DSFWITQLPIYSIFPYPIIERF 200 >gi|157326137|gb|ABV44301.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 66/331 (19%), Positives = 110/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDVTLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKELVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A ++ + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIDATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + DSTI++ + +K F Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLSYVPCPVSLSEHALDSTIQKPQKKGLKSYPFPPKKSAF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + L ++ +RK P+ T K Sbjct: 300 KNLFYYSINAKKRLLAFHRYRKRYAPLKTPK 330 >gi|218678647|ref|ZP_03526544.1| glycosyl transferase family 25 [Rhizobium etli CIAT 894] Length = 276 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 51/230 (22%), Positives = 91/230 (39%), Gaps = 8/230 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 Y+I+L + RR + L F A+ G + + F + R + Sbjct: 23 TYLINLDRAPLRRFRMERLLTDFGLAFERVAAVDGAGLSLPHPGFDEAAYLSRHGRRPNP 82 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 EIGCY+SH+ +R A A++LEDD DF D+ ++LL + +++ + Sbjct: 83 FEIGCYLSHVECARRFLAGHAEFALVLEDDLDFDDDLTELLEAALEHHERWDILRLSTVN 142 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 K + TL + + + Y I ++AA + + P D+ + Sbjct: 143 SGKKHR--VETLTASRSLAIALTREKGSGAYLINRKAAGWITGALLPMRLPYDLAFDLEF 200 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEES----RLVRKPTFSPLYFYRNTC 230 + + + +P V + D S I+ RL R+ +S L YR Sbjct: 201 DAGLNACFVDPLPVSQRAD-GHSQIQAGLSAYRLGRRRPWSVLP-YRTAA 248 >gi|110006051|gb|ABG48528.1| LpsA [Haemophilus influenzae] gi|110006053|gb|ABG48529.1| LpsA [Haemophilus influenzae] gi|110006063|gb|ABG48534.1| LpsA [Haemophilus influenzae] gi|110006065|gb|ABG48535.1| LpsA [Haemophilus influenzae] gi|110006067|gb|ABG48536.1| LpsA [Haemophilus influenzae] Length = 256 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 90/217 (41%), Gaps = 13/217 (5%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI E + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPEIIEETAKKFNITLDRSP-KAKLSDE 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 68 EIGCALSHIVLWDVALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 126 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +I+ + GY + + A +LL + K + +D + Sbjct: 127 MFFKQP--QSVKCDRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 184 Query: 184 WEH--NIPSLVTEPGAVYEAI-----DTNDSTIEESR 213 + H + + PG + + +S+++E R Sbjct: 185 FLHFKDYKIVQLSPGICVQDFVLHPENPFESSLKEGR 221 >gi|21069183|gb|AAM33860.1|AF470658_3 LgtE [Neisseria meningitidis] gi|21069191|gb|AAM33866.1|AF470660_3 LgtE [Neisseria meningitidis] Length = 280 Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRMMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + Sbjct: 59 SEVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAEDGWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|193787780|dbj|BAG52983.1| unnamed protein product [Homo sapiens] Length = 363 Score = 132 bits (333), Expect = 4e-29, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 81 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 140 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 141 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 200 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--Y 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 201 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKV 256 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 257 LPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 316 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 317 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 359 >gi|145642433|ref|ZP_01797988.1| LosA [Haemophilus influenzae R3021] gi|145272871|gb|EDK12762.1| LosA [Haemophilus influenzae 22.4-21] Length = 148 Score = 132 bits (333), Expect = 4e-29, Method: Composition-based stats. Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 10/150 (6%) Query: 71 ISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALRKKPKK 129 +SH+ +++ + AIILEDDA S EF ++ L K N ++ +D K K Sbjct: 1 MSHLSIYQMMCNEKIARAIILEDDAIVSHEFEAIVKDSLKKVSKNVEILFYD--HGKAKS 58 Query: 130 DSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + TL N+ + R S T Y I A LL + I P D Sbjct: 59 YCWKKTLVENYRLVHYRKPSKTSKRAIMCATAYLITLSGAQKLLQIAYPIRMPADYLTGA 118 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 + + EP V++ + +E+ Sbjct: 119 LQLTGLKAYGVEPPCVFQGTISEIDAMEQR 148 >gi|301156387|emb|CBW15858.1| unnamed protein product [Haemophilus parainfluenzae T3T1] Length = 256 Score = 132 bits (333), Expect = 5e-29, Method: Composition-based stats. Identities = 56/235 (23%), Positives = 88/235 (37%), Gaps = 25/235 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + +F+ K + + R Sbjct: 1 MNKYLISLDKDVQRRELFFAQPDT--ADFVVFSAINTMQKEWEELAEVFNPTKFEQHYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHL--SKCDIN 114 ++ EIGC +SH+ ++++I A++ EDDA F+ S L KCD + Sbjct: 59 NVTKGEIGCTLSHLAVYRQIVEDQNVTENDYALVCEDDALFNANLSPKTIALLTEKCDAD 118 Query: 115 NILIKFDALRK---------KPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +LI + + P S+L G+ + P T Y I K AA Sbjct: 119 IVLIGQSKIAEFNDVELEINYPTTFSFLRQTIGDVTVAYPYKSYFAGTVAYLIKKSAARA 178 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L K + D + + I + V P E S +E R + Sbjct: 179 FLQQLEQEKPFWLA-DDFLLFETQFQINNNVVRPLVAIENPQLV-SNLEAIRGSK 231 >gi|3043692|dbj|BAA25510.1| KIAA0584 protein [Homo sapiens] Length = 738 Score = 132 bits (332), Expect = 5e-29, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 456 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 515 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 516 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 575 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 576 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 631 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 632 LPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 691 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 692 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 734 >gi|325578527|ref|ZP_08148627.1| glycosyltransferase [Haemophilus parainfluenzae ATCC 33392] gi|325159763|gb|EGC71893.1| glycosyltransferase [Haemophilus parainfluenzae ATCC 33392] Length = 256 Score = 132 bits (332), Expect = 5e-29, Method: Composition-based stats. Identities = 56/235 (23%), Positives = 88/235 (37%), Gaps = 25/235 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + +F+ K + + R Sbjct: 1 MNKYLISLDKDVQRRELFFAQPDT--ADFVVFSAINTMQKEWDELAEVFNPTKFEQHYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHL--SKCDIN 114 ++ EIGC +SH+ ++++I A++ EDDA F+ S L KCD + Sbjct: 59 NVTKGEIGCTLSHLAVYRQIVEDQNVTENDYALVCEDDALFNANLSPKTIALLTEKCDAD 118 Query: 115 NILIKFDALRK---------KPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +LI + + P S+L G+ + P T Y I K AA Sbjct: 119 IVLIGQSKIAEFNNVELEINYPTTFSFLRQTIGDVTVAYPYKSYFAGTVAYLIKKSAARA 178 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L K + D + + I + V P E S +E R + Sbjct: 179 FLKQLEQEKPFWLA-DDFLLFETQFQINNNVVRPLVAIENPQLV-SNLEAIRGSK 231 >gi|257465064|ref|ZP_05629435.1| putative glycosyltransferase [Actinobacillus minor 202] gi|257450724|gb|EEV24767.1| putative glycosyltransferase [Actinobacillus minor 202] Length = 256 Score = 132 bits (332), Expect = 5e-29, Method: Composition-based stats. Identities = 50/234 (21%), Positives = 84/234 (35%), Gaps = 27/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN--PICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + +F + + + R Sbjct: 1 MKKYLISLEKDVQRRELFFSQPDTQ--DFQIFSAINTMSLSSEELENMFDVTQFEQHYHR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+++++ I A++ EDD FS F++ L L + Sbjct: 59 AVTKGEIGCTMSHLNVYQWIVDDESIAEEDYALVCEDDVLFSANFNENLTALLNEKLTAD 118 Query: 117 LIK--------FDALR---KKPKKDSYLCTLPG---NFDIHQPRILSPRTTGYFIGKEAA 162 ++ FD + P ++ G + + T Y I K AA Sbjct: 119 IVLVGQSKIPTFDDVELSINYPTTFKWMQKKIGKTDYCYAYPYQNYFAGTVAYLIKKSAA 178 Query: 163 IHLL---NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + D + + ++ LV P E T S +E R Sbjct: 179 RRFLMEQQKTSPFWLA-DDFIWFGEKFDMDILVVRPLMAIENP-TLVSNLEALR 230 >gi|308061950|gb|ADO03838.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Cuz20] Length = 332 Score = 132 bits (332), Expect = 5e-29, Method: Composition-based stats. Identities = 62/322 (19%), Positives = 103/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ICVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L + Sbjct: 61 MLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTDQP-VVILEDDVALESNFMQALEN 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTCVPCPVSLSEHSLDSTIQKPQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RELFFYSLNAKKRLKAFHKYSK 321 >gi|55960748|emb|CAI14723.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] Length = 363 Score = 132 bits (332), Expect = 6e-29, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 81 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 140 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 141 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 200 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 201 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 256 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 257 LPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 316 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 317 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 359 >gi|307340800|gb|ADN43861.1| WcnL [Escherichia coli] Length = 248 Score = 131 bits (331), Expect = 6e-29, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 77/215 (35%), Gaps = 3/215 (1%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I +Y I+L +ARREK A ++ L +AI G N + + R L Sbjct: 2 IKIYTINLFTENARREKILSLAKKLELPIKIENAIIGSNLSKEYLDNLTNESIDKIGRNL 61 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E+GCY+SHI K S AIILEDD + + + Sbjct: 62 SPGEVGCYLSHIKCLKNFLSSGDEFAIILEDDVTLDYRIKDFISTIEFKETTLFFDVMLL 121 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPIDMDMK 181 + + + + + Y + ++ A ++ N I+ P D Sbjct: 122 GYRNGYGSYWGKKKWNSHKLLRFPDCGYGAHAYLVTRKGAEKIVYNNAIPIW-PYDYVTG 180 Query: 182 HWWEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLV 215 + I E V + +++ S++E R Sbjct: 181 GRADKTIRVYGVEKKIVDLDFYNSSCSSLEAERNY 215 >gi|315634104|ref|ZP_07889393.1| glycosyltransferase [Aggregatibacter segnis ATCC 33393] gi|315477354|gb|EFU68097.1| glycosyltransferase [Aggregatibacter segnis ATCC 33393] Length = 256 Score = 131 bits (331), Expect = 6e-29, Method: Composition-based stats. Identities = 55/235 (23%), Positives = 87/235 (37%), Gaps = 25/235 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + +F+ K + + R Sbjct: 1 MKKYLISLDKDVQRRELFFSQPNT--ADFEVFSAINTMQKEWDALAEVFNPTKFEQHYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++RI A+I EDDA F+ + S+ L + Sbjct: 59 NVTKGEIGCTLSHLEVYRRIVADERIAETDYALICEDDALFNADLSEKTTALLTQHCSAD 118 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +I + P S+L + + P T Y I K AA Sbjct: 119 IILIGQSKIATFNDVELEINYPTTFSFLRQKVADVTVAYPYKSYFAGTVAYLIKKSAARR 178 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L+V K + D + H + + V P E T S +E R + Sbjct: 179 FLDVLIQEKAFWLADDFLLFETQFH-LNNKVVRPLMAIENP-TLMSNLEAIRGSK 231 >gi|108514913|gb|ABF93251.1| hypothetical protein [Campylobacter jejuni] Length = 257 Score = 131 bits (331), Expect = 7e-29, Method: Composition-based stats. Identities = 46/231 (19%), Positives = 84/231 (36%), Gaps = 17/231 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKR 60 + ++I+L + R+E + + +L + A+ G+ ++ I + R Sbjct: 1 MKFFIINLKIAKDRKEYMQNLCLKHNLDYEIIQAVDGKELDSDFVKNISDFSLSEKYLGR 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL IGC +SH ++R+ +ILEDDA F ++ + +L K + L Sbjct: 61 TLSLGGIGCALSHKKCFERMFELNLNECLILEDDAYFDEKLNYILSLKDKFPKDLELFLL 120 Query: 121 DALR----------KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 R + P Y + + + T GY+I K A+ + + Sbjct: 121 GHYRQVYLDDGFRIESPFSLRYDYKIDDFYHAKRLVGGGNGTHGYYINKNGALKMYKYLE 180 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES---RLVRKP 218 I P D + ++ I P + T + R RK Sbjct: 181 KIIFPYDHCTSN--DNIINVYALYPIVITTDEIFGAQTYVQDDNKRRYRKR 229 >gi|237750809|ref|ZP_04581289.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter bilis ATCC 43879] gi|229373254|gb|EEO23645.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein [Helicobacter bilis ATCC 43879] Length = 172 Score = 131 bits (331), Expect = 8e-29, Method: Composition-based stats. Identities = 41/174 (23%), Positives = 65/174 (37%), Gaps = 17/174 (9%) Query: 3 IPVYVISLPFSHARREKFCHR---------AARIHLQFSFFDAIYGENNPICN--RIFSH 51 + ++VI+L S R+ + + L +SFF+AI ++ +F+ Sbjct: 1 MKIFVINLERSKDRKAHMQEKLSLLEKDPLFKELDLSYSFFNAIDSKSESFKEYKAMFNP 60 Query: 52 QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC 111 R+L EI CY SH LW+ I+LEDD F F L + + Sbjct: 61 MLCYLWHGRILIDNEIACYASHYSLWEECVRLDEP-IIVLEDDIFFEKHFLSALQDMKRT 119 Query: 112 DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 ++F + K Y N + H + T GY+I AA Sbjct: 120 T--FSFVRFFTSARSRDKYIYKI---DNSNYHYSLKNTNGTLGYYITPSAAKAF 168 >gi|15991376|gb|AAL12842.1|AF355193_6 LgtE [Neisseria meningitidis] gi|325127442|gb|EGC50372.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis N1568] gi|325129463|gb|EGC52293.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis OX99.30304] gi|325201436|gb|ADY96890.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M01-240149] gi|325207416|gb|ADZ02868.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis NZ-05/33] Length = 280 Score = 131 bits (330), Expect = 8e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 79/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRVMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|188527533|ref|YP_001910220.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] gi|188143773|gb|ACD48190.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] Length = 332 Score = 131 bits (330), Expect = 9e-29, Method: Composition-based stats. Identities = 62/322 (19%), Positives = 102/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ICVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYQSPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RELFFYSLNAKKRLKAFHKYSK 321 >gi|304388480|ref|ZP_07370583.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis ATCC 13091] gi|21069187|gb|AAM33863.1|AF470659_3 LgtE [Neisseria meningitidis] gi|304337477|gb|EFM03643.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis ATCC 13091] Length = 280 Score = 131 bits (330), Expect = 9e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRMMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + Q L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDESLPYIAVFEDDVLLGEGAEQFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHFGMAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|9963922|gb|AAG09766.1|AF240672_3 galactosyl transferase [Neisseria subflava] gi|325204879|gb|ADZ00333.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M01-240355] Length = 280 Score = 131 bits (330), Expect = 9e-29, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPS--EELNRMMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|308390022|gb|ADO32342.1| lacto-N-neotetraose biosynthesis glycosyl transferase [Neisseria meningitidis alpha710] Length = 280 Score = 131 bits (330), Expect = 9e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 79/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRMMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|119611576|gb|EAW91170.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Homo sapiens] Length = 638 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 356 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 415 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 416 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 475 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 476 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 531 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 532 LPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 591 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 592 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 634 >gi|73960469|ref|XP_859668.1| PREDICTED: similar to glycosyltransferase 25 domain containing 2 isoform 3 [Canis familiaris] Length = 643 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 104/285 (36%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ + +A+ G+ + + ++ Sbjct: 361 IFMINLKRRKDRRDRMLRTLYEQEIDVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 420 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 421 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 480 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI R + K+ GN T GY I E A L+ Sbjct: 481 ELIYIGRKRMQVKEPEKAVPNVGNLVEADYSYW---TLGYVISLEGAQKLV-GADPFGKM 536 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 537 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 596 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++H NA + LP ++ +PS Sbjct: 597 NETVA--TDWDRTHSWKSRKQSHIHRNAKNTEALPSPTSVDTVPS 639 >gi|332230643|ref|XP_003264502.1| PREDICTED: procollagen galactosyltransferase 2 [Nomascus leucogenys] Length = 626 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 109/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRQLEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 519 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 520 LPVDEFLPIMYNKHPVAEYKQYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 579 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 580 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 622 >gi|156537912|ref|XP_001608141.1| PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] Length = 617 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 51/228 (22%), Positives = 92/228 (40%), Gaps = 29/228 (12%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQF 58 VY+I+L RR + + +Q DA+ G N + Sbjct: 322 NVYMINLLRRPERRRRMMSCLNELGIQVEIIDAVDGRVLNESLLESWNIKMMPGYKDPYH 381 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDIN 114 KR +++ E+GC++SH +W RI ++ILEDD F F Q +L L + + Sbjct: 382 KRPMTMGEVGCFLSHYIVWNRIVEDGDKISLILEDDVKFEPYFRQKIKLILNELERFKKD 441 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI-- 172 L+ RK+ ++D+ + +H S T GY + + A L++ K + Sbjct: 442 WDLVYLG--RKQMQRDTESWVEGSRYLVHA--GYSYWTVGYMLSAKGAKKLID-AKPLES 496 Query: 173 YRPIDMDM-------------KHWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + +++ ++ +L EP ++ T DS Sbjct: 497 LIPVDEYLPIMYDKHPREDWKEYFNGRDLIALSAEPLVLFPTHYTGDS 544 >gi|16506820|ref|NP_055916.1| procollagen galactosyltransferase 2 precursor [Homo sapiens] gi|74750765|sp|Q8IYK4|GT252_HUMAN RecName: Full=Procollagen galactosyltransferase 2; AltName: Full=Glycosyltransferase 25 family member 2; AltName: Full=Hydroxylysine galactosyltransferase 2; Flags: Precursor gi|12620188|gb|AAG60609.1|AF288389_1 C1orf17 [Homo sapiens] gi|23273043|gb|AAH35672.1| Glycosyltransferase 25 domain containing 2 [Homo sapiens] gi|55957835|emb|CAI17872.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] gi|55960747|emb|CAI14722.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] gi|119611578|gb|EAW91172.1| glycosyltransferase 25 domain containing 2, isoform CRA_c [Homo sapiens] gi|168278659|dbj|BAG11209.1| glycosyltransferase 25 domain-containing protein 2 [synthetic construct] gi|325463379|gb|ADZ15460.1| glycosyltransferase 25 domain containing 2 [synthetic construct] Length = 626 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 519 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 520 LPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 579 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 580 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 622 >gi|319775527|ref|YP_004138015.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae F3047] gi|329122430|ref|ZP_08251017.1| lipooligosaccharide biosynthesis protein lex-1 [Haemophilus aegyptius ATCC 11116] gi|317450118|emb|CBY86332.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae F3047] gi|327473712|gb|EGF19131.1| lipooligosaccharide biosynthesis protein lex-1 [Haemophilus aegyptius ATCC 11116] Length = 290 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 55/263 (20%), Positives = 90/263 (34%), Gaps = 27/263 (10%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-----IYGENNPICNRIFSHQKRQCQF--- 58 VIS+ + RR+ + L FSFF+A I N N+ + Sbjct: 8 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSNSILHNIEE 67 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL- 117 R+L+ E GC ISH LW + I EDD + L Sbjct: 68 SRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFN 127 Query: 118 ----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI 172 I+ + + K + P N + T GY I + AA +L+ KNI Sbjct: 128 DIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYLIEYLKNI 187 Query: 173 ----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 +D +D+ ++ + + P++ + ++ S +E+ R R Sbjct: 188 PSDEIVAVDQLIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKERQKRPKIR 247 Query: 221 SPLYF-YRNTCYQWNLHYNAWRK 242 R T + N+ RK Sbjct: 248 KKKTLKQRLTRIKENIIRALNRK 270 >gi|73960467|ref|XP_859640.1| PREDICTED: similar to glycosyltransferase 25 domain containing 2 isoform 2 [Canis familiaris] Length = 651 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 104/285 (36%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ + +A+ G+ + + ++ Sbjct: 369 IFMINLKRRKDRRDRMLRTLYEQEIDVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 428 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 429 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 488 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI R + K+ GN T GY I E A L+ Sbjct: 489 ELIYIGRKRMQVKEPEKAVPNVGNLVEADYSYW---TLGYVISLEGAQKLV-GADPFGKM 544 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 545 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 604 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++H NA + LP ++ +PS Sbjct: 605 NETVA--TDWDRTHSWKSRKQSHIHRNAKNTEALPSPTSVDTVPS 647 >gi|332811368|ref|XP_524994.3| PREDICTED: procollagen galactosyltransferase 2 [Pan troglodytes] Length = 626 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 519 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 520 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 579 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 580 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 622 >gi|293341373|ref|XP_001070927.2| PREDICTED: glycosyltransferase 25 domain containing 2 [Rattus norvegicus] gi|293352770|ref|XP_222718.5| PREDICTED: glycosyltransferase 25 domain containing 2 [Rattus norvegicus] gi|149058405|gb|EDM09562.1| glycosyltransferase 25 domain containing 2 (predicted) [Rattus norvegicus] Length = 625 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 57/285 (20%), Positives = 109/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 343 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKAWNIEMLPGYRDPYSS 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + K ++ Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMDDIDKARLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 ELIYIG--RKRMQVKEPEKAVPNVVNLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKM 518 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 519 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 578 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q +H NA + LPP ++ +PS Sbjct: 579 NETVA--TDWDRTHSWKSRKQGPIHSNAKNTEALPPPTSLDTVPS 621 >gi|99078489|ref|YP_611747.1| glycosyl transferase family protein [Ruegeria sp. TM1040] gi|99035627|gb|ABF62485.1| glycosyl transferase family 25 [Ruegeria sp. TM1040] Length = 279 Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 50/266 (18%), Positives = 85/266 (31%), Gaps = 20/266 (7%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M + Y+ISL + R L F A+ + +QC+ Sbjct: 1 MKLATYMISLDRAKGRHAAMQATLRDAGLDAEFVSAVD-----LAETSEEELLQQCKSFG 55 Query: 61 LLSL---PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 + + C +SH +W++ S A A+I EDD S E Q LS L Sbjct: 56 PWGVFAHGNMACTLSHAKVWEQFLASDADVALIFEDDVFISAELRQWTEDLSWWPQGCEL 115 Query: 118 IKFDALRKKPKK--DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 + + R + K + + +P + GY + ++ A LL + Sbjct: 116 VNLEFWRSETLKVMVGTSASQHLGRQVAPMLSRNPGSAGYLVSRKGAEALL-ASRPFTMS 174 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-----VRKPTFSPLYFYRNTC 230 ID + + + EP + A+ T +T E RKP + + Sbjct: 175 IDQLLFNPLVSPLAV-SLEPHQITPALVTQGNTPPEEGTFLGHAQRKPKGALYRKQKRLR 233 Query: 231 YQWNLH---YNAWRKDLPPVSTTKFL 253 L + R + L Sbjct: 234 GMAELRALPLHLLRLISGRARLVRVL 259 >gi|73960471|ref|XP_849763.1| PREDICTED: similar to glycosyltransferase 25 domain containing 2 isoform 1 [Canis familiaris] Length = 626 Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 104/285 (36%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ + +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIDVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI R + K+ GN T GY I E A L+ Sbjct: 464 ELIYIGRKRMQVKEPEKAVPNVGNLVEADYSYW---TLGYVISLEGAQKLV-GADPFGKM 519 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 520 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 579 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++H NA + LP ++ +PS Sbjct: 580 NETVA--TDWDRTHSWKSRKQSHIHRNAKNTEALPSPTSVDTVPS 622 >gi|119611577|gb|EAW91171.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Homo sapiens] gi|193787801|dbj|BAG53004.1| unnamed protein product [Homo sapiens] Length = 506 Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 110/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 224 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 283 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 284 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 343 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 344 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 399 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 400 LPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 459 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ NA + LPP ++ +PS Sbjct: 460 NETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 502 >gi|317177550|dbj|BAJ55339.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F16] Length = 332 Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 63/331 (19%), Positives = 106/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECVKTDQP-IVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P + + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPISLSEHSLDSTIQKPQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + ++ + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLKAFHKYSKQYAPLKTPK 330 >gi|317179019|dbj|BAJ56807.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F30] Length = 332 Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 63/322 (19%), Positives = 104/322 (32%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQN 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGTGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDSTIQKLQKKSLKSYPSPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RELFFYSLNAKKRLKAFHKYSK 321 >gi|301787505|ref|XP_002929168.1| PREDICTED: procollagen galactosyltransferase 2-like, partial [Ailuropoda melanoleuca] Length = 630 Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 52/275 (18%), Positives = 103/275 (37%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 348 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 407 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP----HLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 408 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLTKLMDDIDQAQLDW 467 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 468 ELIYIG--RKRMQVQEPEKAVPNVGNLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKM 523 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 524 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSVEPLLIYPTHYTGQPGYLSDTETSTIWD 583 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V T + Q ++H NA + P Sbjct: 584 NETVA--TDWDRTHSWKSRKQSHIHRNAKNTEALP 616 >gi|157326159|gb|ABV44312.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 66/333 (19%), Positives = 111/333 (33%), Gaps = 85/333 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK S +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGASLTLPELGCYLSHYLLWKECVKSNQP-VVILEDDVTLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDHSFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPKKSTF 299 Query: 221 SPLYFYR-NTCYQWNLHYNAWRKDLPPVSTTKF 252 L++Y N + N ++ P + +F Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKQYAPLKTPKEF 332 >gi|253989060|ref|YP_003040416.1| beta1,4-galactosyltransferase waax [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253780510|emb|CAQ83672.1| putative beta1,4-galactosyltransferase waax [Photorhabdus asymbiotica] Length = 251 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 51/248 (20%), Positives = 98/248 (39%), Gaps = 34/248 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++VI+LP R+ ++A R+ L F +A+ G++ + + + + Sbjct: 1 MKIFVINLPKDKERKASIQYQADRLGLNVEFIEAVNGKDLSEDEI---NILSKDFHQHGM 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + +GC +SHI +++++ A+ILEDDA ++ + + + N Sbjct: 58 THGVLGCSLSHIKIYEKMIKDNIDIALILEDDALLNENIIESYNLIESYNYKN------- 110 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPR-----------TTGYFIGKEAAIHLLNVRKN 171 K K + YL ++ + LS + T GY + +AA +LLN Sbjct: 111 ---KDKPNVYLLSVVNEYIDTFKTKLSTKYNLVNVIDADYTYGYMLNIKAANNLLN---- 163 Query: 172 IYRPIDMDMKHW---WEHN-IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 P+ ++ W EH I P + + ST+E R + + F+ Sbjct: 164 FLTPVWIEADKWRFMREHGAIKLKAIIPHVIDVTPLSAVSTLESDRSITLEKR--IAFFN 221 Query: 228 NTCYQWNL 235 NL Sbjct: 222 EQYQNRNL 229 >gi|15677756|ref|NP_274920.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis MC58] gi|9911086|sp|Q51117|LGTE_NEIMB RecName: Full=Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE gi|7227183|gb|AAF42255.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis MC58] gi|316985549|gb|EFV64496.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtE [Neisseria meningitidis H44/76] gi|325139580|gb|EGC62120.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis CU385] gi|325200978|gb|ADY96433.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis H44/76] Length = 280 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIAATFGVRGIPFQFFDALMPS--EELNRMMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|148827119|ref|YP_001291872.1| putative UDP-GlcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittGG] gi|148718361|gb|ABQ99488.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae PittGG] Length = 191 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 70/174 (40%), Gaps = 20/174 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI ++ IF+ ++ + + R Sbjct: 2 LKKYLISLDKDIQRRELFFSQKNTE--DFQIFSAINTMQKDWDELASIFNIEQFKAHYGR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I I ++ EDDA F +F Q L L + Sbjct: 60 NVTKGEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDALFHSDFQQNLTALLSEKLEAE 119 Query: 117 LIKF-----------DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIG 158 +I D P S+LC GN + P T GY I Sbjct: 120 IILLGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAFPYKSYFAGTVGYLIK 173 >gi|317180389|dbj|BAJ58175.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 332 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 62/322 (19%), Positives = 102/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RKLFFYSFNAKKRLKAFHKYSK 321 >gi|281346524|gb|EFB22108.1| hypothetical protein PANDA_019266 [Ailuropoda melanoleuca] Length = 635 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 52/275 (18%), Positives = 103/275 (37%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 353 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 412 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP----HLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 413 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLTKLMDDIDQAQLDW 472 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 473 ELIYIG--RKRMQVQEPEKAVPNVGNLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKM 528 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 529 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSVEPLLIYPTHYTGQPGYLSDTETSTIWD 588 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V T + Q ++H NA + P Sbjct: 589 NETVA--TDWDRTHSWKSRKQSHIHRNAKNTEALP 621 >gi|251792806|ref|YP_003007532.1| hypothetical protein NT05HA_1079 [Aggregatibacter aphrophilus NJ8700] gi|247534199|gb|ACS97445.1| Lsg locus putative protein 4 [Aggregatibacter aphrophilus NJ8700] Length = 256 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 59/235 (25%), Positives = 92/235 (39%), Gaps = 25/235 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F+ F AI + +F+ K + + R Sbjct: 1 MNKYLISLDKDVRRRELFFAQPDT--ADFTVFSAINTMQKEWDELAEVFNSTKFEQHYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHL--SKCDIN 114 ++ EIGC +SH+ ++++I A++ EDDA F+ S L KCD + Sbjct: 59 NVTKGEIGCTLSHLGVYRQIVEDRNVTENEYALVCEDDALFNANLSPKTTALLTEKCDAD 118 Query: 115 NILIK------FDALRKK---PKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIH 164 +LI FD + + P S+L G+ + P T Y I K AA Sbjct: 119 IVLIGQSKIAEFDDVELEINYPTTFSFLRKTFGDVTVAYPYKSYFAGTVAYLIKKSAARV 178 Query: 165 LLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L V K + D + + I + V P E S +E R + Sbjct: 179 FLTVLENEKPFWLA-DDFLLFETQFQINNNVVRPLVAIENPQLV-SNLEAIRGSK 231 >gi|152979041|ref|YP_001344670.1| glycosyl transferase family protein [Actinobacillus succinogenes 130Z] gi|150840764|gb|ABR74735.1| glycosyl transferase family 25 [Actinobacillus succinogenes 130Z] Length = 292 Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 50/260 (19%), Positives = 88/260 (33%), Gaps = 25/260 (9%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISLP + +RR+ + + ++ F FFDAI + N Q L+ Sbjct: 26 YVISLPTAVSRRQHIRNVFQQQNVPFEFFDAISPDIG--LNAAIDRFVPNLQQAEHLTAG 83 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 E C++SH+ LW++ I EDD + L + L + L+ Sbjct: 84 EKSCFMSHVSLWQKCLDDNLPYIAIFEDDILLGENAGDFLSDYQWFESRFDLFRPIILKL 143 Query: 126 KPKKDSYLCTLPGNFDIHQPR--------ILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 + + D ++ R GY I + A LL K + Sbjct: 144 ETALGKTNFKPLHHIDAYKSRQFMRLTAGRAHIGMAGYIISQATAKALLRYIKTLSAQTL 203 Query: 174 RPID-MDMKHWWEH-NIPSLVTEPG-AVYEAIDTND-----STIEESR---LVRKPTFSP 222 + ID + ++ + + P V ++ + S++E R +P P Sbjct: 204 KAIDLILFDDLLKNPDFQLVQLYPAICVQDSFYNKNNVTLLSSLEVERAHFKKVRPYTKP 263 Query: 223 LYFYRNTCYQWNLHYNAWRK 242 + Y + K Sbjct: 264 KSLFDVVRKIVTTPYRKYLK 283 >gi|148827955|ref|YP_001292708.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae PittGG] gi|148719197|gb|ABR00325.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae PittGG] Length = 237 Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 48/194 (24%), Positives = 80/194 (41%), Gaps = 8/194 (4%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F+FFDAI E + F+ + K LS Sbjct: 30 YVISLTTEQKRRKHITEEFGKQNIPFAFFDAITPEIIEETAKKFNITLDRSP-KAKLSDE 88 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD + +LL + + ++K +A K Sbjct: 89 EIGCALSHIVLWDVALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLEANGK 147 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 K ++ + +I+ + GY + + A +LL + K + +D + Sbjct: 148 MFFKQP--QSVKCDRNIYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 205 Query: 184 WEH--NIPSLVTEP 195 + H + L P Sbjct: 206 FLHFKDYKILQLSP 219 >gi|261838140|gb|ACX97906.1| LPS biosynthesis protein [Helicobacter pylori 51] Length = 332 Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 62/322 (19%), Positives = 101/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RKLFFYSLNAKKRLKAFHKYSK 321 >gi|308184403|ref|YP_003928536.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] gi|308060323|gb|ADO02219.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] Length = 332 Score = 130 bits (326), Expect = 3e-28, Method: Composition-based stats. Identities = 63/322 (19%), Positives = 103/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 54 R------QCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWYHSYVGAGLTLPELGCYLSHYFLWKECVKLNQP-VVILEDDVALESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFSPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYAAHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTCVPCPVSLSEHSLDSTIQKPQKKSLKSYPSPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RELFFYNLNAKKRLKAFHKYSK 321 >gi|45768794|gb|AAH68118.1| Glycosyltransferase 25 domain containing 2 [Mus musculus] Length = 625 Score = 130 bits (326), Expect = 3e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 108/285 (37%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 343 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKAWNIEMLPGYRDPYSS 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + K ++ Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMEDIDKAQLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 ELIYIG--RKRMQVKEPEKAVPNVVNLVEA-DYSYWTLGYAISLEGAQKLV-GADPFGKM 518 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 519 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 578 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ A + LPP ++ +PS Sbjct: 579 NETVA--TDWDRTHSWKSRKQGHIRSTAKNTEALPPPTSVDTVPS 621 >gi|261839554|gb|ACX99319.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 52] Length = 332 Score = 130 bits (326), Expect = 3e-28, Method: Composition-based stats. Identities = 62/322 (19%), Positives = 101/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYAAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 300 RKLFFYSLNAKKRLKAFHKYSK 321 >gi|308063592|gb|ADO05479.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 332 Score = 129 bits (325), Expect = 3e-28, Method: Composition-based stats. Identities = 61/320 (19%), Positives = 101/320 (31%), Gaps = 84/320 (26%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNRIF 49 VY+ISL S R + + + F FDAI ++ + Sbjct: 4 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQSML 62 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 63 KSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALEDCL 121 Query: 110 KCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS---------- 149 K + + + KF L K+ + N + + + Sbjct: 122 KSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNLST 181 Query: 150 ------------------------------------PRTTGYFIGKEAAIHLLNVRKNI- 172 T GY++ + A + + Sbjct: 182 HKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATERFK 241 Query: 173 -YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTFSP 222 P+DM M + H+I + P V + + DSTI++ + +K TF Sbjct: 242 IIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYQSPPKKSTFRE 301 Query: 223 LYFYRNTCYQWNLHYNAWRK 242 L+FY + ++ + K Sbjct: 302 LFFYSLNAKKRLKAFHKYSK 321 >gi|312116343|ref|YP_004013939.1| glycosyl transferase family 25 [Rhodomicrobium vannielii ATCC 17100] gi|311221472|gb|ADP72840.1| glycosyl transferase family 25 [Rhodomicrobium vannielii ATCC 17100] Length = 322 Score = 129 bits (325), Expect = 4e-28, Method: Composition-based stats. Identities = 46/259 (17%), Positives = 84/259 (32%), Gaps = 30/259 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRL 61 IPV++I+L + R + + + + + F AI G P + F+ ++ + R Sbjct: 8 IPVFLINLDRAPHRLARMDEKLRSLGIAYERFAAIDGRALPPDSVETFAPRETWTKPTRR 67 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 E+ C+ SH+ + + I A +LEDD D + +F L K N +K + Sbjct: 68 PIANEVACFQSHLSVLQIIRDRSYERACVLEDDLDLAPDFGTFLDPTLKFPPNTDFLKLE 127 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMK 181 + K K L + R Y + +LL K + +D Sbjct: 128 VTQVKRLKVLPLRPIGARTLGLISRGGDIGAAAYVVTARGVRNLLPNLKPM---VDFY-- 182 Query: 182 HWWEHNI-----PSLVTEPGAVYEAIDTN-----------DSTIEESRLVRKP------- 218 P + + T + + +RKP Sbjct: 183 DNQAFGAATLKPRVYHVIPFPADQEDGSEMGRPQEFVSEKKGTTPKGKRLRKPLHRKMKS 242 Query: 219 -TFSPLYFYRNTCYQWNLH 236 + F+R +H Sbjct: 243 SIKKRMSFHRKVAEYIRVH 261 >gi|305380|gb|AAA65534.1| lipopolysaccharide core [Haemophilus influenzae] Length = 294 Score = 129 bits (325), Expect = 4e-28, Method: Composition-based stats. Identities = 50/252 (19%), Positives = 86/252 (34%), Gaps = 30/252 (11%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-----IYGENNPICNRIFSHQKRQCQF--- 58 VIS+ + RR+ + +L FSFF+A I N N+ + Q Sbjct: 8 VISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSINQSNSILH 67 Query: 59 ----KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R+L+ E GC ISH LW + I EDD + L Sbjct: 68 NIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTR 127 Query: 115 NIL-----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNV 168 I+ + + K + P N + T GY I + AA +++ Sbjct: 128 FDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEY 187 Query: 169 RKNI----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVR 216 KNI +D +D+ ++ + + P++ + ++ S +E+ R R Sbjct: 188 LKNIPSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKERQKR 247 Query: 217 KPTFSPLYFYRN 228 + Sbjct: 248 PKIRKKKTLKQR 259 >gi|157326155|gb|ABV44310.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 129 bits (325), Expect = 4e-28, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 110/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYSFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPQKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLKAFQQYSKQYAPLKTPK 330 >gi|241203508|ref|YP_002974604.1| glycosyl transferase family 25 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240857398|gb|ACS55065.1| glycosyl transferase family 25 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 267 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 57/248 (22%), Positives = 101/248 (40%), Gaps = 13/248 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L + RR + A L F A+ G + + F + R Sbjct: 21 VNTYLINLDRAPLRRFRMERLLASFGLAFERVAAVDGAGLSLPHPGFDEASYLRRHGRRP 80 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + EIGCY+SH+ KR S A A+ILEDD DF D+ ++LL + +++ Sbjct: 81 NPFEIGCYLSHVECAKRFLGSHAEFALILEDDLDFDDDLAELLDAALEHQARWDILRLST 140 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + K + L + + + Y I ++AA + +V + P D+ Sbjct: 141 VNTGRK--HKVEPLTASRSLAIALTREKGSGAYLINRKAAGWIADVLVPMRLPYDLAFDL 198 Query: 183 WWEHNIPSLVTEPGAVYEAIDTNDSTIEES----RLVRKPTFSPLYF-----YRNTCYQW 233 ++ + + +P V + D S I+ RL R+ +S L + R ++ Sbjct: 199 EFDDGLSACFVDPLPVSQRADPC-SQIQAGLSTYRLGRRRPWSVLPYRAAAELRRFAARF 257 Query: 234 NLHYNAWR 241 + AWR Sbjct: 258 S-RLAAWR 264 >gi|33152027|ref|NP_873380.1| lipooligosaccharide N-acetylglucosamine glycosyltransferase [Haemophilus ducreyi 35000HP] gi|33148249|gb|AAP95769.1| possible lipooligosaccharide N-acetylglucosamine glycosyltransferase [Haemophilus ducreyi 35000HP] Length = 264 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 48/241 (19%), Positives = 88/241 (36%), Gaps = 25/241 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + Y+ISL RRE+F +A F FDA+ + + +++R Sbjct: 2 MKKYLISLAKDQQRREQFFTQADT--ADFQVFDAVNTMQQEWDSLAVKYDITQFAQRYQR 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ +++ I ++ EDDA F+ + L L +++ Sbjct: 60 SVTKGEIGCTLSHLGVYQWIVADPQIGDNDYCLVCEDDALFNQDVQSSLTQLLSQNVHAD 119 Query: 117 LI--------KFDA---LRKKPKKDSYLCTLPGN--FDIHQPRI-LSPRTTGYFIGKEAA 162 ++ FD + P + GN + P T Y I K A Sbjct: 120 IVLVGQSKIFGFDDAMLEIEYPTTFAPQMVAIGNSIYRYAYPYKNYYAGTVAYLIKKSTA 179 Query: 163 IHLLNVRK-NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 L+++ + D + + + +V P E S +E +R + S Sbjct: 180 RKLVSITDLPFWLA-DDFILFEQQFGLDIMVVRPLMAIENPALC-SNLETARASTQQALS 237 Query: 222 P 222 Sbjct: 238 K 238 >gi|157164803|ref|YP_001467299.1| ExsB [Campylobacter concisus 13826] gi|112801958|gb|EAT99302.1| putative glycosyl transferase, family 25 [Campylobacter concisus 13826] Length = 253 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 56/222 (25%), Positives = 84/222 (37%), Gaps = 16/222 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y+ISL RRE + F DAI G + ++LS Sbjct: 4 IYLISLEKDTKRRELLQQKFGSYD-SFKLIDAIDGRELNAREYYKIISPSFKAYGKVLSP 62 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINNILI----- 118 E+GC +SH+ ++ S A A+I EDD D + SK N++LI Sbjct: 63 AEVGCSLSHVKAYEAFLASDAKFALIFEDDVIGGDHAIKEAFLAASKMPENSVLICGMQD 122 Query: 119 ----KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 +F A KK DS L H + R Y + K++A +LL + K Sbjct: 123 GLEGRFSAFGKK--VDSSLSKPLWQVSKHSFSSIY-RAGAYVLTKKSAKNLLEIHKRALC 179 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 D+ +++ + A D + S IE RL R Sbjct: 180 TTDVWDYLLGVNDMQMYFCDLFA--HPTDLSGSNIEGERLER 219 >gi|157326149|gb|ABV44307.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 67/331 (20%), Positives = 110/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYFLWKECVKTNQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRK- 170 T GY++ + A + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFTEATEC 239 Query: 171 -NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 I P+DM M + H+I +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + ++ + K P+ T K Sbjct: 300 KNLFYYSINAKKRLKAFHKYSKRYAPLKTPK 330 >gi|118602876|ref|YP_904091.1| glycosyl transferase family protein [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] gi|118567815|gb|ABL02620.1| glycosyl transferase, family 25 [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] Length = 262 Score = 129 bits (324), Expect = 4e-28, Method: Composition-based stats. Identities = 46/198 (23%), Positives = 78/198 (39%), Gaps = 11/198 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRL 61 + ++VI+L S RR + + L F F DAI + + ++ + ++R Sbjct: 1 MNIFVINLENSVERRNFQQEQLFALKLDFEFLDAISTTDVSTTIHKQYCQD-----WQRP 55 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ CY SH W ++ S A+ILEDDA S +LL +LS + L+ + Sbjct: 56 LKATEVACYFSHRCAWNKVIQSNQP-ALILEDDALLSKCVPELLANLS-DKKSIDLVNLE 113 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP-IDMDM 180 +K L N + + GY + A L+ K D + Sbjct: 114 NRSRKKFISKSNVLLECNSKLLRLYQDRTGAAGYILYPNGAKKLIQCEKRKGIALADAHI 173 Query: 181 KHWWEHNIPSLVTEPGAV 198 ++I + EP + Sbjct: 174 TA--CNSIKAYQVEPAPI 189 >gi|312066988|ref|XP_003136531.1| hypothetical protein LOAG_00943 [Loa loa] gi|307768311|gb|EFO27545.1| hypothetical protein LOAG_00943 [Loa loa] Length = 544 Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats. Identities = 49/238 (20%), Positives = 92/238 (38%), Gaps = 37/238 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG---ENNPICNRI-FSHQKRQCQFK 59 +YVI+L R+ K + ++++++A G ++ P+ I F +K Sbjct: 294 KIYVINLERRKTRKVKMMELLKLMGFEYTWWEATDGHHLDSEPLYREIKFLPGYEDPFYK 353 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF----SDEFSQLLPHLSKCDINN 115 R + E+GC++SH +W+ + II EDD F +D +L+ + I Sbjct: 354 RPMKAGEVGCFLSHYRIWQEVDKKKLDRVIIFEDDLRFVVNSTDLLKELIEDIDSSRIEW 413 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 L+ R + ++++ PG+ + S T GY + A LL + + Sbjct: 414 DLVYLGRKRLEGANENWV---PGHRHLSTV-DYSYWTLGYMLSLNGARKLL-GGNPLRKL 468 Query: 175 -PIDMDMK-HWWEH------------NIPSLVTEPGAVYEA---------IDTNDSTI 209 P+D + + +H ++ + P V DT DS I Sbjct: 469 VPVDEYIPIMFNKHPNTTWTNAFPHRDLIAYTIYPTVVVPERYTYQHGYFSDTEDSDI 526 >gi|85704750|ref|ZP_01035851.1| glycosyl transferase, family 25 [Roseovarius sp. 217] gi|85670568|gb|EAQ25428.1| glycosyl transferase, family 25 [Roseovarius sp. 217] Length = 241 Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 87/227 (38%), Gaps = 14/227 (6%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQF 58 M +V+ LP + ARRE A L+ + A+ G + + + Sbjct: 1 MTGRAFVLHLPRASARREAARALLASCGLEGEIWSAVDGSSLSSNELSATVGAGIFAPPY 60 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCDINNIL 117 L ++GC++SH +W I +I EDDA F++ L + Sbjct: 61 PFALKTSQLGCFLSHRQIWAEIVRRDLDYGLIFEDDATIDPVVFARALALARAQIDDLGY 120 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 I+F + + L G + + R+T +G++AA HLL++ + RP+D Sbjct: 121 IQFQTQALRG--PARLIDTNGGCALSLSLVSGLRSTAQMVGRDAAAHLLSLTETFDRPVD 178 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY 224 ++ W + P V +T + RKP + ++ Sbjct: 179 CLIQSHWHTRLRPAGIYPSGVSRNAETVE---------RKPPLTQIW 216 >gi|297281262|ref|XP_002802062.1| PREDICTED: procollagen galactosyltransferase 2-like [Macaca mulatta] Length = 626 Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats. Identities = 54/285 (18%), Positives = 109/285 (38%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GADPFGKM 519 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 520 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 579 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ +A + LPP ++ +PS Sbjct: 580 NETVA--TDWDRTHAWKSRKQSRIYSSAKNTEALPPPTSLDTVPS 622 >gi|145638540|ref|ZP_01794149.1| diadenosine tetraphosphatase [Haemophilus influenzae PittII] gi|145272135|gb|EDK12043.1| diadenosine tetraphosphatase [Haemophilus influenzae PittII] Length = 308 Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 90/259 (34%), Gaps = 23/259 (8%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQCQF---KRLL 62 VIS+ + RR+ + L FSFF+A Y N N+ + R+L Sbjct: 30 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSNSILHNIEESRIL 89 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 90 TKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFI 149 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI---- 172 I+ + + K + P + T GY I + AA +++ KNI Sbjct: 150 IRLETFLQPVKLEKQTKIPPFYSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDE 209 Query: 173 YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY 224 +D +D+ ++ + + P++ + V ++ S +E+ R R Sbjct: 210 IVAVDELIFNKLVDVDNYIVYQLNPAICIQELQVNQSKSVLTSGLEKEREKRPKIRKKKT 269 Query: 225 F-YRNTCYQWNLHYNAWRK 242 R T + N+ RK Sbjct: 270 LKQRLTRIKENIIRALNRK 288 >gi|325143639|gb|EGC65958.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M01-240013] gi|325206834|gb|ADZ02287.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M04-240196] Length = 280 Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 75/241 (31%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGAHDIPFQFFDALM-PSEALEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|322515189|ref|ZP_08068188.1| lipooligosaccharide N-acetylglucosamine glycosyltransferase [Actinobacillus ureae ATCC 25976] gi|322118799|gb|EFX91000.1| lipooligosaccharide N-acetylglucosamine glycosyltransferase [Actinobacillus ureae ATCC 25976] Length = 256 Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats. Identities = 52/246 (21%), Positives = 89/246 (36%), Gaps = 26/246 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + Y+ISL ARRE F + F FDAI + + + K + +++R Sbjct: 1 MKKYLISLAKDSARRELFFSQPDT--ADFQVFDAINTMSLEWQELAQKYDLTKFEQRYQR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDINNI 116 + E+GC SH+ ++++I +I ++ EDDA F+ F L L + D+ Sbjct: 59 KATKGEVGCTFSHLGVYQQIINDESITEHEYCLVCEDDALFNQGFQYTLDKLVQQDLRAD 118 Query: 117 LI-------------KFDALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAA 162 +I + K + + P T Y I K +A Sbjct: 119 IILVGQSKILSFEDPLLEIEFPTTFKQQMKPIDNTTYRLAYPYKNYYAGTVAYLIKKSSA 178 Query: 163 IHLL-NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTF 220 L + D + + + LV P E S +E +R ++ TF Sbjct: 179 RKFLTTGALPFWLA-DDFILFEQQWGLDILVVRPLMAIENP-VLTSNLEGARASQQQATF 236 Query: 221 SPLYFY 226 + Y Sbjct: 237 KKILKY 242 >gi|157326123|gb|ABV44294.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 128 bits (323), Expect = 5e-28, Method: Composition-based stats. Identities = 69/331 (20%), Positives = 110/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK S +ILEDDA F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKSNQP-VVILEDDAMLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY+I + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYITPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANLTYVPCPVSLSEYSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNTKKRLNAFQQYSKQYAPLKTPK 330 >gi|261823574|ref|YP_003261680.1| glycosyl transferase family 25 [Pectobacterium wasabiae WPP163] gi|261607587|gb|ACX90073.1| glycosyl transferase family 25 [Pectobacterium wasabiae WPP163] Length = 249 Score = 128 bits (323), Expect = 5e-28, Method: Composition-based stats. Identities = 47/248 (18%), Positives = 84/248 (33%), Gaps = 19/248 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +V++L ++ K + L + A+ G K L Sbjct: 1 MKTFVVNLEKDKEKKVKIERECRKAELNYEIISAVDGRKINNNEL---KSKVHPVSLNYL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC +SH ++K+I A+ILEDD D S + + L F Sbjct: 58 SKGEIGCALSHQKVYKKIIDDDIEYALILEDDVDISQDVNNFLKDFLSVKNKTKGDVFLL 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + N+ ++ S GY I +AA ++ + I D + Sbjct: 118 YPSGLRFFNRKIEISNNYFFYEVY-NSSCAHGYIISNKAAKKIMRINTPIILVADAWLWF 176 Query: 183 WWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR--------------LVRKPTFSPLYFYR 227 + + V ID + STIE R + ++P + + +Y Sbjct: 177 YQISLLKVYALNKELVRAYDIDKSLSTIETERSLLLDEKEKHQMGIIKKQPFYYVIKYYH 236 Query: 228 NTCYQWNL 235 + L Sbjct: 237 KYIRRIFL 244 >gi|293397818|ref|ZP_06642024.1| glycosyl transferase, family 25 [Neisseria gonorrhoeae F62] gi|2497665|sp|Q50950|LGTE_NEIGO RecName: Full=Lacto-N-neotetraose biosynthesis glycosyltransferase lgtE gi|595814|gb|AAA68013.1| glycosyl transferase [Neisseria gonorrhoeae] gi|291611764|gb|EFF40833.1| glycosyl transferase, family 25 [Neisseria gonorrhoeae F62] Length = 280 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 76/241 (31%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + K N T GY I +EA L+ + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPPERI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|309750044|gb|ADO80028.1| Lipooligosaccharide biosynthesis protein Lic2A [Haemophilus influenzae R2866] Length = 278 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 53/255 (20%), Positives = 91/255 (35%), Gaps = 23/255 (9%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VIS+ + RR+ + L FSFF+A ++ N I + + R+L+ E Sbjct: 8 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSNSILHNIEE----SRILTKGE 63 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-----IKFD 121 GC ISH LW + I EDD + L I+ + Sbjct: 64 KGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFIIRLE 123 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + K + P N + T GY I + AA +++ KNI + Sbjct: 124 TFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDEIVAV 183 Query: 177 D-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF-YR 227 D +D+ ++ + + P++ + ++ S +E+ R R R Sbjct: 184 DELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKERQKRSKIRKKKTLKQR 243 Query: 228 NTCYQWNLHYNAWRK 242 T + N+ RK Sbjct: 244 LTRIKENIIRALNRK 258 >gi|26343025|dbj|BAC35169.1| unnamed protein product [Mus musculus] Length = 625 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 108/285 (37%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 343 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKAWNIEMLPGYRGPYSS 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + K ++ Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMEDIDKAQLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 ELIYIG--RKRMQVKEPEKAVPNVVNLVEA-DYSYWTLGYAISLEGAQKLV-GADPFGKM 518 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 519 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 578 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ A + LPP ++ +PS Sbjct: 579 NETVA--TDWDRTHSWKSRKQGHIRSTAKNTEALPPPTSLDTVPS 621 >gi|254670883|emb|CBA07402.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha153] Length = 280 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 72/241 (29%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLAEDGWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|148707510|gb|EDL39457.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Mus musculus] Length = 625 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 108/285 (37%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 343 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKAWNIEMLPGYRDPYSS 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + K ++ Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMEDIDKAQLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 ELIYIG--RKRMQVKEPEKAVPNVVNLVEA-DYSYWTLGYAISLEGAQKLV-GADPFGKM 518 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 519 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 578 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ A + LPP ++ +PS Sbjct: 579 NETVA--TDWDRTHSWKSRKQGHIRSTAKNTEALPPPTSLDTVPS 621 >gi|228008340|ref|NP_808424.3| procollagen galactosyltransferase 2 precursor [Mus musculus] gi|160395572|sp|Q6NVG7|GT252_MOUSE RecName: Full=Procollagen galactosyltransferase 2; AltName: Full=Glycosyltransferase 25 family member 2; AltName: Full=Hydroxylysine galactosyltransferase 2; Flags: Precursor Length = 625 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 108/285 (37%), Gaps = 40/285 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 343 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKAWNIEMLPGYRDPYSS 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + K ++ Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMEDIDKAQLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 463 ELIYIG--RKRMQVKEPEKAVPNVVNLVEA-DYSYWTLGYAISLEGAQKLV-GADPFGKM 518 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 519 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 578 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 + V T + Q ++ A + LPP ++ +PS Sbjct: 579 NETVA--TDWDRTHSWKSRKQGHIRSTAKNTEALPPPTSLDTVPS 621 >gi|254429889|ref|ZP_05043596.1| LPS glycosyltransferase subfamily, putative [Alcanivorax sp. DG881] gi|196196058|gb|EDX91017.1| LPS glycosyltransferase subfamily, putative [Alcanivorax sp. DG881] Length = 271 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 52/248 (20%), Positives = 87/248 (35%), Gaps = 12/248 (4%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFKRLLSL 64 VI++ RR + + I L+F F D + + C + + +R L+ Sbjct: 24 VIAMKHEIERRNTIENNLSAIGLRFRFIDGLNYDETTTCKIKLGYRKEMSVKTIQRELTA 83 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 PEIGC++SH W + A +ILE DA E +++ L ++ L+ + + Sbjct: 84 PEIGCFVSHREAWIQ-ASRKNRPTLILESDALLDSESVEIIERLCNQAAHHDLVMLNYHK 142 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD-MKHW 183 P + Y + E A L + Y D + Sbjct: 143 CIPSFWKRSAINSRYSLVKFANRRVHCLASYLVSPEGARKLADQASTFYLTADDYATGGY 202 Query: 184 WEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV-----RKPTFSPLYFYRNTCYQWNLHYN 238 + +I P V + + STIEE+R + P + +R N Sbjct: 203 IDKDIDLFAVFPRCVSMSHLADQSTIEEARQEASKRTKNPDKNKASLFRKIELYLR---N 259 Query: 239 AWRKDLPP 246 R LPP Sbjct: 260 LRRALLPP 267 >gi|257464901|ref|ZP_05629272.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor 202] gi|257450561|gb|EEV24604.1| lipooligosaccharide biosynthesis protein [Actinobacillus minor 202] Length = 263 Score = 128 bits (323), Expect = 6e-28, Method: Composition-based stats. Identities = 49/242 (20%), Positives = 77/242 (31%), Gaps = 19/242 (7%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR + ++ F FFDAI + + F L+ Sbjct: 10 YVISLLTAKERRNHIIQEFGKQNIPFEFFDAITPDLIEEKAKEFGIDISSS----PLTKG 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW I EDD + L + + +IKF+ Sbjct: 66 EIACALSHIALWHLAKEKELDYICIFEDDIYLGENAKCFLTEAY-INNDVDIIKFETFSL 124 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDM-DMK 181 K K N + GY I K+ +L + PID Sbjct: 125 DKQPKWQKKEKHFLNRKFYTLTHRHVGMAGYLITKKGVRFILQELSKNKFIIPIDDILFD 184 Query: 182 HWWEHN-------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKP--TFSPLYFYRNTCYQ 232 ++ IP++ + + + S++E+ R R R + Sbjct: 185 QLLQNKEYKVWQIIPAICIQDFILNKTT-KFKSSLEKERDKRTANTPKKKRNLIRKIYRE 243 Query: 233 WN 234 Sbjct: 244 LT 245 >gi|86145075|ref|ZP_01063407.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Vibrio sp. MED222] gi|85837974|gb|EAQ56086.1| Putative Lex2B protein (lipooligosaccharide 5G8 epitopebiosynthesis-associated protein) [Vibrio sp. MED222] Length = 258 Score = 128 bits (322), Expect = 7e-28, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 80/255 (31%), Gaps = 15/255 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIH-LQFSFFDAIYG---ENNPICNRIFSHQKRQCQF 58 + +I+L R + I L A+ + +P+ + + Sbjct: 1 MKSLIIALERKSERFLSSQKQLLNIKSLDIEKLSAVDAMSSKPHPLMD-FYDETAFYALN 59 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R+ EIGCY SH W+R + +D D F + + SK + Sbjct: 60 GRIAVPGEIGCYGSHYLAWERCIQLNQPVIVFEDDVVVDVDVFEKTVQFASKHIEECGFM 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + + K + + + L + + TT Y I A + + P+D+ Sbjct: 120 RLENYSTKREYNYVVENLDDEQSLVRHIKTPLCTTAYMITPHVAKIFIEKSERFLYPVDV 179 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP-----LYFYRNTCYQW 233 +++ W H P+ P + S I K + L +R+ Sbjct: 180 FIRNVWLHKQPTYGISPAGLTGG--EGGSVIGNRSSRTKKSLRIKTLKFLSKFRDVAMNG 237 Query: 234 NLHYNAW---RKDLP 245 + +K P Sbjct: 238 LFNLTYTVIIKKSRP 252 >gi|240079872|ref|ZP_04724415.1| LgtE [Neisseria gonorrhoeae FA19] gi|268596015|ref|ZP_06130182.1| glycosyl transferase [Neisseria gonorrhoeae FA19] gi|6644287|gb|AAF20992.1| LgtE [Neisseria gonorrhoeae] gi|268549803|gb|EEZ44822.1| glycosyl transferase [Neisseria gonorrhoeae FA19] Length = 280 Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 76/241 (31%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + K N T GY I +EA L+ + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPPERI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|156390789|ref|XP_001635452.1| predicted protein [Nematostella vectensis] gi|156222546|gb|EDO43389.1| predicted protein [Nematostella vectensis] Length = 589 Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats. Identities = 43/187 (22%), Positives = 72/187 (38%), Gaps = 14/187 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +Y+I+L RR + A + +Q DA+ G+ K + K Sbjct: 332 IYMINLKRRPLRRRRMMASLAELGIQAKPVDAVDGKALTDAEVKALGIKMLPGYYDPYGK 391 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L++ EIGC++SH +WK + +ILEDD F +F + L ++ + Sbjct: 392 RPLTMGEIGCFLSHYKIWKEMIEKGLERVLILEDDVRFEPDFRRKLLAMIADANQLESKY 451 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 R++ K + L + T GY I E A L++ K + Sbjct: 452 NWDMIYVGRRRMKTE--LIGHVEGSQYINWVNYTWWTLGYMIKLEGARKLVS-AKPLTKM 508 Query: 174 RPIDMDM 180 ID + Sbjct: 509 MAIDEFL 515 >gi|48812|emb|CAA40221.1| unnamed protein product [Haemophilus influenzae] Length = 298 Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats. Identities = 55/271 (20%), Positives = 92/271 (33%), Gaps = 35/271 (12%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA---------IYGENNPICNRIFSHQKRQCQ 57 VIS+ + RR+ + +L FSFF+A I N N+ + Q Sbjct: 8 VISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSINQSINQSN 67 Query: 58 F-------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R+L+ E GC ISH LW + I EDD + L Sbjct: 68 SILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEW 127 Query: 111 CDINNIL-----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIH 164 I+ + + K + P N + T GY I + AA + Sbjct: 128 LKTRFDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKY 187 Query: 165 LLNVRKNI----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEES 212 ++ KNI +D +D+ ++ + + P++ + ++ S +E+ Sbjct: 188 VIEYLKNIPSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKE 247 Query: 213 RLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 R R R T + N+ RK Sbjct: 248 RQKRSKIRKKKTLKQRLTRIKENIIRALNRK 278 >gi|108563025|ref|YP_627341.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] gi|107836798|gb|ABF84667.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] Length = 332 Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats. Identities = 63/322 (19%), Positives = 106/322 (32%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK + +ILEDDA F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDAMLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVLPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSSKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L++Y + + + K Sbjct: 300 KNLFYYSINAKKRLKAFQQYSK 321 >gi|301169267|emb|CBW28864.1| lipooligosaccharide biosynthesis protein lex-1 (ec 2.-.-.-) [Haemophilus influenzae 10810] Length = 298 Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats. Identities = 55/271 (20%), Positives = 92/271 (33%), Gaps = 35/271 (12%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA---------IYGENNPICNRIFSHQKRQCQ 57 VIS+ + RR+ + +L FSFF+A I N N+ + Q Sbjct: 8 VISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSINQSINQSN 67 Query: 58 F-------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R+L+ E GC ISH LW + I EDD + L Sbjct: 68 SILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEW 127 Query: 111 CDINNIL-----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIH 164 I+ + + K + P N + T GY I + AA + Sbjct: 128 LKTRFDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKY 187 Query: 165 LLNVRKNI----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEES 212 ++ KNI +D +D+ ++ + + P++ + ++ S +E+ Sbjct: 188 VIEYLKNIPSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKE 247 Query: 213 RLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 R R R T + N+ RK Sbjct: 248 RQKRPKIRKKKTLKQRLTRIKENIIRALNRK 278 >gi|307637306|gb|ADN79756.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 908] gi|325995897|gb|ADZ51302.1| Beta-1,4-galactosyltransferase [Helicobacter pylori 2018] gi|325997492|gb|ADZ49700.1| putative glycosyltransferase [Helicobacter pylori 2017] Length = 332 Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 110/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + +++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYNAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDVTLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDHSFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSSKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLKAFQQYSKQYAPLKTPK 330 >gi|157326157|gb|ABV44311.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 128 bits (321), Expect = 9e-28, Method: Composition-based stats. Identities = 65/330 (19%), Positives = 110/330 (33%), Gaps = 82/330 (24%) Query: 3 IPVYVISLPFSHARRE---KFCHRAARIHLQ--FSFFDAIYGENNP---ICNRIFSHQK- 53 I VY+ISL S R + ++ + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKLKGRCVFQIFDAISPKHQDFEKLLQELYDAQSL 61 Query: 54 -----RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 62 LQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDVTLEPHFMQALEDC 120 Query: 109 SKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS--------- 149 K + + + KF L K+ + + N + + + Sbjct: 121 LKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDHSFKNNPILEKFKKFFDVSRFLNLS 180 Query: 150 -------------------------------------PRTTGYFIGKEAAIHLLNVRKNI 172 T GY++ + A + + Sbjct: 181 THKVIHYILKKIQKNYYSMHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATERF 240 Query: 173 --YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTFS 221 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 241 KIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPQKSTFK 300 Query: 222 PLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 301 NLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|3983105|gb|AAC83803.1| glycosyltransferase homolog Lob2A [Histophilus somni] Length = 282 Score = 128 bits (321), Expect = 9e-28, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 74/213 (34%), Gaps = 13/213 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ ++ H+ ++FFDA N+ K L+ Sbjct: 4 YVISLKSAVDRRQHIEKEFSQQHIPYTFFDAFQ---LNEENKHILQHYVPALSKSNLANK 60 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NNILIKF 120 C++SH+ LW++ I EDD S ++ L +++ Sbjct: 61 LKSCFMSHVLLWQKCVDENLPFIAIFEDDIFLSSVANKFLSEYDWLKERLAFNQYFILRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN---IYRPID 177 + + K + N ++ GY + + AA LL + KN ID Sbjct: 121 ETIYMKSEYCKTKIPDYQNRKFNRLMSGHFGMGGYILSQVAAKKLLALFKNTDIFIEEID 180 Query: 178 --MDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 + + + P EP + + N Sbjct: 181 QIIFKQLLKDEQYPVYQCEPAIIIQDCILNKDN 213 >gi|154245878|ref|YP_001416836.1| glycosyl transferase family protein [Xanthobacter autotrophicus Py2] gi|154159963|gb|ABS67179.1| glycosyl transferase family 25 [Xanthobacter autotrophicus Py2] Length = 318 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 58/180 (32%), Gaps = 13/180 (7%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 P+ ++VI+L R + + L F +A+ G+ + Sbjct: 20 PMLIFVINLARRPDRLAFMRGQLDALGLAFERIEAVDGQREDVGQ------------GTD 67 Query: 62 L-SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L + EI C +SH W+R + +ILEDD + LL + +++ Sbjct: 68 LITPVEIACAMSHRKAWQRFLETGENRCLILEDDVLIAPRAKLLLEDPRHLPADADIVRL 127 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + +K +H + I + A + PID M Sbjct: 128 ETGLQKSLVGRARRCGLAGHRVHALYSRHHGSAACIITRGFAERAVRSLTTFVEPIDDIM 187 >gi|260914134|ref|ZP_05920607.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260631767|gb|EEX49945.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 259 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 52/235 (22%), Positives = 87/235 (37%), Gaps = 28/235 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG--ENNPICNRIFSHQKRQCQFKR 60 + ++ISL RRE F + F + AI E +F+ ++ Q ++ R Sbjct: 1 MKKFLISLEKDEYRRELFFQQPDTQ--DFEVYSAINTMNETWEKLTALFNVEQFQQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH+ ++++I ++ EDDA F+ F + L + + Sbjct: 59 AVTKGEIGCTLSHLGVYQKIVDDQAISEDEYCLVCEDDALFNQHFLNNVTALLQQERQVD 118 Query: 117 LI--------KFDALRKK---PKKDSYLCTLPGN---FDIHQPRILSPRTTGYFIGKEAA 162 LI FD + P S+LC N + + T Y I K AA Sbjct: 119 LILVGQSKILHFDDNELEINYPVTFSWLCKKIANTEFCYAYPYKDYFAGTVAYLIKKSAA 178 Query: 163 IHLLNVRK----NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 L + + D + + I + V P E S +E R Sbjct: 179 RSFLIQVQQNPPPFWLA-DDFILFQQQFGIKNKVVRPLLAIENP-ALVSNLESLR 231 >gi|157326145|gb|ABV44305.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 66/331 (19%), Positives = 112/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKK---------------PKKDSYLCTLPGNFDI 142 K + + + KF L K+ +K +P ++ Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNTPILEKFKKFFDVPRFLNL 179 Query: 143 HQPRILS-------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 +++ T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPQKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLKAFQQYSKQYAPLKTPK 330 >gi|254720067|ref|ZP_05181878.1| glycosyl transferase family protein [Brucella sp. 83/13] gi|265985071|ref|ZP_06097806.1| glycosyl transferase [Brucella sp. 83/13] gi|306839381|ref|ZP_07472195.1| glycosyl transferase family protein [Brucella sp. NF 2653] gi|264663663|gb|EEZ33924.1| glycosyl transferase [Brucella sp. 83/13] gi|306405504|gb|EFM61769.1| glycosyl transferase family protein [Brucella sp. NF 2653] Length = 265 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 69/245 (28%), Gaps = 18/245 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L + R + + F A + + R S + + L Sbjct: 1 MKCYLINLDRAPERLARMTAILESFDIPFERVPATDAKQHTFNERAVS--SKTPSGQPAL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 +I C SH+ RI A ILEDD + + L + +IK + Sbjct: 59 VGGDIACGFSHVECLTRIVEGSDQYAAILEDDLHLASDIKSYLNSAEWIPEGSDIIKLET 118 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + +P I Y I ++AA +L ID+ + Sbjct: 119 FDEPTLLGPSIKKMPVGRIIAPLLKGHFGAVLYIISRDAARRILQEFDPEVEYIDVYLFE 178 Query: 183 WWEHNIPSLVTEPGAVYEA----------------IDTNDSTIEESRLVRKPTFSPLYFY 226 P + D D + + R K + Sbjct: 179 TILERYRVYQVSPAPAIQDNIGGFKSATFLTSGIAPDRRDQHVVKYRGWAKTKRETKRVF 238 Query: 227 RNTCY 231 CY Sbjct: 239 SRICY 243 >gi|157326139|gb|ABV44302.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 71/334 (21%), Positives = 107/334 (32%), Gaps = 93/334 (27%) Query: 3 IPVYVISLPFSHARR----------EKFCHRAARIHLQFSFFDAIYGENNPI-------- 44 I VY+ISL S R EKF R A F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCA-----FQIFDAISPKHEDFEKFVQELY 56 Query: 45 -CNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 + L+LPE+GCY+SH LWK +ILEDDA F Q Sbjct: 57 DAQSMLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDAMLESHFMQ 115 Query: 104 LLPHLSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS---- 149 L K + + + KF L K+ + N + + + Sbjct: 116 ALEDCLKSPFDFVRLYGHYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSR 175 Query: 150 ------------------------------------------PRTTGYFIGKEAAIHLLN 167 T GY++ + A + Sbjct: 176 FLNLSTHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIE 235 Query: 168 VRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------R 216 + P+DM M + H+I +L P V + + DSTI++ + + Sbjct: 236 ATERFKIIEPVDMFMDNSVYHDIANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPK 295 Query: 217 KPTFSPLYFYR-NTCYQWNLHYNAWRKDLPPVST 249 K TF L++Y N + N R+ P + Sbjct: 296 KSTFKNLFYYSINAKKRLNAFQQYSRRYTPLKTP 329 >gi|194210402|ref|XP_001489806.2| PREDICTED: glycosyltransferase 25 domain containing 2 [Equus caballus] Length = 578 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 50/284 (17%), Positives = 107/284 (37%), Gaps = 38/284 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 296 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 355 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + + ++ Sbjct: 356 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDDIDRAQLDW 415 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 416 ELIYIG--RKRMQVKEPEKAVPNVVNLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 471 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +++ ++ + EP +Y DT STI + Sbjct: 472 LPVDEFLPIMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDTETSTIWD 531 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKFLPS 255 + V + + +++ + + LP ++ +PS Sbjct: 532 NETV-ATDWDRTHSWKSRKQSRIYRNSKNTEALPSPTSLDTVPS 574 >gi|170719163|ref|YP_001784308.1| glycosyl transferase [Haemophilus somnus 2336] gi|168827292|gb|ACA32663.1| glycosyl transferase family 25 [Haemophilus somnus 2336] Length = 271 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 74/213 (34%), Gaps = 13/213 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ ++ H+ ++FFDA N+ K L+ Sbjct: 4 YVISLKSAVDRRQHIEKEFSQQHIPYTFFDAFQ---LNEENKHILQHYVPALSKSNLANK 60 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NNILIKF 120 C++SH+ LW++ I EDD S ++ L +++ Sbjct: 61 LKSCFMSHVLLWQKCVDENLPFIAIFEDDIFLSSVANKFLSEYDWLKERLAFNQYFILRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN---IYRPID 177 + + K + N ++ GY + + AA LL + KN ID Sbjct: 121 ETIYMKSEYCKTKIPDYQNRKFNRLMSGHFGMGGYILSQVAAKKLLALFKNTDIFIEEID 180 Query: 178 --MDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 + + + P EP + + N Sbjct: 181 QIIFKQLLKDEQYPVYQCEPAIIIQDCILNKDN 213 >gi|157326129|gb|ABV44297.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 66/331 (19%), Positives = 105/331 (31%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYNAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK S +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECIKSNQP-VVILEDDIVLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATDR 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|145637408|ref|ZP_01793067.1| diadenosine tetraphosphatase [Haemophilus influenzae PittHH] gi|145269354|gb|EDK09298.1| diadenosine tetraphosphatase [Haemophilus influenzae PittHH] Length = 307 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 90/259 (34%), Gaps = 23/259 (8%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQCQ---FKRLL 62 VIS+ + RR+ + L FSFF+A Y N N+ + R+L Sbjct: 29 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSNSILHNIEKSRIL 88 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 89 TKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFI 148 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI---- 172 I+ + + K + P N + T GY I + AA +++ KNI Sbjct: 149 IRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDE 208 Query: 173 YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY 224 +D +D+ ++ + + P++ + ++ S +E+ R R Sbjct: 209 IVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKERQKRPKIRKKKT 268 Query: 225 F-YRNTCYQWNLHYNAWRK 242 R T + N+ RK Sbjct: 269 LKQRLTRIKENIIRALNRK 287 >gi|157959946|ref|YP_001499980.1| glycosyl transferase family protein [Shewanella pealeana ATCC 700345] gi|157844946|gb|ABV85445.1| glycosyl transferase family 25 [Shewanella pealeana ATCC 700345] Length = 247 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 83/242 (34%), Gaps = 17/242 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL RR H + + F FFDAI P+ N +++ + L Sbjct: 1 MRCFVISLVTEDRRRSHISHEFSSQAIPFEFFDAI----TPVQNEAEANRLQILTQGTTL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E C +SH+ + ++I A I EDD S + L + + L+K + Sbjct: 57 SKGETSCLLSHVAILQKIVDESIPYAAIFEDDIHLSKDAYHYLCSENWIPDSVELVKLEM 116 Query: 123 LRKKPK-KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPID-- 177 K + L ++ + + GY + A LLN + ID Sbjct: 117 FNHIAKGQFLGTHKLESGKELFRLKGRHLGAAGYIVTCNMASQLLNEVRRCEPVEAIDKI 176 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDT------NDSTIEESRLVRKPTFSPLYFYRNTCY 231 + + + + +P + S++ R++ P L Sbjct: 177 IFEELITSNKVDVYQLQPAICAQDDIIDRKNSVLHSSLNAERIL--PAKKKLSLAMKAKR 234 Query: 232 QW 233 + Sbjct: 235 EL 236 >gi|298736312|ref|YP_003728838.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|298355502|emb|CBI66374.1| jhp0562-like glycosyltransferase, family 25 [Helicobacter pylori B8] Length = 332 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 63/331 (19%), Positives = 105/331 (31%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK S +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKSNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM + + H++ + P V + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFINNPTYHDVANFTYLPCPVSLNEHSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + ++ + K P+ T K Sbjct: 300 KNLFYYSFNAKKRLNAFHKYSKQYAPLKTPK 330 >gi|157326131|gb|ABV44298.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 109/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + +++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYNAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|315454038|ref|YP_004074308.1| glycosyltransferase [Helicobacter felis ATCC 49179] gi|315133090|emb|CBY83718.1| Jhp0562-like glycosyltransferase [Helicobacter felis ATCC 49179] Length = 290 Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats. Identities = 53/246 (21%), Positives = 86/246 (34%), Gaps = 41/246 (16%) Query: 1 MPIPVYVISLPFSHARREKFCHRAAR-------IHLQFSFFDAIYGEN---------NPI 44 M +PVY ISL H RR+ + L+F FDAI + + + Sbjct: 1 MVLPVYFISLK-DHPRRKHCQSIIDNPPAQEGDLKLEFHLFDAIDKHSCYFKKDLGYSSL 59 Query: 45 CNRIFSHQK------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 +S + + R L L E+GCY SH LW R +ILEDD Sbjct: 60 IENSYSSEWLKESEWFRGAHGRELLLEELGCYASHYMLWLRCIELN-CPIVILEDDVVLK 118 Query: 99 DEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI---------------H 143 F + L H + ++ A + K + G + + Sbjct: 119 PHFYESLVHCMQSP--FDFVRLFATFWEIKTKKSVIAHTGGTQVLPTDSRIEVILKEHFY 176 Query: 144 QPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAID 203 + Y++ +AA ++ + P+D + H IP+L P +V + Sbjct: 177 LSALEVFAAAAYYLTPKAAKAFVSASLHFTEPVDNLLTLVHTHRIPNLTYIPLSVGFSKH 236 Query: 204 TNDSTI 209 + S I Sbjct: 237 HSISAI 242 >gi|157326141|gb|ABV44303.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 63/322 (19%), Positives = 105/322 (32%), Gaps = 87/322 (27%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDIVLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLKHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ + P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L++Y + NA+ K Sbjct: 300 KNLFYYNLNAKK---RLNAFHK 318 >gi|261492320|ref|ZP_05988882.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261312003|gb|EEY13144.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 268 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 57/263 (21%), Positives = 90/263 (34%), Gaps = 35/263 (13%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFK 59 + Y+ISL RRE F + F F AI + N F + ++ Sbjct: 7 KMKKYLISLEKDVKRRELFFSQPNTE--DFEVFSAINTMSSDWEALNNHFDLALFEQRYG 64 Query: 60 RLLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 R + E+GC +SH+ ++++IA ++ EDDA F+D+F + L L ++ Sbjct: 65 RKTTKGEVGCTLSHLAVYQKIAEDRTIAEDDYCLVCEDDALFADDFQKHLDALLAQNLKV 124 Query: 116 ILIK--------FDALRKKPKKDSYLCTLPGNFDIHQPRI------LSPRTTGYFIGKEA 161 +I F+ + S L N Q + T Y I K Sbjct: 125 DIILVGQSKIPTFNDSELELNYPSTFGFLQKNIPNSQYKYAYSYLNYFAGTVAYLIKKST 184 Query: 162 AIHLLNVR--KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE-------- 211 A L KN Y D + L+ P E S +E+ Sbjct: 185 ACQFLAQCAGKNPYWLADDFILFEKNFKFDVLIVRPLMAIENP-VLISNLEDLRGSLNNN 243 Query: 212 --SRLVRKPTFSPLYFYRNTCYQ 232 + V+ P L RN + Sbjct: 244 LFKKAVKYPLKKLLAIKRNFTQR 266 >gi|157326135|gb|ABV44300.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 105/331 (31%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHEDFEKLVQELCDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK +ILEDDA F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDAMLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T Y++ + A + + Sbjct: 180 STHKVIHYIFKKIQKSYYSTHEKEALFLEHFYLTSVYVASTASYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLKAFQQYSKQYAPLKTPK 330 >gi|325133385|gb|EGC56049.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M13399] Length = 280 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 74/241 (30%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLSTAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|217032011|ref|ZP_03437512.1| hypothetical protein HPB128_187g38 [Helicobacter pylori B128] gi|216946321|gb|EEC24927.1| hypothetical protein HPB128_187g38 [Helicobacter pylori B128] Length = 332 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 63/331 (19%), Positives = 105/331 (31%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK S +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGTGLTLPELGCYLSHYLLWKECVKSNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM + + H++ + P V + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFINNPTYHDVANFTYLPCPVSLNEHSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + ++ + K P+ T K Sbjct: 300 KNLFYYSFNAKKRLNAFHKYSKQYAPLKTPK 330 >gi|110006055|gb|ABG48530.1| nonfunctional LpsA [Haemophilus influenzae] gi|110006059|gb|ABG48532.1| nonfunctional LpsA [Haemophilus influenzae] Length = 237 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 86/217 (39%), Gaps = 32/217 (14%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ + K LS Sbjct: 9 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNITLDRSP-KAKLSDG 67 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC +SHI LW + I EDD H+ K + N + K Sbjct: 68 EIGCALSHIVLWDLALENNLNYINIFEDDI-----------HVLKLEANGKM-----FFK 111 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHW 183 +PK ++ + +++ + GY + + A +LL + K + +D + Sbjct: 112 QPK------SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSLVFED 165 Query: 184 WEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + H + + PG V + +S+++E R Sbjct: 166 FLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 202 >gi|254804257|ref|YP_003082478.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha14] gi|254667799|emb|CBA03766.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha14] gi|254672588|emb|CBA06271.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha275] Length = 279 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 73/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQVMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I ++A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|59802453|ref|YP_209165.1| LgtE [Neisseria gonorrhoeae FA 1090] gi|240015736|ref|ZP_04722276.1| LgtE [Neisseria gonorrhoeae FA6140] gi|59719348|gb|AAW90753.1| lacto-N-neotetraose biosynthesis glycosyl transferase [Neisseria gonorrhoeae FA 1090] Length = 280 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 76/241 (31%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + K N T GY I +EA L+ + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPPERI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|325201435|gb|ADY96889.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis M01-240149] Length = 275 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 46/251 (18%), Positives = 78/251 (31%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRVMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I K+A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAIRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|14586735|gb|AAK70340.1|AF313394_4 glycosyl transferase LgtE [Neisseria gonorrhoeae] Length = 280 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 41/242 (16%), Positives = 75/242 (30%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + G T GY I +EA L+ + I Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLIQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|209696428|ref|YP_002264359.1| putative glycosyltransferase [Aliivibrio salmonicida LFI1238] gi|208010382|emb|CAQ80725.1| putative glycosyltransferase [Aliivibrio salmonicida LFI1238] Length = 231 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 54/232 (23%), Positives = 90/232 (38%), Gaps = 20/232 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICN--RIFSHQKRQCQFK 59 I V V+SL S RR R + + F F+DA I + ++F ++ + + Sbjct: 2 IKVIVLSLKDS-PRRAIITERLKKRGITDFEFYDAFDARAMEISDLEKLFDVKRFRDTYG 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF---------SQLLPHLSK 110 R + EIGC +SH+ +WKRI+ S ++LEDDA F L L Sbjct: 61 REPARGEIGCTLSHLGIWKRISESDCENWMVLEDDAILMPWFGILFREGNYPSNLTILGH 120 Query: 111 CDINNILIKFDALRKKPKKDSYLCT--LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN- 167 ++ +++ K + + + G + T G+ + K +AI L Sbjct: 121 SKLSVWKGFLSNIKRIVIKPNLIASVRKFGYLLGEKSEYHWYGTVGFALPKVSAIKLDKE 180 Query: 168 -VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 + D + + P VYE +STIE R++ K Sbjct: 181 LGTYPFFLADDFLIYSKYVT---INHAHPYFVYEDFSGLESTIEHERVMNKK 229 >gi|157326147|gb|ABV44306.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 66/331 (19%), Positives = 108/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDVALESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K N + + KF L K+ + N + + + Sbjct: 120 CLKSPFNFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPQKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNARKRLNAFQQYSKQYAPLKTPK 330 >gi|221115475|ref|XP_002156385.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 589 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 77/193 (39%), Gaps = 12/193 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR----IFSHQKRQCQF-K 59 ++VI+L R +K + +F F+A+ G IF + + + + Sbjct: 325 IFVINLIRRQDRFKKMSFLLKELGFKFRHFEAVDGRELSAKKVKEMGIFPLEGFKDPYLE 384 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R ++L EIGC++SH +W + ++LEDD F F++ L L + Sbjct: 385 RPMTLGEIGCFLSHWRIWNEVIDRSLDMVLVLEDDVRFESGFNKKLHQLLVSADEIQKNQ 444 Query: 120 ---FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--R 174 F L +K + G+ I + S T GY I A L+ + Sbjct: 445 PWDFMYLGRKRMSSQMEYYVEGSDKIVWAKY-SYWTLGYIIRLSGAKKLI-AGDPLSKML 502 Query: 175 PIDMDMKHWWEHN 187 P+D + ++++ Sbjct: 503 PVDEYIPIMFDNH 515 >gi|240127369|ref|ZP_04740030.1| LgtE [Neisseria gonorrhoeae SK-93-1035] gi|268685740|ref|ZP_06152602.1| glycosyl transferase [Neisseria gonorrhoeae SK-93-1035] gi|268626024|gb|EEZ58424.1| glycosyl transferase [Neisseria gonorrhoeae SK-93-1035] Length = 280 Score = 126 bits (318), Expect = 3e-27, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 77/242 (31%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L+ + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|325133389|gb|EGC56053.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis M13399] Length = 279 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 76/243 (31%), Gaps = 28/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRVMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + Q L + + + Sbjct: 59 SGVEKACFMSHVVLWKQALDEGVPYIAVFEDDVLLGEGVEQFLAEDTWLEERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPSEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPT 219 P+D M +P P + + S IE RL+ + Sbjct: 179 HPVDWMMFGNPDDRE---GMPVYQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQ 235 Query: 220 FSP 222 Sbjct: 236 QRR 238 >gi|6502581|gb|AAF14363.1|AF121135_5 glycosyltransferase [Neisseria gonorrhoeae] Length = 280 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 78/242 (32%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD F + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGVPYVAVFEDDVLFGKDAEKFLAEDTWLEERFDKDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L+ + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|240124854|ref|ZP_04737740.1| glycosyltransferase [Neisseria gonorrhoeae SK-92-679] gi|268683434|ref|ZP_06150296.1| glycosyl transferase [Neisseria gonorrhoeae SK-92-679] gi|268623718|gb|EEZ56118.1| glycosyl transferase [Neisseria gonorrhoeae SK-92-679] Length = 280 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 78/242 (32%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD F + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLFGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L+ + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|317014039|gb|ADU81475.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Gambia94/24] Length = 332 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 68/331 (20%), Positives = 112/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARR--EKFC----HRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R EKF + + F FDAI ++ + +++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKFVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYNAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDHSFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKETFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPQKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|254360553|ref|ZP_04976702.1| possible glycosyltransferase [Mannheimia haemolytica PHL213] gi|153091093|gb|EDN73098.1| possible glycosyltransferase [Mannheimia haemolytica PHL213] Length = 261 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 57/262 (21%), Positives = 90/262 (34%), Gaps = 35/262 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + N F + ++ R Sbjct: 1 MKKYLISLEKDVKRRELFFSQPNTE--DFEVFSAINTMSSDWEALNNHFDLALFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 + E+GC +SH+ ++++IA ++ EDDA F+D+F + L L ++ Sbjct: 59 KTTKGEVGCTLSHLAVYQKIAEDSTIAEDDYCLVCEDDALFADDFQKHLDALLAQNLKVD 118 Query: 117 LIK--------FDALRKKPKKDSYLCTLPGNFDIHQPRI------LSPRTTGYFIGKEAA 162 +I F+ + S L N Q + T Y I K A Sbjct: 119 IILVGQSKIPTFNDSELELNYPSTFGFLQKNIPNSQYKYAYSYLNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVR--KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE--------- 211 L KN Y D + L+ P E S +E+ Sbjct: 179 CQFLAQCAGKNPYWLADDFILFEKNFKFDVLIVRPLMAIENP-VLISNLEDLRGSLNNNL 237 Query: 212 -SRLVRKPTFSPLYFYRNTCYQ 232 + V+ P L RN + Sbjct: 238 FKKAVKYPLKKLLAIKRNFTQR 259 >gi|240114826|ref|ZP_04728888.1| glycosyltransferase [Neisseria gonorrhoeae PID18] gi|260441358|ref|ZP_05795174.1| glycosyltransferase [Neisseria gonorrhoeae DGI2] gi|268600482|ref|ZP_06134649.1| glycosyl transferase [Neisseria gonorrhoeae PID18] gi|291044718|ref|ZP_06570427.1| lacto-N-neotetraose biosynthesis glycosyl transferase [Neisseria gonorrhoeae DGI2] gi|268584613|gb|EEZ49289.1| glycosyl transferase [Neisseria gonorrhoeae PID18] gi|291011612|gb|EFE03608.1| lacto-N-neotetraose biosynthesis glycosyl transferase [Neisseria gonorrhoeae DGI2] Length = 280 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 41/242 (16%), Positives = 77/242 (31%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L+ + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|15611629|ref|NP_223280.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155103|gb|AAD06135.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 332 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 66/331 (19%), Positives = 111/331 (33%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + +++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYNAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDHSFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRK- 170 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATES 239 Query: 171 -NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 I P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPQKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|240013290|ref|ZP_04720203.1| LgtE [Neisseria gonorrhoeae DGI18] gi|240120362|ref|ZP_04733324.1| LgtE [Neisseria gonorrhoeae PID24-1] Length = 280 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 76/241 (31%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + K N T GY I +EA L+ + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPPERI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|240122666|ref|ZP_04735622.1| LgtE [Neisseria gonorrhoeae PID332] gi|268681263|ref|ZP_06148125.1| glycosyl transferase [Neisseria gonorrhoeae PID332] gi|268621547|gb|EEZ53947.1| glycosyl transferase [Neisseria gonorrhoeae PID332] Length = 280 Score = 126 bits (316), Expect = 3e-27, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 77/242 (31%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L+ + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|157326121|gb|ABV44293.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats. Identities = 64/331 (19%), Positives = 105/331 (31%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLVQELYNAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-VVILEDDITLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPKKLTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKRYAPLKTPK 330 >gi|261496106|ref|ZP_05992514.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. OVINE] gi|261308208|gb|EEY09503.1| putative glycosyltransferase [Mannheimia haemolytica serotype A2 str. OVINE] Length = 261 Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats. Identities = 57/262 (21%), Positives = 90/262 (34%), Gaps = 35/262 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + Y+ISL RRE F + F F AI + N F + ++ R Sbjct: 1 MKKYLISLEKDVKRRELFFSQPNTE--DFEVFSAINTMSSDWEALNNHFDLALFEQRYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 + E+GC +SH+ ++++IA ++ EDDA F+D+F + L L ++ Sbjct: 59 KTTKGEVGCTLSHLAVYQKIAEDRTIAEDDYCLVCEDDALFADDFQKHLDALLAQNLKVD 118 Query: 117 LIK--------FDALRKKPKKDSYLCTLPGNFDIHQPRI------LSPRTTGYFIGKEAA 162 +I F+ + S L N Q + T Y I K A Sbjct: 119 IILVGQSKIPTFNDSELELNYPSTFGFLQKNIPNSQYKYAYSYLNYFAGTVAYLIKKSTA 178 Query: 163 IHLLNVR--KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE--------- 211 L KN Y D + L+ P E S +E+ Sbjct: 179 CQFLAQCAGKNPYWLADDFILFEKNFKFDVLIVRPLMAIENP-VLISNLEDLRGSLNNNL 237 Query: 212 -SRLVRKPTFSPLYFYRNTCYQ 232 + V+ P L RN + Sbjct: 238 FKKAVKYPLKKLLAIKRNFTQR 259 >gi|119605027|gb|EAW84621.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Homo sapiens] Length = 645 Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 366 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 425 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 426 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 485 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 486 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKM 541 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 542 LPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 588 >gi|22760716|dbj|BAC11307.1| unnamed protein product [Homo sapiens] Length = 622 Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 343 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 463 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKM 518 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 519 LPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 565 >gi|325135520|gb|EGC58138.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis M0579] Length = 280 Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 74/241 (30%), Gaps = 22/241 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASTKERRAHIADTFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWI 178 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQGRRHRRS 238 Query: 223 L 223 L Sbjct: 239 L 239 >gi|31377697|ref|NP_078932.2| procollagen galactosyltransferase 1 precursor [Homo sapiens] gi|74715064|sp|Q8NBJ5|GT251_HUMAN RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|22761754|dbj|BAC11684.1| unnamed protein product [Homo sapiens] gi|80478641|gb|AAI08309.1| Glycosyltransferase 25 domain containing 1 [Homo sapiens] gi|119605028|gb|EAW84622.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Homo sapiens] Length = 622 Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 343 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 463 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKM 518 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 519 LPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 565 >gi|194100120|ref|YP_002003260.1| glycosyltransferase [Neisseria gonorrhoeae NCCP11945] gi|239998110|ref|ZP_04718034.1| glycosyltransferase [Neisseria gonorrhoeae 35/02] gi|254492890|ref|ZP_05106061.1| glycosyl transferase [Neisseria gonorrhoeae 1291] gi|268593961|ref|ZP_06128128.1| glycosyltransferase [Neisseria gonorrhoeae 35/02] gi|193935410|gb|ACF31234.1| glycosyltransferase [Neisseria gonorrhoeae NCCP11945] gi|226511930|gb|EEH61275.1| glycosyl transferase [Neisseria gonorrhoeae 1291] gi|268547350|gb|EEZ42768.1| glycosyltransferase [Neisseria gonorrhoeae 35/02] Length = 280 Score = 125 bits (315), Expect = 4e-27, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 78/242 (32%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD F + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGVPYVAVFEDDVLFGKDAEKFLAEDTWLEERFDKDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L+ + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D+ M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDLMMFTYFFDKEGMPVYQVSPALCTQELHYAKFLSQNSMLGSDLEKDREQGRRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|74217150|dbj|BAE43293.1| unnamed protein product [Mus musculus] Length = 617 Score = 125 bits (315), Expect = 4e-27, Method: Composition-based stats. Identities = 54/275 (19%), Positives = 97/275 (35%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIF-----SHQKRQCQFK 59 V++I+L RRE+ + +A+ G+ R Sbjct: 338 VFMINLKRRRDRRERMLRALHEQEIDCQLVEAVDGKAMNTSQVEAMGIQMLPGYRDPYHG 397 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 457 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL K + Sbjct: 458 DLI--YVGRKRMQVEHPEKAVPHVRNLVEA-DYSYWTLAYVISLQGAQKLL-AAKPLAKM 513 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ N+ + EP +Y DT S + Sbjct: 514 LPVDEFLPVMFDKHPMSEYKSHFSPRNLRAFSVEPLLIYPTHYTGDDGYVSDTETSVVWN 573 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V+ T + Q L A D+ Sbjct: 574 NEQVK--TDWDRAKSQKMREQQALSREAKNSDVLQ 606 >gi|170784829|ref|NP_666323.2| procollagen galactosyltransferase 1 precursor [Mus musculus] gi|160395574|sp|Q8K297|GT251_MOUSE RecName: Full=Procollagen galactosyltransferase 1; AltName: Full=Glycosyltransferase 25 family member 1; AltName: Full=Hydroxylysine galactosyltransferase 1; Flags: Precursor gi|34785210|gb|AAH56951.1| Glycosyltransferase 25 domain containing 1 [Mus musculus] gi|148697002|gb|EDL28949.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Mus musculus] Length = 617 Score = 125 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 54/275 (19%), Positives = 97/275 (35%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIF-----SHQKRQCQFK 59 V++I+L RRE+ + +A+ G+ R Sbjct: 338 VFMINLKRRRDRRERMLRALHEQEIDCQLVEAVDGKAMNTSQVEAMGIQMLPGYRDPYHG 397 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 457 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL K + Sbjct: 458 DLI--YVGRKRMQVEHPEKAVPHVRNLVEA-DYSYWTLAYVISLQGAQKLL-AAKPLAKM 513 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ N+ + EP +Y DT S + Sbjct: 514 LPVDEFLPVMFDKHPMSEYKSHFSPRNLRAFSVEPLLIYPTHYTGDDGYVSDTETSVVWN 573 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V+ T + Q L A D+ Sbjct: 574 NEQVK--TDWDRAKSQKMREQQALSREAKNSDVLQ 606 >gi|157326133|gb|ABV44299.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 125 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 108/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCI-FQIFDAISPKHQDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECIKLNQP-VVILEDDITLEPHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKDAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFQQYSKQYAPLKTPK 330 >gi|157326125|gb|ABV44295.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 125 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 62/322 (19%), Positives = 102/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKLVQELCDAPS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK +ILEDDA F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLDQP-IVILEDDATLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYQETKFHVLPKEFVFPPFDYHFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLKHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L++Y + + + K Sbjct: 300 KNLFYYSINAKKRLKAFQQYSK 321 >gi|145641046|ref|ZP_01796627.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae R3021] gi|145274207|gb|EDK14072.1| putative UDP-glcNAc--lipooligosaccharide N-acetylglucosaminyl glycosyltransferase [Haemophilus influenzae 22.4-21] Length = 221 Score = 125 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 45/193 (23%), Positives = 76/193 (39%), Gaps = 21/193 (10%) Query: 40 ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDA 95 ++ IF+ ++ + + R ++ EIGC +SH+ ++++I I ++ EDDA Sbjct: 3 KDWNELAAIFNIEQFKAHYGRNVTKGEIGCTLSHLSVYQKIIEDNDIAENSYTLVCEDDA 62 Query: 96 DFSDEFSQLLPHLSKCDINNILI-----------KFDALRKKPKKDSYLCTLPGNFDIHQ 144 F +F + L L + + +I D P S+LC GN + Sbjct: 63 LFHPDFQKNLTALLAEKLESEIILVGQSKINNFNDTDLEINYPTTFSFLCKKTGNVNYAF 122 Query: 145 PRI-LSPRTTGYFIGKEAAIHLLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE 200 P T GY I K AA + + + D + + NI + V P V E Sbjct: 123 PYKSYFAGTVGYLIKKSAARRFIQQISQNEPFWLA-DDFLLFEQDFNIRNKVVRPLMVIE 181 Query: 201 AIDTNDSTIEESR 213 S +E R Sbjct: 182 NP-VLISNLESIR 193 >gi|21595163|gb|AAH32165.1| Glycosyltransferase 25 domain containing 1 [Mus musculus] Length = 617 Score = 125 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 54/275 (19%), Positives = 97/275 (35%), Gaps = 39/275 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIF-----SHQKRQCQFK 59 V++I+L RRE+ + +A+ G+ R Sbjct: 338 VFMINLKRRRDRRERMLRALHEQEIDCQLVEAVDGKAMNTSQVEAMGIQMLPGYRDPYHG 397 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 398 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 457 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL K + Sbjct: 458 DLI--YVGRKRMQVEHPEKAVPHVRNLVEA-DYSYWTLAYVISLQGAQKLL-AAKPLAKM 513 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + H+ N+ + EP +Y DT S + Sbjct: 514 LPVDEFLPVMFDKHPMSEYKSHFSPRNLRAFSVEPLLIYPTHYTGDDGYVSDTETSVVWN 573 Query: 212 SRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 + V+ T + Q L A D+ Sbjct: 574 NEQVK--TDWDRAKSQKMREQQALSREAKNSDVLQ 606 >gi|332253465|ref|XP_003275861.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 1-like [Nomascus leucogenys] Length = 626 Score = 125 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 87/227 (38%), Gaps = 28/227 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 347 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 406 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 407 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 466 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 467 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKM 522 Query: 174 RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 P+D + H+ N+ + EP +Y T D Sbjct: 523 LPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 569 >gi|157326143|gb|ABV44304.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 125 bits (315), Expect = 6e-27, Method: Composition-based stats. Identities = 64/331 (19%), Positives = 109/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENN---PICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEYFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L + Sbjct: 61 LLQSDWYHSYVGTGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDVALESNFMQALEN 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFMEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P + + + DSTI+ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPISLSEHSLDSTIQSPQKKSLKSYPFPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + ++ + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLNAFHQYSKQYAPLKTPK 330 >gi|315586698|gb|ADU41079.1| glycosyltransferase [Helicobacter pylori 35A] Length = 332 Score = 125 bits (315), Expect = 6e-27, Method: Composition-based stats. Identities = 60/322 (18%), Positives = 100/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---------NR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDVQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECVKTDQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H I + P + + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHGIANFTYVPCPISLSEHSLDSTIQKPQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L++Y + + + K Sbjct: 300 KNLFYYSLNAKKRLKAFYKYSK 321 >gi|59802451|ref|YP_209163.1| LgtB [Neisseria gonorrhoeae FA 1090] gi|240013285|ref|ZP_04720198.1| LgtB [Neisseria gonorrhoeae DGI18] gi|240120358|ref|ZP_04733320.1| LgtB [Neisseria gonorrhoeae PID24-1] gi|59719346|gb|AAW90751.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria gonorrhoeae FA 1090] Length = 279 Score = 125 bits (315), Expect = 6e-27, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 74/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|21069193|gb|AAM33867.1|AF470661_1 LgtB [Neisseria meningitidis] Length = 275 Score = 125 bits (315), Expect = 6e-27, Method: Composition-based stats. Identities = 41/251 (16%), Positives = 73/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I ++A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|317181932|dbj|BAJ59716.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F57] Length = 332 Score = 125 bits (314), Expect = 6e-27, Method: Composition-based stats. Identities = 61/322 (18%), Positives = 101/322 (31%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPELGCYLSHYFLWKECVKTDQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM + H I + P V + + DSTI++S+ +K TF Sbjct: 240 FKIIEPVDMFINTPTYHGIANFTYVPCPVSLSEHSLDSTIQKSQKKSLKSYPPPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L++Y + ++ + K Sbjct: 300 KNLFYYSLNAKKRLKAFHKYSK 321 >gi|292618105|ref|XP_684212.3| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Danio rerio] Length = 591 Score = 125 bits (314), Expect = 6e-27, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 88/245 (35%), Gaps = 38/245 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 +Y+I+L RR++ + + DAI G + Sbjct: 324 IYLINLRRRQDRRDRMLWSLYELEIDAKVVDAIDGAALNSSDIKILGVDLLPGYYDPFSG 383 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ E+GC++SH ++WK + A+I EDD F F +L+ + + +++ Sbjct: 384 RTLTKGEVGCFLSHYYIWKEMVDMQLDKALIFEDDVRFQANFKRRMMRLMEEVEQVELDW 443 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 +I + P ++ + N S T Y I + A LLN + + Sbjct: 444 DIIYLGRKKVNPGEEVPV----ENVRNLVFADYSYWTLSYAISLQGAQKLLN-AEPVSKM 498 Query: 174 RPIDMDMKHWWE-------------HNIPSLVTEPGAVYE---------AIDTNDSTIEE 211 P+D + ++ N+ + T P V DT ST+ + Sbjct: 499 LPVDEFLPIMYDKHPNEDYKSHFPNRNLMAYSTHPLLVQPCHYAGDAEWVSDTETSTLWD 558 Query: 212 SRLVR 216 VR Sbjct: 559 DDSVR 563 >gi|68249147|ref|YP_248259.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae 86-028NP] gi|68057346|gb|AAX87599.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae 86-028NP] Length = 312 Score = 125 bits (314), Expect = 6e-27, Method: Composition-based stats. Identities = 53/263 (20%), Positives = 89/263 (33%), Gaps = 27/263 (10%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-----IYGENNPICNRIFSHQKRQCQF--- 58 VIS+ + RR+ + L FSFF+A I N N+ + Sbjct: 30 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSISILHNIEE 89 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL- 117 R+L+ E GC ISH LW + I EDD + L Sbjct: 90 SRILTKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFN 149 Query: 118 ----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI 172 I+ + + K + P + T GY I + AA +++ KNI Sbjct: 150 DIFIIRLETFLQPVKLEKQTKIPPFYSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI 209 Query: 173 ----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 +D +D+ ++ + + P++ + ++ S +E+ R R Sbjct: 210 PSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKEREKRPKIR 269 Query: 221 SPLYF-YRNTCYQWNLHYNAWRK 242 R T + N+ RK Sbjct: 270 KKKTLKQRLTRIKENIIRALNRK 292 >gi|148825173|ref|YP_001289926.1| diadenosine tetraphosphatase [Haemophilus influenzae PittEE] gi|148715333|gb|ABQ97543.1| diadenosine tetraphosphatase [Haemophilus influenzae PittEE] Length = 302 Score = 125 bits (314), Expect = 6e-27, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 91/275 (33%), Gaps = 39/275 (14%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-------------IYGENNPICNRIFSHQK 53 VIS+ + RR+ + L FSFF+A I N N+ + Sbjct: 8 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQSINQSI 67 Query: 54 RQCQF-------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 Q R+L+ E GC ISH LW + I EDD + L Sbjct: 68 NQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLN 127 Query: 107 HLSKCDINNIL-----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKE 160 I+ + + K + P N + T GY I + Sbjct: 128 QNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQG 187 Query: 161 AAIHLLNVRKNI----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDST 208 AA +++ KNI +D +D+ ++ + + P++ + ++ S Sbjct: 188 AAKYVIEYLKNIPSDEIVAVDQLIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSG 247 Query: 209 IEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 +E+ R R R T + N+ RK Sbjct: 248 LEKERQKRPKIRKKKTLKQRLTRIKENIIRALNRK 282 >gi|229845713|ref|ZP_04465835.1| diadenosine tetraphosphatase [Haemophilus influenzae 6P18H1] gi|229811355|gb|EEP47062.1| diadenosine tetraphosphatase [Haemophilus influenzae 6P18H1] Length = 302 Score = 125 bits (314), Expect = 7e-27, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 91/275 (33%), Gaps = 39/275 (14%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-------------IYGENNPICNRIFSHQK 53 VIS+ + RR+ + L FSFF+A I N N+ + Sbjct: 8 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQSINQSI 67 Query: 54 RQCQF-------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 Q R+L+ E GC ISH LW + I EDD + L Sbjct: 68 NQSNSILHNIEKSRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLN 127 Query: 107 HLSKCDINNIL-----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKE 160 I+ + + K + P N + T GY I + Sbjct: 128 QNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQG 187 Query: 161 AAIHLLNVRKNI----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDST 208 AA +++ KNI +D +D+ ++ + + P++ + ++ S Sbjct: 188 AAKYVIEYLKNIPSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSG 247 Query: 209 IEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 +E+ R R R T + N+ RK Sbjct: 248 LEKERQKRPKIRKKKTLKQRLTRIKENIIRALNRK 282 >gi|312376728|gb|EFR23731.1| hypothetical protein AND_12341 [Anopheles darlingi] Length = 323 Score = 125 bits (314), Expect = 7e-27, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 79/212 (37%), Gaps = 37/212 (17%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI-----------------CNRIFS 50 I+L RR K + LQ F A+ G+ ++ ++ Sbjct: 2 INLERRTERRSKMLAHFDLLGLQVEHFPAVDGKLLTDKKVQDLGIRFLPGYADPFHKRYA 61 Query: 51 HQKRQ----------CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 + + +++ EIGC++SH ++W+++ ++LEDD F Sbjct: 62 FEPAKMIPQVLKYLFSFSFSPMTMGEIGCFLSHYYIWEKMVRLGLAEVLVLEDDIRFEPF 121 Query: 101 FS----QLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 F ++L + LI F R + + + ++ G+ + + S T GY Sbjct: 122 FRRRAHRVLDDARRIG-GWDLIYFGRKRLQEEDEKWI---TGSEYLVRA-GYSYWTLGYV 176 Query: 157 IGKEAAIHLL-NVRKNIYRPIDMDMKHWWEHN 187 I E A LL P+D + ++++ Sbjct: 177 ITLEGAKKLLRERPLEKLLPVDEYLPIMFDNH 208 >gi|332673582|gb|AEE70399.1| jhp0562 family glycosyltransferase [Helicobacter pylori 83] Length = 332 Score = 125 bits (314), Expect = 7e-27, Method: Composition-based stats. Identities = 61/322 (18%), Positives = 104/322 (32%), Gaps = 84/322 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 ICVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKLVQELYDVQN 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGAGLTLPELGCYLSHYLLWKECVKINQP-VVILEDDVALEFNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKIYYSTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I + P + + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDIANFTYVPCPISLSEHSLDSTIQKPQKKSLKFYPPPSKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRK 242 L++Y + ++ + K Sbjct: 300 KNLFYYSLNAKKRLKAFHKYSK 321 >gi|157326151|gb|ABV44308.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 107/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDDA F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDAMLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T Y++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTASYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H+I +L P V + + DSTI++ + +K F Sbjct: 240 FKIIEPVDMFMDNSAYHDIANLTYVPCPVSLSEHSLDSTIQKPQKKSLKPYPLLPKKSIF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSINAKKRLKAFYKYSKQYAPLKTLK 330 >gi|325579209|ref|ZP_08149165.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Haemophilus parainfluenzae ATCC 33392] gi|325159444|gb|EGC71578.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Haemophilus parainfluenzae ATCC 33392] Length = 282 Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats. Identities = 46/259 (17%), Positives = 86/259 (33%), Gaps = 27/259 (10%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VIS+ + RR + + + F FFDA + + + L+ E Sbjct: 10 VISISTADKRRNHIIDQFGQKKIPFEFFDAFTPSDR--LDVHLQRYLPNVEATSKLTAGE 67 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-----IKFD 121 GC +SH LWK+ I EDD + + L + + ++ + Sbjct: 68 KGCLMSHFMLWKKCVDDNLAYISIFEDDILLGENAEKFLANDEWLKVRFNFQEIFVLRLE 127 Query: 122 ALRKKPKKDSYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + + LP +I + T GY I AA +L+N+ + + + I Sbjct: 128 TFLMPVQLEKQQQILPFQERNIDILKSKHFGTAGYIISNGAAKYLINLFEKLAIEEVKAI 187 Query: 177 DMDMKHWWEH--NIPSLVTEPGAVYEAIDTND------STIEESRLV------RKPTFSP 222 D M + + P + + N S +++ R +K Sbjct: 188 DEIMFNEQINVGAYQVYQLNPAICVQELQLNQENSLLVSDLQQERKKNTAVHTKKTLKQR 247 Query: 223 LYFYRNTCYQWNLHYNAWR 241 L + + L+ W+ Sbjct: 248 LTRIKENIIR-ALNKEKWK 265 >gi|1857122|gb|AAB48387.1| glycosyl transferase [Neisseria meningitidis] Length = 280 Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 76/242 (31%), Gaps = 24/242 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGAHDIPFQFFDALM-PSEALEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + K N T GY I +EA L + I Sbjct: 119 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAERI 178 Query: 173 YRPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFS 221 + +D M ++ + +P P + + S +E+ R + Sbjct: 179 -KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRR 237 Query: 222 PL 223 L Sbjct: 238 SL 239 >gi|21069197|gb|AAM33870.1|AF470662_2 LgtB [Neisseria meningitidis] gi|21069200|gb|AAM33872.1|AF470663_1 LgtB [Neisseria meningitidis] Length = 279 Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D M + +P +P + + S IE RL+ + Sbjct: 179 HPVDWMMFSDFFDREGMPVCQLDPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|90418037|ref|ZP_01225949.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337709|gb|EAS51360.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 286 Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 96/253 (37%), Gaps = 28/253 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDA--IYGENNPICNRIFSHQKRQCQFKRLL 62 ++ I+L + RR +A I L +A + G ++ S+ + ++R + Sbjct: 7 IWFINLADARERRALIEVQAEAIGLDVHRQEACTVAGIDDA------SYARLSTSWERPM 60 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + PE+ ++SH LW+R A S G I+LEDD F F + + + + N L+ F+ Sbjct: 61 TRPELAAFLSHRRLWERAAASE-DGLIVLEDDTVFGTHFPRAVANAAA--SNFDLVNFET 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + ++ T G + YF+ + A LL + P+D M Sbjct: 118 VGRRKFFRKRGGTTVGGVRFTELVREKSGAGAYFVSRNGARALLAKAETQAAPVDAFM-- 175 Query: 183 WWEHNIPSLVTEPGAV-------YEAIDTNDSTIEESRLVRKPTFSPL------YFYRNT 229 + + + EP A D ST ++ + + PL Y R Sbjct: 176 FGVCRLRTAQVEPAATMQVHLLEERGFDVGLST--QTSIHQPRQRLPLSWANLPYTGRRI 233 Query: 230 CYQWNLHYNAWRK 242 Q L R+ Sbjct: 234 GTQLRLALVQARR 246 >gi|240127365|ref|ZP_04740026.1| LgtB [Neisseria gonorrhoeae SK-93-1035] gi|268685737|ref|ZP_06152599.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae SK-93-1035] gi|268626021|gb|EEZ58421.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae SK-93-1035] Length = 279 Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|261378165|ref|ZP_05982738.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria cinerea ATCC 14685] gi|269145629|gb|EEZ72047.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria cinerea ATCC 14685] Length = 279 Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 73/227 (32%), Gaps = 22/227 (9%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 +ISL + RR R + F FFDA+ + + + + LLS E Sbjct: 5 IISLASAKERRAHIADTFGRHDIPFQFFDALM-PSEELEQAMGELVPGLVRHN-LLSSVE 62 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----LIKFD 121 C++SH+ LWK+ + EDD + + L +++ + Sbjct: 63 KACFMSHVVLWKQALDEGVSYIAVFEDDVLLGEGAEKFLAKDGWLKERFDPDSAFIVRLE 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIYRPID 177 + K T GY I +EA L + +PID Sbjct: 123 TMFIKVIARPCGVPDYEGRHFPLLESEHWGTAGYIISREAMRFFLQHFAMLQPERIKPID 182 Query: 178 MDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESR 213 + M + + +P P + + S +E+ R Sbjct: 183 LMMFSDFFDKEGMPVYQVSPALCAQELHYTKFHNKSSVLGSLLEDER 229 >gi|294669020|ref|ZP_06734106.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291309012|gb|EFE50255.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 275 Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 80/249 (32%), Gaps = 24/249 (9%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VISL + ARR+ + F FFDA+ NR + LLS E Sbjct: 5 VISLSSARARRKHIADTFGARGIPFRFFDALMPS--EELNRAMEELVPGLAKQHLLSEVE 62 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN-----ILIKFD 121 C++SH+ LWK+ + EDD + + L + +++ + Sbjct: 63 KACFMSHVVLWKQALDEGLPYVAVFEDDVLLGENAEKFLAEDGWLEERFAVGTPFIVRLE 122 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID 177 + + N GY I EA L + +D Sbjct: 123 TIFSRVIISPSRVPNYQNRAFAVMESEHMGMAGYIISCEAMRFFLTQFAVLKTGRLEAVD 182 Query: 178 MDM---KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLV-RKPTFSPLY 224 + + E N+P+ P + + S++E R V R+ + + Sbjct: 183 HMLFKYNLFGERNVPAYQIVPALCAQELHYTTFLSQDSRLGSSLESDRKVARRQERALIE 242 Query: 225 FYRNTCYQW 233 ++ + Sbjct: 243 RFKRIFTKL 251 >gi|157326153|gb|ABV44309.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 332 Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 107/331 (32%), Gaps = 84/331 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDA ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAFSPKHEDFEKLLQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDDA F Q L Sbjct: 61 LLQSDWYHSYVGTGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDAMLESNFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKIFTEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPTF 220 P+DM M + H++ +L P V + + DSTI++ + +K TF Sbjct: 240 FKIIEPVDMFMDNSAYHDVANLTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPPKKSTF 299 Query: 221 SPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 L++Y + + + K P+ T K Sbjct: 300 KNLFYYSLNAKKRLKAFQQYSKQYAPLKTPK 330 >gi|16272494|ref|NP_438708.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] gi|260580997|ref|ZP_05848820.1| diadenosine tetraphosphatase [Haemophilus influenzae RdAW] gi|1170778|sp|Q03974|LIC2A_HAEIN RecName: Full=Lipooligosaccharide biosynthesis protein lex-1 gi|1573535|gb|AAC22208.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae Rd KW20] gi|260092356|gb|EEW76296.1| diadenosine tetraphosphatase [Haemophilus influenzae RdAW] Length = 302 Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 91/275 (33%), Gaps = 39/275 (14%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-------------IYGENNPICNRIFSHQK 53 VIS+ + RR+ + L FSFF+A I N N+ + Sbjct: 8 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQSINQSI 67 Query: 54 RQCQF-------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 Q R+L+ E GC ISH LW + I EDD + L Sbjct: 68 NQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENFEYLKIFEDDVILGENAEVFLN 127 Query: 107 HLSKCDINNIL-----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKE 160 I+ + + K + P N + T GY I + Sbjct: 128 QNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQG 187 Query: 161 AAIHLLNVRKNI----YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDST 208 AA +++ KNI +D +D+ ++ + + P++ + ++ S Sbjct: 188 AAKYVIEYLKNIPSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSG 247 Query: 209 IEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 +E+ R R R T + N+ RK Sbjct: 248 LEKERQKRSKIRKKKTLKQRLTRIKENIIRALNRK 282 >gi|167626107|ref|YP_001676401.1| glycosyl transferase family protein [Shewanella halifaxensis HAW-EB4] gi|167356129|gb|ABZ78742.1| glycosyl transferase family 25 [Shewanella halifaxensis HAW-EB4] Length = 247 Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 80/241 (33%), Gaps = 17/241 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL RR + + F FFDA P N + + + L Sbjct: 1 MKCFVISLLTEKKRRAHINQEFSSQAIPFEFFDA----TTPDLNEAETTRLQIATQDTTL 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S E+ C +SH+ L ++I I EDD + + L S LIK + Sbjct: 57 SKGELSCLLSHVALLQQIVDESIPYTAIFEDDIHLGKDAGRYLCSESWIPKTAELIKVEM 116 Query: 123 LRKKPK-KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMD 179 K L ++ + R T GY I A LL+ + ID Sbjct: 117 FNHIAKGTFLGNHKLALGAELFKLRGKHLGTAGYIISCRMAGLLLSEIRQYQPIEAIDKI 176 Query: 180 M--KHWWEHNIPSLVTEPGAVYEAIDTN------DSTIEESRLVRKPTFSPLYFYRNTCY 231 M + + + +P + N S++ + R++ P L Sbjct: 177 MFEEMITSKKVVAYQLQPAICAQDDIINRKNSLLHSSLNDERIL--PAKKKLSLGLKVKR 234 Query: 232 Q 232 + Sbjct: 235 E 235 >gi|50119115|ref|YP_048282.1| putative beta1,4-galactosyltransferase [Pectobacterium atrosepticum SCRI1043] gi|49609641|emb|CAG73074.1| putative beta1,4-galactosyltransferase [Pectobacterium atrosepticum SCRI1043] Length = 248 Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats. Identities = 49/249 (19%), Positives = 88/249 (35%), Gaps = 19/249 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V+V++L ++EK + L + A+ G K L Sbjct: 1 MKVFVVNLDKDKDKKEKIKNECRNAELDYEIISAVDGRELSYNEL---KSKVHPVSLNYL 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S EIGC +SH ++KRI A+ILEDD + S + L N F Sbjct: 58 SKGEIGCVLSHQRIYKRILDDDIDYALILEDDVELSQDIKVFLKEFLSVKDKNKGDVFLL 117 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + + ++ ++ S GY I +AA L+ + I D + Sbjct: 118 YPSGLRFLNRRINVSHDYFFYEAY-NSSCAHGYIISNKAAKKLIRINTPIILVADAWLWF 176 Query: 183 WWEHNIPSLVTEPGAVYE-AIDTNDSTIEESR--------------LVRKPTFSPLYFYR 227 + + V V +D + STIE R + ++P + + +Y Sbjct: 177 YQISLLKVYVLNKELVRAYDVDKSLSTIETERSLLLDEKEKHQMQIIKKQPLYYLIKYYH 236 Query: 228 NTCYQWNLH 236 + ++ Sbjct: 237 KYIRRLFIN 245 >gi|21654773|gb|AAK85148.1| hypothetical protein [Neisseria meningitidis] Length = 268 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 80/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 P+D+ M + +P P ++ +T S ++ R + SP + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|114704987|ref|ZP_01437895.1| glycosyl transferase, family 25 [Fulvimarina pelagi HTCC2506] gi|114539772|gb|EAU42892.1| glycosyl transferase, family 25 [Fulvimarina pelagi HTCC2506] Length = 261 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 57/272 (20%), Positives = 95/272 (34%), Gaps = 27/272 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +P Y I+L + RR+ +A R+ L+ S F A+ E+ ++R + Sbjct: 2 VPAYYINLDRARERRDFMERQAERLGLRLSRFSALSAEDVDETR----FTALAGLWERPM 57 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E+ +SH LW+++A A A I EDDA S + L LI + Sbjct: 58 TRVEVAVLLSHASLWQKVADDDAPLA-IFEDDAVLSPRLPEFLRE---PLPAYDLINLEY 113 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM-- 180 ++ + + + Y + + A L +N P D + Sbjct: 114 FARR----KFFRRYASDGRMSDIARDKAGAAAYLLSPDGARKALAAIENHAAPADAFLYA 169 Query: 181 -KHWWEHNIPSLVTEPGAVYEAIDTND-----STIEESRLVRKPTFSP-LYFYRNTCYQW 233 + + V A + + I E R PT +Y +R T Q Sbjct: 170 SGRLEMTQVEPALAVQAEVLSAKGIDPGVATVTQIHEPRGRLAPTSDNWVYGWRRTLTQL 229 Query: 234 NLH-YNAWRKDL-----PPVSTTKFLPSSSSS 259 L + R P V ++F P SSS Sbjct: 230 RLVPLHVGRGTFYEFREPRVDLSEFAPDQSSS 261 >gi|259415052|ref|ZP_05738974.1| glycosyl transferase, family 25 [Silicibacter sp. TrichCH4B] gi|259348962|gb|EEW60716.1| glycosyl transferase, family 25 [Silicibacter sp. TrichCH4B] Length = 279 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 47/263 (17%), Positives = 83/263 (31%), Gaps = 14/263 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M + Y+ISL + R L+ F A+ Q + Sbjct: 1 MKLATYMISLDRATGRHAAMQVALRDAGLEAEFVSAVD--LAETSEAELLQQCKSFGPWG 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + + + C +SH +W++ S A A+I EDD S E Q + LS L+ Sbjct: 59 VFAHGNMACTLSHAKVWEQFLASDADVALIFEDDVFISTELRQWIADLSWWPQGCDLLSL 118 Query: 121 DALRKKPKK--DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 + R + K D+ P GY + + A LL + +D Sbjct: 119 EFWRSETMKVMLGTKPYRHLRRDVAPMLSRYPGAAGYMLTRRGAEVLLAQA-PFDQTVDS 177 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-----VRKPTFSPLYFYRNTCYQW 233 + + P+ +P + A+ T +T + RKP + + Sbjct: 178 LLFNPMVSG-PARSLKPHQITPALVTQGNTPPDEGTFLGHAQRKPQGALYRKQKRLRGMA 236 Query: 234 NLH---YNAWRKDLPPVSTTKFL 253 L + R + + Sbjct: 237 ELRALPLHLLRLISGRARLVRVI 259 >gi|325129464|gb|EGC52294.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis OX99.30304] Length = 275 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 73/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMVELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I K+A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAIRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|62002234|gb|AAX58717.1| LgtB [Neisseria meningitidis] gi|62002238|gb|AAX58720.1| LgtB [Neisseria meningitidis] Length = 279 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 >gi|325199019|gb|ADY94475.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria meningitidis G2136] Length = 275 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 73/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SSVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLDEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|145628932|ref|ZP_01784732.1| dimethyladenosine transferase [Haemophilus influenzae 22.1-21] gi|144979402|gb|EDJ89088.1| dimethyladenosine transferase [Haemophilus influenzae 22.1-21] Length = 223 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 44/185 (23%), Positives = 64/185 (34%), Gaps = 16/185 (8%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA-----IYGENNPICNRIFSHQKRQCQF--- 58 VIS+ + RR+ + L FSFF+A I N N+ + Sbjct: 29 VISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSNSILHNIEE 88 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL- 117 R+L+ E GC ISH LW + I EDD + L Sbjct: 89 SRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFN 148 Query: 118 ----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI 172 I+ + + K + P N + T GY I + AA +++ KNI Sbjct: 149 DIFIIRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI 208 Query: 173 YRPID 177 P D Sbjct: 209 --PSD 211 >gi|121635585|ref|YP_975830.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis FAM18] gi|15991375|gb|AAL12841.1|AF355193_5 LgtB [Neisseria meningitidis] gi|120867291|emb|CAM11062.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis FAM18] gi|261393258|emb|CAX50884.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis 8013] gi|308390024|gb|ADO32344.1| lacto-N-neotetraose biosynthesis glycosyl transferase [Neisseria meningitidis alpha710] gi|325207415|gb|ADZ02867.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis NZ-05/33] Length = 275 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 73/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I K+A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAIRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|240079868|ref|ZP_04724411.1| LgtB [Neisseria gonorrhoeae FA19] gi|268596012|ref|ZP_06130179.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae FA19] gi|268549800|gb|EEZ44819.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae FA19] Length = 279 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|240124850|ref|ZP_04737736.1| LgtB [Neisseria gonorrhoeae SK-92-679] gi|268683431|ref|ZP_06150293.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae SK-92-679] gi|268623715|gb|EEZ56115.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae SK-92-679] Length = 279 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 72/243 (29%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|325131651|gb|EGC54357.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria meningitidis M6190] gi|325137514|gb|EGC60097.1| Lacto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria meningitidis ES14902] Length = 279 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 >gi|21069190|gb|AAM33865.1|AF470660_2 LgtB [Neisseria meningitidis] Length = 275 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 73/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGAHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGCAFPLLESEHWGTVGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P +P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGMPVCQLDPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|260441363|ref|ZP_05795179.1| LgtB [Neisseria gonorrhoeae DGI2] gi|291044721|ref|ZP_06570430.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae DGI2] gi|291011615|gb|EFE03611.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae DGI2] Length = 279 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|21069174|gb|AAM33853.1|AF470656_2 LgtB [Neisseria meningitidis] gi|21069178|gb|AAM33856.1|AF470657_2 LgtB [Neisseria meningitidis] Length = 275 Score = 123 bits (310), Expect = 2e-26, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I K+A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAIRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|293397820|ref|ZP_06642026.1| glycosyl transferase, family 25 [Neisseria gonorrhoeae F62] gi|2497663|sp|Q50947|LGTB_NEIGO RecName: Full=Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB gi|595811|gb|AAA68010.1| glycosyl transferase [Neisseria gonorrhoeae] gi|291611766|gb|EFF40835.1| glycosyl transferase, family 25 [Neisseria gonorrhoeae F62] Length = 279 Score = 123 bits (309), Expect = 2e-26, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|317165548|gb|ADV09089.1| LgtB [Neisseria gonorrhoeae TCDC-NG08107] Length = 279 Score = 123 bits (309), Expect = 2e-26, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|14586734|gb|AAK70339.1|AF313394_2 glycosyl transferase LgtB [Neisseria gonorrhoeae] Length = 279 Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats. Identities = 44/244 (18%), Positives = 74/244 (30%), Gaps = 31/244 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNI 172 ++ + + G T GY I +EA L+ + I Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISREAMRFFLDRFAVLPAERI 178 Query: 173 YRPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRK 217 + +D+ M +P P + + S IE + RL RK Sbjct: 179 -KAVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRK 234 Query: 218 PTFS 221 + Sbjct: 235 QQWR 238 >gi|6502578|gb|AAF14360.1|AF121135_2 glycosyltransferase [Neisseria gonorrhoeae] Length = 248 Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|240122662|ref|ZP_04735618.1| LgtB [Neisseria gonorrhoeae PID332] gi|268681260|ref|ZP_06148122.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae PID332] gi|268621544|gb|EEZ53944.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae PID332] Length = 279 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 72/243 (29%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHLY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|240114823|ref|ZP_04728885.1| LgtB [Neisseria gonorrhoeae PID18] gi|268600479|ref|ZP_06134646.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae PID18] gi|268584610|gb|EEZ49286.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae PID18] Length = 279 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 72/243 (29%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGVPYIAVFEDDVLLGEGAGQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|239998105|ref|ZP_04718029.1| LgtB [Neisseria gonorrhoeae 35/02] gi|254492887|ref|ZP_05106058.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae 1291] gi|268593957|ref|ZP_06128124.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae 35/02] gi|226511927|gb|EEH61272.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae 1291] gi|268547346|gb|EEZ42764.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria gonorrhoeae 35/02] Length = 279 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 73/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGVPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 + Sbjct: 236 QWR 238 >gi|91078804|ref|XP_970300.1| PREDICTED: similar to Glycosyltransferase 25 family member [Tribolium castaneum] gi|270003725|gb|EFA00173.1| hypothetical protein TcasGA2_TC002995 [Tribolium castaneum] Length = 559 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 52/245 (21%), Positives = 92/245 (37%), Gaps = 35/245 (14%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQCQFKR 60 +++I+L RR++ + LQ + DA+ G + + KR Sbjct: 317 KIFMINLLRRPERRQRMRFCFDELGLQVTIVDAVDGRALNASILQQISPLPEYADPYHKR 376 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCD--INNIL 117 ++L EIGC++SH ++WK I + ++LEDD F F ++L + + L Sbjct: 377 PMTLGEIGCFLSHYNIWKDIVRNGYETTLVLEDDIRFESFFRTKVLNVMDEVKRVSGWDL 436 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--P 175 + R + + ++ + S T GY + A LLN + R P Sbjct: 437 VYLGRKRLQEDDEPWV----EGSSLLVEAGYSYWTLGYVLSLNGAKKLLN-ADPLSRLVP 491 Query: 176 IDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEESR 213 +D + ++ + N+ +L P VY DT DS I + Sbjct: 492 VDEYLPILFDKHPQTSWKGYFPKRNLVALSAAPLLVYPTHYTGEQGYISDTEDSVIIKEN 551 Query: 214 LVRKP 218 R Sbjct: 552 GPRAR 556 >gi|157326127|gb|ABV44296.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 330 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 61/309 (19%), Positives = 97/309 (31%), Gaps = 81/309 (26%) Query: 3 IPVYVISLPFSHARR--EKFC----HRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R EK + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESKRRLNTEKLILESNEKFKGRCV-FQIFDAISPKHEDFEKLVQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE+GCY+SH LWK +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPELGCYLSHYLLWKECVKLNQP-VVILEDDVTLESHFMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + N + + + Sbjct: 120 CLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLNL 179 Query: 150 --------------------------------------PRTTGYFIGKEAAIHLLNVRKN 171 T GY++ + A + + Sbjct: 180 STHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVATTAGYYLTPKGAKTFIEATER 239 Query: 172 I--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY-- 224 P+DM M + H++ + P V +STI+ ++ + KP + Sbjct: 240 FKIIEPVDMFMNNPTYHDVANFTYLPCPVSLNKHAFNSTIQNAKKPDISLKPPRKSYFDN 299 Query: 225 -FYRNTCYQ 232 FY Q Sbjct: 300 LFYHKFNAQ 308 >gi|169234704|ref|NP_001108473.1| chromosome associated protein D3 [Bombyx mori] gi|18700451|dbj|BAB85193.1| hypothetical protein [Bombyx mori] gi|22474509|dbj|BAC10614.1| hypothetical protein [Bombyx mori] Length = 407 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 74/189 (39%), Gaps = 14/189 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFS----HQKRQCQFK 59 +Y+I+L RR + + F AI G N + + R +S K Sbjct: 211 IYMINLERRQERRFLMELSFKELGMSVQHFSAIDGRNLDMNDLREYSITLMPNYEDPYHK 270 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R + E+GC++SH ++W+ I + A++LEDD F F QL+ +++ Sbjct: 271 RPMKAGEVGCFLSHYYIWQDIVKNHHRVALLLEDDIHFVPYFRHRFIQLMDEITQSLSAL 330 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDI--HQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + I ++ + L + + S T GY I + A LL + Sbjct: 331 LYIIYNMFYFRYIGRKILLDGEEEYATPHTTRPLYSYWTLGYLISERGAKKLLR-ADPLS 389 Query: 174 --RPIDMDM 180 P+D + Sbjct: 390 KMLPVDEFL 398 >gi|325127443|gb|EGC50373.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria meningitidis N1568] Length = 275 Score = 123 bits (308), Expect = 4e-26, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 73/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGAHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P +P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGMPVCQLDPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|83944103|ref|ZP_00956559.1| putative glycosyltransferase [Sulfitobacter sp. EE-36] gi|83844970|gb|EAP82851.1| putative glycosyltransferase [Sulfitobacter sp. EE-36] Length = 281 Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats. Identities = 49/272 (18%), Positives = 92/272 (33%), Gaps = 39/272 (14%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + Y+I+L R + + A + L F F A+ G ++ + K+ Sbjct: 1 MDTYLINLERRPDRLSEVSRKMAELGLPFKLFRAVDGHDDDAKLPDLNRHLFVINQKKQP 60 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-----------QLLPHLSKC 111 EI C SHI +W++ + A+ILEDD D + + L S Sbjct: 61 VRGEIACAASHIAVWRQFLETDEEYALILEDDIDIAPDLEMVQKQMPGLGLDFLNLSSNA 120 Query: 112 DINNIL-------IKFDALRKKPKKDS-------------------YLCTLPGNFDIHQP 145 L + DA+R +P +L LP + Sbjct: 121 PYTYTLDEETLSQLANDAVRTRPTVFERSRRRLWRRLEWRRRWRIFHLHPLPNGHVACEC 180 Query: 146 RILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTN 205 +GY + + AA L ++++ PID+ +H + P D++ Sbjct: 181 DPAPLLGSGYIVSRRAAEAFLQTTEHLFFPIDLVWRHAPGMLRQGFLARPIVTQTDKDSD 240 Query: 206 D-STIEESRLVRK-PTFSPLYFYRNTCYQWNL 235 ++ ++ K P R ++++ Sbjct: 241 IPGRFDQGKMALKYRLLRPFLKSRRLRRRFDV 272 >gi|317165551|gb|ADV09092.1| LgtE [Neisseria gonorrhoeae TCDC-NG08107] Length = 280 Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 71/244 (29%), Gaps = 28/244 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LW++ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWEQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPT 219 P+D+ M +P P + + S +E+ R + Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFLSQNSMLGSDLEKDREQGRRH 235 Query: 220 FSPL 223 L Sbjct: 236 RRSL 239 >gi|21654778|gb|AAK85139.1| beta 1,4 galactosyltransferase [Neisseria meningitidis] gi|325141522|gb|EGC63992.1| Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria meningitidis 961-5945] Length = 275 Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|15677758|ref|NP_274922.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis MC58] gi|9911085|sp|Q51116|LGTB_NEIMB RecName: Full=Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB gi|7227185|gb|AAF42257.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis MC58] gi|316985550|gb|EFV64497.1| lacto-N-neotetraose biosynthesis glycosyltransferase lgtB [Neisseria meningitidis H44/76] gi|325139581|gb|EGC62121.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria meningitidis CU385] gi|325200979|gb|ADY96434.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis H44/76] Length = 275 Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYITVFEDDVLLGEGAEKFLAEDAWLQERFDPDTAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|21069182|gb|AAM33859.1|AF470658_2 LgtB [Neisseria meningitidis] Length = 275 Score = 122 bits (307), Expect = 5e-26, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 74/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGAHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + + T GY I ++A L+ + Sbjct: 119 VRLETMFMQVLTLPSGVAEYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P +P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGMPVCQLDPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|973186|gb|AAC44085.1| glycosyl tranferase [Neisseria meningitidis] gi|1586802|prf||2204376B lgtB gene Length = 275 Score = 122 bits (307), Expect = 5e-26, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 72/240 (30%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYITVFEDDVLLGEGEEKFLAEDAWLQERFDPDTAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|240015733|ref|ZP_04722273.1| LgtB [Neisseria gonorrhoeae FA6140] Length = 279 Score = 122 bits (306), Expect = 5e-26, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 72/243 (29%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGVPYIAVFEDDVLLGEGAGQFLAEDTWLQERFDPDSAFV 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|325204880|gb|ADZ00334.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis M01-240355] Length = 275 Score = 122 bits (306), Expect = 5e-26, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 73/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I K+A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAIRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S IE RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|9963921|gb|AAG09765.1|AF240672_2 galactosyl transferase [Neisseria subflava] Length = 275 Score = 122 bits (306), Expect = 5e-26, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 74/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQVMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYITVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G + T GY I K+A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRALPLLESEHWGTAGYIISKKAIRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKLHDQNSALGSLIEHDRLLNRKQQWR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|15602374|ref|NP_245446.1| hypothetical protein PM0509 [Pasteurella multocida subsp. multocida str. Pm70] gi|12720769|gb|AAK02593.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 261 Score = 122 bits (306), Expect = 6e-26, Method: Composition-based stats. Identities = 53/262 (20%), Positives = 93/262 (35%), Gaps = 32/262 (12%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY--GENNPICNRIFSHQKRQCQFKR 60 + ++ISL HARRE F + F+ F A+ E+ +F + Q + R Sbjct: 1 MQKWLISLEKDHARREHFFAQPDT--ADFTVFSAMNTMQESWENLTALFDLNRFQQYYGR 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ E+GC +SH+ ++++I ++ EDD F+ F + L L Sbjct: 59 AVTKGEVGCTLSHLAVYQQIVGDATISENDYCLVCEDDVLFNQGFQRHLETLLTQKPQAD 118 Query: 117 LI-----KFDALRKKPKKDSY---------LCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 +I K D + +Y + + + T Y I K AA Sbjct: 119 VILVGQSKIDHFEASELEINYPTTFACCAKKIAHSSFYYAYPYKNYFAGTVAYLIKKSAA 178 Query: 163 IHLLNVRKNIYRP---IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR------ 213 + + RP D + + + +V P E S +E SR Sbjct: 179 RVFIEQSQQGQRPFWLADDFILFGRDFCLDIMVVRPLLAIENP-ALVSNLEASRGSVSHH 237 Query: 214 LVRKPTFSPLYFYRNTCYQWNL 235 L +K PL + + + Sbjct: 238 LWQKWLKYPLKKWLAIQRNFQI 259 >gi|21069170|gb|AAM33850.1|AF470655_2 LgtB [Neisseria meningitidis] Length = 279 Score = 121 bits (305), Expect = 7e-26, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 73/243 (30%), Gaps = 28/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + Q L + + + Sbjct: 59 SGVEKACFMSHVVLWKQALDEGLPYIAVFEDDVLLGEGAEQFLAEDTWLEERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + + T GY I K+A L+ + Sbjct: 119 VRLETMFMQVLTSPSGVAEYCGRAFPLLESEHWGTAGYIISKKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPT 219 P+D M +P P + + S IE RL+ + Sbjct: 179 HPVDWMMFGNPDDRE---RMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQ 235 Query: 220 FSP 222 Sbjct: 236 QRR 238 >gi|194100116|ref|YP_002003256.1| LgtB [Neisseria gonorrhoeae NCCP11945] gi|193935406|gb|ACF31230.1| LgtB [Neisseria gonorrhoeae NCCP11945] Length = 279 Score = 121 bits (305), Expect = 7e-26, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 74/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ R + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD F + + L + + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGVPYVAVFEDDVLFGKDAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHCGTAGYIISRKAMRFFLDRFAVLPPERL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIE-ESRLVRKP 218 P+D+ M +P P + + S IE + RL RK Sbjct: 179 HPVDLMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRRLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|313229149|emb|CBY23734.1| unnamed protein product [Oikopleura dioica] Length = 576 Score = 121 bits (305), Expect = 7e-26, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 75/199 (37%), Gaps = 14/199 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 ++VI+L R+E+ ++++ A+ G+ + ++ F Sbjct: 333 IFVINLERRQDRKERMDFCMRLQGIKYTLVPAVDGKALTQDDIDGLGIRQLPGFSDPHKP 392 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL--SKCDINNIL 117 R ++ E+GC++SH +LW+++ P ++LEDD F +F L DI L Sbjct: 393 RTITRGEVGCFLSHYYLWQKMV--PGETYLVLEDDVRFGPDFVGDLRRTLREAKDIEWDL 450 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPI 176 I R + + + T Y I +AA L+ P+ Sbjct: 451 IYVGRKRGPGADNDVEVMVTESISNVTYSFW---TLAYLITGDAAKVLVDEKPLEKLIPV 507 Query: 177 DMDMK-HWWEHNIPSLVTE 194 D + + H I S E Sbjct: 508 DEYLPIMYGAHKIDSYTKE 526 >gi|313215923|emb|CBY37331.1| unnamed protein product [Oikopleura dioica] Length = 579 Score = 121 bits (305), Expect = 8e-26, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 75/199 (37%), Gaps = 14/199 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 ++VI+L R+E+ ++++ A+ G+ + ++ F Sbjct: 333 IFVINLERRQDRKERMDFCMRLQGIKYTLVPAVDGKALTQDDIDGLGIRQLPGFSDPHKP 392 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL--SKCDINNIL 117 R ++ E+GC++SH +LW+++ P ++LEDD F +F L DI L Sbjct: 393 RTITRGEVGCFLSHYYLWQKMV--PGETYLVLEDDVRFGPDFVGDLRRTLREAKDIEWDL 450 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPI 176 I R + + + T Y I +AA L+ P+ Sbjct: 451 IYVGRKRGPGADNDVEVMVTESISNVTYSFW---TLAYLITGDAAKVLVDEKPLEKLIPV 507 Query: 177 DMDMK-HWWEHNIPSLVTE 194 D + + H I S E Sbjct: 508 DEYLPIMYGAHKIDSYTKE 526 >gi|332184819|gb|AEE27073.1| putative beta1,4-galactosyltransferase [Francisella cf. novicida 3523] Length = 249 Score = 121 bits (304), Expect = 8e-26, Method: Composition-based stats. Identities = 46/250 (18%), Positives = 90/250 (36%), Gaps = 22/250 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENN--PICNRIFSHQKRQCQFK 59 + V+VI+ R+E ++ +SFF+A+ G + R + + Q Sbjct: 2 MNVFVINRKKDIGRKEYIIQHFKEKGIERYSFFEAVEGRDLDSNFIKRCNGYLSKTLQRA 61 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA-DFSDEFSQLLPHLSKCDINNILI 118 + L++ EIGC +SH+ ++K+I GA+I EDD D + +K N + Sbjct: 62 KGLNVGEIGCALSHLEIYKKIVNEGLAGALIFEDDVEILKDNLLGFVEDFNKQLSEN--V 119 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI-YRPID 177 + + + + + T Y+I + A +L Y ID Sbjct: 120 GYICHLGEKTLRKSKVYKKDIYQLKRSHY----TYSYYIDRIGAERMLRAFGEYPYMSID 175 Query: 178 MDMKHWWEHNIPSLV--TEPGA-VYEAIDTNDSTIEES-------RLVRKPTFSP-LYFY 226 + + I EP + + + R + +P S +Y Y Sbjct: 176 SFKQLIYMKVIKLYGKELEPQCKMRHDFASTIDNAVKRQKANFFIRTILRPIRSLVVYLY 235 Query: 227 RNTCYQWNLH 236 ++ + + Sbjct: 236 LPIKFKMSKN 245 >gi|194376002|dbj|BAG57345.1| unnamed protein product [Homo sapiens] Length = 554 Score = 121 bits (304), Expect = 1e-25, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 83/204 (40%), Gaps = 16/204 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 344 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 403 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L ++ + ++ Sbjct: 404 RPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNIDQAQLDW 463 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 464 ELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-GANPFGKM 519 Query: 174 RPIDMDMKHWWEHNIPSLVTEPGA 197 P+D + + + P L++ P Sbjct: 520 LPVDEFLPVMY-NKHPVLLSSPHC 542 >gi|126727463|ref|ZP_01743297.1| glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] gi|126703243|gb|EBA02342.1| glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] Length = 273 Score = 121 bits (304), Expect = 1e-25, Method: Composition-based stats. Identities = 47/239 (19%), Positives = 83/239 (34%), Gaps = 16/239 (6%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 I +YVI+L R E+ + + L++ AI F H Sbjct: 6 NINIYVINLDKRIDRLERLGGQLDDMGLKWQRISAIDATK--ATPDAFEHATGLAGPLGK 63 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 + + + C SH W+ S A + LEDD S + +LL + ++K + Sbjct: 64 MGMGDRACATSHNKAWRAFLESDATFGLFLEDDVYLSSDTPKLLTDDTWIKPGIDVVKLE 123 Query: 122 ALRKKPKK--DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 K K TLP + Y + ++AA + R+NI P+D Sbjct: 124 KYDKGVSKVLLGQGFTLPNGRTLRPMYSRHAGGAAYILSRKAAAAAV-QRRNICVPVDHY 182 Query: 180 MKHWWEHNIPSLV--TEPGAVYEAIDTN-----DSTIEESRLVRKPTFSPLYFYRNTCY 231 W+ N+ S+ +P + + T +S + +P + +R Sbjct: 183 ---WFNANVSSMARRFKPHLINPGMATQREFGFNSDMAFIGRSERPKGWRM-KWRRAKR 237 >gi|322514724|ref|ZP_08067750.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Actinobacillus ureae ATCC 25976] gi|322119319|gb|EFX91435.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Actinobacillus ureae ATCC 25976] Length = 276 Score = 121 bits (303), Expect = 1e-25, Method: Composition-based stats. Identities = 50/256 (19%), Positives = 92/256 (35%), Gaps = 27/256 (10%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M I +VISL ++ RR + ++ ++ F FFDA+ + + Sbjct: 4 MNIN-FVISLVNANVRRSHINNEFSKRNIPFQFFDALQ---PSVLLEQYIATTLPGLKVA 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI-----NN 115 LS E C++SH+ LW+++A S I EDD +L + +N Sbjct: 60 NLSEGEKACFMSHMALWEKLANSKNNYMAIFEDDVILGKNADKLFSSYDWLEERFALNDN 119 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 ++I+ + K+ GY + ++AA+ LL+ K++ Sbjct: 120 LIIRLETSCIAVKRVGSEVENVLERKFSLLDSEEFGMAGYILSRKAAVFLLDYFKSLDGK 179 Query: 174 --RPIDMDM--KHWWEHNIPSLVTEPG-AVYEA-----IDTNDSTIEESRL------VRK 217 + ID + +N+ EP + E T S +E R ++ Sbjct: 180 SIKAIDKYLFGDLIRNNNLHIYQLEPAICIQEDKLNKKKSTLISQLEYDRRRIENNNKKQ 239 Query: 218 PTFSPLYFYRNTCYQW 233 +Y +W Sbjct: 240 ENKGIIYMIYKLLTKW 255 >gi|115920130|ref|XP_783019.2| PREDICTED: similar to Glycosyltransferase 25 domain containing 2 [Strongylocentrotus purpuratus] gi|115954234|ref|XP_001196894.1| PREDICTED: similar to Glycosyltransferase 25 domain containing 2 [Strongylocentrotus purpuratus] Length = 624 Score = 121 bits (303), Expect = 1e-25, Method: Composition-based stats. Identities = 47/241 (19%), Positives = 85/241 (35%), Gaps = 38/241 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-----ICNRIFSHQKRQCQFK 59 +Y+++L RR + + + F A+ G+ + Sbjct: 316 IYLVNLERRPLRRLRMQAALKLLRIDFKLIKAVDGKLLSPDILREMGIDMLPGYADPYWG 375 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R+L+ EIGC++SH ++WK + +I EDD F F ++ ++ + Sbjct: 376 RVLTKGEIGCFLSHYNIWKEVVERNLSRILIFEDDIRFGARFKPRMASVMAEVAARKLEW 435 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 LI + +K L + I P S T Y + E A LL ++ + + Sbjct: 436 DLI---YVGRKILHMKGEKFLSDSTQIVTPSY-SYWTLSYMLTLEGAKKLL-AQEPLGKM 490 Query: 175 -PIDMDM--------KHWWE-----HNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + + W N+ + +P V DT S+I E Sbjct: 491 VPVDEYLPIMFDKHPEDEWLAQFRPRNMRAFSVDPLMVSPTHYTGEKKYFSDTETSSIWE 550 Query: 212 S 212 Sbjct: 551 E 551 >gi|261494331|ref|ZP_05990825.1| Lipooligosaccharide biosynthesis protein lpsA [Mannheimia haemolytica serotype A2 str. OVINE] gi|261309980|gb|EEY11189.1| Lipooligosaccharide biosynthesis protein lpsA [Mannheimia haemolytica serotype A2 str. OVINE] Length = 254 Score = 120 bits (302), Expect = 2e-25, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 80/229 (34%), Gaps = 15/229 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + YVISL + RR+ + ++ F FFDAI + + + Sbjct: 1 MNNYVISLTTAQDRRKHIEAEFGKQNIPFQFFDAITPNSMEVKALELGIDISNSALPKE- 59 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 EI C +SHI LW I EDD + LL + D N ++K + Sbjct: 60 ---EISCALSHISLWHLAKEQNLDYICIFEDDIYLGKNANLLLSNTYIND-NVDIVKIER 115 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-RPIDMDMK 181 ++ + + + ++P T GY + ++ L + K I ID + Sbjct: 116 GLERLNLSLFPEKEYLERKFYILKTVNPGTGGYILTRKGINFLTSKVKEITSIEIDNLIF 175 Query: 182 HWWEHN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP 222 ++ + P++ + +Y T + R R+ Sbjct: 176 KYYLTDKSYTVWQMQPTICIQDCLLYPTGHTLG-IMSGERDERRKKSKR 223 >gi|119605029|gb|EAW84623.1| glycosyltransferase 25 domain containing 1, isoform CRA_c [Homo sapiens] Length = 565 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 76/187 (40%), Gaps = 15/187 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 343 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 462 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI RK+ + + +P ++ + S T Y I + A LL + + Sbjct: 463 DLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKM 518 Query: 174 RPIDMDM 180 P+D + Sbjct: 519 LPVDEFL 525 >gi|325135521|gb|EGC58139.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria meningitidis M0579] Length = 275 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 74/251 (29%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALM-PSEALEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGKDAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV----RKNIY 173 ++ + + T GY I K+A L + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAIRFFLQHFVMLQPEQI 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 +PID+ M + + +P P + + S E RL+ + Sbjct: 179 KPIDLMMFSDFFDKEGMPVYQVSPALCAQELHYAKFHDQNSALGSLNEHDRLLNRKQQRR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|161869287|ref|YP_001598454.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis 053442] gi|161594840|gb|ABX72500.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria meningitidis 053442] Length = 273 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 74/234 (31%), Gaps = 22/234 (9%) Query: 10 LPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGC 69 + + RR R + F FFDA+ NR+ + + LLS E C Sbjct: 1 MASAKERRAHIADTFGRHDIPFQFFDALMPS--EELNRMMAELVPGLAKQHLLSEVEKAC 58 Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----LIKFDALR 124 ++SH LWK+ + EDD + + L + + +++ + + Sbjct: 59 FMSHAVLWKQALDEGLPYVAVFEDDVLLGKDAEKFLAEDTWLEDRFDKDSAFIVRLETMF 118 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIYRPIDMDM 180 K N T GY I +EA L + + +D M Sbjct: 119 AKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPAEWIKAVDWMM 178 Query: 181 KHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSPL 223 ++ + +P P + + S +E+ R + L Sbjct: 179 FTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQGRRHRRSL 232 >gi|157326119|gb|ABV44292.1| jhp0562-like glycosyltransferase [Helicobacter pylori] Length = 333 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 61/332 (18%), Positives = 104/332 (31%), Gaps = 85/332 (25%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCRGELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS------- 149 K + + + KF L K+ + N + + + Sbjct: 120 DCLKSPFDFVRLYGCYWYYHETKFHVLPKEFVFPPFDYPFKNNPILEKFKKFFDVSRFLN 179 Query: 150 ---------------------------------------PRTTGYFIGKEAAIHLLNVRK 170 T GY++ + A + + Sbjct: 180 LSTHKVIHYILKKIQKSYYATHEKEAFFLEHFYLTSVYVASTAGYYLAPKGAKTFIEATE 239 Query: 171 NI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV---------RKPT 219 P+DM M + H++ + P V + + DSTI++ + +K T Sbjct: 240 RFKIIEPVDMFMDNSAYHDVANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPFPPKKST 299 Query: 220 FSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 F L++Y + ++ + + P+ T K Sbjct: 300 FKNLFYYSINAKKRLKAFHKYSRQYAPLKTPK 331 >gi|310817025|ref|YP_003964989.1| glycosyl transferase, family 25 [Ketogulonicigenium vulgare Y25] gi|308755760|gb|ADO43689.1| glycosyl transferase, family 25 [Ketogulonicigenium vulgare Y25] Length = 261 Score = 120 bits (300), Expect = 3e-25, Method: Composition-based stats. Identities = 48/247 (19%), Positives = 81/247 (32%), Gaps = 11/247 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE-NNPICNRIFSHQKRQCQFKRL 61 I YVI+L + R F ++ + A+ G +P +S + F R Sbjct: 14 ISYYVINLDDNPERLRAFAAHIQPENIAYERVAAVDGRHVSPASFPSYSAKAAVAFFGRG 73 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 L E+ C +SHI R A AII EDDA +++F+ + + L + Sbjct: 74 LLGGEVACCLSHIDALNRFMAGDAPYAIIFEDDAWVAEDFTAKVERAIAL-FDEKLPGWQ 132 Query: 122 ALRKKPKKDSYLCTL--PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 L P+ + G + + T + A +L IY P D+ Sbjct: 133 LLHLYPRTLRGHSPITAEGGHTLFAAHEFAVGTAAIVYSRAGAKAVLETAVPIYAPWDVH 192 Query: 180 MKHWWEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLVRKP------TFSPLYFYRNTCYQ 232 +K + S P + + + R K + +R + Sbjct: 193 LKEVTARHGKSGAITPALIQTSDAPSEIAAYGARRSQIKGLRGWWNKRIRVVRHRAYAIR 252 Query: 233 WNLHYNA 239 L Y Sbjct: 253 NLLRYRL 259 >gi|33413662|gb|AAN08515.1| LgtB [Neisseria subflava] Length = 279 Score = 120 bits (300), Expect = 3e-25, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 71/240 (29%), Gaps = 22/240 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQRR 238 >gi|317009368|gb|ADU79948.1| family 25 glycosyl transferase [Helicobacter pylori India7] Length = 337 Score = 120 bits (300), Expect = 3e-25, Method: Composition-based stats. Identities = 65/336 (19%), Positives = 109/336 (32%), Gaps = 89/336 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRI--FSHQKR 54 I VY+ISL S R + + + F FDAI ++ + Sbjct: 2 ICVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAPS 60 Query: 55 QCQFKRL-------LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 Q L+LPE+GCY+SH LWK + +ILEDD Q L Sbjct: 61 LLQSDWYHSDISDGLTLPELGCYLSHYFLWKECVKTNQP-VVILEDDVALEFNLMQALED 119 Query: 108 LSKCDINNILI----------KFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------- 149 K + + + KF L K+ + +L + + R +S Sbjct: 120 CLKSPFDFVRLYGCYWYYHGTKFHVLPKEFVFPPFDSSLKNRSILEKCREMSNIFIGKLS 179 Query: 150 -------------------------------------------PRTTGYFIGKEAAIHLL 166 T GY++ + A + Sbjct: 180 SLSVYKCIHFALGFLKKIQRNYYPTHEKEAFFLEHFYLTSVYVASTAGYYLTPKGAKTFI 239 Query: 167 NVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV--------- 215 + P+DM + + H++ + P V + + DSTI++ + Sbjct: 240 EATERFKIIEPVDMFINNPTYHDVANFTYVPCPVSLSEHSLDSTIQKPQKKSLKSYPLPP 299 Query: 216 RKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 +K TF L+FY + ++ + K P+ T K Sbjct: 300 KKSTFRGLFFYSLNAKKRLKAFHKYSKQYAPLKTPK 335 >gi|153003252|ref|YP_001377577.1| glycosyl transferase family protein [Anaeromyxobacter sp. Fw109-5] gi|152026825|gb|ABS24593.1| glycosyl transferase family 25 [Anaeromyxobacter sp. Fw109-5] Length = 253 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 82/223 (36%), Gaps = 34/223 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR----IFSHQKRQC--QF 58 ++V++LP + R+ + R L + FF + + +++ ++ + + Sbjct: 15 IFVVTLPRALERQARVTSRLT--GLDYRFFHGFDRQQLDLDRLAREGVYAPERARRVDRH 72 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQLLPHLSKCDINNIL 117 R L ++GC +SH L+++ + ++LEDD D+ QL LS+ + Sbjct: 73 GRALGAGQVGCSLSHRKLYEQAIENGWSRVLVLEDDVVARDDDLPQLTAALSQLPEAWDV 132 Query: 118 IKFD-------ALRKKPKKDSYLCTLPGNFDIHQPRI-----------------LSPRTT 153 + R + K+ SY+ +PR L T Sbjct: 133 LYLGYTNFETVTRRDRVKQASYVVLSALRLMKWRPREILRFHPRPFSPNLRCAGLHHCTH 192 Query: 154 GYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHN-IPSLVTEP 195 Y A LL + + P D + H I + VTEP Sbjct: 193 AYAFTLAGAKKLLAAQTPVAFPADQLLVHLVLRGEIRAFVTEP 235 >gi|33413666|gb|AAN08518.1| LgtB [Neisseria polysaccharea] Length = 279 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 69/243 (28%), Gaps = 28/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + Sbjct: 59 SGVEKACFMSHTVLWKQALDESLPYIAVFEDDVLLGEGAEKFLAEDGWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPSEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPT 219 P+D M +P P + + S IE RL+ + Sbjct: 179 HPVDWMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQ 235 Query: 220 FSP 222 Sbjct: 236 QRR 238 >gi|304388481|ref|ZP_07370584.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis ATCC 13091] gi|21069186|gb|AAM33862.1|AF470659_2 LgtB [Neisseria meningitidis] gi|254670882|emb|CBA07399.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha153] gi|304337478|gb|EFM03644.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis ATCC 13091] Length = 275 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 72/251 (28%), Gaps = 22/251 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPPEGL 178 Query: 174 RPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 P+D+ M + +P P + + S IE RL+ + Sbjct: 179 HPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQWR 238 Query: 223 LYFYRNTCYQW 233 + Sbjct: 239 DSPANTFKRRL 249 >gi|311978056|ref|YP_003987176.1| putative glycosyltransferase [Acanthamoeba polyphaga mimivirus] gi|82000081|sp|Q5UQ62|YR655_MIMIV RecName: Full=Putative glycosyltransferase R655 gi|55417266|gb|AAV50916.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308204975|gb|ADO18776.1| putative glycosyltransferase [Acanthamoeba polyphaga mimivirus] Length = 324 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 51/225 (22%), Positives = 83/225 (36%), Gaps = 19/225 (8%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + + VI+L RRE + ++ Q+ FF+A GE + I + Sbjct: 3 NLKIIVINLKRRTDRREIMEKKFQDENITQYEFFEAFDGETLRPEDPIL--GVFKHGVHG 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPA-IGAIILEDDADFSDEFSQLLPHLSKC---DINNI 116 L GC +SH +W++IA + ++LEDD +F F + L + K I Sbjct: 61 LSRKGVAGCALSHYTVWQKIAADTSGTKYLVLEDDINFKPNFKENLSKVMKTIEPSQAMI 120 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIH--QPRILSPRTTGYFIGKEAAIHLLNV-RKN-I 172 LI K + L ++ IH + GY + AA + ++ N I Sbjct: 121 LIGMTVNGDDVTKTRDIYELDTSYTIHPLGRDYYAGGLFGYILDYRAAQYFVDYISYNGI 180 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVY-----EAIDTNDSTIEES 212 ID + P + P VY + DS I+ Sbjct: 181 RIVIDYLT---YRSGFPMYESHPHLVYTQSVQHDGEHVDSDIQHQ 222 >gi|21069198|gb|AAM33871.1|AF470662_3 LgtH [Neisseria meningitidis] Length = 268 Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats. Identities = 42/263 (15%), Positives = 79/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|38569122|gb|AAR24271.1| glycosyltransferase [Shigella boydii] Length = 261 Score = 119 bits (298), Expect = 4e-25, Method: Composition-based stats. Identities = 51/228 (22%), Positives = 84/228 (36%), Gaps = 23/228 (10%) Query: 5 VYVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENN--PICNRIFSHQKRQCQFKRL 61 + ++SL + RR K L FSFF+ IYG+ + I+S + Q R Sbjct: 6 IKILSLADAIERRRKVSREFELKSVLPFSFFNGIYGKTLSKEKLDTIYSSELAQKVLHRQ 65 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL----SKCDINNIL 117 L+ EIG SH +++ ++LEDD F +L + + D I Sbjct: 66 LTAGEIGATYSHYLIFREAYERGDEFVVVLEDDCYIDRNFDTVLTSILDKKNPTDDEIIF 125 Query: 118 IKFDALRKKPKKDSYLCTLPG-NFDIHQPR---ILSPRTTGYFIGKEAAIHLLNVRKNIY 173 I+ S N+ +H+ GY + + A +++ Y Sbjct: 126 IQRHTSENSHIIRSLGSEKINKNYTLHRMLGSAQYFVGAYGYIVTRAAMKKMIDT----Y 181 Query: 174 RPIDMDMKHWW----EHNIPSL-VTEPGAVY---EAIDTNDSTIEESR 213 PI HW+ + I V P V+ E + DS ++ R Sbjct: 182 LPIYCVCDHWYFIKKKCQIEKFSVLSPAIVFTNDEDVRKIDSFVDIER 229 >gi|85059128|ref|YP_454830.1| hypothetical protein SG1150 [Sodalis glossinidius str. 'morsitans'] gi|84779648|dbj|BAE74425.1| conserved hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 248 Score = 119 bits (298), Expect = 4e-25, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 102/256 (39%), Gaps = 17/256 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +++I+L S RR+ ++ R +L + F +A+ G + + + Sbjct: 1 MKIFIINLKESFRRRQCIEYQCRRYNLDYEFTEAVNGHSLSSEEIV----QHTRTINYAT 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 EIGC +SHI++++ I A+ILEDDA + E L+ ++ + N + Sbjct: 57 RPGEIGCALSHIYIYRLICERGLEQALILEDDAKITPEAIPLIQQIALRNDPNRPLLTLL 116 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD--M 180 + L +H+ + T GY I + AA ++ ++ D Sbjct: 117 TTCNQYLKKPIHVLDKTHSLHRVIDAAM-THGYVINRAAARNMAAALYPVWMVADKFSLF 175 Query: 181 KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTC--YQWNL--H 236 + + + + P + +I S I + KPT + L ++N W + Sbjct: 176 QDYSLCQVRA--VVPPVILHSIHAGQSNIAST---PKPTATRLAIWQNLKHHMPWRVKCK 230 Query: 237 YNAWR-KDLPPVSTTK 251 ++ WR P +S K Sbjct: 231 HSLWRLLVRPFLSIVK 246 >gi|254672587|emb|CBA06268.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha275] Length = 268 Score = 119 bits (298), Expect = 5e-25, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 79/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGARGIPFQFFDALM-PSEALEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SSVEKACFMSHVVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|148244950|ref|YP_001219644.1| glycosyl transferase family protein [Candidatus Vesicomyosocius okutanii HA] gi|146326777|dbj|BAF61920.1| glycosyltransferase, family 25 [Candidatus Vesicomyosocius okutanii HA] Length = 258 Score = 119 bits (298), Expect = 5e-25, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 73/200 (36%), Gaps = 12/200 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + ++VI+L + RR+ + +R+ L + +A + ++QK ++R + Sbjct: 1 MDIFVINLSSAIKRRQFQEQQLSRLKLDYQITNATTISDINNK----TYQKHHYDWQRPM 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 E+ CY SH LW +I A+ILEDD S +L L K N ++ + Sbjct: 57 QKTEVACYFSHQTLWSKIVKKN-RMALILEDDVLLSKHTPSILTKLEKYT-NFDMVNLEV 114 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 +K + G+ + Y + A LL ++ Sbjct: 115 FNRKKYVAKKSQPI-GHHQLFYLYQDKAGAAAYVLYPNGAKKLLQQQQKSGIA---LADA 170 Query: 183 WWEH--NIPSLVTEPGAVYE 200 + + S EP + Sbjct: 171 HLHNCLELQSYQIEPACAVQ 190 >gi|322378457|ref|ZP_08052909.1| putative lipopolysaccharide biosynthesis protein [Helicobacter suis HS1] gi|321149101|gb|EFX43549.1| putative lipopolysaccharide biosynthesis protein [Helicobacter suis HS1] Length = 296 Score = 119 bits (298), Expect = 5e-25, Method: Composition-based stats. Identities = 53/277 (19%), Positives = 92/277 (33%), Gaps = 38/277 (13%) Query: 2 PIPVYVISLPFSHARREKFCHR-----AARIHLQFSF--FDAIYGENN---------PIC 45 + +YVISL S R ++F+F FDAI ++ + Sbjct: 6 SLDIYVISLKNS-KRYATCQAVVDNPPLNNHPIKFTFHMFDAIDKHSHYFKKDLGYTGLI 64 Query: 46 NRIFSHQKRQCQF------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD 99 +++ + + R + E+GCY S LW + S ILEDD Sbjct: 65 ENVYNPKWLKGNAWFESYDGREMLPEELGCYASQYMLWIKCIESNKP-ICILEDDFGLQH 123 Query: 100 EFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGN------------FDIHQPRI 147 F + L + + + S + T G F H Sbjct: 124 NFYESLIDCLSSPFDLVRLWAGFSYGVDGFTSIISTQHGEQVCLTDPEKEIIFLKHFYFS 183 Query: 148 L--SPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTN 205 + + Y+I +AA L+++ + Y P+D H +P ++ P +V Sbjct: 184 VLDTYGNMAYYITPKAAKILVSLSLHWYEPVDQFFNKVHAHKLPCMLYIPFSVLPDTHNT 243 Query: 206 DSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 STI P+Y + W + + R+ Sbjct: 244 QSTIYSQENATLLKNDPIYKKKTKVSYWRILFEKIRR 280 >gi|325199018|gb|ADY94474.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtH [Neisseria meningitidis G2136] Length = 268 Score = 119 bits (298), Expect = 5e-25, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 79/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SSVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLDEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|71403195|ref|XP_804423.1| glycosyl transferase-like protein [Trypanosoma cruzi strain CL Brener] gi|70867388|gb|EAN82572.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 305 Score = 119 bits (298), Expect = 5e-25, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 82/235 (34%), Gaps = 31/235 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNRI------------F 49 VYVI+L R + + R ++ F A+ G I Sbjct: 56 VYVINLNRRPDRWKFILRQLQRAGFREGEYERFTAVDGRAVDIQKAQACGLVSLLGILRL 115 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-L 108 ++ + + L+ +GC +SH+ LW IA +++EDD+ F +F + + Sbjct: 116 REEESRRIWGMDLNPAAVGCALSHVLLWATIAARRYHRVLVVEDDSLFPHDFKKNYDERM 175 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + L+ + + + S L G + + + T Y + + A LL + Sbjct: 176 RHVPQDWELV-YVSGLDTANQASQLRVAEG---VSRVPQMHRTTNCYVVTHQGARRLLEL 231 Query: 169 RKNIYRPIDMDMKHWW----EHNIP------SLVTEPGAVYEAIDTNDSTIEESR 213 + +D M + +P +P V +A S I+ S Sbjct: 232 CLPMTYQLDTMMTMHAVSDPQSGVPHVTSPVCYTLQPPLVVQAT-RMGSDIQSSN 285 >gi|89053315|ref|YP_508766.1| glycosyl transferase family protein [Jannaschia sp. CCS1] gi|88862864|gb|ABD53741.1| glycosyl transferase family 25 [Jannaschia sp. CCS1] Length = 276 Score = 118 bits (297), Expect = 6e-25, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 71/210 (33%), Gaps = 19/210 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIY-----GENNPICNRIFSHQKRQCQ 57 IP +VI+L + AR+ + +Q F DA+ G F Sbjct: 13 IPAFVINLKRAAARQVIMRETCSAAGVQVHFVDAVDSQTDEGRAAVDRLPDFGPWGVFHG 72 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R C +SH +PA +ILEDD + + + + N + Sbjct: 73 HDR-------ACTLSHYKALVAFLETPATHGLILEDDVFLASDLGEWMEDALWWPANADV 125 Query: 118 IKFDALR--KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP 175 +KF+ R K T G+ + + GY + + AA +L V K P Sbjct: 126 VKFERWRDDKLLVALEKRTTSVGDRMLRVLQSRHSGGAGYAVTRAAAERILAVAKP-DVP 184 Query: 176 IDMDMKHW----WEHNIPSLVTEPGAVYEA 201 ID + + + + P + + Sbjct: 185 IDHLLFNPNVSRLARGLTTYQVMPAMIVQG 214 >gi|21654768|gb|AAK85144.1| hypothetical protein [Neisseria meningitidis] gi|21654779|gb|AAK85140.1| hypothetical protein [Neisseria meningitidis] gi|325141521|gb|EGC63991.1| Lacto-N-neotetraose biosynthesis glycosyltransferase LgtH [Neisseria meningitidis 961-5945] Length = 268 Score = 118 bits (297), Expect = 6e-25, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 78/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|21069201|gb|AAM33873.1|AF470663_2 LgtH [Neisseria meningitidis] Length = 268 Score = 118 bits (297), Expect = 6e-25, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 79/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALM-PSEALEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|309379537|emb|CBX21903.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 279 Score = 118 bits (296), Expect = 7e-25, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 69/243 (28%), Gaps = 28/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAALPSEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPT 219 P+D M +P P + + S IE RL+ + Sbjct: 179 HPVDWMMFGNPDDRE---GMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQ 235 Query: 220 FSP 222 Sbjct: 236 QRR 238 >gi|254804258|ref|YP_003082479.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha14] gi|254667800|emb|CBA03768.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis alpha14] Length = 268 Score = 118 bits (296), Expect = 7e-25, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 78/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQVMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|62002235|gb|AAX58718.1| LgtH [Neisseria meningitidis] gi|325131650|gb|EGC54356.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtH [Neisseria meningitidis M6190] gi|325137513|gb|EGC60096.1| lacto-N-neotetraose biosynthesis glycosyltransferase LgtH [Neisseria meningitidis ES14902] Length = 268 Score = 118 bits (296), Expect = 8e-25, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 79/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SSVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLDEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|315453550|ref|YP_004073820.1| lipopolysaccharide biosynthesis protein [Helicobacter felis ATCC 49179] gi|315132602|emb|CBY83230.1| lipopolysaccharide biosynthesis protein [Helicobacter felis ATCC 49179] Length = 285 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 53/259 (20%), Positives = 90/259 (34%), Gaps = 41/259 (15%) Query: 4 PVYVISLPFSHAR--------REKFCHRAARIHLQFSFFDAIYGENN------------P 43 P+Y+I L S AR ++A + F A++G+ + P Sbjct: 9 PIYIIHLDNS-ARDITPLLWHLNYLLKKSAHKNFHIEVFKAVHGQKDYHNKGITFKHTHP 67 Query: 44 ICNRIFSHQKRQCQFKRLL-----------SLPEIGCYISHIHLWKRIAYSPAIGAIILE 92 + N ++ Q L+ SL ++GC+ SH LW+ I+LE Sbjct: 68 VLNDFAPQTPKRTQVLELVRLALRMRVNFQSLGQLGCFASHFLLWQECVKLQRP-IIVLE 126 Query: 93 DDADFSDEFSQ--LLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP 150 DD + +F + L + +++ L K+ L +FD Sbjct: 127 DDILPTFDFYEKCALGLEAINTGLAQVVRLYTLY--FKRHQIKSALTHDFDWIFSPYGGM 184 Query: 151 RTTGYFIGKEAAIHLLNVRKN-IYRPIDMDMKHWWEHNIPSLVTEPGA--VYEA-IDTND 206 T GY + + A LL P+D M + H++ +L + A V A Sbjct: 185 GTQGYVLSPQGAQKLLEHCPPQWILPVDTYMDAHYTHHVKTLTLKSHALLVEPAPTQVPH 244 Query: 207 STIEESRLVRKPTFSPLYF 225 + I+ R K L Sbjct: 245 ANIQNFRGKLKALSWLLRL 263 >gi|158302599|ref|XP_561137.5| Anopheles gambiae str. PEST AGAP012933-PA [Anopheles gambiae str. PEST] gi|157021089|gb|EAL42272.3| AGAP012933-PA [Anopheles gambiae str. PEST] Length = 330 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 61/161 (37%), Gaps = 14/161 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 +Y+I+L RR K + L F A+ G+ F K Sbjct: 175 IYMINLERRTERRTKMLKHFDLLGLDVEHFPAVDGKQLSDKKVYDMGIRFLPGYADPFHK 234 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 R +++ EIGC++SH ++W+R+ ++LEDD F F + +L + Sbjct: 235 RPMTMGEIGCFLSHYNIWERMVRLNQQEVLVLEDDIRFEPFFRRRAYGVLADARRIG-GW 293 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 LI R + + + ++ + S T GY Sbjct: 294 DLIYIGRKRLQEEDEKWI----DGSEYLVKAGYSYWTLGYV 330 >gi|315125453|ref|YP_004067456.1| glycosyl transferase family 25 [Pseudoalteromonas sp. SM9913] gi|315013966|gb|ADT67304.1| glycosyl transferase family 25 [Pseudoalteromonas sp. SM9913] Length = 172 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 42/171 (24%), Positives = 66/171 (38%), Gaps = 7/171 (4%) Query: 74 IHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYL 133 + LW+++ ++LEDD ++ K N LIK R P DS Sbjct: 1 MRLWQKMVNENMPFCVVLEDDLFIEASLKDVVDAAIKLK-NWDLIKLSDNRNFPFIDS-- 57 Query: 134 CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVT 193 L N + + T GY I A LL RK +RP+D+DM+ E + + Sbjct: 58 APLENNLTVGNYKKAPNGTQGYIISLSGAKKLL-QRKPFFRPVDVDMQFHTEVGLSMIGI 116 Query: 194 EPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDL 244 +P + E + S I + S F RN ++ ++H K Sbjct: 117 KPYPIAED-RSFISEITSANAGSHSNRS--TFIRNLIHRSSIHMQRRYKTA 164 >gi|309379534|emb|CBX21900.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 268 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 79/263 (30%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SSVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGERAEKFLDEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|301630121|ref|XP_002944176.1| PREDICTED: glycosyltransferase 25 family member 3-like [Xenopus (Silurana) tropicalis] Length = 590 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 86/245 (35%), Gaps = 38/245 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +++I+L RRE+ + DAI G + + Sbjct: 322 IFLINLRRRPERRERMLRSLYEQEISCHVIDAIDGGAMNSSDIKHLGVNLLPGYYDPFSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----QLLPHLSKCDINN 115 R L+ E+GC++SH +WK I A++ EDD F F +L+ + +++ Sbjct: 382 RTLTKGEVGCFLSHFQIWKEITERQLDTAVVFEDDVRFQPFFKRKMIRLMGDIRSAELDW 441 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-- 173 LI + +K + + S T Y I + A LLN + + Sbjct: 442 DLI---YIGRKQVTLDPEQPVENVPHLVVA-DYSYWTLCYIISLQGAQKLLN-AEPLSKM 496 Query: 174 RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEE 211 P+D + +H+ N+ P VY DT ST+ + Sbjct: 497 LPVDEFLPIMSDTHPNKEYKQHFVNRNLRVFSVHPLLVYPTHYTGETSWQSDTETSTLWD 556 Query: 212 SRLVR 216 + VR Sbjct: 557 NDNVR 561 >gi|55960746|emb|CAI14721.1| glycosyltransferase 25 domain containing 2 [Homo sapiens] Length = 234 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 50/232 (21%), Positives = 90/232 (38%), Gaps = 35/232 (15%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHL 108 R R L+ EIGC++SH +WK + +++EDD F +F + L ++ Sbjct: 5 YRDPYSSRPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMDNI 64 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + ++ LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 65 DQAQLDWELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-G 120 Query: 169 RKNIY--RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDT 204 P+D + +++ ++ + EP +Y DT Sbjct: 121 ANPFGKMLPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDT 180 Query: 205 NDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 STI ++ V T + Q ++ NA + LPP ++ +PS Sbjct: 181 ETSTIWDNETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 230 >gi|322381022|ref|ZP_08055058.1| lipopolysaccharide biosynthesis protein [Helicobacter suis HS5] gi|321146544|gb|EFX41408.1| lipopolysaccharide biosynthesis protein [Helicobacter suis HS5] Length = 296 Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats. Identities = 52/277 (18%), Positives = 91/277 (32%), Gaps = 38/277 (13%) Query: 2 PIPVYVISLPFSHARREKFCHR-----AARIHLQFSF--FDAIYGENN---------PIC 45 + +YVISL S R ++F+F FDAI ++ + Sbjct: 6 SLDIYVISLKNS-KRYATCQAVVDNPPLNNHPIKFTFHMFDAIDKHSHYFKKDLGYTGLI 64 Query: 46 NRIFSHQKRQCQF------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD 99 +++ + + R + E+GCY S LW + S ILEDD Sbjct: 65 ENVYNPKWLKGNAWFESYDGREMLAEELGCYASQYMLWIKCIESNKP-ICILEDDFGLQH 123 Query: 100 EFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGN------------FDIHQPRI 147 F + L + + + S + T G F H Sbjct: 124 NFYESLIDCLSSPFDLVRLWAGFSYGVDGFTSIISTQHGEQVCLTDPEKEIIFLKHFYFS 183 Query: 148 L--SPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTN 205 + + Y+I +AA L+++ + P+D H +P ++ P +V Sbjct: 184 VLDTYGNMAYYITPKAAKILVSLSLHWCEPVDQFFNKVHAHKLPCMLYIPFSVLPDTHNT 243 Query: 206 DSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 STI P+Y + W + + R+ Sbjct: 244 QSTIYSQENATLLKNDPIYKKKTKVSYWRILFEKIRR 280 >gi|193785321|dbj|BAG54474.1| unnamed protein product [Homo sapiens] Length = 234 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 50/232 (21%), Positives = 90/232 (38%), Gaps = 35/232 (15%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHL 108 R R L+ EIGC++SH +WK + +++EDD F +F + L ++ Sbjct: 5 YRDPYSSRPLTRGEIGCFLSHYSVWKEVIDRELEKTLVIEDDVRFEHQFKKKLMKLMVNI 64 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + ++ LI RK+ + +P ++ + S T GY I E A L+ Sbjct: 65 DQAQLDWELIYIG--RKRMQVKEPEKAVPNVANLVEA-DYSYWTLGYVISLEGAQKLV-G 120 Query: 169 RKNIY--RPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA---------IDT 204 P+D + +++ ++ + EP +Y DT Sbjct: 121 ANPFGKMLPVDEFLPVMYNKHPVAEYKEYYESRDLKAFSAEPLLIYPTHYTGQPGYLSDT 180 Query: 205 NDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR-KDLPPVSTTKFLPS 255 STI ++ V T + Q ++ NA + LPP ++ +PS Sbjct: 181 ETSTIWDNETVA--TDWDRTHAWKSRKQSRIYSNAKNTEALPPPTSLDTVPS 230 >gi|121635583|ref|YP_975828.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis FAM18] gi|120867289|emb|CAM11060.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis FAM18] Length = 268 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|21069175|gb|AAM33854.1|AF470656_3 LgtH [Neisseria meningitidis] gi|21069179|gb|AAM33857.1|AF470657_3 LgtH [Neisseria meningitidis] Length = 268 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 78/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|240119061|ref|ZP_04733123.1| glycosyltransferase [Neisseria gonorrhoeae PID1] Length = 173 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 55/167 (32%), Gaps = 7/167 (4%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VISL + RR + F FFDA+ R + LS Sbjct: 3 HVISLASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYLSGV 60 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----LIKF 120 E C++SH LWK+ + EDD F + + L + + +++ Sbjct: 61 EKACFMSHAVLWKQALDEGLPYVAVFEDDVLFGKDAEKFLAEDTWLEERFDKDSAFIVRL 120 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 + + K N T GY I +EA L+ Sbjct: 121 ETMFAKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLD 167 >gi|261335009|emb|CBH18003.1| glycosyl transferase-like protein, putative [Trypanosoma brucei gambiense DAL972] Length = 307 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 77/233 (33%), Gaps = 31/233 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNR------------IF 49 VYVI+L R E + +R ++ F A+ G N + Sbjct: 47 VYVINLDRRPDRWEFATQQLSRAGFLHEEYVRFPAVDGRNVDLQKAHACGLISRLGLLRL 106 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-L 108 + + + L+ +GC +SH LW +IA S +++EDD+ F +F Sbjct: 107 QEPEHRRIWGMDLNPGAVGCALSHALLWAQIAASRHRSVLVVEDDSLFPQDFHHKYEARA 166 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + L+ + + + L G + Q T Y + A LL+ Sbjct: 167 KQVPSDWELL-YVSGLDTAGQAPQLRVAEGVCRVPQMHRT---TNCYVVSHRGARLLLDA 222 Query: 169 RKNIYRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + +D M + +P V +A S I+ Sbjct: 223 CFPVTYQLDTMMTLRTAGDGAGGVPYVTVPNCYTLQPPLVVQAT-RMGSDIQS 274 >gi|74024994|ref|XP_829063.1| glycosyl transferase [Trypanosoma brucei TREU927] gi|70834449|gb|EAN79951.1| glycosyl transferase-like protein [Trypanosoma brucei] Length = 307 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 77/233 (33%), Gaps = 31/233 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNR------------IF 49 VYVI+L R E + +R ++ F A+ G N + Sbjct: 47 VYVINLDRRPDRWEFATQQLSRAGFLHEEYVRFPAVDGRNVDLQKAHACGLISRLGLLRL 106 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-L 108 + + + L+ +GC +SH LW +IA S +++EDD+ F +F Sbjct: 107 QEPEHRRIWGMDLNPGAVGCALSHALLWAQIAASRHRSVLVVEDDSLFPQDFHHKYEARA 166 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + L+ + + + L G + Q T Y + A LL+ Sbjct: 167 KQVPSDWELL-YVSGLDTAGQAPQLRVAEGVCRVPQMHRT---TNCYVVSHRGARLLLDA 222 Query: 169 RKNIYRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 + +D M + +P V +A S I+ Sbjct: 223 CFPVTYQLDTMMTLRTAGDGAGGVPYVTVPNCYTLQPPLVVQAT-RMGSDIQS 274 >gi|152978999|ref|YP_001344628.1| glycosyl transferase family protein [Actinobacillus succinogenes 130Z] gi|150840722|gb|ABR74693.1| glycosyl transferase family 25 [Actinobacillus succinogenes 130Z] Length = 257 Score = 116 bits (292), Expect = 3e-24, Method: Composition-based stats. Identities = 49/234 (20%), Positives = 86/234 (36%), Gaps = 26/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKR 60 + Y+ISL RR+ F + F F+AI + F + ++R Sbjct: 1 MKKYLISLEKDVQRRKLFFSQPDTS--DFEVFNAINTMELSQAELEKNFDFAAFKKAYRR 58 Query: 61 LLSLPEIGCYISHIHLWKRI----AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 ++ EIGC +SH++++K++ +I EDDA F+++F++ L L + ++ Sbjct: 59 DVTKGEIGCTLSHLNVYKKVTEDETIEEEDYVLICEDDALFAEKFNENLTALLRQNLTAD 118 Query: 117 LI-----KFDALRKK------PKKDSYLCTLPGN--FDIHQPRI-LSPRTTGYFIGKEAA 162 +I K + K P L G + P T Y I K A Sbjct: 119 IILAGQSKIPSFDDKELAVNYPTTFKCLQKSVGGTPYRYAYPYQNYFAGTVAYLIKKSAC 178 Query: 163 IHLLNVRKNIYRP---IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + ++ D + + LV P E + S +E R Sbjct: 179 RRFGQEVERLHLAHWLADDFILFGKSFGLDILVVRPLMAIENPN-LTSNLEALR 231 >gi|260797405|ref|XP_002593693.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae] gi|229278921|gb|EEN49704.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae] Length = 384 Score = 116 bits (292), Expect = 3e-24, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 51/123 (41%), Gaps = 9/123 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC-----NRIFSHQKRQCQFK 59 ++VI+L RR++ H I L F +A+ G + + Sbjct: 262 IFVINLKRRPERRKRMVHTLKEIGLDFKLMEAVDGLTLNASVLKKMGVTVLPEYKDPWAD 321 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL----LPHLSKCDINN 115 R +++ EIGC++SH +W+ I ++ EDD F F + L + + ++ Sbjct: 322 RSMTMGEIGCFLSHYKIWEEIVEKNLDWVLVFEDDIRFEPFFKRRMYKMLNEIEEIRLDW 381 Query: 116 ILI 118 L+ Sbjct: 382 DLM 384 >gi|254476656|ref|ZP_05090042.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] gi|214030899|gb|EEB71734.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] Length = 256 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 75/187 (40%), Gaps = 7/187 (3%) Query: 20 FCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWK 78 + A L + A+ G R+ ++ + + + L+ E+GC++SH WK Sbjct: 1 MKTQLAAAGLYNVTRISAVDGRT---VGRVPEYRPHEWRSRWELTPGELGCFLSHRTAWK 57 Query: 79 RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 ++ S + +ILEDD +F + + ++ L+K D L P + L Sbjct: 58 QVVASQEV-TVILEDDILLCSQFGETIDAVAGRLGGKDLVKLDWL-DMPIRRGALPANDS 115 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 ++ + P + Y + EAA LL+ + +D + +H + E Sbjct: 116 GAELSRLLQPVPSSAAYVLTPEAAEVLLHRSQAFCDTVDDFIFSQQKH-LSLFQLERPIC 174 Query: 199 YEAIDTN 205 +A ++ Sbjct: 175 MQAQNSG 181 >gi|21069203|gb|AAM33874.1|AF470664_1 LgtH [Neisseria meningitidis] gi|21069207|gb|AAM33877.1|AF470665_3 LgtH [Neisseria meningitidis] Length = 268 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + G GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYGGRAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|62002239|gb|AAX58721.1| LgtH [Neisseria meningitidis] Length = 268 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 78/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SSVEKACFMSHVVLWKQALDEGVPYVAVFEDDVLLGEGAEKFLDEDAWLQERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|261400604|ref|ZP_05986729.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria lactamica ATCC 23970] gi|33413659|gb|AAN08513.1| LgtH [Neisseria lactamica] gi|269209687|gb|EEZ76142.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria lactamica ATCC 23970] Length = 268 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLSTAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEQFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|171911493|ref|ZP_02926963.1| hypothetical protein VspiD_09975 [Verrucomicrobium spinosum DSM 4136] Length = 279 Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 69/198 (34%), Gaps = 33/198 (16%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ----- 57 +P +VI+L + R +F + + + F+AI + + +H+ + Sbjct: 41 LPTFVINLARATERLAEFEVGSQDHGWEVAIFNAIDRRDLKVAESSPTHKVVLHRDDLGI 100 Query: 58 ---------FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 K L L I C +SH+ LW+++ A I EDDA + Sbjct: 101 RLCNIPDRAGKLSLHLGHIACALSHMTLWQKVVAEELPHACIFEDDAQIVRPWWH----- 155 Query: 109 SKCDINNILIKFD---------ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGK 159 I ++ K D++L P + P T Y + + Sbjct: 156 QPWPATADFIFLSNRVPGLIPQHIQSTDKLDAHLAAHP-----YIPLAPGCGTEAYIVTQ 210 Query: 160 EAAIHLLNVRKNIYRPID 177 + A L + +Y P+D Sbjct: 211 QGARKALAIMAEMYLPVD 228 >gi|256081803|ref|XP_002577157.1| cerebral cell adhesion molecule related [Schistosoma mansoni] gi|238662453|emb|CAZ33394.1| cerebral cell adhesion molecule related [Schistosoma mansoni] Length = 680 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 68/164 (41%), Gaps = 6/164 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 +Y I+L RR+K + ++ + + AI G++ + + K+ + Sbjct: 445 IYFINLLRRPDRRQKMEYMLHQLGINAKHYAAIDGKDLTMKHFDELEIKQLPGYTDPYHN 504 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L EIGC++SH ++W + + +ILEDD F+ F + L + +N+ Sbjct: 505 RSLKFGEIGCFLSHYNIWIEMINNGYNRILILEDDLRFAPAFVRNLNKVINEADDNV-AN 563 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAI 163 +D L K+ S N + T GY + + A Sbjct: 564 WDLLYIGRKRMSKSEKRVPNTTSLTYPDYTYWTLGYILNRNGAE 607 >gi|71652750|ref|XP_815025.1| glycosyl transferase-like protein [Trypanosoma cruzi strain CL Brener] gi|70880049|gb|EAN93174.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 359 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 81/235 (34%), Gaps = 31/235 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNRI------------F 49 VYVI+L R + + R ++ F A+ G I Sbjct: 110 VYVINLNRRPDRWKFILRQLQRAGFREGEYERFTAVDGRAVDIQKAQACGLVSLLGILRL 169 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHL 108 ++ + + L+ +GC +SH+ LW IA +++EDD+ F +F + + Sbjct: 170 REEESRRIWGMDLNPAAVGCALSHVLLWATIAARRYHRVLVVEDDSLFPHDFQKKYDERM 229 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + L+ + + + S L G + + + T Y + A LL + Sbjct: 230 RHVPQDWELV-YVSGLDTADQTSQLRVAEG---VSRVPQMHRTTNCYVVTHHGARRLLEL 285 Query: 169 RKNIYRPIDMDMKHWW----EHNIP------SLVTEPGAVYEAIDTNDSTIEESR 213 + +D M + +P +P V +A S I+ S Sbjct: 286 CLPMTYQLDTMMTMHAVSDPQSGVPHVTSPVCYTLQPPLVVQAT-RMGSDIQSSN 339 >gi|33413663|gb|AAN08516.1| LgtH [Neisseria subflava] Length = 268 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAKERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN-----IL 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDTWLQERFNPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|296313942|ref|ZP_06863883.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria polysaccharea ATCC 43768] gi|296839482|gb|EFH23420.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria polysaccharea ATCC 43768] Length = 279 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 46/243 (18%), Positives = 70/243 (28%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLSSAAERRAHIAATFGAHGIPFQFFDALMPSER-LEQAMAELVLGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD S+ Q L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGTPYIAVFEDDVLLSEGAEQFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I K+A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAMRFFLDRFAALPSEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESR-LVRKP 218 PID M +P P + + S IE R L RK Sbjct: 179 HPIDWMMFGNPDDRE---RMPVFQLNPALCAQELHYAKFHDQNSALGSLIEHDRCLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|21240991|ref|NP_640573.1| glycosyltransferase [Xanthomonas axonopodis pv. citri str. 306] gi|21106278|gb|AAM35109.1| glycosyltransferase [Xanthomonas axonopodis pv. citri str. 306] Length = 269 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 51/269 (18%), Positives = 88/269 (32%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F A+ G F ++ Sbjct: 2 IKSYLINMQRSGDRLQAMSARLDALGLPFERIAAVDGATLTDEQIADFVRERPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSTGPRKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLRSTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPG-NFDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K D+ + + G + + +P I ++AA LL + Sbjct: 122 NSIVRLEPSYNRIKLDAKVADVAGRHLRLVRPSRYQHCWPVCAGALIISRDAARLLLAAK 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 182 ARWHTMADIFLFGWSESPVAHQLSTYQLSPAACIQDKFFHAAAENIVFTSTIDAPLTHAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + R T + + P Sbjct: 242 KQRSTHWKARATA-LLRVAQGYRKVVFAP 269 >gi|291229542|ref|XP_002734736.1| PREDICTED: glycosyltransferase 25 domain containing 2-like, partial [Saccoglossus kowalevskii] Length = 576 Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats. Identities = 31/140 (22%), Positives = 58/140 (41%), Gaps = 9/140 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC-----QFK 59 +Y+I+L RRE+ + + + DA+ G+ + K Sbjct: 339 IYMINLLRRPDRRERMVAAFDLLGIDATLVDAVDGQELTDEKVLELGIKMLPGFADPYHG 398 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH ++W+ + ++LEDD F +F + L L + IN Sbjct: 399 RELTKGEIGCFLSHYNIWQEVVSKNYSKVMVLEDDVRFKFKFLERLNAMMAELKELHINW 458 Query: 116 ILIKFDALRKKPKKDSYLCT 135 ++ K+ +K + Sbjct: 459 DIVAILEETKEAQKFRKRQS 478 >gi|260433945|ref|ZP_05787916.1| putative glycosyl transferase [Silicibacter lacuscaerulensis ITI-1157] gi|260417773|gb|EEX11032.1| putative glycosyl transferase [Silicibacter lacuscaerulensis ITI-1157] Length = 280 Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 63/194 (32%), Gaps = 6/194 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 V+ I+L RR + AR + A ++ L+ Sbjct: 12 VFYINLDRVPERRAFMEDQFARTGLIGAQRICATDASVPGALESTGYVPGSGTRWG--LT 69 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 EI C+ SH +W+ + S A I EDD + S + ++ L +K D Sbjct: 70 PSEIACFESHRTVWQHVVDSGLPAAAIFEDDVEMSVQAGSVIAELFLNTAAFDCVKLD-- 127 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 + T G+ + ++P Y + + A LL ++ +D + Sbjct: 128 YSPRSRRFGAQTHIGSVSVRPMMEMAPSAAAYVVSQSACRKLLEWSQSYSDHLDDFLTLP 187 Query: 184 WEHNIPSLVTEPGA 197 +N P Sbjct: 188 -RNNWKLFQVFPAV 200 >gi|33413667|gb|AAN08519.1| LgtH [Neisseria polysaccharea] Length = 268 Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 78/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + + Sbjct: 59 SGVEKACFMSHTVLWKQALDESLPYIAVFEDDVLLGEGAEKFLAEDTWLEERFDKDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|261400608|ref|ZP_05986733.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria lactamica ATCC 23970] gi|33413657|gb|AAN08511.1| LgtB [Neisseria lactamica] gi|269209691|gb|EEZ76146.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria lactamica ATCC 23970] Length = 279 Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats. Identities = 46/243 (18%), Positives = 75/243 (30%), Gaps = 29/243 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRAHIADTFGRHDIPFQFFDALMPS--EELNRVMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH+ LWK+ + EDD + + L + + Sbjct: 59 SEVEKACFMSHVVLWKQALDEGLPYIAVFEDDVLLGKDAEKFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMWFFLDRFAALPPEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESR-LVRKP 218 P+D M +P P + + S IE R L RK Sbjct: 179 HPVDWMMFGNPDDRE---RMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRCLNRKQ 235 Query: 219 TFS 221 Sbjct: 236 QRR 238 >gi|325268040|ref|ZP_08134686.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Kingella denitrificans ATCC 33394] gi|324980425|gb|EGC16091.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Kingella denitrificans ATCC 33394] Length = 271 Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 87/256 (33%), Gaps = 22/256 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR R + F+FFDA+ + N + + I ++ + L Sbjct: 4 MNNFVISLVSAEERRRHIEREFGRNAIPFTFFDAVTPQTN-LNDCIRAYLPNLARVP-NL 61 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD----INNILI 118 + E C++S LWK+ + EDD S L +I Sbjct: 62 TDGEKACFMSQFLLWKKCVQEKMPYISVFEDDVYLSRHAGTFLASDEWLKRIGLSGRFII 121 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIYR 174 K + + P + C + ++ R Y + +AA L+ + Sbjct: 122 KLETV-NTPCRTEPFCRIDDGHTLNFLRSDHYGGGAYMLSYQAAEQLVRIVEGLSWKEIE 180 Query: 175 PIDMDM---------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF 225 P+D + ++ + P++ + D S +E +R + F Sbjct: 181 PVDHFLFDAGVYRFADFIYQLS-PAICIQEIIKSPDSDYMVSFLEPARKKKNSVKIKRTF 239 Query: 226 YRNTCYQWNLHYNAWR 241 + +W + R Sbjct: 240 WEKLGREWA-RWQKKR 254 >gi|149195374|ref|ZP_01872460.1| hypothetical protein CMTB2_04337 [Caminibacter mediatlanticus TB-2] gi|149134506|gb|EDM22996.1| hypothetical protein CMTB2_04337 [Caminibacter mediatlanticus TB-2] Length = 257 Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 87/221 (39%), Gaps = 19/221 (8%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP----ICNRIFSHQKRQCQF 58 + +VI+L S RR ++ + ++++ FF A G N S +K ++ Sbjct: 1 MKTFVINLERSKDRRAYAKNQLDKFNVEYEFFKATDGSTLSDEWIENNLDESFKKEYEKY 60 Query: 59 K-RLLSLPEIGCYISHIHLWKRIAYS--PAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 K R ++ +GC SH LWK I + +ILEDD + +++ ++ + Sbjct: 61 KNRFITKGALGCADSHRRLWKYILENEINEEIFLILEDDFFLLKDTMKIINEIASVMKQH 120 Query: 116 IL----IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLN--V 168 L + + +L + + +++ P + + Y + K+ LL+ Sbjct: 121 DLEFVNLYYTSLEILELDKKNSIKVNNKYSLYKYPNQKTSNSLAYLVNKKGLKKLLSSQS 180 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 K D D + ++ + + P + + STI Sbjct: 181 SKITRMADDWDFEEI---SLNAWLVYPLPIVTSG--FTSTI 216 >gi|313669147|ref|YP_004049431.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria lactamica ST-640] gi|313006609|emb|CBN88074.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria lactamica 020-06] Length = 268 Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIAATFGARGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I K+A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|311977606|ref|YP_003986726.1| probable procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga mimivirus] gi|82000136|sp|Q5UQC3|PLOD_MIMIV RecName: Full=Probable procollagen-lysine,2-oxoglutarate 5-dioxygenase; AltName: Full=Lysyl hydroxylase; Short=LH gi|55416853|gb|AAV50503.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga mimivirus] gi|308204267|gb|ADO18068.1| probable procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga mimivirus] Length = 895 Score = 115 bits (289), Expect = 6e-24, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 79/237 (33%), Gaps = 35/237 (14%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIH-----LQFSFFDAIYGENNPICNRIFSHQKRQ 55 M YVI+L ++++ ++ L F+A+ G N +R Sbjct: 1 MISRTYVINLARRPDKKDRILAEFLKLKEKGVELNCVIFEAVDGNNPEHLSRFNFKIPNW 60 Query: 56 --CQFKRLLSLPEIGCYISHIHLWKRIAY-------SPAIGAIILEDDADFSDEFSQLL- 105 + ++ E+GC +SH +WK + ++LEDD F D F + Sbjct: 61 TDLNSGKPMTNGEVGCALSHWSVWKDVVDCVENGTLDKDCRILVLEDDVVFLDNFMERYQ 120 Query: 106 PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + S+ N L+ P ++ + T I +P S Y I + A Sbjct: 121 TYTSEITYNCDLLYLHRKPLNPYTETKIST-----HIVKPNK-SYWACAYVITYQCAKKF 174 Query: 166 LNVRK-NIYRPIDMDM-----------KHWWEH--NIPSLVTEPGAVYEAIDTNDST 208 +N P D + + + + I +P V + + + Sbjct: 175 MNANYLENLIPSDEFIPIMHGCNVYGFEKLFSNCEKIDCYAVQPSLVKLTSNAFNDS 231 >gi|218767502|ref|YP_002342014.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis Z2491] gi|15991379|gb|AAL12844.1|AF355194_2 LgtH [Neisseria meningitidis] gi|121051510|emb|CAM07804.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis Z2491] gi|319409764|emb|CBY90070.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB2 [Neisseria meningitidis WUE 2594] Length = 268 Score = 115 bits (288), Expect = 7e-24, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + T GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCVQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|145637011|ref|ZP_01792675.1| hypothetical protein CGSHiHH_04895 [Haemophilus influenzae PittHH] gi|145269869|gb|EDK09808.1| hypothetical protein CGSHiHH_04895 [Haemophilus influenzae PittHH] Length = 260 Score = 115 bits (288), Expect = 7e-24, Method: Composition-based stats. Identities = 47/249 (18%), Positives = 80/249 (32%), Gaps = 30/249 (12%) Query: 22 HRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-KRLLSLPEIGCYISHIHLWKRI 80 + ++ H+ F FFDA+ +P F Q+ L+ E GC ISH+ LW + Sbjct: 2 NEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNGTSLTGGEKGCLISHVALWHKC 57 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILIKFDALRKKPKKDSYLCT 135 I EDD + L + +I+ + ++ ++ Sbjct: 58 IQDNLPYIAIFEDDILLGRDARTFLAEDEWLFSRFNCDDIFIIRLETFLQETICEALPNP 117 Query: 136 LP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPIDMDMKHWWEHN--- 187 + D + T GY I AA +LL + KN+ PID + + + Sbjct: 118 VSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNMESNNIFPIDHLIFNRFLAGEEL 177 Query: 188 -----IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 P+L + + E DS +E R + + Sbjct: 178 MVYQLSPALCIQELQLNENESLLDSQLESERKN-------YRLAEKARKKKTWREKVYHI 230 Query: 243 DLPPVSTTK 251 P K Sbjct: 231 FTKPQRMLK 239 >gi|145638324|ref|ZP_01793934.1| hypothetical protein CGSHiII_06119 [Haemophilus influenzae PittII] gi|145272653|gb|EDK12560.1| hypothetical protein CGSHiII_06119 [Haemophilus influenzae PittII] Length = 260 Score = 115 bits (288), Expect = 7e-24, Method: Composition-based stats. Identities = 46/249 (18%), Positives = 78/249 (31%), Gaps = 30/249 (12%) Query: 22 HRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-KRLLSLPEIGCYISHIHLWKRI 80 + ++ H+ F FFDA+ +P F Q+ L+ E GC ISH+ LW + Sbjct: 2 NEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNGTSLTDGEKGCLISHVALWHKC 57 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILIKFDALRKKPKKDSYLCT 135 I EDD + L + +I+ + ++ + Sbjct: 58 IQDNLPYIAIFEDDILLGRDARTFLAEDEWLFSRFNCDDIFIIRLETFLQETICEELPNP 117 Query: 136 LP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPIDMDMKHWWEHN--I 188 + D + T GY I AA +LL + KN+ PID + + + + Sbjct: 118 VSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNMESNNIFPIDHLIFNRFLAGEEL 177 Query: 189 PSLVTEPGAVYEAIDTN------DSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 P + + N DS +E R + + Sbjct: 178 MVYQLSPALCIQEVQLNENESLLDSQLESERKN-------YRLAEKARKKKTWREKVYHI 230 Query: 243 DLPPVSTTK 251 P K Sbjct: 231 FTKPQRMLK 239 >gi|145642435|ref|ZP_01797990.1| hypothetical protein CGSHiR3021_02603 [Haemophilus influenzae R3021] gi|145272873|gb|EDK12764.1| hypothetical protein CGSHiR3021_02603 [Haemophilus influenzae 22.4-21] Length = 260 Score = 115 bits (288), Expect = 8e-24, Method: Composition-based stats. Identities = 47/249 (18%), Positives = 80/249 (32%), Gaps = 30/249 (12%) Query: 22 HRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-KRLLSLPEIGCYISHIHLWKRI 80 + ++ H+ F FFDA+ +P F Q+ L+ E GC ISH+ LW + Sbjct: 2 NEFSKHHVPFQFFDAV----SPSSQLDFLIQRLVPNLNGTSLTGGEKGCLISHVALWHKC 57 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILIKFDALRKKPKKDSYLCT 135 I EDD + L + +I+ + ++ ++ Sbjct: 58 IQDNLPYIAIFEDDILLGRDARTFLAEDEWLFSRFNCDDIFIIRLETFLQETICEALPNP 117 Query: 136 LP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPIDMDMKHWWEHN--- 187 + D + T GY I AA +LL + KN+ PID + + + Sbjct: 118 VSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNMESNNIFPIDHLIFNRFLAGEEL 177 Query: 188 -----IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 P+L + + E DS +E R + + Sbjct: 178 MVYQLSPALCIQELQLNENESLLDSQLESERKN-------YQLAEKARKKKTWREKVYHI 230 Query: 243 DLPPVSTTK 251 P K Sbjct: 231 FTKPQRMLK 239 >gi|297685475|ref|XP_002820323.1| PREDICTED: glycosyltransferase 25 family member 3-like [Pongo abelii] Length = 528 Score = 115 bits (287), Expect = 9e-24, Method: Composition-based stats. Identities = 29/137 (21%), Positives = 53/137 (38%), Gaps = 6/137 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-----RIFSHQKRQCQFK 59 V+VISL RRE+ + + DA+ G + Sbjct: 322 VFVISLARRPDRRERMLTSLWEMEISGQVVDAVDGRMLNSSAIRNLGVDLLPGYQDPYSG 381 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 R L+ E+GC++SH +W+ + ++ EDD F F L L + D+ + Sbjct: 382 RTLTKGEVGCFLSHHSIWEEVVARGLARVLVFEDDVRFESNFRGRLERLME-DVEAEKLS 440 Query: 120 FDALRKKPKKDSYLCTL 136 +++ ++ K + L Sbjct: 441 WESGIEQYKAHFWPRDL 457 >gi|166710181|ref|ZP_02241388.1| glycosyltransferase [Xanthomonas oryzae pv. oryzicola BLS256] Length = 269 Score = 114 bits (286), Expect = 1e-23, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 86/269 (31%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F A+ G F ++ Sbjct: 2 IKSYLINMQRSSDRLQAMSARFDALGLPFERIAAVDGATLTDEQIADFVRERPLEGSGDT 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSTGPRKWTASNIGCFLSHQAAWRIAADSADAYTAIFEDDMHLSDTLPALLASTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K + + + G + +P + ++AA LL + Sbjct: 122 NSIVRLEPSYNRIKLHAKVADVAGRQLRLVRPSTYQHCWPVCAGALIVSRDAARLLLAAQ 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 182 ARWHTMADIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHATAEDIVFTSTIDAPLTQAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KRRSTQWKAKATA-LLRVAQGYRKVVFAP 269 >gi|296313944|ref|ZP_06863885.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria polysaccharea ATCC 43768] gi|296839484|gb|EFH23422.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria polysaccharea ATCC 43768] Length = 268 Score = 114 bits (286), Expect = 1e-23, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 77/263 (29%), Gaps = 15/263 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHIADTFSSRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + Q L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGKDAEQFLAEDAWLKERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----Y 173 ++ + + GY I ++A L+ + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGHAFPLLESEHFGMAGYIISRKAMRFFLDRFAVLPSERL 178 Query: 174 RPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL-VRKPTFSPLYFYRNTC 230 + +D + + + P ++ +T S ++ R + SP + Sbjct: 179 KAVDWMLFSSFLDKGGMTVCQLTPALCIQS-ETLPSQLKNGRQESYRNRRSPKVLLKRAL 237 Query: 231 YQWNLHYNAWRKDLPPVSTTKFL 253 + R+ K L Sbjct: 238 GKIGREIERARERKRQKKLEKHL 260 >gi|294627346|ref|ZP_06705931.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|294667590|ref|ZP_06732804.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292598301|gb|EFF42453.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292602585|gb|EFF46022.1| glycosyltransferase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 269 Score = 114 bits (286), Expect = 1e-23, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 87/269 (32%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F AI G F ++ Sbjct: 2 IKSYLINMQRSGDRLQAMSARLDALGLPFERIAAIDGATLTDEQIADFVRERPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSTGPRKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLRSTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K D+ + + G + +P I ++AA LL + Sbjct: 122 NSIVRLEPSYNRIKLDAKVADVAGRQLRLVRPSTYQHCWPVCAGALIISRDAARLLLAAK 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 182 ARWHTMADIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHAAAENIVFTSTIDAPLTHAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + R T + + P Sbjct: 242 KQRSTHWKTRATA-LLRVAQGYRKVVFAP 269 >gi|145630270|ref|ZP_01786051.1| dimethyladenosine transferase [Haemophilus influenzae R3021] gi|144984005|gb|EDJ91442.1| dimethyladenosine transferase [Haemophilus influenzae R3021] Length = 157 Score = 114 bits (285), Expect = 1e-23, Method: Composition-based stats. Identities = 37/146 (25%), Positives = 57/146 (39%), Gaps = 11/146 (7%) Query: 74 IHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIKFDALRKKPKKDSY 132 I ++ I + AIILEDDA S +++ L+K N LI D K + + Sbjct: 9 IRAYEHIVENKIESAIILEDDAIVSHFLKEIVQDALNKVKKNYDLIFLD--HGKARSYPF 66 Query: 133 LCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE 185 L + + R S T Y I A LLN I P D + Sbjct: 67 KKKLYEGYQLAHYRAPSKNSKRCIIYATAYLITLSGAKKLLNYAYPIRLPADYLTGLIQK 126 Query: 186 HNIPSLVTEPGAVYEAIDTNDSTIEE 211 + + EP V+ +++ DS I++ Sbjct: 127 TRVDTYGIEPPCVFRGLNS-DSEIDK 151 >gi|71149108|gb|AAZ29056.1| Lgt2 [Moraxella catarrhalis] Length = 207 Score = 113 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 72/200 (36%), Gaps = 21/200 (10%) Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L+ E C++SH+ LW+++ I EDD ++ + + + +IK Sbjct: 10 RLTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDAACFIKN-DWLYFEFDIIKL 68 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPI 176 + + + GN ++ + T GY I + A LL K+I Y I Sbjct: 69 ETQHELVHIGKSIH-HHGNRTLNPLKSTHVGTAGYIISQSGAKRLLEFIKSIDEYEYYAI 127 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEA---IDTNDSTIEESR------------LVRKPTFS 221 D M + L P ++ I +S +E+ R L K Sbjct: 128 DHVMFGAYLSKGKVLQLCPALCQQSDSQIKNLESQLEQDRKNHQYIYHANESLYTKLNKI 187 Query: 222 PLYFYRNTCYQWNLHYNAWR 241 FYR+ ++ A+R Sbjct: 188 VKRFYRSFGKRFFYITVAFR 207 >gi|289669872|ref|ZP_06490947.1| glycosyltransferase [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 269 Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 87/269 (32%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F A+ G F ++ Sbjct: 2 IKSYLINMQRSSDRLQAMSARFDMLGLPFERIAAVDGATLTAEQIADFIRERPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSTGPRTWTASNIGCFLSHQAAWRIAADSADAYTAIFEDDMHLSDALPTLLRSTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K D+ + + G + +P I ++AA LL + Sbjct: 122 NSIVRLEPSYNRIKLDAKVADVAGRQLRLVRPSTYQHCWPVCAGALIISRDAARLLLAAQ 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 182 ARWHTMTDIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHTAAEDIVFTSTIDAPLTHAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KQRSTHWKAKATA-LLRVAQGYRKVVFAP 269 >gi|159185030|ref|NP_355025.2| hypothetical protein Atu2058 [Agrobacterium tumefaciens str. C58] gi|159140304|gb|AAK87810.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 206 Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 55/164 (33%), Gaps = 3/164 (1%) Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL 117 R S PEIGCY+SH+ + A A+ILEDD F D+F + + + Sbjct: 2 HGRRTSPPEIGCYLSHVACANKFMTGDADIALILEDDVVFEDDFLDAIDEAVLNGNDWDI 61 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 ++ + K L + + Y + + A ++ + D Sbjct: 62 LRLTTVSNGRK--FAFRALSNGRSLAVALTREKGSGAYLVNRRAGKW-ISKLIPMRLAYD 118 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 + + + + P + D R+ R P + Sbjct: 119 IAFDLEYLSGLKAAFIYPLCATQDADGESQIQNNLRIYRLPRWR 162 >gi|973187|gb|AAC44086.1| glycosyl tranferase [Neisseria meningitidis] gi|1586803|prf||2204376C lgtE gene Length = 276 Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 77/241 (31%), Gaps = 26/241 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ NR+ + + LL Sbjct: 1 MQNHVISLASAAERRGHIAATFGVRGIPFQFFDALMPS--EELNRMMAELVPGLAKQHLL 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + E + + L + + + Sbjct: 59 SEVEKACFMSHAVLWKQALDEGLPYVAVFE----LGKDAEKFLAEDTWLEERFDKDSAFI 114 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN----VRKNIY 173 ++ + + K N T GY I +EA L + + Sbjct: 115 VRLETMFAKVIVRPDKVLNYENRSFPLLESEHWGTAGYIISREAMRFFLERFAVLPARVD 174 Query: 174 RPIDMDMKHWW--EHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTFSP 222 + +D M ++ + +P P + + S +E+ R + Sbjct: 175 KAVDWMMFTYFFDKEGMPVYQVNPALCTQELHYAKFLSKNSMLGSDLEKDREQERRHRRS 234 Query: 223 L 223 L Sbjct: 235 L 235 >gi|325925713|ref|ZP_08187089.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] gi|325926867|ref|ZP_08188149.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] gi|325542768|gb|EGD14228.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] gi|325543882|gb|EGD15289.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas perforans 91-118] Length = 271 Score = 112 bits (280), Expect = 5e-23, Method: Composition-based stats. Identities = 48/270 (17%), Positives = 84/270 (31%), Gaps = 26/270 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F AI G F + Sbjct: 2 IKSYLINMQRSGDRLQAMSARFDALGLPFERIAAIDGATLTDEQIADFVRDRPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSAGPRKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLGSTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 +++ + + K D+ + + G + +P I ++AA LL + Sbjct: 122 RSIVRLEPSYNRIKLDAKVADVAGRQLRLVRPSTYQHCWPVCAGALIISRDAARLLLAAK 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + + Sbjct: 182 ARWHTMADIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHAAAENIVFTSTIDAPLTLTQ 241 Query: 218 PTFSPLYFYR-NTCYQWNLHYNAWRKDLPP 246 ++ + + P Sbjct: 242 AQKQRSTHWKAKATALLRVAQGYRKVVFAP 271 >gi|91777876|ref|YP_553084.1| hypothetical protein Bxe_B2251a [Burkholderia xenovorans LB400] gi|91690536|gb|ABE33734.1| Hypothetical protein Bxe_B2251a [Burkholderia xenovorans LB400] Length = 241 Score = 112 bits (280), Expect = 5e-23, Method: Composition-based stats. Identities = 48/226 (21%), Positives = 93/226 (41%), Gaps = 30/226 (13%) Query: 35 DAIYGENN--PICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILE 92 DA+ G + + ++ + ++ R ++ E+ C++SH +W++I + A++LE Sbjct: 3 DAVDGRSLAQSELDAVYDDAAARRRYGRSMTRAEVACFMSHRSVWRKIVETG-RAAVVLE 61 Query: 93 DDADFSDEFSQLLPHLSKCDI----NNILIKFDALRKKPKKDSYL------CTLPGNFDI 142 DDA F + + ++ ++ + +L+ LR+ +Y G I Sbjct: 62 DDAMLEPAFFERVLRANESELSAVADIVLLGRSKLRRTASAWTYFNEPLRHVIGVGGLRI 121 Query: 143 HQP-RILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW-W---EHNIPSLVTEPGA 197 P + + + GY+I AA L + P+ + W W E + P A Sbjct: 122 GVPFKQWTSGSVGYWISAHAACAALAYSEG---PLGALLDDWPWHRDEGGARVVELRPYA 178 Query: 198 VYEAIDTNDSTIEESRLVRKPT--------FSPLYFYRNTCYQWNL 235 V+E D S+IE+ R + PL R T +W++ Sbjct: 179 VWEDFDRLPSSIEQERRASIRSRSSWHDVALWPLRLAR-TAARWSV 223 >gi|289664998|ref|ZP_06486579.1| glycosyltransferase [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 269 Score = 112 bits (280), Expect = 6e-23, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 87/269 (32%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F A+ G F ++ Sbjct: 2 IKSYLINMQRSSDRLQAMSARFDMLGLPFERIAAVDGATLTAEQIVDFIRERPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSTGPRTWTASNIGCFLSHQAAWRIAADSADAYTAIFEDDMHLSDALPTLLRSTHWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K D+ + + G + +P I ++AA LL + Sbjct: 122 NSIVRLEPSYNRIKLDAKVADVAGRQLRLVRPSTYQHCWPVCAGALIISRDAARLLLAAQ 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 182 ARWHTMTDIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHTAAEDIVFTSTIDAPLTHAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KQRSTHWKAKATA-LLRVAQGYRKVVFAP 269 >gi|284007091|emb|CBA72366.1| conserved hypothetical protein [Arsenophonus nasoniae] Length = 270 Score = 112 bits (280), Expect = 6e-23, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 77/225 (34%), Gaps = 13/225 (5%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKR 60 P+++ISL RR + + + L F+ DAI G + + + + Sbjct: 8 SCPIFIISLKKDTDRRTQISYALSSQALPFTIVDAIEGSKLTQNDIAGLKYPYYDKAYGK 67 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN------ 114 EI C +SH +++ I + A+ILEDD + S L+ L + + Sbjct: 68 PNGKNEIACSLSHQSVYQHIIKNDIEWALILEDDVIIDKKLSPLIKALEQGKSDLLKKDY 127 Query: 115 -NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP-----RTTGYFIGKEAAIHLLNV 168 +L + L + + ++ + RT Y I + ++ Sbjct: 128 VYLLGGQEGLNSRKRISLSFFNKITIANVTFRLLTYSSDKIFRTCCYLIHRSTCERIIAE 187 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + D + ++ + + + +S +E R Sbjct: 188 FARGFFVADSWGLLYKKNAVKGYYLAEIIKHPIVTAENSHLEADR 232 >gi|58583712|ref|YP_202728.1| glycosyltransferase [Xanthomonas oryzae pv. oryzae KACC10331] gi|84625520|ref|YP_452892.1| glycosyltransferase [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188575017|ref|YP_001911946.1| glycosyltransferase [Xanthomonas oryzae pv. oryzae PXO99A] gi|58428306|gb|AAW77343.1| glycosyltransferase [Xanthomonas oryzae pv. oryzae KACC10331] gi|84369460|dbj|BAE70618.1| glycosyltransferase [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188519469|gb|ACD57414.1| glycosyltransferase [Xanthomonas oryzae pv. oryzae PXO99A] Length = 269 Score = 111 bits (279), Expect = 7e-23, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 85/269 (31%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F A+ G F ++ Sbjct: 2 IKSYLINMQRSSDRLQAMSARFDALGLPFERIAAVDGATLTDEQIADFVRERPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ S I EDD SD LL Sbjct: 62 FSTGPRKWTASNIGCFLSHQAAWRIATDSADAYTAIFEDDMHLSDTLPALLASTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K D+ + + G +P + ++AA LL + Sbjct: 122 NSIVRLEPSYNRIKLDAKVADVAGRQLRPVRPSTYQHCWPVCAGALIVSRDAARLLLAAQ 181 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 182 ARWHTMADIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHATAEDIVFTSTIDAPLTQAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KRRSTQWKAKATA-LLRVAQGYRKVVFAP 269 >gi|329122713|ref|ZP_08251291.1| lipooligosaccharide biosynthesis protein [Haemophilus aegyptius ATCC 11116] gi|327472587|gb|EGF18017.1| lipooligosaccharide biosynthesis protein [Haemophilus aegyptius ATCC 11116] Length = 161 Score = 111 bits (279), Expect = 8e-23, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 51/120 (42%), Gaps = 2/120 (1%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL RR+ + ++ F FFDAI + + F+ K L Sbjct: 31 YVISLTTEQKRRKHITEEFGKQNIPFEFFDAITPDIIEETAKKFNIT-FDRSSKATLCDA 89 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EI C +SHI LW + + I EDD + +LL ++ + ++K +A K Sbjct: 90 EIACALSHIVLWNFVLENNLDYINIFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK 148 >gi|78045758|ref|YP_361933.1| family 25 glycosyltransferase [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78034188|emb|CAJ21833.1| putative family 25 glycosyltransferase [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 318 Score = 111 bits (279), Expect = 8e-23, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 86/269 (31%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFK-- 59 I Y+I++ S R + R + L F AI G F ++ Sbjct: 51 IKSYLINMQRSGDRLQAMSARFDALGLPFERIAAIDGATLTDEQIAHFVRERPLEGSGDA 110 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 111 FSAGPRKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLGSTDWLPGG 170 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 +++ + + K D+ + + G + +P I ++AA LL + Sbjct: 171 RSIVRLEPSYNRIKLDAKVADVAGRQLRLVRPSTYQHCWPVCAGALIISRDAARLLLAAK 230 Query: 170 KNIYRPIDMDMKHWWE----HNIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E H + + P A + STI+ + Sbjct: 231 ARWHTMADIFLFGWNESPVAHQLSTYQLSPAACIQDKFFHAAAENIVFTSTIDAPLTHAQ 290 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 291 KQRSTHWKAKATA-LLRVAQGYRKVVFAP 318 >gi|1944154|dbj|BAA19628.1| unnamed protein product [Aggregatibacter actinomycetemcomitans] Length = 221 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 71/196 (36%), Gaps = 21/196 (10%) Query: 40 ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDA 95 ++ + + +F+ K + + R ++ EIGC +SH+ ++++I A++ EDDA Sbjct: 3 KDWDVLSAVFNPVKFEQHYGRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDA 62 Query: 96 DFSDEFSQLLPHLSKCDINNILI-----------KFDALRKKPKKDSYLCTLPGNFDIHQ 144 F+ + L L +I + P ++L G Sbjct: 63 LFNKNLPENLTALLAQHCTADIILIGQSKIAGFDDMELEINYPTTFAFLRANVGAATYAY 122 Query: 145 PRI-LSPRTTGYFIGKEAAIHLLNVR---KNIYRPIDMDMKHWWEHNIPSLVTEPGAVYE 200 P T Y I K AA L + K + D + + + + V P V E Sbjct: 123 PYKSYFAGTVAYLIKKSAARTFLAILEQEKPFWLA-DDFLLFETQFKLNNNVVRPLMVIE 181 Query: 201 AIDTNDSTIEESRLVR 216 S +E R + Sbjct: 182 NPQLV-SNLEAVRGSK 196 >gi|5545312|dbj|BAA82527.1| ORF1 [Actinobacillus actinomycetemcomitans] Length = 221 Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 71/196 (36%), Gaps = 21/196 (10%) Query: 40 ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS----PAIGAIILEDDA 95 ++ + +F+ K + + R ++ EIGC +SH+ ++++I A++ EDDA Sbjct: 3 KDWDALSAVFNPVKFEQHYGRKVTKGEIGCTLSHLEVYRQIVADEKVAEKDYALVCEDDA 62 Query: 96 DFSDEFSQLLPHLSKCDINNILIKF-----------DALRKKPKKDSYLCTLPGNFDIHQ 144 F+ + L L +I + P +L G+ Sbjct: 63 LFNKNLPENLTALLAQHCTADIILIGQSKIAGFDDAELEINYPTTFMFLRKKLGDVTYAY 122 Query: 145 PRI-LSPRTTGYFIGKEAAIHLLNVRK---NIYRPIDMDMKHWWEHNIPSLVTEPGAVYE 200 P T Y I K AA L + + + D + + + + V P V E Sbjct: 123 PYKSYFSGTVAYLIKKSAARTFLAILEQEPPFWLA-DDFLLFETQFKLNNNVVRPLMVIE 181 Query: 201 AIDTNDSTIEESRLVR 216 S +E +R + Sbjct: 182 NPQLV-SNLEAARGSK 196 >gi|21069206|gb|AAM33876.1|AF470665_2 LgtB [Neisseria meningitidis] Length = 279 Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 65/235 (27%), Gaps = 28/235 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA----IHLLNVRKNIY 173 ++ + + T GY I ++A + Sbjct: 119 VRLETMFMHVLTSPSGVAEYCGRAFPLLESEHWGTAGYIISRKAMWFFLDRFAALPSEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRL 214 P+D M +P P + + S IE RL Sbjct: 179 HPVDWMMFGNPDDRE---RMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRL 230 >gi|325919939|ref|ZP_08181924.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] gi|325920508|ref|ZP_08182433.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] gi|325548991|gb|EGD19920.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] gi|325549555|gb|EGD20424.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas gardneri ATCC 19865] Length = 269 Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 85/269 (31%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFK-- 59 I Y+I++ S R + R + + F A G P F ++ Sbjct: 2 IKSYLINMQRSSDRLQAMSARFQALGMPFERIPAFDGATLTPEQIADFVRERPLEGSGDG 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 62 FSTGPRKWTASNIGCFLSHQAAWRIAADSDDAYTAIFEDDMHLSDALPALLRSTEWLPAG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 N +++ + + K DS + + G + +P I ++AA LL Sbjct: 122 NSIVRLEPSYNRIKLDSKVADIGGRQLRLVRPSTYQHCWPVCAGALIISRDAARLLLAAD 181 Query: 170 KNIYRPIDMDMKHWWEH----NIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E + + P A + STI+ + Sbjct: 182 ARWHTMADIFLFGWNESPVAQQLSTYQLSPAACIQDKFFHNAPEQIVFTSTIDAPLTQAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KQRSAHWKAKATA-LLRMAQGYRKVVFAP 269 >gi|290989828|ref|XP_002677539.1| predicted protein [Naegleria gruberi] gi|284091147|gb|EFC44795.1| predicted protein [Naegleria gruberi] Length = 665 Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 86/221 (38%), Gaps = 18/221 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNRIFSH----QKRQCQ 57 +YVI+L RR+K R + ++FF+ ++GE + IF+H + Sbjct: 387 IYVINLKIREDRRQKSIRRLNTLGGIFSNYTFFEPVFGEALTDQD-IFNHVSITTYKAIL 445 Query: 58 FKRLL-----SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD 112 R L + +GCY++H ++WK + +I EDD + + + +++ + S Sbjct: 446 EGRSLDYQLSTKGGVGCYLTHTNIWKDMIEKGYERILIFEDDFEITSPYDEIMTYFSHLP 505 Query: 113 INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI 172 + F P + ++ + Y I + AA LL I Sbjct: 506 KKFD-VAFMYFSLFPGSGEFERY--NDYWMTTTATRVYGAASYIINRGAAEKLLEKAFPI 562 Query: 173 YRPIDMDMKHW--WEHNIPSLVTEPGAVYEAIDTNDSTIEE 211 +D + H+ + + ++ +E + + +++ Sbjct: 563 DLQLDAFINHYVTFTGGLRLYSSKLLFGHEITELFKTNVQD 603 >gi|316933392|ref|YP_004108374.1| glycosyl transferase family 25 [Rhodopseudomonas palustris DX-1] gi|315601106|gb|ADU43641.1| glycosyl transferase family 25 [Rhodopseudomonas palustris DX-1] Length = 247 Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats. Identities = 48/203 (23%), Positives = 67/203 (33%), Gaps = 11/203 (5%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 L + +FDA P + + R F R LS PEI SH + Sbjct: 1 MTAMLGNTSLDWQYFDAHTSLMCPGLHCDNAELPRH--FGRKLSPPEIAVCSSHAAILSD 58 Query: 80 IAYSP-AIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 S A ++LEDD F +F L +N L K K +L Sbjct: 59 FVESNQADYILVLEDDVIFDTDFP--LDRFCAFCASNGLEYVRLFGKHYAKAVHLGFFYE 116 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAV 198 I + T Y + + AA L ++I ID+ M +W +P P V Sbjct: 117 R-SIVRFTTSPAGTQAYLMSRMAARRFLESYRSIRATIDLAMDSFWITRLPIYSIFPYPV 175 Query: 199 YEAIDTND-----STIEESRLVR 216 E T+E L R Sbjct: 176 IERFSPTSIPMPADTVEVRGLDR 198 >gi|124008087|ref|ZP_01692786.1| LPS glycosyltransferase subfamily, putative [Microscilla marina ATCC 23134] gi|123986501|gb|EAY26307.1| LPS glycosyltransferase subfamily, putative [Microscilla marina ATCC 23134] Length = 269 Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 77/193 (39%), Gaps = 8/193 (4%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFS-HQKRQCQFKRL 61 + +V+SL + R A L + DA+ G + + ++ +++ Sbjct: 1 MKTFVVSLKRAKNRYAYIRQHVAGRKLDYKIIDAVDGSLLTEKEIEENCNMEQVDKYRWW 60 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 LS IGC +SH++ ++ + A I+EDD + + LL + + +I Sbjct: 61 LSNGAIGCALSHLNAYEEFLATGDKAAFIIEDDVLLPETITDLLLEIENEIKPSEIILLY 120 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRT---TGYFIGKEAAIHLLNVRKNIYRPIDM 178 P K S + + I +T Y IG+ AA LN+++NI RPI++ Sbjct: 121 YASFAPAKFSTIGERKLSNSGLYYPIDPKQTITAAAYVIGRTAA---LNLKENI-RPIEV 176 Query: 179 DMKHWWEHNIPSL 191 W + Sbjct: 177 TADSWEHYYAKGF 189 >gi|254466303|ref|ZP_05079714.1| glycosyl transferase, family 25 [Rhodobacterales bacterium Y4I] gi|206687211|gb|EDZ47693.1| glycosyl transferase, family 25 [Rhodobacterales bacterium Y4I] Length = 273 Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 68/200 (34%), Gaps = 9/200 (4%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 ISL + RR A ++ F A+ + Q + + + Sbjct: 2 ISLERAAGRRAAMQAALASAGIEAQFVPAVD--LHETSEAELLRQCKSFGPWGAFAPGNM 59 Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 C +SH +W+R S A A++ EDD S E L LS L+ + R Sbjct: 60 ACTLSHARVWERFLDSGADAALVFEDDVFISAELRLWLEDLSWWPEGCGLVNIEFWRSDS 119 Query: 128 KK--DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE 185 + T + ++ +P + GY + ++AA LL + ID + + Sbjct: 120 LRVLLGTDGTQHLDRELAPMLSRNPGSAGYLVTRDAAEQLL-AARPFDMSIDQLLFNPLV 178 Query: 186 HN----IPSLVTEPGAVYEA 201 + P V + Sbjct: 179 SRLAVTLNPYQLAPALVVQG 198 >gi|218767501|ref|YP_002342013.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis Z2491] gi|9977587|sp|P57033|LGTB_NEIMA RecName: Full=Lacto-N-neotetraose biosynthesis glycosyltransferase lgtB gi|121051509|emb|CAM07802.1| lacto-N-neotetraose biosynthesis glycosyl tranferase [Neisseria meningitidis Z2491] gi|319409763|emb|CBY90069.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB1 [Neisseria meningitidis WUE 2594] Length = 279 Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 64/234 (27%), Gaps = 28/234 (11%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +VISL + RR + F FFDA+ + + L Sbjct: 1 MQNHVISLASAAERRAHITDTFGVRGIPFQFFDALMPSER-LEQAMAELVPGLSAHPY-L 58 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----L 117 S E C++SH LWK+ + EDD + + L + + Sbjct: 59 SGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGEGAEKFLAEDAWLQERFDPDSAFI 118 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA----IHLLNVRKNIY 173 ++ + + T GY I ++A + Sbjct: 119 VRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISRKAMWFFLDRFAALPSEGL 178 Query: 174 RPIDMDM-----KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESR 213 P+D M +P P + + S IE R Sbjct: 179 HPVDWMMFGNPDDRE---RMPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDR 229 >gi|325916198|ref|ZP_08178482.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas vesicatoria ATCC 35937] gi|325537618|gb|EGD09330.1| glycosyltransferase involved in LPS biosynthesis [Xanthomonas vesicatoria ATCC 35937] Length = 282 Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats. Identities = 47/270 (17%), Positives = 85/270 (31%), Gaps = 28/270 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFK-- 59 I Y+I++ S R + R + + F A+ G P F ++ Sbjct: 15 IKSYLINMQRSGDRLQAMSARFQALGILFERLAAVDGATLTPAQIADFVRERPLENSGDT 74 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S I EDD SD LL Sbjct: 75 FSTGPRKWTASNIGCFLSHQAAWRIAAASEDAYTAIFEDDMHLSDALPALLRSTDWLPPG 134 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 + +++ + + K D+ + + G + +P I +EAA LL Sbjct: 135 SSVVRLEPSYNRIKLDTKVADIAGRQLRLVRPSTYQHCWPVCAGALIISREAARRLLAAD 194 Query: 170 KNIYRPIDMDMKHWWEH----NIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ + W E + + P A + STI+ + + Sbjct: 195 ARWHTMADIFLFGWNESPVAQQLSTYQVSPAACIQDKFFHAAPEQIVFTSTIDA--PLTR 252 Query: 218 PTFSPLYFYR-NTCYQWNLHYNAWRKDLPP 246 ++ + + P Sbjct: 253 AQKQRSAHWKAKATTLLRVAQGYRKVVFAP 282 >gi|83955196|ref|ZP_00963852.1| glycosyl transferase, family 25 [Sulfitobacter sp. NAS-14.1] gi|83840525|gb|EAP79698.1| glycosyl transferase, family 25 [Sulfitobacter sp. NAS-14.1] Length = 252 Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 74/232 (31%), Gaps = 19/232 (8%) Query: 18 EKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW 77 + + L + +A+ P ++R L E C++SH+ W Sbjct: 2 AFMAQQMDALGLPYQRLEAV----TPDAQSGSHSAAYWASWQRPLKETEKACFLSHVAAW 57 Query: 78 KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLP 137 +R+A A+ILEDD S +Q+L + ++ + + +++ L Sbjct: 58 ERVAALEEP-ALILEDDVVLS-RATQVLLKSCRVLVDVDHLTLEVRQRRKIVAKDAQPLT 115 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGA 197 + + + Y + A LL K D + +E + S EP Sbjct: 116 DDHVMRRLYQDRSGAAAYVLWPSGAAKLLQRAKTQAALADAMICGAYE--LKSYQVEPAC 173 Query: 198 VYEAIDTN----------DSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 + S I R+PT + R Q + A Sbjct: 174 AIQIDQAEAYGVAVGLKTHSLIGAGVPTRQPTTARFKL-RRIAAQLRMGMQA 224 >gi|71149112|gb|AAZ29058.1| Lgt2 [Moraxella catarrhalis] Length = 204 Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 72/190 (37%), Gaps = 19/190 (10%) Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILI 118 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +I Sbjct: 7 RLTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVI 66 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR---- 174 K + +K + L + + + T GY I ++ A +L+ + Sbjct: 67 KLETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFF 125 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL- 223 PID + + L P V +A DT S IE R R+ T + Sbjct: 126 PIDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLADYG 185 Query: 224 -YFYRNTCYQ 232 +YR+ + Sbjct: 186 KKYYRSIGKR 195 >gi|71149106|gb|AAZ29055.1| Lgt2 [Moraxella catarrhalis] Length = 206 Score = 110 bits (274), Expect = 3e-22, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 72/190 (37%), Gaps = 19/190 (10%) Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILI 118 L+ E C++SH+ LW+++ I EDD ++ Q L L+ N +I Sbjct: 9 RLTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVI 68 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR---- 174 K + +K + L + + + T GY I ++ A +L+ + Sbjct: 69 KLETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFF 127 Query: 175 PIDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL- 223 PID + + L P V +A DT S IE R R+ T + Sbjct: 128 PIDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLADYG 187 Query: 224 -YFYRNTCYQ 232 +YR+ + Sbjct: 188 KKYYRSIGKR 197 >gi|83944105|ref|ZP_00956561.1| glycosyltransferase, family protein 25 [Sulfitobacter sp. EE-36] gi|83844972|gb|EAP82853.1| glycosyltransferase, family protein 25 [Sulfitobacter sp. EE-36] Length = 252 Score = 110 bits (274), Expect = 3e-22, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 75/232 (32%), Gaps = 19/232 (8%) Query: 18 EKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW 77 + + L + +A+ P + ++R L E C++SH+ W Sbjct: 2 AFMAQQMDALGLSYERLEAV----TPDAQSGSHNAAYWASWQRPLKETEKACFLSHVAAW 57 Query: 78 KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLP 137 +R+A A+ILEDD S +Q+L + ++ + + +++ L Sbjct: 58 ERVAALGEP-ALILEDDVVLS-RATQVLLKSCRGLVDVDHLTLEVRQRRKIVAKDAQPLT 115 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGA 197 + + + Y + A LL K D + +E + S EP Sbjct: 116 DDHVMRRLYQDRSGAAAYVLWPSGAAKLLQRAKTQAALADAMICGAYE--LKSYQVEPAC 173 Query: 198 VYEAIDTN----------DSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNA 239 + S I R+PT + R Q + A Sbjct: 174 AIQIDQAEAYGVAVGLKTHSLIGAGVPTRQPTTARFKL-RRIAAQLRMGMQA 224 >gi|255090100|ref|XP_002506971.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] gi|226522245|gb|ACO68229.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] Length = 272 Score = 109 bits (273), Expect = 4e-22, Method: Composition-based stats. Identities = 50/241 (20%), Positives = 86/241 (35%), Gaps = 44/241 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN----RIFSHQKRQCQFKR 60 V+VISL S RR ++ + F A+ G ++ + ++ +K++ R Sbjct: 37 VHVISLARSFERRRACVEALQIQNISYEIFSAVDGLDDGAFHEREIARYAGKKKRAALHR 96 Query: 61 --LLSLPEI--------------------------GCYISHIHLWKRIAYSPAIGAIILE 92 ++ E+ GC +SH+ LW R+ + +ILE Sbjct: 97 TSMMRRDELESARSHYDTASLPEEVRNDLHERLRFGCSMSHVRLWMRLLSTNLPFFVILE 156 Query: 93 DDADFSDEF-SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR 151 DDA F +++ L + + L F A + YL F Sbjct: 157 DDARVVPNFQRRIVDALRQLPKDWDL--FYAFACDIRPGEYLRGNIRQFR------GGAC 208 Query: 152 TTGYFIGKEAAIHLL-NVRKNIYRPIDMDMKHWWEHN-IPSLVTEPGAVYEAID-TNDST 208 T GY I ++ A HLL + P+D M + + +P V + ST Sbjct: 209 TLGYAISRKGAEHLLFDAAIGSELPVDKMMNVPVSRGRMLAFYADPFLVERVDNAHEQST 268 Query: 209 I 209 + Sbjct: 269 L 269 >gi|226225975|ref|YP_002760081.1| putative glycosyltransferase [Gemmatimonas aurantiaca T-27] gi|226089166|dbj|BAH37611.1| putative glycosyltransferase [Gemmatimonas aurantiaca T-27] Length = 251 Score = 109 bits (272), Expect = 5e-22, Method: Composition-based stats. Identities = 41/165 (24%), Positives = 71/165 (43%), Gaps = 15/165 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNP----ICNRIFSHQKRQCQ 57 IPVY +SL + RR + +I +Q F F DA+ + +++ + Q Sbjct: 27 IPVYCVSLRGATERRSLIQSQVEKIGIQHFQFVDAVQASDLTHEKAASENLYAPEICQHF 86 Query: 58 FKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINN 115 R L L EI C +SH +++ IA S A ++I+EDDA F ++ L L+ + Sbjct: 87 HGRDLRLTEIACSLSHARIYRMIANSTAPWSLIIEDDALFR---ARRLAKLTWKQLPTDA 143 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 ++ A + + D++ + T Y I +E Sbjct: 144 QIVFLSAFLHERPPKQKIA-----RDVYSGESYAGSTAAYLINRE 183 >gi|10438672|dbj|BAB15308.1| unnamed protein product [Homo sapiens] Length = 243 Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 69/174 (39%), Gaps = 23/174 (13%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC- 111 R R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 17 YRDPYHGRPLTKGELGCFLSHYNIWKEVVDRGPQKSLVFEDDLRFEIFFKRRLMNLMRDV 76 Query: 112 ---DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ LI RK+ + + +P ++ + S T Y I + A LL Sbjct: 77 EREGLDWDLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-A 132 Query: 169 RKNIY--RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 + + P+D + H+ N+ + EP +Y T D Sbjct: 133 AEPLSKMLPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 186 >gi|170581569|ref|XP_001895737.1| LPS glycosyltransferase family protein [Brugia malayi] gi|158597202|gb|EDP35413.1| LPS glycosyltransferase family protein [Brugia malayi] Length = 429 Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 58/138 (42%), Gaps = 8/138 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPIC----NRIFSHQKRQCQFK 59 +YVI+L + R+ K + +++++ A G + + F +K Sbjct: 291 KIYVINLERRNERKAKMMELLKLMGFEYTWWKATDGHHLDLEPLYKEVKFLPGYEDPYYK 350 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF----SDEFSQLLPHLSKCDINN 115 R + E+GC++SH +W+ + II EDD F + +L+ + +I Sbjct: 351 RPMKAGEVGCFLSHYRIWQEVDEKKLDRVIIFEDDLRFVVNSTILLKELIEDIDSIEIEW 410 Query: 116 ILIKFDALRKKPKKDSYL 133 LI R + ++++ Sbjct: 411 DLIYLGRKRLEGADENWV 428 >gi|322489523|emb|CBZ24781.1| glycosyltransferase family-like protein [Leishmania mexicana MHOM/GT/2001/U1103] Length = 275 Score = 108 bits (271), Expect = 7e-22, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 83/248 (33%), Gaps = 39/248 (15%) Query: 6 YVISLPFSHARREKFCHR--------AARIHLQFSFFDAIYG---------ENNPICNRI 48 YV++L R + + ++ + I G + + ++ Sbjct: 16 YVLNLDRRPDRWAHVQQQVKRARLHKFLKPGVEVTRLSGIDGNGLDVAGLHRDGVLTDQG 75 Query: 49 FSHQK---RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QL 104 + + Q + L+ IGC +SH +W+R+ A+ILEDD +F +FS Q Sbjct: 76 YQRYQLPTEQKLYGMDLTKGAIGCALSHRAVWQRVVAEHRACALILEDDLEFHHQFSRQF 135 Query: 105 LPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 S+ + ++ L + + T Y + +A Sbjct: 136 AERWSRVPADWGIVHMGGLDLLASGKPPRPYIADGIRL--AYHGHRELTAYVVHAASAQR 193 Query: 165 LLNVRKNIYRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 L++ + +D + + + + V +P V + + + Sbjct: 194 CLDLSLPMTWQVDTHISSCLADDAAAQDKYIADPKMYVFQPSLVIQLMSFTTD------V 247 Query: 215 VRKPTFSP 222 +KPT SP Sbjct: 248 QKKPTDSP 255 >gi|154333924|ref|XP_001563217.1| glycosyl transferase [Leishmania braziliensis MHOM/BR/75/M2904] gi|134060229|emb|CAM45637.1| glycosyltransferase family-like protein [Leishmania braziliensis MHOM/BR/75/M2904] Length = 275 Score = 108 bits (271), Expect = 7e-22, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 81/248 (32%), Gaps = 39/248 (15%) Query: 6 YVISLPFSHARREKFCHR--------AARIHLQFSFFDAIYGENNP------------IC 45 YV++L R + + ++ + F I G + + Sbjct: 16 YVLNLDRRPDRWAHVQQQVKRAKLQKFLKPGVEVTRFSGIDGNDLDVAGLHNDGVLTDLG 75 Query: 46 NRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QL 104 + + Q + L+ IGC + H +W+R+ A+ILEDD +F +F+ Q Sbjct: 76 YQRYQLPTEQKLYGMDLTKGAIGCALGHRAVWRRVVAEHRECALILEDDLEFHHQFARQF 135 Query: 105 LPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 S+ + ++ L S + + T Y + +A Sbjct: 136 AERWSRVPADWGIVHMGGLDLLASDKSPRPYIADGIRL--AYHGHRELTAYVVHAASAQR 193 Query: 165 LLNVRKNI----------YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 L++ + D + + + + V +P V + + + Sbjct: 194 CLDLTLPMTWQVDTHISSCLADDTAAQDQYIADPKTYVFQPSLVIQLMSFTTD------V 247 Query: 215 VRKPTFSP 222 +KPT +P Sbjct: 248 QKKPTDNP 255 >gi|187880609|gb|ACD37116.1| WffP [Escherichia coli] Length = 250 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 48/218 (22%), Positives = 82/218 (37%), Gaps = 21/218 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +V+SL S RR+ F HR + FFDAI + F+ + + R L Sbjct: 1 MKCFVVSLKNS-DRRKDFDHRFKI--FNYQFFDAI----TECTSNQFNSAIAKSIYGRCL 53 Query: 63 SLPEIGCYISHIHLWKRIAYS-PAIGAIILEDDADFSDEFSQLLPHLS-KCDINNILIKF 120 EIGC +SH ++ + A + +I+EDDA +F + K N + Sbjct: 54 RKGEIGCSLSHFNIIRDFALNSKENWLLIMEDDAVPESQFIHFINEFENKHVTTNAEVIL 113 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQ----------PRILSPRTTGYFIGKEAAIHLLNVRK 170 K +K ++ L ++ + T Y I K+AA + + Sbjct: 114 LGHSKTSRKHIFVQRLKQPLRNYKNINGISFGENKNVTLCGTVAYLINKQAANIISGNSQ 173 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 + D + + I + VYE + + ST Sbjct: 174 PYWLADDWFL--FQNMGIRVYHPQIPLVYEDLSYDSST 209 >gi|261334340|emb|CBH17334.1| glycosyltransferase family-like protein [Trypanosoma brucei gambiense DAL972] Length = 357 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 75/247 (30%), Gaps = 35/247 (14%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ--------FSFFDAIYGENNPICN----------- 46 YV++L R + +R L+ + + G+ + Sbjct: 98 YVLNLDRRQDRWAHVQQQLSRAKLETFLRPPAKVTRVSGVDGQALDVEALHRNGLVTDVG 157 Query: 47 -RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 + F + F L+ IGC + H +W+ + A+ILEDD +F +F +LL Sbjct: 158 YQRFLLPLEEKLFGMDLTPGAIGCALGHRKIWETVVEKRHQCALILEDDVEFHHKFPRLL 217 Query: 106 PHL-SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 + + + ++ L + + + T Y + AA Sbjct: 218 REVWPRVPSDWGIVHLGGLDLLASGKPPRPFV--DVGVRHAYSGHRELTAYVLHHAAAKR 275 Query: 165 LLNVRKNIYRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 L + +D + + + + + V +P + + R Sbjct: 276 CLEHTLPMTWQVDTHICSVVTEDPAAQDSYISDPMTYVFQPSLAIQITSFGTDV--QKRP 333 Query: 215 VRKPTFS 221 P Sbjct: 334 SDNPPLE 340 >gi|283956549|ref|ZP_06374029.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. jejuni 1336] gi|283792269|gb|EFC31058.1| putative lipooligosaccharide biosynthesis glycosyltransferase [Campylobacter jejuni subsp. jejuni 1336] Length = 183 Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 68/183 (37%), Gaps = 11/183 (6%) Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILIKFDALRKKP 127 ++SH LW+ IILEDD +FSDEF I+ L K Sbjct: 3 FVSHYKLWQECVKLDEP-IIILEDDVEFSDEFLNNGEEYIDGLLKSEYEYIRLCYLFDKR 61 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHN 187 G + + T GY + AA +++ + IY +D M ++H Sbjct: 62 LYFLSESYGGGGYYLSF--KNLSGTQGYVLKPSAAKKIISKAQFIYMSVDDYMDKIYKHK 119 Query: 188 IPSLVTEPGAVYEAIDTNDSTIEESR------LVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 + +++ +P + +++ + + R + RK FY + ++ Y R Sbjct: 120 VLNIIKKPLLLKHSLELQSNISQLGRNSQKIKIYRKIIREIFRFYSSILRLSSIIYIKNR 179 Query: 242 KDL 244 L Sbjct: 180 LKL 182 >gi|218659146|ref|ZP_03515076.1| glycosyl transferase family 25 [Rhizobium etli IE4771] Length = 143 Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats. Identities = 32/118 (27%), Positives = 55/118 (46%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I Y+I+L + RR + + I + F A+ G + R F Q + R Sbjct: 20 INTYLINLDRAELRRFRMERLLSEIGISFERVAAVDGVGISLPRRDFDGQSYLRRHGRTP 79 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 + EIGCY+SH+ + A+ILEDD DF D+F+++L + + +++ Sbjct: 80 NPFEIGCYLSHVECARMFLAGNGEFALILEDDLDFDDDFAEVLAAALEQHASWDILRL 137 >gi|222149298|ref|YP_002550255.1| glycosyl transferase [Agrobacterium vitis S4] gi|221736282|gb|ACM37245.1| glycosyl transferase [Agrobacterium vitis S4] Length = 275 Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 81/247 (32%), Gaps = 25/247 (10%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V +I+L + R + + L F+ A+ ++ P + ++R + Sbjct: 1 MQVLIINLDRARERLAFQQAQMRALGLSFTRIAALDADHLPDLGEAY-----WNGWERPI 55 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 E C +SH W+ IA A ++LEDDA SD+ LL L + Sbjct: 56 RPAERACLLSHRQTWETIAA-GADPVLVLEDDAVLSDKVPALLEALRHRTD----LDHVT 110 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 L + +K + + + Y + A LL D + Sbjct: 111 LETRSRKKLVARASLSDLPLRRLYQDRSGAAAYILWPTGAAKLLQRSARHGAIADAVICA 170 Query: 183 WWEHNIPSLVTE-----------PGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 +E + S + P + + I T S ++P+ + + R Sbjct: 171 AYE--LASFQADPALAVQLDRCAPYGLPKPIATQSSISPTQSEKQRPSLA--FRLRRIKA 226 Query: 232 QWNLHYN 238 Q + Sbjct: 227 QLRMGLR 233 >gi|332298505|ref|YP_004440427.1| glycosyl transferase family 25 [Treponema brennaborense DSM 12168] gi|332181608|gb|AEE17296.1| glycosyl transferase family 25 [Treponema brennaborense DSM 12168] Length = 244 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 67/204 (32%), Gaps = 14/204 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICN--RIFSHQKRQCQFK 59 + YVI+LP R+ + F +A+YG++ R+F ++ Sbjct: 1 MNTYVINLPRDAERKRYMQKILQEQNFKNVVFVEAVYGKSLSAQEKERLFDSAAFTKKYA 60 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK---CDINNI 116 +L + +IGC +SH ++ S +ILEDD E ++ + + Sbjct: 61 KLPNDAQIGCTLSHRKCYEEFLQSGEKSCLILEDDIAPKSEMMPVVKKIQQFLESKEEPA 120 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 ++ KK+ + G+ Y + A LL + Y Sbjct: 121 IVLLSGWFWYTKKEFFAGNALGSL------YSGFLAHSYMLNARGAELLL--AQKPYYVA 172 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYE 200 D + I + + Sbjct: 173 DDWYEFRKNFGIKMYGLLSHIINQ 196 >gi|18088114|gb|AAH20492.1| GLT25D1 protein [Homo sapiens] Length = 231 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 69/174 (39%), Gaps = 23/174 (13%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC- 111 R R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 5 YRDPYHGRPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDV 64 Query: 112 ---DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ LI RK+ + + +P ++ + S T Y I + A LL Sbjct: 65 EREGLDWDLI--YVGRKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-A 120 Query: 169 RKNIY--RPIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 + + P+D + H+ N+ + EP +Y T D Sbjct: 121 AEPLSKMLPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 174 >gi|307257026|ref|ZP_07538802.1| Dimethyladenosine transferase [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864510|gb|EFM96417.1| Dimethyladenosine transferase [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 139 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 35/139 (25%), Positives = 52/139 (37%), Gaps = 10/139 (7%) Query: 82 YSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDALRKKPKKDSYLCTLPGNF 140 + AIILEDDA S F +++ L K +I FD K K L + Sbjct: 2 DNNIKEAIILEDDAIVSHYFRRIVADVLRKVPSRKQIIFFD--HGKAKYWPITRPLKEGY 59 Query: 141 DIHQPRILSPR-------TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVT 193 + + R S TGY++ E A LLN + P D I + Sbjct: 60 KLVRYRYPSKNSKRIIMIATGYYLTLEGAKLLLNHAYPLRMPADFLTGAIQMTGIKAYGV 119 Query: 194 EPGAVYEAIDTNDSTIEES 212 EP ++ + + IE+ Sbjct: 120 EPPCIFISPGSEIDEIEKR 138 >gi|188989616|ref|YP_001901626.1| glycosyltransferase [Xanthomonas campestris pv. campestris str. B100] gi|167731376|emb|CAP49551.1| glycosyltransferase [Xanthomonas campestris pv. campestris] Length = 269 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 47/269 (17%), Positives = 83/269 (30%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFK-- 59 I Y+I++ R R + + F A+ G P F+ Q+ Sbjct: 2 IKSYLINMQRCDDRLRAMSARFQALGVPFERIAAVDGRTLTPEQIADFARQRPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S + EDD SD LL Sbjct: 62 FSTGPRTWTASNIGCFLSHQAAWRIAANSDDAYTAVFEDDMHLSDALPALLRTTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 + +++ + + K + + + G +P I ++AA LL Sbjct: 122 SSIVRLEPSYNRIKLEGKVTAVAGRQLRRVRPSTYQHCWPVCAGALIISRDAARLLLAAE 181 Query: 170 KNIYRPIDMDMKHWWEH----NIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ M W E + + P A + STI+ + Sbjct: 182 ARWHTMADIFMFGWNESPVAQQLSTYQLSPAACIQDKFFHTAPEQIVFTSTIDAPLTQAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KQRSAQWKAKATA-LLRVAQGYRKVTFVP 269 >gi|268604775|ref|ZP_06138942.1| glycosyltransferase [Neisseria gonorrhoeae PID1] gi|268588906|gb|EEZ53582.1| glycosyltransferase [Neisseria gonorrhoeae PID1] Length = 167 Score = 106 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 51/163 (31%), Gaps = 7/163 (4%) Query: 10 LPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGC 69 + + RR + F FFDA+ R + LS E C Sbjct: 1 MASAAERRAHIADTFGSRGIPFQFFDALMPS--ERLERAMAELVPGLSAHPYLSGVEKAC 58 Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI-----LIKFDALR 124 ++SH LWK+ + EDD F + + L + + +++ + + Sbjct: 59 FMSHAVLWKQALDEGLPYVAVFEDDVLFGKDAEKFLAEDTWLEERFDKDSAFIVRLETMF 118 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 K N T GY I +EA L+ Sbjct: 119 AKVIVRPDKVLNYENRSFPLLESEHCGTAGYIISREAMRFFLD 161 >gi|71755109|ref|XP_828469.1| hypothetical protein [Trypanosoma brucei TREU927] gi|70833855|gb|EAN79357.1| glycosyltransferase family-like protein, putative [Trypanosoma brucei] Length = 357 Score = 106 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 75/247 (30%), Gaps = 35/247 (14%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ--------FSFFDAIYGENNPICN----------- 46 YV++L R + +R L+ + + G+ + Sbjct: 98 YVLNLDRRQDRWAHVQQQLSRAKLETFLRPPAKVTRVSGVDGQALDVEALHRNGLVTDVG 157 Query: 47 -RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 + F + F L+ IGC + H +W+ + A+ILEDD +F +F +LL Sbjct: 158 YQRFLLPLEEKLFGMDLTPGAIGCALGHRKIWETVVEKRHQCALILEDDVEFHHKFPRLL 217 Query: 106 PHL-SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 + + + ++ L + + + T Y + AA Sbjct: 218 REVWPRVPSDWGIVHLGGLDLLASGKPPRPFV--DVGVRHAYSGHRELTAYVLHHVAAKR 275 Query: 165 LLNVRKNIYRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 L + +D + + + + + V +P + + R Sbjct: 276 CLEHTLPMTWQVDTHICSVVTEDPAAQDSYISDPMTYVFQPSLAIQITSFGTDV--QKRP 333 Query: 215 VRKPTFS 221 P Sbjct: 334 SDNPPLE 340 >gi|293410401|ref|ZP_06653977.1| predicted protein [Escherichia coli B354] gi|187880602|gb|ACD37110.1| WffP [Shigella dysenteriae] gi|291470869|gb|EFF13353.1| predicted protein [Escherichia coli B354] Length = 250 Score = 106 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 48/218 (22%), Positives = 82/218 (37%), Gaps = 21/218 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +V+SL S RR+ F HR + FFDAI + F+ + + R L Sbjct: 1 MKCFVVSLKNS-GRRKDFDHRFKI--FNYQFFDAI----TECTSNQFNSAIAKSIYGRCL 53 Query: 63 SLPEIGCYISHIHLWKRIAYS-PAIGAIILEDDADFSDEFSQLLPHLS-KCDINNILIKF 120 EIGC +SH ++ + A + +I+EDDA +F + K N + Sbjct: 54 RKGEIGCSLSHFNIIRDFALNSKENWLLIMEDDAVPESQFIHFINEFENKHVTTNAEVIL 113 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQ----------PRILSPRTTGYFIGKEAAIHLLNVRK 170 K +K ++ L ++ + T Y I K+AA + + Sbjct: 114 LGHSKTSRKHIFVQRLKQPLRNYKNINGISFGENKNVTLCGTVAYLINKQAANIISGNSQ 173 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 + D + + I + VYE + + ST Sbjct: 174 PYWLADDWFL--FQNMGIRVYHPQIPLVYEDLSYDSST 209 >gi|47216930|emb|CAG04872.1| unnamed protein product [Tetraodon nigroviridis] Length = 615 Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 78/251 (31%), Gaps = 63/251 (25%) Query: 24 AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----KRLLSLPEIGCYISHIHLWK 78 + + DA+ G + + R L+ E+GC++SH +WK Sbjct: 366 LNELEIDVKVVDAVDGNALNSSDIKLLGVDLLPGYHDPFSGRSLTKGEVGCFLSHFFIWK 425 Query: 79 -------------------------RIAYSPAIGAIILEDDADFSDEFSQLL----PHLS 109 ++ A+I EDD F F + L + Sbjct: 426 EVRTSPLDGDASPSSWCVDRPRPLPQMVDQQYDTALIFEDDVRFQANFKRRLLRLMEEVQ 485 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + +++ +I + P + + S T Y + + A LLN Sbjct: 486 QVELDWDIIYLGRKQVNPSAEEPV----EKVRNLVVADYSYWTLSYAVSLQGAQKLLN-A 540 Query: 170 KNIY--RPIDMDM---------KHWWEH----NIPSLVTEPGAVYE---------AIDTN 205 + + P+D + + + H N+ + T+P V DT Sbjct: 541 EPLSKMLPVDEFLPVMYDKHPNQDYKSHFPNRNLQAFSTQPLLVQPCHYAGDPQWVSDTE 600 Query: 206 DSTIEESRLVR 216 ST+ + VR Sbjct: 601 TSTLWDDDSVR 611 >gi|21229677|ref|NP_635594.1| glycosyltransferase [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66766554|ref|YP_241316.1| glycosyltransferase [Xanthomonas campestris pv. campestris str. 8004] gi|21111160|gb|AAM39518.1| glycosyltransferase [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66571886|gb|AAY47296.1| glycosyltransferase [Xanthomonas campestris pv. campestris str. 8004] Length = 269 Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats. Identities = 47/269 (17%), Positives = 82/269 (30%), Gaps = 26/269 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN-PICNRIFSHQKRQCQFK-- 59 I Y I++ R R + + F A+ G P F+ Q+ Sbjct: 2 IKSYFINMQRCDDRLRAMSARFQALGVPFERIAAVDGRTLTPEQIADFARQRPLEGSGDA 61 Query: 60 -----RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN 114 R + IGC++SH W+ A S + EDD SD LL Sbjct: 62 FSTGPRTWTASNIGCFLSHQAAWRIAANSDDAYTAVFEDDMHLSDALPALLRTTDWLPGG 121 Query: 115 NILIKFDALRKKPKKDSYLCTLPGN-FDIHQPRI----LSPRTTGYFIGKEAAIHLLNVR 169 + +++ + + K + + + G +P I ++AA LL Sbjct: 122 SSIVRLEPSYNRIKLEGKVTAVAGRQLRRVRPSTYQHCWPVCAGALIISRDAARLLLAAE 181 Query: 170 KNIYRPIDMDMKHWWEH----NIPSLVTEPGAVYEA--------IDTNDSTIEESRLVRK 217 + D+ M W E + + P A + STI+ + Sbjct: 182 ARWHTMADIFMFGWNESPVAQQLSTYQLSPAACIQDKFFHTAPEQIVFTSTIDAPLTQAQ 241 Query: 218 PTFSPLYFYRNTCYQWNLHYNAWRKDLPP 246 S + + T + + P Sbjct: 242 KQRSAQWKAKATA-LLRVAQGYRKVTFVP 269 >gi|323452464|gb|EGB08338.1| hypothetical protein AURANDRAFT_64247 [Aureococcus anophagefferens] Length = 351 Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 69/181 (38%), Gaps = 9/181 (4%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQ-CQFKRLL 62 V+V++L AR R + FDA+ G + + + + + + Sbjct: 127 NVFVVTLARDDARVRHTETYTRRRLPEAEVFDAVDGASPALAAFLRARAVTLAPAWAKAC 186 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS-KCDINNILIKFD 121 ++ ++GC SH+ LW+++ A++LEDD +D F + ++ + Sbjct: 187 TVGQLGCMCSHMALWRKVVDDGLDHAVVLEDDVLVADGFRTEVANILGELPALWDHCYLF 246 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PIDMD 179 + K+D LPG + + T Y + + A LL + ID Sbjct: 247 H-HPQCKRD---LPLPGAKHVQRAFETW-GTVAYVVSRRGAEKLLAKTEGASCAKAIDET 301 Query: 180 M 180 M Sbjct: 302 M 302 >gi|157866262|ref|XP_001681837.1| glycosyl transferase [Leishmania major strain Friedlin] gi|68125136|emb|CAJ02854.1| glycosyltransferase family-like protein [Leishmania major strain Friedlin] Length = 275 Score = 105 bits (263), Expect = 6e-21, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 23/196 (11%) Query: 6 YVISLPFSHARREKFCHR--------AARIHLQFSFFDAIYGENNP---------ICNRI 48 YV++L R + + ++ + I G + +R Sbjct: 16 YVLNLDRRPDRWAHVQQQVKRARLHKFLKPGVEVTRVSGIDGRELDVAGLHRDGVLTDRG 75 Query: 49 FSHQK---RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QL 104 + + Q + L+ IGC +SH +W+R+ A+ILEDD +F +FS Q Sbjct: 76 YQRYQLPTEQKLYGMDLTKGAIGCALSHRAVWQRVVAEHRECALILEDDLEFHHQFSRQF 135 Query: 105 LPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 S+ + ++ L + + + T Y + +A Sbjct: 136 AARWSRVPADWGIVHMGGLDLLASGKAPRPYIADGIRL--AYDGHRELTAYVVHAASAQR 193 Query: 165 LLNVRKNIYRPIDMDM 180 L++ + +D + Sbjct: 194 CLDLSLPMTWQVDTHI 209 >gi|71149114|gb|AAZ29059.1| Lgt2 [Moraxella catarrhalis] Length = 205 Score = 105 bits (262), Expect = 7e-21, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 71/187 (37%), Gaps = 19/187 (10%) Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN--ILIKFD 121 + E C++SH+ LW+++ I EDD ++ Q L L+ N +IK + Sbjct: 11 MGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGNDSQQFLQELTIWLQQNAVDVIKLE 70 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR----PID 177 +K + L + + + T GY I ++ A +L+ + PID Sbjct: 71 TWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAFEFFPID 129 Query: 178 MDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFSPL--YF 225 + + L P V +A DT S +E R R+ T + + Sbjct: 130 HVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLLETQRKQNVNQHRRRTLADYGKKY 189 Query: 226 YRNTCYQ 232 YR+ + Sbjct: 190 YRSIGKR 196 >gi|118115561|ref|XP_423349.2| PREDICTED: hypothetical protein, partial [Gallus gallus] Length = 155 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 29/133 (21%), Positives = 59/133 (44%), Gaps = 8/133 (6%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD 112 + R L+ E+GC++SH +W+ I +++ EDD F F + L +L D Sbjct: 17 YKDPYHGRPLTKGELGCFLSHYRVWEEIVERGLGKSVVFEDDLRFEIFFKRRLMNLM-YD 75 Query: 113 INNILIKFDAL---RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + + +D + RK+ + + ++P ++ + S T Y + + A LL Sbjct: 76 LEEEGVGWDLIYIGRKRMQVEQPERSVPHVRNLVEA-DYSYWTLAYVVSLQGARKLL-AA 133 Query: 170 KNIY--RPIDMDM 180 + + P+D + Sbjct: 134 QPLSKMLPVDEFL 146 >gi|71650980|ref|XP_814177.1| glycosyl transferase-like protein [Trypanosoma cruzi strain CL Brener] gi|70879127|gb|EAN92326.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 275 Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 84/243 (34%), Gaps = 29/243 (11%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ--------FSFFDAIYGENN--PICNRI------- 48 YV++L R + AR+ L+ + + G + +R Sbjct: 16 YVLNLDRRRDRWAHVQRQIARVGLEKFIQSPAKVTRVSGVDGNSLDVEALHRDGVITDLG 75 Query: 49 ---FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 F + F L+ IGC + H +W+ I A++LEDD +F +F +LL Sbjct: 76 YTRFLLPTEEKLFGMDLTRGAIGCALGHRKIWEMIVAERRTRALVLEDDVEFHHKFGRLL 135 Query: 106 PHL-SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 L + + ++ L + + + T Y I +A Sbjct: 136 GPLWKRVPADWGIVHLGGLDLLAAEKPPRPFIDDGIRL--AYQGHRELTAYVIHHISAKR 193 Query: 165 LLNVRKNIYRPIDMDMKHWWEHN---IPSLVTEP--GAVYEAIDTNDSTIEESRLVRKPT 219 L + + +D + + E + + +P ++ +++ + ++P+ Sbjct: 194 CLELSTPMTWQVDTHISNIVESDPEAQDKYIVDPKMYVFQPSLAIQITSLGTD-VQKRPS 252 Query: 220 FSP 222 +P Sbjct: 253 ENP 255 >gi|29347059|ref|NP_810562.1| hypothetical protein BT_1649 [Bacteroides thetaiotaomicron VPI-5482] gi|29338957|gb|AAO76756.1| glycoside transferase family 25 [Bacteroides thetaiotaomicron VPI-5482] Length = 225 Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 70/210 (33%), Gaps = 14/210 (6%) Query: 20 FCHRAARIH-LQFSFFDAIYGENNPICNR--IFSHQKRQCQFKRLLSLPEIGCYISHIHL 76 + ++ L F +A+ + F + ++ + + EIGC +SH Sbjct: 1 MQEQLEKMFFLSAEFVEAVDARGMTEREKNVFFDTELFCKRYVKEVRPGEIGCTLSHQKC 60 Query: 77 WKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC--DINNILIKFDALRKKPKKDSYLC 134 ++++ S A+ILEDD L+P + K +I Sbjct: 61 YRKLVESRDKYALILEDDIVIRHNIDTLVPEIEKLLNTDEPRVILLSGWY----WYLGTK 116 Query: 135 TLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTE 194 T+ ++ + + T Y I +EAA L+ + D + + + Sbjct: 117 TIKQHYCLARVYDAFL-THAYIINREAASLLIEQ-RPFITADDWF--YIRKKGVKLYAVL 172 Query: 195 PGAVYEAIDTN-DSTIEESRLVRKPTFSPL 223 P + + ++I + R P Sbjct: 173 PHLLDQDWSGEYPTSINQEEKKRCPGLWKR 202 >gi|71415375|ref|XP_809756.1| glycosyl transferase-like protein [Trypanosoma cruzi strain CL Brener] gi|70874187|gb|EAN87905.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] gi|322829519|gb|EFZ32876.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 275 Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 72/214 (33%), Gaps = 26/214 (12%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ--------FSFFDAIYGENN--PICNRI------- 48 YV++L R + AR+ L+ + + G + +R Sbjct: 16 YVLNLDRRRDRWAHVQRQIARVGLEKFIQPPAKVTRVSGVDGNSLDVEALHRDGVITDLG 75 Query: 49 ---FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 F + F L+ IGC + H +W+ I A++LEDD +F +F +LL Sbjct: 76 YTRFLLPTEEKLFGMDLTRGAIGCALGHRKIWEMIVAERRTRALVLEDDVEFHHKFGRLL 135 Query: 106 PHL-SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 L + + ++ L + + + T Y I +A Sbjct: 136 GPLWKRVPADWGIVHLGGLDLLAAEKPPRPFIDDGIRL--AYQGHRELTAYVIHHISAKR 193 Query: 165 LLNVRKNIYRPIDMDMKHWWEHN---IPSLVTEP 195 L + + +D + + E + + +P Sbjct: 194 CLELSTPMTWQVDTHISNIVESDPEAQDKYIVDP 227 >gi|332025628|gb|EGI65790.1| Glycosyltransferase 25 family member [Acromyrmex echinatior] Length = 197 Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats. Identities = 40/178 (22%), Positives = 75/178 (42%), Gaps = 19/178 (10%) Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 KR +++ EIGC++SH +W+++ ++LEDD F F Q + ++ +++N Sbjct: 7 PYHKRPMTMGEIGCFLSHYIVWQKVLEHGYKSVMVLEDDVRFEPFFRQKVDYVLA-ELSN 65 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK-NIYR 174 + +++D + KK L S T GY + + A L++ Sbjct: 66 LKLEWDLIYMGRKKLVKSENLVEGSKFLVRAGYSYWTLGYILSESGAKKLIDAMPLEKLI 125 Query: 175 PIDMDM--------KHWWE-----HNIPSLVTEPGAVYE----AIDTNDSTIEESRLV 215 P+D + + W ++ L T P +Y D + S E+S+LV Sbjct: 126 PVDEFLPILSDTHPEKQWAAQFPTRDLIILSTNPLLIYPSHYTGEDGHISDTEDSKLV 183 >gi|146080914|ref|XP_001464112.1| glycosyl transferase [Leishmania infantum JPCM5] gi|134068202|emb|CAM66488.1| glycosyltransferase family-like protein [Leishmania infantum JPCM5] gi|322497507|emb|CBZ32581.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 275 Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 82/248 (33%), Gaps = 39/248 (15%) Query: 6 YVISLPFSHARREKFCHR--------AARIHLQFSFFDAIYG---------ENNPICNRI 48 YV++L R + + ++ + I G + + +R Sbjct: 16 YVLNLDRRPDRWAHVQQQVKRAKLHKFLKPGVEVTRVSGIDGHELDVASLHRDGVLTDRG 75 Query: 49 FSHQK---RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QL 104 + + Q + L+ IGC +SH +W+R+ +ILEDD +F +FS Q Sbjct: 76 YQRYQLPTEQKLYGMDLTKGAIGCALSHRAVWQRVVAEHRECVLILEDDLEFHHQFSRQF 135 Query: 105 LPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 S+ + ++ L + + + T Y +A Sbjct: 136 AERWSRVPADWGIVHMGGLDLLASGKAPRPYIADGIRL--AYEGHRELTAYVAHAASAQR 193 Query: 165 LLNVRKNIYRPIDMDM----------KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 L++ + +D + + + + V +P V + + + Sbjct: 194 CLDLSLPMTWQVDTHISSCLADDAAAQDKYIADPKMYVFQPSLVIQLMSFTTD------V 247 Query: 215 VRKPTFSP 222 +KPT +P Sbjct: 248 QKKPTDNP 255 >gi|83955198|ref|ZP_00963854.1| hypothetical protein NAS141_03796 [Sulfitobacter sp. NAS-14.1] gi|83840527|gb|EAP79700.1| hypothetical protein NAS141_03796 [Sulfitobacter sp. NAS-14.1] Length = 260 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 83/251 (33%), Gaps = 39/251 (15%) Query: 24 AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 A + L F F A+ G ++ + K+ EI C SHI +W++ + Sbjct: 1 MAELGLPFKIFRAVDGHDDDAKLPDLNRHLFVINQKKQPVRGEIACAASHIAVWRQFLET 60 Query: 84 PAIGAIILEDDADFSDEFS-----------QLLPHLSKCDINNIL-------IKFDALRK 125 A+ILEDD D + + L S L + DA+R Sbjct: 61 DEEYALILEDDIDIAPDLEMVQKQMPGLGLDFLNLSSNAPYTYTLDEETLSQLANDAVRT 120 Query: 126 KPKKDS-------------------YLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 +P +L LP + +GY + + AA L Sbjct: 121 RPTVFERSRRRLWRRLEWRRRWRIFHLHPLPNGHVACECDPAPLLGSGYIVSRRAAEAFL 180 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND-STIEESRLVRK-PTFSPLY 224 ++++ PID+ +H + P D++ ++ ++ K P Sbjct: 181 QTTEHLFFPIDLVWRHAPGMLRQGFLARPIVTQTDKDSDIPGRFDQGKMALKYRLLRPFL 240 Query: 225 FYRNTCYQWNL 235 R ++++ Sbjct: 241 KSRRLRRRFDV 251 >gi|328721259|ref|XP_001944685.2| PREDICTED: glycosyltransferase 25 family member-like [Acyrthosiphon pisum] Length = 223 Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats. Identities = 40/178 (22%), Positives = 69/178 (38%), Gaps = 22/178 (12%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 + + +KR ++ EIGC++SH +W + Y ++LEDDA F Sbjct: 13 DLEFMGVRLMPGYKDPYYKRPITKGEIGCFMSHYRIWAKTTYEGLDEVLVLEDDARFEPY 72 Query: 101 FS----QLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYF 156 F +L L + ++ L+ + +K D+ L + + +P S T GY Sbjct: 73 FRFKLQMVLDELRRLKVSWDLV---YIGRKSLNDNNESRLENSKLLVRP-GYSYWTLGYL 128 Query: 157 IGKEAAIHLLNV-RKNIYRPIDMDM--------KHWW-----EHNIPSLVTEPGAVYE 200 + A LLN P+D + + W E N+ +L P +Y Sbjct: 129 LSGRGAKKLLNANPLKKLLPVDEFLPIMFNQHPQAEWSEQFDERNLIALSASPLLIYP 186 >gi|170749745|ref|YP_001756005.1| glycosyl transferase family protein [Methylobacterium radiotolerans JCM 2831] gi|170656267|gb|ACB25322.1| glycosyl transferase family 25 [Methylobacterium radiotolerans JCM 2831] Length = 250 Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats. Identities = 47/242 (19%), Positives = 81/242 (33%), Gaps = 24/242 (9%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 + ISLP R A A++G + I F+ L+ Sbjct: 4 FGISLPRFRERHAYLTRHLA----------AVWGRHCEITGYEGPVPDPNAPFQPDLTAG 53 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 +IGC +SH+ ++R+ A+I+EDD + Q++ + ++ + Sbjct: 54 QIGCALSHMSAYERMIALDLPHALIVEDDVVLPPQIHQIVAGIEAALRPGDVVLLFNWAE 113 Query: 126 KPKKDSYLCTLP-GNFDIHQPRILSP--RTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 S + + G + P +S Y I + AA +L + + D + Sbjct: 114 TVGAFSSVGAVTVGEHRLCLPMDMSSLGTAAAYVITRAAAEGILRANRPVAVTSDN-WSY 172 Query: 183 WWEHN--IPSLVTEPGAVYEAIDTNDSTIEESR------LVRKPTFSPLYFYRNTCYQWN 234 ++E P AV +STI SR L PL+ R Sbjct: 173 FFERGALTRGRALVPNAVSR--QPFESTIFTSRSRIAAFLKGSALLRPLFTARRRILAAR 230 Query: 235 LH 236 L Sbjct: 231 LA 232 >gi|283458618|ref|YP_003363253.1| LPS biosynthesis glycosyltransferase [Rothia mucilaginosa DY-18] gi|283134668|dbj|BAI65433.1| glycosyltransferase involved in LPS biosynthesis [Rothia mucilaginosa DY-18] Length = 333 Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 67/185 (36%), Gaps = 18/185 (9%) Query: 30 QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAI 89 F + A+ ++ + + F ++ ++ + E GC +SH H+WK S A A+ Sbjct: 89 DFQRYSAVDNRDDHVTDAQFDSKQFLARYGKEPRPAEKGCALSHYHMWKDFLASDADWAL 148 Query: 90 ILEDDADFSDEFSQLLPHLSKCDINNILIKF------DALRKKPKKDSYLCTLPGNFDIH 143 EDD S + ++ + + + ++ +A + P+ D +L F Sbjct: 149 FAEDDVLISPDLQPVVDRIIEKYPHVQMVNLGDIYASEAGKLTPQVDYPRLSLLSPFVYG 208 Query: 144 QPRILSP-------RTTGYFIGKEAAIHLLNV---RKNIYRPIDMDMKHWWEHNIPSLVT 193 + R+ + Y + + A L+ D + W + + Sbjct: 209 KYRMGNAYGSKPLYGAALYLLSRSGAERLVECYGETVPGVVADDYSLYREW--GVDVYLV 266 Query: 194 EPGAV 198 +PG Sbjct: 267 QPGLC 271 >gi|78188719|ref|YP_379057.1| glycosyl transferase family protein [Chlorobium chlorochromatii CaD3] gi|78170918|gb|ABB28014.1| glycosyl transferase, family 25 [Chlorobium chlorochromatii CaD3] Length = 244 Score = 103 bits (256), Expect = 3e-20, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 84/250 (33%), Gaps = 47/250 (18%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V+VISL RR+ F H+ + FF+A+ G P ++ S+ + Sbjct: 1 MKVHVISLKRCTERRKAFMDM--NPHVDYLFFNAVDGSTIPE--KVLSN-PLLFEKGLPY 55 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-SKCDINNILIK-- 119 + GC +SH+ LW + I EDDA F +F + L S + +I Sbjct: 56 TKGAYGCALSHLLLWNKAIKEN-CVLTIAEDDAIFRKDFHVMQNKLLSSISSDWDIILWG 114 Query: 120 ---------------------FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG 158 F + + ++++ + + Y I Sbjct: 115 WNFDSILSLNVLPDVSPTVMVFSQEKLRESINTFIEKVTYPSSLFFLDKCF-GIPAYTIT 173 Query: 159 KEAAIHL------LNVRKNIY------RP---IDMDMKHWWEHNIPSLVTEPGAVYEAID 203 + AI L + P ID+ M + S V+ P V + Sbjct: 174 PQGAIKFKSLCFPLKFFSLWFPLLNRKLPNNGIDIAMNKIYSS-TNSYVSFPPLVVTKNE 232 Query: 204 TNDSTIEESR 213 STI+ +R Sbjct: 233 HAISTIQTNR 242 >gi|317131936|ref|YP_004091250.1| glycosyl transferase family 25 [Ethanoligenens harbinense YUAN-3] gi|315469915|gb|ADU26519.1| glycosyl transferase family 25 [Ethanoligenens harbinense YUAN-3] Length = 294 Score = 103 bits (256), Expect = 4e-20, Method: Composition-based stats. Identities = 47/241 (19%), Positives = 86/241 (35%), Gaps = 44/241 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-----FS---FFDAIYGE--NNPICNRIFSHQ-- 52 +Y I+L R + ++ RI + F F AI + I +++ Sbjct: 44 IYAINLDRKPKRWSQVSNKLKRIKSRSDTSLFELSRRFSAIDARYLDEKIDSKMLCPYFT 103 Query: 53 ----------------KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD 96 + +K ++ EI +SHI LWK+IA +ILEDD Sbjct: 104 LADQLQVEPNPLIQIDESSKAYKIKMTPQEIAIALSHIELWKKIASDNIPYTLILEDDVY 163 Query: 97 FSDEFSQLLPHL---------SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRI 147 F F+ L + + + F + KPK + + +H+P Sbjct: 164 FERGFANNLDAAWTDIINKSSQPFTFDILFLSFQEVGIKPKTR-----IHEDGLVHKPNC 218 Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 + +GY + K A LLN Y P+D+ + + + + + + +D + Sbjct: 219 GIWQASGYVLSKVGAQKLLN-MLPAYGPVDLWL-NLQFDKLDVYLINKPIIEQRVDVPST 276 Query: 208 T 208 Sbjct: 277 N 277 >gi|242007885|ref|XP_002424748.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508251|gb|EEB12010.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 259 Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 76/205 (37%), Gaps = 26/205 (12%) Query: 42 NPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 F + + KR ++ EIGC++SH +W+++ + ++LEDD F F Sbjct: 58 LEELGIEFMPEYKDPYHKRAMTHGEIGCFLSHYFIWEKVLENKYDLIMVLEDDVQFEPYF 117 Query: 102 SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEA 161 + + L K ++ ++ + +D + K+ S T GY + E Sbjct: 118 REKITALLK-EVEDLKLPWDLIYIGRKRLVEHEDWVKGSKSLVHVAYSYWTLGYLLTNEG 176 Query: 162 AIHLLNVRK-NIYRPIDMDM-------------KHWWEHNIPSLVTEPGAVYEA------ 201 A LL + P+D + KH+ + ++ + P +Y Sbjct: 177 ARKLLEQKPLKKLIPVDEYIPILFDKHPEKEWKKHFPKRDLLAYSAAPLLLYPTHYTGEE 236 Query: 202 ---IDTNDSTIEESRLVRKPTFSPL 223 DT +S I + +K L Sbjct: 237 GYISDTENSVIVKES--QKEIREDL 259 >gi|326576961|gb|EGE26867.1| galactosyltransferase Lgt2B/C [Moraxella catarrhalis O35E] Length = 95 Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 42/98 (42%), Gaps = 5/98 (5%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RL 61 I +VIS+ + RRE + + F FFDA+ P ++ + + Sbjct: 2 IQNFVISIKTATKRREHIMCEFGKQGIAFEFFDAV----TPTDISKYAQKLSIPIINNQR 57 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD 99 L+ E C++SH+ LW+++ I EDD + Sbjct: 58 LTDGEKACFLSHVALWQQMIDENLDYMAIFEDDVYLGN 95 >gi|307107366|gb|EFN55609.1| hypothetical protein CHLNCDRAFT_133736 [Chlorella variabilis] Length = 565 Score = 102 bits (254), Expect = 6e-20, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 67/229 (29%), Gaps = 28/229 (12%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNRIFSHQKRQ-- 55 +P VYVI++ + AR F L F F A+ G + + + Sbjct: 331 LPFDVYVINMDGAAARLASFQRTFQASDLKAKNFVRFPAVNGSALDASRLVSAKALAEIE 390 Query: 56 -------CQFKRLLSLPEIGCYISHIHLWKRIAYS-PAIGAIILEDDADFS-DEFSQLLP 106 L+ +GCY+SH+ L++ I A A + EDDA L Sbjct: 391 AAEAAGYRTKHYQLTRGSVGCYMSHLQLYQHILRETDAQFAFVFEDDAVIEQPGLLAALV 450 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + ++ + C ++ T Y + + + Sbjct: 451 EAKPFPEDWDIVMLGH-------YCHSCPAVAGAPRYRAAHSFFGTHSYVVHRRGLQKI- 502 Query: 167 NVRKNIYRP----IDMDM-KHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 P ID + + + V + STI+ Sbjct: 503 -FAFPRLMPIEKQIDAVLGDMCQQGLLNVYALAQPLVEQNSKEFRSTIQ 550 >gi|307213488|gb|EFN88897.1| Glycosyltransferase 25 family member [Harpegnathos saltator] Length = 195 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 44/188 (23%), Positives = 76/188 (40%), Gaps = 26/188 (13%) Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLP 106 + KR ++ EIGC++SH +W+++ ++LEDD F F Q +L Sbjct: 2 PEYADPYHKRPMTTGEIGCFLSHYVIWQKVLEHGYKDVMVLEDDVRFEPFFRQKVRYVLT 61 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 LS DI L+ L +K +S + G+ + + S T GY + A L+ Sbjct: 62 ELSDLDIKWDLV---YLGRKRLMESAESRIQGSKFLVRATY-SYWTLGYILSNSGARKLV 117 Query: 167 NVRK-NIYRPIDMDM--------KHWW-----EHNIPSLVTEPGAVYE----AIDTNDST 208 + P+D + K W ++ L T P ++ + S Sbjct: 118 DAMPLGKLVPVDEYLPILSDAHPKKQWAAQFPIRDLIILSTNPLLIHPTHYTGENGYISD 177 Query: 209 IEESRLVR 216 E+S++VR Sbjct: 178 TEDSKIVR 185 >gi|253997090|ref|YP_003049154.1| glycosyl transferase family 25 [Methylotenera mobilis JLW8] gi|253983769|gb|ACT48627.1| glycosyl transferase family 25 [Methylotenera mobilis JLW8] Length = 250 Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 89/261 (34%), Gaps = 34/261 (13%) Query: 1 MPIP----VYVISLPFSHARREKFCHRAARIHL-QFSFFDAIY--------------GEN 41 M +P + I+L R E + ++++ F A+ + Sbjct: 1 MNLPFFKYICCINLDSRPDRWEGMQTQFNKVNIVDAQRFSAVSFEKLQNNPPPEGFKAKI 60 Query: 42 NPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 R + + Q K + GC SH+ + K +ILEDD +F Sbjct: 61 MAALQRKNEAKNAEHQIKAMW-----GCLSSHVAVIKHAKAQNWPYVLILEDDCEFEPYT 115 Query: 102 SQLLPHLSK--CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGK 159 + ++ + + D++ ++ +KK ++ ++ T Y + Sbjct: 116 NTVMQRVMEQVSDLDWDMLYLGGNQKKYGLRQRKSK-----NLVAVTGITL-THAYMVRA 169 Query: 160 EAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPT 219 ++N ID ++++ +L+ EP Y+A D + +R RK Sbjct: 170 SIYDKIINEAPQANMTIDDFYAKQLQNHVKTLLVEPPVAYQAPDEVSDISQVAR--RKKY 227 Query: 220 FSPLYFYRNTCYQWNLHYNAW 240 R+ + N+ Y A Sbjct: 228 NWKSLLRRSKRWLANVRYGAQ 248 >gi|255327326|ref|ZP_05368400.1| LPS glycosyltransferase [Rothia mucilaginosa ATCC 25296] gi|255295606|gb|EET74949.1| LPS glycosyltransferase [Rothia mucilaginosa ATCC 25296] Length = 272 Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 79/226 (34%), Gaps = 26/226 (11%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFF---DAIYGENNPICNRIFSHQKRQCQF 58 I YV+ L ++ + + F A+ ++ + + F + + ++ Sbjct: 3 KILKYVLGLDGANR-----FESFYSVGVPAKDFVPYSAVDNRDDHVTDEEFDVEVFRKRW 57 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 R S E GC +SH ++WK S A A++ EDD S + ++ + + + ++ Sbjct: 58 HRDPSPAEKGCMLSHYNMWKDFLASDADWALLAEDDVLISPDLQPVVERIIEKYPHVQMV 117 Query: 119 KF------DALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHL 165 +A + P+ D +L F + R+ S Y + + A L Sbjct: 118 NLGDIYASEAGKLNPQVDYPRLSLLSPFVYGKYRMGSAYGSKPLYGAALYLLSRSGAERL 177 Query: 166 LNV---RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 + D + W + + +PG + T Sbjct: 178 VECFGETVPGVVADDYSLYREW--GVDVYLVQPGLCGWEGSSLIQT 221 >gi|307166662|gb|EFN60659.1| Glycosyltransferase 25 family member [Camponotus floridanus] Length = 198 Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats. Identities = 45/186 (24%), Positives = 74/186 (39%), Gaps = 27/186 (14%) Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLP 106 + KR +++ EIGC++SH +W+++ ++LEDD F F Q +L Sbjct: 2 PEYFDPYHKRPMTMGEIGCFLSHYLIWQKVLEHGYKNVMVLEDDVRFEPFFRQKVNYVLE 61 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 LS I LI + +K K +S + F +H T GY + + A L+ Sbjct: 62 ELSALGIEWDLI-YVGRKKLVKSESPVE--GSKFLLHAAYSY--WTLGYILSENGARKLI 116 Query: 167 NVRK-NIYRPIDMDM--------KHWW-----EHNIPSLVTEPGAVYE----AIDTNDST 208 P+D + K W ++ +L T P +Y D S Sbjct: 117 GAMPLGKLVPVDEYLPILSDTHPKEQWAAQFPIRDLITLSTNPLLIYPTHYTGEDGYISD 176 Query: 209 IEESRL 214 E+S+L Sbjct: 177 TEDSKL 182 >gi|48243707|gb|AAT40822.1| putative Lex2B [Haemophilus influenzae] Length = 134 Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats. Identities = 26/108 (24%), Positives = 46/108 (42%), Gaps = 4/108 (3%) Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K ++ D L+ K K L + GN I+Q TGY++ +AA L Sbjct: 5 DKNTFEFFWLQPDRLKNKRK----LISNFGNLSIYQFSKGFAGATGYYLTPQAARKFLTQ 60 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 K Y +D+ M ++E+ +P V P + + + + E+ + R Sbjct: 61 SKEWYLTVDVTMDRFFENKVPPYVIVPFCLEDDGEIESTIYEKQKKQR 108 >gi|114769884|ref|ZP_01447494.1| putative beta1,4-galactosyltransferase [alpha proteobacterium HTCC2255] gi|114549589|gb|EAU52471.1| putative beta1,4-galactosyltransferase [alpha proteobacterium HTCC2255] Length = 263 Score = 100 bits (248), Expect = 3e-19, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 78/214 (36%), Gaps = 13/214 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +YVISL R++ +A L AI N I + + R + Sbjct: 1 MKIYVISLENYRDRQKFQQEQAMAFGLDIEIVSAIDA--NSISEKKLQDAAN--NWSRPI 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN-NILIKFD 121 + ++ C++SH W+ I + A+I+EDD F+ + ++L H+S + + Sbjct: 57 NAKDVACFLSHKKTWE-IIHDKGERALIIEDDIVFNQQIVEVLNHISLNTTKLGEIYDLE 115 Query: 122 ALRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID-M 178 + +K K + + Y + AA LL + N +D Sbjct: 116 YVPRKHILAKFPKWESNTSKISATKIYQNKNGLGCYCLDHIAASKLLKEQVNYAM-VDAY 174 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 W + + EP + I N +I + Sbjct: 175 VWTRKWANYLQ---IEPAPAIQMIYLNGDSIADQ 205 >gi|317408199|gb|ADV17633.1| WclP [Salmonella enterica] Length = 227 Score = 99.6 bits (247), Expect = 3e-19, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 86/225 (38%), Gaps = 27/225 (12%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKRLLSL 64 +ISL RRE+ + A+ +Q +AI G +F + + L+ Sbjct: 10 IISLVQHAERRERLINEMAKYKVQCRVSNAIDGRKLLADEYFSLFKANSSKLFGRGFLTP 69 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI--KFDA 122 E+GC++SH S ++LEDD +D +LL + + ++I + D Sbjct: 70 SELGCFLSHRKALSEFIMSGKQWLVMLEDDVTPNDNVRELLGVIHSFNESSIYVLGGQDG 129 Query: 123 LRKKPKKDSYLCTLPGNFD--IHQPRILSPRTTGYFIGKEAAI---HLLNVRKNIYRPID 177 L+ + + RT Y I ++ A L+N ++ P D Sbjct: 130 LKSFSRVVMGRERDKEKVRKVLLGTYRWMYRTCCYCIDRKGAKNIMKLMNEQRFF--PDD 187 Query: 178 M-------DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 ++K+++ N T P +D + S+IE R+ Sbjct: 188 WSFIMKNANLKNFFYGN---YFTHP------VDLSSSSIEAERIF 223 >gi|316973139|gb|EFV56766.1| glycosyltransferase 25 family member 1 [Trichinella spiralis] Length = 372 Score = 99.6 bits (247), Expect = 3e-19, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 55/150 (36%), Gaps = 12/150 (8%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIF-----SHQKRQCQFKRLLSLPEIGCYISHI 74 A + L+ +A+ G + R KR ++L E+GC++SH Sbjct: 1 MLACFAVLGLEVRLVEAVDGRMLEPADLNAIGVKQMPDYRDPYHKRPMTLGEVGCFLSHY 60 Query: 75 HLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINNILIKFDALRKKPKKD 130 ++W+ + A+I EDD F+ F + ++ L + L+ R P +D Sbjct: 61 NVWRDMLDRGYRRAVIFEDDLRFTRSFRRQVGVVMAELDANVPDWDLVYLGRKRLNPDQD 120 Query: 131 SYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 L N S T Y + + Sbjct: 121 GPLV---ENCSFVSHVGYSYWTLAYALSRS 147 >gi|323450258|gb|EGB06140.1| hypothetical protein AURANDRAFT_65851 [Aureococcus anophagefferens] Length = 361 Score = 99.2 bits (246), Expect = 5e-19, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 68/180 (37%), Gaps = 23/180 (12%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN--------NPICNRIFS-------H 51 +I+L + R + A +++ F A+ G C + Sbjct: 2 LINLDRAPDRLAFMEKQFAEHGVRYERFPAVDGAAVAGDPAAVVEGCAVEYCAPADFGIE 61 Query: 52 QKRQCQFKRLLSLPEIGCYISHIHLWKRI-AYSPAIGAIILEDDA--DFSDEFSQLLPHL 108 ++R C+ + L + EIGC +SH+ KR+ + +++EDD +F ++++ L + Sbjct: 62 RERWCREGKKLRVGEIGCVLSHLACIKRVYEANEHEAVVVVEDDISLEFVPKWTRSLGDI 121 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 ++ D ++K P L + + S Y I + +L+ Sbjct: 122 ARAD-GWGIVKISCNH--PAGVRNLLKMDEGLVPFKRHYWSTGM--YIINRRGMKLVLDK 176 >gi|326783923|ref|YP_004324317.1| glycosyltransferase family 25 [Synechococcus phage S-SSM7] gi|310003935|gb|ADO98330.1| glycosyltransferase family 25 [Synechococcus phage S-SSM7] Length = 269 Score = 98.9 bits (245), Expect = 6e-19, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 62/169 (36%), Gaps = 18/169 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 P+Y+I+L R + + ++ ++ A G P S ++ + Sbjct: 14 PLYIINLDEQPERMQWMEEQLKEWEIENYTRISAYDGR--PSTGDDLSDIL-VGKYPDSI 70 Query: 63 SLPEIGCYISHIHLWKRIAYS-PAIGAIILEDDADFS-DEFSQLL--PHLSKCDINNILI 118 + EIGC SH+ K AII+EDD D S +F +S+ + + Sbjct: 71 TPGEIGCVTSHLKAIKHFVEETDEPYAIIMEDDCDISIAQFWTFTWRQFISRMPYDWDTV 130 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 + + +C + +H+ I T Y I + A LL Sbjct: 131 Q----------VAVICPGELHVQVHRRFINDFSTACYVITRHHAEKLLK 169 >gi|322786020|gb|EFZ12636.1| hypothetical protein SINV_80063 [Solenopsis invicta] Length = 186 Score = 98.9 bits (245), Expect = 7e-19, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 72/172 (41%), Gaps = 19/172 (11%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 +++ EIGC++SH +W+++ ++LEDD F F Q + ++ ++ ++ IK+D Sbjct: 1 MTMGEIGCFLSHYVVWQKVLKHGYKSVMVLEDDVRFEPFFRQKVNYVLA-ELTDLGIKWD 59 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK-NIYRPIDMDM 180 + K+ +L S T GY + + A L+ P+D + Sbjct: 60 LVYMGRKRLVKSESLVEGSKFLVWAGYSYWTLGYILSQTGAEKLIGAMPLGKLIPVDEFL 119 Query: 181 ---------KHWWE----HNIPSLVTEPGAVYE----AIDTNDSTIEESRLV 215 + W ++ L T P VY D + S E+S+LV Sbjct: 120 PILSNTHPEERWAVQFPTRDLIILSTNPLLVYPTHYTGEDGHISDTEDSKLV 171 >gi|300743831|ref|ZP_07072851.1| Lsg locus putative protein 4 [Rothia dentocariosa M567] gi|300380192|gb|EFJ76755.1| Lsg locus putative protein 4 [Rothia dentocariosa M567] Length = 274 Score = 98.5 bits (244), Expect = 8e-19, Method: Composition-based stats. Identities = 51/271 (18%), Positives = 91/271 (33%), Gaps = 37/271 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 + YV++L R E F + F F I G P + F + + R Sbjct: 1 MKKYVVALK-DENRLEHFFNAPGSAG--FDIFWGIDGRALPAPGESPEFDAVYFEKRAGR 57 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L E+GC +SH ++W+ S A++ EDDA +++ + + + ++ F Sbjct: 58 LARPGEVGCALSHTYVWRDFLESGEEWALVAEDDALIHASIDEIVSRVIEKSRSIGVVNF 117 Query: 121 ------DALRKKPKKDSYLCTLPGNFDIHQPRILSP------RTTGYFIGKEAAIHLLNV 168 R P+ +L F + R+ T Y I + AA ++ Sbjct: 118 ADGWSTQMGRMNPRTPYPRLSLFSPFIWGRHRLGYCASDWTACTGLYLISRSAAQKIIEQ 177 Query: 169 -----RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES----------R 213 + D + + V +PG T +S IEE + Sbjct: 178 IDARGSQYWL--ADDWPLFAADFGVDIKVVQPGLC---GWTGNSLIEEDGTHFVYHKANQ 232 Query: 214 LVRKPTFSPLYFYRNTCYQWNLHYNAWRKDL 244 ++P + L Y + N + L Sbjct: 233 TKKQPLLNKLRIYAAPKARLKNAQNIVKSTL 263 >gi|315639018|ref|ZP_07894188.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis JV21] gi|315480930|gb|EFU71564.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis JV21] Length = 134 Score = 98.5 bits (244), Expect = 9e-19, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 44/113 (38%), Gaps = 17/113 (15%) Query: 2 PIPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRIFSHQ 52 + +++I+L S R++ + R + F FF+AI + + S+ Sbjct: 11 SMKIFIINLKRSLERKKLMQKQIERFFENYPNLKDEINFEFFEAIDAKIKENMEKFASYF 70 Query: 53 KRQCQF-------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 + + E+ C+ SH+ LW++ +ILEDD F Sbjct: 71 PKFRSLAFCGRGGGCGILDTELACFASHLSLWQKCVELNE-AILILEDDFYFE 122 >gi|254474380|ref|ZP_05087766.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] gi|214028623|gb|EEB69458.1| LPS glycosyltransferase subfamily, putative [Ruegeria sp. R11] Length = 283 Score = 98.1 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 81/208 (38%), Gaps = 13/208 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARI--HLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + V++I+L + RR +R+ + + + R + Sbjct: 1 MNVFLINLDRAPDRRAHMLSELSRLLPGTEVQRALCVDIKAPDWTPPAEFTPGRWMSDRW 60 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L +I + SH+ WKRIA S ++LEDD FSD+F ++ L +++ Sbjct: 61 DLPASDIEIFRSHMDCWKRIAASGQP-GLVLEDDLLFSDDFGRIASELMANAPK-GIVRL 118 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 DA+ + + T G + ++Q + L+P Y + + A ++ I R +D + Sbjct: 119 DAVSTPMLRRAGTATDCG-YQLNQIKTLAPSAAAYVVDPDTAANM-AATARIERTVDDYL 176 Query: 181 KHW-------WEHNIPSLVTEPGAVYEA 201 H P EP A +A Sbjct: 177 FDPDPAARGARGHTHPIYQIEPAAAVQA 204 >gi|283458617|ref|YP_003363252.1| LPS biosynthesis glycosyltransferase [Rothia mucilaginosa DY-18] gi|283134667|dbj|BAI65432.1| glycosyltransferase involved in LPS biosynthesis [Rothia mucilaginosa DY-18] Length = 273 Score = 98.1 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 62/153 (40%), Gaps = 13/153 (8%) Query: 31 FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII 90 F + A+ +N + + F + + ++ R S E GC +SH+ +W+ S A A+I Sbjct: 30 FIRYSAVDNRDNHVADEEFDVEFFRSRWHRDPSPAEKGCMLSHVKMWRDFVASDADWALI 89 Query: 91 LEDDADFSDEFSQLLPHLSKCDINNILIKF------DALRKKPKKDSYLCTL-----PGN 139 EDD S + ++ + L+ +A + P+ + +L G Sbjct: 90 AEDDILLSPDAEAVVNAIITKYPQVHLVNLSDSFANEAGKLNPQVEYIRLSLLSPFVHGK 149 Query: 140 FDIHQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 + + +P P T Y I + A LL + Sbjct: 150 YRMGKPYGGWPLICTGLYLISRSGAELLLKQFE 182 >gi|260582744|ref|ZP_05850531.1| LOW QUALITY PROTEIN: diadenosine tetraphosphatase [Haemophilus influenzae NT127] gi|260094194|gb|EEW78095.1| LOW QUALITY PROTEIN: diadenosine tetraphosphatase [Haemophilus influenzae NT127] Length = 269 Score = 98.1 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 69/202 (34%), Gaps = 19/202 (9%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-- 117 R+L+ E GC ISH LW + I EDD + L Sbjct: 48 RILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFND 107 Query: 118 ---IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI- 172 I+ + + K + P N + T GY I + AA +++ KNI Sbjct: 108 IFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHRGTAGYIISQGAAKYVIEYLKNIP 167 Query: 173 ---YRPID-------MDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 +D +D+ ++ + + P++ + ++ S +E+ R R Sbjct: 168 SDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQSKSVLTSGLEKERQKRPKIRK 227 Query: 222 PLYF-YRNTCYQWNLHYNAWRK 242 R T + N+ RK Sbjct: 228 KKTLKQRLTRIKENIIRALNRK 249 >gi|207092487|ref|ZP_03240274.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori HPKX_438_AG0C1] Length = 194 Score = 98.1 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 62/201 (30%), Gaps = 54/201 (26%) Query: 3 IPVYVISLPFSHARREKFCHRA--------------ARIHLQFSFFDAIYGEN-----NP 43 + V+VISL +K C + H Q FDAIY + +P Sbjct: 1 MRVFVISLN------QKMCDQFGLVFRDTTTLLNSINATHHQVQIFDAIYSKTFEGGLHP 54 Query: 44 ICNRIF-----------------------SHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 + + + +SL E+GCY SH LW++ Sbjct: 55 LVKKHLHPYFITQNIKDMGITTNLISGVSKFYYALKYHAKFMSLGELGCYASHYSLWQKC 114 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT----- 135 ILEDD ++F + L L K ++ L P S + Sbjct: 115 IELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRLMHLLYDPNVKSEPLSHKNHE 173 Query: 136 LPGNFDIHQPRILSPRTTGYF 156 + I + T GY Sbjct: 174 IQERVGIIKAYSEGVGTQGYV 194 >gi|220933970|ref|YP_002512869.1| putative glycosyltransferase involved in LPS biosynthesis [Thioalkalivibrio sp. HL-EbGR7] gi|219995280|gb|ACL71882.1| putative glycosyltransferase involved in LPS biosynthesis [Thioalkalivibrio sp. HL-EbGR7] Length = 265 Score = 97.7 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 48/225 (21%), Positives = 81/225 (36%), Gaps = 9/225 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + VISL RR R R F I + ++ L Sbjct: 11 ILVISLETDLERRNLLRERFRRYSTHFKLIKGIDMRDATQPISHTVPPPCPRLTRKPLKP 70 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-SKCDINNILIKFDAL 123 EIGC +SH+ K+ A S A +ILEDD DE + + + ++ L+ Sbjct: 71 SEIGCALSHMLALKQAAISNARTILILEDDVLGKDEDIDTVHRISEQLPPHHFLLCGGQE 130 Query: 124 RKKPKKDSYLCTLPGNFDI-HQPRILSPRTTGYFIGKEAAIHLLNVRK-NIYRPIDMDMK 181 + K Y G F + R R Y + A+ +L ++ ++R D Sbjct: 131 GLRGNKFLYGSEEFGCFRLPSMVRRFVTRACCYALSPSMAVAILERQQACLHRSDDWHTL 190 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV--RKPTFSPLY 224 EHN+ + +D + S IE R ++ ++ ++ Sbjct: 191 LRHEHNV----FFSNLLGHPMDLSASHIEIERTQLPQRGSWKRIW 231 >gi|149201031|ref|ZP_01878006.1| glycosyl transferase, family 25 [Roseovarius sp. TM1035] gi|149145364|gb|EDM33390.1| glycosyl transferase, family 25 [Roseovarius sp. TM1035] Length = 240 Score = 97.3 bits (241), Expect = 2e-18, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 80/201 (39%), Gaps = 5/201 (2%) Query: 11 PFSHARREKFCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQCQFKRLLSLPEIG 68 RR + L+ + + ++ + + + L+ PEI Sbjct: 11 ARDATRRNRARDTLNTCGLEGDLWPEVEASVLSSTVLSATVGAGLFAPLYPFTLTTPEIA 70 Query: 69 CYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP- 127 ++SH +W I +I ++DA + L++ I++ + + A + + Sbjct: 71 RFLSHRQIWAEIVRQGLDYGLIFDEDAALDPQIFARARDLARDHIDD--LGYIAFQPQAV 128 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHN 187 + + + G + P + +PR+ +G++AA HLL++ + RP++ ++ W + Sbjct: 129 RGPARVIDTNGGCVLCLPVVNAPRSPVQMVGQDAAAHLLHLTEIFDRPVEFLIQSHWHTH 188 Query: 188 IPSLVTEPGAVYEAIDTNDST 208 + + P V + + T Sbjct: 189 LRTGAVYPSGVSRIMQGSGHT 209 >gi|311113075|ref|YP_003984297.1| LPS glycosyltransferase subfamily [Rothia dentocariosa ATCC 17931] gi|310944569|gb|ADP40863.1| LPS glycosyltransferase subfamily [Rothia dentocariosa ATCC 17931] Length = 274 Score = 96.9 bits (240), Expect = 2e-18, Method: Composition-based stats. Identities = 50/274 (18%), Positives = 92/274 (33%), Gaps = 37/274 (13%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPI--CNRIFSHQKRQCQFKR 60 + YV++L R E F F F + G PI + F + + + R Sbjct: 1 MKKYVVALK-DENRLEYFFSAPGGAG--FDIFWGVDGRALPIPGESPEFDNIYFEKRAGR 57 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L E+GC +SH ++W+ S A++ EDDA +++ + + ++ ++ F Sbjct: 58 LARPGEVGCALSHTYVWRDFLESGEEWALVAEDDALIHASIDEIVSRVIEKSLSIGVVNF 117 Query: 121 ------DALRKKPKKDSYLCTLPGNFDIHQPRILSP------RTTGYFIGKEAAIHLLNV 168 R + +L F + R+ T Y I + AA ++ Sbjct: 118 ADGWSTQMGRMNARTPYPRLSLFSPFIWGRHRLGYCANDWTACTGLYLISRSAAQKIIEQ 177 Query: 169 -----RKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEES----------R 213 + D + + V +PG T +S IEE + Sbjct: 178 IDARGSQYWL--ADDWPLFAADFGVDIKVVQPGLC---GWTGNSLIEEDGTHFVYHKANQ 232 Query: 214 LVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPV 247 ++P + L Y + N + V Sbjct: 233 TKKQPLLNKLRIYAAPKARLKNAQNIVKSTFWQV 266 >gi|332229692|ref|XP_003264022.1| PREDICTED: glycosyltransferase 25 family member 3 [Nomascus leucogenys] Length = 506 Score = 96.6 bits (239), Expect = 3e-18, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 63/215 (29%), Gaps = 63/215 (29%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V+VISL RRE+ + + DA+ G Sbjct: 277 VFVISLARRPDRRERMLTSLWEMEISGRVVDAVDGR------------------------ 312 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNILIKF 120 + ++ EDDA F F L L + ++ LI Sbjct: 313 ---------------VVARGLARVLVFEDDARFESNFRGRLERLMEDVEAEKLSWDLIYL 357 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PIDM 178 + P+K++ + LPG S T Y + A LL + + R P+D Sbjct: 358 GRKQVNPEKEAAVEGLPG----LVVAGYSYWTLAYALSLAGARKLL-ASQPLRRMLPVDE 412 Query: 179 DMK-------------HWWEHNIPSLVTEPGAVYE 200 + H+W ++ + P Sbjct: 413 FLPIMFDQHPNEQYKAHFWPRDLVAFSAWPLLAAP 447 Score = 48.4 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 9/48 (18%), Positives = 18/48 (37%) Query: 32 SFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 +F + + R L+ E+GC++SH +W+ Sbjct: 159 EYFPMLNSSAIRSLGVDLLPGYQDPYSGRTLTKGEVGCFLSHYSIWEE 206 >gi|86160466|ref|YP_467251.1| glycosyl transferase family protein [Anaeromyxobacter dehalogenans 2CP-C] gi|85776977|gb|ABC83814.1| glycosyl transferase, family 25 [Anaeromyxobacter dehalogenans 2CP-C] Length = 283 Score = 96.6 bits (239), Expect = 3e-18, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 75/235 (31%), Gaps = 37/235 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQ-----FK 59 ++V+++ + R+E+ R A L + F + Sbjct: 43 IFVVTVERAVERQERVRARLA--GLDYRFHQGMDKRLLDPARLAAEGYDAAADRRAARHS 100 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF-SDEFSQLLPHLSKCDINNILI 118 + +S +I C +SH+ +++ + ++ EDD + + L L + + L Sbjct: 101 KPMSQGQIACAVSHLQVYRAAVENGWQRVLVFEDDVVPRGPDLALLPEALRQLPEDWELA 160 Query: 119 KFD-------ALRKKPKKDSYLCTLPGNFDIHQPRI-----------------LSPRTTG 154 LR + K+ +YL P L T Sbjct: 161 YLGWSNFERVTLRHRAKQAAYLVGASLRLMKWTPAQILRFHPRPFSEHLRAAGLHHCTHA 220 Query: 155 YFIGKEAAIHLLNVRKNIYRPID-MDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 Y AA L+ ++ + R D + + + + VT P + D D T Sbjct: 221 YAFTLGAARKLVEAQRPLARNADQLLIHMVLSGKLRAFVTVP----KFFDQEDGT 271 >gi|53728775|ref|ZP_00135232.2| COG3306: Glycosyltransferase involved in LPS biosynthesis [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|165976545|ref|YP_001652138.1| lipooligosaccharide biosynthesis protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|190150446|ref|YP_001968971.1| glycosyltransferase [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|303250132|ref|ZP_07336334.1| lipooligosaccharide biosynthesis protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307257172|ref|ZP_07538944.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|165876646|gb|ABY69694.1| lipooligosaccharide biosynthesis protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|189915577|gb|ACE61829.1| Glycosyltransferase [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|302651195|gb|EFL81349.1| lipooligosaccharide biosynthesis protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306864334|gb|EFM96245.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 100 Score = 96.6 bits (239), Expect = 3e-18, Method: Composition-based stats. Identities = 29/100 (29%), Positives = 39/100 (39%), Gaps = 4/100 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M + YVISL + RR A+ H+ F FFDAI + + F Sbjct: 4 MELRNYVISLKNNSQRRNHMMSEFAKQHIPFVFFDAITPDLIERKAKEFGIDITTS---- 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 L+ EI C +SHI LW+ I E D + Sbjct: 60 PLTKGEIACALSHIALWRLAQEQGLDYIAIFEGDIYLGEN 99 >gi|197124547|ref|YP_002136498.1| glycosyl transferase family 25 [Anaeromyxobacter sp. K] gi|196174396|gb|ACG75369.1| glycosyl transferase family 25 [Anaeromyxobacter sp. K] Length = 271 Score = 95.8 bits (237), Expect = 6e-18, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 75/222 (33%), Gaps = 33/222 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI---FSHQKRQCQFKR- 60 ++VI++ + R+E+ R L + F + R + + +R Sbjct: 31 IFVITIERATERQERVRARLT--GLDYRFHLGMDKRLLDPARRAAEGYDEAADRAAARRS 88 Query: 61 -LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNI-- 116 ++ E+ C ISH+ +++ ++ EDD + + L L + + Sbjct: 89 RTMTPGELACAISHLQIYRAAVEHGWERVLVFEDDVLPRYQDLALLPATLEQLPDDWELA 148 Query: 117 LIKFDALRK-----KPKKDSYLCT--------LPGNFDIHQPRI---------LSPRTTG 154 + + K + K+ +YL P PR L Sbjct: 149 YLGYTNFEKVTPYHRAKQATYLVAAALGLMKWTPAEIRRFHPRRFSENLKVAGLHHCAHA 208 Query: 155 YFIGKEAAIHLLNVRKNIYRPID-MDMKHWWEHNIPSLVTEP 195 Y + AA L+ + + R D + + + + VTEP Sbjct: 209 YAFTQAAARKLVAAQTPLARNADQLFVHMVLSGKLRAFVTEP 250 >gi|57242449|ref|ZP_00370387.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis RM3195] gi|57016734|gb|EAL53517.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Campylobacter upsaliensis RM3195] Length = 125 Score = 95.8 bits (237), Expect = 6e-18, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 43/112 (38%), Gaps = 17/112 (15%) Query: 3 IPVYVISLPFSHARREKFCHRAARI---------HLQFSFFDAIYGENNPICNRI---FS 50 + +++I+L S R++ + R + F FF+AI + + F Sbjct: 1 MKIFIINLKRSLERKKLMQKQIERFFENYPNLKDEINFEFFEAIDAKIKENMEKFTSYFP 60 Query: 51 HQKRQCQFKRLLSLP----EIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 + R E+ C+ SH+ LW++ +ILEDD F Sbjct: 61 KFRSLAFCGRGGGCGILDTELACFASHLSLWQKCVELNE-AVLILEDDFYFE 111 >gi|197334126|ref|YP_002154914.1| glycosyl transferase, family 25 [Vibrio fischeri MJ11] gi|197315616|gb|ACH65063.1| glycosyl transferase, family 25 [Vibrio fischeri MJ11] Length = 151 Score = 95.4 bits (236), Expect = 6e-18, Method: Composition-based stats. Identities = 34/152 (22%), Positives = 58/152 (38%), Gaps = 9/152 (5%) Query: 91 LEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP 150 +ED+ D Q++ + IK A K P + L F + + + Sbjct: 1 MEDNVDPIANAKQIIELSFEHIQTYDYIKLSATHKSP--FHRILDLTPEFQLGGYKKKTC 58 Query: 151 RTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 TT Y I +AA ++ + P+D M+ W H + + P A T STI Sbjct: 59 GTTAYIISPKAAQRFIDNAQAFLEPVDDYMEKPWRHQVQTYSVSPDLFTRADIT--STIG 116 Query: 211 ESRLVRKP--TFSPLY--FYRNTCYQW-NLHY 237 +R + F+ +Y +R +LH+ Sbjct: 117 STRKDKSGISPFNKIYIELFRTYESLMKSLHW 148 >gi|303291206|ref|XP_003064889.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] gi|226453560|gb|EEH50869.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] Length = 293 Score = 95.4 bits (236), Expect = 7e-18, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 80/240 (33%), Gaps = 41/240 (17%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG------------------ENNP 43 + V+VISL S RR + ++ ++ A+ G Sbjct: 58 DLNVHVISLQRSSQRRAACVEALSLENMSYTLSLAVDGLSPFTEGEFNRYAGWKRRALMS 117 Query: 44 ICNRIFSHQKRQCQFKRLLSLPE---------IGCYISHIHLWKRIAYSPAIGAIILEDD 94 + S + + R LS E GC +SHI +W ++ S + I+LEDD Sbjct: 118 LTAEELSDLQEKFASSRRLSNREQRVLHERLRFGCSLSHIRIWMKLLSSTSDFFIVLEDD 177 Query: 95 ADFSDEFSQLL-PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNF--DIHQPRILSPR 151 A EF + L + L + LPG + + Sbjct: 178 AIVVKEFERKTHEALKSLPDDWELFYIGYVC---------SALPGGYLSRNIRQLRGGSC 228 Query: 152 TTGYFIGKEAAIHLL-NVRKNIYRPIDMDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTI 209 T GY + + A L+ P+D +K + P+ +P + N ST+ Sbjct: 229 TKGYAVSRRGAERLVFRSAVGSNLPVDNMIKVDVAAGLAPAFYADPPLLAVTDTRNLSTL 288 >gi|326437233|gb|EGD82803.1| hypothetical protein PTSG_03453 [Salpingoeca sp. ATCC 50818] Length = 289 Score = 95.4 bits (236), Expect = 7e-18, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 64/210 (30%), Gaps = 32/210 (15%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQK--RQCQF 58 I VI+L S R E+ + + L F + + K R + Sbjct: 83 NIGFLVINLDRSPKRLERMRKQFDDLELPQFERVPGVE----------YDPTKEYRVARG 132 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 + L + G ++H W+ S +I+EDDA+ S L K + L+ Sbjct: 133 SKYLKPADYGTALAHYEAWRHAYRSKHRWNVIMEDDAELLPNVS-LTGPWPKVPKDADLV 191 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPI 176 + + S +C Y + + A ++ + PI Sbjct: 192 LL--RKSSMFETSPICKDTSVLK----ADWGFGMVAYLVSRAGAGRMIREAERVGFRSPI 245 Query: 177 DMDMKHWWEHNIPSLV------TEPGAVYE 200 D H W H+ V T P Sbjct: 246 D---GHIWYHS-RVYVTGEDWITHPPCASP 271 >gi|167516424|ref|XP_001742553.1| hypothetical protein [Monosiga brevicollis MX1] gi|163779177|gb|EDQ92791.1| predicted protein [Monosiga brevicollis MX1] Length = 264 Score = 95.0 bits (235), Expect = 8e-18, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 59/183 (32%), Gaps = 20/183 (10%) Query: 7 VISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 VI+L S R + +L F + + + R L Sbjct: 62 VINLDRSPQRLARMRQSFKEHNLPPFERVRGVEASPD-------GGPYDVPRLVRGLKWA 114 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 + G ++H+ WK A S II+EDDA S L L ++ LI F Sbjct: 115 DYGTSLAHLRAWKAAARSNYPWNIIIEDDATLLPNASYL--ELPPIPVDCDLILF----- 167 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPIDMDMKHWW 184 +P + + Y + +E A ++ + PID H W Sbjct: 168 RPSTIFEREDICEETPVQDAY-WGYGMVAYLVSREGAQRMIQKASRGFGGPID---GHIW 223 Query: 185 EHN 187 HN Sbjct: 224 YHN 226 >gi|157877189|ref|XP_001686925.1| glycosyl transferase [Leishmania major strain Friedlin] gi|68130000|emb|CAJ09308.1| glycosyl transferase-like protein [Leishmania major strain Friedlin] Length = 285 Score = 94.6 bits (234), Expect = 1e-17, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 71/222 (31%), Gaps = 50/222 (22%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQF---SFFDAIYGENNPICN------------RIF 49 V+VI+L R AR L A+ G + R Sbjct: 21 VFVINLDRRPDRWAAIQAVCARAGLPAERTERVPAVEGSRLDVNAVHHCGFVSALGLRRL 80 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAY--------SPAIGA-------IILEDD 94 + L+ +GC +SHIHLW RIA + A A ++LEDD Sbjct: 81 KEPPEHHIWGMDLNKAALGCALSHIHLWARIAALGKVSNISAEAPAATLPKQCFLVLEDD 140 Query: 95 ADFSDE----------------FSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + F+D QL ++ + L+ + + K+ ++ G Sbjct: 141 STFADSDDNGSDSPAASPSLPFLDQLQRRMNSVPPDWELV-YVSGLDTAKQCLHMQVAKG 199 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + T Y + + A LL + +D M Sbjct: 200 ---VARVPQYHRTTNAYLVTPQGARRLLATCVPLTFQLDTAM 238 >gi|313892173|ref|ZP_07825766.1| LPS glycosyltransferase [Dialister microaerophilus UPII 345-E] gi|313119311|gb|EFR42510.1| LPS glycosyltransferase [Dialister microaerophilus UPII 345-E] Length = 250 Score = 94.2 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 65/176 (36%), Gaps = 17/176 (9%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + +YV++L R++ +I + F+FF + S + ++ + Sbjct: 1 MKIYVLTLENETERQKNVIKEFQKIDVHFNFFKGMDARKLS------SDELKKLCVNEAM 54 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-----DEFSQLLPHLSKCDINNIL 117 EIGC SH+ + + + I EDD FS + + + + +++ Sbjct: 55 LAGEIGCAESHLQIMRDFLKTEENSVFIFEDDVQFSKKVNKNVLKEFKKFIDSKNTPSVI 114 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 I +K K+ GN +++ + Y I ++ A L+ I Sbjct: 115 I-----LRKRKRLGKEVYRIGNIPVYECYR-ATGAISYLINRKCAEFLVKNNTPIR 164 >gi|255327325|ref|ZP_05368399.1| LPS glycosyltransferase [Rothia mucilaginosa ATCC 25296] gi|255295605|gb|EET74948.1| LPS glycosyltransferase [Rothia mucilaginosa ATCC 25296] Length = 273 Score = 93.9 bits (232), Expect = 2e-17, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 62/154 (40%), Gaps = 13/154 (8%) Query: 30 QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAI 89 F + A+ +N + + F + + ++ S E GC +SH+++W+ S A A+ Sbjct: 29 DFVRYPAVDNRDNHVADEEFDVEFFRSRWHCDPSPAEKGCMLSHVNMWRDFVASDADWAL 88 Query: 90 ILEDDADFSDEFSQLLPHLSKCDINNILIKF------DALRKKPKKDSYLCTL-----PG 138 I EDD S + ++ + L+ DA + P+ + +L G Sbjct: 89 IAEDDILISPDAEAVVNAIITKYPQVQLVNLSDSFANDAGKLNPQVEYIRLSLLSPFVHG 148 Query: 139 NFDIHQPRILSP--RTTGYFIGKEAAIHLLNVRK 170 + + +P P T Y I + A LL + Sbjct: 149 KYRMGKPYGSWPLVCTGLYLISRSGAELLLKQFE 182 >gi|220919272|ref|YP_002494576.1| glycosyl transferase family 25 [Anaeromyxobacter dehalogenans 2CP-1] gi|219957126|gb|ACL67510.1| glycosyl transferase family 25 [Anaeromyxobacter dehalogenans 2CP-1] Length = 271 Score = 93.9 bits (232), Expect = 2e-17, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 76/222 (34%), Gaps = 33/222 (14%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI---FSHQKRQCQFKR- 60 ++VI++ + R+E+ R L + F + R + + +R Sbjct: 31 IFVITIERAAERQERVRTRLT--GLDYRFHLGMDKRLLDPARRAAEGYDEAADRAAARRS 88 Query: 61 -LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNI-- 116 ++ E+ C ISH+ +++ ++ EDD ++ + L L + + Sbjct: 89 RTMTPGELACAISHLQIYRAAVEHGWERVLVFEDDVLPRYEDLALLPQTLEQLPDDWELA 148 Query: 117 LIKFDALRK-----KPKKDSYLCT--------LPGNFDIHQPRI---------LSPRTTG 154 + + K + K+ +YL P PR L Sbjct: 149 YLGYTNFEKVTPYHRAKQATYLVAAALGLMKWTPAEIRRFHPRRFSENLKVAGLHHCAHA 208 Query: 155 YFIGKEAAIHLLNVRKNIYRPID-MDMKHWWEHNIPSLVTEP 195 Y + AA L+ + + R D + + + + VTEP Sbjct: 209 YAFTQAAARKLVAAQTPLARNADQLFVHMVLSGKLRAFVTEP 250 >gi|61806142|ref|YP_214502.1| glycosyltransferase family 25 [Prochlorococcus phage P-SSM2] gi|61374651|gb|AAX44648.1| glycosyltransferase family 25 [Prochlorococcus phage P-SSM2] gi|265525354|gb|ACY76151.1| glycosyltransferase family 25 [Prochlorococcus phage P-SSM2] Length = 263 Score = 93.9 bits (232), Expect = 2e-17, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 60/166 (36%), Gaps = 20/166 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 VY I+L R E ++ ++ A G + + + + + ++ ++ Sbjct: 16 VYCINLDGEPHRWESMKTMLEYWEVENYTRISAYDGREDDLSDIL------KGRYPDSMT 69 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS---QLLPHLSKCDINNILIKF 120 E+GC SH+ K + AI++EDD D S SK + +I+ Sbjct: 70 GGEVGCTTSHLKALKEFLKTDQPCAIVMEDDCDLSPVAHWGFTWKDFFSKIPYDYDVIQL 129 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + + + +H+ + T Y I + A L+ Sbjct: 130 ----------AIINPAQVHLQMHRRFVNDFSTACYLITRHHAEKLV 165 >gi|159490780|ref|XP_001703351.1| predicted protein [Chlamydomonas reinhardtii] gi|158280275|gb|EDP06033.1| predicted protein [Chlamydomonas reinhardtii] Length = 388 Score = 93.5 bits (231), Expect = 2e-17, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 83/228 (36%), Gaps = 28/228 (12%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN---NPICNRIFSHQKR---- 54 +PV+VI +P R R A + + + + G P+ + + R Sbjct: 112 TLPVHVIVIPRLTQRLADARQRLAEVGVAYELLNGTDGRQPIPAPLLDMYLGPRLRDLVL 171 Query: 55 --QCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----FSQLLPHL 108 Q +++ +S++ + R+ A+ILEDDA ++++ L Sbjct: 172 RQHDQHWVMMTAN----LLSNVQAFHRVVAREQQAAVILEDDACPRPNNASWMAEVIKAL 227 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + ++ A + + + + T Y + + A+ + + Sbjct: 228 QQLPEDWDILYLWAQGGQRGRFVG--------PNVRVVRKAAGTVCYVLSLKGALQAIRL 279 Query: 169 RKNIYRPID-MDMKHWWEHNIPSLVTEPGAVYEAI--DTNDSTIEESR 213 ++ ID + H W H++ + VTEP D ST+E R Sbjct: 280 AEHANEGIDKVLFYHEWRHHLQAYVTEPPLCSHGPSGDGAGSTLEYDR 327 >gi|303253306|ref|ZP_07339455.1| lipooligosaccharide biosynthesis protein [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307248142|ref|ZP_07530170.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|302647988|gb|EFL78195.1| lipooligosaccharide biosynthesis protein [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|306855319|gb|EFM87494.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 100 Score = 93.1 bits (230), Expect = 3e-17, Method: Composition-based stats. Identities = 29/100 (29%), Positives = 38/100 (38%), Gaps = 4/100 (4%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 M + YVISL + RR A+ H+ F FFDAI + + F Sbjct: 4 MELRNYVISLKNNSQRRNHMMSEFAKQHIPFVFFDAITPDLIERKAKEFGIDITTS---- 59 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 L EI C +SHI LW+ I E D + Sbjct: 60 PLIKGEIACALSHIALWRLAQEQGLDYIAIFEGDIYLGEN 99 >gi|47223918|emb|CAG06095.1| unnamed protein product [Tetraodon nigroviridis] Length = 660 Score = 93.1 bits (230), Expect = 3e-17, Method: Composition-based stats. Identities = 54/298 (18%), Positives = 96/298 (32%), Gaps = 54/298 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-----FSHQKRQCQFK 59 V++I+L RRE+ + L A+ G+ + + Sbjct: 361 VFMINLVRRSDRRERMLRTLYELELSCQVVAAVDGKALNKTDVESLGIHMLPGYKDPYHG 420 Query: 60 RLLSLPEIGCY---ISHIH----------LWKRIAYSPAIGAIILEDD--ADFSDEFSQL 104 R L+ E+GC+ + H+ + + +P A D A S++ Sbjct: 421 RPLTKGELGCFSLSLQHLEGDLPGDRGRPPLRGVLQTPPADAAGGGDGPRAGLGSHVSEV 480 Query: 105 LP-HLSKCDINNILIKFDA-----LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG 158 N + F RK+ + D ++P ++ Q S T GY + Sbjct: 481 TAGSAPPNGTNANSLAFPHPNRYIGRKRMQTDRPETSVPNIHNLVQA-DYSYWTLGYLLS 539 Query: 159 KEAAIHLLNVRKNIY--RPID-------------MDMKHWWEHNIPSLVTEPGAVYEA-- 201 A LL + + P+D M+ + + ++ + EP VY Sbjct: 540 LRGARKLLK-AEPLSKMLPVDEFLPVMYNKHPILEYMEPFKKRDLRAFSAEPLLVYPTHY 598 Query: 202 -------IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTKF 252 DT S + ++ V+ T R T Q L + A D+ KF Sbjct: 599 TGEPGYVSDTETSVVWDNETVK--TDWDRAKSRKTQEQEELSFEAQNSDVLQSELEKF 654 >gi|48734956|gb|AAH71684.1| GLT25D1 protein [Homo sapiens] Length = 222 Score = 93.1 bits (230), Expect = 3e-17, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 62/154 (40%), Gaps = 11/154 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 60 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 119 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 120 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 179 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS 149 LI RK+ + + +P ++ + Sbjct: 180 DLI--YVGRKRMQVEHPEKAVPRVRNLVEADYSY 211 >gi|322490934|emb|CBZ26198.1| glycosyl transferase-like protein [Leishmania mexicana MHOM/GT/2001/U1103] Length = 285 Score = 93.1 bits (230), Expect = 4e-17, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 67/222 (30%), Gaps = 50/222 (22%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQF---SFFDAIYGENNPICN------------RIF 49 ++VI+L R AR A+ G + R Sbjct: 21 IFVINLDRRPDRWTAIQAVCARAGFPAERTERVPAVEGSRLDVNAVHRCRFVSALGLRRL 80 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRI--------AYSPAIGA-------IILEDD 94 + L+ +GC +SHIHLW RI + A A ++LEDD Sbjct: 81 KEPPEHHIWGMDLNKAALGCALSHIHLWARIAALGKVSNVSAEAPAAALPKQCFLVLEDD 140 Query: 95 ADF----------------SDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + +QL ++ + L+ + + ++ ++ G Sbjct: 141 STLAESDDSGSSSPAASPSHPFLNQLQRRMNSVPPDWELV-YVSGLDTARQCPHMQVAKG 199 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + T Y + + A LL + +D M Sbjct: 200 ---VARVPQYHRTTNAYLVTPQGARRLLATCVPLTFQLDTVM 238 >gi|237507721|ref|ZP_04520436.1| putative lipopolysaccharide biosynthesis protein [Burkholderia pseudomallei MSHR346] gi|234999926|gb|EEP49350.1| putative lipopolysaccharide biosynthesis protein [Burkholderia pseudomallei MSHR346] Length = 172 Score = 93.1 bits (230), Expect = 4e-17, Method: Composition-based stats. Identities = 34/170 (20%), Positives = 56/170 (32%), Gaps = 16/170 (9%) Query: 88 AIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRI 147 ++EDD D F L + +++ + ++ + S LP + Sbjct: 1 MCVMEDDITLLDGFKAATLELYDARQHWDMVRLMWINER--QQSEYARLPSGTRLM-WME 57 Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEA------ 201 Y I + AA +L+ I ID+ WEH VT P V + Sbjct: 58 NPVGLQCYMITRTAAQRMLDYTAKITHAIDIAFDRNWEHGQRMYVTSPQFVADTGAPTTI 117 Query: 202 IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 D DS RL K +YR + + +N + P+ Sbjct: 118 TDRPDSRTLMQRLKAK-------YYRKVERRTSRRFNEEHRPKRPIQIEI 160 >gi|145630510|ref|ZP_01786290.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae R3021] gi|144983900|gb|EDJ91342.1| lipooligosaccharide glycosyl transferase [Haemophilus influenzae R3021] Length = 193 Score = 92.7 bits (229), Expect = 5e-17, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 66/161 (40%), Gaps = 12/161 (7%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 +S EIGC +SHI LW + I EDD + +LL + + ++K + Sbjct: 1 MSDGEIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-IDYISDDIHVLKLE 59 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMD 179 A K K ++ + +++ + GY + + A +LL + K + +D Sbjct: 60 ANGKMFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSL 117 Query: 180 MKHWWEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + H + + PG V + +S+++E R Sbjct: 118 VFEDFLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 158 >gi|328860114|gb|EGG09221.1| hypothetical protein MELLADRAFT_115844 [Melampsora larici-populina 98AG31] Length = 347 Score = 92.7 bits (229), Expect = 5e-17, Method: Composition-based stats. Identities = 46/209 (22%), Positives = 71/209 (33%), Gaps = 20/209 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAI---YGENNPICNRIFSHQKRQCQFKRL 61 + V+SL RR + + L+ F DAI N I NR+ + K K++ Sbjct: 96 IRVVSLKSRTDRRNHMKKLNSFLDLKIKFSDAILYNDSRVNEIVNRVGNQTKAD---KKM 152 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKF 120 + + C +SH + +ILEDD D F L L + +I F Sbjct: 153 AEVGHVACRMSHRMAIQAAEDDHDEFTLILEDDVDIESAFKYLSGTILRDVPKDWDMIFF 212 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-----NVRKNIYR- 174 + NF I++ I GY + + L+ N R +Y Sbjct: 213 GHTDFSDESRHGRDPRTSNFYIYK-SIEPQGGHGYALSPKG-RKLILDLLSNKRPELYET 270 Query: 175 ----PID-MDMKHWWEHNIPSLVTEPGAV 198 PID + M H P + Sbjct: 271 DEGQPIDEIFMYLARLHKANLFSIIPDLI 299 >gi|86742004|ref|YP_482404.1| glycosyl transferase family protein [Frankia sp. CcI3] gi|86568866|gb|ABD12675.1| glycosyl transferase, family 25 [Frankia sp. CcI3] Length = 258 Score = 92.3 bits (228), Expect = 6e-17, Method: Composition-based stats. Identities = 47/247 (19%), Positives = 79/247 (31%), Gaps = 29/247 (11%) Query: 2 PIPVYVISLPFSHARREKFCHRAA-RIHLQFS-----FFDAIYGENNPICNRIF-----S 50 + YVI+L RR + H + F+ FD + + + + Sbjct: 6 DLRTYVINLLRRPDRRARMTHVLPPELKATFTSDWRGLFDGLNLTRSQLDAAGYTLFPWQ 65 Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII-LEDDADFSDEFSQLLPHLS 109 + + R L EIGC ++H+ W+ A + I+ LEDD D F L Sbjct: 66 IESDNPWWARPLKYGEIGCTLAHLACWRHAAQTGDEPFIVFLEDDLVIPDAFLDQLLAGL 125 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NV 168 + L + P + + + F S T GY + + +L Sbjct: 126 HQLTRHGPFDLLYLGRFPLEPDHDQPVLDGF---VSPGYSHCTFGYLLTRPGLGLVLAAH 182 Query: 169 RKNIYRPIDMDMKHWW------------EHNIPSLVTEPGAVYEAI-DTNDSTIEESRLV 215 + P+D + + + +L EP V + D S E S V Sbjct: 183 IEEAVVPVDEFLPALYIDHPRPDLRARFRRRLTALAFEPPLVSQRPKDEAGSDTERSAFV 242 Query: 216 RKPTFSP 222 P Sbjct: 243 EPRDRQP 249 >gi|317012433|gb|ADU83041.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Lithuania75] Length = 183 Score = 91.9 bits (227), Expect = 8e-17, Method: Composition-based stats. Identities = 35/131 (26%), Positives = 51/131 (38%), Gaps = 17/131 (12%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKLVQELYDAQS 60 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 L+LPE GCY+SH LWK + +ILEDD F Q L Sbjct: 61 LLQSDWYHSYVGAGLTLPEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI 118 K + + + Sbjct: 120 CLKSPFDFVRL 130 >gi|13470787|ref|NP_102356.1| hypothetical protein mll0582 [Mesorhizobium loti MAFF303099] gi|14021530|dbj|BAB48142.1| mll0582 [Mesorhizobium loti MAFF303099] Length = 931 Score = 91.9 bits (227), Expect = 8e-17, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 69/239 (28%), Gaps = 38/239 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +++I+L S R K+ R + A+ G + Sbjct: 688 IHLINLDSSVERWRKYQDRNPHLAANTIRVSAVDGASLDRSTLQALVDDGVIAEDCGYLP 747 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHLSKCDINNILIKFDA- 122 +GC +SHI+LW ++A S I EDD S F + LS N ++K+ Sbjct: 748 GALGCALSHINLW-KLAVSEKRPITIFEDDVYSSFNFLEESTRILSMAPENWDMVKWGFN 806 Query: 123 ------------------LRKKPKKDSYLCTLPGNFDIHQPRILSP-RTTGYFIGKEAAI 163 + K + R++ Y + A Sbjct: 807 FDPLFLWLNFDFSKAKLEFYNRQHKQIPAQFQSETYPRSLIRLVHSFGAMAYSVTPRGAQ 866 Query: 164 HLLNVRKNIYR---P------------IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 LL + + P ID M ++ + + V P V + Sbjct: 867 VLLEKCLPLRKRLIPFPGTGVVLEDEGIDCAMSAVYDS-MHAFVCMPPLVIHDDEQASD 924 >gi|72384745|gb|AAZ67664.1| LOS biosynthesis enzyme putative [Haemophilus parasuis 29755] Length = 125 Score = 91.5 bits (226), Expect = 9e-17, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 44/125 (35%), Gaps = 10/125 (8%) Query: 96 DFSDEFSQLL-PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP-------RI 147 S F +++ L+K ++ D K K + L + + + + Sbjct: 1 IVSLYFKEIISDALNKISPKYEILFLD--HGKAKIYPFPKNLVERYRLARYISPSKNSKR 58 Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 RTTGY I + A LL I P D +I + EP V+ + + Sbjct: 59 TIIRTTGYIITQAGAKRLLEYAYPIRMPSDFLTGSLQMTHIRAYGIEPSCVFGGSHSEIN 118 Query: 208 TIEES 212 +E+ Sbjct: 119 EMEDR 123 >gi|145628349|ref|ZP_01784150.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae 22.1-21] gi|144980124|gb|EDJ89783.1| membrane component of ABC transporter FtsEX [Haemophilus influenzae 22.1-21] Length = 193 Score = 91.5 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 66/161 (40%), Gaps = 12/161 (7%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 +S EIGC +SHI LW + I EDD + +LL + + ++K + Sbjct: 1 MSDGEIGCALSHIVLWDLALENNLNYINIFEDDIHLGENAKELLE-VDYISDDIHVLKLE 59 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMD 179 A K K ++ + +++ + GY + + A +LL + K + +D Sbjct: 60 ANGKMFFKQPK--SVKCDRNVYPMTVKQSGCAGYTVTAKGAKYLLELVKNKPLDVAVDSL 117 Query: 180 MKHWWEH--NIPSLVTEPGA-----VYEAIDTNDSTIEESR 213 + + H + + PG V + +S+++E R Sbjct: 118 VFEDFLHFKDYKIVQLSPGICVQDFVLHPDNPFESSLQEGR 158 >gi|203285045|gb|ACH97161.1| WclP [Escherichia coli] Length = 226 Score = 91.2 bits (225), Expect = 1e-16, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 82/228 (35%), Gaps = 34/228 (14%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKRLLSL 64 +ISLP + RR+K + A+ ++ AI G +F + + + L+ Sbjct: 10 IISLPQHNERRDKLKNEMAKYDIECRVSHAIDGRKLLAEKYFSLFKIRSSKMFGRGFLTP 69 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 E+GC++SH S ++LEDD + L ++ +++ I L Sbjct: 70 SELGCFLSHKKALTEFLASGRKWLVVLEDDVLPKENVKYLDEMINSFCSSSVYI----LG 125 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNVRKN--IYRP 175 + S+ + G I R + T Y + + A +L + + + Sbjct: 126 GQDGLKSFSRVIMGRKSICGVRKVILGTHRWLYRTCCYCVDIKGAERILRLMEENSFFC- 184 Query: 176 IDMD--------MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 D + + + + P ++ N S+IE RL Sbjct: 185 -DDWSYIVRNAKLDNVFYG---QYFSHP------VNLNSSSIEAERLF 222 >gi|332853929|ref|XP_512497.3| PREDICTED: procollagen galactosyltransferase 1, partial [Pan troglodytes] Length = 478 Score = 90.8 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 48/112 (42%), Gaps = 5/112 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQF-----K 59 V++I+L RRE+ ++ +A+ G+ + + Sbjct: 343 VFMINLRRRQDRRERMLRALQAQEIECRLVEAVDGKAMNTSQVEALGIQMLPGYRDPYHG 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC 111 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + Sbjct: 403 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRD 454 >gi|254229481|ref|ZP_04922896.1| Glycosyltransferase involved in LPS biosynthesis [Vibrio sp. Ex25] gi|151938052|gb|EDN56895.1| Glycosyltransferase involved in LPS biosynthesis [Vibrio sp. Ex25] Length = 249 Score = 90.8 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 53/242 (21%), Positives = 89/242 (36%), Gaps = 23/242 (9%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFF--DAIYGENNPICNRIFSHQKRQCQFK 59 P+ V+VI+ + R + + + F F D+ + + + + + K Sbjct: 25 PLKVFVIT-TGNEERVNRIQNYLK--GIDFEFVYSDSYD--DLLKLEKEYESYSHKFRQK 79 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILI 118 ++ EIGC+ +H W+ I S AI+LED+ +F E S+LL + + L+ Sbjct: 80 AIM-AGEIGCFKTHSQAWELIVASGTP-AIVLEDNIEFIGEASRLLADDVLSYIQSCGLV 137 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYR--P 175 F Y+ F I + T Y I A LL+ + P Sbjct: 138 NFTNF-------CYILDPEKPFKISDVKEKKPFPTVCYGITPLRARDLLSSMRKTAYAVP 190 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL 235 ID + P V + D S I R +K TF+PL + + Sbjct: 191 IDKWLAIPKLSGCYGY-ISPIVVAKREDNLTS-IANKRKGKK-TFNPLNMIQRVINKIKY 247 Query: 236 HY 237 +Y Sbjct: 248 NY 249 >gi|317180390|dbj|BAJ58176.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 184 Score = 90.4 bits (223), Expect = 2e-16, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 49/131 (37%), Gaps = 17/131 (12%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE+GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPELGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI 118 K + + + Sbjct: 120 CLKSPFDFVRL 130 >gi|262392590|ref|YP_003284444.1| LPS biosynthesis glycosyltransferase [Vibrio sp. Ex25] gi|262336184|gb|ACY49979.1| glycosyltransferase involved in LPS biosynthesis [Vibrio sp. Ex25] Length = 253 Score = 90.4 bits (223), Expect = 2e-16, Method: Composition-based stats. Identities = 53/242 (21%), Positives = 89/242 (36%), Gaps = 23/242 (9%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFF--DAIYGENNPICNRIFSHQKRQCQFK 59 P+ V+VI+ + R + + + F F D+ + + + + + K Sbjct: 29 PLKVFVIT-TGNEERVNRIQNYLK--GIDFEFVYSDSYD--DLLKLEKEYESYSHKFRQK 83 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILI 118 ++ EIGC+ +H W+ I S AI+LED+ +F E S+LL + + L+ Sbjct: 84 AIM-AGEIGCFKTHSQAWELIVASGTP-AIVLEDNIEFIGEASRLLADDVLSYIQSCGLV 141 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYR--P 175 F Y+ F I + T Y I A LL+ + P Sbjct: 142 NFTNF-------CYILDPEKPFKISDVKEKKPFPTVCYGITPLRARDLLSSMRKTAYAVP 194 Query: 176 IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNL 235 ID + P V + D S I R +K TF+PL + + Sbjct: 195 IDKWLAIPKLSGCYGY-ISPIVVAKREDNLTS-IANKRKGKK-TFNPLNMIQRVINKIKY 251 Query: 236 HY 237 +Y Sbjct: 252 NY 253 >gi|322510494|gb|ADX05808.1| putative glycosyl transferase [Organic Lake phycodnavirus 1] Length = 223 Score = 90.4 bits (223), Expect = 2e-16, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 72/205 (35%), Gaps = 13/205 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGE-------NNPICNRIFSHQKRQC 56 ++I+L R++K + H+ ++ FF AI N I K Sbjct: 6 TFIINLKHRKDRKKKMTKQLQNAHIHKYEFFKAIQPSPQDIQKWNTKFLEPIPDWFKLTG 65 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 + + +GC +SHI + K +ILEDD +F HL+ +IN++ Sbjct: 66 GDEMKYKIGSLGCMLSHIEVIKLSLERNYDRVLILEDDTEFELGDKHGFTHLN-DEINDL 124 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 L + + + T Y I K A ++++ + R + Sbjct: 125 SFGLFYLAGNHRGSKIEKVKHNVLRV----QGTYTTGSYVIDKSAMLYIVQAIQGFTREV 180 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEA 201 D+ + + P P +A Sbjct: 181 DVFYANVIQKMFPCYCIYPHMTKQA 205 >gi|288799713|ref|ZP_06405172.1| lipooligosaccharide biosynthesis protein LpsA [Prevotella sp. oral taxon 299 str. F0039] gi|288332961|gb|EFC71440.1| lipooligosaccharide biosynthesis protein LpsA [Prevotella sp. oral taxon 299 str. F0039] Length = 242 Score = 90.0 bits (222), Expect = 3e-16, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 79/228 (34%), Gaps = 20/228 (8%) Query: 15 ARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHI 74 R + ++++ FS+ + G+ + I K + C H+ Sbjct: 21 DREQHIKAMLQKMNIPFSWI--LDGDIADLTPEILDKYFAGDMHKAGATAS---CAYKHV 75 Query: 75 HLWKRIAYSPAIGAIILEDDADFSDE-----FSQLLPHLSKCDINNILIKFDALRKKPKK 129 +K I GA+ILEDD F++ L L+ + ++ ++ R + Sbjct: 76 LAYKEIIAKNLDGALILEDDIQLDPRLFIGIFNKSLTELNSEEQEPSIVSYEDTRLR--- 132 Query: 130 DSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PIDMD-MKHWWEH 186 G ++ P T Y+I K+ A ++N + P+D K E+ Sbjct: 133 FVPKSKRVGGKVLY-PGNRDRMTGAYYINKKGAELIINELEKQAMDIPVDWYNAKLLHEN 191 Query: 187 NIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPL-YFYRNTCYQW 233 + +P + + + V+ S L + ++ +W Sbjct: 192 KLKYYWCQPTIATQGSHLGI--FKSAITVKDNFLSTLKWRFKRFYKKW 237 >gi|326435094|gb|EGD80664.1| hypothetical protein PTSG_01254 [Salpingoeca sp. ATCC 50818] Length = 775 Score = 90.0 bits (222), Expect = 3e-16, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 72/208 (34%), Gaps = 20/208 (9%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + +PV VI+LP SH R +F A+ L F ++A G + +R+ R Sbjct: 567 LTMPVAVINLPSSHDRLARFAAVASAAGLPFFVYEATDGRDLSY-DRLLDRSLISDVVPR 625 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-CDINNILIK 119 L + +SH+ L +++ + II EDDA F+ L L + LI Sbjct: 626 E-RLGTVAQALSHLLLMEKLVKTTFEAGIIFEDDASFNGNLPLALSQLLAMAPEDWDLIS 684 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILS---PRTTGYFIGKEAA----IHLLNVRKNI 172 D C R + GY + ++ A L+ V I Sbjct: 685 LAC-------DPDTCAHGEKVSDWFFRPTPECEVGSVGYAVKRKTAMEMIQDLIPVSARI 737 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYE 200 D + N EP V + Sbjct: 738 S---DSFSSPLFLANYNVYCLEPRVVVQ 762 Score = 87.7 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 76/232 (32%), Gaps = 44/232 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKRL 61 +YVI+LP R +F + + + A G + R+F++ K + + Sbjct: 130 IYVINLPRRRDRLARFLNASPFETGDYHLVTAFDGRELTWDSHLQRLFANNKFKWR---- 185 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC--DINNILIK 119 +G +SH +W+ IA S +ILEDD F+ + ++ +I Sbjct: 186 --RGLVGQALSHYLIWRHIAESRDEFHLILEDDVVFTPGMVERWNSKMHHAFPLDAQVIF 243 Query: 120 FDALRK------------KPKKDSYLCTLPGNFDIHQP-------------RILSPRTTG 154 L + K + LP + H + + Sbjct: 244 LGGLLESNRDAYLSGDVVKRVSPYFCEHLPNTYFTHDFDPSIEPGKKDEPTKAFHYKPVA 303 Query: 155 YFIGKEAAIHLLNVRK--NIYRPIDMDMKHW---WEHNIPSLVTEPGAVYEA 201 Y + + A L+ K +P+ + + WEH T P E Sbjct: 304 YILSSKGARDLVAFVKQHGFVQPVGVMLMKLMRQWEH---VYATFPLIAAED 352 >gi|30061034|gb|AAP19891.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 89.6 bits (221), Expect = 4e-16, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|170736608|ref|YP_001777868.1| glycosyl transferase family protein [Burkholderia cenocepacia MC0-3] gi|169818796|gb|ACA93378.1| glycosyl transferase family 25 [Burkholderia cenocepacia MC0-3] Length = 256 Score = 89.6 bits (221), Expect = 4e-16, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 81/243 (33%), Gaps = 44/243 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V VISL SHARR F RA HL F F DAI G + S + Sbjct: 20 VQVISLRRSHARRHAF--RANNPHLDFEFVDAIDGRALSDADLAASG---LFAPGLPYTR 74 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILI----K 119 G ++ LW IA S + I EDDA F +F L L + + Sbjct: 75 GAFGIAMTTHRLWTDIA-SDSELVTIAEDDAIFRPDFHDATLQFLRDNLGCHDFVAWGYN 133 Query: 120 FDALRK-----------KPKKDSYLCTLPGNFDIHQPRIL------SPRTTGYFIGKEAA 162 FD++ + ++ L G+F + +L Y I A Sbjct: 134 FDSILRGSIFNSRTPVTMRFDEAALTDAIGDFLNDRGPVLVMNLLEFYGICAYTISPAGA 193 Query: 163 IHLLNVRKNI------------YRP---IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 LL+ + P ID+ M ++ + S T P S Sbjct: 194 RFLLDRCFPLQPEMLYSHGLGRTMPNCGIDVVMNKFY-GAMRSAATFPPLAVTTNARETS 252 Query: 208 TIE 210 TI+ Sbjct: 253 TIQ 255 >gi|308814033|ref|XP_003084322.1| unnamed protein product [Ostreococcus tauri] gi|116056206|emb|CAL58387.1| unnamed protein product [Ostreococcus tauri] Length = 616 Score = 89.6 bits (221), Expect = 4e-16, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 81/225 (36%), Gaps = 24/225 (10%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFS----FFDAIYGENNPICNRIFS------- 50 P+ +Y I+L S R + +F A+ + + +FS Sbjct: 46 PLAIYWINLQSSADRHHLIKSHLDALAAKFEIRSERIVAVTEQEAVQNSDVFSGWFRFTK 105 Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHLS 109 H + + + S E+ C SH+ +R + AII+EDD ++F ++ Sbjct: 106 HVSYEHHTQNIYSFKELACLQSHLRALERAFVNGDDIAIIMEDDVLLREDFMHNFNRYVQ 165 Query: 110 KCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR 169 + + +++ L +K + + + P S T Y I ++ +L Sbjct: 166 RAPSDWNILQLYTLNEKVIRY-FRKLRQTYYIKWFPEHWS--TGAYIINRQGLSKVLRAA 222 Query: 170 KNIYRPIDMDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESR 213 + + +P + LV + ++ +T ST E R Sbjct: 223 REMLKPS-------QRSGMDRVLVADEF-LFIHAETYTSTFEAVR 259 >gi|254249297|ref|ZP_04942617.1| Glycosyl transferase [Burkholderia cenocepacia PC184] gi|124875798|gb|EAY65788.1| Glycosyl transferase [Burkholderia cenocepacia PC184] Length = 327 Score = 89.6 bits (221), Expect = 4e-16, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 81/243 (33%), Gaps = 44/243 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V VISL SHARR F RA HL F F DAI G + S + Sbjct: 91 VQVISLRRSHARRHAF--RANNPHLDFEFVDAIDGRALSDADLAASG---LFAPGLPYTR 145 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDINNILI----K 119 G ++ LW IA S + I EDDA F +F L L + + Sbjct: 146 GAFGIAMTTHRLWTDIA-SDSELVTIAEDDAIFRPDFHDATLQFLRDNLGCHDFVAWGYN 204 Query: 120 FDALRK-----------KPKKDSYLCTLPGNFDIHQPRIL------SPRTTGYFIGKEAA 162 FD++ + ++ L G+F + +L Y I A Sbjct: 205 FDSILRGSIFNSRTPVTMRFDEAALTDAIGDFLNDRGPVLVMNLLEFYGICAYTISPAGA 264 Query: 163 IHLLNVRKNI------------YRP---IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 LL+ + P ID+ M ++ + S T P S Sbjct: 265 RFLLDRCFPLQPEMLYSHGLGRTMPNCGIDVVMNKFY-GAMRSAATFPPLAVTTNARETS 323 Query: 208 TIE 210 TI+ Sbjct: 324 TIQ 326 >gi|30060992|gb|AAP19870.1| glycosyl transferase [Haemophilus influenzae] gi|30060994|gb|AAP19871.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 89.6 bits (221), Expect = 4e-16, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 39/121 (32%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFYSRNFDILKSTHLGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|308800464|ref|XP_003075013.1| Lysyl hydrolase/glycosyltransferase family 25 (ISS) [Ostreococcus tauri] gi|116061566|emb|CAL52284.1| Lysyl hydrolase/glycosyltransferase family 25 (ISS) [Ostreococcus tauri] Length = 464 Score = 89.2 bits (220), Expect = 5e-16, Method: Composition-based stats. Identities = 50/230 (21%), Positives = 80/230 (34%), Gaps = 46/230 (20%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENN------------------ 42 M P+ L R + + +S+ DA+ G + Sbjct: 236 MNCPL----LARLTRRSLESARSLDEQGITWSYHDAVDGLDELDVKSLNKYAGRKKRTRV 291 Query: 43 ---------PICNRIFSHQKRQCQFKRLLSLPE---IGCYISHIHLWKRIAYSPAIGAII 90 + R + KR R SL E GCY+SH+ LW+R+ + A AI+ Sbjct: 292 DATRHIPRSTLVQRAYDDTKRPSHAMRQ-SLHERLRFGCYLSHVSLWRRVIRAEAPLAIV 350 Query: 91 LEDDADFSDEFSQLL-PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS 149 LEDDA S +F++ L L K I+ ++ + KK P D + Sbjct: 351 LEDDAVVSSKFARALVSRLDKLPIDWDVLFLNGCFKK--------FGPALADGIRQSRGG 402 Query: 150 PRTTGYFIGKEAAIHLL-NVRKNIYRPIDMDMK-HWWEHNIPSLVTEPGA 197 T Y I + A L+ +PID + + + +P Sbjct: 403 LCTYSYVISAKGAFTLMRRATLRSEKPIDHVLDGEILAGRLLAFHADPPL 452 >gi|30061026|gb|AAP19887.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 89.2 bits (220), Expect = 5e-16, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|261838139|gb|ACX97905.1| LPS biosynthesis protein [Helicobacter pylori 51] Length = 436 Score = 89.2 bits (220), Expect = 6e-16, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 48/131 (36%), Gaps = 17/131 (12%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---------CNR 47 I VY+ISL S R + + + F FDAI ++ Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 48 IFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 + L+LPE GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSYVGDGLTLPEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 119 Query: 108 LSKCDINNILI 118 K + + + Sbjct: 120 CLKSPFDFVRL 130 Score = 68.4 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 54/140 (38%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY++ + A + + P+DM Sbjct: 297 ETFWERRADSEKEAFFLEHFYLTSVYVAS--TAGYYLTPKGAKTFIEATERFKIIEPVDM 354 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSP----LYFYRNTCY 231 M + H++ + P V +STI+ ++ + KP L++++ Sbjct: 355 FMNNPTYHDVANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYFDNLFYHKFNAK 414 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ +RK P+ T K Sbjct: 415 KCLRAFHKYRKQYAPLKTPK 434 >gi|261250530|ref|ZP_05943105.1| 3-deoxy-D-manno-octulosonic-acid transferase [Vibrio orientalis CIP 102891] gi|260939099|gb|EEX95086.1| 3-deoxy-D-manno-octulosonic-acid transferase [Vibrio orientalis CIP 102891] Length = 252 Score = 89.2 bits (220), Expect = 6e-16, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 74/217 (34%), Gaps = 20/217 (9%) Query: 5 VYVISL-PFSHARREKFCHRAARIHLQFSFFDAI-YGENNPICNRIFSHQKRQCQFKRLL 62 V+VI + RRE L F+ + G+ + + + R Sbjct: 8 VFVIHVSEGYDERREHIDKHLPESGL--ERFEYMLDGDIKDLTEEVLENLFR----GSPQ 61 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILIKF 120 S E C+ H +K I A++LEDDA + +FS+ L + + +N L+ Sbjct: 62 SPAEQSCFYKHYLTYKAIVEREVPFALVLEDDAYLTYDFSKKLEKIIEELKGKSNYLVNI 121 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NIYRPIDM 178 ++ C + + T GY I AA L + + I+ P+D+ Sbjct: 122 ESAYLSVPFKFRKCNQHVYLANY-----TKMTGGYIIDYAAAKKLYDYLQTHPIHLPVDV 176 Query: 179 DM-KHWWEHNIPSLVTEPGAVYEAIDTND--STIEES 212 + ++P V + S I Sbjct: 177 YQSEMRDVIGFNIYWSQPALVQQGSKNGTFASAINAE 213 >gi|307825450|ref|ZP_07655669.1| glycosyl transferase family 25 [Methylobacter tundripaludum SV96] gi|307733625|gb|EFO04483.1| glycosyl transferase family 25 [Methylobacter tundripaludum SV96] Length = 241 Score = 88.8 bits (219), Expect = 6e-16, Method: Composition-based stats. Identities = 47/241 (19%), Positives = 81/241 (33%), Gaps = 16/241 (6%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVI++ H RR + R + F + G+ + + I + + LS Sbjct: 12 YVINVKAFHERRAFMEAQLLRKDMTAEFV--LDGDADELTAEIIN----KYFIGDNLSPN 65 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 ++ C + HI + I + A++LEDDA FS F++ L + + K + Sbjct: 66 QMSCALKHIMTLESIVANKQQQALVLEDDAIFSPRFNEGLHYALEESPRFDGPKVIFIGC 125 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDMKHW 183 + ++ T Y I A L+ R + PID + Sbjct: 126 GGNFYTPKSLRKPGQHLYLGSR-GRFTDSYIIDCNTAQKRLDWISRNKVSLPIDNQFEKI 184 Query: 184 -WEHNIPSLVTEPGAVYEA--IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 E I L E V + S +E+ P + + + Y W Sbjct: 185 DRELGISMLWLEEPVVEQGSKAGLFMSAVEK----APPVWLQRLLFNWEKLKRKHIYQLW 240 Query: 241 R 241 R Sbjct: 241 R 241 >gi|148697003|gb|EDL28950.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Mus musculus] Length = 478 Score = 88.8 bits (219), Expect = 6e-16, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 50/123 (40%), Gaps = 9/123 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIF-----SHQKRQCQFK 59 V++I+L RRE+ + +A+ G+ R Sbjct: 348 VFMINLKRRRDRRERMLRALHEQEIDCQLVEAVDGKAMNTSQVEAMGIQMLPGYRDPYHG 407 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINN 115 R L+ E+GC++SH ++WK + +++ EDD F F + L +L + ++ Sbjct: 408 RPLTKGELGCFLSHYNIWKEVVDRGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEREGLDW 467 Query: 116 ILI 118 LI Sbjct: 468 DLI 470 >gi|322380207|ref|ZP_08054438.1| putative lipopolysaccharide biosynthesis protein [Helicobacter suis HS5] gi|321147373|gb|EFX42042.1| putative lipopolysaccharide biosynthesis protein [Helicobacter suis HS5] Length = 289 Score = 88.8 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 84/262 (32%), Gaps = 39/262 (14%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFD-----------AIYGENN------- 42 +PIP+ +I L + R+ ++ FD AI+G + Sbjct: 9 LPIPICIIHLLKTPQERD-ITPLLWHLNYLLKRFDQRDIFSIAIFKAIHGHQDFLQQGIT 67 Query: 43 -PICNRIFSHQKRQCQFKRL---------------LSLPEIGCYISHIHLWKRIAYSPAI 86 + + + + LS ++G + SH LW+ Sbjct: 68 FEHTHPVLNDFSPHLPSFKECLQLASLALKTPLDFLSFKQLGRFASHYLLWQECVRLK-R 126 Query: 87 GAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPR 146 IILEDD +++F HLS ++ + L K+ ++ NFD H Sbjct: 127 PMIILEDDVLPTNDFFGKC-HLSLEALHADKAQIIRLLVHSKRRCAKTSINYNFD-HLFS 184 Query: 147 ILSPRTTGYFIGKEAAIHLLNVRKN-IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTN 205 GY + A LL +D + ++ H+ + + + + Sbjct: 185 PQGLGAHGYMLNPNGAHKLLKACPPLWTLKVDTYIDSYYNHHALTFTLKSAILLINPLDS 244 Query: 206 DSTIEESRLVRKPTFSPLYFYR 227 +S + + LY R Sbjct: 245 ESPPTSTDYFKGKLKWLLYLLR 266 >gi|299768796|ref|YP_003730822.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter sp. DR1] gi|298698884|gb|ADI89449.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter sp. DR1] Length = 251 Score = 88.8 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 81/226 (35%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPFSH-ARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ ++ R +K + + +F F + G+N + Q+ K Sbjct: 1 MKKYLISIESANSERLKKLYSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L H+S ++ Sbjct: 57 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKHISSLNLGPCF 116 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y + ++ A L++ Sbjct: 117 FLSLGGIQMKICNRVRGKFLPEELYNQKILKVDYDYLEKLAYAYAYVVDRKMAQMLVDYH 176 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D + V + E D S +E+ R Sbjct: 177 QPIKI-YDHWQDLVGRGDFSFYVSFLFDHPIIENNDAL-SYLEKER 220 >gi|325123424|gb|ADY82947.1| hypothetical protein BDGL_002361 [Acinetobacter calcoaceticus PHEA-2] Length = 254 Score = 88.8 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPFSH-ARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ ++ R +K + + +F F + G+N + Q+ K Sbjct: 4 MKKYLISIESANSERLKKLYSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L ++ ++ Sbjct: 60 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEQYIHSLNLGPCF 119 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y + ++ A L++ Sbjct: 120 FLSLGGIQMKICNRVRGKFLPEELYNQKVLKVDYDYLEKLAYAYAYVVDRKMAQMLVDYH 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D + V + E D S +E+ R Sbjct: 180 QPIKI-YDHWQDLVGRGDFSFYVSFLFDHPIIENNDAL-SYLEKER 223 >gi|30061006|gb|AAP19877.1| glycosyl transferase [Haemophilus influenzae] gi|30061008|gb|AAP19878.1| glycosyl transferase [Haemophilus influenzae] gi|30061020|gb|AAP19884.1| glycosyl transferase [Haemophilus influenzae] gi|30061024|gb|AAP19886.1| glycosyl transferase [Haemophilus influenzae] gi|30061028|gb|AAP19888.1| glycosyl transferase [Haemophilus influenzae] gi|30061032|gb|AAP19890.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 88.8 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|30060990|gb|AAP19869.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 88.8 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|30060988|gb|AAP19868.1| glycosyl transferase [Haemophilus influenzae] gi|30061000|gb|AAP19874.1| glycosyl transferase [Haemophilus influenzae] gi|30061002|gb|AAP19875.1| glycosyl transferase [Haemophilus influenzae] gi|30061004|gb|AAP19876.1| glycosyl transferase [Haemophilus influenzae] gi|30061010|gb|AAP19879.1| glycosyl transferase [Haemophilus influenzae] gi|30061014|gb|AAP19881.1| glycosyl transferase [Haemophilus influenzae] gi|30061030|gb|AAP19889.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 88.8 bits (219), Expect = 7e-16, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|50083839|ref|YP_045349.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter sp. ADP1] gi|49529815|emb|CAG67527.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter sp. ADP1] Length = 252 Score = 88.8 bits (219), Expect = 8e-16, Method: Composition-based stats. Identities = 46/260 (17%), Positives = 91/260 (35%), Gaps = 34/260 (13%) Query: 3 IPVYVISLP-FSHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + +YVIS+ R ++F + + ++++ + G + + F + + Sbjct: 1 MQIYVISIEENDSIRLKQFFSQPFFSSLNVRPKRM-GVKGADL-LAKEYFEQAVKGREI- 57 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 L+ E+GC +SH+ K + A+ILEDDA + F L++ D I ++ Sbjct: 58 -PLTPGELGCTLSHLQALKDFLSTEDSHALILEDDAIIPEYF-----DLNQLDQEIIKLR 111 Query: 120 F------------DALRKKPKKDSYLCTLPGNFDIHQPRILSPR---TTGYFIGKEAAIH 164 +K + Y L + + R Y I ++ A Sbjct: 112 LPPKSLLSLGGIQMKESRKVRGKPYSAMLLKHKILEVIPDYYHRVNYAMAYVIDRDMAQL 171 Query: 165 LLNVRKNIYRPIDMDMKHWWEHNIPSLVT----EPGAVYEAIDTNDSTIEESRLVRKPTF 220 LL + R D + ++ + + P + S IE R+ ++ Sbjct: 172 LLQYHHPLRRADDWSCLYDFDATVHLYMVHLIDHPVIQKGETNQALSAIEAERVDQQGLR 231 Query: 221 SPLY---FYRNTCYQWNLHY 237 Y RN ++ Y Sbjct: 232 KSKYGNSLRRNLAKLFSERY 251 >gi|30060996|gb|AAP19872.1| glycosyl transferase [Haemophilus influenzae] gi|30060998|gb|AAP19873.1| glycosyl transferase [Haemophilus influenzae] gi|30061016|gb|AAP19882.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 88.5 bits (218), Expect = 8e-16, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLRIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|154346100|ref|XP_001568987.1| glycosyl transferase [Leishmania braziliensis MHOM/BR/75/M2904] gi|134066329|emb|CAM44120.1| glycosyl transferase-like protein [Leishmania braziliensis MHOM/BR/75/M2904] Length = 285 Score = 88.5 bits (218), Expect = 9e-16, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 72/249 (28%), Gaps = 60/249 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQF---SFFDAIYGEN------------NPICNRIF 49 ++VI+L R AR L AI G + + R Sbjct: 21 IFVINLDRRPDRWVAIQAVCARAGLPVERTERVPAIEGSLVDVNVAYRCGLVSALGLRRL 80 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIG---------------AIILEDD 94 + L+ +GC +SHIHLW RIA + ++LEDD Sbjct: 81 KEPSEHHVWGMDLNKAALGCALSHIHLWARIAALGKVSNVSAEASAVVLPKQCFLVLEDD 140 Query: 95 ADFSDE----------------FSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + +D QL +S + L+ + + ++ ++ G Sbjct: 141 STLADAKGSDSESPAASPSPPFLDQLQRRMSSVPPDWELV-YVSGLDTARQCPHMQVAEG 199 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM--------KHWWEHNIP- 189 + T Y + + A LL + +D M + Sbjct: 200 ---VAHVPQYHRTTNAYLVTPQGARRLLATCVPLTFQLDTVMTMNVGYPPGMVGVAGVKT 256 Query: 190 -SLVTEPGA 197 V +P Sbjct: 257 LPYVLDPVC 265 >gi|30061022|gb|AAP19885.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 88.5 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENTEVFLNQNEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P N + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|303285286|ref|XP_003061933.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] gi|226456344|gb|EEH53645.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] Length = 354 Score = 88.1 bits (217), Expect = 1e-15, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 70/235 (29%), Gaps = 33/235 (14%) Query: 2 PIPVYVISLPFS---HARREKFCHRAARIHLQFSFFDAIYGENNPI-----CNRIFSHQK 53 P+ ++V+S+ AR + + + F A+ P R ++ Sbjct: 36 PVKIFVVSVLDDPVGQARLARLRSSLS--SVAFEVVPAVDVRKTPERELRDIRRRVDGRR 93 Query: 54 RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-FSQLLPHLSKCD 112 + +R +SL E+GC +SH+ ++ + S AI LEDDA + Sbjct: 94 FEEINRRRVSLAEVGCSLSHLRAYETMLESNHDAAIFLEDDAIPMNGALDDFTAWYETLP 153 Query: 113 INNILIKFDALRK-----------------KPKKDSYLCTLPGNFDIHQPRILSP----R 151 + +++ D + KP S Sbjct: 154 KDWAVVRLDHRARHWACHSPGAGEACENALKPVVVDPSSPRAHARSFSYVNDTSMSQIYG 213 Query: 152 TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND 206 + Y +E H L + D ++ L+ P + D Sbjct: 214 SYAYLASRETVEHWLWHAYPVAVTSDRLIQRTPPGK-KQLLVHPPPFGHDDEQKD 267 >gi|30061012|gb|AAP19880.1| glycosyl transferase [Haemophilus influenzae] gi|30061018|gb|AAP19883.1| glycosyl transferase [Haemophilus influenzae] gi|30061036|gb|AAP19892.1| glycosyl transferase [Haemophilus influenzae] Length = 129 Score = 88.1 bits (217), Expect = 1e-15, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 39/121 (32%), Gaps = 8/121 (6%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL----- 117 + E GC ISH LW + I EDD + L Sbjct: 1 TKGEKGCLISHFLLWNKCVNENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFI 60 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 I+ + + K + P + T GY I + AA +++ KNI P Sbjct: 61 IRLETFLQPVKLEKQTKIPPFYSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNI--PS 118 Query: 177 D 177 D Sbjct: 119 D 119 >gi|260557125|ref|ZP_05829341.1| glycosyltransferase family 25 family protein [Acinetobacter baumannii ATCC 19606] gi|260409231|gb|EEX02533.1| glycosyltransferase family 25 family protein [Acinetobacter baumannii ATCC 19606] Length = 254 Score = 87.7 bits (216), Expect = 1e-15, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 77/226 (34%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R +K + + +F F + G+N + Q+ K Sbjct: 4 MKKYLISIESTNSERLKKLFSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L H+ + Sbjct: 60 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKHIRSLNFGPCF 119 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y I ++ A L++ Sbjct: 120 FLSLGGIQMKICNRVRGKLLPEELYNQKILKVDYDYLEKLAYAYAYVIDRKMAQMLVDYH 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D V + E D S +E+ R Sbjct: 180 QPIKI-YDHWQDLVGRGEFSFYVSFLFDHPIIENNDAL-SYLEKER 223 >gi|148707509|gb|EDL39456.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Mus musculus] Length = 469 Score = 87.7 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 53/123 (43%), Gaps = 9/123 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 343 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKAWNIEMLPGYRDPYSS 402 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINN 115 R L+ EIGC++SH +WK + +++EDD F +F + L + K ++ Sbjct: 403 RPLTRGEIGCFLSHFSVWKEVIDRELEKTLVIEDDVRFEHQFKRKLMKLMEDIDKAQLDW 462 Query: 116 ILI 118 LI Sbjct: 463 ELI 465 >gi|169632439|ref|YP_001706175.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter baumannii SDF] gi|169151231|emb|CAO99916.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter baumannii] Length = 254 Score = 87.3 bits (215), Expect = 2e-15, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 76/226 (33%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R + + + +F F + G+N + Q+ K Sbjct: 4 MKKYLISIESTNSERLKNLFSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L H+ + Sbjct: 60 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKHIRSLNFGPCF 119 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y I ++ A L++ Sbjct: 120 FLSLGGIQMKICNRVRGKLLPEELYNQKILKVDYDYLEKLAYAYAYVIDRKMAQMLVDYH 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D V + E D S +E+ R Sbjct: 180 QPIKI-YDHWQDLVGRGEFSFYVSFLFDHPIIENNDAL-SYLEKER 223 >gi|149209425|ref|XP_001522087.1| hypothetical protein MGCH7_ch7g204 [Magnaporthe oryzae 70-15] gi|86196159|gb|EAQ70797.1| hypothetical protein MGCH7_ch7g204 [Magnaporthe oryzae 70-15] Length = 393 Score = 87.3 bits (215), Expect = 2e-15, Method: Composition-based stats. Identities = 47/314 (14%), Positives = 94/314 (29%), Gaps = 69/314 (21%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +++ +LP H R + + L+ + F A+ G + L Sbjct: 89 IFL-NLPHRHDRYDAMAIQVHLSKLKITRFPAVDGSKLTSQG-----MPPMENDQFNLKD 142 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS----KCDINNILIKF 120 E GC+ +H ++W+ + A++ E DA + F ++ L+ K Sbjct: 143 GEKGCFRAHANIWQHMLEKDMAAALVFESDAGWDINFRPIMGRLNRGFRKLLQEEYPDNR 202 Query: 121 DALRKKP-------------------------------------------KKDSYLCTLP 137 D + P + D L Sbjct: 203 DVFEENPDDPWLARSGTWDILSIGHCVDARNGGKYVVYDDPHAPNTNYQVEVDGGQAVLN 262 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMK--------HWWEHNI 188 +++ + L TTGY + A +L P+D+ M + + Sbjct: 263 HKRIVYRAQTLFC-TTGYLVSLSGAARMLVRMNWKFDAPVDVIMNEMIGNGQLRAYSMHQ 321 Query: 189 PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVS 248 +V T++S I+ KP + + +++ +R + Sbjct: 322 RPVVQWTYIPGVGKGTSNSDIQGD---GKPEQGTPEGWDRARKERSVY--GYRDGYSDAA 376 Query: 249 TTKF-LPSSSSSLI 261 F LPS+ +I Sbjct: 377 IVNFALPSAWHEVI 390 >gi|294669018|ref|ZP_06734104.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291309010|gb|EFE50253.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtE [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 204 Score = 86.9 bits (214), Expect = 3e-15, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 62/201 (30%), Gaps = 22/201 (10%) Query: 42 NPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF 101 + NR + LLS E C++SH+ LWK+ + EDD + Sbjct: 3 SEELNRAMEELVPGLAKQHLLSEVEKACFMSHVVLWKQALDEGLPYVAVFEDDVLLGENA 62 Query: 102 SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRIL------SPRTTGY 155 + L + + + + + + T G + Q R+ T GY Sbjct: 63 EKFLAEDAWLEERFA-VGTPFIVRLETMFMPIKTETGGIKVCQERVFDLLCSEHWGTAGY 121 Query: 156 FIGKEAAIHLLNVRK----NIYRPIDMDM--KHWWEHNIPSLVTEPGAVYEAID------ 203 + EA L RPID M + + +P P + + Sbjct: 122 IVSCEAMQFFLERFACLPTEKIRPIDWMMFSSSFDKEGMPVYQLNPAVCVQELHYAKFHN 181 Query: 204 ---TNDSTIEESRLVRKPTFS 221 T S +E R K Sbjct: 182 QDSTLGSLLESERCEPKKRMK 202 >gi|255090032|ref|XP_002506937.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] gi|226522211|gb|ACO68195.1| glycosyltransferase family 25 protein [Micromonas sp. RCC299] Length = 290 Score = 86.2 bits (212), Expect = 4e-15, Method: Composition-based stats. Identities = 38/144 (26%), Positives = 58/144 (40%), Gaps = 21/144 (14%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V+VI+L + ARR + H + + F A+ G P+ + RL Sbjct: 106 LNVFVITLQRTPARRAQLIHNLTQAGVAHKIFFAVDG-LLPLNGDDILKYAGLRKRSRLT 164 Query: 63 SL---------PEI----------GCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 SL PE+ GCYISH+ LW+ + S A+ILEDD + F Sbjct: 165 SLDVLRAHSPSPELDLMIHERLRFGCYISHVRLWEHLVNSELPFAVILEDDVIILERFDY 224 Query: 104 LLPH-LSKCDINNILIKFDALRKK 126 L H + N + ++ K Sbjct: 225 SLRHLIRNLPDNWDIFYLNSCFSK 248 >gi|169794762|ref|YP_001712555.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter baumannii AYE] gi|169147689|emb|CAM85552.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter baumannii AYE] Length = 254 Score = 86.2 bits (212), Expect = 5e-15, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 77/226 (34%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R +K + + +F F + G+N + Q+ K Sbjct: 4 MKKYLISIESTNSERLKKLYSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L H+ + Sbjct: 60 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKHIRSLNFGPCF 119 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y + ++ A L++ Sbjct: 120 FLSLGGIQMKICNRVRGKLLPEELYNQKILKVDYDYLEKLAYAYAYVVDRKMAQMLVDYH 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D V + E D S +E+ R Sbjct: 180 QPIKI-YDHWQDLVGRGEFSFYVSFLFDHPIIENNDAL-SYLEKER 223 >gi|126724378|ref|ZP_01740221.1| aspartate-semialdehyde dehydrogenase [Rhodobacterales bacterium HTCC2150] gi|126705542|gb|EBA04632.1| aspartate-semialdehyde dehydrogenase [Rhodobacterales bacterium HTCC2150] Length = 254 Score = 85.8 bits (211), Expect = 5e-15, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 79/240 (32%), Gaps = 16/240 (6%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V VI+ + + + L + G + K ++ R++ Sbjct: 1 MKVLVINQDADADKLALQSRQMDALGLNWERI----GAATLDTITPPTDHKFWRRWHRIM 56 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 + E + SH W+R+ +S G II ED + + L +++ + + Sbjct: 57 TSSEKVQFASHFSAWQRVLHSGQPGLII-EDGILLASGIHEFLNQIAEMTVPQHI----T 111 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN-VRKNIYRPIDMDMK 181 ++ K TL + + + + Y + A +++ K + PID + Sbjct: 112 IQASDDKKLVSKTLDADVPMRRIYQDYTGSAAYVLFPLGASKMISRAAKVVVAPIDAAIS 171 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWR 241 ++ S +P D++ ++R T R ++ + Sbjct: 172 D--ARDLVSFQADPALATHMDDSDT----KNRFALPETVKVNSNNRLVFRCRRINAQLAK 225 >gi|215482306|ref|YP_002324488.1| Glycosyltransferase family 25 (LPS biosynthesis protein) family protein [Acinetobacter baumannii AB307-0294] gi|301510440|ref|ZP_07235677.1| Glycosyltransferase family 25 (LPS biosynthesis protein) family protein [Acinetobacter baumannii AB058] gi|213987973|gb|ACJ58272.1| Glycosyltransferase family 25 (LPS biosynthesis protein) family protein [Acinetobacter baumannii AB307-0294] Length = 251 Score = 85.8 bits (211), Expect = 5e-15, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 77/226 (34%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R +K + + +F F + G+N + Q+ K Sbjct: 1 MKKYLISIESTNSERLKKLYSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L H+ + Sbjct: 57 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKHIRSLNFGPCF 116 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y + ++ A L++ Sbjct: 117 FLSLGGIQMKICNRVRGKLLPEELYNQKILKVDYDYLEKLAYAYAYVVDRKMAQMLVDYH 176 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D V + E D S +E+ R Sbjct: 177 QPIKI-YDHWQDLVGRGEFSFYVSFLFDHPIIENNDAL-SYLEKER 220 >gi|317010839|gb|ADU84586.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SouthAfrica7] Length = 200 Score = 85.4 bits (210), Expect = 8e-15, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 47/130 (36%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGE--NNPICNRIFSHQKRQC 56 VY+ISL S R + + + F FDAI + + + F Sbjct: 4 VYIISLKESKRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLKEFYDAPSLL 62 Query: 57 QF--------KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + + L E+GCY+SH LWK IILEDD F Q L Sbjct: 63 KSDWFHSDYCYKELLPQELGCYLSHYLLWKECVKLNQP-VIILEDDVTLESNFIQALEDC 121 Query: 109 SKCDINNILI 118 K + + + Sbjct: 122 LKSPFDFVRL 131 >gi|323519405|gb|ADX93786.1| putative glycosyltransferase involved in LPS biosynthesis [Acinetobacter baumannii TCDC-AB0715] Length = 254 Score = 85.4 bits (210), Expect = 8e-15, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 76/226 (33%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R +K + + +F F + G+N + Q+ K Sbjct: 4 MKKYLISIESTNSERLKKLFSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 59 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L + + Sbjct: 60 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKLIRSLNFGPCF 119 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y + ++ A L++ Sbjct: 120 FLSLGGIQMKICNRVRGKLLPEELYNQKILKVDYDYLEKLAYAYAYVVDRKMAQMLVDYH 179 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D V + E D S +E+ R Sbjct: 180 QPIKI-YDHWQDLVGRGEFSFYVSFLFDHPIIENNDAL-SYLEKER 223 >gi|184159468|ref|YP_001847807.1| putative glycosyltransferase [Acinetobacter baumannii ACICU] gi|332876309|ref|ZP_08444084.1| LPS glycosyltransferase [Acinetobacter baumannii 6014059] gi|183211062|gb|ACC58460.1| putative glycosyltransferase [Acinetobacter baumannii ACICU] gi|322509381|gb|ADX04835.1| putative glycosyltransferase [Acinetobacter baumannii 1656-2] gi|332735581|gb|EGJ66633.1| LPS glycosyltransferase [Acinetobacter baumannii 6014059] Length = 251 Score = 85.0 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 76/226 (33%), Gaps = 21/226 (9%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R +K + + +F F + G+N + Q+ K Sbjct: 1 MKKYLISIESTNSERLKKLFSQATFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDI-NNI 116 + ++ E+GC +SHI K S AII EDD F +F L + + Sbjct: 57 KPMTPGELGCAMSHIAALKDFLSSDEEYAIIFEDDVIERFEIDFQDLEKLIRSLNFGPCF 116 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS-------PRTTGYFIGKEAAIHLLNVR 169 + ++ K LP + + Y + ++ A L++ Sbjct: 117 FLSLGGIQMKICNRVRGKLLPEELYNQKILKVDYDYLEKLAYAYAYVVDRKMAQMLVDYH 176 Query: 170 KNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESR 213 + I D V + E D S +E+ R Sbjct: 177 QPIKI-YDHWQDLVGRGEFSFYVSFLFDHPIIENNDAL-SYLEKER 220 >gi|76156078|gb|AAX27314.2| SJCHGC08517 protein [Schistosoma japonicum] Length = 170 Score = 85.0 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 45/169 (26%), Positives = 67/169 (39%), Gaps = 28/169 (16%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK 125 EIGC++SH ++W + + +ILEDD F+ F + L + K N+ I +D L Sbjct: 1 EIGCFLSHYNIWNEMITNGYNRILILEDDLRFAPAFIRNLNKIIKEADENV-INWDLLYI 59 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--------YRPI- 176 K+ S N + T GY + + AI L+N K + Y PI Sbjct: 60 GRKRMSKSEKRVPNTTSLAYPDYTYWTLGYVLNRSGAIKLINQ-KPLQKLIAVDEYLPIM 118 Query: 177 -------DMDMKHWWEHNIPSLVTEPGAVYE---------AIDTNDSTI 209 D +K ++ +L EP V DT DS I Sbjct: 119 FNKHPRKDWLIKFE-PRDLIALSAEPLLVEPQRYTGEELYISDTEDSDI 166 >gi|109947390|ref|YP_664618.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] gi|109714611|emb|CAJ99619.1| putative lipopolysaccharide biosynthesis protein [Helicobacter acinonychis str. Sheeba] Length = 349 Score = 85.0 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 57/297 (19%), Positives = 87/297 (29%), Gaps = 80/297 (26%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F F+AI ++ + ++ Sbjct: 9 ISVYIISLKESKRRLDTEKLVLGSNEKFKGRCV-FQIFEAISPKHQDFEKLVQELYDAPS 67 Query: 54 ------RQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP- 106 L+LPE+GCY+SH LWK A IILEDD F Q L Sbjct: 68 LLQSDWFHSDISDGLTLPELGCYLSHYLLWKECARLNQP-VIILEDDVALESNFIQALED 126 Query: 107 ---------------------HLSKCDINNILIKFDALRKKP----KKDSYLCTLPGNFD 141 +L+ + FD K K L F Sbjct: 127 CLKSPFDFVRLYGCYWYYHDSNLNVLPKEFVFAPFDYSLKSRSFLGKCKEILSISIDKFS 186 Query: 142 IHQPRILSPRTTG-----------------------YFIG---KEAAIHLL--NVRKNIY 173 G Y A + L K Sbjct: 187 NLSTYKCIHFALGFLKKMQRSHYATHEKEALFLEHFYLTSVYVASTAGYYLTPKGAKTFT 246 Query: 174 RPI---------DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS 221 + D+ M + HN+ +L P A+ +++STI++ + +K Sbjct: 247 EAVNRFKIIEPVDIFMNNSSYHNMANLTYTPCALSLNEHSSNSTIQKPKDCKKKERE 303 >gi|208434539|ref|YP_002266205.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori G27] gi|208432468|gb|ACI27339.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori G27] Length = 186 Score = 84.6 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 >gi|308184404|ref|YP_003928537.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] gi|308060324|gb|ADO02220.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] Length = 201 Score = 84.6 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 36/132 (27%), Positives = 51/132 (38%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARRE------KFCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 IPVY+ISL S R + K + ++ F FDAI ++ ++ Q Sbjct: 18 IPVYIISLKESQRRLDTEKLVLKSNEKFKGRYV-FQIFDAISPKHQDFEKFVQELYDAQS 76 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK +ILEDDA F Q L Sbjct: 77 MLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDAMLESNFMQALE 135 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 136 DCLKSPFDFVRL 147 >gi|322829107|gb|EFZ32648.1| glycosyl transferase-like protein, putative [Trypanosoma cruzi] Length = 184 Score = 84.6 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 45/114 (39%), Gaps = 15/114 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNRI------------F 49 VYVI+L R + + R ++ F A+ G I Sbjct: 54 VYVINLNRRPDRWKFILRQLQRAGFREGEYERFTAVDGPAVDIQKAQACGLVSLLGILRL 113 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 ++ + + L+ +GC +SH+ LW IA +++EDD+ F +F + Sbjct: 114 REEESRRIWGMDLNPAAVGCALSHVLLWATIAARRYHRVLVVEDDSLFPHDFQK 167 >gi|317179016|dbj|BAJ56804.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F30] Length = 452 Score = 84.6 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 47/132 (35%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ + Q Sbjct: 2 IQVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCCGELLPQEFGCYLSHYLLWKECVKTNQP-IVILEDDVALEPNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSLFDFVRL 131 Score = 63.4 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 52/140 (37%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 313 ETFWERRADSEKEAFFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 370 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSP----LYFYRNTCY 231 + + H I + P V +STI+ ++ + KP L++++ Sbjct: 371 FINNPTYHGIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYFDNLFYHQLNTR 430 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ + K P+ T K Sbjct: 431 KCLRAFHKYSKQYAPLKTPK 450 >gi|157367403|gb|ABV45558.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 437 Score = 84.2 bits (207), Expect = 1e-14, Method: Composition-based stats. Identities = 36/132 (27%), Positives = 47/132 (35%), Gaps = 16/132 (12%) Query: 3 IPVYVISLPFSHARR--EKFCHRAARIHLQ---FSFFDAIYGENNPICNRIFSHQKRQCQ 57 I VY+ISL S R EK + F FDAI ++ + Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKDRCVFQIFDAISPKHEDFEKFVQELYDAQSM 61 Query: 58 FK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 K L E GCY+SH LWK + +ILEDD F Q L Sbjct: 62 LKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALED 120 Query: 108 LSKCDINNILIK 119 K + + + Sbjct: 121 CLKSPFDFVRLF 132 Score = 58.4 bits (140), Expect = 9e-07, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 45/124 (36%), Gaps = 7/124 (5%) Query: 91 LEDDAD--FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRIL 148 L DD + ++++ L IK + ++ +F + + Sbjct: 267 LTDDLNKNIAEKYDDLTTKYESLLAKEANIK-ETFWERRADSEKEAFFLEHFYLTSVYVA 325 Query: 149 SPRTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTND 206 S T GY+I + A + + P+DM + + H++ +L P V Sbjct: 326 S--TAGYYITPKGAKTFIEATERFKIIEPVDMFINNPTYHDVATLTYLPLPVSLNKHCKI 383 Query: 207 STIE 210 STI+ Sbjct: 384 STIQ 387 >gi|157367401|gb|ABV45557.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 425 Score = 84.2 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 33/133 (24%), Positives = 47/133 (35%), Gaps = 18/133 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCRGELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILIK 119 K + + + Sbjct: 120 DCLKSPFDFVKLF 132 Score = 66.9 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 67/168 (39%), Gaps = 17/168 (10%) Query: 93 DDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 DD +D+ + L ++N +++ K+ +L +F + + + T Sbjct: 264 DD--LTDDLTTKYESLLAKEVNIKETFWESRADSEKEALFL----EHFYLTSVYVAT--T 315 Query: 153 TGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 GY++ + A + + P+DM M + HN+ + P + +STI+ Sbjct: 316 AGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHNVANFTYLPCPISLNKHAFNSTIQ 375 Query: 211 ESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 ++ + KP L++++ + ++ + K P+ T K Sbjct: 376 NAKKPDISLKPPRKSYFDNLFYHKFNARKCLKAFHKYSKQYAPLKTPK 423 >gi|157367399|gb|ABV45556.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 442 Score = 84.2 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 35/133 (26%), Positives = 47/133 (35%), Gaps = 18/133 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ + Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK S +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKSNQP-VVILEDDVTLESHFMQALE 119 Query: 107 HLSKCDINNILIK 119 K + + + Sbjct: 120 DCLKSPFDFVKLF 132 Score = 56.9 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 4/92 (4%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY+I + A + + P+DM Sbjct: 303 ETFWERRADSEEEAFFLEHFYLTSVYVAS--TAGYYITPKGAKTFIEATERFKIIEPVDM 360 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 + + H+I +L P V STI+ Sbjct: 361 FINNPTYHDIATLTYLPLPVSLNKHCKISTIQ 392 >gi|157367406|gb|ABV45559.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 435 Score = 84.2 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 49/132 (37%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ + ++ Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDRSS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDDA F Q L Sbjct: 61 LLKSDWFHSDYCYKELLPQEFGCYLSHYFLWKECVKTNQP-IVILEDDAMLESHFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 Score = 63.4 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 55/140 (39%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ + +F + + + T GY++ + A + + P+DM Sbjct: 296 ETFWERRADNEKEALFLEHFYLTSVYVAT--TAGYYLTPKGAKTFIEATERFKIIEPVDM 353 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSP----LYFYRNTCY 231 M + H++ +L P V +STI+ ++ + KP L++++ Sbjct: 354 FMNNPTYHDMANLTYLPCPVSLNKHAFNSTIQNAKKPDISLKPPRKSYFDNLFYHKFNAK 413 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ + + P+ T K Sbjct: 414 KCLKAFHKYSRRYAPLKTPK 433 >gi|145610440|ref|XP_001410325.1| hypothetical protein MGG_12686 [Magnaporthe oryzae 70-15] gi|145017723|gb|EDK02086.1| hypothetical protein MGG_12686 [Magnaporthe oryzae 70-15] Length = 393 Score = 84.2 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 46/314 (14%), Positives = 93/314 (29%), Gaps = 69/314 (21%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +++ +LP H R + + L+ + F A+ G + L Sbjct: 89 IFL-NLPHRHDRYDAMAIQVHLSKLKITRFPAVDGSKLTSQG-----MPPMENDQFNLKD 142 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDD-----------ADFSDEFSQLLP------- 106 E GC+ +H ++W+ + A++ E D + F +LL Sbjct: 143 GEKGCFRAHANIWQHMLEKDMAAALVFESDPPPHTNARPIMGRLNRGFRKLLQEEYPDNR 202 Query: 107 ------------------------HLSKCDINNILIKFDALRK-----KPKKDSYLCTLP 137 H + +D + + D L Sbjct: 203 DVFEENPDDPWLARSGTWDILSIGHCVDARNGGKYVVYDDPHAPNTNYQVEVDGGQAVLN 262 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDMK--------HWWEHNI 188 +++ + L TTGY + A +L P+D+ M + + Sbjct: 263 HKRIVYRAQTLFC-TTGYLVSLSGAARMLVRMNWKFDAPVDVIMNEMIGNGQLRAYSMHQ 321 Query: 189 PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPVS 248 +V T++S I+ KP + + +++ +R + Sbjct: 322 RPVVQWTYIPGVGKGTSNSDIQGD---GKPEQGTPEGWDRARKERSVY--GYRDGYSDAA 376 Query: 249 TTKF-LPSSSSSLI 261 F LPS+ +I Sbjct: 377 IVNFALPSAWHEVI 390 >gi|7465084|pir||C64597 lipopolysacharide biosynthesis glycosyl transferase - Helicobacter pylori (strain 26695) Length = 404 Score = 83.8 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 34/133 (25%), Positives = 49/133 (36%), Gaps = 18/133 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDDA F Q L Sbjct: 61 MLKSDWFHSDWCRGELLPQEFGCYLSHYFLWKECVKTNQP-VVILEDDAMLESNFMQALE 119 Query: 107 HLSKCDINNILIK 119 K + + + Sbjct: 120 DCLKSPFDFVKLF 132 >gi|188527532|ref|YP_001910219.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] gi|188143772|gb|ACD48189.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] Length = 216 Score = 83.8 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 18 IRVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 76 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + A ILEDD F Q L Sbjct: 77 MLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQPVA-ILEDDVALEPNFMQALE 135 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 136 DCLKSPFDFVRL 147 >gi|308063591|gb|ADO05478.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 202 Score = 83.8 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 18 IQVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 76 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 77 MLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALE 135 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 136 DCLKSPFDFVRL 147 >gi|331227147|ref|XP_003326242.1| hypothetical protein PGTG_08072 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309305232|gb|EFP81823.1| hypothetical protein PGTG_08072 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 371 Score = 83.8 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 70/214 (32%), Gaps = 25/214 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-FSFFDAI---YGENNPICNRIFSHQKRQCQFKR 60 V V+SL RR A + L F DA+ I RI ++ K + Sbjct: 94 VRVVSLNSRTDRRMHMQTLADFLQLDNLHFVDAVLYSDPRVLEIVQRIGNNTKADKKIAE 153 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 + + C +SH + +ILEDD D F L L + +I Sbjct: 154 ---VGHVACRMSHRLAIEAADTDDDEITLILEDDVDIEAAFKYLAGTILRDVPKDWDIIF 210 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-----NVRKNIYR 174 F + + NF I++ + GY + ++ L+ N R +Y Sbjct: 211 FGHTDFSNEARNGPDPSTNNFYIYK-SVEPQGGHGYALSRKG-RKLVLDLMDNKRPEVYE 268 Query: 175 -----PID---MDMKHWWEHNIPSLVTEPGAVYE 200 P+D + + H P + + Sbjct: 269 TDEGQPVDEIFVYLARL--HRAALYSIIPDLIVQ 300 >gi|157367411|gb|ABV45561.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 463 Score = 83.8 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 60/184 (32%), Gaps = 25/184 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---CNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 22 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFIQELYDAQSML 80 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK +ILEDD F Q L Sbjct: 81 KSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDVTLESNFMQALEDC 139 Query: 109 SKCDINNILI--KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 K + + + + K + T N ++ P ++ LL Sbjct: 140 LKSPFDFVRLYGHYWGGHKTNLCALPIYTENENEEVEAPIENHAEAEA---SRK--KRLL 194 Query: 167 NVRK 170 + K Sbjct: 195 KIMK 198 Score = 64.6 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 60/168 (35%), Gaps = 14/168 (8%) Query: 93 DDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 DD + ++ L IK + ++ +F + + + T Sbjct: 299 DD--LTKKYDDLTKKYESLLAKEANIK-ETFWERRADSEKEAFFLEHFYLTSVYVAT--T 353 Query: 153 TGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 GY++ + A + + P+DM M + H++ + P + +STI+ Sbjct: 354 AGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPISLNKHAFNSTIQ 413 Query: 211 ESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + + KP L++++ + +N + K P+ T K Sbjct: 414 GVKKPDISLKPPRKSYFDNLFYHKFNARKCLKAFNKYSKQYAPLKTPK 461 >gi|323456462|gb|EGB12329.1| hypothetical protein AURANDRAFT_61250 [Aureococcus anophagefferens] Length = 898 Score = 83.8 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 64/198 (32%), Gaps = 22/198 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHL--QFSFFDAIYGENNPICNRI-----------FSH 51 V +++L R + A DA+ G + + + Sbjct: 290 VLILNLARRPDRWRAMRDKLAAAGFGGDVRRVDAVDGAALDVSDAAVASIFNLTRWRYGA 349 Query: 52 QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGA--IILEDDADFSDEFSQLLPHL- 108 R IGC +SH+ WK +A + A ++LEDDA F D+F+ L Sbjct: 350 GARNAHQDHGYRPRVIGCALSHLAAWKAVAAAADDHAMHLVLEDDAVFVDDFAAKWASLA 409 Query: 109 --SKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQP--RILSPRTTGYFIGKEAAIH 164 D + L+ L + D G R Y + A Sbjct: 410 PHLAADYSWDLLHLGVLDDRDLYDDAPLAEYGGVKRFSARQRSYGAGAFAYVLRPRTARW 469 Query: 165 LLNVRKN--IYRPIDMDM 180 LL+ ++ I + +D + Sbjct: 470 LLDAARDRGIQQAVDWWI 487 >gi|157367393|gb|ABV45553.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 436 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 35/132 (26%), Positives = 46/132 (34%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ I Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFIQELYDAQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVKL 131 Score = 65.4 bits (158), Expect = 8e-09, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 45/109 (41%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P V +STI Sbjct: 326 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPVSLNKHAFNSTI 385 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + +N + K P+ T K Sbjct: 386 QNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFNKYSKQYAPLKTPK 434 >gi|157367408|gb|ABV45560.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 409 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 35/133 (26%), Positives = 46/133 (34%), Gaps = 18/133 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ I Q Sbjct: 2 ICVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFIQELYDAQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILIK 119 K + + + Sbjct: 120 DCLKSPFDFVKLF 132 Score = 56.9 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 34/92 (36%), Gaps = 4/92 (4%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY+I + A + + P+DM Sbjct: 270 ETFWERRADSEEEAFFLEHFYLTSVYVAS--TAGYYITPKGAKTFIEATERFKIIEPVDM 327 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 + + H++ +L P V STI+ Sbjct: 328 FINNPTYHDVATLTYLPLPVSLNKHCKISTIQ 359 >gi|157367391|gb|ABV45552.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 416 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 35/133 (26%), Positives = 47/133 (35%), Gaps = 18/133 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ I Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFIQELYDAQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALKSNFMQALE 119 Query: 107 HLSKCDINNILIK 119 K + + + Sbjct: 120 DCLKSPFDFVKLF 132 Score = 57.6 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 44/117 (37%), Gaps = 5/117 (4%) Query: 96 DFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY 155 + + ++ +L+ + IK + ++ +F + + S T GY Sbjct: 253 ELTGKYDELMGKYESLSAKEVNIK-ETFWERRADSEKEAFFLEHFYLTSVYVAS--TAGY 309 Query: 156 FIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 +I + A + + P+DM + + H++ +L P V STI+ Sbjct: 310 YITPKGAKTFIEATERFKIIEPVDMFINNPTYHDVATLTYLPLPVSLNKHCKISTIQ 366 >gi|161869285|ref|YP_001598452.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis 053442] gi|161594838|gb|ABX72498.1| lacto-N-neotetraose biosynthesis glycosyl transferase LgtB [Neisseria meningitidis 053442] Length = 231 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 54/193 (27%), Gaps = 20/193 (10%) Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI---- 116 LS E C++SH LWK+ + EDD + + L + Sbjct: 13 YLSGVEKACFMSHAVLWKQALDEGLPYIAVFEDDVLLGESAEKFLAEDAWLQERFDPDSA 72 Query: 117 -LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--- 172 +++ + + T GY I K+A L+ + Sbjct: 73 FIVRLETMFMHVLTSPSGVADYCGRAFPLLESEHWGTAGYIISKKAMRFFLDRFAALPPE 132 Query: 173 -YRPIDMDM--KHWWEHNIPSLVTEPGAVYEAID---------TNDSTIEESRLVRKPTF 220 P+D+ M + +P P + + S IE RL+ + Sbjct: 133 GLHPVDLMMFSDFFDREGVPVCQLNPALCAQELHYAKFHDQNSALGSLIEHDRLLNRKQQ 192 Query: 221 SPLYFYRNTCYQW 233 + Sbjct: 193 WRDSPANTFKRRL 205 >gi|317181933|dbj|BAJ59717.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F57] Length = 187 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTNQP-IVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 >gi|206562084|ref|YP_002232847.1| putative glycosyltransferase [Burkholderia cenocepacia J2315] gi|198038124|emb|CAR54072.1| putative glycosyltransferase [Burkholderia cenocepacia J2315] Length = 266 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 49/242 (20%), Positives = 77/242 (31%), Gaps = 44/242 (18%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VISL RR+ F RA L F F DAI G + + + Sbjct: 32 VISLRGERGRRDAF--RANNPALAFEFVDAIDGRALSDDDI---ARSGLFAPALPYTRGA 86 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ-LLPHLSKCDINNILI----KFD 121 IG ++ LW IA + I+ EDDA F +F + + L+ + FD Sbjct: 87 IGIAMTTHRLWTDIAAGNELVTIV-EDDAIFRPDFDETAMQFLAAHANRYDFVAWGYNFD 145 Query: 122 ALRKKPK-----------KDSYLCTLPGNFDIHQPRIL------SPRTTGYFIGKEAAIH 164 ++ + ++ L F + +L Y I A Sbjct: 146 SILRGAIFNGRTPVTMRFDEAALAQAVDEFRTDRGPVLVMDLLEFYGICAYTISPAGARF 205 Query: 165 LLNVRKNI------------YRP---IDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 LL+ + P ID+ M ++ + S+ P STI Sbjct: 206 LLDHCFPLQPETLFSHGLGRALPNYGIDVAMNKFY-GAMRSVAAFPPLAVTMNARATSTI 264 Query: 210 EE 211 + Sbjct: 265 QR 266 >gi|313768161|ref|YP_004061592.1| hypothetical protein BpV1_162c [Bathycoccus sp. RCC1105 virus BpV1] gi|312599768|gb|ADQ91789.1| hypothetical protein BpV1_162c [Bathycoccus sp. RCC1105 virus BpV1] Length = 271 Score = 83.5 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 79/238 (33%), Gaps = 24/238 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-------FSHQKRQC 56 ++++L S R EKF + + + + + + Sbjct: 29 KCFLLTLETSADRHEKFINHYDE-SIPLEIIYGTDTRKLENAKKYQKVIDPTYYREALKL 87 Query: 57 QFKRLLS--------LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLP 106 + + L IGCY+ H+ +KR A+I ED+ D+ + ++ Sbjct: 88 HYNANKTRPDITYFNLGAIGCYMGHMDFYKRCFEQNLKYAVIFEDNVVIKDKRIYKEIQS 147 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 ++K N + F L + P ++ P + + + +S T Y I E Sbjct: 148 VINKKGDNFEMCFFHCLSRYPDEEK-----PDKSGLERVKWISS-TKCYLINVENMKKYY 201 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY 224 I +DM + + +++STI S ++ FS Y Sbjct: 202 KHFFPIDNHVDMKHEDIIARGARVYYKDLRHCLHIDRSHNSTIGHSNWGKRKFFSKRY 259 >gi|227498916|ref|ZP_03929055.1| conserved hypothetical protein [Acidaminococcus sp. D21] gi|226904367|gb|EEH90285.1| conserved hypothetical protein [Acidaminococcus sp. D21] Length = 202 Score = 83.1 bits (204), Expect = 3e-14, Method: Composition-based stats. Identities = 36/158 (22%), Positives = 61/158 (38%), Gaps = 1/158 (0%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI 118 LL++ E+GC +SH+ + K + A I EDD S+EF P + + I Sbjct: 17 NGLLTIGEVGCVLSHLKVCKDFLSTNAPYWTIFEDDIHLSNEFLDAYPRIESFMNSQIEP 76 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 LR+ K + L GN + + GY I ++AA +L+ + D Sbjct: 77 SVLLLRRNNGKGKVVYPLGGNHHVLHMLAGTM-ACGYIINRKAAENLVKALIPVRIEYDA 135 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 + I T+ V + ++I + R Sbjct: 136 WAIYQQLGYIKLYSTDFNNVELNPTMSSNSIIDQMEKR 173 >gi|332673581|gb|AEE70398.1| probable lipopolysaccharide biosynthesis protein [Helicobacter pylori 83] Length = 188 Score = 82.7 bits (203), Expect = 5e-14, Method: Composition-based stats. Identities = 39/136 (28%), Positives = 51/136 (37%), Gaps = 26/136 (19%) Query: 3 IPVYVISLPFSHARR----------EKFCHRAARIHLQFSFFDAIYGENNPIC---NRIF 49 I VY+ISL S R EKF R A F FDAI ++ ++ Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCA-----FQIFDAISPKHQDFEKFVQELY 56 Query: 50 SHQKRQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 Q + L L E GCY+SH LWK + +ILEDD F Sbjct: 57 DAQSMLKSDWFHSDYCYQELLLREFGCYLSHYLLWKECVKTDQP-VVILEDDVVLESNFM 115 Query: 103 QLLPHLSKCDINNILI 118 Q L K + + + Sbjct: 116 QALEDCLKSPFDFVRL 131 >gi|157367397|gb|ABV45555.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 442 Score = 82.3 bits (202), Expect = 6e-14, Method: Composition-based stats. Identities = 35/133 (26%), Positives = 47/133 (35%), Gaps = 18/133 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ + Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDVQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK +ILEDDA F Q L Sbjct: 61 MLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDAMLESHFMQALE 119 Query: 107 HLSKCDINNILIK 119 K + + + Sbjct: 120 DCLKSPFDFVKLF 132 Score = 57.3 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 34/92 (36%), Gaps = 4/92 (4%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY+I + A + + P+DM Sbjct: 303 ETFWERRADSEKEAFFLEHFYLTSVYVAS--TAGYYITPKGAKTFIEATERFKIIEPVDM 360 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 + + H++ +L P V STI+ Sbjct: 361 FINNPTYHDVATLTYLPLPVSLNKHCKISTIQ 392 >gi|261839553|gb|ACX99318.1| hypothetical protein HPKB_0724 [Helicobacter pylori 52] Length = 184 Score = 82.3 bits (202), Expect = 6e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 >gi|217032010|ref|ZP_03437511.1| hypothetical protein HPB128_187g37 [Helicobacter pylori B128] gi|216946320|gb|EEC24926.1| hypothetical protein HPB128_187g37 [Helicobacter pylori B128] Length = 223 Score = 82.3 bits (202), Expect = 6e-14, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 46/130 (35%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---CNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI ++ ++ Sbjct: 17 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFIQELYDSSSLL 75 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 76 KSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALEDC 134 Query: 109 SKCDINNILI 118 K + + + Sbjct: 135 LKSPFDFVRL 144 >gi|308182779|ref|YP_003926906.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] gi|308064964|gb|ADO06856.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] Length = 200 Score = 81.9 bits (201), Expect = 8e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTDQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 >gi|298736313|ref|YP_003728839.1| family 25 glycosyl transferase [Helicobacter pylori B8] gi|298355503|emb|CBI66375.1| jhp0563-like glycosyltransferase, family 25 [Helicobacter pylori B8] Length = 459 Score = 81.9 bits (201), Expect = 8e-14, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 46/130 (35%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPI---CNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI ++ ++ Sbjct: 17 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFIQELYDSSSLL 75 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 76 KSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALEDC 134 Query: 109 SKCDINNILI 118 K + + + Sbjct: 135 LKSPFDFVRL 144 Score = 63.4 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 45/109 (41%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P + +STI Sbjct: 349 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPISLNKHAFNSTI 408 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + +N + K P+ T K Sbjct: 409 QNAKKPDISLKPPKKSYFDNLFYHKFNARKCLKAFNKYSKQYAPLKTPK 457 >gi|39971229|ref|XP_367005.1| hypothetical protein MGG_03081 [Magnaporthe oryzae 70-15] gi|149210823|ref|XP_001522786.1| hypothetical protein MGCH7_ch7g884 [Magnaporthe oryzae 70-15] gi|86196839|gb|EAQ71477.1| hypothetical protein MGCH7_ch7g884 [Magnaporthe oryzae 70-15] gi|145017065|gb|EDK01428.1| hypothetical protein MGG_03081 [Magnaporthe oryzae 70-15] Length = 386 Score = 81.9 bits (201), Expect = 8e-14, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 63/218 (28%), Gaps = 47/218 (21%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + ++LP H R + +A ++ + F A+ + Q L Sbjct: 85 IVFLNLPHRHDRYDAMAIQAHLSGIEVTRFPAV-----AAADVQNDQGMPPTQKPGKLKD 139 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLP------- 106 E GC+ +H ++W+ + ++LE DA + F +LL Sbjct: 140 GEKGCWRAHANVWQHMLEKQIPAVLVLESDAGWDVNLRPIMGRLNGGFRRLLQMDNPDVA 199 Query: 107 HLSKCDINNI---------------------LIKFDALRKKPK--KDSYLCTLPGNFDIH 143 D + + +D + + Sbjct: 200 FADDPDDPWLARSGVWDLLSLGHCIDTSRGNFVVYDDPHAPSSDFTWGPVEEPLNQKRVV 259 Query: 144 QPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPIDMDM 180 TTGY + A LL N+ P+D+ M Sbjct: 260 FRAKSIVCTTGYLVSLTGAARLLVRSSFNLDDPVDVMM 297 >gi|297379818|gb|ADI34705.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori v225d] Length = 200 Score = 81.9 bits (201), Expect = 8e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 ISVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 >gi|328855374|gb|EGG04501.1| hypothetical protein MELLADRAFT_88821 [Melampsora larici-populina 98AG31] Length = 428 Score = 81.9 bits (201), Expect = 8e-14, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 60/171 (35%), Gaps = 11/171 (6%) Query: 52 QKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSK 110 ++R + +LS I C+ SH+ + ++ S A+ILEDD D + L LS+ Sbjct: 241 EERLRHWWNVLSDAAIACWHSHLSVIRKFVDSKQRMALILEDDVDLEVDIEARLEATLSE 300 Query: 111 CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 + ++ + + + +P Y I K A L+ + Sbjct: 301 VPRDWDILLLGHCWSQESQFPPIFPG----SALRPAFTPKCNHAYAISKRGARKLIRHLR 356 Query: 171 N----IYRPIDMDMKHWWEH-NIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 + RP D + H I + P V + D S I R Sbjct: 357 SPAFAYSRPFDKAILHLILTERIKAFSFYPSIVVQTKD-LISDIAGGNGSR 406 Score = 42.2 bits (98), Expect = 0.063, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 24/59 (40%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +YV+ L R + + L F++F A ++ I N F Q+ + +S Sbjct: 32 IYVVHLERRVDRLSRMKALENLLGLSFTYFPATSADDQLIKNIRFHVQQERQLEGWPIS 90 >gi|323451437|gb|EGB07314.1| hypothetical protein AURANDRAFT_65015 [Aureococcus anophagefferens] Length = 854 Score = 81.9 bits (201), Expect = 9e-14, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 51/163 (31%), Gaps = 13/163 (7%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK--- 59 I +Y I+L + RRE + AR D + + R + Sbjct: 14 IKIYYINLDRATLRREAMEAQLARCD------DVVRVAAVDGAADAAAPAARLQAWGLED 67 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 S E+ C +SH+ R +++LEDDA + L ++ +++ Sbjct: 68 DDRSPAELACVLSHVEAAARARDGGGDLSLVLEDDACLAHVAVPLADIAARAPDGWTVLQ 127 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 D+ +L R Y + + A Sbjct: 128 LACNNA----DAQRASLANGAPFAPWRRDHWGAGAYLLSRAGA 166 >gi|15611630|ref|NP_223281.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] gi|4155113|gb|AAD06145.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori J99] Length = 444 Score = 81.9 bits (201), Expect = 9e-14, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 47/132 (35%), Gaps = 20/132 (15%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI ++ + ++ Sbjct: 17 VYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKHQDFEKLLQELYDSSSLL 75 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK S +ILEDD F Q L Sbjct: 76 KSDWFHSDYCYQELLPQEFGCYLSHYFLWKECVKSNQP-VVILEDDVTLESHFMQALEDC 134 Query: 109 SKCDINNILIKF 120 K ++F Sbjct: 135 LKSP--FDFVRF 144 Score = 64.6 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 64/170 (37%), Gaps = 14/170 (8%) Query: 93 DDAD--FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP 150 DD + ++++ L+ IK + + + +F + + + Sbjct: 276 DDLNKNIAEKYDDLMGKYEALLAKETNIK-ETFWESRADNEKEALFLEHFYLTSVYVAT- 333 Query: 151 RTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 T GY++ + A + + P+DM M + H++ + P V +ST Sbjct: 334 -TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPVSLNKHAFNST 392 Query: 209 IEESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 I+ ++ + KP L++++ + +N + K P+ T K Sbjct: 393 IQNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFNKYSKRYAPLKTPK 442 >gi|291336902|gb|ADD96431.1| LosA [uncultured organism MedDCM-OCT-S09-C426] Length = 209 Score = 81.5 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 56/148 (37%), Gaps = 4/148 (2%) Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILI 118 R L+L EI C I HI K IA +ILEDD F+++F L K + I Sbjct: 57 RELNLAEISCTIKHILGIKSIAEKCNSYGLILEDDVVFAEDFKSKFNSFLEKTPEDWDAI 116 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 Y+ + ++ P + Y + KEAA + K D Sbjct: 117 FMGCCGGLRIPQPYIKEDKVAYQVNHPASRGGDS--YLLKKEAAQKIYETIKPFNTVSDW 174 Query: 179 DMKH-WWEHNIPSLVTEPGAVYEAIDTN 205 ++ + ++ H++ EP V + Sbjct: 175 ELSYQFYLHDMKVYWWEPPLVIQGSQNG 202 >gi|315504929|ref|YP_004083816.1| glycoside hydrolase family 16 [Micromonospora sp. L5] gi|315411548|gb|ADU09665.1| glycoside hydrolase family 16 [Micromonospora sp. L5] Length = 704 Score = 81.5 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 66/237 (27%), Gaps = 38/237 (16%) Query: 5 VYVISLPFSHARREKFCHRAARI----HLQFS----FFDAIYGENNPICNRIFSHQKRQC 56 + VI+L R + R + + A+ + Sbjct: 43 IIVINLDRQPRRWRRVTKELGRFRTSEGIPLTSITRRLAAVDARDGRAVAATVDVDVMYR 102 Query: 57 QFKR-------------------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 + +S E+ SH+ +WK +A ++LEDD F Sbjct: 103 IGDQLHVQPDARLAECFAEDEPVRMSRQEVAVARSHVEVWKAVANGTEDYVLVLEDDVWF 162 Query: 98 SDEFSQLL-----PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI-HQPRILSPR 151 + L +C L DI +P Sbjct: 163 KPGAPAAIDRGWRAALDRCTAEGGP---KLLYLSYSDAGGTAARDEACDILFRPSRGLWF 219 Query: 152 TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 +GY + ++ A LL + P+D+ M + + + +L A+ + D Sbjct: 220 LSGYVLSRKGAAALLRAM-PVVGPVDLWMNYRFA-ELGALALTSPAIAQRHDEASDN 274 >gi|184199930|ref|YP_001854137.1| hypothetical protein KRH_02840 [Kocuria rhizophila DC2201] gi|183580160|dbj|BAG28631.1| hypothetical protein [Kocuria rhizophila DC2201] Length = 177 Score = 81.5 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 62/178 (34%), Gaps = 23/178 (12%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 V+SL S RR F + + F FDA +G F ++ + R E Sbjct: 2 VVSLADS-DRRPGFFAQ--PLGEIFEVFDAFHGATQDWTP-YFDAERFAGNYLRPPDPAE 57 Query: 67 IGCYISHIHLWKRIAY---SPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILI----- 118 IGC ISH + + A A ++ EDDA F+ + L +++ + + ++ Sbjct: 58 IGCAISHAQVIRAFAAEPGDDADLLLVAEDDARFTADLPCALRAVTEGPLPHDVVVLTDG 117 Query: 119 ----------KFDALRKKPKKDSYLCTLPGNF-DIHQPRILSPRTTGYFIGKEAAIHL 165 +F + S + P I + + Y + + A Sbjct: 118 LSLDPALHRRRFLTSISQLSLLSRTVSGPERRHRIGRFAGQGDCSGLYLMTRGGARKF 175 >gi|317014040|gb|ADU81476.1| family 25 glycosyl transferase [Helicobacter pylori Gambia94/24] Length = 445 Score = 81.1 bits (199), Expect = 1e-13, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 46/130 (35%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQKRQ 55 VY+ISL S R + + + F FDAI ++ ++ Sbjct: 4 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDSSSLL 62 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 63 KSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVTLESNFMQALEDC 121 Query: 109 SKCDINNILI 118 K + + + Sbjct: 122 LKSPFDFVRL 131 Score = 64.2 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 62/170 (36%), Gaps = 14/170 (8%) Query: 93 DDAD--FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP 150 DD + ++++ L+ IK + + +F + + + Sbjct: 277 DDLNKNIAEKYDDLMGKYEALLAKEANIK-ETFWESRADSEKEALFLEHFYLTSVYVAT- 334 Query: 151 RTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 T GY++ + A + P+DM M + H++ + P V +ST Sbjct: 335 -TAGYYLTPKGAKTFIEATDRFKIIEPVDMFMNNPTYHDVANFTYLPCPVSLNKHAFNST 393 Query: 209 IEESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 I+ ++ + KP L++++ + +N + K P+ T K Sbjct: 394 IQNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFNKYSKQYAPLKTPK 443 >gi|260548798|ref|ZP_05823020.1| glycosyltransferase family 25 family protein [Acinetobacter sp. RUH2624] gi|260407966|gb|EEX01437.1| glycosyltransferase family 25 family protein [Acinetobacter sp. RUH2624] Length = 258 Score = 80.8 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 50/263 (19%), Positives = 89/263 (33%), Gaps = 31/263 (11%) Query: 3 IPVYVISLPFSH-ARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R + F + F I G+ + + + K Sbjct: 1 MKKYLISIEADNSPRLQNFFSQSNFQPYKGDFKKI-GIKGK--ELTVKEY-FDLAVATQK 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA--DFSDEFSQLLPHLSKCDINN-I 116 + LS E+GC +SH+ + S A++LEDDA + +Q + ++ N Sbjct: 57 QPLSPGELGCTLSHMQALRDFITSNEKYALVLEDDAIQIMDIDLNQFKSEVVNLNLQNCF 116 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP-------RTTGYFIGKEAAIHLLN-V 168 + ++ K Y P L P T Y + KE A LL+ Sbjct: 117 FMSLGGIQLKVNDRIYGKIQPKKIYNRNILKLHPYSIADLSYTYAYVVDKEMAKVLLDYH 176 Query: 169 RKNIYRPIDMDMKHWWEHNIPSLVT----EPGAVYEAIDTNDSTIEESRLVRKPTFSPLY 224 K ++ ++ I T P ++ + S+IE R + Sbjct: 177 HKPHVCDHWDELYKFYP-KINFYATFLFDHPEIIFHT--HSTSSIELERKLLSEKNKVER 233 Query: 225 -----FYRNTCYQW-NLHYNAWR 241 R+ +W L Y +R Sbjct: 234 NIFEKLNRSFKKRWLKLFYEQYR 256 >gi|157367395|gb|ABV45554.1| jhp0563-like glycosyltransferase [Helicobacter pylori] Length = 412 Score = 80.8 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 34/125 (27%), Positives = 43/125 (34%), Gaps = 18/125 (14%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 I VY+ISL S R + + + F FDAI ++ + Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQEFYDAQS 60 Query: 57 QFK----------RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 K L E GCY+SH LWK +ILEDD F Q L Sbjct: 61 MLKSDWFHSDWCCGELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKC 111 K Sbjct: 120 DCLKS 124 Score = 56.1 bits (134), Expect = 5e-06, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 51/140 (36%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY++ + A + + P+DM Sbjct: 273 ETFWERRADSEEEAFFLEHFYLTSVYVAS--TAGYYLTPKGAKTFIEATERFKIIEPVDM 330 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLV-------RKPTFSPLYFYRNTCY 231 + + H++ +L P V STI+ + +K F L++ + Sbjct: 331 FINNPTYHDVATLTYLPLPVSLNKHCKISTIQNLKKSDISLIGPKKSYFDNLFYDQLNTR 390 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ + K P+ T K Sbjct: 391 KCLKAFHKYSKQYAPLKTPK 410 >gi|108563026|ref|YP_627342.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] gi|107836799|gb|ABF84668.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] Length = 193 Score = 80.8 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 45/130 (34%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQKRQ 55 VY+ISL S R + + + F FDAI ++ ++ Sbjct: 11 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDSSSLL 69 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK +ILEDD F Q L Sbjct: 70 KSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKLNQP-VVILEDDITLESHFMQALEDC 128 Query: 109 SKCDINNILI 118 K + + + Sbjct: 129 LKSPFDFVRL 138 >gi|171318033|ref|ZP_02907204.1| glycosyl transferase family 25 [Burkholderia ambifaria MEX-5] gi|171096767|gb|EDT41645.1| glycosyl transferase family 25 [Burkholderia ambifaria MEX-5] Length = 218 Score = 80.8 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 43/120 (35%), Gaps = 13/120 (10%) Query: 7 VISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 I+L R E + A ++ A+ + + R + Sbjct: 16 CINLDRRPDRWEAMQRKFAEQNILTVERLSAVDARQVTVPESL-----------RHMRPQ 64 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALR 124 + GC +SH+ K+ + A +I EDDA F +F+ +++ + ++ A Sbjct: 65 DYGCTMSHLAAVKQAKAAGASEVLIFEDDAFFDPDFAARFPEFIAQLPNDWHMLFLGAYH 124 >gi|170701947|ref|ZP_02892871.1| glycosyl transferase family 25 [Burkholderia ambifaria IOP40-10] gi|170133154|gb|EDT01558.1| glycosyl transferase family 25 [Burkholderia ambifaria IOP40-10] Length = 218 Score = 80.8 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 43/120 (35%), Gaps = 13/120 (10%) Query: 7 VISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 I+L R E + A ++ A+ + + R + Sbjct: 16 CINLDRRPDRWEAMQRKFAEQNILTVERLSAVDARQVTVPESL-----------RHMRPQ 64 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALR 124 + GC +SH+ K+ + A +I EDDA F +F+ +++ + ++ A Sbjct: 65 DYGCTMSHLAAVKQAKAAGASEVLIFEDDAFFDPDFAARFPEFIAQVPNDWHMLFLGAYH 124 >gi|323457338|gb|EGB13204.1| hypothetical protein AURANDRAFT_60431 [Aureococcus anophagefferens] Length = 2376 Score = 80.8 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 66/190 (34%), Gaps = 9/190 (4%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RLLSL 64 + I L + R E F R A + F A+ G + ++ Sbjct: 375 HCIVLASAARRVENF-ERLAAAAPEIKRFKAVEGGAPDAVGQAIKDLGITLATDVWQATV 433 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADF---SDEFSQLLPHLSKCDINNILIKFD 121 ++ C SH LW+R + P ++LEDDA F +D F + + + + + + Sbjct: 434 GQVACLCSHASLWRRFSACPKPHFVVLEDDARFPAGADAFRRAVDDVVRGEP-FDVCYLY 492 Query: 122 ALRKKPKKDSYLCTLPGNFDIH-QPRILSPRTTGYFIGKEAAIHL--LNVRKNIYRPIDM 178 + P F + P + Y + K A L L + +Y PID Sbjct: 493 VYPDHWPSPPRVADDPNRFRVSTTPGFRTWCLLAYVVSKRGAEKLRRLIETEEVYAPIDC 552 Query: 179 DMKHWWEHNI 188 + W + Sbjct: 553 MVADWGSRGL 562 >gi|317177549|dbj|BAJ55338.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F16] Length = 188 Score = 80.4 bits (197), Expect = 2e-13, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 47/132 (35%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IQVYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHQDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPQEFGCYLSHYLLWKECVKTNQP-VVILEDDVALEFNFMQALE 119 Query: 107 HLSKCDINNILI 118 + + + Sbjct: 120 DCLNSPFDFVRL 131 >gi|323448909|gb|EGB04802.1| hypothetical protein AURANDRAFT_72438 [Aureococcus anophagefferens] Length = 1145 Score = 80.4 bits (197), Expect = 2e-13, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 60/206 (29%), Gaps = 28/206 (13%) Query: 7 VISLPFSHARREKFCH---RAARIHLQFSFFDAIYG----ENNPICNRIF---------- 49 +I+L R E F +A + +A G N +F Sbjct: 138 LINLKRRTDRLEAFLEDAPKALGDGVAVDVVEAFDGFQLHRGGSFLNGMFEPLWDASNLA 197 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSP--AIGAIILEDDADFSDEFSQLLPH 107 + R S E GC +SH +W+RIA +I EDD F L P Sbjct: 198 DREPSVEPGCRFASFGEQGCALSHASIWRRIAPRGTKDSWHLICEDDVKLHPRF-ALWPR 256 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 L + + + F ++ G + + Y + AA L Sbjct: 257 LWHARCDVVYLGFHDYGQRR-------YAEGTGNCCFRPEYTLGMHCYALRNTAAAKLAG 309 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLVT 193 + P+D + + Sbjct: 310 HL-PVVGPVDAWLADNQWFGLDVFCL 334 >gi|310792759|gb|EFQ28220.1| glycosyltransferase family 25 [Glomerella graminicola M1.001] Length = 410 Score = 80.0 bits (196), Expect = 3e-13, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 69/241 (28%), Gaps = 51/241 (21%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 I+LP + + +A + + + + G + + LS E Sbjct: 107 INLPIRQDKADAATVQAFLSGIDINIMEGVNGSRLGE---VGMPPSSIPDTEPGLSGGEK 163 Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS------------------ 109 GC+ SH + W+ + + +I+EDD + +++ S Sbjct: 164 GCWRSHANAWRSMLRANRGTVLIMEDDVTWDINIKEVMRLASSNLAVLLNGTRMFDIGQS 223 Query: 110 -----------KCDINNIL----------------IKFDALRKKPKKDSYL-CTLPGNFD 141 L IK+D K+ Y L G Sbjct: 224 QSKVRNALFDPWYSDRWDLITFGQCGDGDGWKSERIKYDDPYTNGKQSEYFGVGLKGGRR 283 Query: 142 IHQPRILSPRTTGYFIGKEAAIHLLNVRK-NIYRPIDMDM-KHWWEHNIPSLVTEPGAVY 199 + + T Y + A LL ++ P+D+ M + + P V Sbjct: 284 MVRRAGGMTCTGAYAVSARGAAKLLLRSNFDLNLPVDVIMNDMIGNGQLHAYSIWPPPVV 343 Query: 200 E 200 + Sbjct: 344 Q 344 >gi|39946558|ref|XP_362816.1| hypothetical protein MGG_08232 [Magnaporthe oryzae 70-15] gi|145020131|gb|EDK04359.1| hypothetical protein MGG_08232 [Magnaporthe oryzae 70-15] Length = 393 Score = 80.0 bits (196), Expect = 3e-13, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 71/224 (31%), Gaps = 55/224 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +++ +LP H R + +A L+ S F A+ + L Sbjct: 89 IFL-NLPHRHDRFDAMAIQAHLSKLKISRFPAVDPSTLTSQG-----MPPMENDQANLKE 142 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLPHLSKCDI 113 E GC+ +H ++W+ + A++LE DA + F +LL + Sbjct: 143 GEKGCFRAHANVWQHMLEKGIDSALVLESDAGWDINFRPIMGRLNGGFRKLLQEEHPDNR 202 Query: 114 N----------------NILIK---------------FDALRKKPKKDSYLCTLPGNFDI 142 + ++ +D P + + G + Sbjct: 203 DVFEENPDDPWLARSGTWDILSIGHCVDARGGGRYVIYDDPHA-PSTNYQVEVDEGQAVL 261 Query: 143 HQPRILS-----PRTTGYFIGKEAAIHLLNVR-KNIYRPIDMDM 180 ++ RI+ TTGY + A +L P+D+ M Sbjct: 262 NRKRIVYRAQTLFCTTGYLVSLSGAARMLVRMNWKFDAPVDVIM 305 >gi|167538615|ref|XP_001750970.1| hypothetical protein [Monosiga brevicollis MX1] gi|163770558|gb|EDQ84246.1| predicted protein [Monosiga brevicollis MX1] Length = 663 Score = 79.6 bits (195), Expect = 4e-13, Method: Composition-based stats. Identities = 46/231 (19%), Positives = 73/231 (31%), Gaps = 44/231 (19%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE---NNPICNRIFSHQKRQCQFKRL 61 VYVI+L R F R+ F A + R+F+ + + Sbjct: 123 VYVINLRRRPDRLHGFLQRSNVDPDSIHIFSAFDKRELAWSQSLRRLFAGNRFHSRRAV- 181 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC--DINNILIK 119 I +SH LW+ IA S +ILEDD + +F + + ++ Sbjct: 182 -----IAKALSHFTLWRHIATSEQQYHLILEDDVQLAHDFVARWNAQIQHAFPSDADVVF 236 Query: 120 FDAL----RKKPKKDSYLCTLPGNFDIHQP---------------------RILSPRTTG 154 L R + S L + +F++H+P R R Sbjct: 237 LGGLSSGNRAYYSQASTLRRVSRHFNLHEPTTLFHADWDPELDQGDATTPTRAFHYRPCA 296 Query: 155 YFIGKEAAIHLLNV-RKNIYRPIDMDM----KHWWEHNIPSLVTEPGAVYE 200 Y + AA L+ + K +R M W+ P A E Sbjct: 297 YILSSRAAQSLVQLVSKKGFRYSPDIMLLKAMRQWD---QVYALHPLAANE 344 Score = 57.3 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 51/177 (28%), Gaps = 19/177 (10%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 I L RR F + + + G + + + R ++ Sbjct: 453 IGLQSDEERRAAFEANLRQAGISMFWHVGTDGSSLMMDKLFERNVVAVPLTSRQ--RGDV 510 Query: 68 GCYISHIHLWKRI-----AYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILIKF 120 G +SH+ L +++ + + + EDDA F LL + + L F Sbjct: 511 GRTLSHMLLHEKLHKLPASNGEIVAHAVFEDDAILPPNFPTLLERALERLGRSDWDLFSF 570 Query: 121 DALR---KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR 174 K P +Y P P Y + +A L I R Sbjct: 571 HCAAACTKPPTSSAYPVPAPRAECNISP-------MAYLVNSKAVGRLFQYLLPIRR 620 >gi|296121303|ref|YP_003629081.1| glycosyl transferase family 2 [Planctomyces limnophilus DSM 3776] gi|296013643|gb|ADG66882.1| glycosyl transferase family 2 [Planctomyces limnophilus DSM 3776] Length = 1122 Score = 79.6 bits (195), Expect = 4e-13, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 35/94 (37%), Gaps = 4/94 (4%) Query: 7 VISLPFSHARREKFC-HRAARIHLQFSFFDAIYGENNPICNRI--FSHQKRQCQFKRLLS 63 ++LP + RR+K + FF A + + R L+ Sbjct: 909 CVNLPRATERRKKITCEWIDGLGFPIKFFPAFDRRELEKGRSFFQYEDSAAINKIGRPLT 968 Query: 64 LPEIGCYISHIHLW-KRIAYSPAIGAIILEDDAD 96 EI C SH + + + ++ G IILEDD Sbjct: 969 AGEIACASSHALVIREEMEFTGPEGVIILEDDVT 1002 >gi|325995898|gb|ADZ51303.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 2018] Length = 200 Score = 79.6 bits (195), Expect = 4e-13, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 46/130 (35%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI + + ++ Sbjct: 4 VYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKYQDFEKLVQELYDSSSLL 62 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 63 KSDWFHSDYCYQELLPQEFGCYLSHYFLWKECVKTNQP-VVILEDDVALESHFMQALEDC 121 Query: 109 SKCDINNILI 118 K + + + Sbjct: 122 LKSPFDFVRL 131 >gi|325997493|gb|ADZ49701.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 2017] Length = 404 Score = 79.6 bits (195), Expect = 4e-13, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 46/130 (35%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI + + ++ Sbjct: 4 VYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKYQDFEKLVQELYDSSSLL 62 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 63 KSDWFHSDYCYQELLPQEFGCYLSHYFLWKECVKTNQP-VVILEDDVALESHFMQALEDC 121 Query: 109 SKCDINNILI 118 K + + + Sbjct: 122 LKSPFDFVRL 131 Score = 63.4 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 46/109 (42%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P + + +STI Sbjct: 294 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPISLSKHAFNSTI 353 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + +N + K P+ T K Sbjct: 354 QNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFNKYSKRYAPLKTPK 402 >gi|145606536|ref|XP_365567.2| hypothetical protein MGG_02269 [Magnaporthe oryzae 70-15] gi|145014167|gb|EDJ98808.1| hypothetical protein MGG_02269 [Magnaporthe oryzae 70-15] Length = 377 Score = 79.6 bits (195), Expect = 4e-13, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 73/236 (30%), Gaps = 55/236 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ I+LP H R + ++ +Q + F A+ ++Q K Sbjct: 77 IFFINLPHRHDRFDAMAIQSHIADIQVTRFAAVD-------TSTLTNQGMPPMQKTDFLP 129 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC------------- 111 E GC+ +H ++W+ + + +++LE DA F + ++ L+ Sbjct: 130 SEKGCFRAHANVWQHMLENKIPASLVLESDAGFDAKLRPIMGRLNTAFRELLKKDNPGVA 189 Query: 112 -------------DINNILIKF-DALRKKPKKDSYLC------------------TLPGN 139 + L + K SY+ +LP Sbjct: 190 FDETNTDDPWLARSDTWDYLSIGHCLDTRLKNGSYVVYHDPDSSAKGFDVDDIEPSLPRA 249 Query: 140 FDIHQPRILS-PRTTGYFIGKEAAIHLL-NVRKNIYRPIDMDM-KHWWEHNIPSLV 192 R S GY + A LL N+ PID+ M + + Sbjct: 250 HKRVVYRADSIICLAGYILSLSGAARLLVRSSLNLDAPIDVLMADMIRAGQLRTYS 305 >gi|331213105|ref|XP_003319234.1| hypothetical protein PGTG_01408 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309298224|gb|EFP74815.1| hypothetical protein PGTG_01408 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 362 Score = 79.2 bits (194), Expect = 5e-13, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 70/209 (33%), Gaps = 19/209 (9%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-FSFFDAI---YGENNPICNRIFSHQKRQCQFKR 60 + V+SL RR + + L+ F DA+ I R+ ++ K + Sbjct: 92 IRVVSLKSRTDRRTHMQTLTSFLRLENLHFTDAVLYTDPRVLEIVQRVGNNTKADKKIAE 151 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIK 119 + + C +SH + +ILEDD D F L L + +I Sbjct: 152 ---VGHVACRMSHRLAIEAADADDDQITLILEDDVDMEAAFKYLAGTVLRDAPKDWDIIF 208 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY------ 173 F + T NF I++ I GY + K+ +L++ N Sbjct: 209 FGHTDFSDEARHGRDTSTSNFYIYK-SIEPQGGHGYALSKKGRKLILDLLDNKRPEVYET 267 Query: 174 ---RPID-MDMKHWWEHNIPSLVTEPGAV 198 +PID + M H P + Sbjct: 268 DEGQPIDEIFMYLARLHRAQLYSIIPDMI 296 >gi|307637307|gb|ADN79757.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 908] Length = 167 Score = 78.8 bits (193), Expect = 6e-13, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 45/130 (34%), Gaps = 18/130 (13%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQ 55 VY+ISL S R + + + F FDAI + + ++ Sbjct: 4 VYIISLKESQRRLDTEKLILESNEKFKGRCV-FQIFDAISPKYQDFEKLVQELYDSSSLL 62 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 + L E GCY+SH LWK +ILEDD F Q L Sbjct: 63 KSDWFHSDYCYQELLPQEFGCYLSHYFLWKECVKLNQP-VVILEDDVALESHFMQALEDC 121 Query: 109 SKCDINNILI 118 K + + + Sbjct: 122 LKSPFDFVRL 131 >gi|261250525|ref|ZP_05943100.1| glycosyltransferase [Vibrio orientalis CIP 102891] gi|260939094|gb|EEX95081.1| glycosyltransferase [Vibrio orientalis CIP 102891] Length = 246 Score = 78.4 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 84/247 (34%), Gaps = 20/247 (8%) Query: 3 IPVYVISLPFS-HARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 I ++I + RR+ + ++ + + G+ + + + + R F Sbjct: 2 IKTFIIHVSKGYEERRKHIDSHLPSRGINEYEYM--LRGDIDDLSDAV-----RHEFFGS 54 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 LSL E C+ H + K ++LEDDA D F+Q + + Sbjct: 55 KLSLAEKSCFYKHYLVMKEAVERNIEQVLVLEDDALLVDSFTQDFSLILSELEGRS--NY 112 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRPIDM 178 A ++ L N ++ ++ GY A ++ + I PID Sbjct: 113 FANIEEASNSVPLSVRKSNQCLYPCKVNKLC-GGYVFDLVFAKKMVEYIESQQIDTPIDG 171 Query: 179 DM-KHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK--PTFSPLYFYRNTCYQWNL 235 + E P V + + ES+L + ++ + + Y+ ++ Sbjct: 172 LIGDKIDELQFNLYWVHPPVVKQGSKVG---MFESQLSGRTSGIYATVRSWFKEGYRTHI 228 Query: 236 HYNAWRK 242 + +K Sbjct: 229 RSHVSKK 235 >gi|260548797|ref|ZP_05823019.1| glycosyltransferase family 25 family protein [Acinetobacter sp. RUH2624] gi|260407965|gb|EEX01436.1| glycosyltransferase family 25 family protein [Acinetobacter sp. RUH2624] Length = 255 Score = 78.1 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 45/229 (19%), Positives = 85/229 (37%), Gaps = 26/229 (11%) Query: 3 IPVYVISLPF-SHARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ + R +K + + +F F + G+N + Q+ K Sbjct: 1 MKKYLISIESTNSERLKKLYSQPTFYKYRDEFKQF-GVIGKNLSVSEY---FQQGVAGKK 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLPHLSKCDINNIL 117 + ++ E+GC +SHI K S AII EDD F +F++L + + + L Sbjct: 57 KPMTPGELGCTMSHIAALKDFLSSDDEYAIIFEDDVIERFEVDFNELEEQVKRIGLTTPL 116 Query: 118 ------IKFDALRKKPKKDSYLCTLPGNFDIHQPRILS--PRTTGYFIGKEAAIHLLNVR 169 I+ RK K L + P Y + +E A L++ Sbjct: 117 FLSLGGIQMKICRKVKGKYQPNFLLDQHVLKVDPFFYENLAYAYAYIVDREMAQILIDFH 176 Query: 170 KNIYRPIDMDMKH-WWEHNIPSLVTEPGAVYEAIDTND----STIEESR 213 ++ D ++ +I V +++ + + S +E R Sbjct: 177 RSPRV-YDQWADLIIFKQSINIYV---SYIFDHPEIENEKDKSYLENER 221 >gi|210134822|ref|YP_002301261.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori P12] gi|210132790|gb|ACJ07781.1| lipopolysaccharide biosynthesis protein [Helicobacter pylori P12] Length = 405 Score = 78.1 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 47/132 (35%), Gaps = 18/132 (13%) Query: 3 IPVYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQK 53 I VY+ISL S R + + + F FDAI ++ ++ Q Sbjct: 2 IGVYIISLKESQRRLDTEKLVSESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQS 60 Query: 54 RQC-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 + L E GCY+ H LWK + +ILEDD F Q L Sbjct: 61 MLKSDWFHSDYCYQELLPREFGCYLGHYFLWKECVKTNQP-VVILEDDVALESNFMQALE 119 Query: 107 HLSKCDINNILI 118 K + + + Sbjct: 120 DCLKSPFDFVRL 131 Score = 61.5 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 15/96 (15%), Positives = 37/96 (38%), Gaps = 4/96 (4%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ + +F + + + T GY++ + A + + P+DM Sbjct: 266 ETFWERRADNEKEALFLEHFYLTSVYVAT--TAGYYLTPKGAKTFIEATERFKIIEPVDM 323 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 M + H++ + P V +STI+ ++ Sbjct: 324 FMNNPTYHDVANFTYLPCPVSLNKHAFNSTIQNAKK 359 >gi|209545546|ref|YP_002277775.1| glycosyl transferase family 25 [Gluconacetobacter diazotrophicus PAl 5] gi|209533223|gb|ACI53160.1| glycosyl transferase family 25 [Gluconacetobacter diazotrophicus PAl 5] Length = 239 Score = 78.1 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 71/236 (30%), Gaps = 44/236 (18%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 + ISL + R E+F A + F + GE I + + Sbjct: 4 FFISLDRTPERAERFLAANAHVP-GFERSPGVDGETMDIDA---MKEMGLVSDACTFTRA 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-CDINNILIKFDALR 124 +G ++H+ LW +A S + A I EDDA F Q + ++ +I + Sbjct: 60 ALGAGLAHVALWGNVAESG-VPAYIFEDDAYLCRNFEQESARIIAGLPVDWEIILWGNNH 118 Query: 125 KKPKKDSYLCTLPGNFDIH-------------------QPRILS--PRTTGYFIGKEAAI 163 + L + F I P L+ T GY I A Sbjct: 119 RGMTLFELLPGITQCFAIVPQASVREGVDAFREMDVTTLPFRLTETFGTCGYAISPAGAR 178 Query: 164 HLLNVRKNIYR---P-------------IDMDMKHWWEHNIPSLVTEPGAVYEAID 203 ++ + P ID M + + S V+ P + Sbjct: 179 KMIKRCLPLTTVKVPHVCLGGRILQATGIDTLMNRHYA-QLKSYVSFPPLCLTDDE 233 >gi|302680883|ref|XP_003030123.1| hypothetical protein SCHCODRAFT_235978 [Schizophyllum commune H4-8] gi|300103814|gb|EFI95220.1| hypothetical protein SCHCODRAFT_235978 [Schizophyllum commune H4-8] Length = 334 Score = 78.1 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 56/161 (34%), Gaps = 14/161 (8%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAI-GAIILEDDADFSDEFSQLLPHLSK-CDINNILIK 119 L+ P + C+ SHI + +RI IILEDD D + ++ L L + ++ Sbjct: 157 LTPPRVACWQSHISVIRRIVDEGRDGPFIILEDDVDMEKDTARRLRDLWPLLPSDWDVVF 216 Query: 120 FDA-----LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--- 171 P + T P P + T Y + + A LL ++ Sbjct: 217 LGHCWSNEAHHPPLNPDHSPTFPLRQTRLHPSMAPKCTHAYALSRSGARRLLLHLRHAPF 276 Query: 172 -IYRPIDMDMKHWWEHN-IPSLVTEPGAVYE--AIDTNDST 208 R ID I P V + A +++ +T Sbjct: 277 AYSRAIDQAFAWLVTTKRIKGFSVVPSVVIQRKADNSDINT 317 >gi|317009367|gb|ADU79947.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori India7] Length = 179 Score = 78.1 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 46/129 (35%), Gaps = 18/129 (13%) Query: 6 YVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQKRQC 56 Y+ISL S R + + + F FDAI ++ ++ Q Sbjct: 5 YIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFDAISPKHEDFEKFVQELYDAQSMLK 63 Query: 57 -------QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS 109 + L E GCY+SH LWK + +ILEDD F Q L Sbjct: 64 SDWFHSDYCYQELLPREFGCYLSHYFLWKECVKTNQP-VVILEDDVALEFNFMQALEDCL 122 Query: 110 KCDINNILI 118 K + + + Sbjct: 123 KSPFDFVRL 131 >gi|322705080|gb|EFY96669.1| hypothetical protein MAA_07952 [Metarhizium anisopliae ARSEF 23] Length = 415 Score = 78.1 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 35/251 (13%), Positives = 77/251 (30%), Gaps = 61/251 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I LP + R + ++ + + + A+ P + + L Sbjct: 96 IYFIYLPSRYDRLDAMSLQSYLSGVDLTEYPAV----GPQLIKDVGMPPTRK--PGKLRT 149 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLPHLS---- 109 E GC+ +H ++W + +I+E DA + ++ F Q L ++ Sbjct: 150 SEQGCWRAHANIWSTMLRQKLPPLLIVESDATWDINIRKIMPNLNNHFRQFLREINSTQL 209 Query: 110 -----------------------KCDINNILIKFDALRK-------KPKKDSYLCTLPGN 139 + ++ F + + + L + Sbjct: 210 HNPAWPAEAVARNESTRASDDDPWLSSHWDILSFGQCHETDENNDIRLIYNDPFVPLGKD 269 Query: 140 F--------DIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNIYRPIDMDMK-HWWEHNIP 189 + + + TT Y + + A LL N+ P+DM M+ ++ Sbjct: 270 YQGRALTHERVIRRSGGITCTTAYAVSQTGAAKLLLRTAVNLDMPVDMVMQEMIVSGDLV 329 Query: 190 SLVTEPGAVYE 200 S P + + Sbjct: 330 SYSVMPPIMAQ 340 >gi|15892983|ref|NP_360697.1| putative lipopolysaccharide biosynthesis protein [Rickettsia conorii str. Malish 7] gi|15620180|gb|AAL03598.1| lipopolysaccharide biosynthesis protein-like protein [Rickettsia conorii str. Malish 7] Length = 105 Score = 77.7 bits (190), Expect = 1e-12, Method: Composition-based stats. Identities = 28/101 (27%), Positives = 37/101 (36%), Gaps = 2/101 (1%) Query: 80 IAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + A+I+EDDA +D+F + L L + LI K K + P Sbjct: 1 MVKDNIPYALIIEDDAILNDDFRNKFLTMLKHLPTDWDLIYLSLSHSKNKIFYNIYNNP- 59 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 TTGY I +AA LL KN ID Sbjct: 60 YLKKIGHSGYFNTTTGYLIHLKAAQKLLEYSKNFTLEIDNV 100 >gi|312599314|gb|ADQ91337.1| hypothetical protein BpV2_170c [Bathycoccus sp. RCC1105 virus BpV2] Length = 271 Score = 77.7 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 74/238 (31%), Gaps = 24/238 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-------FSHQKRQC 56 ++++L S RREKF + + + + + + Sbjct: 29 KCFLLTLETSADRREKFINHYDE-SVPLEIIYGTDTRKLENAKKYQKIIEPNYYREALKL 87 Query: 57 QFKRLLS--------LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLP 106 + + L IGCY+ H+ ++R A+I ED+ D+ + ++ Sbjct: 88 HYNANKTRPDITYFNLGAIGCYMGHMDFYRRCFEQNLKYAVIFEDNVVIKDKRVYREIQD 147 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 ++K + + F L + P ++ T Y I + Sbjct: 148 VINKKGDDFEMCFFHCLSRYPDEEHKEKGDIERV------KWISSTKCYLIHVDNMKRYY 201 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLY 224 I +DM + + +++STI S ++ FS Y Sbjct: 202 KHFFPIDNHVDMKHEDIIARGARVYYKDLRHCLHIDRSHNSTIGHSNWGKRKFFSKRY 259 >gi|115361212|ref|YP_778349.1| glycosyl transferase family protein [Burkholderia ambifaria AMMD] gi|115286540|gb|ABI92015.1| glycosyl transferase, family 25 [Burkholderia ambifaria AMMD] Length = 218 Score = 77.7 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 43/120 (35%), Gaps = 13/120 (10%) Query: 7 VISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 I+L R E + A ++ A+ + + R + Sbjct: 16 CINLDRRPDRWEAMQRKFAEQNILTVERLSAVDARQVTVPESL-----------RHMRAQ 64 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALR 124 + GC +SH+ K+ + A +I EDDA F +F+ +++ + ++ A Sbjct: 65 DYGCTMSHLAAVKQAKAAGASEVLIFEDDAFFDRDFAARFPEFIAQVPNDWHMLFLGAYH 124 >gi|34581322|ref|ZP_00142802.1| hypothetical lipopolysaccharide biosynthesis protein [Rickettsia sibirica 246] gi|28262707|gb|EAA26211.1| hypothetical lipopolysaccharide biosynthesis protein [Rickettsia sibirica 246] Length = 105 Score = 77.7 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 28/101 (27%), Positives = 37/101 (36%), Gaps = 2/101 (1%) Query: 80 IAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + A+I+EDDA +D+F + L L + LI K K + P Sbjct: 1 MVKDNIPYALIIEDDAILNDDFRNKFLTMLKHLPTDWDLIYLSLSHSKNKIFYNIYNNP- 59 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 TTGY I +AA LL KN ID Sbjct: 60 YLKKIGHGGYFNTTTGYLIHLKAAQKLLEYSKNFTLEIDNV 100 >gi|323449453|gb|EGB05341.1| hypothetical protein AURANDRAFT_66529 [Aureococcus anophagefferens] Length = 1871 Score = 77.3 bits (189), Expect = 2e-12, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 64/211 (30%), Gaps = 28/211 (13%) Query: 19 KFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW 77 + + + F A G+ ++ + + S EIGC +SH+ Sbjct: 569 RMEKQFEEAGISNFKRVPAFDGKKLFGDPKLLASTVELPSEMKQ-SSGEIGCSLSHLRAA 627 Query: 78 KRIAYSPAIGAIILEDDADFS--DEFSQLLPHL-SKCDINNILIKFDALRKKPKKDSYLC 134 + +++EDD + + + + S+ + +++ + + Sbjct: 628 EMALELDDDYVMVMEDDIHLTFYGAWRTTVKEIVSQAPPDWQVLQLTINNVRVLRT---- 683 Query: 135 TLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-------NVRKNIYRPI------DMDMK 181 L R T Y I + + N P D+ M Sbjct: 684 LLGLGSSFVPWRKNHWSTGAYLINRRGCQRAVDEFTRGPNSSPRYRLPANVQLVSDVLM- 742 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEES 212 +N P T +++ + +STI + Sbjct: 743 ----YNGPGAYTHTRPLFDH-EIKESTIHQG 768 >gi|163803274|ref|ZP_02197153.1| hypothetical protein 1103602000575_AND4_17921 [Vibrio sp. AND4] gi|159172911|gb|EDP57749.1| hypothetical protein AND4_17921 [Vibrio sp. AND4] Length = 250 Score = 77.3 bits (189), Expect = 2e-12, Method: Composition-based stats. Identities = 46/248 (18%), Positives = 92/248 (37%), Gaps = 22/248 (8%) Query: 5 VYVISL-PFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 V+VI + RR+ L + + + G+ + + + + F+ Sbjct: 8 VFVIHVSEGYEERRQHIDKHLPEKGLHNYEYM--LRGDIKDLTSEV-----KAKYFQEND 60 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 LPE+ C HI +++ + A++LEDDA + + L L + ++ + + Sbjct: 61 KLPELSCAYKHILVYEEMVKRGIECALVLEDDAFLATDALTKL-RLVEQEMRDEVNFIIN 119 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRPIDMD- 179 + + + PG + + R GY I E A + PID Sbjct: 120 IEHSNRSVPFKVKQPG--KLCYLASHTKRCGGYIIHLEVAKKFVAFFDSNQTGLPIDAFQ 177 Query: 180 --MKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF-SPLYFYRNTCYQWNLH 236 M+ +N + +P V++ + + E RK ++ + L + YQ + Sbjct: 178 THMRDILNYN--TFWMDPPVVFQG--SKNGAFESELSQRKKSYLTRLTSFFRDNYQRYVV 233 Query: 237 YNAWRKDL 244 N +K L Sbjct: 234 TNLSKKRL 241 >gi|146103895|ref|XP_001469671.1| glycosyl transferase [Leishmania infantum JPCM5] gi|134074041|emb|CAM72782.1| glycosyl transferase-like protein [Leishmania infantum JPCM5] gi|322503713|emb|CBZ38799.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 285 Score = 77.3 bits (189), Expect = 2e-12, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 65/222 (29%), Gaps = 50/222 (22%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQF---SFFDAIYGENNPICN------------RIF 49 ++ I+L R AR L A+ G + R Sbjct: 21 IFAINLDRRPDRWAAVQAVCARAGLPAERTERVPAVEGSRLDVNAAHRCGFVSALGLRRL 80 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIA---------YSPAIGA------IILEDD 94 + L+ +GC +SHIHLW RIA A ++LEDD Sbjct: 81 KEPPEHHIWGMDLNKAALGCALSHIHLWARIAALGTVSNFSAETPAAALPKQCFLVLEDD 140 Query: 95 ADF----------------SDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + QL ++ + L+ + + ++ ++ G Sbjct: 141 STLTDSDDSGSGSPAASPSLPFLDQLQRRMNSVPPDWELV-YVSGLDTARQCPHMQVAKG 199 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 + + T Y + + A LL + +D M Sbjct: 200 ---VARVPQYHRTTNAYLVTPQGARRLLATCVPLTFQLDTAM 238 >gi|229587049|ref|YP_002845550.1| LPS biosynthesis glycosyltransferase [Rickettsia africae ESF-5] gi|228022099|gb|ACP53807.1| Glycosyltransferase involved in LPS biosynthesis [Rickettsia africae ESF-5] Length = 105 Score = 76.9 bits (188), Expect = 2e-12, Method: Composition-based stats. Identities = 29/101 (28%), Positives = 39/101 (38%), Gaps = 2/101 (1%) Query: 80 IAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 + A+I+EDDA +D+F + L L + LI K K + P Sbjct: 1 MVKDNIPYALIIEDDAILNDDFRNKFLTILKHLPTDWDLIYLSLSHSKNKIFYNIYNNPY 60 Query: 139 NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMD 179 I + TTGY I +AA LL KN ID Sbjct: 61 LKKIGHGGYFNT-TTGYLIHLKAAQKLLEYSKNFTLEIDNV 100 >gi|284053776|ref|ZP_06383986.1| glycosyl transferase family protein [Arthrospira platensis str. Paraca] Length = 284 Score = 76.1 bits (186), Expect = 4e-12, Method: Composition-based stats. Identities = 37/153 (24%), Positives = 61/153 (39%), Gaps = 23/153 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +YVI+LP RR + I L F N +IF K + S Sbjct: 16 KIYVINLPERVDRRREMEKEIKSIGLNF----------NSEKVKIFPALKPTEKLAFP-S 64 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINNILIK 119 + +GCY+SH+ + K +++EDD S F +QLL LS+ + + + + Sbjct: 65 IGVLGCYLSHLEIIKIAKRDKLSHILVMEDDLAISSRFCSVQTQLLDELSQVNWDLLFLG 124 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + A K L ++ + P I + T Sbjct: 125 YFAYHK--------LKLSDYYNFYDPEISTTWT 149 >gi|322712270|gb|EFZ03843.1| hypothetical protein MAA_00917 [Metarhizium anisopliae ARSEF 23] Length = 388 Score = 75.8 bits (185), Expect = 6e-12, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 72/252 (28%), Gaps = 64/252 (25%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I++ + R + +A + + A+ E + L Sbjct: 70 IYFINMKKRYDRLDAVSLQAYLSGVDVMPYSAVEPEMINDVG--------MPPTRVSLKE 121 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLPHLS---- 109 E GC+ +H ++W I +ILE DA + + F Q L ++S Sbjct: 122 GEKGCWRAHANIWSAILRGKLPPVLILESDAAWDINLRKIMPNLNKHFRQFLFNISSARI 181 Query: 110 ------------------------KCDINNILIKFDALRKK---------------PKKD 130 + L+ + P Sbjct: 182 PNPGWNANFDESISWERKEATDDPWLSDHWDLLSLGQCHETSENSNVSSIYHDPTVPPGK 241 Query: 131 SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNIYRPIDMDM-KHWWEHNI 188 Y + G + + TT Y I + A LLN + P+D+ + + + Sbjct: 242 DYFGRVLGRERVIRQAGGIVCTTAYAISQTGAAKLLLNTAVKLDAPVDLVIREMILSGEL 301 Query: 189 PSLVTEPGAVYE 200 + P + + Sbjct: 302 VAYSVMPPIIAQ 313 >gi|32477031|ref|NP_870025.1| hypothetical protein RB11533 [Rhodopirellula baltica SH 1] gi|32447579|emb|CAD79178.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 445 Score = 75.8 bits (185), Expect = 7e-12, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 81/249 (32%), Gaps = 33/249 (13%) Query: 9 SLPFSHARREKFCHRAARIHLQF---SFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +L R +++ + F F AI G R+ + + + Sbjct: 3 NLDRRDDRMQEWMRQLPD-PWPFPEPERFAAIDGR------RVATPPQWRAGNGAW---- 51 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALR 124 GCY SH+ + ++ ++ EDDA F D+F + L +++ + + Sbjct: 52 --GCYRSHLLILEKCLLEHIDSYVVFEDDAGFGDDFCERLQEFIAELPADWGMAYLGGQH 109 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 K+ + + G+EA L D KH Sbjct: 110 LYAGKNPPHKVSEHVYRPYNVNRTHAFM---VRGREAMKTLYRHL-TWN---DWHTKHHI 162 Query: 185 EHNIPSLVTEPG-AVYEAIDTNDSTIEESRLVR-----KPTFSPL--YFYRNTCYQWNLH 236 +H++ L+ A+ + + +I R PT S + + T + +N Sbjct: 163 DHHLGRLIQRRYQALVQGKNIQKESIAVYTPDRWLVGQLPTKSNICGRKWSQTRF-FNDA 221 Query: 237 YNAWRKDLP 245 NA D P Sbjct: 222 KNADHSDAP 230 >gi|167588960|ref|ZP_02381348.1| glycosyl transferase, family 25 [Burkholderia ubonensis Bu] Length = 218 Score = 75.4 bits (184), Expect = 7e-12, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 71/214 (33%), Gaps = 24/214 (11%) Query: 7 VISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 I+L R E + A ++ A+ + + + Sbjct: 16 CINLDRRPDRWEAMQRKFAEQNILTVERLSAVDAKLVSVPE-----------HLSHMRPQ 64 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALR 124 + GC +SH+ K+ + A +I EDDA F +F+ +++ + ++ A Sbjct: 65 DYGCTMSHLAAVKQAKAAGASEVLIFEDDAFFDADFTARFPEFIAQVPDDWHMLFLGAYH 124 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWW 184 + +P +I + + + Y + + + +N ID + Sbjct: 125 -------FTQPIPVAPNIVK-TVETLTAHAYVVRDSLYDAFIEINENPPAIIDRNNLVLQ 176 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 + EP V ++ S I E + KP Sbjct: 177 QT-FNCYCFEPNLV--GQESGYSDIMEEVMPEKP 207 >gi|162147436|ref|YP_001601897.1| glycosyl transferase [Gluconacetobacter diazotrophicus PAl 5] gi|161786013|emb|CAP55595.1| Glycosyl transferase [Gluconacetobacter diazotrophicus PAl 5] Length = 239 Score = 75.4 bits (184), Expect = 7e-12, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 68/236 (28%), Gaps = 44/236 (18%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 + ISL + R E+F A + F + GE I + + Sbjct: 4 FFISLDRTPERAERFLAANAHVP-GFERSPGVDGETVDIEA---MREMGLVSDACEFTRA 59 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDALR 124 IG ++H+ W +A S I EDD F Q ++ ++ +I + + Sbjct: 60 AIGAGLAHVAFWGNVAESGVPAH-IFEDDTYLCRNFEQESARVIADLPVDWEIILWGSNH 118 Query: 125 KKPKKDSYLCTLPGNFDIHQPRIL---------------------SPRTTGYFIGKEAAI 163 + L + F I + + T GY I A Sbjct: 119 RGMALFELLPGITQCFTIVPQTSVREGIDAFGMMDVTTLPFRLTQTFGTCGYAISPAGAR 178 Query: 164 HLLNVRKNI-------------YRP---IDMDMKHWWEHNIPSLVTEPGAVYEAID 203 ++ + P ID M + + S ++ P + Sbjct: 179 KMIKRCLPLTTVKVPHACLGGRILPATGIDTLMNRHYA-EMKSYISFPPLCLTDDE 233 >gi|291567094|dbj|BAI89366.1| putative glycosyl transferase [Arthrospira platensis NIES-39] Length = 283 Score = 75.4 bits (184), Expect = 8e-12, Method: Composition-based stats. Identities = 41/166 (24%), Positives = 63/166 (37%), Gaps = 38/166 (22%) Query: 1 MPI-----PVYVISLPFSHARREKFCHRAARIHLQFS-----FFDAIYGENNPICNRIFS 50 M I +YVI+LP RR + I L F F AI P FS Sbjct: 7 MKISDFFDKIYVINLPERVDRRREMEKEIKSIGLNFDSEKVNIFPAIR----PTEKLAFS 62 Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLP 106 S+ +GCY+SH+ + K +++EDD S F +QLL Sbjct: 63 ------------SIGMLGCYLSHLEIIKIAKRDQLSNILVMEDDLAISSRFCSVQTQLLD 110 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 LS+ + + + + + A K L ++ + P I + T Sbjct: 111 ELSQVNWDLLFLGYFAYHK--------LKLSDYYNFYDPEISTTWT 148 >gi|145605136|ref|XP_001405883.1| hypothetical protein MGG_13711 [Magnaporthe oryzae 70-15] gi|145013002|gb|EDJ97643.1| hypothetical protein MGG_13711 [Magnaporthe oryzae 70-15] Length = 395 Score = 75.0 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 73/220 (33%), Gaps = 52/220 (23%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 + I+LP H R + ++ +Q + F A+ + + + + + L Sbjct: 86 KIAFINLPHRHDRFDAIALQSHLSGIQITHFPAVD--VSTLRSSGLPPTENPTR----LK 139 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE-----------FSQLL------P 106 E GC+ +H ++W+ + ++LE DA F F +LL Sbjct: 140 DTERGCWRAHANVWQHMLEHQIPAVLVLESDAGFDVNLRPIMGRLNGAFRELLRKENPRA 199 Query: 107 HLSKCDIN--------NILIKF-------------------DALRKKPKKDSYLCTLPGN 139 L + + L+ DA + + L G Sbjct: 200 ALGEDPDDPWLARSGTWDLLSIGHCADGRGSDGAYVVYDDPDAPHNNWRYEPLSRVLNGQ 259 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPIDM 178 +++ + TTGY + A +L N+ P+D+ Sbjct: 260 RIVYRANNVVC-TTGYLVSLSGAARMLVRTAWNLEEPVDI 298 >gi|149912515|ref|ZP_01901049.1| hypothetical protein RAZWK3B_00965 [Roseobacter sp. AzwK-3b] gi|149812921|gb|EDM72747.1| hypothetical protein RAZWK3B_00965 [Roseobacter sp. AzwK-3b] Length = 251 Score = 74.6 bits (182), Expect = 1e-11, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 68/247 (27%), Gaps = 5/247 (2%) Query: 11 PFSHARREKFCHRAARIHLQF--SFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIG 68 S + + + +F I G + LLS E+G Sbjct: 4 ARSIDIYKITSDALSEVETEFGLHRIQPIEGAKLDANSYFGKILDNYRDTGELLSPSELG 63 Query: 69 CYISHIHLWKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNILIKFDALRKKP 127 C +SHI +++R+ + AIILE D + S D++ + + + Sbjct: 64 CTLSHISVYQRVVEANRC-AIILEADISPHGMHLEDAIAFCSGTDLDFVHLGWHPQISYN 122 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHN 187 +P Y++ A LL+ + R D + + Sbjct: 123 VYFVGRFDTRKQAFRIEPSRNFYGAFAYYVSPTVAFELLDFHQATLRKADYWSEFFETSK 182 Query: 188 IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRKDLPPV 247 P + +S V+ P R + + P+ Sbjct: 183 TKPY-FRPFFEHPLERGELDRQRKSAFVKTKRMPPKRLKRILKSRLRTFFCRTFSGYRPI 241 Query: 248 STTKFLP 254 + P Sbjct: 242 KPGEAQP 248 >gi|326781961|ref|YP_004322363.1| glycosyltransferase family 25 [Synechococcus phage S-SM2] gi|310003151|gb|ADO97549.1| glycosyltransferase family 25 [Synechococcus phage S-SM2] Length = 287 Score = 74.6 bits (182), Expect = 1e-11, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 74/219 (33%), Gaps = 34/219 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 VY ++L RR+ + A ++ + G + + + + ++ ++ Sbjct: 24 VYWLNLDADVERRKYMEDQFAYWEVENHTRISGYDGREDDVSCHL------KGKYPDHMN 77 Query: 64 LPEIGCYISHIHLWKRI-AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 E+GC +SH+ K + +I+EDDA E ++ + + +D Sbjct: 78 QQEVGCCMSHLKAIKHFYEETDDDYCMIMEDDAVL--EIAKFWNFTWQEFFSYAPYDWDC 135 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGK-EAAIHLLNVRKNIYRPIDMDMK 181 L+ + +H + Y I + AA L N + Sbjct: 136 LQLTTITTGDI-----YVKLHLKFVNDFSAAVYLITRHHAAKVLRNHMRG---------D 181 Query: 182 HWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 W N +P AV + D+ +E + P F Sbjct: 182 KWKLDN----NVKPRAV-----SEDTIMESGKTYSIPIF 211 >gi|209525525|ref|ZP_03274064.1| glycosyl transferase family 25 [Arthrospira maxima CS-328] gi|209494024|gb|EDZ94340.1| glycosyl transferase family 25 [Arthrospira maxima CS-328] Length = 287 Score = 74.2 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 40/178 (22%), Positives = 67/178 (37%), Gaps = 19/178 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +YVI+LP RR + I L F N +IF K + S Sbjct: 19 KIYVINLPERVDRRREMEKEIKSIGLNF----------NSEKVKIFPAIKPTEKLAFP-S 67 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINNILIK 119 + +GCY+SH+ + K +++EDD S F +QLL LS+ + + + + Sbjct: 68 IGVLGCYLSHLEIIKIAKTDKLSHILVMEDDLAISSRFCSVQTQLLDELSQVNWDLLFLG 127 Query: 120 FDALRKKPKKDSYLCTLPG----NFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY 173 + A K D Y P + + + T Y + A L+ + + Sbjct: 128 YLAYNKLKLSDYYNFYDPDISTTWTHLKKANYPTLGTHFYAVNHTAYDSLILFLEELL 185 >gi|83313620|ref|YP_423884.1| hypothetical protein amb4521 [Magnetospirillum magneticum AMB-1] gi|82948461|dbj|BAE53325.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 304 Score = 73.8 bits (180), Expect = 2e-11, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 66/236 (27%), Gaps = 36/236 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ--------FSFFDAIYGENNPICNRIFSHQKRQC 56 + +++L AR + R + A+ + Sbjct: 43 ILILNLDRQPARWRRVLRELGRFRTSDGDHLTSITTRLAAVDARDGRAVAATADVDPNYR 102 Query: 57 QFKR-------------------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 + ++ E+ SHI +WK +A ++LEDD F Sbjct: 103 LGNQLHVQPDARLEECFSADEPVRMTRQEVAVARSHIEVWKAVATGKHEHVLVLEDDVWF 162 Query: 98 SDEFSQLL-----PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + L +C + + +P Sbjct: 163 RPGAAAAIDRGWCAALERCRTEGGPRLLYLSYADAGGTANRADPCE--ALFRPVRGLWFL 220 Query: 153 TGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 +GY + +E A LL + P+DM + + ++ + +L + + D Sbjct: 221 SGYVLSREGAAALLRAM-PVIGPVDMWINYRFQ-ELGALALSSPVILQRQDCGSDN 274 >gi|218708253|ref|YP_002415874.1| putative glycosyltransferase [Vibrio splendidus LGP32] gi|218321272|emb|CAV17222.1| putative glycosyltransferase [Vibrio splendidus LGP32] Length = 246 Score = 73.4 bits (179), Expect = 3e-11, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 70/210 (33%), Gaps = 19/210 (9%) Query: 3 IPVYVISLPFS-HARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 I +VI + RR+ + + + + A ++ S R F Sbjct: 2 IKTFVIHVSKGYEERRKHIDKHLPEVGITDYEYMLAGDIDD-------LSDDVRNGFFDD 54 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L LPE C+ H + K++ ++LEDDA + +F + L K+ Sbjct: 55 SLRLPEKSCFYKHYLVMKKVVEEQIPQVLVLEDDAFLNSDFIEQLEAFQSELAGEE--KY 112 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDM 178 ++ N ++ + T G A ++ ++ PID Sbjct: 113 LVNIEEASSLVPTSVRVPNQHLYLCKTNKL-TGGLLYDLSFARAMVEHMESNITDAPIDG 171 Query: 179 DMKHW---WEHNIPSLVTEPGAVYEAIDTN 205 + + E+NI T P V + T Sbjct: 172 YIGNERTTLEYNI--FWTHPPLVRQGSKTG 199 >gi|86147278|ref|ZP_01065593.1| 3-deoxy-D-manno-octulosonic-acid transferase [Vibrio sp. MED222] gi|85834993|gb|EAQ53136.1| 3-deoxy-D-manno-octulosonic-acid transferase [Vibrio sp. MED222] Length = 246 Score = 73.4 bits (179), Expect = 3e-11, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 71/210 (33%), Gaps = 19/210 (9%) Query: 3 IPVYVISLPFS-HARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 I +VI + RR+ + + + + A ++ S + R F Sbjct: 2 IKTFVIHVSKGYEERRKHIDKHLPEVGITDYEYMLAGDIDD-------LSDEVRNGFFDD 54 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L LPE C+ H + K++ ++LEDDA + +F + L K+ Sbjct: 55 SLRLPEKSCFYKHYLVMKKVVEEQIPQVLVLEDDAFLNSDFIEQLEAFQNELRGEE--KY 112 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDM 178 ++ N ++ + T G A ++ ++ PID Sbjct: 113 LVNIEEASSLVPASVRVPNQHLYLCKTNKL-TGGLLYDLSFARAMVEHMESNITDAPIDG 171 Query: 179 DMKHW---WEHNIPSLVTEPGAVYEAIDTN 205 + + E+NI T P V + T Sbjct: 172 YIGNERTTLEYNI--FWTHPPLVRQGSKTG 199 >gi|145247574|ref|XP_001396036.1| hypothetical protein ANI_1_2260104 [Aspergillus niger CBS 513.88] gi|134080775|emb|CAL00889.1| unnamed protein product [Aspergillus niger] Length = 412 Score = 73.1 bits (178), Expect = 4e-11, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 74/247 (29%), Gaps = 58/247 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLS 63 ++ I+LP +R+ QF D I G + + + + + + R+L Sbjct: 77 IFCINLPSRPDKRDAIT--LGSSVTQF-RVDWIDGVSSEDMSPKAYPPRYDEPDRPRML- 132 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP---------------HL 108 EIG + +H++ +RI A+ILEDD D+ L Sbjct: 133 AGEIGSWRAHLNAMQRIVSERISSALILEDDVDWDVTLKNQLQGFALGTLALQAESHPKT 192 Query: 109 SKCDINNILIKFDALRKKPKKDSYLCTLPGN---------------------------FD 141 + ++ K +K+ L + Sbjct: 193 TPYGDEWDILWLGHCGTKCQKNIPFYILKNDPTSIPVYGLPQYWAGPAVHELVDNIKHNR 252 Query: 142 IHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRP------IDMDMKHWWEHN-IPS 190 I L+ ++ Y + AA +L + P D+ + E + Sbjct: 253 IICKTSLTVCSSAYAVSFNAAQKILAALSVLPDDESMPPGQSVVYDVMLGRLCESGYLRC 312 Query: 191 LVTEPGA 197 + + P Sbjct: 313 ISSHPSL 319 >gi|330952480|gb|EGH52740.1| glycoside hydrolase family 16 [Pseudomonas syringae Cit 7] Length = 704 Score = 73.1 bits (178), Expect = 4e-11, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 65/236 (27%), Gaps = 36/236 (15%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ--------FSFFDAIYGENNPICNRIFS------ 50 + +++L R + R A+ + Sbjct: 43 ILIVNLDRQPRRWRRVVRELRRFRTADDVSLASIARRLAAVDARDGRAVAATADVDSTYR 102 Query: 51 -------------HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF 97 + ++ EI SH+ WK IA ++LEDD F Sbjct: 103 IGDQLYVQPDARLEECFPINEPIKMTRQEIAVARSHVEGWKTIACGADNYVLVLEDDVWF 162 Query: 98 SDEFSQLL-----PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 + + + +C + + L + +P Sbjct: 163 KTGAAAAISCGWRAAIQRCCAEGGPRLLYLSYEDAGGTAERVDLCD--ALFRPVRGLWFL 220 Query: 153 TGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 +GY + +E A LL + P+DM + + + + +L + + D+ Sbjct: 221 SGYILSREGAAALLRAM-PVIGPVDMWLNYRF-GELGALALSSPVILQRQDSGSDN 274 >gi|303271423|ref|XP_003055073.1| predicted protein [Micromonas pusilla CCMP1545] gi|226463047|gb|EEH60325.1| predicted protein [Micromonas pusilla CCMP1545] Length = 720 Score = 72.7 bits (177), Expect = 5e-11, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 57/197 (28%), Gaps = 21/197 (10%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +V+ LP R +F A+ ++ + + + + + ++ Sbjct: 13 FVVHLPRLAGRLARFQAAASTANVTAYELIEGVDANDLDATELRRQRDNGELTSWAEVTP 72 Query: 65 PEIGCYISHIHLWKRIAYS-------PAIGAIILEDDADFSDEF-------SQLLPHLSK 110 E+ SH W+ A A+ILEDD + F Sbjct: 73 TEVAIARSHHRAWRTFLDRDDDGGGGGAPYALILEDDVELRAGFVNALNALHAAATQKRS 132 Query: 111 CDINNILIKFDA-----LRKKPKKDSYLCTLPGNFDIH-QPRILSPRTTGYFIGKEAAIH 164 ++ R + + LP + +P T Y + + A Sbjct: 133 DGDGYDVLLLHNQDVLGRRARRTPLAPSPPLPRGLALAIEPTPYVAGTCAYVMSRRFARA 192 Query: 165 LLNVRKNIYRPIDMDMK 181 L R + PID + Sbjct: 193 ALARRYPVTLPIDFFVG 209 >gi|89902422|ref|YP_524893.1| glycoside hydrolase family protein [Rhodoferax ferrireducens T118] gi|89347159|gb|ABD71362.1| glycoside hydrolase, family 16 [Rhodoferax ferrireducens T118] Length = 593 Score = 72.3 bits (176), Expect = 7e-11, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 55/152 (36%), Gaps = 9/152 (5%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL-----SKCDINNI 116 ++ EI SHI +W+ S A++LEDD F F ++L I Sbjct: 1 MTQAEIAVARSHIGVWRAFVASSESYALVLEDDVWFERGFGRILEQAWREMEDAHPAVPI 60 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 + + + L N + +P +GY + K A LL++ P+ Sbjct: 61 FDVLYVSYNEVRYGAPKEILSKN--VFRPERGLWYLSGYVLSKRGAQALLDLL-PCRGPV 117 Query: 177 DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 D+ + + H + + + D + + Sbjct: 118 DLWINQKF-HELKVRALRRSVINQRRDLHSTN 148 >gi|242212166|ref|XP_002471918.1| predicted protein [Postia placenta Mad-698-R] gi|220729016|gb|EED82898.1| predicted protein [Postia placenta Mad-698-R] Length = 629 Score = 71.9 bits (175), Expect = 8e-11, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 58/189 (30%), Gaps = 39/189 (20%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAI----------------------GAIILEDDADFSD 99 L+ ++ C++SH+ + +RIA S A I+LEDD D Sbjct: 428 LTPAKVACWMSHLEVIRRIANSVAPDTLASAPSGNAVVTSDNPLSRGVGIVLEDDIDMER 487 Query: 100 EFSQLLPHL-SKCDINNILIKF-----DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 + L + + ++ D S T P T Sbjct: 488 DIQARLQDVWTALPSEWDVVFLGHCWSDESYYPALPPSPSATFKKTSTNLHPSFAPKCTH 547 Query: 154 GYFIGKEAAIHLLNVRKN----IYRPIDMDMKHW-WEHNIPSLVTEPGAVYE------AI 202 Y + + A LL ++ R ID M + + P + + I Sbjct: 548 AYALTRAGARRLLLHLRHPPFAYSRAIDQAMSWLIQSGRLRAFSAVPSLIVQRKVAESDI 607 Query: 203 DTNDSTIEE 211 D+ +S I Sbjct: 608 DSGNSGIGS 616 >gi|149912506|ref|ZP_01901040.1| glycosyl transferase, family 25 [Roseobacter sp. AzwK-3b] gi|149812912|gb|EDM72738.1| glycosyl transferase, family 25 [Roseobacter sp. AzwK-3b] Length = 250 Score = 71.9 bits (175), Expect = 8e-11, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 54/143 (37%), Gaps = 25/143 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQ-----FSFFDAIYGENNPICNRIFSHQKRQCQFK 59 VY+I+LP RR + + AR+ L SFF A+ + + Sbjct: 22 VYIINLPERADRRREIEGQLARLGLSSAHPSISFFKAV----------------KPSEAG 65 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----LLPHLSKCDINN 115 S+ GC++SH+ + + S +ILEDD D+S F +L L + Sbjct: 66 GWPSVGAHGCFMSHLGVLEEALESRFDNVLILEDDMDWSPRFIDGGMRILDELGSKTWHF 125 Query: 116 ILIKFDALRKKPKKDSYLCTLPG 138 I + + K + P Sbjct: 126 IHGGLRSPKAKSAEPVLRALSPE 148 >gi|145602931|ref|XP_362246.2| hypothetical protein MGG_13779 [Magnaporthe oryzae 70-15] gi|145011279|gb|EDJ95935.1| hypothetical protein MGG_13779 [Magnaporthe oryzae 70-15] Length = 687 Score = 71.9 bits (175), Expect = 8e-11, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 51/152 (33%), Gaps = 11/152 (7%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 + VI+LP RR+ AA +L + I G + R+ R K+ Sbjct: 344 KILVINLPSRTDRRDAISLAAALTNLTVEYVPGIDGS--EVQERVLPADSRNKSIKK--- 398 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 +G + +HI+ + I +ILEDD D+ L L Sbjct: 399 -GNVGSWRAHINALRTIVEQDLTTVLILEDDVDWDVRLKSQLQLFGAASRAY----LQPL 453 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGY 155 + P + L P L P T+ Y Sbjct: 454 KTNPSQTLSQAALTVPLS-SAPETLPPSTSPY 484 >gi|307246034|ref|ZP_07528116.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|307250375|ref|ZP_07532323.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|307252757|ref|ZP_07534648.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307255016|ref|ZP_07536834.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307259452|ref|ZP_07541177.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|307261601|ref|ZP_07543269.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|307263790|ref|ZP_07545396.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|306852969|gb|EFM85192.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306857585|gb|EFM89693.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306859789|gb|EFM91811.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306861889|gb|EFM93865.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306866388|gb|EFM98251.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306868724|gb|EFN00533.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306870911|gb|EFN02649.1| Lipooligosaccharide biosynthesis protein lpsA [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 78 Score = 71.9 bits (175), Expect = 9e-11, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 4/81 (4%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 A+ H+ F FFDAI + + F L+ EI C +SHI LW+ Sbjct: 1 MMSEFAKQHIPFVFFDAITPDLIERKAKEFGIDITTS----PLTKGEIACALSHIALWRL 56 Query: 80 IAYSPAIGAIILEDDADFSDE 100 I E D + Sbjct: 57 AQEQGLDYIAIFEGDIYLGEN 77 >gi|78060655|ref|YP_367230.1| hypothetical protein Bcep18194_C7542 [Burkholderia sp. 383] gi|77965205|gb|ABB06586.1| hypothetical protein Bcep18194_C7542 [Burkholderia sp. 383] Length = 296 Score = 71.5 bits (174), Expect = 1e-10, Method: Composition-based stats. Identities = 46/268 (17%), Positives = 82/268 (30%), Gaps = 55/268 (20%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQ--FSFFDAIYGENNPICNRIFSHQKRQCQF 58 M + Y I+L S ARR + + + L+ F A+ G F + + Sbjct: 1 MSLDGYYINLDRSVARRIRIEKQIHALGLESMIRRFAAVDG----GTAGPFDAALKNRTW 56 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGA--IILEDDADFSDEFSQLLPH--------- 107 C SH ++I + IILEDD + S FS ++ Sbjct: 57 ---------ACRQSH----EKIIDGGDAKSTTIILEDDIEISQAFSDIVTDEVMRKFTDD 103 Query: 108 -------LSKCDINNILIKFDALRKKPKKDSYLCTL----PGNFDIHQPR------ILSP 150 +C + + + L G +++ I S Sbjct: 104 EPTVDIVFLECGFQWRFLPLLLAKADSRMTWRLSPSGNDDSGGYNLSTVDLLDAKGIYSW 163 Query: 151 RTTGYFIGKEAAIHLLNV-RKNIYRP---IDMDMKHWWEHN-IPSLVTEPGAVYEAIDTN 205 + Y + L + +P ID+ K W + + + P +T Sbjct: 164 SSAAYIVTPVGKQTLRRLFSAQRDQPGVSIDVLYKRWIDAGELKGKICIPFLATPHSET- 222 Query: 206 DSTIEE--SRLVRKPTFSPLYFYRNTCY 231 STI+ S L+ + +R + Sbjct: 223 SSTIDHAPSELLDPVPTEWVSLFRRFLF 250 >gi|15603006|ref|NP_246078.1| hypothetical protein PM1141 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721488|gb|AAK03225.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 208 Score = 71.1 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 55/194 (28%), Gaps = 25/194 (12%) Query: 76 LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILIKFDALRKKPKKD 130 +W + I EDD + L + +I+ + ++ + Sbjct: 1 MWHKCIQDNLPYIAIFEDDILLGRDARTFLAEDEWLFSRFNCDDIFIIRLETFLQETICE 60 Query: 131 SYLCTLP-GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPIDMDMKHWWE 185 + + D + T GY I AA +LL + KN+ PID + + + Sbjct: 61 ALPNPVSYCGRDFLALKDEHLGTAGYIISLGAAKYLLEIFKNMESNNIFPIDHLIFNRFL 120 Query: 186 HN--------IPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHY 237 P+L + + E DS +E R + Sbjct: 121 AGEELMVYQLSPALCIQELQLNENESLLDSQLESERKN-------YRLAEKARKKKTWRE 173 Query: 238 NAWRKDLPPVSTTK 251 + P K Sbjct: 174 KVYHIFTKPQRMLK 187 >gi|327540120|gb|EGF26714.1| beta-1,4-galactosyltransferase waaX [Rhodopirellula baltica WH47] Length = 229 Score = 70.7 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 61/211 (28%), Gaps = 16/211 (7%) Query: 4 PVYVIS-LPF-SHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 P +VIS L S R + R A + +R Q +++ Sbjct: 9 PTFVISTLDEQSRDRVNQVEQHLRRAGFRNSQIIQAKTPQTEDFEHRGLPDVL-QGRWRT 67 Query: 61 LL--SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS-KCDINNIL 117 L C +SHI + R + IILEDD +F + L + +I L Sbjct: 68 DLQHLWGTAACTLSHIQFYDR--AEHQLPIIILEDDVTIHPDFFRFLDDVDIPDEIEWDL 125 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + L L + P Y + + L+ +D Sbjct: 126 CHLSYTNPQFSSQANLNRLVAPHLLMCPPDEITGAYSYIVNRS----FLDRFLPSVEEVD 181 Query: 178 MDMKHWWEHNIPSLVTE--PGAVYEAIDTND 206 + H I S V E P Sbjct: 182 WQLAH-QTDEIASYVIEHDPPLTAPDYRLES 211 >gi|327282249|ref|XP_003225856.1| PREDICTED: procollagen galactosyltransferase 1-like [Anolis carolinensis] Length = 527 Score = 70.7 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 70/195 (35%), Gaps = 32/195 (16%) Query: 79 RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL---RKKPKKDSYLCT 135 I +++ EDD F F + + +L D+ + +D + RK+ + + Sbjct: 327 EIVDRGLEKSVVFEDDLRFEIFFKRRIMNLM-YDLEEEGLDWDLIYIGRKRMQVEHSEKA 385 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPID-------------MDM 180 +P ++ + S T GY I + A LL+ + + P+D M Sbjct: 386 VPHVRNLVEA-DYSYWTLGYIISLQGAKKLLD-AQPLSKMLPVDEFLPVMFDKHPVSDYM 443 Query: 181 KHWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEESRLVRKPTFSPLYFYRNTCY 231 +H+ N+ + EP VY DT S + ++ ++ T + Sbjct: 444 EHFENRNLLAFSVEPLLVYPTHYTGDDGYISDTETSVVWDNENIK--TDWDRAKSQKMKE 501 Query: 232 QWNLHYNAWRKDLPP 246 Q L A D+ Sbjct: 502 QQELSSEARNTDVLQ 516 >gi|115389118|ref|XP_001212064.1| predicted protein [Aspergillus terreus NIH2624] gi|114194460|gb|EAU36160.1| predicted protein [Aspergillus terreus NIH2624] Length = 386 Score = 70.7 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 38/272 (13%), Positives = 78/272 (28%), Gaps = 68/272 (25%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLS 63 +Y I L +R+ A + F D + G + + + + K Sbjct: 67 IYAIGLKERTDKRDFLT--LASLEAGFK-VDWLDGVRVDSLDPKSLPNGANFSWIK---- 119 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK------------- 110 + C+ +H++ +++ + A+ILEDDAD+ L ++ Sbjct: 120 PTAVACWRAHMNAMQKVLQNSYSTALILEDDADWDISIKHQLREFARGVRQLNGEEHAPK 179 Query: 111 -----------------------------------CDINNILIKFDALRKKPKKDSYLCT 135 L + +K K+ + Sbjct: 180 ETPYGLDWDILWIGGCASGPSANETTFFAIPDDPTVPSKEHLDTWGGPLEKWKEQYPELS 239 Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV--RKNIYRPIDMDM-----KHWWEHNI 188 I+Q + T GY + + A +L + P+D M + Sbjct: 240 DDSTRFIYQA-SMGCCTYGYAVTAKGAKKILAALSLDRLDCPVDNAMSDLCAGTNGRRQV 298 Query: 189 PSLVTEPGAV----YEAIDTNDSTIEESRLVR 216 P + + ++ DS IE+ + Sbjct: 299 KCFAPWPNLIGTYKHAGPESRDSDIEDRSEDK 330 >gi|226946496|ref|YP_002801569.1| hypothetical protein Avin_44770 [Azotobacter vinelandii DJ] gi|226721423|gb|ACO80594.1| hypothetical protein Avin_44770 [Azotobacter vinelandii DJ] Length = 218 Score = 70.7 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 77/224 (34%), Gaps = 19/224 (8%) Query: 20 FCHRAARIHLQFSFFDAIY----GENNP---ICNRIFSHQKRQCQFKRLLSLPEIGCYIS 72 + ++ F A+ G++ P +C + F+R L L C S Sbjct: 1 MEDSFRDMGVRAERFSAVSLHELGDDQPSPALCEFLLRVDGEHPGFERKL-LGTWACMRS 59 Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILIKFDALRKKPKKD 130 H+ + + +I+EDD +F +L ++ ++ L+ KK + Sbjct: 60 HLGVIAHARDNGWPAVLIMEDDCEFEPYTLAVLERVASQLQGLDWDLLYLGGTFKKGGEK 119 Query: 131 SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPS 190 + P + + R+ T Y + +L P+D + Sbjct: 120 RKVA--PNLLSVTRMRL----THAYMVRAALYERILAEAPLSGLPLDWYYSEVLLPQVRG 173 Query: 191 LVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWN 234 L+ +P + + + S IE+ ++RKP F F + Sbjct: 174 LMVKPTLARQRL-MDPSDIEQ--VMRKPRFKSRQFLERLFARLR 214 >gi|308812614|ref|XP_003083614.1| unnamed protein product [Ostreococcus tauri] gi|116055495|emb|CAL58163.1| unnamed protein product [Ostreococcus tauri] Length = 445 Score = 70.4 bits (171), Expect = 2e-10, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 66/204 (32%), Gaps = 32/204 (15%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 I Y I+ + RR I A R F + R C+ K Sbjct: 19 IRAYYINARGAAERRLAMEDGFRAIGTTLERVAATDATRASELVRRF--ETRTCKRKGWA 76 Query: 63 SLPE-------------IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL---- 105 + + +G +SH+ A + A A++LEDDA S E Sbjct: 77 TSGDDGVEKFSGTYLERLGRTLSHLSAIYEAADAGAETALVLEDDA--SPELMATWMSGL 134 Query: 106 -PHLSKCDINNILIKFDALRKKPKK-------DSYLCTLPGNFDIHQPR--ILSPRTTGY 155 ++S+ + L++ AL + T PG + P+ T Y Sbjct: 135 EEYVSRLPEDWTLVQLSALGESSAIMKLFYDWQRERKTAPGRYLSSLPKGMRRLHGTQAY 194 Query: 156 FIGKEAAIHLLN-VRKNIYRPIDM 178 I K L+ R + + +D+ Sbjct: 195 LISKRGMDRLVKAYRAPVTKEVDV 218 >gi|303282179|ref|XP_003060381.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] gi|226457852|gb|EEH55150.1| glycosyltransferase family 25 protein [Micromonas pusilla CCMP1545] Length = 304 Score = 70.4 bits (171), Expect = 2e-10, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 41/165 (24%), Gaps = 23/165 (13%) Query: 9 SLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 +L RR R ++ EI Sbjct: 61 NLDADADRRGHMERMFDRWNVTNHVRVRGHDARRV--------DVTTLLHGGAAAHPGEI 112 Query: 68 GCYISHIHLWKRIA-YSPAIGAIILEDDADFSDEFSQLL---PHLSKCDINNILIKFDAL 123 GC +SH+ + + A+I+EDDAD ++ ++ + Sbjct: 113 GCTVSHLKALRYFVTRTDEDVALIMEDDADIEQASHWSFAWEEFFDALPVDYDTVQLSLI 172 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 + + +H Y + + A LL Sbjct: 173 NTQRV----------HVSLHPRFSNDYGAAAYLVTRHHATKLLRQ 207 >gi|32473465|ref|NP_866459.1| beta-1,4-galactosyltransferase waaX [Rhodopirellula baltica SH 1] gi|32398145|emb|CAD78240.1| similar to beta-1,4-galactosyltransferase waaX [Rhodopirellula baltica SH 1] Length = 235 Score = 70.4 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 61/211 (28%), Gaps = 16/211 (7%) Query: 4 PVYVIS-LPF-SHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 P +VIS L S R + R A + +R Q +++ Sbjct: 15 PTFVISTLDEQSRDRVNQVEQHLRRAGFRNSQIIQAKTPQTEDFEHRGLPDVL-QGRWRT 73 Query: 61 LL--SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS-KCDINNIL 117 L C +SHI + R + IILEDD +F + L + +I L Sbjct: 74 DLQHLWGTAACTLSHIQFYDR--AEHQLPIIILEDDVTIHPDFFRFLDDVDIPDEIEWDL 131 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + L L + P Y + + L+ +D Sbjct: 132 CHLSYNNPQFSSQANLNRLVAPHLLMCPPDEITGAYSYIVNRS----FLDRFLPSVEEVD 187 Query: 178 MDMKHWWEHNIPSLVTE--PGAVYEAIDTND 206 + H I S V E P Sbjct: 188 WQLAH-QTDEIASYVIEHDPPLTAPDYRLES 217 >gi|262281079|ref|ZP_06058861.1| conserved hypothetical protein [Acinetobacter calcoaceticus RUH2202] gi|262257310|gb|EEY76046.1| conserved hypothetical protein [Acinetobacter calcoaceticus RUH2202] Length = 257 Score = 70.4 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 68/181 (37%), Gaps = 17/181 (9%) Query: 3 IPVYVISLPFS-HARREKFCHR--AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 + Y+IS+ AR F + + H +F+ + G P + K Sbjct: 1 MKKYLISIEKEGSARLTSFFSQPTFGKYHSEFTKM-GVIGALLPTAEY---FKLAVAGRK 56 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD--EFSQLLPHLSKCDINN-I 116 + LS E+GC +SH++ +K S A I EDDA + + + L +S ++N Sbjct: 57 KALSPAELGCTLSHVNAFKDFLASDERYACIFEDDAICLNDIDLNDLEAQVSGLNLNECF 116 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR-------TTGYFIGKEAAIHLLNVR 169 + ++ + K L + L P T Y I ++ A L+ Sbjct: 117 FLSMGGIQLRSSKTVRGTFLENKINQTPVLKLHPVYFGRLFYTYAYIIDRKMAELLIRYH 176 Query: 170 K 170 + Sbjct: 177 E 177 >gi|225872805|ref|YP_002754262.1| LPS glycosyltransferase family protein [Acidobacterium capsulatum ATCC 51196] gi|225793194|gb|ACO33284.1| LPS glycosyltransferase family protein [Acidobacterium capsulatum ATCC 51196] Length = 216 Score = 70.0 bits (170), Expect = 3e-10, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 38/133 (28%), Gaps = 20/133 (15%) Query: 1 MPIPV-------YVISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQ 52 MP+ + I+L R + + + A+ I + Sbjct: 1 MPVTIDTFFAKKICINLDRRPDRWQAVQQKFTEHRISNVERLSAVDARTAVIPD------ 54 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKC 111 L + C +SH+ K+ +I EDD + L P ++++ Sbjct: 55 -----HLSHLRPQDYACTMSHLAAVKQAKREGCENVLIFEDDVTLDPALNDLFPGYMAEL 109 Query: 112 DINNILIKFDALR 124 + + Sbjct: 110 PEDWHMFFLGCYH 122 >gi|322709644|gb|EFZ01220.1| hypothetical protein MAA_03816 [Metarhizium anisopliae ARSEF 23] Length = 393 Score = 69.6 bits (169), Expect = 4e-10, Method: Composition-based stats. Identities = 29/250 (11%), Positives = 73/250 (29%), Gaps = 60/250 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I++ + R + + ++ A+ + + + + + Sbjct: 72 IYFINMKKRYDRLDALALQCFLSGVEVKEVPAV------EPDMMSDAGMPPSKSPSSVKV 125 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-----------DEFSQLLPHLS---- 109 E G + +H ++W + + IILE DA + F + L +++ Sbjct: 126 GERGAWRAHANIWSTVLRNNLPPVIILESDATWDVNIRSIMLNMNKHFHKFLTNINSTRL 185 Query: 110 ----------------------KCDINNILIKFDALRKKPKKDS---------------Y 132 + ++ + + Sbjct: 186 HNPRWRDDAGESTRRTEAADDPWQSDHWDILSLGHCHDSAANRNISLIYDDPYVPPGKEF 245 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPID-MDMKHWWEHNIPS 190 T+ G + + TTGY + + A LL +N+ P+D + + + + Sbjct: 246 GDTVLGRQRVIRKAGGIVCTTGYAVSQTGAAKLLVKTAQNLDAPLDLIVRGMVEDGELVA 305 Query: 191 LVTEPGAVYE 200 P + + Sbjct: 306 YSVMPPIMAQ 315 >gi|322709100|gb|EFZ00676.1| hypothetical protein MAA_03272 [Metarhizium anisopliae ARSEF 23] Length = 403 Score = 69.6 bits (169), Expect = 4e-10, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 66/245 (26%), Gaps = 63/245 (25%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 I+L R + +A L I + I ++ E Sbjct: 78 INLKHRFDRLDAATLQAYLSGLD------ISEAAGVQSDDIHDAGMPPTHRIGVIRDGEK 131 Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLPHLS------- 109 GC+ +H ++W R+ + +I+E DA + F+ LL L Sbjct: 132 GCWRAHANIWSRMLRDKSPAVLIIESDAAWDINIRDIMSTLNPHFTDLLGRLDSKPIHDA 191 Query: 110 ----------------------KCDINNILIKF-DALRKKPKKDS--------------Y 132 + ++ F D Y Sbjct: 192 AWNAGRSHNTSHDSLQLNPNDPWHSRHWDMLSFGQCFESSVNSDVSLSYPDKHVTEGKDY 251 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNIYRPIDMDM-KHWWEHNIPS 190 G+ + + TT Y + + A LL ++ P+D+ M + ++ Sbjct: 252 FGQTLGHDRVVRLSGGIVCTTAYAVSRTGAAKLLLRSSVDLDNPVDLLMRRMTLSGDLIV 311 Query: 191 LVTEP 195 P Sbjct: 312 YSVMP 316 >gi|326935711|ref|XP_003213911.1| PREDICTED: procollagen galactosyltransferase 1-like, partial [Meleagris gallopavo] Length = 200 Score = 69.6 bits (169), Expect = 4e-10, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 68/194 (35%), Gaps = 32/194 (16%) Query: 80 IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL---RKKPKKDSYLCTL 136 I +++ EDD F F + L +L D+ + +D + RK+ + + ++ Sbjct: 1 IVERELEKSVVFEDDLRFEIFFKRRLMNLM-YDLEEEGLDWDLIYIGRKRMQVEHPEKSV 59 Query: 137 PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPID-------------MDMK 181 P ++ + S T Y I A LL + + P+D MK Sbjct: 60 PRVRNLVEA-DYSYWTLAYVISLRGARKLL-AAEPLSKMLPVDEFLPVMFNKHPVSDYMK 117 Query: 182 HWWEHNIPSLVTEPGAVYEA---------IDTNDSTIEESRLVRKPTFSPLYFYRNTCYQ 232 H+ N+ + EP VY DT S + ++ ++ T + Q Sbjct: 118 HFKNRNLLAFSVEPLLVYPTHYTGDDGYISDTETSVVWDNEKIK--TDWDRAKSQKMKEQ 175 Query: 233 WNLHYNAWRKDLPP 246 L A D+ Sbjct: 176 QELRREAKNSDVLQ 189 >gi|207109624|ref|ZP_03243786.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori HPKX_438_CA4C1] Length = 94 Score = 69.2 bits (168), Expect = 5e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 32/76 (42%), Gaps = 4/76 (5%) Query: 160 EAAIHLLNVR-KNIYRPIDMDMKHWWEHNIPSLVTEPGAVY-EAIDTNDSTIEESRLVRK 217 +AA LL K PID M + H + + V E A+ + ++ +S E+ R + Sbjct: 1 KAAQKLLKYSAKEWVMPIDCVMDRHYWHGVKNYVLEEFAITCDEMNAQNSNTEKQRPKKL 60 Query: 218 PTFSPL--YFYRNTCY 231 P + +++ Sbjct: 61 PLSIRIGRSLHKSAVK 76 >gi|119490255|ref|ZP_01622768.1| glycosyl transferase, family 25 [Lyngbya sp. PCC 8106] gi|119454141|gb|EAW35294.1| glycosyl transferase, family 25 [Lyngbya sp. PCC 8106] Length = 236 Score = 69.2 bits (168), Expect = 5e-10, Method: Composition-based stats. Identities = 31/133 (23%), Positives = 44/133 (33%), Gaps = 23/133 (17%) Query: 6 YVISLPFSHARREKFCHRAAR-----IHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 YVI+LP RR + Q FF I ++ I Sbjct: 11 YVINLPERTDRRRMIIRELEKPDSPLAPDQVEFFPGIRPDDPGEFKNI------------ 58 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILI 118 GC++SH+ + K+ +ILEDD FS +F L N L+ Sbjct: 59 ----GIKGCFLSHLAILKKAKEDNLPNILILEDDLCFSRQFKPHQDALIDQLSQSNWGLV 114 Query: 119 KFDALRKKPKKDS 131 F K D+ Sbjct: 115 YFGHRLKLEATDT 127 >gi|207108723|ref|ZP_03242885.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori HPKX_438_CA4C1] Length = 119 Score = 69.2 bits (168), Expect = 5e-10, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 36/103 (34%), Gaps = 5/103 (4%) Query: 136 LPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEP 195 + I + T GY I + A + P+D M + H + +LV +P Sbjct: 20 IQERVGIIKAYSEGVGTQGYVITPKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQP 79 Query: 196 GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYN 238 + A D STI P + R +++ ++ Sbjct: 80 FVI--ADDEQISTIARKEEPYNPK---IALMRELHFKYLKYWQ 117 >gi|297704068|ref|XP_002828944.1| PREDICTED: procollagen galactosyltransferase 1-like [Pongo abelii] Length = 258 Score = 69.2 bits (168), Expect = 6e-10, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 56/154 (36%), Gaps = 27/154 (17%) Query: 77 WK----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNILIKFDALRKKPK 128 W+ ++ +++ EDD F F + L +L + ++ LI RK+ + Sbjct: 52 WQGFLPQVVDQGLQKSLVFEDDLRFEIFFKRRLMNLMRDVEQEGLDWDLI--YVGRKRMQ 109 Query: 129 KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPIDMDMK----- 181 + +P ++ + S T Y I + A LL + + P+D + Sbjct: 110 VEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKMLPVDEFLPVMFDK 167 Query: 182 --------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 H+ N+ + EP +Y T D Sbjct: 168 HPVSEYKAHFSLRNLRAFSVEPLLIYPTHYTGDD 201 >gi|170717810|ref|YP_001784872.1| hypothetical protein HSM_1552 [Haemophilus somnus 2336] gi|168825939|gb|ACA31310.1| conserved hypothetical protein [Haemophilus somnus 2336] Length = 196 Score = 68.4 bits (166), Expect = 8e-10, Method: Composition-based stats. Identities = 38/154 (24%), Positives = 64/154 (41%), Gaps = 19/154 (12%) Query: 78 KRIAYSPAIGAIILEDDADFSDEF-----SQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 KR+ S I EDD FSD F +++ +S +IL+ F+++ KK K Sbjct: 13 KRLLESNQEYIFIFEDDILFSDNFTPEVINEITNFISSLQNPSILVLFNSIYKKKK---- 68 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLV 192 + L I L GY I ++AA ++LN++ I ID ++W Sbjct: 69 VKDLNSAISIFSAHNLFY-ACGYVINRQAAENILNIQTPIKFEIDAFKFYYWLGACDLYC 127 Query: 193 TEPGAVYEAIDTND-STIE--------ESRLVRK 217 V A + ++ S I+ + R ++K Sbjct: 128 LNTDLVKPAPEISNLSEIDCDNPRKYTKQRTIKK 161 >gi|300728322|ref|ZP_07061688.1| LPS glycosyltransferase family [Prevotella bryantii B14] gi|299774434|gb|EFI71060.1| LPS glycosyltransferase family [Prevotella bryantii B14] Length = 240 Score = 68.4 bits (166), Expect = 9e-10, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 74/253 (29%), Gaps = 36/253 (14%) Query: 4 PVYVISLPFS-HARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + V+ + R++ F + + G+ + + N F HQ Sbjct: 5 KILVLHVKKGYEDRQKHIEKMMKSWGYSFEYI--LDGDMDDL-NLEFVHQYFT------- 54 Query: 63 SLPEIGC-------YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDIN 114 E+ C H+ + I GA+ILEDD F ++ LS+ Sbjct: 55 --GEMDCISPATSVATKHLKAARYIVDHQLDGALILEDDMLLYPNFEKVFNQCLSEIQNR 112 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NI 172 ++ + ++ + + +EAA +L + Sbjct: 113 HLATALISFEDSSLHFVPGSQRKKGQHLYIAKRD-RFAGALYCTREAAELILQHVRENRC 171 Query: 173 YRPIDMD-MKHWWEHNIPSLVTEPGAVYEAIDTN--DSTIEESRLVRKPTFSPLYFYRNT 229 PID+ + P + T S+I + +K T+ L Sbjct: 172 DMPIDLYHTSLIQRAGLNYYWCHPCIATQGSHTGLFASSIN-EKSAKKKTYRRLT----- 225 Query: 230 CYQWNLHYNAWRK 242 WNL + Sbjct: 226 ---WNLKLAYKKL 235 >gi|134075727|emb|CAK96619.1| unnamed protein product [Aspergillus niger] Length = 374 Score = 68.4 bits (166), Expect = 9e-10, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 75/226 (33%), Gaps = 23/226 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL-L 62 V I+LP +R+ ++ + + + E+ + + + R + Sbjct: 80 NVLCINLPSRTDKRDAITLGSSVTDFHVDWIEGVSAEDMDLKA----YPPHYGEPGRPIM 135 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 E+G + +H++ +RI A+I+EDD+D+ L + + + Sbjct: 136 IPGEVGNWRAHLNAMRRIVSDRLTTALIIEDDSDWDVTLKIQLHEFALASQSLQKARKLT 195 Query: 123 LRKKPKKDSYLCTLPG---NFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNIYR 174 ++ ++ N I T Y + AA ++ I+ Sbjct: 196 ASRRTGPAVHVHDNVDKAPNTRIICQTRYVICTGAYAVSYNAAQKIIAALSVLPSDEIFP 255 Query: 175 P-----IDMDMKHWWEHN-IPSLVTEPGAV----YEAIDTNDSTIE 210 P D+ + E+ + + P + + +S I+ Sbjct: 256 PGTSSIFDVSLGRLCENGYLTCYSSFPSLIGNWRPAGSPSRNSDIQ 301 >gi|70983225|ref|XP_747140.1| LPS glycosyltransferase [Aspergillus fumigatus Af293] gi|66844765|gb|EAL85102.1| LPS glycosyltransferase, putative [Aspergillus fumigatus Af293] gi|159124024|gb|EDP49143.1| LPS glycosyltransferase, putative [Aspergillus fumigatus A1163] Length = 387 Score = 68.4 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 39/264 (14%), Positives = 74/264 (28%), Gaps = 66/264 (25%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFKRLLS 63 +Y I L +R+ A + F D I G + + R + L Sbjct: 68 IYAIGLKERTDKRDFLT--LASLAAGFK-VDWIDGVRPDELDPRSLPNGLNLT----DLK 120 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK------------- 110 + C+ +H++ + + + A+ILEDDAD+ L ++ Sbjct: 121 PTAVACWRAHMNALRMVLQNSYTTALILEDDADWDVALKMQLREFARGVRLLNGEENAPK 180 Query: 111 ---CDINNILIKF--------------------DALRKKPKKDSYLCTLP---------- 137 + ++ + K+D + L Sbjct: 181 TAPYGTDWDILWIGGCASGPGANETTFFAIPDDPTVPSPDKRDGWGGPLETWKQQYPNLP 240 Query: 138 -GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDM-----KHWWEHNIP 189 G + T GY + +E A +L + +D M + Sbjct: 241 IGTTRFLYRAEMGCCTYGYAVTREGAKKILAALSVDRLDCAVDNAMSDLCAGTNGRRQLK 300 Query: 190 SLVTEPGAV----YEAIDTNDSTI 209 T P + + + DS I Sbjct: 301 CFATFPNLIGTYRHAGPASRDSDI 324 >gi|315044417|ref|XP_003171584.1| hypothetical protein MGYG_06124 [Arthroderma gypseum CBS 118893] gi|311343927|gb|EFR03130.1| hypothetical protein MGYG_06124 [Arthroderma gypseum CBS 118893] Length = 402 Score = 68.4 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 43/103 (41%), Gaps = 6/103 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV++LP RR+ AA ++ + D I GE + + + Sbjct: 69 IYVVNLPSRTDRRDALVLMAALSGIKLHWIDGIMGET------VLDRALPPPASHKTMKN 122 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + +HI+ + + + A+ILEDDAD+ L Sbjct: 123 ANIGSWRAHINALQDMVENNISSALILEDDADWDIRIKSQLKD 165 Score = 37.6 bits (86), Expect = 1.8, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 28/63 (44%), Gaps = 2/63 (3%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDM 178 + L PK D + + P + + + + + GY + ++ A LL K RP D+ Sbjct: 262 EHLVLSPKNDEFTSSFPPHTRVVHHAMTAVCSLGYGVSQKTARRLLYEFGVKKFDRPYDL 321 Query: 179 DMK 181 ++ Sbjct: 322 MLR 324 >gi|317028498|ref|XP_001390184.2| LPS glycosyltransferase [Aspergillus niger CBS 513.88] Length = 376 Score = 68.0 bits (165), Expect = 1e-09, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 81/263 (30%), Gaps = 64/263 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I L +R+ A+ + + D ++ E+ + + LL+ Sbjct: 64 IYAIGLKERTDKRDFLNLAASIAGFRVEWIDGVHPEDMSEKALLNDN---------LLAP 114 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-------------- 110 EIGC+ +H++ + + A+ILEDDAD+ Q L ++ Sbjct: 115 SEIGCWRAHMNALSNMVQNSYSTALILEDDADWDVNIRQQLREFARGVRALTRNANTTKS 174 Query: 111 --CDINNILI--------------KFDALRKKPKK-------------DSYLCTLPGNFD 141 N ++ +F A+ P DS+ P Sbjct: 175 APYGTNWDILWVGGCASSAAPNETQFYAIPHDPTAPSVDHRGTWGGPLDSWKENYPETST 234 Query: 142 IHQPR-ILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDM-----KHWWEHNIPSLVT 193 R + T GY + K A +L + +D M I Sbjct: 235 RFIYRADMGCCTYGYAVTKRGAERILAALAVDRLVAAVDNSMADMCGGKDGRSQIECYAP 294 Query: 194 EPGAV----YEAIDTNDSTIEES 212 P + + + DS I E Sbjct: 295 FPNIIGTYKAAGLASKDSDIREG 317 >gi|207092759|ref|ZP_03240546.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPKX_438_AG0C1] Length = 270 Score = 68.0 bits (165), Expect = 1e-09, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 60/168 (35%), Gaps = 14/168 (8%) Query: 93 DDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 DD + ++ L IK + ++ +F + + S T Sbjct: 106 DD--LTKKYDDLTEKYESLLAKETNIK-ETFWERRADSEKEALFLEHFYLTSVYVAS--T 160 Query: 153 TGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 GY+I + A + + P+DM + + H+I + P V +STI+ Sbjct: 161 AGYYITPKGAKTFIEATERFKIIEPVDMFINNPTYHDIANFTLVPCPVSLNKHAFNSTIQ 220 Query: 211 ESR---LVRKPTFSPLY----FYRNTCYQWNLHYNAWRKDLPPVSTTK 251 ++ + KP Y +++ + ++ + K P+ T K Sbjct: 221 NAKKPDISLKPPKKSYYDNLFYHKFNARKCLKAFHKYSKQYAPLKTPK 268 >gi|197334202|ref|YP_002154915.1| glycosyl transferase, family 25 [Vibrio fischeri MJ11] gi|197315692|gb|ACH65139.1| glycosyl transferase, family 25 [Vibrio fischeri MJ11] Length = 81 Score = 67.7 bits (164), Expect = 1e-09, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 23/59 (38%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 + V+V+SL S RRE+ + + F FFDAI G + K Sbjct: 1 MKVFVVSLKRSVDRRERIQKHLDALDISFEFFDAIDGSIEHFQYSDHARPVTTKNVKAT 59 >gi|134057862|emb|CAK38229.1| unnamed protein product [Aspergillus niger] Length = 688 Score = 67.7 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 81/263 (30%), Gaps = 64/263 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I L +R+ A+ + + D ++ E+ + + LL+ Sbjct: 376 IYAIGLKERTDKRDFLNLAASIAGFRVEWIDGVHPEDMSEKALLNDN---------LLAP 426 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-------------- 110 EIGC+ +H++ + + A+ILEDDAD+ Q L ++ Sbjct: 427 SEIGCWRAHMNALSNMVQNSYSTALILEDDADWDVNIRQQLREFARGVRALTRNANTTKS 486 Query: 111 --CDINNILI--------------KFDALRKKPKK-------------DSYLCTLPGNFD 141 N ++ +F A+ P DS+ P Sbjct: 487 APYGTNWDILWVGGCASSAAPNETQFYAIPHDPTAPSVDHRGTWGGPLDSWKENYPETST 546 Query: 142 IHQPR-ILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDM-----KHWWEHNIPSLVT 193 R + T GY + K A +L + +D M I Sbjct: 547 RFIYRADMGCCTYGYAVTKRGAERILAALAVDRLVAAVDNSMADMCGGKDGRSQIECYAP 606 Query: 194 EPGAV----YEAIDTNDSTIEES 212 P + + + DS I E Sbjct: 607 FPNIIGTYKAAGLASKDSDIREG 629 >gi|113461210|ref|YP_719279.1| hypothetical protein HS_1067 [Haemophilus somnus 129PT] gi|112823253|gb|ABI25342.1| conserved hypothetical protein [Haemophilus somnus 129PT] Length = 194 Score = 67.3 bits (163), Expect = 2e-09, Method: Composition-based stats. Identities = 34/134 (25%), Positives = 56/134 (41%), Gaps = 10/134 (7%) Query: 78 KRIAYSPAIGAIILEDDADFSDEF-----SQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 KR+ S I EDD FSD F +++ +S +IL+ F+++ KK K Sbjct: 11 KRLLESNQEYIFIFEDDILFSDNFTPEVINEITNFISSLQNPSILVLFNSIYKKKK---- 66 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLV 192 + L I L GY I ++AA ++LN++ I ID ++W Sbjct: 67 VKDLNSAISIFSAHNLFY-ACGYVINRQAAENILNIQTPIKFEIDAFKFYYWLGACDLYC 125 Query: 193 TEPGAVYEAIDTND 206 V A + ++ Sbjct: 126 LNTDLVKPAPEISN 139 >gi|217034824|ref|ZP_03440205.1| hypothetical protein HP9810_306g1 [Helicobacter pylori 98-10] gi|216942641|gb|EEC22225.1| hypothetical protein HP9810_306g1 [Helicobacter pylori 98-10] Length = 118 Score = 67.3 bits (163), Expect = 2e-09, Method: Composition-based stats. Identities = 22/104 (21%), Positives = 43/104 (41%), Gaps = 11/104 (10%) Query: 150 PRTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDS 207 T GY++ + A + + P+DM M + H+I + P V + + DS Sbjct: 4 ASTAGYYLTPKGAKTFIEATERFKIIEPVDMFMDNSAYHDIANFTYVPCPVSLSEHSLDS 63 Query: 208 TIEESRLV---------RKPTFSPLYFYRNTCYQWNLHYNAWRK 242 TI++ + +K TF L++Y + ++ + K Sbjct: 64 TIQKPQKKSLKSYPSPPKKSTFKNLFYYSLNAKKRLKAFHKYSK 107 >gi|208434540|ref|YP_002266206.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori G27] gi|208432469|gb|ACI27340.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori G27] Length = 210 Score = 67.3 bits (163), Expect = 2e-09, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 62/168 (36%), Gaps = 14/168 (8%) Query: 93 DDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 DD + ++ L + IK + + + +F + + + T Sbjct: 46 DD--LTTKYDDLTTKYESLLAKEVNIK-ETFWESRADNEKEALFLEHFYLTSVYVAT--T 100 Query: 153 TGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIE 210 GY+I + A + + P+DM M + H++ + P V +STI+ Sbjct: 101 AGYYITPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPVSLNKHAFNSTIQ 160 Query: 211 ESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 ++ + KP L++++ + ++ + K P+ T K Sbjct: 161 NAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFHKYSKQYAPLKTPK 208 >gi|322701411|gb|EFY93161.1| hypothetical protein MAC_00944 [Metarhizium acridum CQMa 102] Length = 391 Score = 66.9 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 28/250 (11%), Positives = 71/250 (28%), Gaps = 60/250 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I++ + R + + ++ A+ + + + + + Sbjct: 72 IYFINMKKRYDRLDALALQCFLSGVEVKQVPAV------EPDMMSDAGMPPSKSPGSVKV 125 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-----------DEFSQLLPHLS---- 109 E G + +H ++W + + IILE DA + F + + ++ Sbjct: 126 GERGAWRAHANIWSAMLRNNLPPVIILESDAAWDVNIRKIMLNMNKHFRKFVASINSTRL 185 Query: 110 ----------------------KCDINNILIKFDALR---------------KKPKKDSY 132 + ++ P + Sbjct: 186 HNPRWRDDADESIRGTEAADDPWQSDHWDILSLGHCHDTEANRNISLIYDDPAVPPGKEF 245 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIYRPIDMDM-KHWWEHNIPS 190 T+ G + + TT Y + + A LL +N+ P+D+ + + + Sbjct: 246 GGTVLGRQRVIRKAGNIVCTTAYAVSQTGAAKLLVKTSQNLDAPLDLIVGGMAEAGELVA 305 Query: 191 LVTEPGAVYE 200 P + + Sbjct: 306 YSVMPPIMAQ 315 >gi|307295520|ref|ZP_07575356.1| hypothetical protein SphchDRAFT_2951 [Sphingobium chlorophenolicum L-1] gi|306878559|gb|EFN09779.1| hypothetical protein SphchDRAFT_2951 [Sphingobium chlorophenolicum L-1] Length = 251 Score = 66.9 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 83/245 (33%), Gaps = 30/245 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +I+L RR + A+I L+ + G +FS + Sbjct: 20 IRIINLASRADRRREITREFAKIGLEITA----DGPVRFHEAALFSDP------GPFPLI 69 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADF----SDEFSQLLPHLSKCDINNILIKF 120 GC+ SH+ + + A +I EDD DF D+ L L++ + Sbjct: 70 GARGCWHSHVEILRE-ALDGQDNILIFEDDCDFVAGIEDKLPNALASLAQKPWSIFYGGH 128 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA---IHLLNVRKNIYRPID 177 D + K + D ++ G + + + + + A +LL++ P Sbjct: 129 DLIDKPERADPFIHIDHGVQWLRGAHFVGFHRS--ILEQLVAELDKYLLDLTAGKDAPKG 186 Query: 178 MDMKHWWEH----NIPSLVTEPGA----VYEAIDTNDSTIEESRLVRKPTFSPLYFYRNT 229 +D + W ++ + P + + S + +RL+ S + ++ Sbjct: 187 IDGGYSWFRNRYPDLQVYLAWPKLGFQRPSHSDISGSSAM--TRLMPPSLLSVIRAFKRH 244 Query: 230 CYQWN 234 + Sbjct: 245 VKRAR 249 >gi|113460782|ref|YP_718849.1| lipooligosaccharide biosynthesis protein [Haemophilus somnus 129PT] gi|112822825|gb|ABI24914.1| lipooligosaccharide biosynthesis protein [Haemophilus somnus 129PT] Length = 158 Score = 66.9 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 3/67 (4%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YVISL + RR+ ++ H+ ++FFDA N+ K L+ Sbjct: 4 YVISLKSAVDRRQHIEKEFSQQHIPYTFFDAFQ---LNEENKHILQHYVPSLNKTDLTTG 60 Query: 66 EIGCYIS 72 E GC++S Sbjct: 61 EKGCFMS 67 >gi|116208356|ref|XP_001229987.1| hypothetical protein CHGG_03471 [Chaetomium globosum CBS 148.51] gi|88184068|gb|EAQ91536.1| hypothetical protein CHGG_03471 [Chaetomium globosum CBS 148.51] Length = 380 Score = 66.9 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 23/104 (22%), Positives = 44/104 (42%), Gaps = 6/104 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++V+ LP RR+ +AA ++ F D + G + P + ++ L Sbjct: 38 KIFVVGLPSRTDRRDGMVLQAALSNMDIEFIDGVAGADVP------DKAIPMNKGQKRLR 91 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + +H++ + + + A+ILEDD D+ L Sbjct: 92 DASIGSWRAHMNAIREVVHRNLSSALILEDDVDWDVRVRSQLAD 135 >gi|297704066|ref|XP_002828943.1| PREDICTED: procollagen galactosyltransferase 1-like, partial [Pongo abelii] Length = 429 Score = 66.9 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 15/51 (29%), Positives = 26/51 (50%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 R R L+ E+GC++SH ++WK + + ++ EDD F F + Sbjct: 379 YRDPYHGRPLTKGELGCFLSHYNIWKEVCPEGCLKWLVFEDDLRFEIFFKR 429 >gi|323450433|gb|EGB06314.1| hypothetical protein AURANDRAFT_65694 [Aureococcus anophagefferens] Length = 760 Score = 66.5 bits (161), Expect = 3e-09, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 44/173 (25%), Gaps = 10/173 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARI--HLQFSFFDAIYGENNPICN-----RIFSHQKRQC 56 V+ I+ RR + R + + R Sbjct: 55 KVFYINCDADLDRRASIEAQLDRALPGVPRERVACTDTASAKFAYESGGVRGLDLSTETA 114 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 F + + Y+SH + + IA + LEDDA F+D F + L Sbjct: 115 HFGADEAYTTLAIYLSHTSIQETIAAMDDGLYLALEDDAVFADGFLEKFRALD-LPEAFD 173 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRIL--SPRTTGYFIGKEAAIHLLN 167 L++ ++ +D + A L Sbjct: 174 LVRLGCWTEESPEDRATKDAFVAAAPARASATTRYLGGHAELWTPRGAATFLE 226 >gi|169843283|ref|XP_001828371.1| hypothetical protein CC1G_04342 [Coprinopsis cinerea okayama7#130] gi|116510468|gb|EAU93363.1| hypothetical protein CC1G_04342 [Coprinopsis cinerea okayama7#130] Length = 643 Score = 66.5 bits (161), Expect = 4e-09, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 50/164 (30%), Gaps = 26/164 (15%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPA---IGAIILEDDADFSDEFSQLLPHLSK-CDINNIL 117 L+ I C+ SH L IA + +I+LEDD D + + L HL K + + Sbjct: 451 LTPSRIACWHSHTKLIHSIANGKSSASEVSIVLEDDVDMEQDIQEQLLHLWKFLPPDWDI 510 Query: 118 IKFDA---------------LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 + KK + + P T Y + A Sbjct: 511 VFLGHCWSNEKLHPAISIPTGYKKSILRLLSRSRSLFWSSLHPSNGPKCTHAYALSYRGA 570 Query: 163 IHLLNVRK--NI--YRPIDMDMKHWWEH--NIPSLVTEPGAVYE 200 LL + R ID + W + S P V + Sbjct: 571 RRLLLHLRYPPFAYSRAIDQAI-SWLITSGRLKSYSVVPSLVVQ 613 Score = 44.2 bits (103), Expect = 0.017, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 28/93 (30%), Gaps = 27/93 (29%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +YVISLP RR ++ L +++ A N PI + I Sbjct: 300 KIYVISLPSRKDRRADMEVLRNKLGLDWTYIKAT-ASNEPIVSSILD------------- 345 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD 96 W RI + ED+ Sbjct: 346 -------------WVRIVREGPPTVVTSEDEVT 365 >gi|317158172|ref|XP_001826879.2| hypothetical protein AOR_1_498034 [Aspergillus oryzae RIB40] Length = 381 Score = 66.5 bits (161), Expect = 4e-09, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 74/244 (30%), Gaps = 63/244 (25%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS-------FFDAIYGENNPICNRIFSHQKRQC 56 V+ I++P +R+ A LQF F D I + P + H K Sbjct: 64 KVFFINMPNRPDKRDYIT--LASSILQFHPEPVNGVFVDDIDKKAYP---SNWDHGK--- 115 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI--- 113 E+G + +H+++ +RI + A +LEDDAD+ + L + Sbjct: 116 ------LPAEMGAWRAHMNVMQRIVHERISTAFVLEDDADWDVNLKKQLQRFASASQLVQ 169 Query: 114 ------------NNILIKFDALRKKPKK--------DSYLCTLP--------------GN 139 L+ + K D LP Sbjct: 170 GDTGPSHSPYGDLWDLLWIGHCGIQYKTGPIHVTTDDITTVPLPELPRYWHGFPAGADNG 229 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVR----KNIYRPIDMDMKHWWEHN-IPSLVTE 194 + + GY I A LL+ K P D+ + + ++ + + Sbjct: 230 TRLVARMHDGVCSLGYAITYLGAQKLLSALSLTPKGDGAPFDVAIGRFCQNGWLRCIAPF 289 Query: 195 PGAV 198 P + Sbjct: 290 PSLI 293 >gi|108563027|ref|YP_627343.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] gi|107836800|gb|ABF84669.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPAG1] Length = 268 Score = 66.1 bits (160), Expect = 5e-09, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 45/109 (41%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P V +STI Sbjct: 158 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPVSLNKHAFNSTI 217 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + ++ + K P+ T K Sbjct: 218 QNAKKPDISLKPPRKSYFDNLFYHKFNARKCLKAFHKYSKQYAPLKTPK 266 >gi|312212365|emb|CBX92448.1| hypothetical protein [Leptosphaeria maculans] Length = 459 Score = 65.4 bits (158), Expect = 7e-09, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 45/103 (43%), Gaps = 6/103 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 + V++LPF RR+ AA +++ F + + G + I + + + L Sbjct: 84 KILVLNLPFRTDRRDAISLSAALSNIKLEFVNGVTG--DSIHEKAYPPPDENIK----LL 137 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 G + +H++ +R+ A+ILEDD D+ Q L Sbjct: 138 PGIRGSWRTHMNALQRVVEQNLTTALILEDDVDWDIRIRQSLQ 180 >gi|308184405|ref|YP_003928538.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] gi|308060325|gb|ADO02221.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SJM180] Length = 268 Score = 65.4 bits (158), Expect = 8e-09, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 62/170 (36%), Gaps = 14/170 (8%) Query: 93 DDAD--FSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP 150 DD + ++++ L IK + ++ +F + + + Sbjct: 100 DDLNKNIAEKYDDLTTKYESLLAKETNIK-ETFWERRADSEKEALFLEHFYLTSVYVAT- 157 Query: 151 RTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 T GY++ + A + P+DM M + HN+ + P V +ST Sbjct: 158 -TAGYYLTPKGAKTFIEATDRFKIIEPVDMFMNNPTYHNVANFTYLPCPVSLNKHAFNST 216 Query: 209 IEESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 I+ ++ + KP L++++ + ++ + K P+ T K Sbjct: 217 IQNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFHKYSKQYAPLKTPK 266 >gi|116201385|ref|XP_001226504.1| hypothetical protein CHGG_08577 [Chaetomium globosum CBS 148.51] gi|88177095|gb|EAQ84563.1| hypothetical protein CHGG_08577 [Chaetomium globosum CBS 148.51] Length = 493 Score = 65.4 bits (158), Expect = 9e-09, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 47/111 (42%), Gaps = 2/111 (1%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ I+LP RR+ A L ++ D + G++ + ++ Q + Sbjct: 47 IFAINLPSRTDRRDALALAGALSELDITWVDGVLGKD--VLDKTLPGDPNSRQGNGGFTT 104 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 G + +H+ + +RI + A+ILEDDAD+ + + ++ Sbjct: 105 GNKGSWRAHMDVLQRIVHENVTSALILEDDADWDLRLKRQMQVFAQASRAF 155 Score = 37.6 bits (86), Expect = 1.7, Method: Composition-based stats. Identities = 10/65 (15%), Positives = 21/65 (32%), Gaps = 4/65 (6%) Query: 106 PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNF----DIHQPRILSPRTTGYFIGKEA 161 + I + + KP + L ++ + P + T Y + ++ Sbjct: 211 TAALRVTIPDDMTVPSPNHLKPHPFALQDALAEDYPPHTRVVHPSRGTVCTQAYAVSQQG 270 Query: 162 AIHLL 166 A LL Sbjct: 271 ARKLL 275 >gi|164422665|ref|XP_958531.2| hypothetical protein NCU09832 [Neurospora crassa OR74A] gi|157069769|gb|EAA29295.2| predicted protein [Neurospora crassa OR74A] Length = 470 Score = 64.6 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 67/257 (26%), Gaps = 70/257 (27%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 I++ + R + +A L + + R L + E Sbjct: 133 INMKARYDREDAMALQAYMSGLDIE-----DAPAVEADEIDPAGMPPTHRPGR-LRVGEK 186 Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLPHLSKC----- 111 GC+ +H ++W ++ ++LE DA + + F L ++ Sbjct: 187 GCWRAHANIWSQMTRHRLPPILVLESDAAWDLNIRSIMSNLNTHFIDFLNQINSTAVHDP 246 Query: 112 --------------------------DINNILIKFDAL--------------------RK 125 D + +D Sbjct: 247 SYQSPNNHNVHGSPSYSDNGPIKPNPDDPWLSEHWDLFSIGQCFEYSQDREIKLVYDDES 306 Query: 126 KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNIYRPIDMDMKHWW 184 P Y G + + TT Y I A LL ++ P+D+ ++ Sbjct: 307 VPAGKEYWGKKMGKERVIRKSGGITCTTAYAISHTGAAKLLLRGAMDLDNPVDLLIRRMV 366 Query: 185 E-HNIPSLVTEPGAVYE 200 ++ + P + + Sbjct: 367 MSRDLVAYSLFPPVMAQ 383 >gi|255941456|ref|XP_002561497.1| Pc16g11970 [Penicillium chrysogenum Wisconsin 54-1255] gi|211586120|emb|CAP93867.1| Pc16g11970 [Penicillium chrysogenum Wisconsin 54-1255] Length = 385 Score = 64.6 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 41/285 (14%), Positives = 85/285 (29%), Gaps = 68/285 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I L +R+ A+ + + D + + + + + + Sbjct: 70 IYAIGLKERTDKRDFLTVSASVTGFEVDWVDGV--HSQELQQKALPNGYNLSTIQ----P 123 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI----------- 113 +GC+ +H+++ + + A++LEDDAD+ L ++ Sbjct: 124 NAVGCWRAHMNVLNSVIANSYSSALVLEDDADWDVNIKTQLREFARGLHSLKGDKKVSKQ 183 Query: 114 -----NNILI--------------KFDALRKKPKK-------------DSYLCTLPGNFD 141 + L+ +F A+ + P DS+ P + Sbjct: 184 APYGTDWDLLWIGGCSSRADKNETQFYAIPEDPTVPSVERRHTWFGPLDSWKEVFPEDST 243 Query: 142 IHQPR-ILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDM-----KHWWEHNIPSLVT 193 R T GY + A +L +I P+D+ M I Sbjct: 244 RFIYRAEEGCCTYGYAVTARGAKKILTALAIDHIEAPVDLAMSELCGGTGDRERIECFAP 303 Query: 194 EPGAV-----------YEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 P + I++ D T+ E + + +R Sbjct: 304 FPNLIGTYRPAGPAFKDSDINSGDQTVHEEQAYNLVYSTRRNIHR 348 >gi|299470270|emb|CBN79574.1| hypothetical protein Esi_0011_0166 [Ectocarpus siliculosus] Length = 1100 Score = 64.2 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 44/146 (30%), Gaps = 31/146 (21%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP-------------------- 43 P +VI+L R + A R + +A+ + Sbjct: 737 PAHVINLARRPFRWKAVSDSARRQGVWLHRHEAVDWKAVEASSMSAKGWISDKEVALTWA 796 Query: 44 -ICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR---------IAYSPAIGAIILED 93 N I+ + + + ++ E C SH+ LW+R + +ILED Sbjct: 797 TELNTIYD-KDCEKETTHVMHPSERACAASHLDLWRRHRDSKLAPFMPEVHPDAVLILED 855 Query: 94 DADFSDEFSQLLPHLSKCDINNILIK 119 DA F + ++ Sbjct: 856 DAKLEPGFLEAAKATMLQAAAMDTVR 881 >gi|322510496|gb|ADX05810.1| putative glycosyltransferase [Organic Lake phycodnavirus 1] Length = 204 Score = 63.8 bits (154), Expect = 2e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 49/159 (30%), Gaps = 32/159 (20%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 + + I+L R + H + FF + + Q KR Sbjct: 1 MKGFYINLDQRKDRMDHIEHLKQ----TYPFFSNV-------------ERMNAFQHKR-- 41 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD--EFSQLLPHLSKCD-INNILIK 119 +IGC +SHI K + +ILEDD + F++ K +N +I Sbjct: 42 --GDIGCGLSHIKCLKTLKKENEPYYMILEDDFQILNPVNFAEFETQFDKIKNLNWDVIV 99 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG 158 K + ++ +GY + Sbjct: 100 LTPRGNKTQN--------NYYEHFHRINNHQTASGYIVK 130 >gi|289616357|emb|CBI56884.1| unnamed protein product [Sordaria macrospora] Length = 473 Score = 63.8 bits (154), Expect = 2e-08, Method: Composition-based stats. Identities = 36/258 (13%), Positives = 71/258 (27%), Gaps = 72/258 (27%) Query: 8 ISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK-RLLSLPE 66 I++ + R + +A L+ + A+ + + + L E Sbjct: 136 INMKARYDREDAMALQAYLSGLKIDDYPAVEAD-------MIDPVGMPPTHRPGKLKNGE 188 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQLLPHLSKCDIN- 114 GC+ +H ++W S +ILE DA + F L ++S ++ Sbjct: 189 KGCWRAHANIWSDQTRSRLPPILILESDAAWDLQIRPIMSLLNAHFITFLNNISSTPVHD 248 Query: 115 ------------------------------NILIKFDAL--------------------R 124 + +D Sbjct: 249 PSFNSPNNHKVHGKPSYNPSNPILPNPDDPWLSEHWDLFSIGQCFEYSQDHDIKLVYDDE 308 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNIYRPIDMDMKHW 183 P+ Y G + + TT Y I A LL ++ P+D+ M+ Sbjct: 309 SVPEGKEYWGQKMGKERVIRKSGGITCTTAYAISHTGAAKLLLRSAMDLDNPVDLLMRRL 368 Query: 184 WE-HNIPSLVTEPGAVYE 200 ++ + P V + Sbjct: 369 TMSRDLVAYSLFPPVVAQ 386 >gi|238753114|ref|ZP_04614563.1| LPS glycosyltransferase family protein [Yersinia rohdei ATCC 43380] gi|238708664|gb|EEQ00933.1| LPS glycosyltransferase family protein [Yersinia rohdei ATCC 43380] Length = 232 Score = 63.8 bits (154), Expect = 2e-08, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 52/196 (26%), Gaps = 44/196 (22%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS---FFDAIYGENNPICNRIFSHQKRQCQFKR 60 V I+L R + + ++ F+AI + Sbjct: 16 KVVYINLKTRSDREDNIKELLQKFNIANEKVIRFEAIDEK-------------------- 55 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNI 116 IGC SH + K +ILEDD F+D+ + + Sbjct: 56 ---PGYIGCAKSHEAVLKMARDHEWDNVLILEDDIVFNDDIESVALANRFLSMLKTTDWD 112 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP- 175 + A ++ + + + + Y + K LL+ R Sbjct: 113 VAMLSA------NYYHVMPFINDNHFLRLNMAHC-SCAYLVNKNYYQTLLDNVSEAVRKL 165 Query: 176 ------IDMDMKHWWE 185 ++ + W Sbjct: 166 EAGGEQVNCAIDSHWL 181 >gi|317180391|dbj|BAJ58177.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F32] Length = 284 Score = 63.4 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 53/140 (37%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY++ + A + + P+DM Sbjct: 145 ETFWERRADSEKEALFLEHFYLTSVYVAS--TAGYYLTPKGAKTFIEATERFKIIEPVDM 202 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRL-------VRKPTFSPLYFYRNTCY 231 + + H+I + P V +STI+ ++ +K F L++++ Sbjct: 203 FINNPTYHDIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLNPPKKSYFDNLFYHKFNAK 262 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ + K P+ T K Sbjct: 263 KCLRAFHKYSKQYAPLKTPK 282 >gi|226192527|gb|ACO37543.1| truncated Lex2B [Haemophilus influenzae] Length = 99 Score = 63.4 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 7/70 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYGENNPI--CNRIFSHQKRQC 56 P+++I+L S R+ + ++ +L + FFDAIYG++NP + ++ KR Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYGKSNPNHPLFQRYNENKRLN 64 Query: 57 QFKRLLSLPE 66 L+L + Sbjct: 65 AKGYPLTLGQ 74 >gi|317012434|gb|ADU83042.1| family 25 glycosyl transferase [Helicobacter pylori Lithuania75] Length = 265 Score = 63.0 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 53/140 (37%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + + + +F + + + T GY++ + A + + P+DM Sbjct: 126 ETFGESRADNEKEALFLEHFYLTSVYVAT--TAGYYLTPKGAKTFIEATERFKIIEPVDM 183 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSP----LYFYRNTCY 231 M + H++ + P + +STI+ ++ + KP L++++ Sbjct: 184 FMNNPTYHDVANFTYLPCPISLNKHAFNSTIQNAKKPDISLKPPRKSYFDNLFYHKFNAQ 243 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ + + P+ T K Sbjct: 244 KCLKAFHKYSRQYAPLKTPK 263 >gi|326432644|gb|EGD78214.1| hypothetical protein PTSG_12837 [Salpingoeca sp. ATCC 50818] Length = 456 Score = 63.0 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 65/219 (29%), Gaps = 30/219 (13%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQ--------- 55 V IS R+ + A + AI G + + + +Q Sbjct: 164 VLYISRDVDVVRQGQMEEGVAGL-WGAKRIPAIDGTHQENVLQRLAGTPQQAQATLQNLV 222 Query: 56 --CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA--DFSDEFSQLLPH-LSK 110 + + L E ISH+ +IA S ++LEDD D ++Q L + K Sbjct: 223 TDKRKPAKVHLKEAAVSISHLDALHQIAVSDRPYGLVLEDDVTIDLLPFWTQTLQAFVQK 282 Query: 111 CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL----- 165 + L++ K+ + G Y + + A + Sbjct: 283 MPADWKLVRLAMSAKESEWKKLFKQWRGELAFAPNSTTVYGAIAYLVPRTFARDVMRMPF 342 Query: 166 --LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAI 202 L+ RP+ W +P P V + Sbjct: 343 VQLDAAAPTQRPL-----KVWLQKLPRF---PCIVADEC 373 >gi|307637308|gb|ADN79758.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 908] Length = 242 Score = 63.0 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 46/109 (42%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P + + +STI Sbjct: 132 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPISLSKHAFNSTI 191 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + +N + K P+ T K Sbjct: 192 QNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFNKYSKRYAPLKTPK 240 >gi|325995899|gb|ADZ51304.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori 2018] Length = 221 Score = 63.0 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 46/109 (42%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P + + +STI Sbjct: 111 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPISLSKHAFNSTI 170 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + +N + K P+ T K Sbjct: 171 QNAKKPDISLKPPRKSYFDNLFYHKFNAQKCLKAFNKYSKRYAPLKTPK 219 >gi|257455407|ref|ZP_05620642.1| lipooligosaccharide biosynthesis protein lex-1 [Enhydrobacter aerosaccus SK60] gi|257447369|gb|EEV22377.1| lipooligosaccharide biosynthesis protein lex-1 [Enhydrobacter aerosaccus SK60] Length = 176 Score = 63.0 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 50/161 (31%), Gaps = 12/161 (7%) Query: 82 YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN-ILIKFDALRKKPKKDSYLCTLPGNF 140 I EDD +F QL+ + ++K + + + + Sbjct: 2 DENLDYIGIYEDDIFLGQQFEQLIRQIDVLQSQAIDILKLEKVSPRVHLAHPIVLPQSKR 61 Query: 141 DIHQPRILSPRTTGYFIGKEAAIHLLNVRK--NIYRPIDMDMKHW--WEHNIPSLVTEPG 196 I++ + GY + A +LL + +D+ + + +P P Sbjct: 62 TIYELKSRHLGGAGYLLSHRACQYLLQYIRNLPQLEAVDVMLFDVGKYPKALPVYQLVPA 121 Query: 197 AVYEAIDTND-----STIEESRLVRKPTFSPLYFYRNTCYQ 232 V + S+++ +R + PL + + Sbjct: 122 IVIQEHHLLPQVALASSLQATRS--REIKRPLTGLQKIKRE 160 >gi|78187240|ref|YP_375283.1| glycosyl transferase family protein [Chlorobium luteolum DSM 273] gi|78167142|gb|ABB24240.1| glycosyl transferase, family 25 [Chlorobium luteolum DSM 273] Length = 241 Score = 62.7 bits (151), Expect = 5e-08, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 47/141 (33%), Gaps = 26/141 (18%) Query: 1 MPIP-----VYVISLPFSHARREKFCHRAARIHLQ-----FSFFDAIYGENNPICNRIFS 50 M P + VI+LP R+ + +R SFF AI Sbjct: 3 MNFPDHFDRISVINLPDRTDRKRDTLNEFSRAGWDPNDKMISFFPAI------------- 49 Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 R S+ GCY SH+ + +R + ++LEDD F E +Q+ + + Sbjct: 50 ---RPETAGGFPSVGVRGCYTSHMEVLRRAKENNCANILVLEDDISFIREINQIGNAVME 106 Query: 111 CDINNILIKFDALRKKPKKDS 131 +P D Sbjct: 107 QLEGKKWGFIYFGHDRPAGDK 127 >gi|317181934|dbj|BAJ59718.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F57] Length = 237 Score = 62.7 bits (151), Expect = 5e-08, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 52/140 (37%), Gaps = 11/140 (7%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 98 EIFWERRADSEKEAFFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 155 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSP----LYFYRNTCY 231 + + H I + P V +STI+ ++ + KP L++++ Sbjct: 156 FINNPTYHGIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYFDNLFYHKFNAK 215 Query: 232 QWNLHYNAWRKDLPPVSTTK 251 + ++ + K P+ T K Sbjct: 216 KCLRAFHKYSKQYAPLKTPK 235 >gi|296811282|ref|XP_002845979.1| conserved hypothetical protein [Arthroderma otae CBS 113480] gi|238843367|gb|EEQ33029.1| conserved hypothetical protein [Arthroderma otae CBS 113480] Length = 405 Score = 62.3 bits (150), Expect = 6e-08, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 42/103 (40%), Gaps = 6/103 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YVI+LP RR+ AA ++ + D I G+ + + Sbjct: 69 IYVINLPARTDRRDALVLMAAVSGIRLHWIDGIAGDT------VLDKAIPPPASHKTKKD 122 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + +H++ + I + A+ILEDDAD+ L Sbjct: 123 ANIGSWRAHLNALQDIVENNISSALILEDDADWDIRIKSQLKD 165 >gi|289613796|emb|CBI59371.1| unnamed protein product [Sordaria macrospora] Length = 402 Score = 61.9 bits (149), Expect = 8e-08, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 6/103 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++VI+LP RR+ AA +LQ ++ + G + P +++ + Sbjct: 57 KIFVINLPERTDRRDAMTLAAALTNLQVTWAKGVSGTDVP------DKVVPGEDWEKSIL 110 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 G + +H+++ I A+ILEDDAD+ L Sbjct: 111 RGNKGSWRAHMNVLNTIVNQNLTSALILEDDADWDIRLKSQLQ 153 Score = 36.8 bits (84), Expect = 2.9, Method: Composition-based stats. Identities = 6/26 (23%), Positives = 11/26 (42%) Query: 141 DIHQPRILSPRTTGYFIGKEAAIHLL 166 + P + T Y + ++ A LL Sbjct: 266 RVVHPSSGTICTQAYAVSQQGARKLL 291 >gi|217032009|ref|ZP_03437510.1| hypothetical protein HPB128_187g36 [Helicobacter pylori B128] gi|216946319|gb|EEC24925.1| hypothetical protein HPB128_187g36 [Helicobacter pylori B128] Length = 226 Score = 61.9 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 45/109 (41%), Gaps = 9/109 (8%) Query: 152 TTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI 209 T GY++ + A + + P+DM M + H++ + P + +STI Sbjct: 116 TAGYYLTPKGAKTFIEATERFKIIEPVDMFMNNPTYHDVANFTYLPCPISLNKHAFNSTI 175 Query: 210 EESR---LVRKPTFSP----LYFYRNTCYQWNLHYNAWRKDLPPVSTTK 251 + ++ + KP L++++ + +N + K P+ T K Sbjct: 176 QNAKKPDISLKPPKKSYFDNLFYHKFNARKCLKAFNKYSKQYAPLKTPK 224 >gi|145640523|ref|ZP_01796107.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae R3021] gi|145275109|gb|EDK14971.1| lipooligosaccharide biosynthesis protein [Haemophilus influenzae 22.4-21] Length = 201 Score = 61.5 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 56/180 (31%), Gaps = 19/180 (10%) Query: 82 YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-----IKFDALRKKPKKDSYLCTL 136 I EDD + L I+ + + K + Sbjct: 2 NENLEYLTIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIP 61 Query: 137 PGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID-------MDMKHWW 184 P N + T GY I + AA +++ KNI +D +D+ ++ Sbjct: 62 PFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDEIVAVDELIFNKLVDVDNYI 121 Query: 185 EHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 + + P++ + ++ S +E+ R R R T + N+ RK Sbjct: 122 VYQLNPAICIQELQANQSKSVLTSGLEKERQKRPKIRKKKTLKQRLTRIKENIIRALNRK 181 >gi|331006186|ref|ZP_08329509.1| 3-deoxy-D-manno-octulosonic-acid transferase [gamma proteobacterium IMCC1989] gi|330419984|gb|EGG94327.1| 3-deoxy-D-manno-octulosonic-acid transferase [gamma proteobacterium IMCC1989] Length = 248 Score = 61.5 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 52/245 (21%), Positives = 81/245 (33%), Gaps = 26/245 (10%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN-RIFSHQKRQCQFKRLL 62 VYV+++ R + + ++ F F + IFS + K L+ Sbjct: 15 KVYVLTVKAFDERIKHITTELGKYNIDFEFIFDFDIPEITEKDLSIFSPSNMEMAHKSLV 74 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 I +WK++A A+ILEDDA + F L + K Sbjct: 75 LKN--------IAVWKKMAAESIQRALILEDDAILDESFRTDLEDIIKSADTLPPGHLIF 126 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTT---GYFIGKEAAIHLLNVRK--NIYRPID 177 L K LP F H ++ R T G+ I L K I P D Sbjct: 127 LGGSDTK------LPKGFLNHPFPLVPRRMTTADGFVIDSTLIFKRLEWLKHNKINLPAD 180 Query: 178 MDMKHWWEHNIPSLVTEP--GAVYEAIDTN--DSTIEESRLVRKPTFSPLYF-YRNTCYQ 232 M + +I + P V +A T +T++ SR + L F + Q Sbjct: 181 GLMCKI-DADINAPHFWPKKSIVRQASCTGKFQTTLDGSRSKHSLLYIKLRFHWHKFKNQ 239 Query: 233 WNLHY 237 + Sbjct: 240 SVKRW 244 >gi|238488291|ref|XP_002375383.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] gi|220697771|gb|EED54111.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] Length = 344 Score = 61.5 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 55/186 (29%), Gaps = 37/186 (19%) Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL------- 105 K + + C++ H+++W+ + +++EDDAD+ Q L Sbjct: 104 KGVPGRAGHIGRGQARCWLGHLNIWREMIAQNWATVLVMEDDADWDIAVKQQLLRASPLI 163 Query: 106 ----------------------------PHLSKCDINNILIKFDALRKKPKKDSYLCTLP 137 ++ + + + + D L P Sbjct: 164 RQVSGDKGQVDWSPYGEAWDLLWLGHCGDYIPESPASFMDETLPESSLYRENDGRLTNFP 223 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK-NIYRPIDMDMKHW-WEHNIPSLVTEP 195 + + + T Y + K AA L + + + R I ++ W + L P Sbjct: 224 SHLRMVHKSVAPICTYAYALTKPAASKLYELSRQGMDRIITDYLREWCQSGTLRCLTVNP 283 Query: 196 GAVYEA 201 + Sbjct: 284 EVFHHH 289 >gi|171677827|ref|XP_001903864.1| hypothetical protein [Podospora anserina S mat+] gi|170936982|emb|CAP61640.1| unnamed protein product [Podospora anserina S mat+] Length = 389 Score = 61.1 bits (147), Expect = 1e-07, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 46/111 (41%), Gaps = 14/111 (12%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++VI+LP RR+ A L ++ D I G++ P + + +S Sbjct: 58 IFVINLPERSDRRDAMTLAATLTRLDVNWIDGIDGKDVPE------RVLPGDSWDKKISK 111 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 G + +H++ +R A+ILEDDAD+ + + ++ Sbjct: 112 GNKGSWRAHMNALQR--------ALILEDDADWDIRLKEQMQVFAQAARAF 154 >gi|85080981|ref|XP_956636.1| hypothetical protein NCU05138 [Neurospora crassa OR74A] gi|28881428|emb|CAD70545.1| hypothetical protein [Neurospora crassa] gi|28917708|gb|EAA27400.1| predicted protein [Neurospora crassa OR74A] Length = 389 Score = 61.1 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 46/112 (41%), Gaps = 6/112 (5%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++VI+LP RR+ AA +LQ ++ + G + P +++ + Sbjct: 57 KIFVINLPERTDRRDAMTLTAALTNLQITWAKGVPGTDVP------DKVIPGEDWEKSIL 110 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINN 115 G + +H+++ I A+ILEDDAD+ L ++ Sbjct: 111 RGNKGSWRAHMNVLNTIIDHNLTSALILEDDADWDIRLKSQLQLFARAAQPF 162 >gi|290343647|ref|YP_003495014.1| hypothetical protein OTV1_175 [Ostreococcus tauri virus 1] gi|260161062|emb|CAY39763.1| hypothetical protein OTV1_175 [Ostreococcus tauri virus 1] Length = 268 Score = 61.1 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 61/194 (31%), Gaps = 13/194 (6%) Query: 32 SFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIIL 91 F D I E + + + +L IGC++ H+ +KR A+I Sbjct: 71 EFEDHIDPEYFEKAVEMHYNPYVKRPDITYFNLGAIGCFMGHMDFYKRCFDQGLKYAVIF 130 Query: 92 EDDADFSDE--FSQLLPHLSKCDINNILIKFDALRKKP-KKDSYLCTLPGNFDIHQPRIL 148 ED+ + + ++ + + F L + P K+D L + Sbjct: 131 EDNVIVNSNKLYDEIQKIIDDRGDEFEMCFFHCLSRLPDKRDGKLEKV----------KW 180 Query: 149 SPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 T Y I E + IDM + + + T+ ST Sbjct: 181 ISSTKCYLINVENMKKYTKHFLPMDNHIDMKHEDLIAKGARIYYKDMRQYMKIDRTHSST 240 Query: 209 IEESRLVRKPTFSP 222 I S+ R FS Sbjct: 241 IGHSKHGRPQFFSR 254 >gi|22126669|ref|NP_670092.1| hypothetical protein y2791 [Yersinia pestis KIM 10] gi|45441038|ref|NP_992577.1| hypothetical protein YP_1211 [Yersinia pestis biovar Microtus str. 91001] gi|51595747|ref|YP_069938.1| hypothetical protein YPTB1407 [Yersinia pseudotuberculosis IP 32953] gi|108806670|ref|YP_650586.1| hypothetical protein YPA_0673 [Yersinia pestis Antiqua] gi|108812756|ref|YP_648523.1| hypothetical protein YPN_2595 [Yersinia pestis Nepal516] gi|145599585|ref|YP_001163661.1| hypothetical protein YPDSF_2313 [Yersinia pestis Pestoides F] gi|149366639|ref|ZP_01888673.1| hypothetical protein YPE_1874 [Yersinia pestis CA88-4125] gi|153947445|ref|YP_001401555.1| LPS glycosyltransferase family protein [Yersinia pseudotuberculosis IP 31758] gi|162421726|ref|YP_001606122.1| hypothetical protein YpAngola_A1619 [Yersinia pestis Angola] gi|165924371|ref|ZP_02220203.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165938976|ref|ZP_02227529.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009924|ref|ZP_02230822.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211808|ref|ZP_02237843.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167399713|ref|ZP_02305231.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167419588|ref|ZP_02311341.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424169|ref|ZP_02315922.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167470140|ref|ZP_02334844.1| LPS glycosyltransferase family protein [Yersinia pestis FV-1] gi|170024903|ref|YP_001721408.1| glycosyl transferase family protein [Yersinia pseudotuberculosis YPIII] gi|186894825|ref|YP_001871937.1| glycosyl transferase family protein [Yersinia pseudotuberculosis PB1/+] gi|218928530|ref|YP_002346405.1| hypothetical protein YPO1382 [Yersinia pestis CO92] gi|229841351|ref|ZP_04461510.1| hypothetical protein YPH_3733 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229843456|ref|ZP_04463602.1| hypothetical protein YPF_1841 [Yersinia pestis biovar Orientalis str. India 195] gi|229895829|ref|ZP_04510999.1| hypothetical protein YPS_3630 [Yersinia pestis Pestoides A] gi|229903165|ref|ZP_04518278.1| hypothetical protein YP516_2923 [Yersinia pestis Nepal516] gi|270486960|ref|ZP_06204034.1| LPS glycosyltransferase [Yersinia pestis KIM D27] gi|294503372|ref|YP_003567434.1| hypothetical protein YPZ3_1262 [Yersinia pestis Z176003] gi|21959684|gb|AAM86343.1|AE013882_4 hypothetical [Yersinia pestis KIM 10] gi|45435897|gb|AAS61454.1| hypothetical protein YP_1211 [Yersinia pestis biovar Microtus str. 91001] gi|51589029|emb|CAH20647.1| hypothetical protein YPTB1407 [Yersinia pseudotuberculosis IP 32953] gi|108776404|gb|ABG18923.1| hypothetical protein YPN_2595 [Yersinia pestis Nepal516] gi|108778583|gb|ABG12641.1| hypothetical protein YPA_0673 [Yersinia pestis Antiqua] gi|115347141|emb|CAL20034.1| hypothetical protein YPO1382 [Yersinia pestis CO92] gi|145211281|gb|ABP40688.1| hypothetical protein YPDSF_2313 [Yersinia pestis Pestoides F] gi|149291013|gb|EDM41088.1| hypothetical protein YPE_1874 [Yersinia pestis CA88-4125] gi|152958940|gb|ABS46401.1| LPS glycosyltransferase family protein [Yersinia pseudotuberculosis IP 31758] gi|162354541|gb|ABX88489.1| LPS glycosyltransferase family protein [Yersinia pestis Angola] gi|165913123|gb|EDR31747.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Orientalis str. IP275] gi|165923431|gb|EDR40563.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165991320|gb|EDR43621.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166207579|gb|EDR52059.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166962329|gb|EDR58350.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167050421|gb|EDR61829.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167057018|gb|EDR66781.1| LPS glycosyltransferase family protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169751437|gb|ACA68955.1| glycosyl transferase family 25 [Yersinia pseudotuberculosis YPIII] gi|186697851|gb|ACC88480.1| glycosyl transferase family 25 [Yersinia pseudotuberculosis PB1/+] gi|229678935|gb|EEO75038.1| hypothetical protein YP516_2923 [Yersinia pestis Nepal516] gi|229689803|gb|EEO81864.1| hypothetical protein YPF_1841 [Yersinia pestis biovar Orientalis str. India 195] gi|229697717|gb|EEO87764.1| hypothetical protein YPH_3733 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229700752|gb|EEO88781.1| hypothetical protein YPS_3630 [Yersinia pestis Pestoides A] gi|262361412|gb|ACY58133.1| hypothetical protein YPD4_1225 [Yersinia pestis D106004] gi|262365051|gb|ACY61608.1| hypothetical protein YPD8_0920 [Yersinia pestis D182038] gi|270335464|gb|EFA46241.1| LPS glycosyltransferase [Yersinia pestis KIM D27] gi|294353831|gb|ADE64172.1| hypothetical protein YPZ3_1262 [Yersinia pestis Z176003] gi|320015758|gb|ADV99329.1| hypothetical protein YPC_2790 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 224 Score = 60.7 bits (146), Expect = 2e-07, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 69/233 (29%), Gaps = 45/233 (19%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFS---FFDAIYGENNPICNRIFSHQKRQCQFKR 60 V I+L R E ++ F+AI Sbjct: 12 KVVYINLKERTDRNEHIKKELEKVCFPPEKIIRFEAIRA--------------------- 50 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 IGC SH+ + K + +ILEDD F D+ ++ NN L K Sbjct: 51 --GSGFIGCAKSHLAVLKMAQENNWRNILILEDDMVFEDDDETIIR------TNNFLSKL 102 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRI-LSPRTTGYFIGKEAAIHLL-NVRKNIYRP--- 175 + + S + D + ++ + Y + L+ N +++ R Sbjct: 103 NNIHWDAAFLSASYYIVNAIDDNFFKVNFAYLANSYLVNNHYYEKLINNYTESVQRLTNG 162 Query: 176 ---IDMDMKHWWEHNIPS---LVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP 222 + + W + P Y+ D S IE + R F+ Sbjct: 163 ESSSEYGLDSNWLKIMKIDNWYGIYPVIGYQRTD--ISDIEYKEIDRTHQFTR 213 >gi|188527531|ref|YP_001910218.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] gi|188143771|gb|ACD48188.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Shi470] Length = 251 Score = 60.3 bits (145), Expect = 2e-07, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 112 ETFWERRADSEKEALFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 169 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P V +STI+ ++ + KP + Sbjct: 170 FINNPTYHDIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYF 218 >gi|260581579|ref|ZP_05849376.1| lipopolysaccharide core biosynthesis glycosyltransferase lpsC [Haemophilus influenzae NT127] gi|260095172|gb|EEW79063.1| lipopolysaccharide core biosynthesis glycosyltransferase lpsC [Haemophilus influenzae NT127] Length = 168 Score = 60.3 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 51/133 (38%), Gaps = 12/133 (9%) Query: 90 ILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS 149 I EDD + +LL ++ + ++K +A K + N +I + + Sbjct: 4 IFEDDIYLGENAKELL-NIDYIPEDTDILKLEAHGK--IIYGKCEQIKCNRNISRLKFKH 60 Query: 150 PRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDM--KHWWEHNIPSLVTEPGA-----VYE 200 GY I + A +LLN +N +Y ID + + + + + P + Sbjct: 61 TGMAGYSITAKGARYLLNNIRNKQLYLAIDTLIFDELLGKKDYKVMQLSPAICAQSFILH 120 Query: 201 AIDTNDSTIEESR 213 + +S++ R Sbjct: 121 DENYFESSLHNGR 133 >gi|156045870|ref|XP_001589490.1| predicted protein [Sclerotinia sclerotiorum 1980] gi|154693607|gb|EDN93345.1| predicted protein [Sclerotinia sclerotiorum 1980 UF-70] Length = 354 Score = 60.3 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 68/249 (27%), Gaps = 61/249 (24%) Query: 14 HARREKF--CHRAARI-HLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCY 70 R ++ + + + D + +K + L+ +IGC+ Sbjct: 74 PDRTDQIDTASLLSSYSDIDLTIIDGVRS----------VSEKTRPPGGAQLTKGQIGCW 123 Query: 71 ISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKD 130 +H+++ + + A+ILE D D+ L L K + L Sbjct: 124 RAHMNMMRYVVDHRLETALILEADVDWDIRIKYQLEELGKHMPGASKTRPYGLEWDMLYT 183 Query: 131 SYLCTLPGNFDIHQPRILSPRTTG------------------------------------ 154 LP + ++ T G Sbjct: 184 GPALHLPEDANLGPVIRYHDNTVGTLTGSGYDKIEADWVARDLAAYQAQDHQRIVHRAYT 243 Query: 155 ------YFIGKEAAIHLLNV--RKNIYRPID-MDMKHWWEHNIPSLVTEPGAVYE---AI 202 Y + E A +L PID ++ +I L+ P + + Sbjct: 244 TYANTSYMVTLEGAKRILYQLGLHQQTAPIDVDFLEALHRRDINGLIVVPPLMQQWKTGN 303 Query: 203 DTNDSTIEE 211 DS IE+ Sbjct: 304 AAKDSDIED 312 >gi|9631442|ref|NP_048277.1| ORF MSV206 putative glycosyl transferase, similar to Haemophilus somnus GB:U20824 [Melanoplus sanguinipes entomopoxvirus] gi|4049795|gb|AAC97755.1| ORF MSV206 putative glycosyl transferase, similar to Haemophilus somnus GB:U20824 [Melanoplus sanguinipes entomopoxvirus] Length = 287 Score = 60.3 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 58/169 (34%), Gaps = 13/169 (7%) Query: 1 MPIPVYVISL--PFSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQKRQCQ 57 M I ++I L + R++ + +SFF I + N+ + Sbjct: 1 MNIRYHIIVLTIKRNKQRQKDIINYMNAQKYNNYSFFYGIDYK----QNKDNVKYVARHI 56 Query: 58 FKRLLSLPEIGCYISHIHLWKRI--AYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDIN 114 FK++ + C SHI LWK I S I+LEDD + D+ ++ + ++ Sbjct: 57 FKQISPYGVMACAASHILLWKFIASLESKYDFVIVLEDDTIINIDKLNKYIDYIYNLTKT 116 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAI 163 + + I + +++ Y + A Sbjct: 117 GFTFLYSDYCLG---WDSKIIIDKECTIIKNPMINMSLGAYALTSNIAK 162 >gi|308063590|gb|ADO05477.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori Sat464] Length = 251 Score = 60.3 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 39/109 (35%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + S T GY++ + A + + P+DM Sbjct: 112 ETFWERRADSEKEALFLEHFYLTSVYVAS--TAGYYLTPKGAKTFIEATERFKIIEPVDM 169 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + HNI + P V +STI+ + + KP + Sbjct: 170 FINNPTYHNIANFTYVPCPVSLNKHAFNSTIQNVKKPDISLKPPKKSYF 218 >gi|9964562|ref|NP_065030.1| putative glycosyl transferase [Amsacta moorei entomopoxvirus 'L'] gi|9944771|gb|AAG02954.1|AF250284_248 AMV248 [Amsacta moorei entomopoxvirus 'L'] Length = 289 Score = 60.3 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 60/168 (35%), Gaps = 12/168 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHL----QFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + ++++ + R +K + + +L +S F YG + N+ + FK Sbjct: 6 IIILTIKRNSDRLQKLENILSCQNLLYNKDYSVF---YGIDYKNINKNNLKNICKKGFKN 62 Query: 61 LLSLPEIGCYISHIHLWKRI--AYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNIL 117 + C SHI LWK I IILEDD + E+++ + + NN + Sbjct: 63 TCPYSTLACASSHILLWKYISKLKDKYKYIIILEDDTYINVSEYNKHTNTVEELLKNNSI 122 Query: 118 IKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + + + + + + I Y I A L Sbjct: 123 VFLYS--DCYIMGTTIKSTNNDTKITYNPKFHVSMGCYCITPITATKL 168 >gi|126724913|ref|ZP_01740756.1| glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] gi|126706077|gb|EBA05167.1| glycosyl transferase, family 25 [Rhodobacterales bacterium HTCC2150] Length = 251 Score = 60.0 bits (144), Expect = 3e-07, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 21/103 (20%) Query: 5 VYVISLPFSHARREKFCHRAARIHL-----QFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 +Y+++LP RR + + I L Q + F AI R + Sbjct: 19 IYIVNLPSRTDRRREMEEQLRGIGLSLDHEQITLFPAI----------------RPIEKG 62 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS 102 S+ GC+ SH+ ++ + +I+EDD +F F+ Sbjct: 63 EFPSIGCRGCFESHLTIFTDAITAGHETILIIEDDTNFRTNFN 105 >gi|217034635|ref|ZP_03440041.1| hypothetical protein HP9810_875g1 [Helicobacter pylori 98-10] gi|216942883|gb|EEC22372.1| hypothetical protein HP9810_875g1 [Helicobacter pylori 98-10] Length = 240 Score = 60.0 bits (144), Expect = 3e-07, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 41/109 (37%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 101 ETFWERRADSEKEAFFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 158 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P +V +STI+ ++ + KP + Sbjct: 159 FINNPTYHDIANFTYVPCSVSLNKHAFNSTIQNAKKPDISLKPPKKSYF 207 >gi|297379819|gb|ADI34706.1| Hypothetical protein HPV225_0627 [Helicobacter pylori v225d] Length = 148 Score = 60.0 bits (144), Expect = 3e-07, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 9 ETFWERRADSEKEALFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 66 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P V +STI+ ++ + KP + Sbjct: 67 FINNPTYHDIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYF 115 >gi|331221213|ref|XP_003323281.1| hypothetical protein PGTG_04818 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309302271|gb|EFP78862.1| hypothetical protein PGTG_04818 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 460 Score = 60.0 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 51/149 (34%), Gaps = 11/149 (7%) Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS-KCDINNILIKFDALRKKPK 128 Y +H+ + + I S A+ILEDD D + + + + + Sbjct: 256 YYNHLRVLRTIRESGEASALILEDDVDMEWDLERRWRSIESHLPSDWDTVFLGHCWG--- 312 Query: 129 KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN-VRKNIY---RPIDMDMKHWW 184 + G+ ++H+ GY + + LL P+D + + Sbjct: 313 -HELIQPQFGHPNLHKSTE-PLCLHGYAVSALGSRKLLELYNDPWTSFQTPVDTCIPTFI 370 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEESR 213 + + S EP + ++ S I+ + Sbjct: 371 KLGLNSFSIEPTIINQS-KVLQSDIQSGK 398 Score = 50.7 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 19/59 (32%), Positives = 29/59 (49%), Gaps = 1/59 (1%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +Y ISLP RRE AA + ++ +F DAIY +++P+ I + K L Sbjct: 99 IYCISLPNRQDRRETMTKIAAALGVRITFVDAIY-KDHPVIGWIGERASEVRKKKLELM 156 >gi|78183672|ref|YP_376106.1| hypothetical protein Syncc9902_0088 [Synechococcus sp. CC9902] gi|78167966|gb|ABB25063.1| hypothetical protein Syncc9902_0088 [Synechococcus sp. CC9902] Length = 290 Score = 60.0 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 52/146 (35%), Gaps = 10/146 (6%) Query: 73 HIHLWKRIAYSPAIGAIILEDDADFS---DEFSQLLPHLSKCDINNILIKFDALRKKPKK 129 H +++ S ++LEDD+ + ++ L + + NN+ + + Sbjct: 141 HYAIFEAFLRSNCNYCLVLEDDSYINTGISHINESLYQVVEHAENNLYNEPFFIDLSQSL 200 Query: 130 DSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW-WEHN 187 P + I Q S T+ Y I + + L K PID + ++N Sbjct: 201 GIKPLNKPPDTYIWQATPGFSQCTSAYMINRNCSQKFLKESKTKVLPIDWHISRIMKKNN 260 Query: 188 IPSLVTEPGAVYEAIDTNDSTIEESR 213 I V+E++ S E R Sbjct: 261 ILCYS-----VWESLFLQGSMTEVYR 281 >gi|308182780|ref|YP_003926907.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] gi|308064965|gb|ADO06857.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori PeCan4] Length = 148 Score = 60.0 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 9 ETFWERRADSEKEALFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 66 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P V +STI+ ++ + KP + Sbjct: 67 FINNPTYHDIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYF 115 >gi|310831127|ref|YP_003969770.1| putative glycosyltransferase [Cafeteria roenbergensis virus BV-PW1] gi|309386311|gb|ADO67171.1| putative glycosyltransferase [Cafeteria roenbergensis virus BV-PW1] Length = 263 Score = 59.6 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 46/160 (28%), Gaps = 7/160 (4%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR---IFSHQKRQCQF 58 + I+L R+ +L F+A N + + Sbjct: 33 KTKIIYINLEERQDRKNIIEKFYKNENLIIERFNAERINLNLFIQKYPLMNLSTLLSKST 92 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPA-IGAIILEDDADFSDE-FSQLLPHLSKCDINNI 116 K +GCY SH + + + I +ILEDD S + L + K ++ Sbjct: 93 KINWVNGTLGCYDSHYSILHKYQNNDENIFLVILEDDCIISKSTLNTALKFIYKNPSIDL 152 Query: 117 --LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG 154 + + P S L I+ T Sbjct: 153 LRINCWTPTTSNPFNFSLPTKLSKFSSINNKYYYDGGTHC 192 >gi|229846635|ref|ZP_04466743.1| diadenosine tetraphosphatase [Haemophilus influenzae 7P49H1] gi|229810728|gb|EEP46446.1| diadenosine tetraphosphatase [Haemophilus influenzae 7P49H1] Length = 201 Score = 59.6 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 56/180 (31%), Gaps = 19/180 (10%) Query: 82 YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-----IKFDALRKKPKKDSYLCTL 136 I EDD + L I+ + + K + Sbjct: 2 NENLEYLKIFEDDVILGENTEVFLNQNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIP 61 Query: 137 PGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID-------MDMKHWW 184 P N + T GY I + AA +++ KNI +D +D+ ++ Sbjct: 62 PFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDEIVAVDQLIFNKLVDVDNYI 121 Query: 185 EHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 + + P++ + ++ S +E+ R R R T + N+ RK Sbjct: 122 VYQLNPAICIQELQANQSKSVLTSGLEKERQKRPKIRKKKTLKQRLTRIKENIIRALNRK 181 >gi|326483769|gb|EGE07779.1| LPS glycosyltransferase [Trichophyton equinum CBS 127.97] Length = 403 Score = 59.6 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 7/103 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV++LP RR+ AA ++ + D I G + + ++ Sbjct: 70 IYVVNLPSRTDRRDALVLMAAVSGIKLHWVDGIMG--DTVVDKALPPPATHKFRSAN--- 124 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + H++ + I + A+I EDDAD+ L Sbjct: 125 --IGSWRGHLNALQDIVENNINSALIFEDDADWDIRIKPQLRD 165 >gi|332673580|gb|AEE70397.1| jhp0563 family glycosyltransferase [Helicobacter pylori 83] Length = 265 Score = 59.6 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 126 ETFWERRADSEKEAFFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 183 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P V +STI+ ++ + KP + Sbjct: 184 FINNPTYHDIANFTYVPCPVSLNKHAFNSTIQNAKKPDISLKPPKKSYF 232 >gi|326475608|gb|EGD99617.1| hypothetical protein TESG_06964 [Trichophyton tonsurans CBS 112818] Length = 403 Score = 59.6 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 7/103 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV++LP RR+ AA ++ + D I G+ + ++ Sbjct: 70 IYVVNLPSRTDRRDALVLMAAVSGIKLHWVDGIMGDA--VVDKALPPPATHKFRSAN--- 124 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + H++ + I + A+I EDDAD+ L Sbjct: 125 --IGSWRGHLNALQDIVENNINSALIFEDDADWDIRIKPQLRD 165 >gi|90081244|dbj|BAE90102.1| unnamed protein product [Macaca fascicularis] Length = 292 Score = 59.6 bits (143), Expect = 5e-07, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 38/89 (42%), Gaps = 5/89 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK----- 59 +++I+L RR++ ++ +A+ G+ + + ++ Sbjct: 153 IFMINLKRRKDRRDRMLRTLYEQEIEVKIVEAVDGKALNTSQLKALNIEMLPGYRDPYSS 212 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGA 88 R L+ EIGC++SH ++K ++ A Sbjct: 213 RPLTRGEIGCFLSHYSVYKFLSPEKPSQA 241 >gi|322702960|gb|EFY94578.1| hypothetical protein MAA_09949 [Metarhizium anisopliae ARSEF 23] Length = 332 Score = 59.6 bits (143), Expect = 5e-07, Method: Composition-based stats. Identities = 45/275 (16%), Positives = 77/275 (28%), Gaps = 96/275 (34%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +Y ISLP R++ AA +++ + + G N + Sbjct: 72 KIYYISLPHRTDRQDMMTLLAASTNIKLALQPGVNGSNVNDKAK---------------- 115 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD----------- 112 G I+LEDDAD+ E + LS Sbjct: 116 ---------------------PKGTIVLEDDADWDVEIHDIFQTLSVQMRKGELRKRKAT 154 Query: 113 ----------INNILIKF------------------------------DALRKKPKKDSY 132 ++ L+ + RK+ + Sbjct: 155 KYEINHAPYGLDWDLLYIGTCWDIPNGENRPKHQVYNDRFSPNSAEMTSSFRKELEDWGV 214 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN---VRKNIYRPIDMDM-KHWWEHNI 188 + P S T GY + A LL R+++ PID+ M + + Sbjct: 215 DLANNTRVRVLAPSWYSVCTIGYALTNRGAQKLLYTIGNRESMGAPIDLAMIDKVQKGYL 274 Query: 189 PSLVTEPGAV--YEAIDTNDSTIE--ESRLVRKPT 219 +L P V ++ +DS I+ R P+ Sbjct: 275 DALTIIPPLVTPWKTGRVSDSDIDNLAERKGELPS 309 >gi|322692943|gb|EFY84825.1| hypothetical protein MAC_09147 [Metarhizium acridum CQMa 102] Length = 305 Score = 59.6 bits (143), Expect = 5e-07, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 59/200 (29%), Gaps = 55/200 (27%) Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDA-----------DFSDEFSQL 104 Q L E G + +H ++W + +I+E DA ++ F + Sbjct: 31 SQKPGKLHTGEQGGWRAHANIWSKTLRQKLPPILIVESDATWDINIRKIMPHLNNHFRKF 90 Query: 105 LPHLS---------------------------KCDINNILIKFDALRK-------KPKKD 130 L ++ + ++ F + K + Sbjct: 91 LRKINSTKLHSPAWPAEAVFRNESTRASDDDPWLSRHWDILSFGPCHETDENNDIKLIYN 150 Query: 131 SYLCTLPGNF--------DIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNIYRPIDMDMK 181 L ++ + + TT Y + + A+ LL N+ P+DM M+ Sbjct: 151 DPFVPLGKDYQGRALKHERVIRRSGGITCTTAYAVSQTGAVKLLLRTAVNLDMPVDMVMQ 210 Query: 182 -HWWEHNIPSLVTEPGAVYE 200 ++ S P + + Sbjct: 211 EMIVSGDLISYSVMPPIMAQ 230 >gi|144898527|emb|CAM75391.1| hypothetical protein MGR_3502 [Magnetospirillum gryphiswaldense MSR-1] Length = 245 Score = 59.2 bits (142), Expect = 6e-07, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 55/191 (28%), Gaps = 29/191 (15%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 +VI+L R+ +F + + + DA+ + +R S Sbjct: 5 HVINLRRRADRKARFLEWNGGHAVHWVWHDAVEPDQVDRADRAVVGADIDA------SDA 58 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE----------FSQLLPHLSKCDINN 115 EI +SH W S +I EDD S ++ D Sbjct: 59 EIATALSHRRQWLACVDSGQPR-LIFEDDCCLRHGAEATLAAALAESDMVFLGCNADAAV 117 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFD--IHQPRILSP----------RTTGYFIGKEAAI 163 +L D L + PG FD HQ + P Y + A Sbjct: 118 VLELPDRLFAHVTYGNTAHGQPGYFDHFAHQAPPVPPPHLHRTYLIWGLLAYALSPAGAA 177 Query: 164 HLLNVRKNIYR 174 LL + R Sbjct: 178 KLLAECFPLRR 188 >gi|317177548|dbj|BAJ55337.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori F16] Length = 238 Score = 59.2 bits (142), Expect = 6e-07, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 39/109 (35%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 99 ETFWERRADSEKEAFFLEHFYLTSVYVST--TAGYYLTPKGAKTFIEATERFKIIEPVDM 156 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P V +S I+ ++ + KP + Sbjct: 157 FINNPTYHDIANFTYMPCPVSLNKHAFNSIIQNAKKPDISLKPPKKSYF 205 >gi|145633498|ref|ZP_01789227.1| diadenosine tetraphosphatase [Haemophilus influenzae 3655] gi|145635367|ref|ZP_01791069.1| diadenosine tetraphosphatase [Haemophilus influenzae PittAA] gi|144985867|gb|EDJ92475.1| diadenosine tetraphosphatase [Haemophilus influenzae 3655] gi|145267373|gb|EDK07375.1| diadenosine tetraphosphatase [Haemophilus influenzae PittAA] Length = 201 Score = 59.2 bits (142), Expect = 6e-07, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 55/180 (30%), Gaps = 19/180 (10%) Query: 82 YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-----IKFDALRKKPKKDSYLCTL 136 I EDD + L I+ + + K + Sbjct: 2 NENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIP 61 Query: 137 PGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID-------MDMKHWW 184 P + T GY I + AA +++ KNI +D +D+ ++ Sbjct: 62 PFYSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDEIVAVDELIFNKLVDVDNYI 121 Query: 185 EHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 + + P++ + ++ S +E+ R R R T + N+ RK Sbjct: 122 VYQLNPAICIQELQANQSKSVLTSGLEKEREKRPKIRKKKTLKQRLTRIKENIIRALNRK 181 >gi|301058284|ref|ZP_07199324.1| LPS glycosyltransferase [delta proteobacterium NaphS2] gi|300447618|gb|EFK11343.1| LPS glycosyltransferase [delta proteobacterium NaphS2] Length = 233 Score = 59.2 bits (142), Expect = 6e-07, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 53/170 (31%), Gaps = 31/170 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y ISL RR + + + L P+ + + Sbjct: 9 IYCISLKERPDRRMEARSQFHAVGL------------LPLVEFVIVE--------KHPVD 48 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL---SKCDINNILIKFD 121 E G + SH+ K+ S A +I EDD F +L K D + ++ Sbjct: 49 CEQGIFESHLLCMKKGLRSGARNIMIFEDDICFERFAPHVLESAVKFMKTDAHWKILFLG 108 Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 + K +K N + + + S Y + A HL+ Sbjct: 109 CMVKGSRK-------TENRSVLKVKFRSLC-HAYVANADFAQHLVAKTWR 150 >gi|313768382|ref|YP_004062062.1| hypothetical protein MpV1_179c [Micromonas sp. RCC1109 virus MpV1] gi|312599078|gb|ADQ91102.1| hypothetical protein MpV1_179c [Micromonas sp. RCC1109 virus MpV1] Length = 269 Score = 58.8 bits (141), Expect = 7e-07, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 68/238 (28%), Gaps = 29/238 (12%) Query: 4 PVYVISLPFSHARREKFCHRAA--------------RIHLQFSFFDAIYGENNPICNRIF 49 +++++ R+E+F ++ + + D I E + Sbjct: 30 KCFLLTVKDQKERQERFFKSHKDDIPVEIIYGPDTRKVKIAREYEDQIDPEYFEKAIEMH 89 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLPH 107 + + +L IGC++ H+ +KR A+I ED+ + Q+ Sbjct: 90 YNSDVKRPDITYFNLGAIGCFVGHMEFYKRCFRQGLKYAVIFEDNVIIKSNQLYEQIQSV 149 Query: 108 LSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 + + + + F L + P T Y I Sbjct: 150 IDEKGDDFEMCFFHCLSRLPDTKEGTLEKV---------KWISSTKCYLINVNNMKKYQK 200 Query: 168 VRKNIYRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 + +DM + + + + S I + R+ FS + Sbjct: 201 YFYPMDNHVDMKHEDLIDKGARVYYKDLREHMLIDRTHQ--SMIGHNEHGRREFFSRI 256 >gi|261839552|gb|ACX99317.1| hypothetical protein HPKB_0723 [Helicobacter pylori 52] Length = 228 Score = 58.8 bits (141), Expect = 7e-07, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 7/109 (6%) Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDM 178 + ++ +F + + + T GY++ + A + + P+DM Sbjct: 89 ETFWERRADSEKEAFFLEHFYLTSVYVST--TAGYYLTPKGAKTFIGATERFKIIEPVDM 146 Query: 179 DMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSPLY 224 + + H+I + P V +STI+ ++ + KP + Sbjct: 147 FINNPTYHDIANFTYVPCPVSLNKHAFNSTIQNTKKPSISLKPPKKSYF 195 >gi|327297154|ref|XP_003233271.1| hypothetical protein TERG_06265 [Trichophyton rubrum CBS 118892] gi|326464577|gb|EGD90030.1| hypothetical protein TERG_06265 [Trichophyton rubrum CBS 118892] Length = 403 Score = 58.4 bits (140), Expect = 9e-07, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 7/103 (6%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV++LP RR+ AA ++ + D I G + + ++ Sbjct: 70 IYVVNLPSRTDRRDALVLMAAVSDIKLHWVDGIMG--DTVVDKALPPPATHKFKSAN--- 124 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + H++ + I + A+ILEDDAD+ L Sbjct: 125 --IGSWRGHLNALQDIVENNINSALILEDDADWDIRIKPQLRD 165 >gi|314055258|ref|YP_004063596.1| Glycosyl transferase, family 25 [Ostreococcus tauri virus 2] gi|313575149|emb|CBI70162.1| Glycosyl transferase, family 25 [Ostreococcus tauri virus 2] Length = 268 Score = 58.4 bits (140), Expect = 9e-07, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 74/236 (31%), Gaps = 27/236 (11%) Query: 4 PVYVISLPFSHARREKFCHRAAR--------------IHLQFSFFDAIYGENNPICNRIF 49 ++++LP R+E F + + + F + I E + Sbjct: 29 KSFLLTLPKETIRQETFMRNHNKEIPIEVIYGPDTRNVKVAKEFENHIDPEYYEKALEMH 88 Query: 50 SHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLPH 107 + + ++ IGC++ H+ + R A+I ED+ + ++ Sbjct: 89 YNPSVKRPDITYFNMGAIGCFMGHMDFYDRCFKQGLKYAVIFEDNVIVKSNKLYDEIQKI 148 Query: 108 LSKCDINNILIKFDALRKKP-KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + + + F L + P K+D L + T Y I + Sbjct: 149 IDERGEEFEMCFFHCLSRLPDKQDGKLEKV----------KWISSTKCYLINVQNMKRYT 198 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP 222 + IDM + + + T++STI S+ R FS Sbjct: 199 KYFLPMDNHIDMKHEDLIAKGARIYYKDMRRYMKIDRTHNSTIGHSQHGRPKYFSR 254 >gi|291084945|ref|YP_003495228.1| gp87 [Mycobacterium phage Ardmore] gi|262262781|gb|ACY39969.1| gp87 [Mycobacterium phage Ardmore] Length = 264 Score = 58.4 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 52/142 (36%), Gaps = 4/142 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA D F L + ++ R Sbjct: 91 GALGCEANHRKVWQHLTRHNTDWLVVLEDDAIPCDNFRDQLDAALAVAPS-PVVSLYLGR 149 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 150 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHADLVPHMLNNL-PNGKPIDEAISAW 208 Query: 184 WEHNIPSLV-TEPGAVYEAIDT 204 H ++ T P + A +T Sbjct: 209 ARHQSHTIAYTWPSLIDHADET 230 >gi|317010840|gb|ADU84587.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori SouthAfrica7] Length = 148 Score = 58.0 bits (139), Expect = 1e-06, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 42/117 (35%), Gaps = 10/117 (8%) Query: 143 HQPRILSPRTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHNIPSLVTEPGAVYE 200 + + T GY++ ++A + + P+DM M + H+I + P V Sbjct: 29 YLTSVYVASTAGYYLTPKSAKTFIEATERFKIIEPVDMFMNNPTYHDIANFTYLPCPVSL 88 Query: 201 AIDTNDSTIEESRLVR---KPTFSPLY---FYRNTCYQ--WNLHYNAWRKDLPPVST 249 + STI+ + + KP Y FY + Y ++ P + Sbjct: 89 SKHCYVSTIQNVKKIDISLKPPKKSYYDNLFYHKFNARKCLKAFYKYTKQYAPLKTP 145 >gi|270585839|ref|ZP_06221409.1| conserved hypothetical protein [Haemophilus influenzae HK1212] gi|270318507|gb|EFA29596.1| conserved hypothetical protein [Haemophilus influenzae HK1212] Length = 78 Score = 58.0 bits (139), Expect = 1e-06, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 22/52 (42%) Query: 165 LLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 L K Y +D+ M ++E+ +P P + + + + E+ + R Sbjct: 1 FLTQSKEWYLTVDVTMDRFFENKVPPYAIVPFCLEDDGEIESTIYEKQKKQR 52 >gi|169780134|ref|XP_001824531.1| LPS glycosyltransferase [Aspergillus oryzae RIB40] gi|238505805|ref|XP_002384107.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] gi|83773271|dbj|BAE63398.1| unnamed protein product [Aspergillus oryzae] gi|220690221|gb|EED46571.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] Length = 396 Score = 58.0 bits (139), Expect = 1e-06, Method: Composition-based stats. Identities = 44/275 (16%), Positives = 84/275 (30%), Gaps = 82/275 (29%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I + +R+ A+ + ++ + D +Y +N + L L Sbjct: 71 IYAIGMKERTDKRDYLTLAASFLGIKVDWRDGVYPDNVSEKS-------------YPLKL 117 Query: 65 PE-------IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK------- 110 E IGC+ +H++ I + A+ILEDDAD+ Q L ++ Sbjct: 118 GESGVKPAAIGCWRAHMNTLIDIVENGYTTALILEDDADWDVSLRQQLAEFARGVRTLTN 177 Query: 111 ---------CDINNILI--------------KFDALRKKP-------------------- 127 N ++ F + P Sbjct: 178 NQHESKKAPYGTNWDILWVGGCASGAHQNETDFYVIPNDPTVPNTTIRGPWESPAGPSIQ 237 Query: 128 -KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL-NVRKNIY-RPIDMDMKHWW 184 +++ + I++ + T GY + E A +L + N P+D M Sbjct: 238 WRQEHPEWPVDSTRYIYRAN-MGCCTFGYAVTLEGARRILAELSINYLNLPVDNAMSDLC 296 Query: 185 EH----NIPSLVTEPGAV----YEAIDTNDSTIEE 211 + P + E + DS I++ Sbjct: 297 AGSNRPQLRCYAPFPNLIGTFRSEGYVSRDSDIDQ 331 >gi|328852550|gb|EGG01695.1| hypothetical protein MELLADRAFT_117746 [Melampsora larici-populina 98AG31] Length = 478 Score = 57.6 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 11/152 (7%) Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLS-KCDINNILIKFDALRKKPK 128 + +H+ + I S A++LEDD D + + + + + + K + Sbjct: 265 FYNHLKTIRMIKESGDKSALVLEDDVDLEWDLERRWRTIERRLPKDWETVFLGHCWGK-E 323 Query: 129 KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR-KNIY---RPIDMDMKHWW 184 P P L GY + A LL++ PID + + Sbjct: 324 IFEPQVLHPNLHKSTSPLCLH----GYAVSSLGATKLLSLFNDPWISFQTPIDTCIPTFI 379 Query: 185 EHNIPSLVTEPGAVYEAIDTNDSTIEESRLVR 216 + + S EP + ++ S I+ + + Sbjct: 380 KLGLKSYSIEPPIINQS-KILKSDIQIGKGSK 410 Score = 45.3 bits (106), Expect = 0.009, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 57/139 (41%), Gaps = 16/139 (11%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP---ICNRIFSHQKRQCQFKRL 61 +Y ISLP RR A + ++ +F +A ++N I + + +K + + + Sbjct: 106 IYCISLPERIDRRITMKKIANALGIEITFINATSKDSNIIKWISEKAYEIRKEKIRHIKK 165 Query: 62 ---LSLPEIGCYISHIHLW-----KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 + +IG + +W +++ ++ I L D D F + L ++ + Sbjct: 166 HTGMKSSQIG-GMGVESIWLTVNEGKLSNQDSLKDIHLPD----LDLFDKELNGMNWLEY 220 Query: 114 NNILIKFDALRKKPKKDSY 132 + D + KPK ++ Sbjct: 221 LWSIKSEDHSKLKPKNPNF 239 >gi|302690276|ref|XP_003034817.1| hypothetical protein SCHCODRAFT_255847 [Schizophyllum commune H4-8] gi|300108513|gb|EFI99914.1| hypothetical protein SCHCODRAFT_255847 [Schizophyllum commune H4-8] Length = 417 Score = 57.3 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 45/122 (36%), Gaps = 8/122 (6%) Query: 68 GCYISHIHLWKRIAYS---PAIGAIILEDDADFSDEFSQLLPHLSK-CDINNILIKFDAL 123 CY +H L + A + P +ILEDD D +F + L H + + ++ Sbjct: 237 ACYHAHYSLLRTFAAASGAPDDTLLILEDDVDIEFDFERTLRHAMQGLPADWDVLLPGYC 296 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 K + PG + + GY + + A L+++ ++ + Sbjct: 297 DSTETKHKPV---PGYPRLRRSDD-PLCNHGYIVSRRGARRLVSLLRSPSFAYSRALDQA 352 Query: 184 WE 185 ++ Sbjct: 353 YK 354 Score = 41.5 bits (96), Expect = 0.13, Method: Composition-based stats. Identities = 12/32 (37%), Positives = 15/32 (46%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDA 36 + VISLP RR + A + F FFD Sbjct: 80 IMVISLPHRTDRRARMSLLANALDADFGFFDG 111 >gi|109392284|ref|YP_655094.1| gp98 [Mycobacterium phage Llij] gi|88910385|gb|ABD58314.1| gp98 [Mycobacterium phage Llij] Length = 207 Score = 57.3 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 56/160 (35%), Gaps = 8/160 (5%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L + ++ R Sbjct: 34 GTLGCEANHRKVWQHLTRHNTDWLVVLEDDAIPCNNFRDQLDAALAVAPS-PVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAI--HLLNVRKNIYRPIDMDMK 181 ++P++ + H G I AA+ ++LN +PID + Sbjct: 93 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAI--HAALVPNMLNNL-PNGKPIDEAIS 149 Query: 182 HWWEHNIPSLV-TEPGAVYEAIDTNDSTIEESRLVRKPTF 220 W H ++ T P + A +T R P Sbjct: 150 AWARHQSHTIAYTWPSLIDHADETPMIATRNDNQPRPPGR 189 >gi|104779683|ref|YP_606181.1| hypothetical protein PSEEN0409 [Pseudomonas entomophila L48] gi|95108670|emb|CAK13364.1| hypothetical protein PSEEN0409 [Pseudomonas entomophila L48] Length = 185 Score = 57.3 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 48/180 (26%), Gaps = 23/180 (12%) Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDA 122 S P+ GCY SH L + +I EDD + + +++ + Sbjct: 15 SDPKRGCYESHQALAQLALDQGLERILIFEDDVKPYPLKASRIRWINRFMRTHRFEALHL 74 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKH 182 + + Y + +E +L P+D+ K Sbjct: 75 GYSMGRTWLTWFPFIARGKVVALH-------AYVLSREG-CKILAQTPYDGTPVDVMFKR 126 Query: 183 WWEHNIPSLVTEPGAVY-EAIDTNDSTIEE---------SRLVRKPTFSPLY-FYRNTCY 231 I P A S +E R RK SPL +R Sbjct: 127 ----RIRQHCVFPMMFRQHAAAVTGSDLELVVRNEDEWWERNWRKHWRSPLKNLWRTVLR 182 >gi|317029807|ref|XP_001391261.2| hypothetical protein ANI_1_1538064 [Aspergillus niger CBS 513.88] Length = 420 Score = 56.9 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 38/272 (13%), Positives = 79/272 (29%), Gaps = 69/272 (25%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL-L 62 V I+LP +R+ ++ + + + E+ + + + R + Sbjct: 80 NVLCINLPSRTDKRDAITLGSSVTDFHVDWIEGVSAEDMDLKA----YPPHYGEPGRPIM 135 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-------DEF---SQLLPHLSKCD 112 E+G + +H++ +RI A+I+EDD+D+ EF SQ L K Sbjct: 136 IPGEVGNWRAHLNAMRRIVSDRLTTALIIEDDSDWDVTLKIQLHEFALASQSLQKARKVK 195 Query: 113 I-------------------NNILIKFDALRKKPKKDSYLCTLPG--------------- 138 ++ ++D+ Sbjct: 196 APKIFRHRTPNTAPNSPYGDEWGILWLGHCGMTCRQDAPYYKRSNDSTALPVDSLPPYWS 255 Query: 139 -----NFDIHQPRILSPRTTGYFIGKEAAIHLLNV-----RKNIYRP-----IDMDMKHW 183 N I T Y + AA ++ I+ P D+ + Sbjct: 256 GRASPNTRIICQTRYVICTGAYAVSYNAAQKIIAALSVLPSDEIFPPGTSSIFDVSLGRL 315 Query: 184 WEHN-IPSLVTEPGAV----YEAIDTNDSTIE 210 E+ + + P + + +S I+ Sbjct: 316 CENGYLTCYSSFPSLIGNWRPAGSPSRNSDIQ 347 >gi|159474124|ref|XP_001695179.1| predicted protein [Chlamydomonas reinhardtii] gi|158276113|gb|EDP01887.1| predicted protein [Chlamydomonas reinhardtii] Length = 338 Score = 56.9 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 55/171 (32%), Gaps = 15/171 (8%) Query: 12 FSHARREKFCHRAARIHLQ-FSFFDAIYGENNPICNRIFSHQK--RQCQFKRLLSLPEIG 68 S+AR AR L F Y + + + + +FK S +I Sbjct: 79 KSNARVASLQRNFARYKLPPVEFL---YAKAPGYLSYTWLAWRGVFHPKFKDHHSDAQIS 135 Query: 69 CYISHIHLWKRIAYSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILIKFDALRKKP 127 ++ ++R+A + EDDA + L L + + + Sbjct: 136 LALAMYEAFERVARGSDSWVYVFEDDARVVNVPLGGDLVTLRDVPADAEFLAVSRQHNRV 195 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM 178 G+F +P I + +E A L+ K I+ P D+ Sbjct: 196 --------HNGSFAPAEPVIGGGLMQALLVSREGARKLVQALKPIWAPNDV 238 >gi|313844146|ref|YP_004061809.1| hypothetical protein OlV1_177c [Ostreococcus lucimarinus virus OlV1] gi|312599531|gb|ADQ91553.1| hypothetical protein OlV1_177c [Ostreococcus lucimarinus virus OlV1] Length = 268 Score = 56.5 bits (135), Expect = 4e-06, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 61/197 (30%), Gaps = 33/197 (16%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-------NPICNRIFSHQKRQC 56 ++++LP RR +F + N + + + + Sbjct: 29 KCFLLTLPKEDLRRRRFMKYHNP-EVPVEVIYGTDTRNIKNARKFEHQIDAEYFEKAVEM 87 Query: 57 QF----KRL----LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLP 106 + KR ++ IGC++ H+ +KR A+I ED+ + ++ Sbjct: 88 HYNPNVKRPDITYFNMGAIGCFMGHMDFYKRCFDQGLRYAVIFEDNVIVKSNKLYDEIQK 147 Query: 107 HLSKCDINNILIKFDALRKKP-KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + + + F L + P K D L + T Y I + Sbjct: 148 VIDERGDEFEMCFFHCLSRLPDKIDGKLEKV----------KWISSTKCYLINVQNMKKY 197 Query: 166 LNVRKNIYRPIDMDMKH 182 + + P+D + Sbjct: 198 THH----FLPMDNHIDM 210 >gi|207092251|ref|ZP_03240038.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPKX_438_AG0C1] Length = 101 Score = 56.1 bits (134), Expect = 5e-06, Method: Composition-based stats. Identities = 22/98 (22%), Positives = 34/98 (34%), Gaps = 17/98 (17%) Query: 5 VYVISLPFSHARREK------FCHRAARIHLQFSFFDAIYGENNPIC---NRIFSHQKRQ 55 VY+ISL S R + + + F F+AI ++ ++ Q Sbjct: 4 VYIISLKESQRRLDTEKLVLESNEKFKGRCV-FQIFNAISPKHEDFEKFVQELYDAQSML 62 Query: 56 C-------QFKRLLSLPEIGCYISHIHLWKRIAYSPAI 86 + L E GCY+SH LWK + Sbjct: 63 KSDWFHSDYCYQELLPREFGCYLSHYLLWKECVKTNQP 100 >gi|154296527|ref|XP_001548694.1| hypothetical protein BC1G_12838 [Botryotinia fuckeliana B05.10] gi|150843361|gb|EDN18554.1| hypothetical protein BC1G_12838 [Botryotinia fuckeliana B05.10] Length = 376 Score = 55.7 bits (133), Expect = 6e-06, Method: Composition-based stats. Identities = 42/297 (14%), Positives = 79/297 (26%), Gaps = 72/297 (24%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 V VI L + + A+ + F D I G+ K + + Sbjct: 83 KVVVIGLRERSDKWDAMVLSASLTGFEVEFADGIKGDEV--------VDKAKPPPGHFSN 134 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-------DEFSQLLPHLSKCDI--- 113 +G + H+++ + I A+I+EDD D+ EF++ + LS+ Sbjct: 135 GGAMGPWRGHMNVIRSIIERKLASALIMEDDVDWDIRLRTQLPEFAKGIRTLSRMPHSQR 194 Query: 114 -------NNILIKFDA--------LRKKPKKDSYLCTLPG-------------------N 139 + ++ K K + Y G N Sbjct: 195 QHSPYGDDWDILWPGHCGDELPSVFHKGEKDELYRIDNDGTVVPKSSQSGLKTLANYPAN 254 Query: 140 FDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI---YRPIDMDM----KHWWEHNIPSLV 192 I T GY + A +L D + + + ++ Sbjct: 255 TRIVHKAGAPKCTLGYAVSYRGAQKILMALSVKGSNDLTGDNALASLCRDGYL-DMKCFS 313 Query: 193 TEP-------GAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAWRK 242 EP AV + D +++ + W+ N Sbjct: 314 VEPQLFQHHKPAVTKNKDGGINSVGQKEAQGAGEKEI-----KNMIVWSARLNLEHL 365 >gi|145612563|ref|XP_367289.2| hypothetical protein MGG_07214 [Magnaporthe oryzae 70-15] gi|145019706|gb|EDK03934.1| hypothetical protein MGG_07214 [Magnaporthe oryzae 70-15] Length = 459 Score = 55.7 bits (133), Expect = 6e-06, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 41/106 (38%), Gaps = 6/106 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + ++LP H R + ++ + +F A+ + I L Sbjct: 94 IQFLNLPHRHDRFDAAAIQSLLSDIDVEYFPAV------VDTDIRDQGLPPTSKPNTLKK 147 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 EIGC+ +H +LW+ +ILE DA + Q + L+K Sbjct: 148 GEIGCWRAHANLWQYQVQKRIPALLILESDAAWDVYIRQQMGVLNK 193 Score = 39.2 bits (90), Expect = 0.69, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 19/47 (40%), Gaps = 2/47 (4%) Query: 136 LPGNFDIHQPRILSP-RTTGYFIGKEAAIHLLNVRK-NIYRPIDMDM 180 P N R + TTGY I A LL N+ P+D+ M Sbjct: 313 KPLNRQRVVFRAMGMVCTTGYIISLRGAAKLLVRTNFNLDMPVDLIM 359 >gi|171910378|ref|ZP_02925848.1| glycosyl transferase, family 25 [Verrucomicrobium spinosum DSM 4136] Length = 388 Score = 55.7 bits (133), Expect = 6e-06, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 57/198 (28%), Gaps = 27/198 (13%) Query: 1 MPIPVYVISLPFSHARREKFC----HRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQC 56 M + ++L RR A + F A+ G P + + Sbjct: 1 MKVDYVCLNLLRRPDRRFLVESTLPQEMAGL---LRFVPAVDGGMIPSSRV---RRWKIG 54 Query: 57 QFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINN 115 ++ +S + + + EDD F +F + L+ + Sbjct: 55 NTGYAVN-------LSKRLVLREFLRGDSDVLFFFEDDVKFERDFWEKYQWMLANAPSDW 107 Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIY 173 ++ + + + + + H L I ++AA +L RK Sbjct: 108 AMLFLGGDHSQAPQSTSVSGIVKCVRTHLNHAL-------LIKRDAAREILREINRKPFR 160 Query: 174 RPIDMDMKHWWEHNIPSL 191 + W + +P+ Sbjct: 161 HCLSDQTIAWLQDKLPTY 178 >gi|145354663|ref|XP_001421598.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144581836|gb|ABO99891.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 457 Score = 55.7 bits (133), Expect = 6e-06, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 56/188 (29%), Gaps = 26/188 (13%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFS-------------HQ 52 Y ++ + +R + + + + + + + H Sbjct: 38 YYVNARGATTKRLEMERAVVALDARARRIERVDAKRRSTLLMTYETATCGRGRFAPSAHA 97 Query: 53 KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDAD--FSDEFSQLLP-HLS 109 +R+ +F +G +SH+ A A+I+EDDA +S L + Sbjct: 98 ERRERFGEEYLNA-LGTSLSHLEAVYAAYDDGASVALIMEDDARWDLLPTWSSDLEKFVE 156 Query: 110 KCDINNILIKFDALRKKPKKDSYLC-------TLPGNFDIHQPR--ILSPRTTGYFIGKE 160 + + + + AL + PG P+ T Y I ++ Sbjct: 157 RLPEDWTVTQLSALGDYAETTKLFYDWQRERKNAPGQSLTTLPKGSRRLSGTQAYLISRK 216 Query: 161 AAIHLLNV 168 L+ Sbjct: 217 GMSRLVKA 224 >gi|206599982|ref|YP_002241787.1| gp102 [Mycobacterium phage Fruitloop] gi|206287070|gb|ACI12416.1| gp102 [Mycobacterium phage Fruitloop] Length = 207 Score = 55.3 bits (132), Expect = 7e-06, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 52/142 (36%), Gaps = 4/142 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA D F L + ++ R Sbjct: 34 GTLGCETNHRKVWQHLTRHNTDWLVVLEDDAIPCDNFRDQLDAALAVAPS-PVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + ++LN +PID + W Sbjct: 93 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHADLVPNMLNNL-PNGKPIDEAISAW 151 Query: 184 WEHNIPSLV-TEPGAVYEAIDT 204 H ++ T P + A +T Sbjct: 152 ARHQSHTIAYTWPSLIDHADET 173 >gi|319776047|ref|YP_004138535.1| hypothetical protein HICON_13950 [Haemophilus influenzae F3047] gi|317450638|emb|CBY86858.1| Putative uncharacterised protein [Haemophilus influenzae F3047] Length = 518 Score = 55.3 bits (132), Expect = 8e-06, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 55/175 (31%), Gaps = 10/175 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M + YV+ P A +F + ++F+ + E +F +K F Sbjct: 1 MNLKKYVLYSPSQPAELAEFLQQEN--GVEFTPIKGVQVEEINALAANHLFDVEKFYRYF 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDIN 114 R EI SH+ LW+ IA + A+I E FS F + L Sbjct: 59 NRTPFNDEIAHTYSHLQLWQIIAKDNELDENGYALIAESAVRFSSLFQHFVE-LYIEKYP 117 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRIL-SPRTTGYFIGKEAAIHLLNV 168 +IK + G H P L Y I K LL Sbjct: 118 YDIIKLQRNGVAKATLNPELPELGAVIYHNPAELDGYGCAFYLIKKSVIRQLLAQ 172 >gi|319896855|ref|YP_004135050.1| uncharacterised protein [Haemophilus influenzae F3031] gi|329123207|ref|ZP_08251776.1| hypothetical protein HMPREF9095_0994 [Haemophilus aegyptius ATCC 11116] gi|317432359|emb|CBY80714.1| putative uncharacterised protein [Haemophilus influenzae F3031] gi|327471552|gb|EGF16996.1| hypothetical protein HMPREF9095_0994 [Haemophilus aegyptius ATCC 11116] Length = 518 Score = 55.3 bits (132), Expect = 8e-06, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 55/175 (31%), Gaps = 10/175 (5%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR--IFSHQKRQCQF 58 M + YV+ P A +F + ++F+ + E +F +K F Sbjct: 1 MNLKKYVLYSPSQPAELAEFLQQEN--GVEFTPIKGVQVEEINALAANHLFDVEKFYRYF 58 Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAI----GAIILEDDADFSDEFSQLLPHLSKCDIN 114 R EI SH+ LW+ IA + A+I E FS F + L Sbjct: 59 NRTPFNDEIAHTYSHLQLWQIIAKDNELDENGYALIAESAVRFSSLFQHFVE-LYIEKYP 117 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRIL-SPRTTGYFIGKEAAIHLLNV 168 +IK + G H P L Y I K LL Sbjct: 118 YDIIKLQRNGVAKATLNPELPELGAVIYHNPAELDGYGCAFYLIKKSVIRQLLAQ 172 >gi|302655913|ref|XP_003025827.1| LPS glycosyltransferase, putative [Trichophyton verrucosum HKI 0517] gi|291183481|gb|EFE39092.1| LPS glycosyltransferase, putative [Trichophyton verrucosum HKI 0517] Length = 407 Score = 55.3 bits (132), Expect = 8e-06, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 42/106 (39%), Gaps = 9/106 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV++LP RR+ AA ++ + D I GEN + ++ FK Sbjct: 70 IYVVNLPSRTDRRDAIVLLAAVSGIKLHWVDGIRGEN--VVDKALPPPATHQYFKA---- 123 Query: 65 PEIGCYISHIHLWKR---IAYSPAIGAIILEDDADFSDEFSQLLPH 107 IG + H+ I + A+I EDDAD+ L Sbjct: 124 ANIGSWRGHLDALSDEPSIVENNINTALIFEDDADWDIRIKSQLRD 169 >gi|288929226|ref|ZP_06423071.1| glycosyltransferase [Prevotella sp. oral taxon 317 str. F0108] gi|288329328|gb|EFC67914.1| glycosyltransferase [Prevotella sp. oral taxon 317 str. F0108] Length = 225 Score = 55.3 bits (132), Expect = 8e-06, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 39/102 (38%), Gaps = 24/102 (23%) Query: 3 IPVYVISLPFSHARREKFCHRA-ARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 +PVYVI+L RR+ + R + ++ A+ +PI Sbjct: 10 LPVYVINLEERTERRQHIEEQFSDRTEFELTWIKAV---KHPI----------------- 49 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 +G + S + ++ + +I EDD F+ +++ Sbjct: 50 ---GAVGLWKSMVQAIRKAEENEDDIIVICEDDHTFTPAYNR 88 >gi|207111759|ref|ZP_03245921.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPKX_438_CA4C1] Length = 38 Score = 55.3 bits (132), Expect = 9e-06, Method: Composition-based stats. Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 1/35 (2%) Query: 59 KRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILED 93 L+LPE+GCY+SH LWK +ILED Sbjct: 5 GAGLTLPELGCYLSHYLLWKECVKLNQP-VVILED 38 >gi|83775626|dbj|BAE65746.1| unnamed protein product [Aspergillus oryzae] Length = 358 Score = 55.0 bits (131), Expect = 9e-06, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 70/238 (29%), Gaps = 63/238 (26%) Query: 10 LPFSHARREKFCHRAARIHLQFS-------FFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +P +R+ A LQF F D I + P + H K Sbjct: 1 MPNRPDKRDYIT--LASSILQFHPEPVNGVFVDDIDKKAYP---SNWDHGK--------- 46 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI--------- 113 E+G + +H+++ +RI + A +LEDDAD+ + L + Sbjct: 47 LPAEMGAWRAHMNVMQRIVHERISTAFVLEDDADWDVNLKKQLQRFASASQLVQGDTGPS 106 Query: 114 ------NNILIKFDALRKKPKK--------DSYLCTLP--------------GNFDIHQP 145 L+ + K D LP + Sbjct: 107 HSPYGDLWDLLWIGHCGIQYKTGPIHVTTDDITTVPLPELPRYWHGFPAGADNGTRLVAR 166 Query: 146 RILSPRTTGYFIGKEAAIHLLNVR----KNIYRPIDMDMKHWWEHN-IPSLVTEPGAV 198 + GY I A LL+ K P D+ + + ++ + + P + Sbjct: 167 MHDGVCSLGYAITYLGAQKLLSALSLTPKGDGAPFDVAIGRFCQNGWLRCIAPFPSLI 224 >gi|169628871|ref|YP_001702520.1| bacteriophage protein [Mycobacterium abscessus ATCC 19977] gi|169240838|emb|CAM61866.1| Bacteriophage protein [Mycobacterium abscessus] Length = 210 Score = 55.0 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 44/183 (24%), Positives = 67/183 (36%), Gaps = 22/183 (12%) Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP-HLSKCDINNILIKF 120 L GC +H H+ + ++ SP ++LEDDA D+F L L+ + + + Sbjct: 31 LDNGSKGCNGNHRHVLEWLSTSPTEWVVVLEDDAQPIDDFRTQLDKALTAAPCDIVSLYL 90 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILS---PRTTGYFIGKEAAIHLLNVRKNIYRPID 177 + + P ++S GY I LL+ PID Sbjct: 91 GTNYPRLWQRGIQRATTQADQTDSPWLVSEHLLHAVGYCIRTTLVPDLLDALPE--MPID 148 Query: 178 MDMKHW---WEHNIPSLVTEPGAV-YEAIDTNDSTIEESRLVRKPTF-SPLYFYR-NTCY 231 + W EH I T P V +E DT S ++PT +P +R T Sbjct: 149 DAITTWARDQEHRIA--YTWPSLVDHEDADTLIS--------KRPTRNAPRKAHRTGTRT 198 Query: 232 QWN 234 QW Sbjct: 199 QWA 201 >gi|163955173|ref|YP_001648277.1| hypothetical protein OsV5_201r [Ostreococcus virus OsV5] gi|163638622|gb|ABY27981.1| hypothetical protein OsV5_201r [Ostreococcus virus OsV5] Length = 265 Score = 55.0 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 66/236 (27%), Gaps = 30/236 (12%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR--- 60 Y+++LP S R+ KF + + +N H + K Sbjct: 29 KCYLLTLPESTERQRKFFKSHNS-SIPIEVVYGVNTKNIKTAREFEDHIDPEYFEKAVEM 87 Query: 61 ------------LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLP 106 +L IG Y+ H + K+ A++ ED+ + ++ Sbjct: 88 HYNPQVKRPDITYFNLGAIGAYMGHTGIMKKC---EHKYALVFEDNVIIKGNRLYDEVQK 144 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 + + N + F L + P + + + Y + + Sbjct: 145 VIDELGDNFEMCFFHCLSRIPTEVNKDLERV---------VWVSSMKCYLVNIHNMKKYM 195 Query: 167 NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSP 222 I +D + + + + STI S RK FS Sbjct: 196 KYYFPIDNHVDNKTEDIIARGARVYYKDLRKYMKIDRSGPSTIGHSDHGRKEYFSR 251 >gi|145630267|ref|ZP_01786048.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae R3021] gi|144984002|gb|EDJ91439.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae R3021] Length = 201 Score = 55.0 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 54/180 (30%), Gaps = 19/180 (10%) Query: 82 YSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNIL-----IKFDALRKKPKKDSYLCTL 136 I EDD + L I+ + + K + Sbjct: 2 NENLEYLTIFEDDVILGENAEVFLAQDEWLKTRFDFNDIFIIRLETFLQPVKLEKQTKIP 61 Query: 137 PGNFDIHQ-PRILSPRTTGYFIGKEAAIHLLNVRKNI----YRPID-------MDMKHWW 184 P + T GY I + A +++ KNI +D +D+ ++ Sbjct: 62 PFYSRNFDILKSTHWGTAGYIISQGAVKYVIEYLKNIPSDEIVAVDELIFNKLVDVDNYI 121 Query: 185 EHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYF-YRNTCYQWNLHYNAWRK 242 + + P++ + ++ S +E+ R R R T + N+ RK Sbjct: 122 VYQLNPAICIQELQANQSKSVLTSGLEKEREKRPKIRKKKTLKQRLTRIKENIIRALNRK 181 >gi|154290384|ref|XP_001545788.1| hypothetical protein BC1G_15746 [Botryotinia fuckeliana B05.10] gi|150847768|gb|EDN22961.1| hypothetical protein BC1G_15746 [Botryotinia fuckeliana B05.10] Length = 373 Score = 55.0 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 39/309 (12%), Positives = 76/309 (24%), Gaps = 90/309 (29%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++ +SLP +R+ ++ + + D + GE + + + Sbjct: 60 KIFAVSLPERSDKRDALTLISSLTGFKIEWVDGVKGEAVSDKAVPYGADRVKLW------ 113 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS-QLLPHLSKCDI--------- 113 +G + H++ +RI A+I EDD D+ QLL + Sbjct: 114 ESNLGSWRGHMNAIRRIVEQNLTSALIFEDDMDWDTRLKSQLLTFAAGSKFIQYPSTTPS 173 Query: 114 --------NNILIKFDA-------------------------LRKKPKKDSYLCTLPGNF 140 N L+ RK P +F Sbjct: 174 NLVSPYGDNWDLLWLGHCGEVFPETLDEYKSLSPSSPEIAHLSRKYIITSDPTVVPPTHF 233 Query: 141 DIHQ------------PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW----- 183 Q T Y + ++ A +L +D + Sbjct: 234 KGFQNATQNPYTRWVHISGGPICTFAYALSQQGARKVL-----WDLSVDKLAGPFDNALA 288 Query: 184 ----WEH-----NIPSLVTEPGAVYEAIDT----NDSTIEESRLVRKPTFSP------LY 224 W + + PG DS I++ + ++ Sbjct: 289 GLCRWGRDSKRLGMRCISVTPGLFQHHKARGWIGGDSDIQKVGSAKGEVREKGVTENVVW 348 Query: 225 FYRNTCYQW 233 R + Sbjct: 349 SARMNVREM 357 >gi|157311297|ref|YP_001469341.1| gp108 [Mycobacterium phage Tweety] gi|148540926|gb|ABQ86177.1| gp108 [Mycobacterium phage Tweety] Length = 208 Score = 54.6 bits (130), Expect = 1e-05, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 54/158 (34%), Gaps = 4/158 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L + ++ R Sbjct: 35 GALGCETNHRKVWQHLTRHNTDWLVVLEDDAIPCNNFRDQLHAALAVAPS-PVVSLYLGR 93 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 94 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHADLVPHMLNNL-PNGKPIDEAISAW 152 Query: 184 WEHNIPSLV-TEPGAVYEAIDTNDSTIEESRLVRKPTF 220 H ++ T P + A +T R P Sbjct: 153 ARHQSHTIAYTWPSLIDHADETPMIATRNDNQPRPPGR 190 >gi|206600179|ref|YP_002241684.1| gp100 [Mycobacterium phage Pacc40] gi|206287267|gb|ACI12611.1| gp100 [Mycobacterium phage Pacc40] Length = 207 Score = 54.6 bits (130), Expect = 1e-05, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 50/142 (35%), Gaps = 4/142 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L + ++ R Sbjct: 34 GTLGCETNHRKVWQHLTKFNTDWLVVLEDDAIPCNNFRDQLDAALAVAPS-PVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 93 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHADLVPHMLNQL-PNGKPIDEAISAW 151 Query: 184 WEH-NIPSLVTEPGAVYEAIDT 204 H T P + A +T Sbjct: 152 ARHQGHTIAYTWPSLIDHADET 173 >gi|53711301|ref|YP_097293.1| putative glycosyltransferase [Bacteroides fragilis YCH46] gi|52214166|dbj|BAD46759.1| putative glycosyltransferase [Bacteroides fragilis YCH46] Length = 222 Score = 54.6 bits (130), Expect = 1e-05, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 24/102 (23%) Query: 3 IPVYVISLPFSHARREKFCHRAA-RIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 +PVYVI+L RR+ + ++ + +AI Sbjct: 10 LPVYVINLKERTERRQHIEEQFQGKVEFAIHWIEAI-----------------------E 46 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 S+ +G + S + + II EDD F+ +++ Sbjct: 47 HSIGAVGLWQSMLKAVQTAIDKRDDIMIICEDDHIFTPAYNK 88 >gi|311246551|ref|XP_001925614.2| PREDICTED: glycosyltransferase 25 family member 3-like [Sus scrofa] Length = 349 Score = 54.6 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 2/58 (3%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 V+VISL RRE+ + + DA+ G F + + R L Sbjct: 184 VFVISLARRPDRRERMLSSLWEMEIAGRVVDAVDGR--EQYKAHFWPRDLRAFSARPL 239 >gi|60679606|ref|YP_209750.1| putative glycosyltransferase [Bacteroides fragilis NCTC 9343] gi|60491040|emb|CAH05788.1| putative glycosyltransferase [Bacteroides fragilis NCTC 9343] Length = 222 Score = 54.6 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 24/102 (23%) Query: 3 IPVYVISLPFSHARREKFCHRAA-RIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 +PVYVI+L RR+ + ++ + +AI Sbjct: 10 LPVYVINLKERTERRQHIEEQFQGKVEFALHWIEAI-----------------------E 46 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 S+ +G + S + + II EDD F+ +++ Sbjct: 47 HSIGAVGLWQSMLKAVQTAIDKRDDIMIICEDDHIFTPAYNK 88 >gi|242007889|ref|XP_002424750.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508253|gb|EEB12012.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 327 Score = 54.2 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 17/37 (45%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGE 40 +Y+I+L RR + + L + FDA+ G Sbjct: 291 KIYMINLKRRPERRIRMIETFNELELDVTVFDAVDGR 327 >gi|206600289|ref|YP_002241895.1| gp108 [Mycobacterium phage Ramsey] gi|206287377|gb|ACI12720.1| gp108 [Mycobacterium phage Ramsey] Length = 203 Score = 54.2 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 48/154 (31%), Gaps = 32/154 (20%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L + ++ R Sbjct: 34 GALGCENNHRKVWQHLTRHNTDWLVVLEDDAIPCNNFRDQLHAALAVAPS-PVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--------------- 169 ++P++ + TT +++ LL+ Sbjct: 93 ERPREYQQRIAK------------AAGTTAHWLT---CRRLLHAVGVAIHADLVPHMLNN 137 Query: 170 KNIYRPIDMDMKHWWEH-NIPSLVTEPGAVYEAI 202 +PID + W H T P + A Sbjct: 138 LPNGKPIDEAISAWARHQGHTIAYTWPSLIDHAD 171 >gi|219128501|ref|XP_002184450.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404251|gb|EEC44199.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 296 Score = 53.8 bits (128), Expect = 2e-05, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 70/232 (30%), Gaps = 56/232 (24%) Query: 37 IYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIA--------------- 81 + E + N +S R ++L+ E GC SH+ LW+ + Sbjct: 54 VQWEWDATNNARYSA--RVIPGPKILTPGERGCARSHVALWRELVHDPTHRCSDNGDARH 111 Query: 82 -----------YSPAIGAIILEDDADFSDE------------FSQLLPHLSKCDINNILI 118 ++LEDD F+ + LP + + + Sbjct: 112 SSGGSSSSTHDSENGPTMLVLEDDVSFTSHRGQSRFEKCWARAWEQLPRGNDPTWGILYL 171 Query: 119 KFDALRKK------------PKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHL 165 F + P P + +H R T Y I + A L Sbjct: 172 GFSDRGARVLLERDGARGDCPPHRRRTSHDPRHPTVHLYRPEYGFHTHAYIITQAA-AAL 230 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRK 217 L + + PID+ + IP+ E +D T E + LVR+ Sbjct: 231 LLDQLPVVGPIDVWLADNQWFGIPTFCAV--IANEGWRRDDGTYEGAPLVRQ 280 >gi|253564591|ref|ZP_04842048.1| glycoside hydrolase [Bacteroides sp. 3_2_5] gi|265764748|ref|ZP_06093023.1| glycosyltransferase [Bacteroides sp. 2_1_16] gi|251948367|gb|EES88649.1| glycoside hydrolase [Bacteroides sp. 3_2_5] gi|263254132|gb|EEZ25566.1| glycosyltransferase [Bacteroides sp. 2_1_16] gi|301161088|emb|CBW20625.1| putative glycosyltransferase [Bacteroides fragilis 638R] Length = 222 Score = 53.8 bits (128), Expect = 2e-05, Method: Composition-based stats. Identities = 20/102 (19%), Positives = 36/102 (35%), Gaps = 24/102 (23%) Query: 3 IPVYVISLPFSHARREKFCHRAA-RIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 +PVYVI+L RR+ + R+ + +AI Sbjct: 10 LPVYVINLKERTERRQHIEEQFQGRVEFALHWIEAI-----------------------E 46 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 S+ +G + S + + II EDD F+ +++ Sbjct: 47 HSIGAVGLWQSMLKAVQTAIDKRDDIMIICEDDHIFTPAYNK 88 >gi|53804240|ref|YP_113880.1| glycosyl transferase family protein [Methylococcus capsulatus str. Bath] gi|53758001|gb|AAU92292.1| glycosyl transferase, family 25 [Methylococcus capsulatus str. Bath] Length = 239 Score = 53.4 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 47/139 (33%), Gaps = 23/139 (16%) Query: 6 YVISLPFSHARREKFCHRAARIHLQ-----FSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 Y+I+LP RR + R + FF AI R Sbjct: 8 YIINLPERRDRRREMEKELERAGMPPMPGRVEFFPAI----------------RPESAGP 51 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL--SKCDINNILI 118 S+ GC++SH+ + + +ILEDD F+ +F ++ L + Sbjct: 52 FPSIGAHGCFLSHLGILRAAKAQGLRNVLILEDDLTFAAQFPEVEEKLVGELRETPWDFA 111 Query: 119 KFDALRKKPKKDSYLCTLP 137 F + + P+ L Sbjct: 112 FFGHVLEFPEVPPDRAHLE 130 >gi|328697541|ref|XP_001943906.2| PREDICTED: glycosyltransferase 25 family member-like [Acyrthosiphon pisum] Length = 374 Score = 53.4 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 10/48 (20%), Positives = 20/48 (41%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSH 51 ++VI+L RR + + + L +F +A G I ++ Sbjct: 324 NIFVINLARRTERRNRMHYCLNELGLNATFIEATDGRLVIYFQNILNN 371 >gi|115523122|ref|YP_780033.1| glycosyl transferase family protein [Rhodopseudomonas palustris BisA53] gi|115517069|gb|ABJ05053.1| glycosyl transferase, family 25 [Rhodopseudomonas palustris BisA53] Length = 242 Score = 53.4 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 45/125 (36%), Gaps = 12/125 (9%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 Y+I+LP RR+ R+ + + + S R + S+ Sbjct: 12 YLINLPDRVDRRKGAEQEFVRLGMG----------DLLHSHIEISPGVRFEEPGEFYSVG 61 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC--DINNILIKFDAL 123 G + +H ++ KR ++ EDD + + L + + N +I F + Sbjct: 62 ARGSFNAHRNVLKRAQKLNLENVLVFEDDILLKSAPAGFIEKLLQEIPETNWDVIYFGYI 121 Query: 124 RKKPK 128 + P+ Sbjct: 122 KPIPE 126 >gi|38505579|ref|NP_942200.1| hypothetical protein sll5044 [Synechocystis sp. PCC 6803] gi|38423603|dbj|BAD01814.1| unknown protein [Synechocystis sp. PCC 6803] Length = 213 Score = 53.0 bits (126), Expect = 4e-05, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 36/106 (33%), Gaps = 20/106 (18%) Query: 21 CHRAARIHLQ--FSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWK 78 R + L +FF A+ + SL GC++SH+ + K Sbjct: 5 FQRFKILGLPKNAAFFTAVKPSDQ----------------GNFPSLGARGCFLSHLGVLK 48 Query: 79 RIAYSPAIGAIILEDDADFSDEFSQLLPHLSK--CDINNILIKFDA 122 ++LEDDA F+ + Q L + + + F Sbjct: 49 VAKAQNLEQVLLLEDDATFTKDLVQNQTQLLQELQTTSWDFVYFGH 94 >gi|318065894|ref|YP_004123930.1| gp108 [Mycobacterium phage Wee] gi|315420977|gb|ADU15978.1| gp108 [Mycobacterium phage Wee] Length = 207 Score = 53.0 bits (126), Expect = 4e-05, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 53/158 (33%), Gaps = 4/158 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + +LEDDA + F L + ++ R Sbjct: 34 GTLGCETNHRKVWQHLTRHNTDWLTVLEDDAIPCNNFRDQLDAALAVAPS-PVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 93 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHADLVPHMLNNL-PNGKPIDEAISAW 151 Query: 184 WEHNIPSLV-TEPGAVYEAIDTNDSTIEESRLVRKPTF 220 H ++ T P + A +T R P Sbjct: 152 ARHQSHTIAYTWPSLIDHADETPMIATRNDNQPRPPGR 189 >gi|317158702|ref|XP_001827184.2| LPS glycosyltransferase [Aspergillus oryzae RIB40] Length = 349 Score = 53.0 bits (126), Expect = 4e-05, Method: Composition-based stats. Identities = 31/239 (12%), Positives = 79/239 (33%), Gaps = 38/239 (15%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKRL 61 ++ I + + A+ L+ +F+ + N P +++ +K + + Sbjct: 59 KIFAIGFKERTDKHDAIALAASYTGLEVDWFEGVRAANIPPKAYPAVWTEEKHRDK---- 114 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----------QLLPHLSKC 111 E+G + +H++ + I + A+++EDD+D+ + L Sbjct: 115 --PAELGSWRAHMNALRHIVENKISSAVLMEDDSDWDVNIKAQMVEFARGTRALQGTEGE 172 Query: 112 DINNILIKFDALRKKPKKDSYLC------TLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 ++ + +P ++ ++ + + + + E A Sbjct: 173 PKYYVIPNDLTVTPRPHQNEFVRPPLAEDPNFESHRLIFNADNAICSWAMAFTYEGACKA 232 Query: 166 LNVRK--NIYRPIDMDMKHWWE-HNI-----PSLVTEPGAV----YEAIDTNDSTIEES 212 L I P+D + + + NI L + P + + DS I + Sbjct: 233 LTALSYVGIDEPVD--LGYNFLCTNILHVPYKCLSSHPSIMGTWAQRGPASRDSDITDG 289 >gi|255948004|ref|XP_002564769.1| Pc22g07480 [Penicillium chrysogenum Wisconsin 54-1255] gi|211591786|emb|CAP98036.1| Pc22g07480 [Penicillium chrysogenum Wisconsin 54-1255] Length = 389 Score = 53.0 bits (126), Expect = 4e-05, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 79/267 (29%), Gaps = 64/267 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y IS+ +R+ A+ ++ + D + + + ++ + K Sbjct: 69 IYAISMKERTDKRDFLTLAASISGIKVEWLDGV--RPHDLHSKAMPDGLDPSRVK----P 122 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-------DEFSQLLPHLSKCDI---- 113 + C+ +H++ + + A+ILEDDAD+ EF++ L L D Sbjct: 123 TVVACWRAHMNALVNVISNDYSTALILEDDADWDVSLKSQLGEFARGLHALKGTDHVSKE 182 Query: 114 -----NNILIKFDALRKKPK-------------------------------KDSYLCTLP 137 + L+ P K Y Sbjct: 183 APYGTDWDLLWIGGCASGPNANETSFYAIPMDPTVPRVHHRATWGGPTEKWKQQYPELAE 242 Query: 138 GNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDM-----KHWWEHNIPS 190 + + GY + A ++L ++ +P+D + H I Sbjct: 243 DSTRFIYRAEMGCCMFGYAVTTRGARNILAALSIDHLDKPVDDALSELCAGANGRHKIEC 302 Query: 191 LVTEPGAV----YEAIDTNDSTIEESR 213 P + + DS IE + Sbjct: 303 WDPFPNLIGTYRKAGSASRDSDIENNN 329 >gi|194303052|ref|YP_002014321.1| gp105 [Mycobacterium phage Boomer] gi|194153100|gb|ACF34167.1| gp105 [Mycobacterium phage Boomer] Length = 203 Score = 52.6 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 47/140 (33%), Gaps = 4/140 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L ++ R Sbjct: 34 GTLGCETNHRKVWQHLTRHNTNWLVVLEDDAIPCNNFRDQLDAALAVAPT-PVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 93 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHADLVPHMLNQL-PNGKPIDEAISAW 151 Query: 184 WEH-NIPSLVTEPGAVYEAI 202 H T P + A Sbjct: 152 ARHQGHTIAYTWPSLIDHAD 171 >gi|224012881|ref|XP_002295093.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969532|gb|EED87873.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 1278 Score = 52.6 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 54/195 (27%), Gaps = 55/195 (28%) Query: 35 DAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW----------------- 77 D + + P R F + R ++ E C +SHI W Sbjct: 1014 DFVKAKWRPGDLRAFDREARNDSELVRTNVTEKACSLSHIASWMGVEKTLSEANSSNANE 1073 Query: 78 ----------KRIAY--------------------SPAIGAIILEDDADFSDEF-SQLLP 106 +++ P +ILEDDA D F +L Sbjct: 1074 VGDDVASWYQQKLIRMFKIAGFARGAALSRENADMEPVPVCVILEDDAVLVDRFVERLQL 1133 Query: 107 HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 L + + R K + L I TGY + + A HL+ Sbjct: 1134 LLEELPRDFHFCSLGYSRPKS---APLIEYSSQLGIPSCLWY---LTGYILSLDGARHLI 1187 Query: 167 NVRKNIYRPIDMDMK 181 + P+D + Sbjct: 1188 K-SLPVVGPVDSWIG 1201 >gi|260464226|ref|ZP_05812419.1| hypothetical protein MesopDRAFT_5066 [Mesorhizobium opportunistum WSM2075] gi|259030029|gb|EEW31312.1| hypothetical protein MesopDRAFT_5066 [Mesorhizobium opportunistum WSM2075] Length = 247 Score = 52.6 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 59/198 (29%), Gaps = 18/198 (9%) Query: 15 ARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHI 74 RR R + +Y + + + + P + + H+ Sbjct: 25 DRRRHIVREFERRGVP------VYFYTEWDRPDVTPEIRAELVAPGFVHPPAVSLALKHV 78 Query: 75 HLWKRIAYSPAIGAIILEDDA----DFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKD 130 +W+ + ++ EDD DF +F Q L L D ++ L Sbjct: 79 GIWRDFLETDLPYCLVFEDDVFLARDFVAKFRQGLAELGSPDRKAVI----YLGNGSNYY 134 Query: 131 SYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDM-KHWWEHN 187 + L ++ P + + Y I + AA R I ID + E Sbjct: 135 TPSWKLRKGQRLY-PALHARCADSYLITRPAAEARCAWIARNKISTVIDHLAEQMDEELG 193 Query: 188 IPSLVTEPGAVYEAIDTN 205 + L E V + + Sbjct: 194 VEMLWFERPIVEQGSENG 211 >gi|47182078|emb|CAG14200.1| unnamed protein product [Tetraodon nigroviridis] Length = 91 Score = 52.3 bits (124), Expect = 7e-05, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 29/90 (32%), Gaps = 15/90 (16%) Query: 36 AIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR---------------I 80 A+ + + R L+ E+ ++SH +WK + Sbjct: 1 ALNSSDIKLLGVDLLPGYHDPFSGRSLTKGEVFFFLSHFFIWKEVRTSPLDRRRFPKLLM 60 Query: 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 A+I EDD F F + L L + Sbjct: 61 VDQQYDTALIFEDDVRFQANFKRRLLRLME 90 >gi|207110630|ref|ZP_03244792.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPKX_438_CA4C1] Length = 93 Score = 52.3 bits (124), Expect = 7e-05, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 27/72 (37%), Gaps = 11/72 (15%) Query: 31 FSFFDAIYGENNPIC---NRIFSHQKRQC-------QFKRLLSLPEIGCYISHIHLWKRI 80 F FDAI ++ ++ Q + L E GCY+SH LWK Sbjct: 23 FQIFDAISPKHEDFEKFVQELYDAQSMLKSDWFHSDYCYQELLPREFGCYLSHYLLWKEC 82 Query: 81 AYSPAIGAIILE 92 + +ILE Sbjct: 83 VKTNQP-IVILE 93 >gi|29565875|ref|NP_817448.1| gp110 [Mycobacterium phage Che8] gi|29424600|gb|AAN12507.1| gp110 [Mycobacterium phage Che8] Length = 211 Score = 51.9 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 53/158 (33%), Gaps = 4/158 (2%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L H + + ++ R Sbjct: 38 GALGCENNHRKVWQHLTRHNTNWLVVLEDDAIPCNNFRDQL-HAALTAAPSPVVSLYLGR 96 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 97 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGTAIHTDLVPHMLNNL-PNGKPIDEAITTW 155 Query: 184 WEH-NIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTF 220 H T P V A T R P Sbjct: 156 ARHQGHTIAYTWPSLVDHADTPPVITTRHDNQPRPPGR 193 >gi|109522187|ref|YP_655864.1| gp103 [Mycobacterium phage PMC] gi|91980887|gb|ABE67604.1| gp103 [Mycobacterium phage PMC] Length = 207 Score = 51.9 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 62/165 (37%), Gaps = 7/165 (4%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 +GC +H +W+ + ++LEDDA + F L H + + ++ R Sbjct: 34 GALGCENNHRKVWQHLTRHNTDWLVVLEDDAIPCNNFRDQL-HAALTAAPSPVVSLYLGR 92 Query: 125 KKPKKDSYLCTLPGNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 ++P++ + H G I + H+LN +PID + W Sbjct: 93 ERPREYQQRIAKAADTTAHWLTCRRLLHAVGIAIHTDLVPHMLNNL-PNGKPIDEAITTW 151 Query: 184 WEHNIPSLV-TEPGAVYEAIDT-NDSTIEESRLVRKPTFSPLYFY 226 H ++ T P + A +T T +++ +P + + Sbjct: 152 ARHQSHTIAYTWPSLIDHADETPMIDTRNDNQP--RPPGRVAWQH 194 >gi|115385098|ref|XP_001209096.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114196788|gb|EAU38488.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 375 Score = 51.5 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 74/264 (28%), Gaps = 76/264 (28%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 +Y I L +R+ A+ ++ + D S K Sbjct: 64 KIYAIGLKERTDKRDFLTLAASVHGMKVKWLDG---------KFDISFWKPTVA------ 108 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-------DEFSQLLPHLSKCD---- 112 C+ +H++ + + + A+I+EDDAD+ EF++ + LS Sbjct: 109 ----ACWRAHMNALRDVVENGYGTALIMEDDADWDVSVREQLREFARGVRTLSDNPNPSK 164 Query: 113 -----INNILI--------------KFDALRKKPKKDSYLCT----------------LP 137 N ++ A+ P S LP Sbjct: 165 DAPYGTNWDMLWVGGCASGAADNETDIYAIPNDPTTPSVKHRGTWGGPSDGWKAKHPELP 224 Query: 138 GNFDIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKN--IYRPIDMDMKHWWEHN----IPS 190 + + R + GY + E A +L + P+D M + Sbjct: 225 EDSTRYIYRANMGCCLYGYAVTLEGARRILAALSVDYLDMPVDNAMSDLCAGKNRPHLRC 284 Query: 191 LVTEPGAV----YEAIDTNDSTIE 210 P + + DS IE Sbjct: 285 YAPFPNLIGTYKPAGGSSRDSDIE 308 >gi|308814105|ref|XP_003084358.1| unnamed protein product [Ostreococcus tauri] gi|116056242|emb|CAL58423.1| unnamed protein product [Ostreococcus tauri] Length = 395 Score = 51.5 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 22/42 (52%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR 47 + I+L RR++ + + ++ F+ +A+ G N + ++ Sbjct: 233 FYINLHHRSDRRKRIEYELQQANISFTRIEAVDGRVNSLSSK 274 >gi|148555843|ref|YP_001263425.1| hypothetical protein Swit_2935 [Sphingomonas wittichii RW1] gi|148501033|gb|ABQ69287.1| hypothetical protein Swit_2935 [Sphingomonas wittichii RW1] Length = 253 Score = 51.5 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 42/119 (35%), Gaps = 11/119 (9%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 YV++LP RR + AR+ F ++P + + + R S+ Sbjct: 20 YVVNLPDRADRRREMAGELARLGAGF---------DDPRVRLFAAERPAEAGPFR--SIG 68 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 G ++S + + + ++LEDD DF + LP L F Sbjct: 69 ARGAFLSQLAVLREARERGDRRILMLEDDCDFVRAIGRRLPPLLDRLDAEGFGLFYGGH 127 >gi|326433099|gb|EGD78669.1| hypothetical protein PTSG_01648 [Salpingoeca sp. ATCC 50818] Length = 648 Score = 51.5 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 67/246 (27%), Gaps = 40/246 (16%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y+ S P S R + F + G + + LS Sbjct: 407 IYLPSRPTSVMRATNLTRWLDVFGFEARMFAGVDGPDTLADAAKLTS---------PLSA 457 Query: 65 PEIGCYISHIHLWKRIAYSPAIGA-IILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 EIG S+ L I G + LEDDA S F++ Sbjct: 458 AEIGVRESYKSLLADILRGNGDGLHLFLEDDAQLSPNFAE---------------GLSRR 502 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHW 183 + P+ YL PG + + T L + ID +MK Sbjct: 503 LRSPRCGGYLRVAPGGALLLGGTVWRNGTY---------PKLSTHTSGWKQ-IDDEMKRH 552 Query: 184 WEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW--- 240 N A E + RLV +P Y R W A Sbjct: 553 GGDNKLCYNV--AAATEGTFAWLADTATLRLVYSWLVNPSYADRPASQMWTQLAEAGVSV 610 Query: 241 RKDLPP 246 R LPP Sbjct: 611 RALLPP 616 >gi|331251740|ref|XP_003338459.1| hypothetical protein PGTG_20034 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309317449|gb|EFP94040.1| hypothetical protein PGTG_20034 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 391 Score = 51.1 bits (121), Expect = 1e-04, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 41/135 (30%), Gaps = 11/135 (8%) Query: 23 RAARIHLQFSFFDAI---YGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 + +H F DAI RI ++ K + + C +SH Sbjct: 117 QLDNLH----FVDAILYSDPRVLDTVQRIGNNTKADKKVAE---GGHVACRMSHRLAIGA 169 Query: 80 IAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDALRKKPKKDSYLCTLPG 138 +IL+DD D F L L + +I F K + Sbjct: 170 ADADDNKITLILKDDVDIEAAFKYLAGTILRDVPKDWDIIFFGHTDFSNKARNGSDPSTH 229 Query: 139 NFDIHQPRILSPRTT 153 NF I++ T+ Sbjct: 230 NFYIYKSVKPQVSTS 244 >gi|302511231|ref|XP_003017567.1| LPS glycosyltransferase, putative [Arthroderma benhamiae CBS 112371] gi|291181138|gb|EFE36922.1| LPS glycosyltransferase, putative [Arthroderma benhamiae CBS 112371] Length = 426 Score = 51.1 bits (121), Expect = 1e-04, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 44/122 (36%), Gaps = 22/122 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN-------NPICNRIFSHQKRQCQ 57 +YV++LP RR+ AA ++ + D I GEN P ++ F Sbjct: 70 IYVVNLPSRTDRRDAIVLLAAVTGIKLHWVDGIRGENVVDKALPPPATHQYFKAANIGSW 129 Query: 58 FKRLLSLPEIG----------CYISHIHLWK--RIAYSPAIGAIILEDDADFSDEFSQLL 105 L +L E C +++ I + A+I EDDAD+ L Sbjct: 130 RGHLDALSEYSPAPPPPPPSNCTT---LIFEISSIVENNINTALIFEDDADWDIRIKSQL 186 Query: 106 PH 107 Sbjct: 187 RD 188 >gi|259482300|tpe|CBF76650.1| TPA: LPS glycosyltransferase, putative (AFU_orthologue; AFUA_8G00650) [Aspergillus nidulans FGSC A4] Length = 389 Score = 51.1 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/317 (12%), Positives = 86/317 (27%), Gaps = 73/317 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I L + + A+ + + + + + + + + L Sbjct: 66 IYAIGLKERTDKHDSLTLAASATGMNVEWLEGV--RPGDVSEKALPQGMDNPEIPYAVVL 123 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL---------------- 108 C+ +H++ +R+ + A+I+EDDAD+ L + Sbjct: 124 ----CWRAHMNALRRVVENKYTTALIMEDDADWDVTIRPQLREIARGVRELSSNQKAPRK 179 Query: 109 SKCDINNILIKFD---ALRKKPKKDSYLCTL-------------------PGNFDIHQPR 146 N L+ L+ K + D Y+ ++ P Sbjct: 180 EPYGTNWDLLWIGGCATLQDKNETDLYMIPDDPTTPSLGHRGPWQGPIGPHEDWRNQHPE 239 Query: 147 IL------------SPRTTGYFIGKEAAIHLLNV-RKNIYRPIDMDMKHWWEHN-----I 188 + GY + + A +L +D + I Sbjct: 240 VSMDSTRFVYRAGHGCCLYGYAVTYQGARKILAALSLERSAEVDNSLGDLCGGKNGRQQI 299 Query: 189 PSLVTEPGAV----YEAIDTNDSTIEESRL---VRKPTFSPLYFYRNTCYQW----NLHY 237 P + + + S I+ V +++ +Y R + Y Sbjct: 300 RCFGVYPNVIGTFKPAGLSSRHSDIQNYSTTDWVNAESWNMVYSTRMNLQRLLAGEETVY 359 Query: 238 NAWRKDLPPVSTTKFLP 254 + W P S + P Sbjct: 360 SQWNTSFVPWSKGEVKP 376 >gi|67537348|ref|XP_662448.1| hypothetical protein AN4844.2 [Aspergillus nidulans FGSC A4] gi|40740889|gb|EAA60079.1| hypothetical protein AN4844.2 [Aspergillus nidulans FGSC A4] Length = 803 Score = 51.1 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/317 (12%), Positives = 86/317 (27%), Gaps = 73/317 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +Y I L + + A+ + + + + + + + + L Sbjct: 66 IYAIGLKERTDKHDSLTLAASATGMNVEWLEGV--RPGDVSEKALPQGMDNPEIPYAVVL 123 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL---------------- 108 C+ +H++ +R+ + A+I+EDDAD+ L + Sbjct: 124 ----CWRAHMNALRRVVENKYTTALIMEDDADWDVTIRPQLREIARGVRELSSNQKAPRK 179 Query: 109 SKCDINNILIKFD---ALRKKPKKDSYLCTL-------------------PGNFDIHQPR 146 N L+ L+ K + D Y+ ++ P Sbjct: 180 EPYGTNWDLLWIGGCATLQDKNETDLYMIPDDPTTPSLGHRGPWQGPIGPHEDWRNQHPE 239 Query: 147 IL------------SPRTTGYFIGKEAAIHLLNV-RKNIYRPIDMDMKHWWEHN-----I 188 + GY + + A +L +D + I Sbjct: 240 VSMDSTRFVYRAGHGCCLYGYAVTYQGARKILAALSLERSAEVDNSLGDLCGGKNGRQQI 299 Query: 189 PSLVTEPGAV----YEAIDTNDSTIEESRL---VRKPTFSPLYFYRNTCYQW----NLHY 237 P + + + S I+ V +++ +Y R + Y Sbjct: 300 RCFGVYPNVIGTFKPAGLSSRHSDIQNYSTTDWVNAESWNMVYSTRMNLQRLLAGEETVY 359 Query: 238 NAWRKDLPPVSTTKFLP 254 + W P S + P Sbjct: 360 SQWNTSFVPWSKGEVKP 376 >gi|322702306|gb|EFY93984.1| hypothetical protein MAA_10554 [Metarhizium anisopliae ARSEF 23] Length = 800 Score = 51.1 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/127 (21%), Positives = 50/127 (39%), Gaps = 9/127 (7%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + V+ +P RR+ AA L+ +F D + GE+ + L Sbjct: 606 ILVVGMPSRTDRRDGMILGAALSQLKINFVDGVRGEDVNEKAIPVPEDRNN-----HLKG 660 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 P +G + H++ + A+I+EDD D+ + L + I LR Sbjct: 661 PVLGSWRGHMNAIHEVVRRNLSSALIMEDDVDWDVRIREQLHDFAVS----IRALTQPLR 716 Query: 125 KKPKKDS 131 ++P+K + Sbjct: 717 RQPRKYA 723 >gi|121717213|ref|XP_001276042.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1] gi|119404199|gb|EAW14616.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1] Length = 391 Score = 50.7 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 40/266 (15%), Positives = 69/266 (25%), Gaps = 66/266 (24%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 VY I L +R+ A+ + Q + + + + + I + K Sbjct: 65 VYAIGLKERTDKRDFLALAASLVGFQIDWLEGV--KPDEIHPKALPEGMNLTIVK----P 118 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL------------------- 105 C+ +H++ + + + A+I+EDDAD+ Q L Sbjct: 119 TAAACWRAHMNALRTVVENRHTTALIMEDDADWDISLKQQLREFAHGLQTLSRTRTASKR 178 Query: 106 ----------------PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRI-- 147 + I D K SY G D + R Sbjct: 179 APYGTDWDLLWVGGCSTAAGFNETEFFAIPNDPTVPSMKHRSYWVDRRGPLDEWKERYPQ 238 Query: 148 -------------LSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMKHWWEHN----I 188 GY + E A +L ++ +D + I Sbjct: 239 LPEESTRYVYRANTGCCLYGYAVTYEGARKILASLSVDHLDIAVDNALGDMCGGKNRPQI 298 Query: 189 PSLVTEPGAV----YEAIDTNDSTIE 210 P + DS IE Sbjct: 299 QCYAPHPNLIGTHRRAGSSFKDSDIE 324 >gi|317025892|ref|XP_001388543.2| hypothetical protein ANI_1_2178014 [Aspergillus niger CBS 513.88] Length = 387 Score = 50.7 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 65/245 (26%), Gaps = 63/245 (25%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++ IS RR+ ++ F D + + + S R Sbjct: 71 KIFAISPAQRLDRRDAIVLASSATGFHVDFIDGLILDEDSAKELNDSKDSETVPVSRP-- 128 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS----------QLLPHLSKCDI 113 H + +RI A+ILEDD D+ Q +P ++ + Sbjct: 129 ---------HANAVQRIVADGIASALILEDDVDWDINLKMQLKGLALGSQQIPKINAPER 179 Query: 114 N----------NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP------------- 150 N ++ K S + +P + R L P Sbjct: 180 NATTDSPYGDSWDVLWIGHCGMKCNSSSPVQVMPHDITAMPSRYLPPYLYDAPAGTGNNV 239 Query: 151 ----------RTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE-----HN--IPSLVT 193 + Y I A+ +L + D ++ HN + + Sbjct: 240 RMSCIIEKAACSAAYAITYGASQKILAALTRLT--DDDNLDFPNLLSELCHNGSLECYSS 297 Query: 194 EPGAV 198 P V Sbjct: 298 YPSLV 302 >gi|322706318|gb|EFY97899.1| hypothetical protein MAA_06682 [Metarhizium anisopliae ARSEF 23] Length = 442 Score = 50.7 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 41/101 (40%), Gaps = 5/101 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + V+ +P RR+ AA L+ +F D + GE+ + L Sbjct: 108 ILVVGMPSRTDRRDGMILGAALSGLKINFIDGVKGEDVYEKAIPVPEDRNN-----HLKG 162 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 P +G + +H++ + A+I+EDD D+ + L Sbjct: 163 PGLGSWRAHMNAIHDVVRRNLSSALIMEDDVDWDVRIREQL 203 Score = 35.7 bits (81), Expect = 6.4, Method: Composition-based stats. Identities = 11/81 (13%), Positives = 22/81 (27%), Gaps = 7/81 (8%) Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDMKHWWE 185 K S + P + + + Y + + A +L K D+ ++ + E Sbjct: 307 KPFSLVEEYPAHTRVVHHAQEGVCSLAYAVSQRGAQKMLREIALKPATDAFDILLRFYCE 366 Query: 186 -----HNIPSLVTEPGAVYEA 201 L P Sbjct: 367 GTHDRTKQECLSVNPSLFSHH 387 >gi|326432628|gb|EGD78198.1| hypothetical protein PTSG_09075 [Salpingoeca sp. ATCC 50818] Length = 711 Score = 50.3 bits (119), Expect = 2e-04, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 72/261 (27%), Gaps = 61/261 (23%) Query: 6 YVISLPF---SHARREKFCHRAARIHLQFSFFDAIYG-ENNPICNRIFSHQKRQCQFK-- 59 +V+ L S AR ++ S F+ + G + ++ F+ Sbjct: 423 FVLHLAHDQGSVARAMAMAKELRAHSIKASLFNGVDGAHLDTATHQAFTSSSYLASSSAS 482 Query: 60 ----------------------------RLLSLPEIGCYISHIHLWKRI---AYSPAIGA 88 LS ++G ++ L R+ Sbjct: 483 SSAHADKGKGASSNSTSSSSSSSPSSAVHALSARDVGRRETYRLLLHRVFSATDDNDALY 542 Query: 89 IILEDDADFSDEFSQLLPHLSKCD--------INNILIKFDALRKKPKKDSYLCTLPGNF 140 +++EDD S F+Q L L + + + A + + G + Sbjct: 543 LVVEDDVVLSPVFAQRLDTLLQDPRCGSYLSSGSGGALLLGATIARNGTYPRVDAHTGGW 602 Query: 141 DIH-QPRILSPRTTGY------------FIGKEAAIHLLNVRKNIYR---PIDMDMKHWW 184 + + +P T Y + + A LL N P D ++ Sbjct: 603 AMADEEMRRTPGTMCYNFNSGVENTFAFVLERRAMQLLLTWLSNPSHANQPFDHAWRYLC 662 Query: 185 EHNIPSLVTEPGAVYEAIDTN 205 + +P V P V D Sbjct: 663 DRGMPVRVAFPYIVIPHRDAG 683 >gi|203454799|ref|YP_002225115.1| gp238 [Mycobacterium phage Myrna] gi|197311847|gb|ACH62205.1| gp238 [Mycobacterium phage Myrna] Length = 232 Score = 50.3 bits (119), Expect = 2e-04, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 57/199 (28%), Gaps = 32/199 (16%) Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDA 122 +G +H+ W+ +A ++LEDDA D + L + + Sbjct: 32 DGRLGAGRNHLRAWRWLARQEEEWGVVLEDDAVPVDGLPDQIHSALYVAPTPVVSLYLGR 91 Query: 123 LRKKPKKDSYLCTL-------PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNI--- 172 R + + TL + P + G I + L+ Sbjct: 92 GRPPGIQHAVQRTLGMISVDGVDPCWLVHPGLAHC--VGVAIRTDQIESLIEYM-PWHLR 148 Query: 173 -YRPIDMDMKHWWEHNIPSLV--TEPGAVYEAIDTNDSTI-------------EESRLVR 216 P D + W + LV T P V D +TI E+ R Sbjct: 149 SGMPPDEIISRWAQRGPKCLVSYTWPSLVDHN-DELPTTIVDRRSMILDEIWGEQDRREE 207 Query: 217 KPTFSPLYFYRNTCY-QWN 234 P +R +WN Sbjct: 208 DPPKLVRKAWRAERREEWN 226 >gi|328698345|ref|XP_003240618.1| PREDICTED: hypothetical protein LOC100570584, partial [Acyrthosiphon pisum] Length = 434 Score = 50.3 bits (119), Expect = 2e-04, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 31/84 (36%), Gaps = 13/84 (15%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLW----------KRIAYSPAIGAII 90 + + +KR ++ EIGC++SH +W K I+ I + Sbjct: 6 DLEFMGVRLMPGYKDPYYKRPITKGEIGCFMSHYRIWAKGSSYIPLPKHISNKKGIINVK 65 Query: 91 LEDDADFSDEFSQLLPHLSKCDIN 114 EDD F +L L + + Sbjct: 66 NEDDKCF---LWAILSALHPVEKD 86 >gi|71149110|gb|AAZ29057.1| Lgt2 [Moraxella catarrhalis] Length = 152 Score = 50.3 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 47/133 (35%), Gaps = 17/133 (12%) Query: 116 ILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR- 174 +IK + +K + L + + + T GY I ++ A +L+ + Sbjct: 12 DVIKLETWVEKIHIKKAVTVL-NHRQLCPLKTFHTGTAGYVISQQGAKIILDYLSTLDAF 70 Query: 175 ---PIDMDMKHWWEHNIPSLVTEPGAVYEA-----IDTNDSTIEESRLV-----RKPTFS 221 PID + + L P V +A DT S IE R R+ T + Sbjct: 71 EFFPIDHVIFDAIISKMSVLQVNPAMVIQAHLVSEDDTFKSLIETQRKQNVNQHRRRTLA 130 Query: 222 PL--YFYRNTCYQ 232 +YR+ + Sbjct: 131 DYGKKYYRSIGKR 143 >gi|328861529|gb|EGG10632.1| hypothetical protein MELLADRAFT_103329 [Melampsora larici-populina 98AG31] Length = 507 Score = 49.9 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 74/189 (39%), Gaps = 10/189 (5%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 +YV+SLP + RR++ + L F+F DAI +N+ + + I S K KR + Sbjct: 146 IYVLSLPNNFKRRQRIQKIGNALELNFTFIDAIE-KNSSVIDWIGSKVKEIRNKKRKILS 204 Query: 65 PEIGCYISHI---HLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFD 121 + G + S I + II E ++ LP LSK + + D Sbjct: 205 TQFGKHQSEIGGMKVGSDWLLRNEEEVIIKE----SEEKIKIKLPALSKDWVKELHQHQD 260 Query: 122 ALRK-KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGK-EAAIHLLNVRKNIYRPIDMD 179 L+ K + L + PR LS T + G+ + +L + ++ D Sbjct: 261 DLQSLYQPKFNIQSQLYDPLESKAPRQLSSGTISVWYGQTKVMRKMLENQDRSALVLEDD 320 Query: 180 MKHWWEHNI 188 + W+ Sbjct: 321 VDLEWDLGQ 329 Score = 47.2 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 21/184 (11%) Query: 41 NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE 100 I ++++ + + R LS I + + +++ + A++LEDD D + Sbjct: 269 KFNIQSQLYDPLESKA--PRQLSSGTISVWYGQTKVMRKMLENQDRSALVLEDDVDLEWD 326 Query: 101 FSQL----LPHLSKCDIN----NILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRT 152 QL L L K N ++ + G+ ++H+ Sbjct: 327 LGQLWSSVLRRLPKDHKNRQEGWEIVYLGHCWGRENSH----PQYGHPNLHRSSE-PRCL 381 Query: 153 TGYFIGKEAAIHLLNV-RKNIY---RPIDMDMKHWWEHNI--PSLVTEPGAVYEAIDTND 206 Y + +L ID+ + + + EP + + + Sbjct: 382 HAYALSHTGVQKILKYLSDPWIAYQTAIDIALPTLIKSGFLKNAFSIEPAWIIQDKTEFE 441 Query: 207 STIE 210 S ++ Sbjct: 442 SDVQ 445 >gi|220898665|gb|ACL81526.1| putative glycosyl transferase [Burkholderia contaminans] Length = 218 Score = 49.9 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 27/85 (31%), Gaps = 12/85 (14%) Query: 7 VISLPFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 I+L R E + A ++ A+ + + + Sbjct: 16 CINLDRRPDRWEAMQRKFAEQNILTVERLPAVDARLVSVPESL-----------SHMRAQ 64 Query: 66 EIGCYISHIHLWKRIAYSPAIGAII 90 + GC +SH+ K+ + A +I Sbjct: 65 DYGCTMSHLAAVKQAKAAGAREVLI 89 >gi|310795529|gb|EFQ30990.1| hypothetical protein GLRG_06134 [Glomerella graminicola M1.001] Length = 309 Score = 49.9 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 6/86 (6%) Query: 20 FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 +AA + FD + G+ + ++ R IGC+ SH++ Sbjct: 1 MLLQAALSDMDIELFDGVPGDTVSE-----KAIPKTSEYNRQ-DDGVIGCWRSHMNAIGE 54 Query: 80 IAYSPAIGAIILEDDADFSDEFSQLL 105 I +ILEDD D+ L Sbjct: 55 IVRRNLSSVLILEDDVDWDIRIRSQL 80 Score = 42.6 bits (99), Expect = 0.050, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 38/112 (33%), Gaps = 12/112 (10%) Query: 137 PGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDMKHWWEHNI------ 188 P + T GY + ++ A LL K++ P+D+ ++++ E + Sbjct: 185 PEHTRAVHHVREGVCTLGYAVSQQGARKLLQEVALKDVDDPVDLLLRYFCE-GVKGRKPH 243 Query: 189 PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYRNTCYQWNLHYNAW 240 L ++P + + K Y +W++ NA Sbjct: 244 KCLTSQPAFFHHHRAAGPMSSHSDIRNYKGFRETGMTY---MVRWSVRLNAD 292 >gi|159130165|gb|EDP55279.1| conserved hypothetical protein [Aspergillus fumigatus A1163] Length = 398 Score = 49.9 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 68/259 (26%), Gaps = 62/259 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 VY I L +R+ A+ + Q + D + + + + K Sbjct: 71 VYAIGLKERTDKRDFLALAASIVGFQVDWLDGV--RPDEVHPKSLPQGLNMTIIK----P 124 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-------------- 110 C+ +H++ + + A+I+EDDAD+ Q L ++ Sbjct: 125 TAAACWRAHMNALRTVVEKRHATALIMEDDADWDVSLKQQLREFARGLRTLSPTTTATKQ 184 Query: 111 --CDINNILI--------------KFDALRKKPK--------------------KDSYLC 134 + L+ +F A+ P K + Sbjct: 185 APYGTDWDLLWVGGCSTAAGANETQFYAIPNDPTVPGMNRRSFWVDRRGPVDQWKQRFPQ 244 Query: 135 TLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMK--HWWEHN--I 188 + GY + A +L ++ P+D + + I Sbjct: 245 LREESTRYVYRADTGCCLYGYAVTYNGARKILASLSVDHLEVPVDNALGDMCGGRNRPQI 304 Query: 189 PSLVTEPGAVYEAIDTNDS 207 P + S Sbjct: 305 QCYAPHPNLIATHRRAGSS 323 >gi|146323100|ref|XP_756112.2| conserved hypothetical protein [Aspergillus fumigatus Af293] gi|129558616|gb|EAL94074.2| conserved hypothetical protein [Aspergillus fumigatus Af293] Length = 398 Score = 49.9 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 68/259 (26%), Gaps = 62/259 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 VY I L +R+ A+ + Q + D + + + + K Sbjct: 71 VYAIGLKERTDKRDFLALAASIVGFQVDWLDGV--RPDEVHPKSLPQGLNMTIIK----P 124 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-------------- 110 C+ +H++ + + A+I+EDDAD+ Q L ++ Sbjct: 125 TAAACWRAHMNALRTVVEKRHATALIMEDDADWDVSLKQQLREFARGLRTLSPTTTATKQ 184 Query: 111 --CDINNILI--------------KFDALRKKPK--------------------KDSYLC 134 + L+ +F A+ P K + Sbjct: 185 APYGTDWDLLWVGGCSTAAGANETQFYAIPNDPTVPGMNRRSFWVDRRGPVDQWKQRFPQ 244 Query: 135 TLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMK--HWWEHN--I 188 + GY + A +L ++ P+D + + I Sbjct: 245 LREESTRYVYRADTGCCLYGYAVTYNGARKILASLSVDHLEVPVDNALGDMCGGRNRPQI 304 Query: 189 PSLVTEPGAVYEAIDTNDS 207 P + S Sbjct: 305 QCYAPHPNLIATHRRAGSS 323 >gi|29566189|ref|NP_817760.1| gp83 [Mycobacterium phage Che9c] gi|29424914|gb|AAN12641.1| gp83 [Mycobacterium phage Che9c] Length = 208 Score = 49.6 bits (117), Expect = 4e-04, Method: Composition-based stats. Identities = 34/170 (20%), Positives = 60/170 (35%), Gaps = 9/170 (5%) Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 +GC +H +W+ + AI+LEDDA ++F H + ++ F Sbjct: 32 DGTLGCEANHRRVWRWHRNHASEWAIVLEDDALPINDFRSH-AHAALATAPTRIVSFYLG 90 Query: 124 RKKPKKDSYLC--TLPGNFDIHQPRILSPRT---TGYFIGKEAAIHLLNVRKNIYRPIDM 178 R+ P++ L +H I +PRT Y + LL + ID Sbjct: 91 RQYPRRWQPDIENALQRAGHVHASWITAPRTLHAVAYAMHTTVLDGLLTHDS--TKAIDS 148 Query: 179 DMKHWWEH-NIPSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFSPLYFYR 227 + W + + + T P V A R+ + + +R Sbjct: 149 AITAWQQRAGVDTSYTVPSLVDHADGPTVINRRSRRMPGRTAWQVGAPHR 198 >gi|209527783|ref|ZP_03276276.1| conserved hypothetical protein [Arthrospira maxima CS-328] gi|209491770|gb|EDZ92132.1| conserved hypothetical protein [Arthrospira maxima CS-328] Length = 270 Score = 49.2 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 42/136 (30%), Gaps = 25/136 (18%) Query: 69 CYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPK 128 C ++H W+ IA + ++I+E D EF QL + L Sbjct: 54 CMLNHRQAWE-IATQQSQPSLIVEADFVPVLEFGQL-----PLPFDKELPNLGITWIYTC 107 Query: 129 KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRP------------I 176 P + S Y + +AA L+ + I I Sbjct: 108 SPQVYWVSPEGY----AEGFSTGMVAYIVSPKAAELLIKFSEKITETTQGKTYSTWDSEI 163 Query: 177 DMDMKHWWEHNIPSLV 192 D ++ HN+ + + Sbjct: 164 DTFLRS---HNLQNFI 176 >gi|83775932|dbj|BAE66051.1| unnamed protein product [Aspergillus oryzae] Length = 353 Score = 49.2 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 83/263 (31%), Gaps = 62/263 (23%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNP--ICNRIFSHQKRQCQFKRL 61 ++ I + + A+ L+ +F+ + N P +++ +K + + Sbjct: 39 KIFAIGFKERTDKHDAIALAASYTGLEVDWFEGVRAANIPPKAYPAVWTEEKHRDK---- 94 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD--------- 112 E+G + +H++ + I + A+++EDD+D+ + ++ Sbjct: 95 --PAELGSWRAHMNALRHIVENKISSAVLMEDDSDWDVNIKAQMVEFARGTRALQGTEGV 152 Query: 113 ------INNILIKFDA----LRKKPK-----KDSYLCTLPGNFDIHQPR----------- 146 N ++ LR +PK D + P + +P Sbjct: 153 PFSPYGDNWDMLWLGHCGINLRGEPKYYVIPNDLTVTPRPHQNEFVRPPLAEDPNFESHR 212 Query: 147 -----ILSPRTTGYFIGKEAAIHLLNVRK--NIYRPIDMDMKHWWE-HNI-----PSLVT 193 + + E A L I P+D + + + NI L + Sbjct: 213 LIFNADNAICSWAMAFTYEGACKALTALSYVGIDEPVD--LGYNFLCTNILHVPYKCLSS 270 Query: 194 EPGAV----YEAIDTNDSTIEES 212 P + + DS I + Sbjct: 271 HPSIMGTWAQRGPASRDSDITDG 293 >gi|317032680|ref|XP_001394212.2| hypothetical protein ANI_1_1700094 [Aspergillus niger CBS 513.88] Length = 386 Score = 49.2 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 55/205 (26%), Gaps = 47/205 (22%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++ I+LP +R+ ++ D + G I + + Sbjct: 60 KIFAINLPSRVDKRDNLVLGSSVSGFN---IDWVNG----ITPDEVDPRTYPYNWNYDHK 112 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH---------------- 107 E G +H++ +RI + A+I+EDD D+ L Sbjct: 113 PTEYGARRAHVNAMQRIVHERIGSAMIMEDDVDWDVNLKTQLQSFALGVRALQGADKKVT 172 Query: 108 LSKCDINNILIKF------------------DALRKKPKKDSYLCTLPGNFDIHQPRILS 149 S + +I D P P F I L+ Sbjct: 173 ASPYGDDWDIIWLGHCGMECRTNEPYFVSSNDTTVLPPHHFLPYWRDPPPFGIPDYTRLT 232 Query: 150 P------RTTGYFIGKEAAIHLLNV 168 + Y + A +L+ Sbjct: 233 CAVNDAVCSIVYAVSYHGAQKILSA 257 >gi|119482445|ref|XP_001261251.1| hypothetical protein NFIA_093170 [Neosartorya fischeri NRRL 181] gi|119409405|gb|EAW19354.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181] Length = 398 Score = 48.8 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 75/259 (28%), Gaps = 62/259 (23%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 VY I L +R+ A+ + Q + D + + + + + K ++ Sbjct: 71 VYAIGLKERTDKRDFLALAASIVGFQVDWLDGV--KPDEVHPKSLPQGMNMTIIKPTVA- 127 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-------------- 110 C+ +H++ + + A+I+EDDAD+ Q L ++ Sbjct: 128 ---ACWRAHMNALRTVVEKRHATALIMEDDADWDVSLKQQLREFARGLRTLSLTKAASKQ 184 Query: 111 --CDINNILI--------------KFDALRKKPK-----KDSYLCTLPGNFDIHQPRI-- 147 + L+ +F A+ P + S+ G D + R Sbjct: 185 APYGTDWDLLWVGGCSTAAGVNETQFYAIPNDPTVPGMNRRSFWVDRRGPLDRWKHRFPQ 244 Query: 148 -------------LSPRTTGYFIGKEAAIHLLNVR--KNIYRPIDMDMK--HWWEHN--I 188 GY + A +L ++ P+D + + I Sbjct: 245 LPEESTRYVYRADTGCCLYGYAVTYNGARKILASLSVDHLENPVDNALGDMCGGRNRPQI 304 Query: 189 PSLVTEPGAVYEAIDTNDS 207 P + S Sbjct: 305 QCYAPHPNLIATHRRAGSS 323 >gi|322711754|gb|EFZ03327.1| hypothetical protein MAA_00401 [Metarhizium anisopliae ARSEF 23] Length = 368 Score = 48.8 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 21/101 (20%), Positives = 40/101 (39%), Gaps = 5/101 (4%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + V+ +P RR+ AA L+ +F D + G++ + L Sbjct: 61 ILVVGMPSRTDRRDGMILGAALSELKINFVDGVRGDDVNEKAIPVPKDRNN-----HLKG 115 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 P +G + H++ + A+I+EDD D+ + L Sbjct: 116 PVLGSWRGHMNAIHEVVRRNLSSALIMEDDVDWDVRIREQL 156 >gi|207093459|ref|ZP_03241246.1| lipooligosaccharide 5G8 epitope biosynthesis-associated protein (lex2B) [Helicobacter pylori HPKX_438_AG0C1] Length = 77 Score = 48.8 bits (115), Expect = 9e-04, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 29/80 (36%), Gaps = 5/80 (6%) Query: 159 KEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESRLVRKP 218 + A + P+D M + H + +LV +P + A D STI P Sbjct: 1 PKIAKVFKKCSRKWVVPVDTIMDATFIHGVKNLVLQPFVI--ADDEQISTIARKEEPYNP 58 Query: 219 TFSPLYFYRNTCYQWNLHYN 238 + R +++ ++ Sbjct: 59 K---IALMRELHFKYLKYWQ 75 >gi|29566322|ref|NP_817889.1| gp38 [Mycobacterium phage Corndog] gi|29425047|gb|AAN01970.1| gp38 [Mycobacterium phage Corndog] Length = 206 Score = 48.4 bits (114), Expect = 9e-04, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 2/49 (4%) Query: 65 PEIGCYISHIHLWKRIAY--SPAIGAIILEDDADFSDEFSQLLPHLSKC 111 E+GC +H +W+ +A ++LEDDA D F L + Sbjct: 33 GELGCAQNHAKVWRTLAALSEGHSHCVVLEDDAVPVDGFRDQLDAALEA 81 >gi|322694421|gb|EFY86251.1| hypothetical protein MAC_07705 [Metarhizium acridum CQMa 102] Length = 330 Score = 48.4 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 21/96 (21%), Positives = 38/96 (39%), Gaps = 5/96 (5%) Query: 10 LPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGC 69 +P RR+ AA L+ +F D + G + + L+ P +G Sbjct: 1 MPSRTDRRDGMMLGAALSGLEINFIDGVRGGDVHEKAIPVPEDRNN-----HLNGPGLGS 55 Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 + +H++ + A+I+EDD D+ Q L Sbjct: 56 WRAHMNAIHDVVRRNLSSALIMEDDVDWDVRIRQQL 91 Score = 36.5 bits (83), Expect = 4.3, Method: Composition-based stats. Identities = 11/81 (13%), Positives = 23/81 (28%), Gaps = 7/81 (8%) Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--VRKNIYRPIDMDMKHWWE 185 K S + P + + + Y + + A +L K + D+ ++ + E Sbjct: 195 KPFSLVEEYPAHTRVVHHAQEGVCSLAYAVSQRGAQKMLREIALKPVTDAFDILLRFYCE 254 Query: 186 -----HNIPSLVTEPGAVYEA 201 L P Sbjct: 255 GTHGRTKQECLSINPSLFSHH 275 >gi|302851334|ref|XP_002957191.1| hypothetical protein VOLCADRAFT_107575 [Volvox carteri f. nagariensis] gi|300257441|gb|EFJ41689.1| hypothetical protein VOLCADRAFT_107575 [Volvox carteri f. nagariensis] Length = 424 Score = 48.4 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 30/180 (16%) Query: 43 PICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWK----RIAYSPAIGA-------IIL 91 +F +++ +G SH+ + + A ++ Sbjct: 246 SAAAPLFDDEEKVINLSY------LGTAASHLLAVQNWTMEMVKQGVPPAERTNRHLLLF 299 Query: 92 EDDADFSDEFSQLLP-HLSKCDINNILIKFDALRKKPKKDSYLCTLPGNF--------DI 142 EDDA + E L L D ++ D+ S+ L F + Sbjct: 300 EDDAVVTPETVTALQGVLQDLDPCYDVVGLDSTDNFCTFSSFTTALLDTFLLRKSPSSRL 359 Query: 143 HQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHN--IPSLVTEPGAVYE 200 R+ R TG + + A++LL+ + R ID+ + + + V P V Sbjct: 360 VPARMSYSRNTGLVMSYKGAMNLLSGL-PVTREIDLWFRDL-MTDQVLKIFVICPRIVGP 417 Score = 40.3 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 41/111 (36%), Gaps = 16/111 (14%) Query: 88 AIILEDDADFSDEFSQLLP-HLSKCDINNILIKFDALRKKPKKDSYL--------CTLPG 138 II EDDA + + L L K D + ++ D+ L PG Sbjct: 44 LIIFEDDAVVTTDTVAALQSALPKLDPSFDILALDSAGSFCSLSCRLDWVMRMLGLHTPG 103 Query: 139 NF-DIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNI 188 + + + RI RTTG + + A+ L I R D WE ++ Sbjct: 104 SLPRLIRARISFSRTTGVVLSYKGAVQLFRYL-PITRVSD-----LWERDL 148 >gi|48243726|gb|AAT40835.1| putative Lex2B [Haemophilus influenzae] Length = 45 Score = 48.0 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 5/41 (12%) Query: 4 PVYVISLPFSHARREKFCHRAARI---HL--QFSFFDAIYG 39 P+++I+L S R+ + ++ +L + FFDAIYG Sbjct: 5 PIFIINLEKSTDRKAYMQAQFEQLFSNNLIQEIHFFDAIYG 45 >gi|29566510|ref|NP_818076.1| gp111 [Mycobacterium phage Che9d] gi|29425235|gb|AAN08021.1| gp111 [Mycobacterium phage Che9d] Length = 200 Score = 48.0 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 27/71 (38%), Gaps = 1/71 (1%) Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDAL 123 +GC +H +W +A A++LEDDA + F L ++ Sbjct: 33 DGTLGCEGNHRKVWNALAQQHTDWAVVLEDDAVPCNNFRAQLTQALAVAPT-PIVSLYLG 91 Query: 124 RKKPKKDSYLC 134 R++P + Sbjct: 92 RQRPYAYQHRI 102 >gi|29349688|ref|NP_813191.1| putative glycosyltransferase [Bacteroides thetaiotaomicron VPI-5482] gi|29341598|gb|AAO79385.1| glycoside transferase family 4 [Bacteroides thetaiotaomicron VPI-5482] Length = 577 Score = 48.0 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 36/103 (34%), Gaps = 22/103 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +PVYVI++ R++ +F + + +PI Sbjct: 374 LPVYVINMRERVERKQHITKEFDNKE-EFE-LNWVEASAHPI------------------ 413 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 +G + S I + K +I EDD F++ +S L Sbjct: 414 --GAVGLWNSMIKIIKMAKEKGDDIIVICEDDHYFTENYSPKL 454 >gi|253569945|ref|ZP_04847354.1| glycoside transferase family 4 [Bacteroides sp. 1_1_6] gi|251840326|gb|EES68408.1| glycoside transferase family 4 [Bacteroides sp. 1_1_6] Length = 544 Score = 48.0 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 36/103 (34%), Gaps = 22/103 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +PVYVI++ R++ +F + + +PI Sbjct: 341 LPVYVINMRERVERKQHITKEFDNKE-EFE-LNWVEASAHPI------------------ 380 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 +G + S I + K +I EDD F++ +S L Sbjct: 381 --GAVGLWNSMIKIIKMAKEKGDDIIVICEDDHYFTENYSPKL 421 >gi|298383950|ref|ZP_06993511.1| glycosyltransferase [Bacteroides sp. 1_1_14] gi|298263554|gb|EFI06417.1| glycosyltransferase [Bacteroides sp. 1_1_14] Length = 544 Score = 47.6 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 36/103 (34%), Gaps = 22/103 (21%) Query: 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +PVYVI++ R++ +F + + +PI Sbjct: 341 LPVYVINMRERVERKQHITKEFDNKE-EFE-LNWVEASAHPI------------------ 380 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 +G + S I + K +I EDD F++ +S L Sbjct: 381 --GAVGLWNSMIKIIKMAKEKGDDIIVICEDDHYFTENYSPKL 421 >gi|34863501|gb|AAQ82931.1| putative N-acetyl glucosaminyl transferase [Raoultella terrigena] Length = 1106 Score = 46.9 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 47/121 (38%), Gaps = 30/121 (24%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 IP +SL + RR+ F + FSFFD I + + Sbjct: 891 DIPRLCLSLSETPVRRKAFFKSPSH---GFSFFDGI-----------------RYRIGW- 929 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADF----SDEFSQLLPHLSKCDINNIL 117 IGC +S+ ++ R+ S A II EDD F + ++++ HL +I + Sbjct: 930 -----IGCGMSYKYMLSRMLASRAKMGIICEDDVIFPLDYDQKLNKIIDHLQATEIKWHI 984 Query: 118 I 118 Sbjct: 985 F 985 >gi|284054217|ref|ZP_06384427.1| hypothetical protein AplaP_22453 [Arthrospira platensis str. Paraca] gi|291569785|dbj|BAI92057.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 270 Score = 46.9 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 41/136 (30%), Gaps = 25/136 (18%) Query: 69 CYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPK 128 C ++H W+ IA + ++I+E D EF QL + L Sbjct: 54 CMLNHRQAWE-IATQQSQPSLIVEADFVPVLEFGQL-----PLPFDKELPNLGITWIYTC 107 Query: 129 KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN------------IYRPI 176 P + S Y + +AA L+ + I Sbjct: 108 SPQVYWVSPEGY----AEGFSTGMVAYIVSPKAAEILIKFSEKITEKTQGKTYSTWDSEI 163 Query: 177 DMDMKHWWEHNIPSLV 192 D ++ HN+ + + Sbjct: 164 DTFLRS---HNLQNFI 176 >gi|323344098|ref|ZP_08084324.1| glycosyltransferase [Prevotella oralis ATCC 33269] gi|323094827|gb|EFZ37402.1| glycosyltransferase [Prevotella oralis ATCC 33269] Length = 224 Score = 46.9 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 40/103 (38%), Gaps = 22/103 (21%) Query: 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKR 60 + IPV++I+L R + ++ +F + +C K Sbjct: 5 ISIPVFIINLKEHSDRLKHVLNQFEGKK-EFDI-----------------YIVERCNHK- 45 Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ 103 ++G + S + + K + S II EDD F++ +++ Sbjct: 46 ---SGDMGLWQSIVKIIKEVYNSDDDLIIICEDDHTFTEHYNR 85 >gi|119605030|gb|EAW84624.1| glycosyltransferase 25 domain containing 1, isoform CRA_d [Homo sapiens] Length = 189 Score = 46.5 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 17/99 (17%) Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPIDMDMK 181 RK+ + + +P ++ + S T Y I + A LL + + P+D + Sbjct: 36 RKRMQVEHPEKAVPRVRNLVEA-DYSYWTLAYVISLQGARKLL-AAEPLSKMLPVDEFLP 93 Query: 182 -------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 H+ N+ + EP +Y T D Sbjct: 94 VMFDKHPVSEYKAHFSLRNLHAFSVEPLLIYPTHYTGDD 132 >gi|145345832|ref|XP_001417403.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144577630|gb|ABO95696.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 488 Score = 46.5 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 41/139 (29%), Gaps = 24/139 (17%) Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL----LPHLSKCDINNILIKFDAL 123 GC ++H+ +W R+ S ++ E D + L L D + + +K D Sbjct: 185 GCLLTHMAVWNRVIDSDDDTFVLWESDGATLNAVHPLDYPKLREHVPDDFDIVWLKPDDH 244 Query: 124 RKKPKKDSYLCT------------LPGNFDIHQPRI----LSPRTTGYFIGKEAAI---H 164 + L D++ + T Y + ++ A Sbjct: 245 SNGQLIKRFKSAAKGLWDPTMDSRLINGNDVYLYKFDKACNWAGTPSYMMSRKGAKKIMK 304 Query: 165 LLNVRKNIYRPIDMDMKHW 183 + + ID + Sbjct: 305 FIKEAEFADM-IDAWLSRH 322 >gi|169608143|ref|XP_001797491.1| hypothetical protein SNOG_07138 [Phaeosphaeria nodorum SN15] gi|160701573|gb|EAT85789.2| hypothetical protein SNOG_07138 [Phaeosphaeria nodorum SN15] Length = 404 Score = 46.1 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 56/153 (36%), Gaps = 31/153 (20%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++V++LPF RR+ AA +++ F D + GE+ +Q + Sbjct: 37 KIFVLNLPFRTDRRDAMVLSAAISNIKVDFVDGVTGESI-----------KQSAYP---- 81 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC-----DINNILI 118 PE + + P I A ILEDD D+ Q L + N++ Sbjct: 82 PPE-----------ENLKLLPGIRAFILEDDVDWDIRVRQNLQRFALASRVLSSSQNVIT 130 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPR 151 L+ + + + T F I + L R Sbjct: 131 TSSNLQSRVEYRANTETEESAFQIIDTKNLPKR 163 >gi|255035474|ref|YP_003086095.1| hypothetical protein Dfer_1689 [Dyadobacter fermentans DSM 18053] gi|254948230|gb|ACT92930.1| hypothetical protein Dfer_1689 [Dyadobacter fermentans DSM 18053] Length = 459 Score = 46.1 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 39/106 (36%), Gaps = 26/106 (24%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQF--SFFDAIYGENNPICNRIFSHQKRQCQFK 59 I YV++L RR + + +F AI + Sbjct: 5 KIVTYVVNLRNRVDRRAHILNEFNKRG-EFAVEIVPAIEQK------------------- 44 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL 105 L +G +++ ++ + A + A +I EDD F+ E+S+ L Sbjct: 45 ----LGSVGLWMTIRYIVENAAKTDAEYILICEDDHQFTGEYSRSL 86 >gi|323446150|gb|EGB02429.1| hypothetical protein AURANDRAFT_68890 [Aureococcus anophagefferens] Length = 520 Score = 46.1 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 52/170 (30%), Gaps = 30/170 (17%) Query: 35 DAIYG---ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHI-----HLWKRIAYSPAI 86 DA+ + N + + + R + E+ C SH+ + + Sbjct: 44 DALDALVDAWHAGGNAAVARSALERYYGRGVLPAEVACARSHVAAWRRAAARFADDAALE 103 Query: 87 GAIILEDDA--------------DFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 ++LEDDA + ++ L+ L+K + + + R + D Sbjct: 104 KVLVLEDDAVPCLGPVRAADWAGVYGAAWAALVARLAKTTLPSDVDLLYVGRHRLGPDGG 163 Query: 133 LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR---PIDMD 179 + P S Y + + A L + N R P D Sbjct: 164 PAGA-----VCYPAGFSSCLHAYALTRRGAARLDAISGNFLRRVVPADDF 208 >gi|115531767|ref|YP_784293.1| IMV envelope protein [Crocodilepox virus] gi|115521120|gb|ABJ08994.1| IMV envelope protein [Crocodilepox virus] Length = 278 Score = 45.7 bits (107), Expect = 0.007, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 49/155 (31%), Gaps = 21/155 (13%) Query: 73 HIHLWKRIAYSPA-----IGAIILEDDADFSDEFSQLLP----HLSKCDINNILIKFDAL 123 H+ L++RI ++LED +F++ L + I+ + + D Sbjct: 63 HLGLYERILNGKPQQVADPAFVVLEDLVTPRPDFAKTLQPICDAMKANSIDIVYLTTDYF 122 Query: 124 RKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH----LLNVRKNIYRPIDMD 179 + + + +++ + GY I L +D+D Sbjct: 123 -SRVNQPPIAMAAAPAYKLYRAGS-ALSLAGYVITLAGMKKVRDHFLANPVTTSLNVDLD 180 Query: 180 MKHWWEHNIPSLVTEPGAVY-----EAIDTNDSTI 209 + H I V +P V I+ S + Sbjct: 181 VNAARAH-IARAVLDPALVSIDRVAPDIENKGSFL 214 >gi|145347423|ref|XP_001418166.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144578395|gb|ABO96459.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 591 Score = 45.7 bits (107), Expect = 0.007, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 37/116 (31%), Gaps = 10/116 (8%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDA----DFSDEFSQLLPHLSKCDINNILIKFDA 122 IGC H+ W+ A + A +LE DA F + +I FD Sbjct: 417 IGCLFGHMFQWQLAADANDKHAFMLESDAFDSSIVGMPFKDMQTLADNAPKEYDIIFFDQ 476 Query: 123 LRKKPKKDSYLCTLP--GNFDIHQPRILSPRTTG----YFIGKEAAIHLLNVRKNI 172 R ++ G +I+ R+ S T G + + L + Sbjct: 477 PRDMTFDEAPEHKFKDAGGSEIYLFRLKSKNTQGGLSASLVSRSFYPKLFKHLSHF 532 >gi|242212164|ref|XP_002471917.1| predicted protein [Postia placenta Mad-698-R] gi|220729015|gb|EED82897.1| predicted protein [Postia placenta Mad-698-R] Length = 723 Score = 45.3 bits (106), Expect = 0.008, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 23/59 (38%), Gaps = 1/59 (1%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +YV+SLP RR + L +++ DA+ +P + +R L Sbjct: 551 NIYVVSLPHRTDRRAAMERLKQALELDWTYVDAV-PSGDPSIVDVLERVRRTRFPPGEL 608 >gi|308805178|ref|XP_003079901.1| unnamed protein product [Ostreococcus tauri] gi|116058358|emb|CAL53547.1| unnamed protein product [Ostreococcus tauri] Length = 510 Score = 44.9 bits (105), Expect = 0.010, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 40/146 (27%), Gaps = 15/146 (10%) Query: 49 FSHQKRQCQFKRLLSLP------EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF- 101 F + + L IGC SH+ +W+ + II E D ++ Sbjct: 230 FDNLEVAKSHGGTLPRGLGFLSHHIGCLFSHMRMWRLHQKNLNKWTIIYESDGWWNQHVN 289 Query: 102 -SQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG---YFI 157 + L + + +I + + G Y I Sbjct: 290 GASLQSVVDNAPEHADVIFLHPNSPASGQFVKQWPSGKGMNYMYQYNKVLGAAGLQAYMI 349 Query: 158 GKEAAIH---LLNVRKNIYRPIDMDM 180 G + L++ + +D + Sbjct: 350 GPQFTEKILDLIHYLRGADM-VDAWL 374 >gi|207110975|ref|ZP_03245137.1| putative lipopolysaccharide biosynthesis protein [Helicobacter pylori HPKX_438_CA4C1] Length = 44 Score = 44.9 bits (105), Expect = 0.012, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 1/45 (2%) Query: 76 LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 +W++ ILEDD ++F + L L K ++ Sbjct: 1 MWQKCIELNE-AICILEDDITLKEDFKEGLDFLEKHIQELGYVRL 44 >gi|227537840|ref|ZP_03967889.1| possible glycosyltransferase [Sphingobacterium spiritivorum ATCC 33300] gi|227242454|gb|EEI92469.1| possible glycosyltransferase [Sphingobacterium spiritivorum ATCC 33300] Length = 216 Score = 44.9 bits (105), Expect = 0.013, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 39/111 (35%), Gaps = 24/111 (21%) Query: 2 PIPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 IPVY+I+L R+ + +F F + I Sbjct: 5 KIPVYIINLKERIDRQTHILNEFKDKP-EFEIF---------LFEAIRDA---------- 44 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSD--EFSQLLPHLSK 110 + +G + S + K II EDD F++ ++QLL ++ + Sbjct: 45 --IGAVGLWKSITAVVKLAKEKGEDLIIICEDDHQFTENYNYNQLLKNIER 93 >gi|116193081|ref|XP_001222353.1| hypothetical protein CHGG_06258 [Chaetomium globosum CBS 148.51] gi|88182171|gb|EAQ89639.1| hypothetical protein CHGG_06258 [Chaetomium globosum CBS 148.51] Length = 310 Score = 44.9 bits (105), Expect = 0.013, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 31/76 (40%), Gaps = 6/76 (7%) Query: 30 QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAI 89 Q F D + G++ P + K L IG + +H++ + I + A+ Sbjct: 2 QLEFIDGVMGKDVP------DKAVPMAKGKERLRDASIGSWRAHMNAVREIVHRNLSSAL 55 Query: 90 ILEDDADFSDEFSQLL 105 I+EDD D+ L Sbjct: 56 IMEDDVDWDVRIKDQL 71 >gi|255949826|ref|XP_002565680.1| Pc22g17710 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592697|emb|CAP99059.1| Pc22g17710 [Penicillium chrysogenum Wisconsin 54-1255] Length = 387 Score = 44.6 bits (104), Expect = 0.013, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 58/210 (27%), Gaps = 52/210 (24%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 V+ I+L +R+ ++ + + D + + + + + + + Sbjct: 63 KVFAINLISRPDKRDNIILGSSLSNFHVEWIDGVTP--DEVDAKSYPYNWNHGHKQA--- 117 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS-------DEFSQLLPHLSKCDI--- 113 E +H++ +RI AII+EDD D+ F+ + L D Sbjct: 118 --EYAARRAHLNGLQRIVQERLSSAIIMEDDTDWDVSIKAQLQSFAHAVRALQPQDSRNV 175 Query: 114 --------NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP--------------- 150 + ++ + D P + I P P Sbjct: 176 PTQSPYGDDWDILWLGHCGVSCRTDMPFYLTPNDPTIPMPHHFLPLWRDPPTFEGYERPD 235 Query: 151 -----RTTG-------YFIGKEAAIHLLNV 168 T Y + A +L Sbjct: 236 HSRLACTASDAVCSMLYAVSYSGAQRILAA 265 >gi|170717809|ref|YP_001784871.1| hypothetical protein HSM_1551 [Haemophilus somnus 2336] gi|168825938|gb|ACA31309.1| hypothetical protein HSM_1551 [Haemophilus somnus 2336] Length = 71 Score = 44.6 bits (104), Expect = 0.014, Method: Composition-based stats. Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 6/71 (8%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 ++ IS + RR + ++ + +F+DAI + L+ Sbjct: 4 IF-ISDKNNDIRRHHIENETKKLGITPNFYDAIMARDLSKEELSTLTIPNTF-----LTP 57 Query: 65 PEIGCYISHIH 75 EI C SH+ Sbjct: 58 GEICCAKSHLE 68 >gi|331666538|ref|ZP_08367412.1| glycosyl transferase, group 2 family [Escherichia coli TA271] gi|331065762|gb|EGI37646.1| glycosyl transferase, group 2 family [Escherichia coli TA271] Length = 838 Score = 44.6 bits (104), Expect = 0.015, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 52/167 (31%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ R+ + + Q R L Sbjct: 289 VVVVNLDKRPDRLQQIREELALLHIP-----------PEKITRLAACEDENGQRGRRL-- 335 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 336 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 387 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A L Sbjct: 388 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQL 430 >gi|148238426|ref|YP_001223813.1| glycosyltransferase [Synechococcus sp. WH 7803] gi|147846965|emb|CAK22516.1| Possible glycosyltransferase [Synechococcus sp. WH 7803] Length = 279 Score = 44.6 bits (104), Expect = 0.015, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 42/124 (33%), Gaps = 5/124 (4%) Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 H+ K + +ILEDDA S + L +++I + ++ Sbjct: 129 HLSAMKDFISTDHQYCLILEDDACVSSHCDKPLADRIASCVDSISHLGEGFFDVSDSLNF 188 Query: 133 LCTLPGNFDIHQPRILSPR----TTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE-HN 187 + + + Y + ++ A ++N ++ PID + + + Sbjct: 189 RPQVENKNQASNFIEMGEGQTRCASSYIVTRKVAETIINDSSSLLLPIDWHLSYLLSANK 248 Query: 188 IPSL 191 I + Sbjct: 249 IRTF 252 >gi|159483797|ref|XP_001699947.1| hypothetical protein CHLREDRAFT_186807 [Chlamydomonas reinhardtii] gi|158281889|gb|EDP07643.1| predicted protein [Chlamydomonas reinhardtii] Length = 537 Score = 44.6 bits (104), Expect = 0.015, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 56/163 (34%), Gaps = 29/163 (17%) Query: 67 IGCYISHIHL---WK-RIAYSPAIG-------AIILEDDADFS-DEFSQLLPHLSKCDIN 114 +G SH+ W +A +++EDDA + + + L L + D Sbjct: 147 LGIAASHVQAVSNWTVDMARRGVPPSERADRHLLLIEDDAVMTTEGIAALGRVLKQLDSC 206 Query: 115 NILIKFDALRKKPKKDSYLCTLPGNF---------DIHQPRILSPRTTGYFIGKEAAIHL 165 ++ D+ + L F + + ++ RTTG + A+ L Sbjct: 207 YDMVALDSKHNYCALQRWQDRLTSWFLPAAWRTSPHLVRVKLAFSRTTGLLFSYKGAVRL 266 Query: 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 L+ + R +D W ++ + V+ I S+ Sbjct: 267 LSNA-PVTREVD-----LWYRDLATEGL--FDVFRHIVIETSS 301 >gi|145635366|ref|ZP_01791068.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae PittAA] gi|145267372|gb|EDK07374.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae PittAA] Length = 101 Score = 44.2 bits (103), Expect = 0.017, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 16/30 (53%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA 36 VIS+ + RR+ + L FSFF+A Sbjct: 30 VISMENATERRKHITKQFESKKLSFSFFNA 59 >gi|145640524|ref|ZP_01796108.1| dimethyladenosine transferase [Haemophilus influenzae R3021] gi|145275110|gb|EDK14972.1| dimethyladenosine transferase [Haemophilus influenzae 22.4-21] Length = 97 Score = 44.2 bits (103), Expect = 0.017, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 16/30 (53%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA 36 VIS+ + RR+ + L FSFF+A Sbjct: 30 VISMENATERRKHITKQFESKKLSFSFFNA 59 >gi|307313527|ref|ZP_07593148.1| glycosyl transferase family 2 [Escherichia coli W] gi|306906695|gb|EFN37206.1| glycosyl transferase family 2 [Escherichia coli W] gi|315059534|gb|ADT73861.1| glycosyl transferase family 2 [Escherichia coli W] gi|323379907|gb|ADX52175.1| glycosyl transferase family 2 [Escherichia coli KO11] Length = 814 Score = 44.2 bits (103), Expect = 0.018, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 52/167 (31%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ R+ + + Q R L Sbjct: 265 VVVVNLDKRPDRLQQIREELALLHIP-----------PEKITRLAACEDENGQRGRRL-- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A L Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQL 406 >gi|145633497|ref|ZP_01789226.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae 3655] gi|144985866|gb|EDJ92474.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae 3655] Length = 81 Score = 44.2 bits (103), Expect = 0.019, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 16/30 (53%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA 36 VIS+ + RR+ + L FSFF+A Sbjct: 29 VISMENATERRKHITKQFESKKLSFSFFNA 58 >gi|308806053|ref|XP_003080338.1| unnamed protein product [Ostreococcus tauri] gi|116058798|emb|CAL54505.1| unnamed protein product [Ostreococcus tauri] Length = 229 Score = 43.8 bits (102), Expect = 0.022, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 41/134 (30%), Gaps = 10/134 (7%) Query: 85 AIGAIILEDDADFSDEFSQLL-----PHLSKCDINNILIKFD---ALRKKPKKDSYLCTL 136 ++ EDDA+ L + ++ D ++P Sbjct: 95 DDSILLFEDDANVVSPSPDALRSELMALIQTLPTGWDILSLDPHPNFCREPFWKHPSAPT 154 Query: 137 PGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE-HNIPSLVTEP 195 +H+ RTT + + A+ +L IDM + + + Sbjct: 155 SLKRRVHRTYSTFSRTTAIVVSRRGALKILRNL-PTNIVIDMFIARILRQGGLRIYLACD 213 Query: 196 GAVYEAIDTNDSTI 209 + +D++ +++ Sbjct: 214 NGIVRQVDSSVTSL 227 >gi|242819198|ref|XP_002487268.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500] gi|218713733|gb|EED13157.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500] Length = 333 Score = 43.8 bits (102), Expect = 0.023, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 31/75 (41%), Gaps = 8/75 (10%) Query: 24 AARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYS 83 + + L + I G++ + +R IG + SH++ + + + Sbjct: 4 MSDLKLDY-----IEGKSGDEVSDRALPPPASHNTQRT---GNIGSWRSHLNAIRAVVEN 55 Query: 84 PAIGAIILEDDADFS 98 A+ILEDDAD+ Sbjct: 56 NLDSALILEDDADWD 70 >gi|300898045|ref|ZP_07116415.1| glycosyltransferase, group 2 family protein [Escherichia coli MS 198-1] gi|300358243|gb|EFJ74113.1| glycosyltransferase, group 2 family protein [Escherichia coli MS 198-1] Length = 860 Score = 43.8 bits (102), Expect = 0.025, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 48/158 (30%), Gaps = 27/158 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ S + + +R Sbjct: 302 VVVVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDQNGQRGRRQ--- 348 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 349 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 400 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 L + + + L +LPG R + Y + + Sbjct: 401 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNRR 434 >gi|218703517|ref|YP_002411036.1| putative glysosyltransferase [Escherichia coli UMN026] gi|218430614|emb|CAR11480.1| putative glysosyltransferase [Escherichia coli UMN026] Length = 847 Score = 43.8 bits (102), Expect = 0.028, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 48/158 (30%), Gaps = 27/158 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ S + + +R Sbjct: 289 VVVVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDQNGQRGRRQ--- 335 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 336 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 387 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 L + + + L +LPG R + Y + + Sbjct: 388 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNRR 421 >gi|75909392|ref|YP_323688.1| hypothetical protein Ava_3185 [Anabaena variabilis ATCC 29413] gi|75703117|gb|ABA22793.1| hypothetical protein Ava_3185 [Anabaena variabilis ATCC 29413] Length = 282 Score = 43.4 bits (101), Expect = 0.029, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 77/273 (28%), Gaps = 59/273 (21%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V +I+ S E+ + L + + + R Sbjct: 18 VLIIAHKEST---EQLEATLTKEGLNYEVL-------RQESQPEYQNFSRSYL------- 60 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALR 124 C ++H W++ A +I+E D F +L + F+ + Sbjct: 61 ----CLLNHRSAWEK-ARQTDKLTLIVEADFVPVIGFGKL------------PLPFEPHQ 103 Query: 125 KKPKKDSYLCTLPGNFDIH---QPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI----- 176 K P + + + S Y + ++A +LL + I I Sbjct: 104 KDVGISWLYTCAPQVYHVSPEGYAQGFSTSAVAYIVNSQSATYLLELADEITEKIGATKY 163 Query: 177 ---DMDMKHWWE-HNIPSLVTE---------PGAVYEAIDTNDSTIEESRLVRKPTFSPL 223 D ++ + + + + P + + L K F+PL Sbjct: 164 STWDSEIDKFLLQRKLKNYIPWRNYGEHGGLPNP-EHRQNNLGNVHHADVLYGKLAFTPL 222 Query: 224 YFYRNTCYQW---NLHYNAWRKDLPPVSTTKFL 253 Y W + A K + ++ +FL Sbjct: 223 YALGEQGGTWKFLTVRLQARLKGIARLAVGRFL 255 >gi|149039148|gb|EDL93368.1| rCG45647, isoform CRA_b [Rattus norvegicus] Length = 165 Score = 43.4 bits (101), Expect = 0.030, Method: Composition-based stats. Identities = 19/118 (16%), Positives = 39/118 (33%), Gaps = 20/118 (16%) Query: 105 LPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH 164 + + ++ LI + P+++ + LPG S T Y + A Sbjct: 1 MEDVLTQKLSWDLIYLGRKQVNPEEEVAVEGLPG----LVVAGYSYWTLAYTLSLAGARK 56 Query: 165 LLNVRKNIYR--PIDMDMK-------------HWWEHNIPSLVTEPGAVYEAIDTNDS 207 LL + ++R P+D + H+W ++ + P D+ Sbjct: 57 LL-ASQPLHRMLPVDEFLPVMFDRHPNDQYKAHFWPRDLQAFSARPLLASPTHYAGDT 113 >gi|293403347|ref|ZP_06647438.1| conserved hypothetical protein [Escherichia coli FVEC1412] gi|298378959|ref|ZP_06988840.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|291429200|gb|EFF02220.1| conserved hypothetical protein [Escherichia coli FVEC1412] gi|298280072|gb|EFI21576.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 823 Score = 43.4 bits (101), Expect = 0.032, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 48/158 (30%), Gaps = 27/158 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ S + + +R Sbjct: 265 VVVVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDQNGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 L + + + L +LPG R + Y + + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNRR 397 >gi|145631771|ref|ZP_01787532.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae R3021] gi|144982632|gb|EDJ90178.1| UDP-Gal--lipooligosaccharide galactosyltransferase [Haemophilus influenzae R3021] Length = 58 Score = 43.4 bits (101), Expect = 0.033, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 16/30 (53%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDA 36 VIS+ + RR+ + L FSFF+A Sbjct: 29 VISMENATERRKHITKQFESKKLSFSFFNA 58 >gi|155371614|ref|YP_001427148.1| hypothetical protein ATCV1_Z667L [Acanthocystis turfacea Chlorella virus 1] gi|155124934|gb|ABT16801.1| hypothetical protein ATCV1_Z667L [Acanthocystis turfacea Chlorella virus 1] Length = 545 Score = 43.0 bits (100), Expect = 0.041, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 42/157 (26%), Gaps = 47/157 (29%) Query: 6 YVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 + I+ R K + I G N ++ ++ Sbjct: 415 FYINRDEDIDNRMKMEKQK------------INGMNILRIKGDYNEKE------------ 450 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF-SQLLPHLSKCDINNILIKFDALR 124 SH KR AI+ DD F +F ++LL +SK ++ ++ Sbjct: 451 ------SHSKAIKRAFEENFTFAILCRDDVIFPSDFMTKLLHIISKLPLSWQKLQLRFKS 504 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEA 161 + + + Y + K Sbjct: 505 NETITEGNIENP----------------ACYILSKSG 525 >gi|114676041|ref|XP_001173632.1| PREDICTED: procollagen galactosyltransferase 1-like [Pan troglodytes] gi|193787049|dbj|BAG51872.1| unnamed protein product [Homo sapiens] Length = 151 Score = 43.0 bits (100), Expect = 0.041, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 25/75 (33%), Gaps = 16/75 (21%) Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIY--RPIDMDMK-------------HWWEHNIPSLV 192 S T Y I + A LL + + P+D + H+ N+ + Sbjct: 21 YSYWTLAYVISLQGARKLL-AAEPLSKMLPVDEFLPVMFDKHPVSEYKAHFSLRNLHAFS 79 Query: 193 TEPGAVYEAIDTNDS 207 EP +Y T D Sbjct: 80 VEPLLIYPTHYTGDD 94 >gi|212536927|ref|XP_002148619.1| conserved hypothetical protein [Penicillium marneffei ATCC 18224] gi|210068361|gb|EEA22452.1| conserved hypothetical protein [Penicillium marneffei ATCC 18224] Length = 236 Score = 43.0 bits (100), Expect = 0.042, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 40/143 (27%), Gaps = 36/143 (25%) Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN------ 114 + + + C++ H+++ + I + A++ EDD D+ + L +S Sbjct: 7 NIGIGQTQCWLGHLNVLQEIVHRSFATALVFEDDVDWDVDIKYELALVSPLIRQVWSCDG 66 Query: 115 -----------NILIKFDALRKKPKKDSY-------------------LCTLPGNFDIHQ 144 ++ D+ L P + Sbjct: 67 GDDDKTLFGSCWDILWLGHCGDIIASDALSVWDYTSLPSGFYQENDDQLTPFPARTRMVH 126 Query: 145 PRILSPRTTGYFIGKEAAIHLLN 167 T Y + ++A L+ Sbjct: 127 VSQGPICTYAYAVTNKSAAKLVQ 149 >gi|308802442|ref|XP_003078534.1| unnamed protein product [Ostreococcus tauri] gi|116056987|emb|CAL51414.1| unnamed protein product [Ostreococcus tauri] Length = 378 Score = 43.0 bits (100), Expect = 0.046, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 42/137 (30%), Gaps = 25/137 (18%) Query: 68 GCYISHIHLWKRIAYSPAIGAIILEDD----ADFSDEFSQLLPHLSKCDINNILIKFDAL 123 GC ++H+ +W R+ S A ++ E D S L D + + +++D Sbjct: 208 GCLLTHMAVWNRVLESDATHFVLWESDGSEHLSVSPLDYDQLYDELPHDADMVWMRYDVE 267 Query: 124 RKKPKKDSYLCTLPGNFDIHQP-----------------RILSPRTTGYFIGKEAAIH-- 164 + G + + P R T Y + ++ Sbjct: 268 SSGQLLKRFKSRATGTWGGNLPSIRFDQEPDVYLYKFDKRCNWAGTPAYMMTRKGVEKIM 327 Query: 165 -LLNVRKNIYRPIDMDM 180 +N + ID + Sbjct: 328 NFINDAEKADM-IDAWL 343 >gi|84618234|emb|CAJ21222.1| FPV140 protein [Avipoxvirus CVL] gi|84618286|emb|CAJ43783.1| FPV140 protein [Avipoxvirus CVL] Length = 333 Score = 42.6 bits (99), Expect = 0.060, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 10/101 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + II+EDD + + F L ++ K D N +++ K Sbjct: 110 HISLWRYIMDNAEKLPNYIIIIEDDNTITGESFITNLDNIIKILNDNNVDVLQLVTHTKL 169 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 K +K +L LP +S + Y I +EA L Sbjct: 170 LKDRKSQHLMLLPDLEAFKGSFDVSL--SAYIIRQEAVRKL 208 >gi|323344144|ref|ZP_08084370.1| glycosyltransferase [Prevotella oralis ATCC 33269] gi|323094873|gb|EFZ37448.1| glycosyltransferase [Prevotella oralis ATCC 33269] Length = 219 Score = 42.6 bits (99), Expect = 0.061, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 38/103 (36%), Gaps = 25/103 (24%) Query: 3 IPVYVISLPFSHARREKFCHRA-ARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 IP YVI+LP RR+ + + +AI +PI Sbjct: 8 IPTYVINLPERIDRRKSVEAEFKDKTEFELHIVEAI---KHPI----------------- 47 Query: 62 LSLPEIGCYISHIHLWKRIAY-SPAIGAIILEDDADFSDEFSQ 103 IG + S + K++A II ED+ F++ + + Sbjct: 48 ---GAIGLWKSITSIIKKVAQKGEDDVIIICEDEHVFTENYQR 87 >gi|293408359|ref|ZP_06652198.1| conserved hypothetical protein [Escherichia coli B354] gi|291471537|gb|EFF14020.1| conserved hypothetical protein [Escherichia coli B354] Length = 823 Score = 42.2 bits (98), Expect = 0.069, Method: Composition-based stats. Identities = 38/253 (15%), Positives = 80/253 (31%), Gaps = 46/253 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 265 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDQNGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHLLN---VRKNIYRP 175 L + + + L +LPG R + Y + + A + N + ++P Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNCRYYPQLAQQMSNDEHSLEECWQP 419 Query: 176 I---DMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI----EESRLVRKPTFSPLYFYRN 228 + D + + PSL + + + I + +V KP+ P+ Sbjct: 420 VLRADKWLACY-----PSLCYQRPGFSDIENKTTDNIGHYFNKLPVVTKPSALPIADTIG 474 Query: 229 TCYQWNLHYNAWR 241 + + HY +R Sbjct: 475 FFMETSFHYALYR 487 >gi|331645408|ref|ZP_08346512.1| glycosyl transferase, group 2 family [Escherichia coli M605] gi|331045570|gb|EGI17696.1| glycosyl transferase, group 2 family [Escherichia coli M605] Length = 812 Score = 42.2 bits (98), Expect = 0.072, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V +I+L R ++ +H+ + + + +R Q Sbjct: 263 VVLINLDKRPDRLQQIREELTLLHIPPE-------KITRLAASENENGQRGRQQ------ 309 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 310 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNALLASLAKIPW-QVMI 361 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 362 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 404 >gi|9634810|ref|NP_039103.1| Virion envelope protein, p35 [Fowlpox virus] gi|18203079|sp|Q9J590|VP35_FOWPN RecName: Full=Immunodominant envelope protein p35; AltName: Full=Ag35; AltName: Full=Virion envelope protein p35 gi|7271638|gb|AAF44484.1|AF198100_131 ORF FPV140 Virion envelope protein, p35 [Fowlpox virus] gi|41023425|emb|CAE52679.1| H3L IMV membrane protein p35 orthologue [Fowlpox virus isolate HP-438/Munich] gi|84618250|emb|CAJ21210.1| FPV140 protein [Fowlpox virus] gi|84618254|emb|CAJ21213.1| FPV140 protein [Fowlpox virus] gi|84618262|emb|CAJ21219.1| FPV140 protein [Fowlpox virus] gi|311350260|gb|ADP92335.1| envelope protein [Fowlpox virus] Length = 327 Score = 42.2 bits (98), Expect = 0.078, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 43/104 (41%), Gaps = 10/104 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + +I+EDD + + F L +++K D N +++ K Sbjct: 104 HISLWRYIMDNAEKLPNYVVIMEDDNTITGEGFITNLDNITKVLNDNNVDILQLVTHTKL 163 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K + +L LP +S + Y I +EA L + Sbjct: 164 LKDRNSQHLMLLPDLEAFKGSFDVSL--SAYIIRQEAVRKLYSY 205 >gi|84618258|emb|CAJ21216.1| FPv140 protein [Fowlpox virus] Length = 327 Score = 41.9 bits (97), Expect = 0.082, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 43/104 (41%), Gaps = 10/104 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + +I+EDD + + F L +++K D N +++ K Sbjct: 104 HISLWRYIMDNAEKLPNYVVIMEDDNTITGEGFITNLDNITKVLNDNNVDILQLVTHTKL 163 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K + +L LP +S + Y I +EA L + Sbjct: 164 LKDRNSQHLMLLPDLEAFKGSFDVSL--SAYIIRQEAVRKLYSY 205 >gi|260574346|ref|ZP_05842350.1| hypothetical protein Rsw2DRAFT_0337 [Rhodobacter sp. SW2] gi|259023242|gb|EEW26534.1| hypothetical protein Rsw2DRAFT_0337 [Rhodobacter sp. SW2] Length = 269 Score = 41.9 bits (97), Expect = 0.087, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 66/217 (30%), Gaps = 43/217 (19%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 VI++P RR + + SF D + ++ + Sbjct: 27 VINMPKQRKRRRHMERLLTGLGIAHSFSDGVD----STEFKVLNAT-------------- 68 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 ++H +A + + +ILEDD +F S LP L D + + + + Sbjct: 69 ----LAHQGA---MAKAKGLPFMILEDDLEFCAG-STALPGL-PADADIVYLGLNTAGCF 119 Query: 127 PKKD----------SYLCTLPGNFDIHQPRILSPRTTGYFI-----GKEAAIHLLNVRKN 171 P+ L D R+ S + + G+ A L + N Sbjct: 120 PRIREIHDQIGHSAVPDFVLARAHDARFARLFSMVSAHAILFVSARGRAAYAEALQLAAN 179 Query: 172 IYRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDST 208 P+D+ + + + EA+ T Sbjct: 180 RRMPLDVSYAYV-MGGLNTYAVLQPMFREAMALQIRT 215 >gi|170021368|ref|YP_001726322.1| glycosyl transferase family protein [Escherichia coli ATCC 8739] gi|169756296|gb|ACA78995.1| glycosyl transferase family 2 [Escherichia coli ATCC 8739] Length = 814 Score = 41.9 bits (97), Expect = 0.094, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ P C + R+ Sbjct: 265 VVVVNLDKRPDRLQQIREELALLHIPPEKIT-----RLPACEDENGQRGRRQ-------- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVML 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRCYPQLAQQM 406 >gi|188492971|ref|ZP_03000241.1| glycosyl transferase domain protein, group 2 family [Escherichia coli 53638] gi|188488170|gb|EDU63273.1| glycosyl transferase domain protein, group 2 family [Escherichia coli 53638] Length = 814 Score = 41.9 bits (97), Expect = 0.095, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ P C + R+ Sbjct: 265 VVVVNLDKRPDRLQQIREELALLHIPPEKIT-----RLPACEDENGQRGRRQ-------- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVML 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRCYPQLAQQM 406 >gi|145348073|ref|XP_001418481.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144578710|gb|ABO96774.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 515 Score = 41.9 bits (97), Expect = 0.098, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 33/106 (31%), Gaps = 8/106 (7%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDAD---FSDEFSQLLPHLSKCDINNILIKFDAL 123 +GC SH+ +W+ +I E D L + N +I Sbjct: 265 VGCLFSHMRMWRLHQKLKNKWTVIFESDGLWSGLHIPPKGLQSVIDNAPANADVIFLKTR 324 Query: 124 RKKPKKDSYLCTLPGN----FDIHQPRILSPRTTGYFIGKEAAIHL 165 + + + + +++ + + GY +G + L Sbjct: 325 DEPTGQFVKQWPVEDEMIYMYSMNRVQ-AACGLAGYIVGPKFTEKL 369 >gi|291228148|ref|XP_002734046.1| PREDICTED: phospholipase C beta type-like [Saccoglossus kowalevskii] Length = 1318 Score = 41.9 bits (97), Expect = 0.10, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 53/167 (31%), Gaps = 29/167 (17%) Query: 2 PIPV-YVIS-LPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 +PV ++I L S R+K H I L G+N+ I F+ + + Sbjct: 137 KVPVKHIIKKLANSKDDRKKVEHALQAIGLP-------SGKNDVIPTERFTLEDFFIFYI 189 Query: 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIK 119 RL + EI +++ I +L +F L + + N ++ Sbjct: 190 RLTNRGEI------DKVFQEIGAKNKPYLTVL--------QFKDFLNSVQRDPRLNEVLF 235 Query: 120 FDALRKKPK------KDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 KK + + + G+ + Y I + Sbjct: 236 PLCTEKKTQELIDDYEPNKSFAKKGHLSMEGFTKYLMSFDNYIITPD 282 >gi|84618246|emb|CAJ21207.1| FPV140 protein [Avipoxvirus CVL] gi|238801172|gb|ACR56304.1| immunodominant envelope protein [Penguinpox virus] Length = 333 Score = 41.9 bits (97), Expect = 0.10, Method: Composition-based stats. Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 10/104 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + II+EDD + + F L ++ K D N +++ K Sbjct: 110 HISLWRYIMDNAEKLPNYIIIIEDDNTITGESFITNLDNIIKILSDNNVDVLQLVTHTKL 169 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K +K +L LP +S + Y I ++A L + Sbjct: 170 LKDRKSQHLMLLPDLEAFKGSFDVSL--SAYIIRQDAVRKLYSY 211 >gi|84618242|emb|CAJ21204.1| FPV140 protein [Avipoxvirus CVL] Length = 333 Score = 41.9 bits (97), Expect = 0.10, Method: Composition-based stats. Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 10/104 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + II+EDD + + F L ++ K D N +++ K Sbjct: 110 HISLWRYIMDNAEKLPNYIIIIEDDNTITGESFITNLDNIIKILSDNNVDVLQLVTHTKL 169 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K +K +L LP +S + Y I ++A L + Sbjct: 170 LKDRKSQHLMLLPDLEAFKGSFDVSL--SAYIIRQDAVRKLYSY 211 >gi|323158033|gb|EFZ44133.1| glycosyl transferase family 2 family protein [Escherichia coli EPECa14] Length = 830 Score = 41.9 bits (97), Expect = 0.11, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ S + + +R Sbjct: 281 VVVVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 327 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 328 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 379 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 380 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 422 >gi|323975938|gb|EGB71032.1| glycosyl transferase 2 [Escherichia coli TW10509] Length = 823 Score = 41.5 bits (96), Expect = 0.11, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 51/167 (30%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + V++L R + A +H+ R+ + + Q R L Sbjct: 265 IVVVNLDKRPDRLRQIREELALLHIP-----------PEKITRLAACEDENGQRGRRL-- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHSWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 406 >gi|260866405|ref|YP_003232807.1| putative glycosyltransferase [Escherichia coli O111:H- str. 11128] gi|257762761|dbj|BAI34256.1| predicted glycosyltransferase [Escherichia coli O111:H- str. 11128] gi|323178502|gb|EFZ64080.1| glycosyl transferase family 2 family protein [Escherichia coli 1180] Length = 814 Score = 41.5 bits (96), Expect = 0.11, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ S + + +R Sbjct: 265 VVVVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 406 >gi|283834463|ref|ZP_06354204.1| glycosyl transferase, group 2 family [Citrobacter youngae ATCC 29220] gi|291070010|gb|EFE08119.1| glycosyl transferase, group 2 family [Citrobacter youngae ATCC 29220] Length = 853 Score = 41.5 bits (96), Expect = 0.12, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 43/156 (27%), Gaps = 27/156 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V +++L R ++ K + + Sbjct: 273 VVIVNLDSRPDRLAHIQQELTFLNFPAE--------------------KVTRLAASVATN 312 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 +IG SH+ + ++LEDD+ + ++L L + Sbjct: 313 GQIGRSQSHMRALQLAQLKGWKNYLLLEDDSVILKQEKHIRVLNSLLSALPSFPWE-VVI 371 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG 158 L + K+ S L +L G + Y + Sbjct: 372 LGGEIKQGSELKSLQGMIHARDCNKV----CAYLVN 403 >gi|212530252|ref|XP_002145283.1| conserved hypothetical protein [Penicillium marneffei ATCC 18224] gi|210074681|gb|EEA28768.1| conserved hypothetical protein [Penicillium marneffei ATCC 18224] Length = 334 Score = 41.5 bits (96), Expect = 0.12, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 20/34 (58%) Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFS 98 IG + SH++ + I + A+ILEDDAD+ Sbjct: 37 GNIGSWRSHLNAIRAIVENNLDSALILEDDADWD 70 >gi|182509204|ref|NP_001116813.1| fringe [Bombyx mori] gi|169643683|dbj|BAG12565.1| fringe [Bombyx mori] Length = 327 Score = 41.5 bits (96), Expect = 0.12, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 58/189 (30%), Gaps = 37/189 (19%) Query: 69 CYISHIHL---------WKRIAYSPAIGAIILEDDADFS-DEFSQLLPHLSKCDINNILI 118 C SH + R S +DD + +L + + Sbjct: 123 CSASHQRKHLCCKMSVEYDRFLESGKKWFCHFDDDNYVNVPRLVSVLQTYKHQEDWYL-- 180 Query: 119 KFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL---------NVR 169 R + + P N + + G+ I + A+ +L ++ Sbjct: 181 ----GRTSVYEPVKIYKKPTN-KLMFSFWFATGGAGFCISRSLALKMLPVASGGRFISIC 235 Query: 170 KNIYRPIDMDMKHWWEHNI-PSLVTEPGAVYEAIDTNDSTIEESRLVRKPTFS--PLYFY 226 + I P D+ + EH + +L P S +E+ +L+ TF + Y Sbjct: 236 EGIRLPDDVSVGFIIEHLMKKNLTLVP--------EFHSHLEQMKLLPPETFRDQISFSY 287 Query: 227 RNTCYQWNL 235 +WN+ Sbjct: 288 AKAKDEWNV 296 >gi|239948402|ref|ZP_04700155.1| glycosyltransferase [Rickettsia endosymbiont of Ixodes scapularis] gi|239922678|gb|EER22702.1| glycosyltransferase [Rickettsia endosymbiont of Ixodes scapularis] Length = 51 Score = 41.5 bits (96), Expect = 0.13, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 1/49 (2%) Query: 156 FIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHN-IPSLVTEPGAVYEAID 203 I +AA LL KN+ ID + + HN + + VT P ++ + Sbjct: 1 MIHLKAAQKLLEYSKNLTLEIDNVIYQAFMHNEVQAYVTSPFLIHATFN 49 >gi|300817280|ref|ZP_07097498.1| conserved hypothetical protein [Escherichia coli MS 107-1] gi|300530256|gb|EFK51318.1| conserved hypothetical protein [Escherichia coli MS 107-1] Length = 550 Score = 41.5 bits (96), Expect = 0.14, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 49/165 (29%), Gaps = 31/165 (18%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPE 66 V++L R ++ A +H+ S + + +R Sbjct: 3 VVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ----- 47 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDALR 124 SH+ + ++LEDDA + Q+L L + L Sbjct: 48 -----SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMLLG 101 Query: 125 KKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 + + + L +LPG R + Y + + A + Sbjct: 102 GEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRCYPQLAQQM 142 >gi|84618238|emb|CAJ21201.1| FPV140 protein [Avipoxvirus Peekham] Length = 333 Score = 41.1 bits (95), Expect = 0.14, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 10/101 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + II+EDD + + F L ++ K D N +++ K Sbjct: 110 HISLWRYIMDNTEKLPNYVIIIEDDNTITGESFITNLDNIIKILNDNNVDVLQLVTHTKL 169 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 K +K +L LP +S + Y I ++A L Sbjct: 170 LKDRKSQHLMLLPDLEAFKGSFDVSL--SAYIIRQDAVRKL 208 >gi|300936116|ref|ZP_07151056.1| glycosyltransferase, group 2 family protein [Escherichia coli MS 21-1] gi|300458736|gb|EFK22229.1| glycosyltransferase, group 2 family protein [Escherichia coli MS 21-1] Length = 823 Score = 41.1 bits (95), Expect = 0.15, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 48/158 (30%), Gaps = 27/158 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ +H+ + + + +R Q Sbjct: 265 IVLLNLDKRPDRLQQIREELTLLHIPPE-------KITRLAASENENGQRGRQQ------ 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQRGWQNYLLLEDDAVILKQEKHIQVLNALLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 L + + + L +LPG R + Y + + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNRR 397 >gi|256389709|ref|YP_003111273.1| hypothetical protein Caci_0480 [Catenulispora acidiphila DSM 44928] gi|256355935|gb|ACU69432.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 381 Score = 41.1 bits (95), Expect = 0.16, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 41/144 (28%), Gaps = 5/144 (3%) Query: 74 IHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYL 133 + W + A ++L+DD + F L+ + + + F + L Sbjct: 53 LLAWS-MVEPGATHHLVLQDDVVLCEGFVDLMVAAVRTHPDAAVSMFAEWGARAGTLVRL 111 Query: 134 CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDM-DMKHWWEHNIPSLV 192 L G P T + A + P D+ + H +V Sbjct: 112 ALLRGQGYAAVADPYIP-TQALVLPAAQAADFAEKGRTETEPDDIALLNHLRRTGTRGVV 170 Query: 193 TEPGAVYE--AIDTNDSTIEESRL 214 P V +T + RL Sbjct: 171 VAPNLVQHQDRPSLTGNTFQGPRL 194 >gi|73995172|ref|XP_543380.2| PREDICTED: similar to beta-1,3-N-acetylglucosaminyltransferase bGnT-4 [Canis familiaris] Length = 536 Score = 41.1 bits (95), Expect = 0.16, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 55/187 (29%), Gaps = 24/187 (12%) Query: 11 PFSHARREKFCHRAARIH-------LQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL-- 61 P RR R+ L+ F + G P + ++ + Sbjct: 282 PGHVERRAAIRSTWGRVGGWARGRRLKLVFLLGVAGPAPPAQLLAYESREFDDILQWDFA 341 Query: 62 -----LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNI 116 L+L E+ H+ W +A S + +DD +L L D Sbjct: 342 EDFFNLTLKEL-----HLQRWVAVACSQTRFMLKGDDDVFV--HVPNVLEFLDGWDPAQD 394 Query: 117 LIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSP--RTTGYFIGKEAAIHLLNVRKNIYR 174 L+ D +R+ + +++ R P GY + + L + Sbjct: 395 LLVGDVIRQALPNRNTKVKYFIPPSMYRARHYPPYAGGGGYVMSRATVQRLQAAVEEAEL 454 Query: 175 -PIDMDM 180 PID Sbjct: 455 FPIDDVF 461 >gi|170680272|ref|YP_001742365.1| glycosyl transferase, group 2 family protein [Escherichia coli SMS-3-5] gi|170517990|gb|ACB16168.1| glycosyl transferase, group 2 family protein [Escherichia coli SMS-3-5] Length = 814 Score = 41.1 bits (95), Expect = 0.16, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 48/158 (30%), Gaps = 27/158 (17%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ +H+ + + + +R Q Sbjct: 265 IVLLNLDKRPDRLQQIREELTLLHIPPE-------KITRLAASENENGQRGRQQ------ 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQRGWQNYLLLEDDAVILKQEKHIQVLNALLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKE 160 L + + + L +LPG R + Y + + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNRR 397 >gi|145348071|ref|XP_001418480.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144578709|gb|ABO96773.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 514 Score = 40.7 bits (94), Expect = 0.19, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 29/100 (29%), Gaps = 6/100 (6%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDAD---FSDEFSQLLPHLSKCDINNILIKFDAL 123 I C SH+ +W+ + +I E DA + L + N +I A Sbjct: 259 ISCLFSHMRMWRLHSKFKNKWTVIFESDALWGGIRMKPYALQSVIDNAPANADVILLKAR 318 Query: 124 RKKPKKDSYLCTLPGNFDIHQPR---ILSPRTTGYFIGKE 160 + + + GY IG + Sbjct: 319 DEPTGQFVKQWPVEDEMVYMYSANHIKAGCGLAGYVIGPK 358 >gi|331216091|ref|XP_003320725.1| hypothetical protein PGTG_02747 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309299715|gb|EFP76306.1| hypothetical protein PGTG_02747 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 462 Score = 40.7 bits (94), Expect = 0.21, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 28/97 (28%), Gaps = 7/97 (7%) Query: 30 QFSFFDAI---YGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAI 86 F D I RI ++ K + + C +SH Sbjct: 220 NLHFVDTILYSDPRVLDTVQRIGNNTKADKKVAE---GGHVACQMSHRLAIGAADADDDE 276 Query: 87 GAIILEDDADFSDEFSQLL-PHLSKCDINNILIKFDA 122 +IL+DD D F L L + +I F Sbjct: 277 ITLILKDDVDIEAVFKYLAGTILRDVPKDWDIIFFGH 313 >gi|9629016|ref|NP_044035.1| MC084L [Molluscum contagiosum virus subtype 1] gi|1492027|gb|AAC55212.1| MC084L [Molluscum contagiosum virus subtype 1] gi|3041767|dbj|BAA25418.1| 34-kDa protein [Molluscum contagiosum virus] Length = 298 Score = 40.7 bits (94), Expect = 0.24, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 39/106 (36%), Gaps = 13/106 (12%) Query: 71 ISHIHLWKRI--------AYSPAIGAIILEDDADFSDEFSQLLPHL---SKCDINNILIK 119 ++H+ LWK + A + + AI++EDD Q L + ++L Sbjct: 69 LNHLSLWKELGSESAPKAAGAESEYAIVVEDDNTVQPLLLQSAAALVGGMRAQQVHVLQL 128 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 + L + + L P + P L Y I K +A L Sbjct: 129 REPLHAGVRAQTPLSGNPSAY--VYPARLHASLGAYIIHKPSAGRL 172 >gi|323171054|gb|EFZ56703.1| glycosyl transferase family 2 family protein [Escherichia coli LT-68] Length = 436 Score = 40.3 bits (93), Expect = 0.25, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 49/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 V V++L R ++ A +H+ + + +R Sbjct: 265 VVVVNLDKRPDRLQQIREELALLHIPPEKI----------TRLAACEDENGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVML 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 406 >gi|301646803|ref|ZP_07246656.1| conserved hypothetical protein [Escherichia coli MS 146-1] gi|301075009|gb|EFK89815.1| conserved hypothetical protein [Escherichia coli MS 146-1] Length = 720 Score = 40.3 bits (93), Expect = 0.27, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 73/253 (28%), Gaps = 46/253 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 171 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 217 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 218 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 269 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL------LNVRKNI 172 L + + + L +LPG R + Y + + A + L Sbjct: 270 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQMSNDEHSLEACWQP 325 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI----EESRLVRKPTFSPLYFYRN 228 D + + PSL + + I + + KP+ P+ Sbjct: 326 LLRADKWLACY-----PSLCYQRPGFSDIEKKTTDNIAHYFNKLPVATKPSTLPIADTIG 380 Query: 229 TCYQWNLHYNAWR 241 + + HY +R Sbjct: 381 FFMETSFHYTLYR 393 >gi|67524931|ref|XP_660527.1| hypothetical protein AN2923.2 [Aspergillus nidulans FGSC A4] gi|40744318|gb|EAA63494.1| hypothetical protein AN2923.2 [Aspergillus nidulans FGSC A4] gi|259486139|tpe|CBF83741.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4] Length = 339 Score = 40.3 bits (93), Expect = 0.28, Method: Composition-based stats. Identities = 24/102 (23%), Positives = 37/102 (36%), Gaps = 19/102 (18%) Query: 11 PFSHARREKFCHRAARIHLQFSFFDA-----IYGENNPICNRIFSHQKRQCQFKRLLSLP 65 P S R+ H+ DA I + P N + H+ + +R Sbjct: 20 PRSDQERDIGGSSVRGFHI--EIIDATTPEEIDAKTYPY-NWNYDHRPVEYAARR----- 71 Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH 107 SH+++ +RI AII+EDDAD+ L Sbjct: 72 ------SHLNVMQRILKERLASAIIMEDDADWDVSIKSQLQS 107 >gi|322696428|gb|EFY88220.1| LPS glycosyltransferase [Metarhizium acridum CQMa 102] Length = 263 Score = 40.3 bits (93), Expect = 0.29, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 3/89 (3%) Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH-LLNVRKNI 172 NNI + ++ P D Y L G + + TTGY + A LL ++ Sbjct: 90 NNITLIYNDPHVPPGMDYYGREL-GRERVIRKAGHQGCTTGYAVSLNGAAKLLLRSAADL 148 Query: 173 YRPIDMDMKHW-WEHNIPSLVTEPGAVYE 200 + +DM M ++ S P + + Sbjct: 149 DQAVDMGMSSLIQSGHLTSYSVYPPILAQ 177 >gi|331640705|ref|ZP_08341852.1| glycosyl transferase, group 2 family [Escherichia coli H736] gi|331040079|gb|EGI12287.1| glycosyl transferase, group 2 family [Escherichia coli H736] Length = 838 Score = 40.3 bits (93), Expect = 0.30, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 73/253 (28%), Gaps = 46/253 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 289 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 335 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 336 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 387 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL------LNVRKNI 172 L + + + L +LPG R + Y + + A + L Sbjct: 388 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQMSNDEHSLEACWQP 443 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI----EESRLVRKPTFSPLYFYRN 228 D + + PSL + + I + + KP+ P+ Sbjct: 444 LLRADKWLACY-----PSLCYQRPGFSDIEKKTTDNIAHYFNKLPVATKPSTLPIADTIG 498 Query: 229 TCYQWNLHYNAWR 241 + + HY +R Sbjct: 499 FFMETSFHYTLYR 511 >gi|307213490|gb|EFN88899.1| Glycosyltransferase 25 family member [Harpegnathos saltator] Length = 347 Score = 40.3 bits (93), Expect = 0.30, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 17/37 (45%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGEN 41 +Y+I+L RR + + +Q DA+ G N Sbjct: 303 IYMINLLRRPERRRRMQKLFKELGIQAEIIDAVDGRN 339 >gi|283786364|ref|YP_003366229.1| Acetate kinase [Citrobacter rodentium ICC168] gi|282949818|emb|CBG89441.1| Acetate kinase [Citrobacter rodentium ICC168] Length = 431 Score = 40.3 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 52/195 (26%) Query: 28 HLQFSFFDAIYGEN-NPICNRIFSHQKRQCQFKRLLSLPE--IGCYISHIHLWKRIAY-- 82 L+F+ DA+ GE F + + ++K S E +G +H I Sbjct: 46 SLKFAIIDAVNGEEYLSGLAECFHLPEARIKWKMDGSKQEAALGAGAAHSEALNFIVNTI 105 Query: 83 ----------------------SPAIGAIILEDDAD--------FSD--------EFSQL 104 +++++D F+ ++ Sbjct: 106 LAQKPELSAQLTAIGHRIVHGGEKYTSSVVIDDSVIQGIKDAAPFAPLHNPAHLIGITEA 165 Query: 105 LPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQ--PRILSPRTTGYFIGKEAA 162 L N+ + A + ++SYL LP N R T+ Y++ +EAA Sbjct: 166 LKSFPALKDKNVAVFDTAFHQTMPEESYLYALPYNLYKEHGVRRYGFHGTSHYYVTQEAA 225 Query: 163 IHLLNVRKNIYRPID 177 +LN P+D Sbjct: 226 -KMLNK------PVD 233 >gi|133843258|gb|ABO38878.1| FPV140 protein [Avipoxvirus CVL] Length = 330 Score = 39.9 bits (92), Expect = 0.34, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 42/104 (40%), Gaps = 10/104 (9%) Query: 73 HIHLWKRIAYSP---AIGAIILEDDADFS-DEFSQLLPHLSKC--DINNILIKFDALRK- 125 HI LW+ I + +I+EDD + + F L ++ K D N +++ K Sbjct: 107 HISLWRYIMDNAEKLPNYVVIIEDDNTITGESFITNLDNIIKILSDNNVDILQLVTHTKL 166 Query: 126 -KPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K + +L LP +S + Y I ++A L + Sbjct: 167 LKDRNSQHLMLLPDLEAFKGSFDVSL--SAYIIRQDAVRKLYSY 208 >gi|323191528|gb|EFZ76789.1| glycosyl transferase family 2 family protein [Escherichia coli RN587/1] Length = 812 Score = 39.9 bits (92), Expect = 0.35, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ +H+ + + + +R Q Sbjct: 263 IVLLNLDKRPDRLQQIREELTLLHIPPE-------KITRLAASENENGQRGRQQ------ 309 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 310 -------SHLQALRLAQQRGWQNYLLLEDDAVILKQEKHIQVLNALLASLAKIPW-QVMI 361 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 362 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 404 >gi|256020220|ref|ZP_05434085.1| glycosyl transferase family protein [Shigella sp. D9] Length = 830 Score = 39.9 bits (92), Expect = 0.36, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 281 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 327 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 328 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 379 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 380 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRCYPQLAQQM 422 >gi|260797407|ref|XP_002593694.1| hypothetical protein BRAFLDRAFT_107674 [Branchiostoma floridae] gi|229278922|gb|EEN49705.1| hypothetical protein BRAFLDRAFT_107674 [Branchiostoma floridae] Length = 144 Score = 39.9 bits (92), Expect = 0.39, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 16/68 (23%) Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIYR--PID-------------MDMKHWWEHNIPSLV 192 S T GY I A LL+ K + + P+D M + N+ + Sbjct: 20 YSYWTLGYMITWRGAKKLLD-VKPLTKMIPVDEFLPMMFDDHPDKHYMYAFEPRNLKAFS 78 Query: 193 TEPGAVYE 200 EP +Y Sbjct: 79 AEPLMIYP 86 >gi|307136850|ref|ZP_07496206.1| putative glycosyltransferase [Escherichia coli H736] gi|315616835|gb|EFU97452.1| glycosyl transferase family 2 family protein [Escherichia coli 3431] Length = 814 Score = 39.9 bits (92), Expect = 0.39, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 73/253 (28%), Gaps = 46/253 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 265 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL------LNVRKNI 172 L + + + L +LPG R + Y + + A + L Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQMSNDEHSLEACWQP 419 Query: 173 YRPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTI----EESRLVRKPTFSPLYFYRN 228 D + + PSL + + I + + KP+ P+ Sbjct: 420 LLRADKWLACY-----PSLCYQRPGFSDIEKKTTDNIAHYFNKLPVATKPSTLPIADTIG 474 Query: 229 TCYQWNLHYNAWR 241 + + HY +R Sbjct: 475 FFMETSFHYTLYR 487 >gi|332281382|ref|ZP_08393795.1| glycosyl transferase group 2 family protein [Shigella sp. D9] gi|332103734|gb|EGJ07080.1| glycosyl transferase group 2 family protein [Shigella sp. D9] Length = 814 Score = 39.9 bits (92), Expect = 0.40, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 265 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRCYPQLAQQM 406 >gi|167522387|ref|XP_001745531.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775880|gb|EDQ89502.1| predicted protein [Monosiga brevicollis MX1] Length = 247 Score = 39.9 bits (92), Expect = 0.41, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 31/83 (37%), Gaps = 1/83 (1%) Query: 40 ENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR-IAYSPAIGAIILEDDADFS 98 + NP+ R + + + L+ E G + L++ + ++ +DDA Sbjct: 14 KGNPVLGRKWQGKLVFPRQPGYLTPGEQGYLATMRQLFEEAMQDQSISTLLVFDDDALLH 73 Query: 99 DEFSQLLPHLSKCDINNILIKFD 121 F++ L L + L+ Sbjct: 74 CNFTEALSELLSEERCGSLVDIH 96 >gi|323181538|gb|EFZ66958.1| glycosyl transferase family 2 family protein [Escherichia coli 1357] Length = 814 Score = 39.5 bits (91), Expect = 0.41, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 265 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRCYPQLAQQM 406 >gi|331671794|ref|ZP_08372590.1| glycosyl transferase, group 2 family [Escherichia coli TA280] gi|331070783|gb|EGI42142.1| glycosyl transferase, group 2 family [Escherichia coli TA280] Length = 847 Score = 39.5 bits (91), Expect = 0.42, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +H+ S + + +R Sbjct: 289 IVLLNLDKRPDRLQQIREELALLHIPPEKI----------TRLAASEDENGQRGRRQ--- 335 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 336 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 387 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 388 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 430 >gi|296213171|ref|XP_002753172.1| PREDICTED: UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 4-like [Callithrix jacchus] Length = 483 Score = 39.5 bits (91), Expect = 0.43, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 53/183 (28%), Gaps = 20/183 (10%) Query: 11 PFSHARREKFCH---RAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL------ 61 P RR RA L+ F + G P + ++ + Sbjct: 237 PGHVERRAAIRSTWGRARGQQLKLVFLLGVAGPTPPAQLLAYESREFDDILQWDFTEDFF 296 Query: 62 -LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 L+L E+ H+ W A A + +DD +L L D L+ Sbjct: 297 NLTLKEL-----HLQRWVAAACPQARFTLKGDDDVFV--HVPNVLEFLHGWDPAQDLLVG 349 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQPRILSP--RTTGYFIGKEAAIHLLNVRKNIYR-PID 177 D +R+ + +++ P GY + + L + PID Sbjct: 350 DIIRQALPNRNTKVKYFIPPSMYRASHYPPYAGGGGYVMSRATVQRLQAAVEEAELFPID 409 Query: 178 MDM 180 Sbjct: 410 DVF 412 >gi|37048702|gb|AAQ88213.1| envelope protein P32 [Goatpox virus] Length = 322 Score = 39.5 bits (91), Expect = 0.43, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 30/100 (30%), Gaps = 5/100 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIDMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 NF + Y I +A+ ++N Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSAMKIIN 198 >gi|281416143|ref|YP_003347879.1| gp231 [Mycobacterium phage ET08] gi|255927822|gb|ACU41443.1| gp231 [Mycobacterium phage ET08] Length = 220 Score = 39.5 bits (91), Expect = 0.44, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 48/154 (31%), Gaps = 22/154 (14%) Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 H W+ + S +++LEDDA F L + K + ++ R +P Sbjct: 41 HDVAWRWLDESGGTWSVVLEDDAIPVSNFRDQLDAVLKVAPS-PIVSLYLGRSRPPHWQP 99 Query: 133 ----LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH------LLNVRKNIYRPIDMDMKH 182 + + +F + + AI +L R P+D + Sbjct: 100 SIMQVISSREHFLLGSTLLHHV---------AVAIRTELIFDMLAQRDR-ELPVDESIGT 149 Query: 183 WW-EHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 W IP T P V + +E + Sbjct: 150 WARLREIPVAYTNPSIVNHEHRLPTTIVERTSGY 183 >gi|29566714|ref|NP_818281.1| gp231 [Mycobacterium phage Bxz1] gi|109393413|ref|YP_656212.1| gp234 [Mycobacterium phage Catera] gi|203457551|ref|YP_002224897.1| gp234 [Mycobacterium phage Rizal] gi|203459078|ref|YP_002224234.1| gp237 [Mycobacterium phage ScottMcG] gi|203460722|ref|YP_002224456.1| gp237 [Mycobacterium phage Spud] gi|204305860|ref|YP_002224677.1| gp235 [Mycobacterium phage Cali] gi|29425439|gb|AAN16863.1| gp231 [Mycobacterium phage Bxz1] gi|91981235|gb|ABE67950.1| gp234 [Mycobacterium phage Catera] gi|197312079|gb|ACH62436.1| gp234 [Mycobacterium phage Rizal] gi|197312303|gb|ACH62659.1| gp237 [Mycobacterium phage Spud] gi|197312525|gb|ACH62880.1| gp237 [Mycobacterium phage ScottMcG] gi|197312833|gb|ACH63186.1| gp235 [Mycobacterium phage Cali] gi|255928111|gb|ACU41730.1| gp237 [Mycobacterium phage LRRHood] Length = 220 Score = 39.5 bits (91), Expect = 0.44, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 48/154 (31%), Gaps = 22/154 (14%) Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSY 132 H W+ + S +++LEDDA F L + K + ++ R +P Sbjct: 41 HDVAWRWLDESGGTWSVVLEDDAIPVSNFRDQLDAVLKVAPS-PIVSLYLGRSRPPHWQP 99 Query: 133 ----LCTLPGNFDIHQPRILSPRTTGYFIGKEAAIH------LLNVRKNIYRPIDMDMKH 182 + + +F + + AI +L R P+D + Sbjct: 100 SIMQVISSREHFLLGSTLLHHV---------AVAIRTELIFDMLAQRDR-ELPVDESIGT 149 Query: 183 WW-EHNIPSLVTEPGAVYEAIDTNDSTIEESRLV 215 W IP T P V + +E + Sbjct: 150 WARLREIPVAYTNPSIVNHEHRLPTTIVERTSGY 183 >gi|61806140|ref|YP_214500.1| hypothetical protein PSSM2_269 [Prochlorococcus phage P-SSM2] gi|61374649|gb|AAX44646.1| hypothetical protein PSSM2_269 [Prochlorococcus phage P-SSM2] gi|265525352|gb|ACY76149.1| conserved hypothetical protein [Prochlorococcus phage P-SSM2] Length = 212 Score = 39.5 bits (91), Expect = 0.49, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 29/92 (31%), Gaps = 22/92 (23%) Query: 27 IHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK--------------RLLSLPEIGCYIS 72 ++L A+Y + F ++ F R L+ GC +S Sbjct: 1 MNLDLRTVSAVYINLKKDVKKDFEMKRLMVDFGFKNVIRVEGNVIPDRHLA----GCSLS 56 Query: 73 HIHLWKRIAYSPAIGAIILEDDADFSDEFSQL 104 H + I II EDD + ++ Sbjct: 57 HYNALHEI----DPPFIIFEDDCVIKNNTPEI 84 >gi|327284397|ref|XP_003226924.1| PREDICTED: UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 4-like, partial [Anolis carolinensis] Length = 404 Score = 39.5 bits (91), Expect = 0.50, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 38/125 (30%), Gaps = 14/125 (11%) Query: 61 LLSLPEIGCYISHIHLWKRIAYSPAIGAIIL--EDDADFSDEFSQLLPHLSKCDINNILI 118 L+L E+ H W + +L +DD + ++ L D L Sbjct: 218 NLTLKEL-----HFLRW--LVEDCPQTRFVLKGDDDVFVNTY--NIIEFLKDLDSGKDLF 268 Query: 119 KFDALRKKP--KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-P 175 D + K + +P + L GY + + A L ++ P Sbjct: 269 AGDVISKARPIRNTKAKYFIPESMYPAPFYPLYAGGGGYVMSQRTAQRLQATAEDTELFP 328 Query: 176 IDMDM 180 ID Sbjct: 329 IDDVF 333 >gi|327254544|gb|EGE66160.1| glycosyl transferase family 2 family protein [Escherichia coli STEC_7v] Length = 814 Score = 39.5 bits (91), Expect = 0.51, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 51/167 (30%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R + A +H+ R+ + + Q R L Sbjct: 265 IVLLNLDKRPDRLRQIREELALLHIP-----------PEKITRLAACEDENGQRGRRL-- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHAQDCRKV----CAYLVNSRYYPQLAQQM 406 >gi|34420272|gb|AAQ67421.1| envelope protein P32 [Sheeppox virus] Length = 323 Score = 39.2 bits (90), Expect = 0.53, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + N + + Sbjct: 100 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDSIIIHNIIIEVQEKNIDIFQLRETFHNS 159 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF P + Y I +A Sbjct: 160 NSRILFNQENNNFMYSYPGGYDFTLSAYVIRLSSA 194 >gi|312377431|gb|EFR24263.1| hypothetical protein AND_11270 [Anopheles darlingi] Length = 388 Score = 39.2 bits (90), Expect = 0.58, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 59/197 (29%), Gaps = 36/197 (18%) Query: 12 FSHARREKFCHRAARIHL-------------QFSFFDAIYGEN---NPICNRIFSHQKRQ 55 S R+ + + L +F AI E + F R Sbjct: 106 RSAHRQAISQQKLLSMGLLRIFSLATIPATERFIRQAAIEAEQRLHGDLIQGNFIEAYRN 165 Query: 56 CQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQL-LPHLSKCDIN 114 +K L+SL + +H + A I ++DD F + Q L L++ D Sbjct: 166 LTYKHLMSLQ----WATH-------SCRGAKYIIKMDDDIVFDPFYIQNHLSDLNQRDER 214 Query: 115 NILIKFDALRKKP---KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKN 171 +L F KK + + + + P LS Y + A LL + Sbjct: 215 YLLAGFVFRSKKVIRLRANKWYVSPAEYEKAVYPAYLSGWL--YITNQRTARALLAESQE 272 Query: 172 IYRPIDMDMKHWWEHNI 188 + + + Sbjct: 273 ASF---FWIDDTFITGV 286 >gi|291240268|ref|XP_002740042.1| PREDICTED: UDP-Gal:betaGlcNAc beta 1,3-galactosyltransferase 1-like [Saccoglossus kowalevskii] Length = 386 Score = 39.2 bits (90), Expect = 0.58, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 42/153 (27%), Gaps = 21/153 (13%) Query: 84 PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP--KKDSYLCTLPGNFD 141 A + ++DD L+P LS+ + + ++ KP K + + Sbjct: 214 NAKYVMKVDDDVFV--NLDNLIPLLSEAPREGYAVGYVYVQSKPIRKTWNKWYVSEEEWS 271 Query: 142 IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR--PIDMDMKHWW---------EHNIPS 190 Y + + A +L + I D+ + + Sbjct: 272 YEFYPPYPTGP-AYVLSMDVARAVLKSARRIRMFRMEDVYIGMNLLKLSIKPVHHNGFDR 330 Query: 191 LVT---EPGAVYEAIDTNDSTIEESRLVRKPTF 220 P V I T+ I R+ P Sbjct: 331 YGICQSLPCCVRNVIATH--YITSVRMATLPRR 361 >gi|227537841|ref|ZP_03967890.1| glycosyltransferase [Sphingobacterium spiritivorum ATCC 33300] gi|227242455|gb|EEI92470.1| glycosyltransferase [Sphingobacterium spiritivorum ATCC 33300] Length = 222 Score = 39.2 bits (90), Expect = 0.65, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 42/119 (35%), Gaps = 24/119 (20%) Query: 3 IPVYVISLPFSHARREK-FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRL 61 IPVY+I+LP R + + + +A C+ +R Sbjct: 7 IPVYLINLPERKDRLKHSMTQFKDKSEFYITIIEA-------------------CKHER- 46 Query: 62 LSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 G ++S + K + I+ EDD F+ ++++ + + + N K Sbjct: 47 ---GNYGLWMSIVKAVKMAKDNDDDVMILCEDDHKFTSDYTKEILFRNIIESNEEGAKI 102 >gi|119947061|ref|YP_944741.1| methyltransferase type 11 [Psychromonas ingrahamii 37] gi|119865665|gb|ABM05142.1| Methyltransferase type 11 [Psychromonas ingrahamii 37] Length = 866 Score = 39.2 bits (90), Expect = 0.66, Method: Composition-based stats. Identities = 11/45 (24%), Positives = 20/45 (44%) Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL 108 L IGC +S+ + ++ I EDD +F +F ++ Sbjct: 687 LGWIGCGLSYKFIIRKAKEQNFETITICEDDVEFKTDFESRYQNI 731 >gi|85059584|ref|YP_455286.1| acetate kinase [Sodalis glossinidius str. 'morsitans'] gi|123519194|sp|Q2NSJ4|ACKA_SODGM RecName: Full=Acetate kinase; AltName: Full=Acetokinase gi|84780104|dbj|BAE74881.1| acetate kinase [Sodalis glossinidius str. 'morsitans'] Length = 400 Score = 39.2 bits (90), Expect = 0.66, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 46/121 (38%), Gaps = 31/121 (25%) Query: 79 RIAYSPAIGA--IILEDDAD--------FSD--------EFSQLLPHLSKCDINNILIKF 120 RI + I++DD F+ + L + N+ + Sbjct: 91 RIVHGGERFIQSAIIDDDVVKGIENSIPFAPLHNPAHLIGIREALKEFPQLTQKNVAVFD 150 Query: 121 DALRKKPKKDSYLCTLPGNFDIHQ----PRILSPRTTGYFIGKEAAIHLLNVRKNIYRPI 176 A + ++SYL LP + ++Q R + T+ Y++ +EAA LL+ P+ Sbjct: 151 TAFHQTMPEESYLYALP--YSLYQDHAIRRYGAHGTSHYYVSREAA-KLLDK------PV 201 Query: 177 D 177 D Sbjct: 202 D 202 >gi|254786294|ref|YP_003073723.1| hypothetical protein TERTU_2275 [Teredinibacter turnerae T7901] gi|237686772|gb|ACR14036.1| hypothetical protein TERTU_2275 [Teredinibacter turnerae T7901] Length = 244 Score = 38.8 bits (89), Expect = 0.70, Method: Composition-based stats. Identities = 13/54 (24%), Positives = 26/54 (48%), Gaps = 1/54 (1%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKF 120 +GC+I+H + K I + +++EDD F +F + + + + I F Sbjct: 79 VGCWIAHNNALKSI-KDKSGFTLVIEDDLIFRKDFIAAIQNTLRNYSEHFDIAF 131 >gi|59889773|emb|CAH19128.1| Putative glysosyltransferase [Escherichia coli] gi|284920037|emb|CBG33093.1| putative glysosyltransferase [Escherichia coli 042] Length = 823 Score = 38.8 bits (89), Expect = 0.76, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 50/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ A +++ S + + +R Sbjct: 265 IVLLNLDKRPDRLQQIREELALLYIPPEKI----------TRLAASEDENGQRGRRQ--- 311 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ + ++LEDDA + Q+L L + Sbjct: 312 -------SHLQALRLAQQHGWQNYLLLEDDAVILKQEKHIQVLNTLLASLAKIPW-QVMI 363 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 364 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 406 >gi|251792684|ref|YP_003007410.1| acetate kinase [Aggregatibacter aphrophilus NJ8700] gi|247534077|gb|ACS97323.1| acetate kinase [Aggregatibacter aphrophilus NJ8700] Length = 444 Score = 38.8 bits (89), Expect = 0.80, Method: Composition-based stats. Identities = 18/98 (18%), Positives = 37/98 (37%), Gaps = 16/98 (16%) Query: 80 IAYSPAIGAIILEDDADFSD--------EFSQLLPHLSKCDINNILIKFDALRKKPKKDS 131 I + I ED F+ + N+++ A + +++ Sbjct: 148 IVTDEVVKGI--EDAIQFAPLHNPAHLIGIKESFKLFPHLKDKNVVVFDTAFHQTMPEEA 205 Query: 132 YLCTLPGNFDIHQP----RILSPRTTGYFIGKEAAIHL 165 YL LP + +++ R + T+ Y++ +EAA L Sbjct: 206 YLYALP--YSLYREHGVRRYGAHGTSHYYVSREAAKRL 241 >gi|262201964|ref|YP_003273172.1| NAD-glutamate dehydrogenase [Gordonia bronchialis DSM 43247] gi|262085311|gb|ACY21279.1| NAD-glutamate dehydrogenase [Gordonia bronchialis DSM 43247] Length = 1578 Score = 38.8 bits (89), Expect = 0.81, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 46/140 (32%), Gaps = 16/140 (11%) Query: 17 REKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHL 76 + H+ AR + + + ++ + + R L+ PE+ ++H+ L Sbjct: 1204 HARMLHQMARER-------GVDLRLEALPDAAELRKRLRGEGHRGLTSPELATLMAHVKL 1256 Query: 77 WKR--IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLC 134 + + S + DA F + L D + I+ LR++ + + Sbjct: 1257 LAKADLLASDLPDNDVF--DARVGRYFPRRLA-----DEYSSAIRAHRLRREIVTTTLVN 1309 Query: 135 TLPGNFDIHQPRILSPRTTG 154 + I L T Sbjct: 1310 DVVDQAGITHLFRLGEGTGA 1329 >gi|306842199|ref|ZP_07474866.1| Glycosyltransferase [Brucella sp. BO2] gi|306287693|gb|EFM59131.1| Glycosyltransferase [Brucella sp. BO2] Length = 877 Score = 38.8 bits (89), Expect = 0.82, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 71/217 (32%), Gaps = 42/217 (19%) Query: 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLS 63 ++ +SLP + +RR P + IF + + Sbjct: 662 NIFALSLPETISRRLMLMA------------------TKPQESVIFDAARFR-------- 695 Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINNILIK 119 +GC +S L + EDD + ++ + +L + Sbjct: 696 PGWLGCGMSFNLLANYALAKNLDQLTVFEDDVELHADYRSKISNIHEYLESRKEPWDV-- 753 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY-RPIDM 178 F + + D+ + ++ + + I + Y I ++A++LL D+ Sbjct: 754 FSGIMAEVHPDTRVLSVDVHNGVTYITINRMVSMVYNIYNKSALNLLA---RWDSENHDV 810 Query: 179 ---DMKHWWEH--NIPSLVTEPGAVYEAIDTNDSTIE 210 + + ++ ++ +V+ P V + ST+ Sbjct: 811 NTNVIDRYLQNIGDLKVVVSLPFLVGHR-EELHSTLW 846 >gi|115384416|ref|XP_001208755.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114196447|gb|EAU38147.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 359 Score = 38.8 bits (89), Expect = 0.82, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 50/172 (29%), Gaps = 41/172 (23%) Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQ----------LLPHLSKCDIN----- 114 +++H+ L K + S A+ILEDD D+ + L HL + Sbjct: 133 WLAHLDLIKHVLQSNLDTALILEDDVDWDRALREQMVRTASAVRNLTHLPPHQESDAPYG 192 Query: 115 --NILIKFDA--------------------LRKKPKKDS--YLCTLPGNFDIHQPRILSP 150 ++ K + Y+ LP Sbjct: 193 RAWDVLWIGHCGEYWDEGVETVVFEDTGVCPHDKYFGWAGQYVSRLPDRKRAVYWSYNPV 252 Query: 151 RTTGYFIGKEAAIHLLNV-RKNIYRPIDMDMKHW-WEHNIPSLVTEPGAVYE 200 + Y + + A +L + D+ M H + + + P V++ Sbjct: 253 CSFAYGMTRAGARRVLEQVGGSQDESFDVAMMHACRQRRLDCISVVPEVVHQ 304 >gi|195997153|ref|XP_002108445.1| hypothetical protein TRIADDRAFT_51346 [Trichoplax adhaerens] gi|190589221|gb|EDV29243.1| hypothetical protein TRIADDRAFT_51346 [Trichoplax adhaerens] Length = 409 Score = 38.8 bits (89), Expect = 0.84, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 23/58 (39%), Gaps = 8/58 (13%) Query: 76 LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDIN--------NILIKFDALRK 125 + K+ + A IILEDDA + +F + L ++ + IK K Sbjct: 186 ILKKCIQTNARYIIILEDDALPTTDFIENLHYIIHKTLPTDSLSRSTWAFIKLYYPEK 243 >gi|152979319|ref|YP_001344948.1| acetate kinase [Actinobacillus succinogenes 130Z] gi|150841042|gb|ABR75013.1| acetate kinase [Actinobacillus succinogenes 130Z] Length = 404 Score = 38.8 bits (89), Expect = 0.86, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 16/98 (16%) Query: 80 IAYSPAIGAIILEDDADFSD--------EFSQLLPHLSKCDINNILIKFDALRKKPKKDS 131 I + I ED F+ + N+++ A + +++ Sbjct: 103 IVTDEVVKGI--EDAIQFAPLHNPAHLIGIKEAFKIFPHLKDKNVVVFDTAFHQTMPEEA 160 Query: 132 YLCTLPGNFDIHQP----RILSPRTTGYFIGKEAAIHL 165 +L LP + +++ R + T+ YFI +EAA L Sbjct: 161 FLYALP--YSLYKEHGIRRYGAHGTSHYFISREAAKRL 196 >gi|146231444|gb|ABQ12870.1| envelope protein [Goatpox virus] Length = 322 Score = 38.4 bits (88), Expect = 0.91, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 30/100 (30%), Gaps = 5/100 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 NF + Y I +A+ ++N Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSAVKIIN 198 >gi|52425054|ref|YP_088191.1| acetate kinase [Mannheimia succiniciproducens MBEL55E] gi|52307106|gb|AAU37606.1| ackA protein [Mannheimia succiniciproducens MBEL55E] Length = 423 Score = 38.4 bits (88), Expect = 0.93, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 16/98 (16%) Query: 80 IAYSPAIGAIILEDDADFSD--------EFSQLLPHLSKCDINNILIKFDALRKKPKKDS 131 I + I ED F+ + N+++ A + +++ Sbjct: 120 IVTDEVVKGI--EDAIQFAPLHNPAHLIGIKEAFKIFPHLKDKNVVVFDTAFHQTMPEEA 177 Query: 132 YLCTLPGNFDIHQP----RILSPRTTGYFIGKEAAIHL 165 YL LP + +++ R + T+ YF+ +EAA L Sbjct: 178 YLYALP--YSLYKEHGVRRYGAHGTSHYFVSREAAKRL 213 >gi|60389368|sp|Q65TV4|ACKA_MANSM RecName: Full=Acetate kinase; AltName: Full=Acetokinase Length = 407 Score = 38.4 bits (88), Expect = 0.93, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 16/98 (16%) Query: 80 IAYSPAIGAIILEDDADFSD--------EFSQLLPHLSKCDINNILIKFDALRKKPKKDS 131 I + I ED F+ + N+++ A + +++ Sbjct: 104 IVTDEVVKGI--EDAIQFAPLHNPAHLIGIKEAFKIFPHLKDKNVVVFDTAFHQTMPEEA 161 Query: 132 YLCTLPGNFDIHQP----RILSPRTTGYFIGKEAAIHL 165 YL LP + +++ R + T+ YF+ +EAA L Sbjct: 162 YLYALP--YSLYKEHGVRRYGAHGTSHYFVSREAAKRL 197 >gi|315634144|ref|ZP_07889433.1| acetate kinase [Aggregatibacter segnis ATCC 33393] gi|315477394|gb|EFU68137.1| acetate kinase [Aggregatibacter segnis ATCC 33393] Length = 400 Score = 38.4 bits (88), Expect = 1.0, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 34/96 (35%), Gaps = 12/96 (12%) Query: 80 IAYSPAIGAIILEDDADFSD--------EFSQLLPHLSKCDINNILIKFDALRKKPKKDS 131 I + I ED F+ + N+++ A + +++ Sbjct: 104 IVTDEVVKGI--EDAIQFAPLHNPAHLIGIKEAFKLFPYLKEKNVVVFDTAFHQTMPEEA 161 Query: 132 YLCTLP-GNFDIHQPRILSP-RTTGYFIGKEAAIHL 165 YL LP + H R T+ Y++ +EAA L Sbjct: 162 YLYALPYSLYRDHGVRRYGAHGTSHYYVSREAAKRL 197 >gi|306813312|ref|ZP_07447505.1| putative glycosyltransferase [Escherichia coli NC101] gi|305854075|gb|EFM54514.1| putative glycosyltransferase [Escherichia coli NC101] Length = 812 Score = 38.4 bits (88), Expect = 1.0, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 49/167 (29%), Gaps = 31/167 (18%) Query: 5 VYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSL 64 + +++L R ++ +H+ + + + +R Q Sbjct: 263 IVLLNLDKRPDRLQQIREELTLLHIPPE-------KITRLAASENENGQRGRQQ------ 309 Query: 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFS--QLLPHLSKCDINNILIKFDA 122 SH+ ++LEDDA + Q+L L + Sbjct: 310 -------SHLQALHLAQQHGWQNYLLLEDDAVILKQEKHIQVLNALLASLAKIPW-QVMI 361 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIG----KEAAIHL 165 L + + + L +LPG R + Y + + A + Sbjct: 362 LGGEISQGTMLKSLPGLVHARDCRKV----CAYLVNSRYYPQLAQQM 404 >gi|146231438|gb|ABQ12867.1| envelope protein [Goatpox virus] Length = 322 Score = 38.4 bits (88), Expect = 1.1, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 30/100 (30%), Gaps = 5/100 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 NF + Y I +AI ++N Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSAIKVIN 198 >gi|159474068|ref|XP_001695151.1| predicted protein [Chlamydomonas reinhardtii] gi|158276085|gb|EDP01859.1| predicted protein [Chlamydomonas reinhardtii] Length = 319 Score = 38.0 bits (87), Expect = 1.3, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 49/127 (38%), Gaps = 20/127 (15%) Query: 87 GAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDALRKKPKKDSYLCTLPGNF----- 140 +++EDDADF++E +L +++ D ++ D + L F Sbjct: 189 HLLLIEDDADFNEERVAMLQRTVAQLDPCYDMVGLDYKESFCSLQRFKDGLASWFLPRSW 248 Query: 141 ----DIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWE---HN--IPSL 191 + + ++ RTTG + A+ + + P+ ++ W+ + + Sbjct: 249 RSSPHLVRAKLAFSRTTGLLFSYKGALRIQSA-----MPVTRELDLWYRDLITDGLLKVY 303 Query: 192 VTEPGAV 198 VT P V Sbjct: 304 VTCPRVV 310 >gi|270261094|ref|ZP_06189367.1| hypothetical protein SOD_a03190 [Serratia odorifera 4Rx13] gi|270044578|gb|EFA17669.1| hypothetical protein SOD_a03190 [Serratia odorifera 4Rx13] Length = 1152 Score = 38.0 bits (87), Expect = 1.4, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 4/59 (6%) Query: 64 LPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEF----SQLLPHLSKCDINNILI 118 + IGC +S+ ++ +A A+I EDD F ++F +++ HL + + + Sbjct: 967 IGWIGCGMSYKYMLSTLADRKTEMAMICEDDVLFPEDFDIQLEKVVGHLRSTEWDWHVF 1025 >gi|18075747|emb|CAD11297.1| lipopolysacharide biosynthesis glycosyl transferase [Helicobacter pylori] gi|18075751|emb|CAD11300.1| lipopolysacharide biosynthesis glycosyl transferase [Helicobacter pylori] Length = 87 Score = 38.0 bits (87), Expect = 1.5, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 36/85 (42%), Gaps = 7/85 (8%) Query: 174 RPIDMDMKHWWEHNIPSLVTEPGAVYEAIDTNDSTIEESR---LVRKPTFSP----LYFY 226 P+DM M + H++ + P + +STI+ ++ + KP L+++ Sbjct: 1 EPVDMFMNNPTYHDVANFTYLPCPISLNKHAFNSTIQNAKKPDISLKPPRKSYFDNLFYH 60 Query: 227 RNTCYQWNLHYNAWRKDLPPVSTTK 251 + + ++ + + P+ T K Sbjct: 61 KFNAQKCLKAFHKYSRRYAPLKTPK 85 >gi|240167863|ref|ZP_04746522.1| short chain dehydrogenase [Mycobacterium kansasii ATCC 12478] Length = 668 Score = 38.0 bits (87), Expect = 1.5, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 16/79 (20%) Query: 9 SLPFSHAR-------REKFCHRAA--RIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFK 59 SLP S A R K A ++ + FDA+ + S + R+ Sbjct: 274 SLPRSVAAPVLNASGRAKVLRNMAAAQLGIPAEIFDAVD-----LKPTFASDETRKALHG 328 Query: 60 RLLSLPEIGCYISHIHLWK 78 + +PE G Y LW+ Sbjct: 329 TGIEVPEFGAYAP--RLWR 345 >gi|23097521|ref|NP_690987.1| hypothetical protein OB0066 [Oceanobacillus iheyensis HTE831] gi|22775744|dbj|BAC12022.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831] Length = 489 Score = 37.6 bits (86), Expect = 1.6, Method: Composition-based stats. Identities = 22/104 (21%), Positives = 38/104 (36%), Gaps = 14/104 (13%) Query: 66 EIGCYISHIHLWKRIAYSPAIGAIILEDDADFS------DEFSQLLPHLSKCDINNILIK 119 E+G + H+ L +I +I DD S + + + D N + Sbjct: 284 ELGDLLMHVLLHSQIGEDNGYFSI---DDVILSITNKMIHRHPHVFSNQNVEDANQVEKN 340 Query: 120 FDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAI 163 ++ L+++ K L G P L + Y I K+AA Sbjct: 341 WEELKREEKGHQRELLLDG-----IPNHLPSLSAAYKITKKAAK 379 >gi|145341140|ref|XP_001415672.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144575895|gb|ABO93964.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 383 Score = 37.6 bits (86), Expect = 1.9, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 32/102 (31%), Gaps = 10/102 (9%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDE--FSQLLPHLSKCDINNILIKFDALR 124 +GC +H+ W+ + A+I E D + F++ + + + L+ F Sbjct: 218 VGCMYAHLFQWQLVKEQKLKKALIFESDGVQQTDLPFAEAQKAIDRLPEDADLLLFSFGN 277 Query: 125 KKPKKDSYLCTLPGN----FDIHQPRIL----SPRTTGYFIG 158 G+ D+H + Y I Sbjct: 278 TPGGDLVDQWKGHGDDGTPVDVHLYKWNKWNAVAGLQSYVIT 319 >gi|116192239|ref|XP_001221932.1| hypothetical protein CHGG_05837 [Chaetomium globosum CBS 148.51] gi|88181750|gb|EAQ89218.1| hypothetical protein CHGG_05837 [Chaetomium globosum CBS 148.51] Length = 440 Score = 37.6 bits (86), Expect = 2.0, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 35/103 (33%), Gaps = 1/103 (0%) Query: 51 HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK 110 + + +F+ E G ++ H ++ + + + +E + F + Sbjct: 34 NTSEEHRFRFDELPDEFGNFL-HRRAYEEMVRTLHLPFRAIEGTSAVGPFFWAAFDQDDE 92 Query: 111 CDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 I+ + +RKK K + L +F H + T Sbjct: 93 DPHLQIIFRKSDVRKKGKTRGWELMLSHSFRTHITSGFAKGTA 135 >gi|241676967|ref|XP_002412578.1| GLT25D1 protein, putative [Ixodes scapularis] gi|215506380|gb|EEC15874.1| GLT25D1 protein, putative [Ixodes scapularis] Length = 143 Score = 37.2 bits (85), Expect = 2.1, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 27/83 (32%), Gaps = 18/83 (21%) Query: 148 LSPRTTGYFIGKEAAIHLLNVRKNIYR-PIDMDMK-HWWEH------------NIPSLVT 193 S T Y + A LL+ + P+D + + +H ++ + Sbjct: 30 YSYWTLCYALTLAGARKLLDAQPLSKMVPVDEYLPIMFDKHPEATWTAHFPNRDLKAFSV 89 Query: 194 EPGAVYE----AIDTNDSTIEES 212 P VY + S E+S Sbjct: 90 YPLLVYPTHYTGEENYISDTEDS 112 >gi|294880211|ref|XP_002768924.1| hypothetical protein Pmar_PMAR008223 [Perkinsus marinus ATCC 50983] gi|239871953|gb|EER01642.1| hypothetical protein Pmar_PMAR008223 [Perkinsus marinus ATCC 50983] Length = 289 Score = 37.2 bits (85), Expect = 2.2, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 20/53 (37%), Gaps = 2/53 (3%) Query: 70 YISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHL--SKCDINNILIKF 120 Y+ HI + II E+DA +D F +K + ++K Sbjct: 42 YVDHIRALDECVSLNSTWCIIFENDAMLTDHFLAKFHKYVAAKAPSDTAMVKL 94 >gi|291226171|ref|XP_002733068.1| PREDICTED: conserved hypothetical protein-like, partial [Saccoglossus kowalevskii] Length = 123 Score = 37.2 bits (85), Expect = 2.4, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 23/60 (38%), Gaps = 2/60 (3%) Query: 122 ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-PIDMDM 180 + +K S + G+ + P T GY I ++ A LL+ P+D + Sbjct: 1 YIGRKIMDSSKESWVRGSHYLVWPYYTY-WTLGYIINQQGARKLLDANPLPKMVPVDEYL 59 >gi|157939697|ref|YP_001497069.1| IMV envelope protein p35 [Tanapox virus] gi|9719381|gb|AAF97774.1| J4R [Tanapox virus] gi|146746413|gb|ABQ43549.1| IMV envelope protein p35 [Tanapox virus] gi|146746569|gb|ABQ43704.1| IMV envelope protein p35 [Tanapox virus] Length = 323 Score = 37.2 bits (85), Expect = 2.5, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 33/105 (31%), Gaps = 15/105 (14%) Query: 73 HIHLWKRIAYSPAI-----GAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LW++ A + ++LEDD D L + C + I D L+ + Sbjct: 100 HFSLWEQYAKANIKDPEKKFFVVLEDDNTLHD-----LTTIHVCISSMIEKNIDILQLRE 154 Query: 128 KKDSYLCTLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLN 167 + N Q + Y I AI +N Sbjct: 155 TMYNNNVRTLLNPGYKQAMYSYTGGYDFSLSAYIIRVSTAIRFIN 199 >gi|134078885|emb|CAK40570.1| unnamed protein product [Aspergillus niger] Length = 386 Score = 37.2 bits (85), Expect = 2.5, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 44/175 (25%), Gaps = 44/175 (25%) Query: 34 FDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILED 93 D + G I + + E G +H++ +RI + A+I+ED Sbjct: 87 IDWVNG----ITPDEVDPRTYPYNWNYDHKPTEYGARRAHVNAMQRIVHERIGSAMIMED 142 Query: 94 DADFSDEFSQLLPH----------------LSKCDINNILIKF----------------- 120 D D+ L S + +I Sbjct: 143 DVDWDVNLKTQLQSFALGVRALQGADKKVTASPYGDDWDIIWLGHCGMECRTNEPYFVSS 202 Query: 121 -DALRKKPKKDSYLCTLPGNFDIHQPRILSP------RTTGYFIGKEAAIHLLNV 168 D P P F I L+ + Y + A +L+ Sbjct: 203 NDTTVLPPHHFLPYWRDPPPFGIPDYTRLTCAVNDAVCSIVYAVSYHGAQKILSA 257 >gi|294669017|ref|ZP_06734103.1| hypothetical protein NEIELOOT_00931 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291309009|gb|EFE50252.1| acto-N-neotetraose biosynthesis glycosyltransferase LgtB [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 35 Score = 36.8 bits (84), Expect = 2.8, Method: Composition-based stats. Identities = 9/28 (32%), Positives = 12/28 (42%) Query: 7 VISLPFSHARREKFCHRAARIHLQFSFF 34 VISL + ARR+ + F F Sbjct: 6 VISLSSAQARRKHIADTFGARGIPFRFL 33 >gi|308509706|ref|XP_003117036.1| hypothetical protein CRE_01997 [Caenorhabditis remanei] gi|308241950|gb|EFO85902.1| hypothetical protein CRE_01997 [Caenorhabditis remanei] Length = 362 Score = 36.8 bits (84), Expect = 3.0, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 35/83 (42%), Gaps = 5/83 (6%) Query: 77 WKRIAYS-PAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT 135 WK +A + + +++EDDA EFS LL L + ++ + F L +YL Sbjct: 164 WKCMALTTDSRYILLIEDDAVVIPEFSNLLKSLVRKLDDHEYVDFVKLY----HPNYLRK 219 Query: 136 LPGNFDIHQPRILSPRTTGYFIG 158 LP + I + Y I Sbjct: 220 LPSYALMGAVSISLSFFSCYAIK 242 >gi|319943088|ref|ZP_08017371.1| transketolase [Lautropia mirabilis ATCC 51599] gi|319743630|gb|EFV96034.1| transketolase [Lautropia mirabilis ATCC 51599] Length = 752 Score = 36.8 bits (84), Expect = 3.0, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 19/61 (31%), Gaps = 4/61 (6%) Query: 8 ISLPFSHA--RREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLP 65 IS+ RE R A DA+ G + + + + + R + Sbjct: 255 ISIDGDVKGWYRENVAERFAAYGWN--VIDAVDGHDAWAVAQAIAQARDWGETGRPVVDG 312 Query: 66 E 66 E Sbjct: 313 E 313 >gi|47096446|ref|ZP_00234039.1| transcriptional antiterminator, bglG family, putative [Listeria monocytogenes str. 1/2a F6854] gi|254900630|ref|ZP_05260554.1| hypothetical protein LmonJ_12479 [Listeria monocytogenes J0161] gi|254913669|ref|ZP_05263681.1| conserved hypothetical protein [Listeria monocytogenes J2818] gi|254938000|ref|ZP_05269697.1| PRD/PTS system IIA 2 domain-containing regulatory protein [Listeria monocytogenes F6900] gi|47015167|gb|EAL06107.1| transcriptional antiterminator, bglG family, putative [Listeria monocytogenes str. 1/2a F6854] gi|258610612|gb|EEW23220.1| PRD/PTS system IIA 2 domain-containing regulatory protein [Listeria monocytogenes F6900] gi|293591682|gb|EFG00017.1| conserved hypothetical protein [Listeria monocytogenes J2818] Length = 653 Score = 36.8 bits (84), Expect = 3.0, Method: Composition-based stats. Identities = 32/141 (22%), Positives = 62/141 (43%), Gaps = 12/141 (8%) Query: 91 LEDDADFSDEFSQLLP-HLSKCDINNILIKFDALRKKPKK--DSYLCTLPGNFDIHQPRI 147 E+ F +F Q +++ + I + F+A +++ + D L + + R+ Sbjct: 373 FEEALYFKAKFEQKYDVQVNEDETAYIALHFEAYKERSRSFPDQIRVVLVCSTGLGSSRL 432 Query: 148 LSPRTTGYF--IGKEAAIHLLNVRKNIYRPIDMDMK----HWWEHNIPSLVTEPGAVYEA 201 L+ R YF I E +L+V+ + +D+D+ + +IPS+V P Sbjct: 433 LAARIKKYFPMITIE---KILSVQALMETEVDVDLVISTIYLELEDIPSIVVSPMMSKAD 489 Query: 202 IDTNDSTIEESRLVRKPTFSP 222 + +S IE +RL +K P Sbjct: 490 LQQVESQIERTRLKKKKRSQP 510 >gi|12085057|ref|NP_073459.1| 74L protein [Yaba-like disease virus] gi|12056233|emb|CAC21312.1| 74L protein [Yaba-like disease virus] Length = 323 Score = 36.8 bits (84), Expect = 3.1, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 33/105 (31%), Gaps = 15/105 (14%) Query: 73 HIHLWKRIAYSPAI-----GAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LW++ A + ++LEDD D L + C + I D L+ + Sbjct: 100 HFSLWEQYAKANIKDPEKKFFVVLEDDNTLHD-----LTTIHVCISSMIEKNIDILQLRE 154 Query: 128 KKDSYLCTLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLN 167 + N Q + Y I AI +N Sbjct: 155 IMYNNNVRTLLNPGYKQAMYSYTGGYDFSLSAYIIRVSTAIRFIN 199 >gi|238507942|ref|XP_002385172.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] gi|220688691|gb|EED45043.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] Length = 294 Score = 36.8 bits (84), Expect = 3.1, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 39/104 (37%), Gaps = 24/104 (23%) Query: 10 LPFSHARREKFCHRAARIHLQFS-------FFDAIYGENNPICNRIFSHQKRQCQFKRLL 62 +P +R+ A LQF F D I + P + H K Sbjct: 1 MPNRPDKRDYIT--LASSILQFHPEPVNGVFVDDIDKKAYP---SNWDHGK--------- 46 Query: 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLP 106 E+G + +H+++ +R A +LEDDAD+ + L Sbjct: 47 LPAEMGAWRAHMNVMQR---DRISTAFVLEDDADWDVNLKKQLQ 87 >gi|196010119|ref|XP_002114924.1| predicted protein [Trichoplax adhaerens] gi|190582307|gb|EDV22380.1| predicted protein [Trichoplax adhaerens] Length = 1702 Score = 36.8 bits (84), Expect = 3.3, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 43/136 (31%), Gaps = 14/136 (10%) Query: 18 EKFCHRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQFKRLLSLPE---- 66 E R A D I+ + + I+ K E Sbjct: 1181 EMIATRLANKLKNAPVVDTIWWKLVIVISAKDLCNVNIYKSLLSVYGIKENFRESENFTI 1240 Query: 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKK 126 C ISH+ R+ + I+ EDD S + +N+ ++ ++++ Sbjct: 1241 KTCLISHV---HRVINANLCYRIVFEDDITDSTLIGTNGILFLLNETDNVSLQSHWMKQR 1297 Query: 127 PKKDSYLCTLPGNFDI 142 + + +L N+ I Sbjct: 1298 KRLEKLFKSLSNNYQI 1313 >gi|291233267|ref|XP_002736575.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 283 Score = 36.5 bits (83), Expect = 3.5, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 51/152 (33%), Gaps = 27/152 (17%) Query: 76 LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCT 135 ++ A IIL+DDA +F +L H+ K I N K + ++ + + Sbjct: 57 CLDAVSRMNASYTIILQDDALVRWDFFDILDHILKTRIENTYEKGELVKNERLDEVAAVK 116 Query: 136 LP-----GNFDIHQPRILSPRTTG------------YFIGKEA-----AIH---LLNVRK 170 L + Q +L+ G Y I + A LL+V Sbjct: 117 LFMPEKWQGYSWSQKTVLTLFAIGCTWGPPLAAIYIYLINQHVNTSYLARLLVFLLSVFY 176 Query: 171 NIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAI 202 + P+ + + + + + +Y A Sbjct: 177 VMLLPV--LIGQHYVLELRRISVQLYNMYHAP 206 >gi|90024994|gb|ABD85023.1| P32 attachment protein [Sheeppox virus strain Ranipet] Length = 265 Score = 36.5 bits (83), Expect = 3.8, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 24/95 (25%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + N + + Sbjct: 100 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDSIIIHNIIIEMQKKNIDIFQLRETFHNS 159 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 160 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 194 >gi|21492527|ref|NP_659646.1| IMV envelope protein p35 [Sheeppox virus] gi|4884709|gb|AAD31777.1|AF124517_4 P32 antigen [sheeppox virus] gi|163676293|gb|ABY40376.1| envelope protein [Sheeppox virus] gi|194267707|gb|ACF35790.1| major envelope protein [Sheeppox virus] gi|237690220|gb|ACR15801.1| envelope protein [Sheeppox virus] gi|237825461|gb|ACR20671.1| P32 envelope protein [Sheeppox virus] gi|311088580|gb|ADP68577.1| envelope protein [Sheeppox virus] Length = 323 Score = 36.5 bits (83), Expect = 3.8, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 25/95 (26%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + N + + Sbjct: 100 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDSIIIHNIIIEMQEKNIDIFQLRETFHNS 159 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 160 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 194 >gi|308799085|ref|XP_003074323.1| unnamed protein product [Ostreococcus tauri] gi|116000494|emb|CAL50174.1| unnamed protein product [Ostreococcus tauri] Length = 605 Score = 36.5 bits (83), Expect = 3.9, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 21/54 (38%), Gaps = 2/54 (3%) Query: 43 PICNRIFSHQKRQCQFKRLLSLPE--IGCYISHIHLWKRIAYSPAIGAIILEDD 94 ++ + C+ L E GC +SH+ +WK+ ++ E D Sbjct: 344 NEVKQVLDSKDHVCRTDSCLGHREHHFGCLLSHMAVWKKALERGEERVVLWEQD 397 >gi|145352750|ref|XP_001420700.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144580935|gb|ABO98993.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 714 Score = 36.5 bits (83), Expect = 4.1, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 37/125 (29%), Gaps = 23/125 (18%) Query: 67 IGCYISHIHLWKRIAYSPAIGAIILE------------DDADFSDEFSQLLPHLSKCDIN 114 +G +SH+ +WK+ S A ++ E D A+ E + + Sbjct: 397 LGSMLSHMAVWKKALDSGADSFVVWESTALAKHAVSPLDYAELESELPENTDMVFLEPDY 456 Query: 115 NILIKFDALRKKPKKDSYLCTLPG-------NFDIHQPRILSP----RTTGYFIGKEAAI 163 +F K ++ G + ++ + T+ K A Sbjct: 457 LSPGQFVKKIKSSATGTWGEHTHGAEECVNQSSSVYLYKHNHMCGGVGTSALMFTKRGAQ 516 Query: 164 HLLNV 168 L + Sbjct: 517 RLRDY 521 >gi|146231442|gb|ABQ12869.1| envelope protein [Goatpox virus] Length = 322 Score = 36.5 bits (83), Expect = 4.2, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 27/95 (28%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF ++ Y I +A Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTSSAYVIRLSSA 193 >gi|38229236|ref|NP_938329.1| 74L [Yaba monkey tumor virus] gi|38000507|gb|AAR07430.1| 74L [Yaba monkey tumor virus] Length = 320 Score = 36.5 bits (83), Expect = 4.4, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 34/108 (31%), Gaps = 15/108 (13%) Query: 73 HIHLWKR-----IAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LW++ I ++LEDD D L L C + K D L+ + Sbjct: 97 HFSLWEQYTNVEIKDPEKKFFVVLEDDNTLHD-----LTTLHICISSMFEKKIDILQLRE 151 Query: 128 KKDSYLCTLPGNFDIHQPRILSPR-----TTGYFIGKEAAIHLLNVRK 170 + N Q + Y I A+ L+N K Sbjct: 152 IMYNNNVRTLLNQGYKQSMYSYTGGYDFSLSAYIIRVSTAMALINNIK 199 >gi|170576926|ref|XP_001893815.1| GLT25D1 protein [Brugia malayi] gi|158599945|gb|EDP37347.1| GLT25D1 protein, putative [Brugia malayi] Length = 111 Score = 36.5 bits (83), Expect = 4.5, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 25/85 (29%), Gaps = 23/85 (27%) Query: 148 LSPRTTGYFIGKEAAIHLLN-VRKNIYRPIDMDMK-HWWEH------------NIPSLVT 193 S T GY + A LL P+D + + +H N+ + Sbjct: 9 YSYWTLGYVLSLNGARKLLAGNPLKKLLPVDEYIPIMFNKHPNTTWSNAFPHRNLIAYTI 68 Query: 194 EPGAVYEA---------IDTNDSTI 209 P V DT DS I Sbjct: 69 YPTIVVPERYTYQHGYFSDTEDSDI 93 >gi|301121818|ref|XP_002908636.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262103667|gb|EEY61719.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 434 Score = 36.1 bits (82), Expect = 4.9, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 21/54 (38%), Gaps = 1/54 (1%) Query: 14 HARREKFCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEI 67 R E + +++ Q FDA+ + + SH+ KR + +I Sbjct: 17 QERIESVREQLSQLQDQVDRFDAV-ADTLTELPKKLSHKVMVPLGKRAMVPGKI 69 >gi|13876683|gb|AAK43571.1| envelope protein [lumpy skin disease virus] Length = 323 Score = 36.1 bits (82), Expect = 4.9, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 28/96 (29%), Gaps = 6/96 (6%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIHNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRT-TGYFIGKEAA 162 NF R T + Y I +A Sbjct: 159 NSRILFNQENNNFMYSLTRGGYDFTLSAYVIRLSSA 194 >gi|255950410|ref|XP_002565972.1| Pc22g20720 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592989|emb|CAP99360.1| Pc22g20720 [Penicillium chrysogenum Wisconsin 54-1255] Length = 337 Score = 36.1 bits (82), Expect = 5.2, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 6/70 (8%) Query: 32 SFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIIL 91 D++ N + + + + E G Y +I+L + + A ++L Sbjct: 57 RVVDSVSTYNVSAEEFRYLQKLEEERNFY-----EKGVY-DYIYLLNQCLETGAPYMLVL 110 Query: 92 EDDADFSDEF 101 EDD F+D + Sbjct: 111 EDDVIFADGW 120 >gi|239977163|sp|B4S1F8|MIAA_ALTMD RecName: Full=tRNA dimethylallyltransferase; AltName: Full=Dimethylallyl diphosphate:tRNA dimethylallyltransferase; Short=DMAPP:tRNA dimethylallyltransferase; Short=DMATase; AltName: Full=Isopentenyl-diphosphate:tRNA isopentenyltransferase; Short=IPP transferase; Short=IPPT; Short=IPTase Length = 301 Score = 36.1 bits (82), Expect = 5.3, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 54/184 (29%), Gaps = 32/184 (17%) Query: 8 IS-LPFSHAR-REKFCHRAARIHL-----QFSFFDAIYGENN--PICNRIFSHQKRQCQF 58 IS LP S + R++ +A R+ + D + GE RI + Sbjct: 108 ISPLPKSDEKIRDEITQQAQRLGWSKLHDELRSVDPVSGERIHPNDPQRITRALEVYRNT 167 Query: 59 KRLLSLPEIGCYISHIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDI 113 + L+ W+ + Y+ A AI ED A + + + + Sbjct: 168 GKTLTY------------WQQQKGEKCPYNIAQFAIAPEDRAVLHNRIATRFDMMLEQGF 215 Query: 114 NNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT-----GYFIGKEAAIHLLNV 168 ++K R +D G + Q G ++ A L Sbjct: 216 EKEVVKL-YERSDLHEDLPSIRSVGYRQMWQYLDGHLSYAEMRERGIIATRQLAKRQLTW 274 Query: 169 RKNI 172 + Sbjct: 275 LRGW 278 >gi|40556124|ref|NP_955209.1| CNPV186 putative IMV envelope protein [Canarypox virus] gi|40233949|gb|AAR83532.1| CNPV186 putative IMV envelope protein [Canarypox virus] Length = 330 Score = 36.1 bits (82), Expect = 5.4, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 44/106 (41%), Gaps = 10/106 (9%) Query: 71 ISHIHLWKRIAYSPA---IGAIILEDD-----ADFSDEFSQLLPHLSKCDINNILIKFDA 122 +HI LWK + + I++ED+ F + ++ L K +I+ + + Sbjct: 104 ANHISLWKYVIDNSDKLSEYIIVIEDNNTITSEVFITKIDDIIAVLKKNNIDFLQLVTSN 163 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K + + + LP + +I++ + + Y + KE L + Sbjct: 164 NLLKDRTSNRIMILP-DLEIYK-GGIDISLSAYIVSKETINKLYSY 207 >gi|238756674|ref|ZP_04617960.1| Glutamate--cysteine ligase [Yersinia ruckeri ATCC 29473] gi|238705106|gb|EEP97537.1| Glutamate--cysteine ligase [Yersinia ruckeri ATCC 29473] Length = 519 Score = 35.7 bits (81), Expect = 6.5, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 34/99 (34%), Gaps = 12/99 (12%) Query: 99 DEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI-HQPRILSPRTTGYFI 157 + SQ L L +R+ ++++ T G P L T +I Sbjct: 3 PDVSQALTWLEAHPKA-----LKGIRRGIERETLRVTADGQLAATSHPESLGAALTHQWI 57 Query: 158 GKEAAIHLLNVRKNIYRPIDMDMKHW--WEHNIPSLVTE 194 + A LL P+D D+ H + H+I T Sbjct: 58 TTDFAEALL----EFITPVDGDIDHLLAFLHDIHRYTTR 92 >gi|84618267|emb|CAJ43770.1| CNPV186 protein [Avipoxvirus CVL] gi|84618272|emb|CAJ43774.1| CNPV186 protein [Avipoxvirus CVL] gi|84618277|emb|CAJ43789.1| CNVP186 protein [Avipoxvirus CVL] gi|84618282|emb|CAJ43780.1| CNPV186 protein [Avipoxvirus CVL] Length = 330 Score = 35.3 bits (80), Expect = 8.1, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 44/106 (41%), Gaps = 10/106 (9%) Query: 71 ISHIHLWKRIAYSPA---IGAIILEDD-----ADFSDEFSQLLPHLSKCDINNILIKFDA 122 +HI LWK + + I++ED+ F + ++ L K +I+ + + Sbjct: 104 ANHISLWKYVIDNSDKLSEYIIVIEDNNTITSEVFITKIDDIITVLKKNNIDFLQLVTSN 163 Query: 123 LRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNV 168 K + + + LP + +I++ + + Y + KE L + Sbjct: 164 NLLKDRTSNRIMILP-DLEIYK-GGIDVSLSAYIVSKETINKLYSY 207 >gi|148912951|ref|YP_001293265.1| hypothetical protein GTPV_gp070 [Goatpox virus Pellor] gi|194307596|gb|ACF42325.1| IMV envelope protein P32 [Goatpox virus] Length = 322 Score = 35.3 bits (80), Expect = 8.9, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 193 >gi|146231446|gb|ABQ12871.1| envelope protein [Goatpox virus] Length = 322 Score = 35.3 bits (80), Expect = 8.9, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 193 >gi|145308237|gb|ABP57382.1| envelope protein P32 [Goatpox virus] Length = 322 Score = 35.3 bits (80), Expect = 9.0, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 159 NSRILFNQEDNNFMYSYTGGYDFTLSAYVIRLSSA 193 >gi|55274609|gb|AAV49158.1| IMV envelope protein p35 [Goatpox virus] gi|145308233|gb|ABP57380.1| envelope protein P32 [Goatpox virus] gi|145308235|gb|ABP57381.1| envelope protein P32 [Goatpox virus] gi|146231440|gb|ABQ12868.1| envelope protein [Goatpox virus] gi|194307598|gb|ACF42326.1| IMV envelope protein P32 [Goatpox virus] gi|237690222|gb|ACR15802.1| envelope protein [Goatpox virus] gi|307448528|gb|ADN44124.1| envelope protein P32 [Goatpox virus] Length = 322 Score = 35.3 bits (80), Expect = 9.3, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 193 >gi|58531776|gb|AAW78648.1| P32 [Goatpox virus] Length = 322 Score = 35.3 bits (80), Expect = 9.5, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 193 >gi|145308231|gb|ABP57379.1| envelope protein P32 [Goatpox virus] Length = 322 Score = 35.3 bits (80), Expect = 9.7, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 99 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 158 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 159 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 193 >gi|146231448|gb|ABQ12872.1| envelope protein [Goatpox virus] gi|307448531|gb|ADN44125.1| envelope protein P32 [Goatpox virus] Length = 324 Score = 35.3 bits (80), Expect = 9.8, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 5/95 (5%) Query: 73 HIHLWK-----RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP 127 H LWK I S +++EDD D + + + N + + Sbjct: 101 HFSLWKSYADADIKNSENKFIVVIEDDNTLKDLITIYNIIIEMQEKNIDIFQLRETFHNS 160 Query: 128 KKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 NF + Y I +A Sbjct: 161 NSRILFNQENNNFMYSYTGGYDFTLSAYVIRLSSA 195 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.311 0.145 0.432 Lambda K H 0.267 0.0449 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,193,485,526 Number of Sequences: 14124377 Number of extensions: 217157080 Number of successful extensions: 671194 Number of sequences better than 10.0: 1392 Number of HSP's better than 10.0 without gapping: 1430 Number of HSP's successfully gapped in prelim test: 717 Number of HSP's that attempted gapping in prelim test: 666145 Number of HSP's gapped (non-prelim): 2470 length of query: 263 length of database: 4,842,793,630 effective HSP length: 136 effective length of query: 127 effective length of database: 2,921,878,358 effective search space: 371078551466 effective search space used: 371078551466 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.7 bits) S2: 80 (35.3 bits)