BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001292
(1106 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356533887|ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 [Glycine max]
Length = 1044
Score = 1302 bits (3370), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 667/1033 (64%), Positives = 800/1033 (77%), Gaps = 31/1033 (3%)
Query: 87 RDRFRFKRNPNHTQDKTQTKPSLHRYLLRHRHVNSTPSAANAATSGPRFNRKGFSS-LFP 145
R F FKRNP+H + + S R L N+ ++ N RKG LFP
Sbjct: 28 RGGFPFKRNPSHHRHRG----SFDRQL---PRSNNNSNSNNNINRSHLHKRKGLLLWLFP 80
Query: 146 F-RGAYLLYFMIFLAVFAFAMASMVLQNSIASVFG--AERGRPIREELRFGSRLKFVPDQ 202
F + Y I VF FA+AS+V+Q+SI SVF AER IR +RFGS L+FVP +
Sbjct: 81 FPKSKSGFYAFIIAVVFLFALASLVMQSSITSVFRQRAERASYIRGGIRFGSALRFVPGK 140
Query: 203 VG----FGNGLDGLRSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIY 258
+ G+GLD +RS PR GVR PRI LILG+M D +SL+L+TV++NLQKLGYVFKI+
Sbjct: 141 ISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVTVIRNLQKLGYVFKIF 200
Query: 259 AVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIP 318
AV G + S+WE I G IS L + LIDWSIF+GII DSLEAK AISS+MQ+PF S+P
Sbjct: 201 AVGHGKARSIWENIGGGISPLSAKHQGLIDWSIFEGIIVDSLEAKVAISSVMQDPFCSVP 260
Query: 319 LVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFV 378
L+WIIQEDSL++RLPVY + G+++++S+W+S FSR V+VFPD+T PMLYS LD GNFFV
Sbjct: 261 LIWIIQEDSLSSRLPVYEQMGWEHIVSHWRSAFSRAGVVVFPDFTYPMLYSELDTGNFFV 320
Query: 379 IPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVG 438
IPGSP DVWA E+YSK+H K QLR+ +GF K++++V+VVGSS FY+ LSWDYAVAMH VG
Sbjct: 321 IPGSPVDVWAAESYSKTHAKDQLRELSGFGKNDMLVLVVGSSVFYDNLSWDYAVAMHSVG 380
Query: 439 PLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
PLL KYARRN SFKFVFLCGNSTDGY+DALQ VASR+GL + S+RHYG NGDVN VL
Sbjct: 381 PLLTKYARRNGATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVL 440
Query: 499 LMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGL 558
LMADI+LYGS+Q QGFP L++RAMTF IPV+ PDF ++K+Y+ +G IFF K NPE L
Sbjct: 441 LMADIILYGSAQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEAL 500
Query: 559 SRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPI 618
AFSL +SNG+LSKFA+ +AS+GR AKN+LALDC+T YAR+LENVLNFPSDALLPGP+
Sbjct: 501 MNAFSLLLSNGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGPV 560
Query: 619 SQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEE-----FTKNITE 673
SQ+QQ SWEWNLFR EIDL D S+R S+V +E E ++ +I E
Sbjct: 561 SQIQQGSWEWNLFRNEIDLSKID--------GDFSNRKVSIVYAVEHELASLNYSTSIFE 612
Query: 674 NENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERF 733
N +D +++LDWD+L +IE SEE E E+E+ EER + WDDIYRNARKSE+
Sbjct: 613 NGTEVPLRDELTQLDWDILREIEISEENEMFEVEEAEERREKGVGVWDDIYRNARKSEKL 672
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
KFE NERDEGELERTGQPVCIYEIY+G+G WPFLHHGSLYRGL+LS A+R SDDVDAV
Sbjct: 673 KFEVNERDEGELERTGQPVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQSSDDVDAV 732
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
RL LLN T+YRDILCE+GGMF+IAN+VDNIH+RPWIGFQSWRAAGRKV+LS AEKVLE
Sbjct: 733 GRLPLLNDTYYRDILCEMGGMFAIANRVDNIHRRPWIGFQSWRAAGRKVALSAKAEKVLE 792
Query: 854 ETVQET-EGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
ET+QE GDV+YFW DMD N+N +FW MCDILNGG+CR F++ FRQMY L
Sbjct: 793 ETMQENFRGDVIYFWGRFDMDQSVIGNHN-ANSFWYMCDILNGGNCRIVFQEGFRQMYAL 851
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLS 972
P H EALPPMPED G WSALH WVM TPSFLEFIMFSRMFVDS+DAL+ +S+K + CLL
Sbjct: 852 PPHAEALPPMPED-GYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSTKYSLCLLG 910
Query: 973 SSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFT 1032
SSE+EKKHCYCRVLELL+NVWAYHS RKMVY++P +GS++EQHPIE+R+GFMW KYFN +
Sbjct: 911 SSEIEKKHCYCRVLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNIS 970
Query: 1033 LLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMF 1092
LLKSMDEDLAEAADDGD+PRE WLWP TGEVHW+GIYEREREERYR KMDKKRK KEK+F
Sbjct: 971 LLKSMDEDLAEAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLF 1030
Query: 1093 DRLTKGYRQKTLG 1105
+R+ GY+QK+LG
Sbjct: 1031 ERMKYGYKQKSLG 1043
>gi|356574667|ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790929 [Glycine max]
Length = 1045
Score = 1298 bits (3359), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 662/1035 (63%), Positives = 800/1035 (77%), Gaps = 28/1035 (2%)
Query: 85 SIRDRFRFKRNPNHTQDKTQTKPSLHRYLLRHRHVNSTPSAANAATSGPRFNRKGFSS-L 143
+IR F FKRNP H + + L R +S + + + RKG L
Sbjct: 24 AIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSSNNNNISIRSH--LHKRKGLLLWL 81
Query: 144 FPF-RGAYLLYFMIFLAVFAFAMASMVLQNSIASVF--GAERGRPIREELRFGSRLKFVP 200
FPF + Y I + VF FA+ASMVLQ+SI SVF A+ R I +RFGS L+FVP
Sbjct: 82 FPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFRQSADSARYISGGIRFGSALRFVP 141
Query: 201 DQVG----FGNGLDGLRSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFK 256
++ G+GLD +RS PR GVR PRI LILG+M D +SL+L+TV+ NLQKLGYVFK
Sbjct: 142 GRISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVTVIWNLQKLGYVFK 201
Query: 257 IYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHS 316
I+AV G + S+WE I G+I L E LIDWSIF+GII DSLEAK AISS+MQEPF S
Sbjct: 202 IFAVGHGKARSIWENIGGRICPLSTEHQGLIDWSIFEGIIVDSLEAKVAISSVMQEPFCS 261
Query: 317 IPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNF 376
+PL+WIIQEDSL++RLPVY + G+++++S+W+S FSR +V+VFPD+T PMLYS LD GNF
Sbjct: 262 VPLIWIIQEDSLSSRLPVYEQMGWEHIVSHWRSAFSRASVVVFPDFTYPMLYSELDTGNF 321
Query: 377 FVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHD 436
FVIPGSP DVWA E+Y K+H K QLR+ +GF K++++V+VVGSS F+++LSWDYAVAMH
Sbjct: 322 FVIPGSPVDVWAAESYHKTHAKEQLRELSGFGKNDMLVLVVGSSVFFDDLSWDYAVAMHS 381
Query: 437 VGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNG 496
VGPLL +YARRN SFKFVFLCGNSTDGY+DALQ VASR+GL + S+RHYG NGDVN
Sbjct: 382 VGPLLTRYARRNDATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNS 441
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
VLLMADI+LYGS+Q QGFP L++RAMTF IPV+ PDF ++K+Y+ +G IFF K NPE
Sbjct: 442 VLLMADIILYGSAQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPE 501
Query: 557 GLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPG 616
L AFSL +SNG+LSKFA+ +AS+GR AKN+LALDC+T YAR+LENVLNFPSDALLPG
Sbjct: 502 ALMNAFSLLLSNGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPG 561
Query: 617 PISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEE-----FTKNI 671
+SQ+QQ SWEWNLF+ EIDL D S+R S+V +E E ++ +I
Sbjct: 562 AVSQIQQGSWEWNLFQNEIDLSKID-----------SNRKVSIVYAVEHELASLNYSTSI 610
Query: 672 TENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSE 731
EN QD +++LD D L +IE SEE E E+E+ EERM+ + WDDIYRNARKSE
Sbjct: 611 VENGTEVPLQDELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSE 670
Query: 732 RFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVD 791
+ KFE NERDEGELERTGQ VCIYEIY+G+G WPFLHHGSLYRGL+LS A+R SDDVD
Sbjct: 671 KLKFEVNERDEGELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVD 730
Query: 792 AVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKV 851
AV RL LLN T+YRDILCE+GGMF+IAN+VD+IH+RPWIGFQSWRAAGRKV+LS AE V
Sbjct: 731 AVGRLPLLNDTYYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENV 790
Query: 852 LEETVQET-EGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMY 910
LEET+QE GDV+YFW LDMD RN+N ++FW MCDILNGG+CR F+D FRQMY
Sbjct: 791 LEETMQENFRGDVIYFWGRLDMDQSAIRNHN-AISFWYMCDILNGGNCRIVFQDGFRQMY 849
Query: 911 GLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCL 970
LP H EALPPMPEDGG WSALH WVM T SFLEFIMFSRMFVDS+DA + +S+K + CL
Sbjct: 850 ALPPHAEALPPMPEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCL 909
Query: 971 LSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFN 1030
L SSE+EKKHCYCR+LELL+NVWAYHS RKMVY++P +GS++EQHPIE+R+GFMW KYFN
Sbjct: 910 LGSSEIEKKHCYCRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFN 969
Query: 1031 FTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEK 1090
F+LLKSMDEDLAEAADDGD+PRE WLWP TGEVHW+GIYEREREERYR KMDKKRK KEK
Sbjct: 970 FSLLKSMDEDLAEAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEK 1029
Query: 1091 MFDRLTKGYRQKTLG 1105
+F+R+ GY+QK+LG
Sbjct: 1030 LFERMKYGYKQKSLG 1044
>gi|18414574|ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
gi|15450503|gb|AAK96544.1| AT5g04480/T32M21_80 [Arabidopsis thaliana]
gi|24111433|gb|AAN46867.1| At5g04480/T32M21_80 [Arabidopsis thaliana]
gi|332003367|gb|AED90750.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
Length = 1050
Score = 1285 bits (3324), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 651/1063 (61%), Positives = 804/1063 (75%), Gaps = 42/1063 (3%)
Query: 63 DDNAAAA------ATTTSGNNDQQQHPHSIRDRFRFKRNPNHTQDKTQT---KPSLHRYL 113
DDN A A +GN D H SIRDR R KRN + +D++ + +PSL
Sbjct: 11 DDNGGAGRDGNHNANNVAGNGDTSFH--SIRDRLRLKRNSSDRRDRSHSGLDRPSLR--- 65
Query: 114 LRHRHVNSTPSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNS 173
R H+ G NRKG SL RG LLYF++ V AF M+S++LQNS
Sbjct: 66 TRPHHI------------GRSLNRKGLLSLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNS 113
Query: 174 IASVFGAERGRPIREELRFGSRLKFVPDQVGF----GNGLDGLRSTPRFGVRPPRIGLIL 229
I + G +G +R ++ GS LK+VP + G GLD LRS R GVRPPR+ L+L
Sbjct: 114 I-TWQGNVKGGQVRSQIGLGSTLKYVPGGIARTLIEGKGLDPLRSAVRIGVRPPRLALVL 172
Query: 230 GNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDW 289
GNM KD R+L+L+TV+KNLQKLGYVFK++AV +G + SLWEQ+AG + +L EQ DW
Sbjct: 173 GNMKKDPRTLMLVTVMKNLQKLGYVFKVFAVENGEARSLWEQLAGHVKVLVSEQLGHADW 232
Query: 290 SIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKS 349
+IF+G+IADSLEAKEAISSLMQEPF S+PL+WI+ ED LANRLPVY G +L+S+W+S
Sbjct: 233 TIFEGVIADSLEAKEAISSLMQEPFRSVPLIWIVHEDILANRLPVYQRMGQNSLISHWRS 292
Query: 350 VFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLK 409
F+R +V+VFP +TLPML+SVLD GNF VIP S DVWA E+YS++H K LR+ N F +
Sbjct: 293 AFARADVVVFPQFTLPMLHSVLDDGNFVVIPESVVDVWAAESYSETHTKQNLREINEFGE 352
Query: 410 DEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYND 469
D+++++V+GSSFFY+E SWD AVAMH +GPLL +Y RR GSFKFVFL GNST G +D
Sbjct: 353 DDVIILVLGSSFFYDEFSWDNAVAMHMLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQSD 412
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
A+QEVASRLGL E +VRH+G N DVN VL MADI++Y SSQ EQ FP LIVRAM+FGIP+
Sbjct: 413 AVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYASSQEEQNFPPLIVRAMSFGIPI 472
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
ITPDFPI+K+Y+A+ IFF++++P+ L +AFS IS+G+LSKFA+T+AS+GRL KN+
Sbjct: 473 ITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNL 532
Query: 590 LALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWG 649
+A +C+T YAR+LEN+L+FPSD LPG ISQLQ +WEWN FR E++ IL+
Sbjct: 533 MATECITGYARLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILD----S 588
Query: 650 TSTSSRNSSVVDLLEEEF-----TKNITENENRSADQDTISELDWDVLHDIESSEEYERL 704
S +V +EE+F + N +N + S+LDWDVL +IE +EEYE++
Sbjct: 589 AYAFIGKSGIVFQVEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKV 648
Query: 705 EMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAW 764
E E+LE+RM+ W++IYRNARKSE+ KFE NERDEGELERTG+P+CIYEIY+G+GAW
Sbjct: 649 ESEELEDRMERDVEDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAW 708
Query: 765 PFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNI 824
PFLHHGSLYRGL+LSS RRL SDDVDA RL LLN T+YRDILCEIGGMFS+ANKVD+I
Sbjct: 709 PFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSI 768
Query: 825 HKRPWIGFQSWRAAGRKVSLSISAEKVLEETV-QETEGDVMYFWAHLDMDGGFTRNNNDV 883
H RPWIGFQSWRAAGRKVSLS AE+ LE + QET+G+++YFW LD+DG + N
Sbjct: 769 HMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKN-A 827
Query: 884 LTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFL 943
LTFWSMCDILN G+CRT FEDAFR MYGLP H+EALPPMPEDG WS+LH WVM TPSFL
Sbjct: 828 LTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFL 887
Query: 944 EFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVY 1003
EF+MFSRMF +SLDAL+ N + SC L+SS LE+KHCYCRVLELLVNVWAYHSGRKMVY
Sbjct: 888 EFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVY 947
Query: 1004 LDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEV 1063
++P GSL+EQHP+++R+G MW KYFNFTLLKSMDEDLAEAADD D+PRE+WLWP TGEV
Sbjct: 948 INPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEV 1007
Query: 1064 HWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLGG 1106
HWKG+YEREREERYR KMDKKRK KEK++DR+ GY+QK+LGG
Sbjct: 1008 HWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1050
>gi|224113167|ref|XP_002316413.1| predicted protein [Populus trichocarpa]
gi|222865453|gb|EEF02584.1| predicted protein [Populus trichocarpa]
Length = 1073
Score = 1279 bits (3310), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 664/1102 (60%), Positives = 798/1102 (72%), Gaps = 105/1102 (9%)
Query: 68 AAATTTSGNNDQQQHPHSIRDRFRFKRNPNHTQDKTQTKPSLHRYLLRHRHVNSTPSAAN 127
+ AT T Q+ HSI DRF FKRNPN + + LR H + S
Sbjct: 14 SPATNTGSEGVSDQNFHSISDRFLFKRNPNPSTNSPHKSSKSPPDRLRRWHHYTNKSN-- 71
Query: 128 AATSGPRFNRKG-FSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAERG--- 183
NRKG + S PFRG L YF+IFLAVFAF +AS++LQ+SI + +G
Sbjct: 72 --------NRKGGWFSCIPFRGICLFYFVIFLAVFAFVLASILLQSSITGMVVFSKGWID 123
Query: 184 --RPIREELRFGSRLKFVPDQ----VGFGNGLDGLRSTP-RFGVRPPRIGLI-------- 228
R IRE L+ G+ LKFVP + G+GLD R R G+RPPR+ +I
Sbjct: 124 HRRSIREGLKSGTTLKFVPGLRSRLLLEGHGLDHARVLANRVGLRPPRLAVIVKGLGVSL 183
Query: 229 ---------------------------------------LGNMAKDSRSLLLITVVKNLQ 249
LGNM KD +SL+L++V+KNL+
Sbjct: 184 NGHNGNWEIVCWLHPMAMHGPPSMTNMIKLKYGGGGRKILGNMKKDPQSLMLLSVMKNLR 243
Query: 250 KLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSL 309
KLGY KIYA+ +G + ++WE I GQIS+L +QY LIDWSIF+G++ DSLEAKE +SSL
Sbjct: 244 KLGYALKIYALGNGETRTMWEDIGGQISVLRPKQYDLIDWSIFEGVMVDSLEAKEVVSSL 303
Query: 310 MQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYS 369
QEPF SIPLVWIIQED+LANRLP+Y + Q+L+S+W+S F+R NV+VFPD+ LPMLYS
Sbjct: 304 SQEPFQSIPLVWIIQEDTLANRLPLYQDMNLQHLVSHWRSTFNRANVVVFPDFALPMLYS 363
Query: 370 VLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWD 429
VLD GNFFVIPGSP DVW E+YSK+H K+QLR ++GF +D++VV+VVGSSFFY+ELSWD
Sbjct: 364 VLDTGNFFVIPGSPVDVWDAESYSKTHAKHQLRVDHGFSEDDLVVLVVGSSFFYDELSWD 423
Query: 430 YAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYG 489
Y VA+H +GP+L +YAR EGSFKFVFLCGNSTD +DA QE+ SR+GL SVRHYG
Sbjct: 424 YTVALHTLGPVLAEYARSKDAEGSFKFVFLCGNSTD--DDAFQEIVSRVGLHPSSVRHYG 481
Query: 490 FNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIF 549
NGD N VLL ADIVLYGSSQ EQGFP +++RAMTFGIPVI PD P +K+YV++ A IF
Sbjct: 482 LNGDANSVLLAADIVLYGSSQDEQGFPPVLIRAMTFGIPVIAPDIPTMKKYVSDEAHGIF 541
Query: 550 FQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFP 609
F K NPE L+RAFSL ISNGKLSKFA TVA +GRL AKNMLA +C+T YAR+LEN+L+FP
Sbjct: 542 FSKYNPEALTRAFSLLISNGKLSKFAETVAFSGRLLAKNMLASECITGYARLLENMLSFP 601
Query: 610 SDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTK 669
SD LLPGP+S+L+Q WEWNLF KE++ T D+ M E + SSR +S+V LE+E++
Sbjct: 602 SDTLLPGPVSKLEQREWEWNLFNKELEQETDDLSGMYE--SLFSSRETSIVYSLEKEWSN 659
Query: 670 -----NITENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIY 724
I+EN DT +E DWDVL +IES EE+ER+ E+LEERMD T WDDIY
Sbjct: 660 LVNSTIISENGTEILVPDTPTESDWDVLMEIESFEEHERVVKEELEERMDKTRGLWDDIY 719
Query: 725 RNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARR 784
R+ARKSE+ KFE+NERDEGELERTGQP S+ ARR
Sbjct: 720 RSARKSEKLKFESNERDEGELERTGQP---------------------------STKARR 752
Query: 785 LRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSL 844
RSDDVDAV+RL LLN ++Y++ILCEIGGMFSIA +VD IHKRPWIGFQSW AAGRKVSL
Sbjct: 753 SRSDDVDAVARLPLLNESYYQNILCEIGGMFSIAIRVDAIHKRPWIGFQSWHAAGRKVSL 812
Query: 845 SISAEKVLEETVQETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFED 904
S AEKVLEE QE DVMYFWA L MDGG T +N + LTFWSMCD+LNGG CRTAFED
Sbjct: 813 SFKAEKVLEEKTQEENKDVMYFWARLGMDGGVTGSNEE-LTFWSMCDVLNGGRCRTAFED 871
Query: 905 AFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSS 964
AFRQMY LPS++EALPPMPEDGG WSALH WVM TPSFLEFIMFSRMFVDSLDAL +NSS
Sbjct: 872 AFRQMYDLPSYLEALPPMPEDGGHWSALHSWVMPTPSFLEFIMFSRMFVDSLDALQSNSS 931
Query: 965 KVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFM 1024
+VN CLLSS+ELE+KHCYCR++E+LVNVWAYHS R+MVY+DP +GS++EQHPI++R+
Sbjct: 932 QVNKCLLSSTELEEKHCYCRIMEVLVNVWAYHSARRMVYIDPHTGSVEEQHPIKQRKEIA 991
Query: 1025 WMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKK 1084
W KYFN T+LKSMDEDLAEAADDGD+PRE+WLWP TGEVHW+GIYEREREERYR KMDKK
Sbjct: 992 WKKYFNLTVLKSMDEDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYRIKMDKK 1051
Query: 1085 RKMKEKMFDRLTKGYRQKTLGG 1106
RK +EK+ +RL GY+QK LGG
Sbjct: 1052 RKTREKLVERLKAGYKQKPLGG 1073
>gi|297810537|ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arabidopsis lyrata subsp.
lyrata]
gi|297318989|gb|EFH49411.1| hypothetical protein ARALYDRAFT_487229 [Arabidopsis lyrata subsp.
lyrata]
Length = 1051
Score = 1274 bits (3297), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 638/1036 (61%), Positives = 795/1036 (76%), Gaps = 34/1036 (3%)
Query: 84 HSIRDRFRFKRNPNHTQDKTQTKPSLHRYLLRHR--HVNSTPSAANAATSGPRFNRKGFS 141
HSIRDR R KRN + +D++ + L R LR+R H+ + NRKG
Sbjct: 37 HSIRDRLRLKRNSSDRRDRSHS--GLDRPSLRNRPHHIARS------------LNRKGLI 82
Query: 142 SLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAERGRPIREELRFGSRLKFVPD 201
SL RG LLYF++ V AF M+S++LQNSI +RG+ +R ++ GS LK+VP
Sbjct: 83 SLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNSITWQGNVKRGQ-VRSQIGLGSTLKYVPG 141
Query: 202 QVGF----GNGLDGLRSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKI 257
+ G GLD LRST R GVRPPR+ L+LGNM KD R+L+L+TV+KNLQKLGYVFK+
Sbjct: 142 GIARTLIEGEGLDPLRSTVRIGVRPPRLALVLGNMKKDPRTLMLVTVMKNLQKLGYVFKV 201
Query: 258 YAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSI 317
+AV +G + SLWE +AG + +L EQ DW+IF+G+IADSLEAKEAISSLMQEPF S+
Sbjct: 202 FAVENGEARSLWEHLAGHVKVLVSEQLGHADWTIFEGVIADSLEAKEAISSLMQEPFRSV 261
Query: 318 PLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFF 377
PL+WI+ ED LAN+LPVY G +L+S+W+S F+R +V+VFP +TLPML+S+LD GNF
Sbjct: 262 PLIWIVHEDILANQLPVYQRMGQNSLISHWRSAFARADVVVFPQFTLPMLHSILDDGNFV 321
Query: 378 VIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDV 437
VIP S D+WA E+YS++H K +LR+ N F +++++++V+GSSFFYNE SWD AVAMH +
Sbjct: 322 VIPESVVDIWAAESYSETHTKQKLREINEFGEEDVIILVLGSSFFYNEFSWDNAVAMHML 381
Query: 438 GPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGV 497
GPLL +Y RR SFKFVFL GNST G +DA+QEVA+RLGL E +VRH+G N DVN V
Sbjct: 382 GPLLTRYGRRKDTSNSFKFVFLYGNSTKGQSDAVQEVAARLGLTEGTVRHFGLNEDVNKV 441
Query: 498 LLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG 557
L MADI++Y SSQ EQ FP LIVRAM+FGIP+ITPDFP++K+Y+A+ IFF++++P+
Sbjct: 442 LRMADILVYASSQEEQNFPPLIVRAMSFGIPIITPDFPVMKKYLADEVHGIFFRRNDPDA 501
Query: 558 LSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGP 617
L +AFS IS+G+LS+FA+T+AS+GRL KN++A +C+T YAR+LEN+L+FPSD LPG
Sbjct: 502 LLKAFSPLISDGRLSEFAQTIASSGRLLTKNLMATECITGYARLLENILHFPSDTFLPGS 561
Query: 618 ISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEF-----TKNIT 672
ISQLQ SWEW+ FR E++ IL+ S S +V +EE++ + N
Sbjct: 562 ISQLQGASWEWSFFRSELEQPKSFILD----SAYASIGKSGIVFQVEEKYMGVIESTNPV 617
Query: 673 ENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSER 732
+N + S+LDWDVL +IE +EEYE +E E+LE+RM+ W++IYRNARKSE+
Sbjct: 618 DNSTLFVSDELPSKLDWDVLEEIEGAEEYENVESEELEDRMERDVEDWEEIYRNARKSEK 677
Query: 733 FKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDA 792
KFE NERDEGELERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+LSS RRL SDDVDA
Sbjct: 678 LKFEVNERDEGELERTGQPVCIYEIYDGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDA 737
Query: 793 VSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVL 852
RL LLN T+YRDILCEIGGMFS+ANKVD+IH RPWIGFQSWRAAGRKVSLS AE+ L
Sbjct: 738 ADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESL 797
Query: 853 EETV-QETEGDVMYFWAHLDMDG-GFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMY 910
E + QET+G+++YFW LD+DG + R N LTFWSMCDILN G+CRT FEDAFR +Y
Sbjct: 798 ENIIKQETKGEIIYFWTRLDIDGDAYGRKN--ALTFWSMCDILNQGNCRTTFEDAFRHIY 855
Query: 911 GLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCL 970
GLP H+EALPPMPEDG WS+LH WVM TPSFLEF+MFSRMF +SLDAL+ N + SC
Sbjct: 856 GLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCS 915
Query: 971 LSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFN 1030
L+SS LE+KHCYCRVLELLVNVWAYHSGRKMVY++P GSL+EQHP+ +R+G MW KYFN
Sbjct: 916 LASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLLQRKGLMWAKYFN 975
Query: 1031 FTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEK 1090
FTLLKSMDEDLAEAADD D+PRE+WLWP TGEVHWKG+YEREREERYR KMDKKRK KEK
Sbjct: 976 FTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEK 1035
Query: 1091 MFDRLTKGYRQKTLGG 1106
++DR+ GY+QK+LGG
Sbjct: 1036 LYDRIKNGYKQKSLGG 1051
>gi|225448875|ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254795 [Vitis vinifera]
Length = 1028
Score = 1263 bits (3268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 649/1002 (64%), Positives = 772/1002 (77%), Gaps = 37/1002 (3%)
Query: 123 PSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAER 182
P A + R F +++ F + + ++F+ MV+Q I E+
Sbjct: 44 PQGIRVARAHRGARRGSFCTMWRFLSGSIAFSVLFIVFL------MVMQTKIRV---PEQ 94
Query: 183 GRPIREELRFGSRLKF----VPDQVGFGNGLDGLRSTPRFGVRPPRIGLILGNMAKDSRS 238
G + ++ G LKF + + G NGLD LRS R GVR P + LILGNM K+ S
Sbjct: 95 GWSFLDGIKSGKSLKFGQGSLLRRFGQRNGLDHLRSEMRIGVRRPTLALILGNMKKNPPS 154
Query: 239 LLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIAD 298
L+L TV+KNLQ LGY+FKIYAV NS S+WEQ+ GQISIL E YS DW+ F+GII D
Sbjct: 155 LMLFTVIKNLQGLGYLFKIYAVHDDNSRSIWEQLGGQISILSPEIYSHNDWTTFEGIIVD 214
Query: 299 SLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIV 358
SLEAKEAI SLMQEPF IPL+WIIQED+LA RLP Y + G+++L+SYW+S FSR +V+V
Sbjct: 215 SLEAKEAILSLMQEPFCFIPLIWIIQEDTLAKRLPFYEKLGWEHLVSYWRSAFSRADVVV 274
Query: 359 FPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVG 418
FPD++LPMLYSVLD GNFFVIP SP DVWA E+YSK+H KYQLR++ GF KD+++V+VVG
Sbjct: 275 FPDFSLPMLYSVLDTGNFFVIPASPVDVWAAESYSKTHSKYQLREDLGFNKDDMLVLVVG 334
Query: 419 SSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRL 478
SSFFY+ELSWDYAVAM+D+GPLL KYAR + F+FVFLCGNSTDGYND L+EVAS L
Sbjct: 335 SSFFYDELSWDYAVAMNDIGPLLSKYARSKNAGAMFRFVFLCGNSTDGYNDHLKEVASHL 394
Query: 479 GLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
LL SVR YG N DVNG++LMAD+V+Y SSQVEQGFP L+ RAM+FGIPVI PD P I+
Sbjct: 395 KLLPGSVRQYGMNSDVNGLILMADVVIYASSQVEQGFPPLLTRAMSFGIPVIAPDLPDIR 454
Query: 539 EYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRY 598
+YV +G V+ F K+NP+ L RAFSL ISNGKLSKFA+ VA +GRL AKNMLA +CV Y
Sbjct: 455 KYVVDGVHVVIFPKNNPDALMRAFSLLISNGKLSKFAKAVALSGRLLAKNMLASECVNSY 514
Query: 599 ARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSS 658
A++LENVL+FPSD LLPG ISQ Q +WEWN FR T D+ ++ S S R SS
Sbjct: 515 AKLLENVLSFPSDVLLPGHISQSQHDAWEWNSFR------TADMPLIE--NGSASMRKSS 566
Query: 659 VVDLLEEEFTKNITEN--ENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGT 716
VVD+LEE + + N + D +++LDWDVL +IES EE ERLEME+LEERM+
Sbjct: 567 VVDVLEETLSNQLDSGNISNSETENDVLTQLDWDVLREIESIEEMERLEMEELEERMEKN 626
Query: 717 FASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGL 776
WD+IYRNARK ER KFE NERDEGELERTGQP+CIYEIY+G+GAWPFLHHGS+YRGL
Sbjct: 627 PGIWDEIYRNARKVERVKFETNERDEGELERTGQPLCIYEIYNGAGAWPFLHHGSMYRGL 686
Query: 777 ALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWR 836
+L+++ARRLRSDDVDAV RL +LN T+YRDI C+IGGMFSIA +VD IHKRPWIGFQSW
Sbjct: 687 SLTTSARRLRSDDVDAVDRLPVLNDTYYRDIFCDIGGMFSIAFRVDKIHKRPWIGFQSWH 746
Query: 837 AAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNG 895
A G KVSLS AEKVLEET+Q ET+GDV+YFWAHL++D G T+ N + TFWSMCDILNG
Sbjct: 747 AVGSKVSLSSRAEKVLEETIQEETKGDVLYFWAHLNVDDGPTQKNR-IPTFWSMCDILNG 805
Query: 896 GHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDS 955
G+CRTAFEDAFRQMY +PS++EALPPMPEDGG WSALH WVM TPSFLEFIMFSRMF DS
Sbjct: 806 GNCRTAFEDAFRQMYAMPSYIEALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFADS 865
Query: 956 LDALN------------ANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVY 1003
LDAL+ NSS+ CLL SS+LEKKHCYCRVLELLVNVWAYHS RKMVY
Sbjct: 866 LDALHMNSRQSMNLSQSMNSSQPTVCLLGSSKLEKKHCYCRVLELLVNVWAYHSARKMVY 925
Query: 1004 LDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEV 1063
++P SG L+EQHP+E+RRGFMW KYFN TLLKSMDEDLAEAADDGD+PRE+WLWP TGEV
Sbjct: 926 INPYSGQLEEQHPVEQRRGFMWAKYFNSTLLKSMDEDLAEAADDGDHPRERWLWPLTGEV 985
Query: 1064 HWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
HW+GIYEREREERYR KMDKKRK KEK+ +R+ GY+QK +G
Sbjct: 986 HWQGIYEREREERYRSKMDKKRKAKEKLVERMKHGYKQKPIG 1027
>gi|147820655|emb|CAN65363.1| hypothetical protein VITISV_036074 [Vitis vinifera]
Length = 1037
Score = 1252 bits (3240), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 649/1011 (64%), Positives = 772/1011 (76%), Gaps = 46/1011 (4%)
Query: 123 PSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAER 182
P A + R F +++ F + + ++F+ MV+Q I E+
Sbjct: 44 PQGIRVARAHRGARRGSFCTMWRFLSGSIAFSVLFIVFL------MVMQTKIRV---PEQ 94
Query: 183 GRPIREELRFGSRLKF----VPDQVGFGNGLDGLRSTPRFGVRPPRIGLILGNMAKDSRS 238
G + ++ G LKF + + G NGLD LRS R GVR P + LILGNM K+ S
Sbjct: 95 GWSFLDGIKSGKSLKFGQGSLLRRFGQRNGLDHLRSEMRIGVRRPXLALILGNMKKNPPS 154
Query: 239 LLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIAD 298
L+L TV+KNLQ LGY+FKIYAV NS S+WEQ+ GQISIL E YS DW+ F+GII D
Sbjct: 155 LMLFTVIKNLQGLGYLFKIYAVXDDNSRSIWEQLGGQISILSPEIYSHNDWTTFEGIIVD 214
Query: 299 SLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIV 358
SLEAKEAI SLMQEPF IPL+WIIQED+LA RLP Y + G+++L+SYW+S FSR +V+V
Sbjct: 215 SLEAKEAILSLMQEPFCFIPLIWIIQEDTLAKRLPFYEKLGWEHLVSYWRSAFSRADVVV 274
Query: 359 FPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVG 418
FPD++LPMLYSVLD GNFFVIP SP DVWA E+YSK+H KYQLR++ GF KD+++V+VVG
Sbjct: 275 FPDFSLPMLYSVLDTGNFFVIPASPVDVWAAESYSKTHSKYQLREDLGFNKDDMLVLVVG 334
Query: 419 SSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTD---------GYND 469
SSFFY+ELSWDYAVAM+D+GPLL KYAR + +FVFLCGNSTD GYND
Sbjct: 335 SSFFYDELSWDYAVAMNDIGPLLSKYARSKNAGAMXRFVFLCGNSTDGYNDHLKVYGYND 394
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
L+EVAS L LL SVR YG N DVNG++LMAD+V+Y SSQVEQGFP L+ RAM+FGIPV
Sbjct: 395 HLKEVASHLKLLPGSVRQYGMNSDVNGLMLMADVVIYASSQVEQGFPPLLTRAMSFGIPV 454
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
I PD P I++YV +G V+ F K+NP+ L RAFSL ISNGKLSKFA+ VA +GRL AKNM
Sbjct: 455 IAPDLPDIRKYVVDGVHVVIFPKNNPDALMRAFSLLISNGKLSKFAKAVALSGRLLAKNM 514
Query: 590 LALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWG 649
LA +CV YA++LENVL+FPSD LLPG ISQ Q +WEWN FR T D+ ++
Sbjct: 515 LASECVNSYAKLLENVLSFPSDVLLPGHISQSQHDAWEWNSFR------TADMPLIE--N 566
Query: 650 TSTSSRNSSVVDLLEEEFTKNITEN--ENRSADQDTISELDWDVLHDIESSEEYERLEME 707
S S R SSVVD+LEE + + N + D +++LDWDVL +IES EE ERLEME
Sbjct: 567 GSASMRKSSVVDVLEETLSNQLDSGNISNSETENDVLTQLDWDVLREIESIEEMERLEME 626
Query: 708 QLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFL 767
+LEERM+ WD+IYRNARK ER KFEANERDEGELERTGQP+CIYEIY+G+GAWPFL
Sbjct: 627 ELEERMEKNPGIWDEIYRNARKVERVKFEANERDEGELERTGQPLCIYEIYNGAGAWPFL 686
Query: 768 HHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKR 827
HHGS+YRGL+L+++ARRLRSDDVDAV RL +LN T+YRDI C+IGGMFSIA +VD IHKR
Sbjct: 687 HHGSMYRGLSLTTSARRLRSDDVDAVDRLPVLNDTYYRDIFCDIGGMFSIAFRVDKIHKR 746
Query: 828 PWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTF 886
PWIGFQSW A G KVSLS AEKVLEET+Q ET+GDV+YFWAHL++D G T+ N + TF
Sbjct: 747 PWIGFQSWHAVGSKVSLSSRAEKVLEETIQEETKGDVLYFWAHLNVDDGPTQKNR-IPTF 805
Query: 887 WSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFI 946
WSMCDILNGG+CRTAFEDAFRQMY +PS++EALPPMPEDGG WSALH WVM TPSFLEFI
Sbjct: 806 WSMCDILNGGNCRTAFEDAFRQMYAMPSYIEALPPMPEDGGYWSALHSWVMPTPSFLEFI 865
Query: 947 MFSRMFVDSLDALN------------ANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWA 994
MFSRMF DSLDAL+ NSS+ CLL SS+LEKKHCYCRVLELLVNVWA
Sbjct: 866 MFSRMFADSLDALHMNSRQSMNLSQSMNSSQPTVCLLGSSKLEKKHCYCRVLELLVNVWA 925
Query: 995 YHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREK 1054
YHS RKMVY++P SG L+EQHP+E+RRGFMW KYFN TLLKSMDEDLAEAADDGD+PRE+
Sbjct: 926 YHSARKMVYINPYSGQLEEQHPVEQRRGFMWAKYFNSTLLKSMDEDLAEAADDGDHPRER 985
Query: 1055 WLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
WLWP TGEVHW+GIYEREREERYR KMDKKRK KEK+ +R+ GY+QK +G
Sbjct: 986 WLWPLTGEVHWQGIYEREREERYRSKMDKKRKAKEKLVERMKHGYKQKPIG 1036
>gi|334187426|ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
gi|332003368|gb|AED90751.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
Length = 1035
Score = 1247 bits (3226), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 638/1063 (60%), Positives = 789/1063 (74%), Gaps = 57/1063 (5%)
Query: 63 DDNAAAA------ATTTSGNNDQQQHPHSIRDRFRFKRNPNHTQDKTQT---KPSLHRYL 113
DDN A A +GN D H SIRDR R KRN + +D++ + +PSL
Sbjct: 11 DDNGGAGRDGNHNANNVAGNGDTSFH--SIRDRLRLKRNSSDRRDRSHSGLDRPSLR--- 65
Query: 114 LRHRHVNSTPSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNS 173
R H+ G NRKG SL RG LLYF++ V AF M+S++LQNS
Sbjct: 66 TRPHHI------------GRSLNRKGLLSLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNS 113
Query: 174 IASVFGAERGRPIREELRFGSRLKFVPDQVGF----GNGLDGLRSTPRFGVRPPRIGLIL 229
I + G +G +R ++ GS LK+VP + G GLD LRS R GVRPPR+ L+L
Sbjct: 114 I-TWQGNVKGGQVRSQIGLGSTLKYVPGGIARTLIEGKGLDPLRSAVRIGVRPPRLALVL 172
Query: 230 GNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDW 289
GNM KD R+L+L+ +AV +G + SLWEQ+AG + +L EQ DW
Sbjct: 173 GNMKKDPRTLMLV---------------FAVENGEARSLWEQLAGHVKVLVSEQLGHADW 217
Query: 290 SIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKS 349
+IF+G+IADSLEAKEAISSLMQEPF S+PL+WI+ ED LANRLPVY G +L+S+W+S
Sbjct: 218 TIFEGVIADSLEAKEAISSLMQEPFRSVPLIWIVHEDILANRLPVYQRMGQNSLISHWRS 277
Query: 350 VFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLK 409
F+R +V+VFP +TLPML+SVLD GNF VIP S DVWA E+YS++H K LR+ N F +
Sbjct: 278 AFARADVVVFPQFTLPMLHSVLDDGNFVVIPESVVDVWAAESYSETHTKQNLREINEFGE 337
Query: 410 DEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYND 469
D+++++V+GSSFFY+E SWD AVAMH +GPLL +Y RR GSFKFVFL GNST G +D
Sbjct: 338 DDVIILVLGSSFFYDEFSWDNAVAMHMLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQSD 397
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
A+QEVASRLGL E +VRH+G N DVN VL MADI++Y SSQ EQ FP LIVRAM+FGIP+
Sbjct: 398 AVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYASSQEEQNFPPLIVRAMSFGIPI 457
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
ITPDFPI+K+Y+A+ IFF++++P+ L +AFS IS+G+LSKFA+T+AS+GRL KN+
Sbjct: 458 ITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNL 517
Query: 590 LALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWG 649
+A +C+T YAR+LEN+L+FPSD LPG ISQLQ +WEWN FR E++ IL+
Sbjct: 518 MATECITGYARLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILD----S 573
Query: 650 TSTSSRNSSVVDLLEEEF-----TKNITENENRSADQDTISELDWDVLHDIESSEEYERL 704
S +V +EE+F + N +N + S+LDWDVL +IE +EEYE++
Sbjct: 574 AYAFIGKSGIVFQVEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKV 633
Query: 705 EMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAW 764
E E+LE+RM+ W++IYRNARKSE+ KFE NERDEGELERTG+P+CIYEIY+G+GAW
Sbjct: 634 ESEELEDRMERDVEDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAW 693
Query: 765 PFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNI 824
PFLHHGSLYRGL+LSS RRL SDDVDA RL LLN T+YRDILCEIGGMFS+ANKVD+I
Sbjct: 694 PFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSI 753
Query: 825 HKRPWIGFQSWRAAGRKVSLSISAEKVLEETV-QETEGDVMYFWAHLDMDGGFTRNNNDV 883
H RPWIGFQSWRAAGRKVSLS AE+ LE + QET+G+++YFW LD+DG + N
Sbjct: 754 HMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKN-A 812
Query: 884 LTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFL 943
LTFWSMCDILN G+CRT FEDAFR MYGLP H+EALPPMPEDG WS+LH WVM TPSFL
Sbjct: 813 LTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFL 872
Query: 944 EFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVY 1003
EF+MFSRMF +SLDAL+ N + SC L+SS LE+KHCYCRVLELLVNVWAYHSGRKMVY
Sbjct: 873 EFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVY 932
Query: 1004 LDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEV 1063
++P GSL+EQHP+++R+G MW KYFNFTLLKSMDEDLAEAADD D+PRE+WLWP TGEV
Sbjct: 933 INPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEV 992
Query: 1064 HWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLGG 1106
HWKG+YEREREERYR KMDKKRK KEK++DR+ GY+QK+LGG
Sbjct: 993 HWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1035
>gi|449441828|ref|XP_004138684.1| PREDICTED: uncharacterized protein LOC101206364 [Cucumis sativus]
Length = 1034
Score = 1243 bits (3217), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 642/1056 (60%), Positives = 799/1056 (75%), Gaps = 43/1056 (4%)
Query: 63 DDNAAAAATTTSGNNDQQQHPHSIRDRFRFKRNPNHTQDKTQTKPSLHRYLLRHRHVNST 122
DDNA+A A T + HSIRDRF FKRN +H + + K SL
Sbjct: 10 DDNASANAVTGT---------HSIRDRFPFKRNSSHF--RLRVKDSLDH----------- 47
Query: 123 PSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAF--AMASMVLQNSIASVFGA 180
+A+ + + R NRKG S P RG L YF++ AVF F + S+ S G+
Sbjct: 48 -AASRSRSHQTRINRKGLLSWIPARGQTLFYFLVVFAVFGFFTGSMLLQSSISLLSSHGS 106
Query: 181 ERGRPIREELRFGSRLKFVPDQVGF----GNGLDGLRSTPRFGVRPPRIGLILGNMAKDS 236
+R R + E ++FGS LKFVP ++ G+GL+ +R R GVR PR+ LILG+M D
Sbjct: 107 QRERWLMERIKFGSSLKFVPGRISKRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDP 166
Query: 237 RSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGII 296
+SL+LITV+KN+QKLGYVF+I+AV GN S+WEQI GQ SIL Y +DWSI+DGII
Sbjct: 167 QSLMLITVMKNIQKLGYVFEIFAVERGNKQSMWEQI-GQPSILSPGHYGRVDWSIYDGII 225
Query: 297 ADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNV 356
ADSLE + AI+SLMQEPF S+PL+WI++ED+LA+RLP+Y +RG+++L+S+WK F R NV
Sbjct: 226 ADSLETEGAIASLMQEPFCSLPLIWIVREDTLASRLPMYEQRGWKHLISHWKRSFRRANV 285
Query: 357 IVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVV 416
+VFPD+ LPMLYS+LD GNF VIPGSPADV+A E Y H K QLR++NGF +D+I+V+V
Sbjct: 286 VVFPDFALPMLYSILDNGNFHVIPGSPADVYAAEDYMNVHSKSQLREKNGFNEDDILVLV 345
Query: 417 VGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVAS 476
VGS FF NELSWDYAVAMH +GPLL YARR VEGSFKFVFLC NSTDG +DAL+E+AS
Sbjct: 346 VGSLFFPNELSWDYAVAMHSIGPLLSIYARRREVEGSFKFVFLCCNSTDGSHDALKEIAS 405
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
RLGL + S+ HYG NGDVN VL+MADIVLYGSSQ Q FP L++RAM+FGIP++ PD P
Sbjct: 406 RLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPA 465
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVT 596
+K Y+ +G + F K NP+ L +FS IS+GKLS+FA+++AS+GRL AKN+LA +CVT
Sbjct: 466 LKNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQSIASSGRLLAKNILASECVT 525
Query: 597 RYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRN 656
YA++LENVLNFPSD LPGP+SQLQ +WEWNLFRKE+ + T D +E +T S+
Sbjct: 526 GYAQLLENVLNFPSDVKLPGPVSQLQLGAWEWNLFRKEM-VKTIDENADNEERIATISK- 583
Query: 657 SSVVDLLEEEFTKNI-----TENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEE 711
+SV+ LE + T ++ +ENEN + +QD + DWD+L IES+EEYE +EME+ +E
Sbjct: 584 ASVIFALEAQLTNSVNLTILSENENGTLEQDIPTPQDWDILEKIESAEEYETVEMEEFQE 643
Query: 712 RMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGS 771
RM+ +WD+IYRNARKSE+ KFE+NERDEGELERTGQ V IYEIYSG+GAWPF+HHGS
Sbjct: 644 RMERDLGAWDEIYRNARKSEKLKFESNERDEGELERTGQTVSIYEIYSGAGAWPFMHHGS 703
Query: 772 LYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIG 831
LYRGL+LS+ A RL+SDDV+AV RL LL+ ++Y D LCEIGGMF+IANK+DNIHKRPWIG
Sbjct: 704 LYRGLSLSTRALRLKSDDVNAVGRLPLLDDSYYLDALCEIGGMFAIANKIDNIHKRPWIG 763
Query: 832 FQSWRAAGRKVSLSISAEKVLEETVQET-EGDVMYFWAHLDMDGGFTRNNNDVLTFWSMC 890
FQSW+A+GRKVSL AE VLE+T+Q+ +GDV+YFWAHL ++ G TFWS+C
Sbjct: 764 FQSWQASGRKVSLGKKAENVLEDTIQDNPKGDVIYFWAHLQVNRGTIPP-----TFWSVC 818
Query: 891 DILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSR 950
DILNGG CRT F FR+M+GL S++ ALPPMPEDGG WSALH WVM TPSFLEFIMFSR
Sbjct: 819 DILNGGLCRTTFRSTFREMFGLSSNMGALPPMPEDGGHWSALHSWVMPTPSFLEFIMFSR 878
Query: 951 MFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGS 1010
MF LDALN N S+ N CLL+SSE+EKKHCYCR+LE+LVNVWAYHSGR+MVY++P SG
Sbjct: 879 MFTHYLDALNRNQSQPNGCLLASSEIEKKHCYCRILEMLVNVWAYHSGRRMVYINPHSGF 938
Query: 1011 LQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYE 1070
L+EQHP+E+R+ FMW KYFNFTLLKSMDEDLAEAADD + LWP TGEVHW+GIYE
Sbjct: 939 LEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYE 998
Query: 1071 REREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLGG 1106
REREERYR KMDKKR K K+ +R+ GY+QK+LGG
Sbjct: 999 REREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG 1034
>gi|449522726|ref|XP_004168377.1| PREDICTED: uncharacterized protein LOC101229264 [Cucumis sativus]
Length = 1037
Score = 1237 bits (3200), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 641/1059 (60%), Positives = 799/1059 (75%), Gaps = 46/1059 (4%)
Query: 63 DDNAAAAATTTSGNNDQQQHPHSIRDRFRFKRNPNHTQDKTQTKPSLHRYLLRHRHVNST 122
DDNA+A A T + HSIRDRF FKRN +H + + K SL
Sbjct: 10 DDNASANAVTGT---------HSIRDRFPFKRNSSHF--RLRVKDSLDH----------- 47
Query: 123 PSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAF--AMASMVLQNSIASVFGA 180
+A+ + + R NRKG S P RG L YF++ AVF F + S+ S G+
Sbjct: 48 -AASRSRSHQTRINRKGLLSWIPARGQTLFYFLVVFAVFGFFTGSMLLQSSISLLSSHGS 106
Query: 181 ERGRPIREELRFGSRLKFVPDQVGF----GNGLDGLRSTPRFGVRPPRIGLILGNMAKDS 236
+R R + E ++FGS LKFVP ++ G+GL+ +R R GVR PR+ LILG+M D
Sbjct: 107 QRERWLMERIKFGSSLKFVPGRISKRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDP 166
Query: 237 RSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGII 296
+SL+LITV+KN+QKLGYVF+I+AV GN S+WEQI GQ SIL Y +DWSI+DGII
Sbjct: 167 QSLMLITVMKNIQKLGYVFEIFAVERGNKQSMWEQI-GQPSILSPGHYGRVDWSIYDGII 225
Query: 297 ADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNV 356
ADSLE + AI+SLMQEPF S+PL+WI++ED+LA+RLP+Y +RG+++L+S+WK F R NV
Sbjct: 226 ADSLETEGAIASLMQEPFCSLPLIWIVREDTLASRLPMYEQRGWKHLISHWKRSFRRANV 285
Query: 357 IVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVV 416
+VFPD+ LPMLYS+LD GNF VIPGSPADV+A E Y H K QLR++NGF +D+I+V+V
Sbjct: 286 VVFPDFALPMLYSILDNGNFHVIPGSPADVYAAEDYMNVHSKSQLREKNGFNEDDILVLV 345
Query: 417 VGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVAS 476
VGS FF NELSWDYAVAMH +GPLL YARR VEGSFKFVFLC NSTDG +DAL+E+AS
Sbjct: 346 VGSLFFPNELSWDYAVAMHSIGPLLSIYARRREVEGSFKFVFLCCNSTDGSHDALKEIAS 405
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
RLGL + S+ HYG NGDVN VL+MADIVLYGSSQ Q FP L++RAM+FGIP++ PD P
Sbjct: 406 RLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPA 465
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVT 596
+K Y+ +G + F K NP+ L +FS IS+GKLS+FA+++AS+GRL AKN+LA +CVT
Sbjct: 466 LKNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQSIASSGRLLAKNILASECVT 525
Query: 597 RYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRN 656
YA++LENVLNFPSD LPGP+SQLQ +WEWNLFRKE+ + T D +E +T S+
Sbjct: 526 GYAQLLENVLNFPSDVKLPGPVSQLQLGAWEWNLFRKEM-VKTIDENADNEERIATISK- 583
Query: 657 SSVVDLLEEEFTKNI-----TENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEE 711
+SV+ LE + T ++ +ENEN + +QD + DWD+L IES+EEYE +EME+ +E
Sbjct: 584 ASVIFALEAQLTNSVNLTILSENENGTLEQDIPTPQDWDILEKIESAEEYETVEMEEFQE 643
Query: 712 RMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGS 771
RM+ +WD+IYRNARKSE+ KFE+NERDEGELERTGQ V IYEIYSG+GAWPF+HHGS
Sbjct: 644 RMERDLGAWDEIYRNARKSEKLKFESNERDEGELERTGQTVSIYEIYSGAGAWPFMHHGS 703
Query: 772 LYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIG 831
LYRGL+LS+ A RL+SDDV+AV RL LL+ ++Y D LCEIGGMF+IANK+DNIHKRPWIG
Sbjct: 704 LYRGLSLSTRALRLKSDDVNAVGRLPLLDDSYYLDALCEIGGMFAIANKIDNIHKRPWIG 763
Query: 832 FQSWRAAGRKVSLSISAEKVLEETVQET-EGDVMYFWAHLDMDGGFTRNNNDVLTFWSMC 890
FQSW+A+GRKVSL AE VLE+T+Q+ +GDV+YFWAHL ++ G TFWS+C
Sbjct: 764 FQSWQASGRKVSLGKKAENVLEDTIQDNPKGDVIYFWAHLQVNRGTIPP-----TFWSVC 818
Query: 891 DILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSR 950
DILNGG CRT F FR+M+GL S++ ALPPMPEDGG WSALH WVM TPSFLEFIMFSR
Sbjct: 819 DILNGGLCRTTFRSTFREMFGLSSNMGALPPMPEDGGHWSALHSWVMPTPSFLEFIMFSR 878
Query: 951 MFVDSLDALNANSSKVNSCLLSSSELE---KKHCYCRVLELLVNVWAYHSGRKMVYLDPL 1007
MF LDALN N S+ N CLL+SSE+E +KHCYCR+LE+LVNVWAYHSGR+MVY++P
Sbjct: 879 MFTHYLDALNRNQSQPNGCLLASSEIEVRIQKHCYCRILEMLVNVWAYHSGRRMVYINPH 938
Query: 1008 SGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKG 1067
SG L+EQHP+E+R+ FMW KYFNFTLLKSMDEDLAEAADD + LWP TGEVHW+G
Sbjct: 939 SGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQG 998
Query: 1068 IYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLGG 1106
IYEREREERYR KMDKKR K K+ +R+ GY+QK+LGG
Sbjct: 999 IYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG 1037
>gi|255584368|ref|XP_002532918.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
gi|223527311|gb|EEF29460.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
Length = 1020
Score = 1215 bits (3143), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 669/1061 (63%), Positives = 785/1061 (73%), Gaps = 98/1061 (9%)
Query: 69 AATTTSGNNDQQQHPHSIRDRFRFKRNPNHTQ-------DKTQTKPSLHRYLLRHRHVNS 121
+ S N + HSIRDRF FKRNPN T ++ +TK SL R R R ++
Sbjct: 23 GGSNASNRNTTDKVFHSIRDRFIFKRNPNCTNIAATINNNQNRTKSSLDR---RSRWHHN 79
Query: 122 TPSAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAE 181
S N S + +RKGF FRG YL YF+IF AVF FAMASM+LQ+SIA + ++
Sbjct: 80 YNSNNNTNRSNHQ-HRKGFV----FRGIYLFYFVIFFAVFGFAMASMILQSSIAGIVFSK 134
Query: 182 -------RGRPIREELRFGSRLKFVPDQVGF----GNGLDGLRSTPRFGVRPPRIGLILG 230
R IR++LR GS LKFVP + G+GLD +R
Sbjct: 135 GRWSEDRRRGSIRDDLRLGSTLKFVPLRRSLRLVKGDGLDRVR----------------- 177
Query: 231 NMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWS 290
LQ+ +G S +WEQI+G+ISIL EQY IDWS
Sbjct: 178 -----------------LQE-----------NGKSQPVWEQISGRISILRPEQYGGIDWS 209
Query: 291 IFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSV 350
IF+G+I DSLEAKEAISSLMQEPF SIP++WIIQED+LANRLPVY E G+++L+S+W+
Sbjct: 210 IFEGVIVDSLEAKEAISSLMQEPFCSIPVIWIIQEDTLANRLPVYEEMGWEDLVSHWRRA 269
Query: 351 FSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKD 410
F R NV+VFPD+T+PMLYSVLDAGNFFVIPGSP DVWA E+YSK+H ++QLR NGF +D
Sbjct: 270 FKRANVVVFPDFTMPMLYSVLDAGNFFVIPGSPIDVWAAESYSKTHARHQLRTSNGFNED 329
Query: 411 EIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDA 470
++VV+VVGSSFFY+ELS DYAVAMH +GPLL+KYARR EG FKFVFLCGNSTDG DA
Sbjct: 330 DMVVLVVGSSFFYDELSLDYAVAMHTLGPLLVKYARRKDTEGLFKFVFLCGNSTDG--DA 387
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
LQ+VASRLGLL VRH+ NGDVNGVLLMADIVLYGSSQ EQGFP LI+RAMTFGIPVI
Sbjct: 388 LQDVASRLGLLHGFVRHFSLNGDVNGVLLMADIVLYGSSQDEQGFPPLIIRAMTFGIPVI 447
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
PD PI+K+YV +G + F+K NP+ L RAFSL IS+GKLS+F +TVAS+GRL AKNML
Sbjct: 448 APDIPIMKKYVIDGVHALLFKKYNPDSLMRAFSLLISDGKLSRFGKTVASSGRLLAKNML 507
Query: 591 ALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGT 650
A +C YAR+LEN ++FPSDALLPGP S LQQ WEWNLF EI T D+L MD
Sbjct: 508 ASECTMGYARLLENAVSFPSDALLPGPTSPLQQSVWEWNLFWNEIVPETDDLLGMDG--R 565
Query: 651 STSSRNSSVVDLLEEEFT-----KNITENENRSADQDTISELDWDVLHDIESSEEYERLE 705
++SSR SSVV LEEE T ++++N D +E DWD+L +I+S EEYERLE
Sbjct: 566 NSSSRGSSVVYSLEEELTYHTDSTSVSKNGTEVLVPDLPTESDWDILREIDSLEEYERLE 625
Query: 706 MEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWP 765
E+L+ER D + WD+IYRNARKSE+ KFE NERDEGELERTGQPVCIYEIY+G GAWP
Sbjct: 626 TEELKERTDRSPGVWDEIYRNARKSEKLKFETNERDEGELERTGQPVCIYEIYNGPGAWP 685
Query: 766 FLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIH 825
FLHHGSLYRGL+LSS +RR RSDDVDAV RL +LN T+YRDILCEIGGMFS+AN VDNIH
Sbjct: 686 FLHHGSLYRGLSLSSKSRRSRSDDVDAVGRLPILNDTYYRDILCEIGGMFSVANVVDNIH 745
Query: 826 KRPWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVL 884
+RPWIGFQSWRAAGRKVSLS AEKVLEE +Q ETEGDVMYFWA LD+D G T +NN+ L
Sbjct: 746 QRPWIGFQSWRAAGRKVSLSFEAEKVLEEKIQRETEGDVMYFWACLDVDSGVTGSNNE-L 804
Query: 885 TFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLE 944
TFWSMCDILNGGHCR+ A+ H WSALH WVM TPSFLE
Sbjct: 805 TFWSMCDILNGGHCRSV---AYSTKILCHCH-------------WSALHCWVMPTPSFLE 848
Query: 945 FIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYL 1004
FIMF+RMFVDSLDAL+ NS+ N CLLSSSELE+KHCYCR+LE+L+NVWAYHS RKMVY+
Sbjct: 849 FIMFARMFVDSLDALHTNSTLDNVCLLSSSELEEKHCYCRILEILINVWAYHSARKMVYI 908
Query: 1005 DPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVH 1064
DP +GS +EQHPIE+R+ +W KYFN TLLKSMDEDLAEAADDGD+PRE+WLWP TGEVH
Sbjct: 909 DPRTGSSEEQHPIEQRKEIIWAKYFNLTLLKSMDEDLAEAADDGDHPRERWLWPLTGEVH 968
Query: 1065 WKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
W+GIYEREREERYRQKMDKKRK KEK+++RL GY+QK LG
Sbjct: 969 WQGIYEREREERYRQKMDKKRKTKEKLYERLKSGYKQKPLG 1009
>gi|7406452|emb|CAB85554.1| putative protein [Arabidopsis thaliana]
Length = 1091
Score = 1161 bits (3004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 570/885 (64%), Positives = 699/885 (78%), Gaps = 15/885 (1%)
Query: 228 ILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLI 287
+LGNM KD R+L+L+TV+KNLQKLGYVFK++AV +G + SLWEQ+AG + +L EQ
Sbjct: 216 VLGNMKKDPRTLMLVTVMKNLQKLGYVFKVFAVENGEARSLWEQLAGHVKVLVSEQLGHA 275
Query: 288 DWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYW 347
DW+IF+G+IADSLEAKEAIS EPF S+PL+WI+ ED LANRLPVY G +L+S+W
Sbjct: 276 DWTIFEGVIADSLEAKEAISRW--EPFRSVPLIWIVHEDILANRLPVYQRMGQNSLISHW 333
Query: 348 KSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGF 407
+S F+R +V+VFP +TLPML+SVLD GNF VIP S DVWA E+YS++H K LR+ N F
Sbjct: 334 RSAFARADVVVFPQFTLPMLHSVLDDGNFVVIPESVVDVWAAESYSETHTKQNLREINEF 393
Query: 408 LKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGY 467
+D+++++V+GSSFFY+E SWD AVAMH +GPLL +Y RR GSFKFVFL GNST G
Sbjct: 394 GEDDVIILVLGSSFFYDEFSWDNAVAMHMLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQ 453
Query: 468 NDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGI 527
+DA+QEVASRLGL E +VRH+G N DVN VL MADI++Y SSQ EQ FP LIVRAM+FGI
Sbjct: 454 SDAVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYASSQEEQNFPPLIVRAMSFGI 513
Query: 528 PVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAK 587
P+ITPDFPI+K+Y+A+ IFF++++P+ L +AFS IS+G+LSKFA+T+AS+GRL K
Sbjct: 514 PIITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTK 573
Query: 588 NMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDE 647
N++A +C+T YAR+LEN+L+FPSD LPG ISQLQ +WEWN FR E++ IL+
Sbjct: 574 NLMATECITGYARLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILD--- 630
Query: 648 WGTSTSSRNSSVVDLLEEEF-----TKNITENENRSADQDTISELDWDVLHDIESSEEYE 702
S +V +EE+F + N +N + S+LDWDVL E E
Sbjct: 631 -SAYAFIGKSGIVFQVEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLE--EIEGAEE 687
Query: 703 RLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSG 762
++E E+RM+ W++IYRNARKSE+ KFE NERDEGELERTG+P+CIYEIY+G+G
Sbjct: 688 YEKVESEEDRMERDVEDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAG 747
Query: 763 AWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVD 822
AWPFLHHGSLYRGL+LSS RRL SDDVDA RL LLN T+YRDILCEIGGMFS+ANKVD
Sbjct: 748 AWPFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVD 807
Query: 823 NIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETV-QETEGDVMYFWAHLDMDGGFTRNNN 881
+IH RPWIGFQSWRAAGRKVSLS AE+ LE + QET+G+++YFW LD+DG + N
Sbjct: 808 SIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKN 867
Query: 882 DVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPS 941
LTFWSMCDILN G+CRT FEDAFR MYGLP H+EALPPMPEDG WS+LH WVM TPS
Sbjct: 868 -ALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPS 926
Query: 942 FLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKM 1001
FLEF+MFSRMF +SLDAL+ N + SC L+SS LE+KHCYCRVLELLVNVWAYHSGRKM
Sbjct: 927 FLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKM 986
Query: 1002 VYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTG 1061
VY++P GSL+EQHP+++R+G MW KYFNFTLLKSMDEDLAEAADD D+PRE+WLWP TG
Sbjct: 987 VYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTG 1046
Query: 1062 EVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLGG 1106
EVHWKG+YEREREERYR KMDKKRK KEK++DR+ GY+QK+LGG
Sbjct: 1047 EVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1091
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 56/89 (62%), Gaps = 5/89 (5%)
Query: 147 RGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAERGRPIREELRFGSRLKFVPDQVGF- 205
RG LLYF++ V AF M+S++LQNSI + G +G +R ++ GS LK+VP +
Sbjct: 47 RGTCLLYFLVAFTVCAFVMSSLLLQNSI-TWQGNVKGGQVRSQIGLGSTLKYVPGGIART 105
Query: 206 ---GNGLDGLRSTPRFGVRPPRIGLILGN 231
G GLD LRS R GVRPPR+ L++ +
Sbjct: 106 LIEGKGLDPLRSAVRIGVRPPRLALVMTD 134
>gi|296083473|emb|CBI23431.3| unnamed protein product [Vitis vinifera]
Length = 976
Score = 1117 bits (2888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/911 (63%), Positives = 681/911 (74%), Gaps = 82/911 (9%)
Query: 217 RFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQI 276
R GVR P + LILGNM K+ SL+L TV+KNLQ LGY+FKIYAV NS S+WEQ+ GQI
Sbjct: 2 RIGVRRPTLALILGNMKKNPPSLMLFTVIKNLQGLGYLFKIYAVHDDNSRSIWEQLGGQI 61
Query: 277 SILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYV 336
SIL E YS DW+ F+GII DSLEAKEAI SLMQEPF IPL+WIIQED+LA RLP Y
Sbjct: 62 SILSPEIYSHNDWTTFEGIIVDSLEAKEAILSLMQEPFCFIPLIWIIQEDTLAKRLPFYE 121
Query: 337 ERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSH 396
+ G+++L+SYW+S FSR +V+VFPD++LPMLYSVLD GNFFVIP SP DVWA E+YSK+H
Sbjct: 122 KLGWEHLVSYWRSAFSRADVVVFPDFSLPMLYSVLDTGNFFVIPASPVDVWAAESYSKTH 181
Query: 397 EKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPL--------------LI 442
KYQLR++ GF KD+++V+VVGSSFFY+ELSWDYAVAM+D+GPL ++
Sbjct: 182 SKYQLREDLGFNKDDMLVLVVGSSFFYDELSWDYAVAMNDIGPLLSKYARSKNAGAILIM 241
Query: 443 KYARRNSVEGS---------------------------FKFVFLC--------------- 460
K A + GS ++F FL
Sbjct: 242 KCALFYILIGSKHNKKYIVPFVGDEAFSLIPPFMMRVLWRFQFLALHLDLVQRTDNYIAH 301
Query: 461 --GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
GN+ D + EVAS L LL SVR YG N DVNG++LMAD+V+Y SSQVEQGFP L
Sbjct: 302 CKGNAWSQDCDMICEVASHLKLLPGSVRQYGMNSDVNGLILMADVVIYASSQVEQGFPPL 361
Query: 519 IVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTV 578
+ RAM+FGIPVI PD P I++YV +G V+ F K+NP+ L RAFSL ISNGKLSKFA+ V
Sbjct: 362 LTRAMSFGIPVIAPDLPDIRKYVVDGVHVVIFPKNNPDALMRAFSLLISNGKLSKFAKAV 421
Query: 579 ASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLG 638
A +GRL AKNMLA +CV YA++LENVL+FPSD LLPG ISQ Q +WEWN FR
Sbjct: 422 ALSGRLLAKNMLASECVNSYAKLLENVLSFPSDVLLPGHISQSQHDAWEWNSFR------ 475
Query: 639 TGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITEN--ENRSADQDTISELDWDVLHDIE 696
T D+ ++ S S R SSVVD+LEE + + N + D +++LDWDVL +IE
Sbjct: 476 TADMPLIE--NGSASMRKSSVVDVLEETLSNQLDSGNISNSETENDVLTQLDWDVLREIE 533
Query: 697 SSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYE 756
S EE ERLEME+LEERM+ WD+IYRNARK ER KFE NERDEGELERTGQP+CIYE
Sbjct: 534 SIEEMERLEMEELEERMEKNPGIWDEIYRNARKVERVKFETNERDEGELERTGQPLCIYE 593
Query: 757 IYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFS 816
IY+G+GAWPFLHHGS+YRGL+L+++ARRLRSDDVDAV RL +LN T+YRDI C+IGGMFS
Sbjct: 594 IYNGAGAWPFLHHGSMYRGLSLTTSARRLRSDDVDAVDRLPVLNDTYYRDIFCDIGGMFS 653
Query: 817 IANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGG 875
IA +VD IHKRPWIGFQSW A G KVSLS AEKVLEET+Q ET+GDV+YFWAHL++D G
Sbjct: 654 IAFRVDKIHKRPWIGFQSWHAVGSKVSLSSRAEKVLEETIQEETKGDVLYFWAHLNVDDG 713
Query: 876 FTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGW 935
T+ N + TFWSMCDILNGG+CRTAFEDAFRQMY +PS++EALPPMPEDGG WSALH W
Sbjct: 714 PTQKNR-IPTFWSMCDILNGGNCRTAFEDAFRQMYAMPSYIEALPPMPEDGGYWSALHSW 772
Query: 936 VMQTPSFLEFIMFSRMFVDSLDALN------------ANSSKVNSCLLSSSELEKKHCYC 983
VM TPSFLEFIMFSRMF DSLDAL+ NSS+ CLL SS+LEKKHCYC
Sbjct: 773 VMPTPSFLEFIMFSRMFADSLDALHMNSRQSMNLSQSMNSSQPTVCLLGSSKLEKKHCYC 832
Query: 984 RVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAE 1043
RVLELLVNVWAYHS RKMVY++P SG L+EQHP+E+RRGFMW KYFN TLLKSMDEDLAE
Sbjct: 833 RVLELLVNVWAYHSARKMVYINPYSGQLEEQHPVEQRRGFMWAKYFNSTLLKSMDEDLAE 892
Query: 1044 AADDGDYPREK 1054
AADDGD+PRE+
Sbjct: 893 AADDGDHPRER 903
>gi|449449513|ref|XP_004142509.1| PREDICTED: uncharacterized protein LOC101206932 [Cucumis sativus]
Length = 1025
Score = 969 bits (2506), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 522/1009 (51%), Positives = 693/1009 (68%), Gaps = 75/1009 (7%)
Query: 161 FAFAMASMVLQNSIASVF--GAERGRPIREELRFGSRLKFVPDQVGF----GNGLDGLRS 214
+A + L + + SVF +E+ EL+ G L F P ++ N +D + S
Sbjct: 27 LCYAFSIKPLPSLMTSVFLKRSEKAWSRDAELKVGVTLMFAPRRIPRKFIESNEVDQMHS 86
Query: 215 TPRFGVRPPRIGL-----------------------------ILGNMAKDSRSLLLITVV 245
RFG R PR+ L +L NM KDS+SL L TV+
Sbjct: 87 ENRFGFRNPRLALSPGYIGYIWSINTLQHVLRIYGCVSVHITVLRNMEKDSQSLFLFTVM 146
Query: 246 KNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEA 305
KN+++LGY F+I+AV +G + +W+++ G++ +L +Q+ IDW +F+GII DS+E KEA
Sbjct: 147 KNMKELGYAFEIFAVGNGEARQMWQEL-GRLVLLSPKQFGQIDWLLFEGIIVDSIEGKEA 205
Query: 306 ISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLP 365
I+S+M EPF S+PL+WIIQ+D L+ RL +Y +RG++NL+S+W+S FSR +V+VFP++ LP
Sbjct: 206 ITSIMVEPFCSVPLIWIIQDDMLSKRLNMYKDRGWENLVSHWRSTFSRASVVVFPNFALP 265
Query: 366 MLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNE 425
MLYS LD GNF VI GSP DVW+ E Y K+H K++L + GF ++IVV+VVGSSF YNE
Sbjct: 266 MLYSALDTGNFHVIQGSPVDVWSAEIYKKTHFKHELGNKLGFDVEDIVVLVVGSSF-YNE 324
Query: 426 LSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSV 485
LS +YAVA++ +GP+L K R+N E SFKFVFLCGNST+ NDALQE ASRLGL +
Sbjct: 325 LSPEYAVALNRMGPVLTKLPRKNP-EVSFKFVFLCGNSTNRCNDALQETASRLGLPSGYL 383
Query: 486 RHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGA 545
HYGF+ DVNG+L ADIVLY S+Q Q FPSL++RAMTF +P++ PD PII +YV +G
Sbjct: 384 SHYGFDQDVNGILYFADIVLYESAQNVQDFPSLLIRAMTFEVPIVAPDLPIINQYVIDGF 443
Query: 546 QVIFFQKDNPEGLSRAFSLFIS--NGKLSKFARTVASAGRLHAKNMLALDCVTRYARILE 603
+ F K + + L A + S +G+L+ A ++AS+GRL AKN+LA +CVT YA +L+
Sbjct: 444 HGLLFPKFSSDALISALTDLTSTSDGRLTMIANSIASSGRLLAKNILASECVTGYANLLK 503
Query: 604 NVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLL 663
VLNFPSD +LP I++L + WEW+LF E+ + +E + R SSVV L
Sbjct: 504 EVLNFPSDVVLPSSITRLPKAVWEWDLFWNELIQ-----VPPNEQRSEKIKRKSSVVIKL 558
Query: 664 EEEFTK-----NITENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFA 718
EEEF+ NI+ + D ++ DWD++ +IE EEY+R+EME+L+ER +
Sbjct: 559 EEEFSDLVSPLNISSPGKEISAHDIPTQQDWDIIEEIELIEEYDRVEMEELQERTESILG 618
Query: 719 SWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLAL 778
SW+ +YR ARKS+R K E+DE ELER GQ VCIYEIY+G GAWPFLHH +L+RGL+L
Sbjct: 619 SWEQVYRIARKSDRIK-LEKEKDEEELERAGQIVCIYEIYNGPGAWPFLHHAALFRGLSL 677
Query: 779 -----------------SSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKV 821
S A RL+SDDV A RL LL Y+DILCEIGGMF+IANK+
Sbjct: 678 DANFLFFIVCNLHCDLQSPKALRLKSDDVSAPQRLPLLKSRFYQDILCEIGGMFAIANKI 737
Query: 822 DNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGDVMYFWAHLDMDGGFTRNN 880
D IH+RPWIGFQ W+A GRKVSLS A +VLEE +QE T G+V+YFWA+LD+ GF +
Sbjct: 738 DTIHRRPWIGFQPWQADGRKVSLSKKAGQVLEEAIQENTGGEVIYFWAYLDV--GFEVID 795
Query: 881 NDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLP-SHVEALPPMPEDGGCWSALHGWVMQT 939
+D FW +CDI N GHCR+ F+DAFR MYGLP +H+EALPPMP+DG WS+LH WVM T
Sbjct: 796 SDDSPFWQICDIFNRGHCRSTFKDAFRHMYGLPRAHLEALPPMPDDGDLWSSLHSWVMPT 855
Query: 940 PSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGR 999
P+FLEFIMFSRMFVDS+DA+N N S CLL+SS LE++ CYCR+LE+L+NVWAYHSGR
Sbjct: 856 PTFLEFIMFSRMFVDSIDAVNGNLSDDYKCLLASSGLERRQCYCRMLEMLINVWAYHSGR 915
Query: 1000 KMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREK-WLWP 1058
+MVYL+P SGSL+EQHP+E R+ FMW K+FN TLLK+MD DLAEAA+DGD+ + WLWP
Sbjct: 916 RMVYLNPRSGSLEEQHPLEERQDFMWSKFFNITLLKTMDADLAEAANDGDHSTQNTWLWP 975
Query: 1059 WTGEVHWKGI-YEREREERYRQKMDKKRKMKEKM-FDRLTKGYRQKTLG 1105
TGE+ +GI E E+R+RQKM+K+R +EK + L ++QK LG
Sbjct: 976 LTGEMFREGINEMEEEEKRHRQKMEKRRISREKKPGNHLNHEHKQKPLG 1024
>gi|326520029|dbj|BAK03939.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1013
Score = 901 bits (2328), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/886 (49%), Positives = 600/886 (67%), Gaps = 26/886 (2%)
Query: 209 LDGLRSTPRF--GVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSH 266
L+ RS R+ G+ P R+ L++G M D+ SL+L T+ K+L LGY ++ A G +H
Sbjct: 112 LEAARSARRWWPGLEPVRLALVVGTMNIDAESLMLTTLAKSLVGLGYEIEVLAFVDGKAH 171
Query: 267 SLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQED 326
+W+ I ++++ E+ +DWS ++ ++ SLE K +S LMQEPF +P+VW+I +D
Sbjct: 172 DIWKAIC-HVNVVSFEKLKYVDWSKYNTVLVSSLEGKRVVSILMQEPFRLLPVVWLIHDD 230
Query: 327 SLANRLPVYVER--GFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPA 384
+L L Y + N + W++ F+ +VFPD LP+LYS +D GNF VIPGSP
Sbjct: 231 ALGQHLRNYPDPHLSIPNHIEDWRAHFNACTYVVFPDSHLPLLYSPVDTGNFLVIPGSPV 290
Query: 385 DVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKY 444
D+WA + Y S + +R ++G ++++VV+VVGS F++EL WDY + P ++
Sbjct: 291 DIWAAKRYGLSQSQGTIRNQHGIKEEDVVVLVVGSYLFFDELPWDYVTVLRASAPHVMDM 350
Query: 445 ARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIV 504
+R + +F+F CGN TD YN A QE+AS +G + SV+H+ D+ +L+ DIV
Sbjct: 351 SRTKKL--GVQFIFFCGNGTDAYNSAFQELASHMGFPDGSVKHFPMTHDIGNLLMFVDIV 408
Query: 505 LYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSL 564
LYGS + E GFP L++R+M+ IP+I P+ +I +YV +G F +P + AF
Sbjct: 409 LYGSLRQEPGFPPLLLRSMSSEIPIIAPNLTVITKYVTDGIHAFLFNSGDPSTAALAFMR 468
Query: 565 FISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQV 624
+ + L A +VA G+L +KNMLA DC+ + ++LE+VL++PSDA LP S+++
Sbjct: 469 ILGDKGLLDTAYSVALEGKLLSKNMLAYDCIVAHVKLLESVLHYPSDARLPLSFSKVKDK 528
Query: 625 SWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENE--NRSADQD 682
+W W+ F + LG + DE T RN+ + LL E N T N N ++ D
Sbjct: 529 TWLWDPFESKAALGNSS--SEDERHIHT--RNAGI--LLGESAQTNWTTNSDSNDTSSYD 582
Query: 683 TISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDE 742
S+ DWD L ++E E+ E EME+++ER++ +WD++YRNARKSER K E NERDE
Sbjct: 583 YPSQSDWDDLSEVEIFEDIEMREMEEIDERVEKPLLTWDEVYRNARKSERMKPEGNERDE 642
Query: 743 GELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYT 802
GELERTGQPVCIYEIY G GAWPFLHHGSLYRG+ LS RR RSDDVDAV RL +L+ T
Sbjct: 643 GELERTGQPVCIYEIYHGEGAWPFLHHGSLYRGITLSKGGRRPRSDDVDAVMRLSVLDNT 702
Query: 803 HYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETV-QETEG 861
+YRD+LCE G MF+IAN++D +HK PWIGFQSWRAAGRKVSLS+SAE+ LE+T+ +E
Sbjct: 703 YYRDLLCEFGAMFAIANRIDTVHKLPWIGFQSWRAAGRKVSLSVSAEETLEKTMAEENHE 762
Query: 862 DVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPP 921
DV+Y+W +D D T N FWS CD LN GHCRT FEDAFR MYGLP V ALPP
Sbjct: 763 DVIYYWVPMDTDQ--TSN------FWSTCDCLNAGHCRTLFEDAFRNMYGLPKGVAALPP 814
Query: 922 MPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHC 981
MP DG WS LH WVM TPSFL+FIMFSRMFVDSL +LN N ++ SCLL +S+ EK+HC
Sbjct: 815 MPNDGDYWSTLHSWVMPTPSFLKFIMFSRMFVDSLHSLNGNGTEPASCLLGASQPEKRHC 874
Query: 982 YCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDL 1041
YCR+LE+LVN+WAYHSGRKMVYL+P++G +EQHP E R+ MW+K+FNFTLLKSMDEDL
Sbjct: 875 YCRILEILVNIWAYHSGRKMVYLNPVTGESREQHPQEERKE-MWVKFFNFTLLKSMDEDL 933
Query: 1042 AEAADDGDYP-REKWLWPWTGEVHWKGIYEREREERYRQKMDKKRK 1086
AE ADDG +P +++WLWP TG+V W G+ +REREE+Y +K ++ +
Sbjct: 934 AEEADDGMHPGKDQWLWPLTGQVFWPGMADREREEKYIKKTGQEAQ 979
>gi|449513237|ref|XP_004164270.1| PREDICTED: uncharacterized LOC101206932, partial [Cucumis sativus]
Length = 824
Score = 898 bits (2320), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/831 (56%), Positives = 608/831 (73%), Gaps = 29/831 (3%)
Query: 294 GIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSR 353
GII DS+E KEAI+S+M EPF S+PL+WIIQ+D L+ RL +Y +RG++NL+S+W+S FSR
Sbjct: 3 GIIVDSIEGKEAITSIMVEPFCSVPLIWIIQDDMLSKRLNMYKDRGWENLVSHWRSTFSR 62
Query: 354 VNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIV 413
+V+VFP++ LPMLYS LD GNF VI GSP DVW+ E Y K+H K++L + GF ++IV
Sbjct: 63 ASVVVFPNFALPMLYSALDTGNFHVIQGSPVDVWSAEIYKKTHFKHELGNKLGFDVEDIV 122
Query: 414 VVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQE 473
V+VVGSSF YNELS +YAVA++ +GP+L K R+N E SFKFVFLCGNST+ NDALQE
Sbjct: 123 VLVVGSSF-YNELSPEYAVALNRMGPVLTKLPRKNP-EVSFKFVFLCGNSTNRCNDALQE 180
Query: 474 VASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPD 533
ASRLGL + HYGF+ DVNG+L ADIVLY S+Q Q FPSL++RAMTF +P++ PD
Sbjct: 181 TASRLGLPSGYLSHYGFDQDVNGILYFADIVLYESAQNVQDFPSLLIRAMTFEVPIVAPD 240
Query: 534 FPIIKEYVA-------EGAQVIFFQKDNPEGLSRAFSLFIS--NGKLSKFARTVASAGRL 584
PII +YV +G + F K + + L A + S +G+L+ A ++AS+GRL
Sbjct: 241 LPIINQYVGLLFIQVIDGFHGLLFPKFSSDALISALTDLTSTSDGRLTMIANSIASSGRL 300
Query: 585 HAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEWNLFRKEIDLGTGDILN 644
AKN+LA +CVT YA +L+ VLNFPSD +LP I++L + WEW+LF E+ +
Sbjct: 301 LAKNILASECVTGYANLLKEVLNFPSDVVLPSSITRLPKAVWEWDLFWNELIQ-----VP 355
Query: 645 MDEWGTSTSSRNSSVVDLLEEEFTK-----NITENENRSADQDTISELDWDVLHDIESSE 699
+E + R SSVV LEEEF+ NI+ + D ++ DWD++ +IE E
Sbjct: 356 PNEQRSEKIKRKSSVVIKLEEEFSDLVSPLNISSPGKEISAHDIPTQQDWDIIEEIELIE 415
Query: 700 EYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYS 759
EY+R+EME+L+ER + SW+ +YR ARKS+R K E+DE ELER GQ VCIYEIY+
Sbjct: 416 EYDRVEMEELQERTESILGSWEQVYRIARKSDRIK-LEKEKDEEELERAGQIVCIYEIYN 474
Query: 760 GSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIAN 819
G GAWPFLHH +L+RGL+LS A RL+SDDV A RL LL Y+DILCEIGGMF+IAN
Sbjct: 475 GPGAWPFLHHAALFRGLSLSPKALRLKSDDVSAPQRLPLLKSRFYQDILCEIGGMFAIAN 534
Query: 820 KVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGDVMYFWAHLDMDGGFTR 878
+D IH+ PWIGFQSW+A GRKVSLS A +VLEE +QE T G+V+YFWA+LD+ GF
Sbjct: 535 NIDTIHRTPWIGFQSWQADGRKVSLSKKAGQVLEEAIQENTGGEVIYFWAYLDV--GFEV 592
Query: 879 NNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLP-SHVEALPPMPEDGGCWSALHGWVM 937
++D FW +CDI N GHCR+ F+DAFR MYGLP +H+EALPPMP+DG WS+LH WVM
Sbjct: 593 IDSDDSPFWQICDIFNRGHCRSTFKDAFRHMYGLPRAHLEALPPMPDDGDLWSSLHSWVM 652
Query: 938 QTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHS 997
TP+FLEFIMFSRMFVDS+DA+N N S CLL+SS LE++ CYCR+LE+L+NVWAYHS
Sbjct: 653 PTPTFLEFIMFSRMFVDSIDAVNGNLSDDYKCLLASSGLERRQCYCRMLEMLINVWAYHS 712
Query: 998 GRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREK-WL 1056
GR+MVYL+P SGSL+EQHP+E R+ FMW K+FN TLLK+MD DLAEAA+DGD+ + WL
Sbjct: 713 GRRMVYLNPRSGSLEEQHPLEERQDFMWSKFFNITLLKTMDADLAEAANDGDHSTQNTWL 772
Query: 1057 WPWTGEVHWKGI-YEREREERYRQKMDKKRKMKEKM-FDRLTKGYRQKTLG 1105
WP TGE+ +GI E E+R+RQKM+K+R +EK + L ++QK LG
Sbjct: 773 WPLTGEMFREGINEMEEEEKRHRQKMEKRRISREKKPGNHLNHEHKQKPLG 823
>gi|13357249|gb|AAK20046.1|AC025783_6 hypothetical protein [Oryza sativa Japonica Group]
gi|31433369|gb|AAP54888.1| expressed protein [Oryza sativa Japonica Group]
Length = 1012
Score = 852 bits (2201), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/914 (48%), Positives = 602/914 (65%), Gaps = 37/914 (4%)
Query: 209 LDGLRSTPRF--GVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSH 266
L+ RS R+ G+ P R+ L +G M +++SL++ T+ K+L+ LGY ++ A G ++
Sbjct: 118 LEAARSERRWWPGLAPVRLALFVGTMNINAQSLMVATLAKSLKNLGYEVEVLAFADGKAN 177
Query: 267 SLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQED 326
+ E I ++++ IDWS ++ ++ SLE K +S LMQEPF +P+VW+I ED
Sbjct: 178 DILENIC-HVNVVSPPSLKYIDWSKYNAVLLSSLEGKMVVSILMQEPFQFLPVVWLIHED 236
Query: 327 SLANRL-PVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPAD 385
+L L + + N + W++ F+ +VFPD LP+L+S LD GNF VI GSP D
Sbjct: 237 ALGQFLRNPELHQSIPNHIEDWRTHFNACTYVVFPDSYLPLLHSALDTGNFLVISGSPVD 296
Query: 386 VWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYA 445
+ A + YS SH + R + G +D++VV+VVGS F++EL WD+A + P ++ A
Sbjct: 297 ILATKRYSSSHTQESARNQYGSKEDDVVVLVVGSYLFFDELPWDFATVLRASAPHIMDMA 356
Query: 446 RRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVL 505
+ ++ +F+F CGN TD YN A QE+AS +G SV+H+ D+ +L+ ADIVL
Sbjct: 357 KTKNL--GVQFIFFCGNDTDAYNSAFQELASHMGFPLGSVKHFSLTHDIRNLLVFADIVL 414
Query: 506 YGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF 565
YGSS+ E FP L++R+M IP+I P+ +I +Y+ +G F D+P + AF+
Sbjct: 415 YGSSRQEPVFPPLLLRSMASEIPIIVPNLTVITKYITDGTHGFLFNADDPSTMVSAFAQI 474
Query: 566 ISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVS 625
+ +LS A +VA G+L +KNMLA DC+T + +LE+VL++PS A LP P+S+LQ+ +
Sbjct: 475 LGEKRLSARAFSVALEGKLLSKNMLAYDCITAHVMLLESVLHYPSYAKLPSPVSKLQERT 534
Query: 626 WEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVD-LLEEEFTKNIT-ENENRSADQDT 683
W W+LF + L S + ++D L++E N T + N ++ +
Sbjct: 535 WLWDLFETKAALENSS-------SDDDSQLLTRIIDNLVDESHESNQTYSDSNDTSLYNY 587
Query: 684 ISELDWDVLHDIE----------SSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERF 733
S DW+ L+++E + L ++ER++ SWD++Y+NARKSER
Sbjct: 588 PSLSDWNDLNEVEIFEDIERREIEEASFSPLPF-PIDERVERPLLSWDEVYKNARKSERL 646
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
K E NERDEGELERTGQPVCIYEIYSG GAWPFLHHGSLYRG+ LS RR RSDDVDAV
Sbjct: 647 KPEGNERDEGELERTGQPVCIYEIYSGEGAWPFLHHGSLYRGITLSKGGRRPRSDDVDAV 706
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
+ L +L+ +Y D LCE G MF++AN++D +HK PWIGFQSW+AAGRKVSLS AE+ LE
Sbjct: 707 THLSVLDNPYYSDRLCEFGAMFAVANRIDAVHKLPWIGFQSWQAAGRKVSLSEKAEETLE 766
Query: 854 ETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
ET+ E DV+Y+WA +DMD T N FWSMCD LN G CRT FEDAFR MYGL
Sbjct: 767 ETMAGENNEDVIYYWAPMDMDQ--TSN------FWSMCDWLNAGRCRTLFEDAFRTMYGL 818
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLS 972
+ ALPPMP DG WS LH WVM TPSFL+FIMFSRMFVD L +LN N + SCLL
Sbjct: 819 SDGITALPPMPNDGDYWSTLHSWVMPTPSFLKFIMFSRMFVDYLHSLNVNGTDPASCLLG 878
Query: 973 SSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFT 1032
+S+LEK+HCYCR+LE+LVNVWAYHSG+KM YL+P++G ++EQHP++ R MW+K+FNFT
Sbjct: 879 ASQLEKRHCYCRILEVLVNVWAYHSGKKMAYLNPVTGDIREQHPLDDRNE-MWVKFFNFT 937
Query: 1033 LLKSMDEDLAEAADDGDYP-REKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
LLKSMDEDLAE ADDG + ++WLWP TG+V W GI +REREE+Y +K+DKK K K K+
Sbjct: 938 LLKSMDEDLAEEADDGMHAGDDQWLWPLTGQVFWPGIADREREEKYIKKLDKKLKNKVKL 997
Query: 1092 FDRLTKGYRQKTLG 1105
+R GY+QK LG
Sbjct: 998 LERQKSGYKQKPLG 1011
>gi|218184963|gb|EEC67390.1| hypothetical protein OsI_34544 [Oryza sativa Indica Group]
Length = 1013
Score = 851 bits (2198), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/914 (48%), Positives = 607/914 (66%), Gaps = 36/914 (3%)
Query: 209 LDGLRSTPRF--GVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSH 266
L+ RS R+ G+ P R+ L +G M +++SL++ T+ K+L+ LGY ++ A G ++
Sbjct: 118 LEAARSERRWWPGLAPVRLALFVGTMNINAQSLMVATLAKSLKNLGYEVEVLAFADGKAN 177
Query: 267 SLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQED 326
+ E I ++++ IDWS ++ ++ SLE K +S LMQEPF +P+VW+I ED
Sbjct: 178 DILENIC-HVNVVSPPSLKYIDWSKYNAVLLSSLEGKMVVSILMQEPFQFLPVVWLIHED 236
Query: 327 SLANRL-PVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPAD 385
+L L + + N + W++ F+ +VFPD LP+L+S LD GNF VI GSP D
Sbjct: 237 ALGQFLRNPELHQSIPNHIEDWRTHFNACTYVVFPDSYLPLLHSALDTGNFLVISGSPVD 296
Query: 386 VWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYA 445
+ A + YS SH + R + G +D++VV+VVGS F++EL WD+A + P ++ A
Sbjct: 297 ILATKRYSSSHTQESARNQYGSKEDDVVVLVVGSYLFFDELPWDFATVLRASAPHIMDMA 356
Query: 446 RRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVL 505
+ ++ +F+F CGN TD YN A QE+AS +G SV+H+ D+ +L+ ADIVL
Sbjct: 357 KTKNL--GVQFIFFCGNDTDAYNSAFQELASHMGFPLGSVKHFSLTHDIRNLLVFADIVL 414
Query: 506 YGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF 565
YGSS+ E FP L++R+M IP+I P+ +I +Y+ +G F D+P + AF+
Sbjct: 415 YGSSRQEPVFPPLLLRSMASEIPIIVPNLTVITKYITDGTHGFLFNADDPSTMVSAFAQI 474
Query: 566 ISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVS 625
+ +LS A +VA G+L +KNMLA DC+T + +LE+VL++PS A LP P+S+LQ+ +
Sbjct: 475 LGEKRLSARAFSVALEGKLLSKNMLAYDCITAHVMLLESVLHYPSYAKLPSPVSKLQERT 534
Query: 626 WEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVD-LLEEEFTKNIT--ENENRSADQD 682
W W+LF + L S + ++D L++E N T + N ++ +
Sbjct: 535 WLWDLFETKAALENSS-------SDDDSQLLTRIIDNLVDESHESNQTTYSDSNDTSLYN 587
Query: 683 TISELDWDVLHDIESSEEYERLEMEQ---------LEERMDGTFASWDDIYRNARKSERF 733
S DW+ L+++E E+ ER E+E+ ++ER++ SWD++Y+NARKSER
Sbjct: 588 YPSLSDWNDLNEVEIFEDIERREIEEASFFPLPFPIDERVERPLLSWDEVYKNARKSERL 647
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
K E NERDEGELERTGQPVCIYEIYSG GAWPFLHHGSLYRG+ LS RR RSDDVDAV
Sbjct: 648 KPEGNERDEGELERTGQPVCIYEIYSGEGAWPFLHHGSLYRGITLSKGGRRPRSDDVDAV 707
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
+ L +L+ +Y D LCE G MF++AN++D +HK PWIGFQSW+AAGRKVSLS AE LE
Sbjct: 708 THLSVLDNPYYSDRLCEFGAMFAVANRIDAVHKLPWIGFQSWQAAGRKVSLSEKAEATLE 767
Query: 854 ETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
ET+ E DV+Y+WA +DMD T N FWSMCD LN G CRT FEDAFR MYGL
Sbjct: 768 ETMAGENNEDVIYYWAPMDMDQ--TSN------FWSMCDWLNAGRCRTLFEDAFRTMYGL 819
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLS 972
+ ALPPMP DG WS LH WVM TPSFL+FIMFSRMFVD L +LN N + SCLL
Sbjct: 820 SDGITALPPMPNDGDYWSTLHSWVMPTPSFLKFIMFSRMFVDYLHSLNVNGTDPASCLLG 879
Query: 973 SSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFT 1032
+S+LEK+HCYCR+LE+LVNVWAYHSG+KM YL+P++G ++EQHP++ R MW+K+FNFT
Sbjct: 880 ASQLEKRHCYCRILEVLVNVWAYHSGKKMAYLNPVTGDIREQHPLDDRNE-MWVKFFNFT 938
Query: 1033 LLKSMDEDLAEAADDGDYP-REKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
LLKSMDEDLAE ADDG + ++WLWP TG+V W GI +REREE+Y +K+DKK K K K+
Sbjct: 939 LLKSMDEDLAEEADDGMHAGDDQWLWPLTGQVFWPGIADREREEKYIKKLDKKLKNKVKL 998
Query: 1092 FDRLTKGYRQKTLG 1105
+R GY+QK LG
Sbjct: 999 LERQKSGYKQKPLG 1012
>gi|222613213|gb|EEE51345.1| hypothetical protein OsJ_32351 [Oryza sativa Japonica Group]
Length = 913
Score = 849 bits (2194), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/914 (47%), Positives = 601/914 (65%), Gaps = 37/914 (4%)
Query: 209 LDGLRSTPRF--GVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSH 266
L+ RS R+ G+ P R+ L + M +++SL++ T+ K+L+ LGY ++ A G ++
Sbjct: 19 LEAARSERRWWPGLAPVRLALFVEPMNINAQSLMVATLAKSLKNLGYEVEVLAFADGKAN 78
Query: 267 SLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQED 326
+ E I ++++ IDWS ++ ++ SLE K +S LMQEPF +P+VW+I ED
Sbjct: 79 DILENIC-HVNVVSPPSLKYIDWSKYNAVLLSSLEGKMVVSILMQEPFQFLPVVWLIHED 137
Query: 327 SLANRL-PVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPAD 385
+L L + + N + W++ F+ +VFPD LP+L+S LD GNF VI GSP D
Sbjct: 138 ALGQFLRNPELHQSIPNHIEDWRTHFNACTYVVFPDSYLPLLHSALDTGNFLVISGSPVD 197
Query: 386 VWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYA 445
+ A + YS SH + R + G +D++VV+VVGS F++EL WD+A + P ++ A
Sbjct: 198 ILATKRYSSSHTQESARNQYGSKEDDVVVLVVGSYLFFDELPWDFATVLRASAPHIMDMA 257
Query: 446 RRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVL 505
+ ++ +F+F CGN TD YN A QE+AS +G SV+H+ D+ +L+ ADIVL
Sbjct: 258 KTKNL--GVQFIFFCGNDTDAYNSAFQELASHMGFPLGSVKHFSLTHDIRNLLVFADIVL 315
Query: 506 YGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF 565
YGSS+ E FP L++R+M IP+I P+ +I +Y+ +G F D+P + AF+
Sbjct: 316 YGSSRQEPVFPPLLLRSMASEIPIIVPNLTVITKYITDGTHGFLFNADDPSTMVSAFAQI 375
Query: 566 ISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVS 625
+ +LS A +VA G+L +KNMLA DC+T + +LE+VL++PS A LP P+S+LQ+ +
Sbjct: 376 LGEKRLSARAFSVALEGKLLSKNMLAYDCITAHVMLLESVLHYPSYAKLPSPVSKLQERT 435
Query: 626 WEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVD-LLEEEFTKNIT-ENENRSADQDT 683
W W+LF + L S + ++D L++E N T + N ++ +
Sbjct: 436 WLWDLFETKAALENSS-------SDDDSQLLTRIIDNLVDESHESNQTYSDSNDTSLYNY 488
Query: 684 ISELDWDVLHDIE----------SSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERF 733
S DW+ L+++E + L ++ER++ SWD++Y+NARKSER
Sbjct: 489 PSLSDWNDLNEVEIFEDIERREIEEASFSPLPF-PIDERVERPLLSWDEVYKNARKSERL 547
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
K E NERDEGELERTGQPVCIYEIYSG GAWPFLHHGSLYRG+ LS RR RSDDVDAV
Sbjct: 548 KPEGNERDEGELERTGQPVCIYEIYSGEGAWPFLHHGSLYRGITLSKGGRRPRSDDVDAV 607
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
+ L +L+ +Y D LCE G MF++AN++D +HK PWIGFQSW+AAGRKVSLS AE+ LE
Sbjct: 608 THLSVLDNPYYSDRLCEFGAMFAVANRIDAVHKLPWIGFQSWQAAGRKVSLSEKAEETLE 667
Query: 854 ETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
ET+ E DV+Y+WA +DMD T N FWSMCD LN G CRT FEDAFR MYGL
Sbjct: 668 ETMAGENNEDVIYYWAPMDMDQ--TSN------FWSMCDWLNAGRCRTLFEDAFRTMYGL 719
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLS 972
+ ALPPMP DG WS LH WVM TPSFL+FIMFSRMFVD L +LN N + SCLL
Sbjct: 720 SDGITALPPMPNDGDYWSTLHSWVMPTPSFLKFIMFSRMFVDYLHSLNVNGTDPASCLLG 779
Query: 973 SSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFT 1032
+S+LEK+HCYCR+LE+LVNVWAYHSG+KM YL+P++G ++EQHP++ R MW+K+FNFT
Sbjct: 780 ASQLEKRHCYCRILEVLVNVWAYHSGKKMAYLNPVTGDIREQHPLDDRNE-MWVKFFNFT 838
Query: 1033 LLKSMDEDLAEAADDGDYP-REKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
LLKSMDEDLAE ADDG + ++WLWP TG+V W GI +REREE+Y +K+DKK K K K+
Sbjct: 839 LLKSMDEDLAEEADDGMHAGDDQWLWPLTGQVFWPGIADREREEKYIKKLDKKLKNKVKL 898
Query: 1092 FDRLTKGYRQKTLG 1105
+R GY+QK LG
Sbjct: 899 LERQKSGYKQKPLG 912
>gi|359489827|ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262009 [Vitis vinifera]
Length = 1026
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/974 (40%), Positives = 587/974 (60%), Gaps = 41/974 (4%)
Query: 154 FMIFLAVFAFAMASMVLQNSIASVFGAERGRP----IRE--ELRFGSRLKFVPDQV--GF 205
F F+ +F + ++++ S S+ E G I+ L FG ++F P ++ F
Sbjct: 54 FCFFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKF 113
Query: 206 GNGLDGL------RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYA 259
D + R RFG R P++ L+ ++ D + LL++TV L ++GY ++Y+
Sbjct: 114 QKEADEVNLSSASRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYS 173
Query: 260 VRSGNSHSLWEQIAGQISIL--GQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSI 317
+ G +++W + ++I+ + +++DW +DGII +SLEA+ +S +QEPF S+
Sbjct: 174 LEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSL 233
Query: 318 PLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFF 377
PL+W I E +LA RL Y G L++ WK VF+R +VFP+Y LPM+YS D+GN+F
Sbjct: 234 PLIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYF 293
Query: 378 VIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDV 437
VIPGSPA W V+ + SH R + G+ D+ V+ +V S F Y L ++A+ + +
Sbjct: 294 VIPGSPAQAWEVDNFMASHRD-SPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQAL 352
Query: 438 GPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFN-GDVNG 496
PL+ ++ N+ K + GNS + Y+ A++ +A +L + V+H + G+ +
Sbjct: 353 LPLVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADN 412
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
VL ADIV+YGS EQ FP ++++AM+FG +I PD IIK+YV + F K+
Sbjct: 413 VLAAADIVIYGSFLEEQSFPDILIKAMSFGKLIIAPDLSIIKKYVDDRVNGYLFPKEKIS 472
Query: 557 GLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPG 616
L++ IS GKLS +AS G+ AKN++ ++ V YA +LEN+L FPS+ P
Sbjct: 473 VLTQVILQMISEGKLSPLVHNIASLGKSTAKNLMVMETVEGYASLLENLLKFPSEVASPK 532
Query: 617 PISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITE 673
++++ + W+WNLF +G T+ +SR+ +D EE+++++ T
Sbjct: 533 AVTEIPPKLKEEWQWNLF-----AASGHSTY-----TNRTSRSHRFLDKFEEQWSQSQTG 582
Query: 674 NENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERF 733
++ W+ I + +R E ++L++R D SW+D+YR+A++++R
Sbjct: 583 GSGSVTTDESFPYSIWEEEKLIGIANAKKRREEDELKDRTDQPRGSWEDVYRSAKRADRA 642
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
K + +ERD+GELERTGQP+CIYE Y G G WPFLH SLYRG+ LS+ RR +DD+DA
Sbjct: 643 KNDLHERDDGELERTGQPLCIYEPYFGEGTWPFLHATSLYRGIGLSTKGRRREADDIDAP 702
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
SRL LLN +YRD L E G F+IAN+VD IH+ WIGFQSWRA R SLS AE L
Sbjct: 703 SRLPLLNNPYYRDALGEYGAFFAIANRVDRIHRNAWIGFQSWRATARNASLSKIAETALL 762
Query: 854 ETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
+Q GD +YFW +DMD N L FWS CD +N G+C+ AF +A ++MYG+
Sbjct: 763 NAIQARKHGDTLYFWVRMDMDP----RNPSQLDFWSFCDAINAGNCKFAFSEALKKMYGI 818
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNSCLL 971
++LPPMP DG WS + W + T SFLEF+MFSRMFVD+LDA + + + C L
Sbjct: 819 KRDWDSLPPMPVDGDAWSVMQSWALPTRSFLEFVMFSRMFVDALDAQIYNDHHQRGHCYL 878
Query: 972 SSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNF 1031
S S + KHCY RVLELLVNVWAYH ++MVY++P +G + E H ++ RRG MW+K+F++
Sbjct: 879 SLS--KDKHCYSRVLELLVNVWAYHGAKRMVYVNPQTGEMHEHHKLKNRRGHMWVKWFSY 936
Query: 1032 TLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
LKSMDE+LAE +DD D+P +WLWP TGEV W+GIY RER +R +QK ++++ K+K+
Sbjct: 937 ATLKSMDEELAEESDD-DHPMRRWLWPSTGEVFWQGIYLRERNQRLQQKEKRRQQSKDKL 995
Query: 1092 FDRLTKGYRQKTLG 1105
R+ + QK +G
Sbjct: 996 L-RMRRRSHQKVIG 1008
>gi|297745376|emb|CBI40456.3| unnamed protein product [Vitis vinifera]
Length = 1026
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/974 (40%), Positives = 587/974 (60%), Gaps = 41/974 (4%)
Query: 154 FMIFLAVFAFAMASMVLQNSIASVFGAERGRP----IRE--ELRFGSRLKFVPDQV--GF 205
F F+ +F + ++++ S S+ E G I+ L FG ++F P ++ F
Sbjct: 54 FCFFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKF 113
Query: 206 GNGLDGL------RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYA 259
D + R RFG R P++ L+ ++ D + LL++TV L ++GY ++Y+
Sbjct: 114 QKEADEVNLSSASRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYS 173
Query: 260 VRSGNSHSLWEQIAGQISIL--GQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSI 317
+ G +++W + ++I+ + +++DW +DGII +SLEA+ +S +QEPF S+
Sbjct: 174 LEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSL 233
Query: 318 PLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFF 377
PL+W I E +LA RL Y G L++ WK VF+R +VFP+Y LPM+YS D+GN+F
Sbjct: 234 PLIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYF 293
Query: 378 VIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDV 437
VIPGSPA W V+ + SH R + G+ D+ V+ +V S F Y L ++A+ + +
Sbjct: 294 VIPGSPAQAWEVDNFMASHRD-SPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQAL 352
Query: 438 GPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFN-GDVNG 496
PL+ ++ N+ K + GNS + Y+ A++ +A +L + V+H + G+ +
Sbjct: 353 LPLVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADN 412
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
VL ADIV+YGS EQ FP ++++AM+FG +I PD IIK+YV + F K+
Sbjct: 413 VLAAADIVIYGSFLEEQSFPDILIKAMSFGKLIIAPDLSIIKKYVDDRVNGYLFPKEKIS 472
Query: 557 GLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPG 616
L++ IS GKLS +AS G+ AKN++ ++ V YA +LEN+L FPS+ P
Sbjct: 473 VLTQVILQMISEGKLSPLVHNIASLGKSTAKNLMVMETVEGYASLLENLLKFPSEVASPK 532
Query: 617 PISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITE 673
++++ + W+WNLF +G T+ +SR+ +D EE+++++ T
Sbjct: 533 AVTEIPPKLKEEWQWNLF-----AASGHSTY-----TNRTSRSHRFLDKFEEQWSQSQTG 582
Query: 674 NENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERF 733
++ W+ I + +R E ++L++R D SW+D+YR+A++++R
Sbjct: 583 GSGSVTTDESFPYSIWEEEKLIGIANAKKRREEDELKDRTDQPRGSWEDVYRSAKRADRA 642
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
K + +ERD+GELERTGQP+CIYE Y G G WPFLH SLYRG+ LS+ RR +DD+DA
Sbjct: 643 KNDLHERDDGELERTGQPLCIYEPYFGEGTWPFLHATSLYRGIGLSTKGRRREADDIDAP 702
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
SRL LLN +YRD L E G F+IAN+VD IH+ WIGFQSWRA R SLS AE L
Sbjct: 703 SRLPLLNNPYYRDALGEYGAFFAIANRVDRIHRNAWIGFQSWRATARNASLSKIAETALL 762
Query: 854 ETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
+Q GD +YFW +DMD N L FWS CD +N G+C+ AF +A ++MYG+
Sbjct: 763 NAIQARKHGDTLYFWVRMDMDP----RNPSQLDFWSFCDAINAGNCKFAFSEALKKMYGI 818
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNSCLL 971
++LPPMP DG WS + W + T SFLEF+MFSRMFVD+LDA + + + C L
Sbjct: 819 KRDWDSLPPMPVDGDAWSVMQSWALPTRSFLEFVMFSRMFVDALDAQIYNDHHQRGHCYL 878
Query: 972 SSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNF 1031
S S + KHCY RVLELLVNVWAYH ++MVY++P +G + E H ++ RRG MW+K+F++
Sbjct: 879 SLS--KDKHCYSRVLELLVNVWAYHGAKRMVYVNPQTGEMHEHHKLKNRRGHMWVKWFSY 936
Query: 1032 TLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
LKSMDE+LAE +DD D+P +WLWP TGEV W+GIY RER +R +QK ++++ K+K+
Sbjct: 937 ATLKSMDEELAEESDD-DHPMRRWLWPSTGEVFWQGIYLRERNQRLQQKEKRRQQSKDKL 995
Query: 1092 FDRLTKGYRQKTLG 1105
R+ + QK +G
Sbjct: 996 L-RMRRRSHQKVIG 1008
>gi|356522554|ref|XP_003529911.1| PREDICTED: uncharacterized protein LOC100806723 [Glycine max]
Length = 1034
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/992 (40%), Positives = 596/992 (60%), Gaps = 68/992 (6%)
Query: 151 LLYFMIFLAVFAFAMASMVLQNSIASVFGAERGRP---------------IREELRFGSR 195
++ F+ + VF + VL+NS A R R I E+ F +
Sbjct: 53 VVVFLCLVIVFQMFLPGSVLENSEEGSLEAVRMRSDNLFQYGDIHDVVLDIGEDAVFLPK 112
Query: 196 LKFVPDQVGFGNGLDGL-RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYV 254
+ + G G +D P FG R P++ L+ G + DS+ LL++TV LQ++GY
Sbjct: 113 ISEKFSRAGEGRDVDLFNHKVPHFGYRKPQLALVFGELLVDSQQLLMVTVGSALQEIGYE 172
Query: 255 FKIYAVRSGNSHSLWEQIAGQISIL--GQEQYSLIDWSIFDGIIADSLEAKEAISSLMQE 312
+++++ G H++W + I+I+ ++ + +DW +DGII SLEAK A S +QE
Sbjct: 173 IQVFSLEDGPGHNVWRNLRVPITIIRTCDKRNNTVDWLNYDGIIVSSLEAKSAFSCFLQE 232
Query: 313 PFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLD 372
PF SIPL+WI+ E++LA R Y G LL+ W VF+R V+VFP+Y LPM+YS D
Sbjct: 233 PFKSIPLIWIVHENALAYRSRQYTTNGQIELLNDWGRVFNRSTVVVFPNYALPMIYSTFD 292
Query: 373 AGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAV 432
AGNF+VIPGSPA+ EA+ + +K LR G+ +++++ +VGS F Y + +A+
Sbjct: 293 AGNFYVIPGSPAETLEAEAF-MALQKDNLRANMGYGPEDVIIAIVGSRFLYKGMWLGHAI 351
Query: 433 AMHDVGPLLIKYA-RRNSVEGSFKFVFLCGNSTDGYNDALQEVASRL----GLLEHSVRH 487
+ + PLL + +++ F+ + T+ Y AL+ +A L G++EH
Sbjct: 352 VLRALKPLLEDFLLNKDNSSAQFRIIVHSEELTNNYTVALETMAHSLKYPGGIIEH---- 407
Query: 488 YGFNGDVNG--VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGA 545
GD+N VL AD+V+YGS EQ FP ++++AM+F P+I PD P+I++YV +
Sbjct: 408 --IAGDLNADSVLGTADVVIYGSFLEEQSFPEILIKAMSFEKPIIAPDVPMIRKYVDDRV 465
Query: 546 QVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENV 605
F KDN L + IS GK+S A +AS GR AKN++A + + YA +L+N+
Sbjct: 466 NGYLFPKDNIRVLRQILLEVISKGKISPLACNIASIGRSTAKNLMASEAIDGYASLLQNI 525
Query: 606 LNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDL 662
L PS+ P +S++ + W+W+LF ++ + + R+++ +D
Sbjct: 526 LRLPSEVSPPKAVSEIAPNFKEQWQWHLFEAFPNMTY----------QNRALRSNTFLDK 575
Query: 663 LEEEFTKNITENENRSADQDTISELDWDVLHDIESSEEYERL-------EMEQLEERMDG 715
E + + +NRS + +++ ++ + E+Y +L E E+L++RM+
Sbjct: 576 YEHQLNHS---QKNRSTTAVSANDV---FVYSLWEEEKYTQLAITKKRREDEELKDRMEQ 629
Query: 716 TFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRG 775
+ +W+D+Y++A++++R K + +ERDEGELERTGQP+CIYE Y G G+WPFLH SLYRG
Sbjct: 630 SHGTWEDVYKSAKRADRSKNDLHERDEGELERTGQPLCIYEPYFGEGSWPFLHKKSLYRG 689
Query: 776 LALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSW 835
+ LS RR DDVDA SRL LLN +YRD+L + G F+IANK+D +H+ WIGFQSW
Sbjct: 690 IGLSGKGRRPGRDDVDAPSRLPLLNNGYYRDLLSDYGAFFAIANKIDRLHRNAWIGFQSW 749
Query: 836 RAAGRKVSLSISAEKVLEETVQETE-GDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILN 894
RA RK SLSI AE L + +Q GD +YFW +DMD +RN + FWS CD +N
Sbjct: 750 RATARKASLSIIAENALLDAIQSKRYGDALYFWVRMDMD---SRNPSQT-DFWSFCDAVN 805
Query: 895 GGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVD 954
G+C+ AF +A R MYG+ ++LPPMP DG WS + W M T SF+EF+MFSRMFVD
Sbjct: 806 AGNCKFAFSEAMRGMYGVKGDADSLPPMPVDGDTWSVMQSWAMPTRSFMEFVMFSRMFVD 865
Query: 955 SLDALNANSSKVNS-CLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQE 1013
+LDA + + C LS S + KHCY R+LELLVNVW YHS R+MV++DP +G +QE
Sbjct: 866 ALDAQMYDEHHLTGHCSLSLS--KDKHCYSRLLELLVNVWTYHSARRMVFVDPETGLMQE 923
Query: 1014 QHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYERER 1073
QH + RRG MW+K+F+++ LKSMDEDLAE +D D P WLWP TGEV W+G++ERER
Sbjct: 924 QHKFKSRRGQMWIKWFSYSTLKSMDEDLAELSDSED-PTRHWLWPSTGEVFWQGVFERER 982
Query: 1074 EERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
R+++K +K+K EK +R+ K +RQ+ +G
Sbjct: 983 SLRHKEKEKRKQKSIEKQ-NRIRKRHRQQVIG 1013
>gi|147777557|emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera]
Length = 1040
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/988 (39%), Positives = 588/988 (59%), Gaps = 55/988 (5%)
Query: 154 FMIFLAVFAFAMASMVLQNSIASVFGAERGRP----IRE--ELRFGSRLKFVPDQV--GF 205
F F+ +F + ++++ S S+ E G I++ L FG ++F P ++ F
Sbjct: 54 FCFFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKKIGGLDFGEGIRFEPSKLLQKF 113
Query: 206 GNGLDGL------RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFK--- 256
D + R RFG R P++ L+ ++ D + LL++TV L ++GY +
Sbjct: 114 QKEADEVNLSSASRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQALP 173
Query: 257 -----------IYAVRSGNSHSLWEQIAGQISIL--GQEQYSLIDWSIFDGIIADSLEAK 303
+Y++ G +++W + ++I+ + +++DW +DGII +SLEA+
Sbjct: 174 YLVSIYVAWIQVYSLEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEAR 233
Query: 304 EAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYT 363
+S +QEPF S+PL+W I E +LA RL Y G L++ WK VF+R +VFP+Y
Sbjct: 234 GVVSCFVQEPFKSLPLIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYV 293
Query: 364 LPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFY 423
LPM+YS D+GN+FVIPGSPA W V+ + SH R + G+ D+ V+ +V S F Y
Sbjct: 294 LPMIYSTFDSGNYFVIPGSPAQAWEVDNFMASHRD-SPRVKMGYGPDDFVIALVRSQFLY 352
Query: 424 NELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEH 483
L ++A+ + + PL+ ++ N+ K + GNS + Y+ A++ +A +L +
Sbjct: 353 KGLWLEHALILQALLPLVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKG 412
Query: 484 SVRHYGFN-GDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVA 542
V+H + G+ + VL ADIV+YGS EQ FP ++++AM+FG +I PD IIK+YV
Sbjct: 413 VVKHIAIDVGEADNVLAAADIVIYGSFLEEQSFPDILIKAMSFGKXIIAPDLSIIKKYVD 472
Query: 543 EGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARIL 602
+ F K+ L++ IS GKLS +AS G+ AKN++ ++ V YA +L
Sbjct: 473 DRVXGYLFPKEKISVLTQVILQMISEGKLSPLVHNIASLGKSTAKNLMVMETVEGYASLL 532
Query: 603 ENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSV 659
EN+L FPS+ P ++++ + W+WNLF +G T+ +SR+
Sbjct: 533 ENLLKFPSEVASPKAVTEIPPKLKEEWQWNLF-----AASGHSTY-----TNRTSRSHRF 582
Query: 660 VDLLEEEFTKNITENENRSADQDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFAS 719
+D EE+++++ T ++ W+ I + +R E ++L++R D S
Sbjct: 583 LDKFEEQWSQSQTGGSGSVTTDESFPYSIWEEEKLIGIANAKKRREEDELKDRTDQPRGS 642
Query: 720 WDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALS 779
W+D+YR+A++++R K + +ERD+GELERTGQP+CIYE Y G G WPFLH SLYRG+ LS
Sbjct: 643 WEDVYRSAKRADRAKNDLHERDDGELERTGQPLCIYEPYFGEGTWPFLHATSLYRGIGLS 702
Query: 780 SAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAG 839
+ RR +DD+DA SRL LLN +YRD L E G F+IAN+VD IH+ WIGFQSWRA
Sbjct: 703 TKGRRREADDIDAPSRLPLLNNPYYRDALGEYGAFFAIANRVDRIHRNAWIGFQSWRATA 762
Query: 840 RKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHC 898
R SLS AE L +Q GD +YFW +DMD N L FWS CD +N G+C
Sbjct: 763 RNASLSKIAETALLNAIQARKHGDTLYFWVRMDMDP----RNPSQLDFWSFCDAINAGNC 818
Query: 899 RTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA 958
+ AF +A ++MYG+ ++LPPMP DG WS + W + T SFLEF+MFSRMFVD+LDA
Sbjct: 819 KFAFSEALKKMYGIKRDWDSLPPMPVDGDAWSVMQSWALPTRSFLEFVMFSRMFVDALDA 878
Query: 959 -LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPI 1017
+ + + C LS S + KHCY RVLELLVNVWAYH ++MVY++P +G + E H +
Sbjct: 879 QIYNDHHQRGHCYLSLS--KDKHCYSRVLELLVNVWAYHGAKRMVYVNPQTGEMHEHHKL 936
Query: 1018 ERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERY 1077
+ RRG MW+K+F++ LKSMDE+LAE +DD D+P +WLWP TGEV W+GIY RER +R
Sbjct: 937 KNRRGHMWVKWFSYATLKSMDEELAEESDD-DHPMRRWLWPSTGEVFWQGIYLRERNQRL 995
Query: 1078 RQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
+QK ++++ K+K+ R+ + QK +G
Sbjct: 996 QQKEKRRQQSKDKLL-RMRRRSHQKVIG 1022
>gi|297610864|ref|NP_001065214.2| Os10g0546200 [Oryza sativa Japonica Group]
gi|255679599|dbj|BAF27128.2| Os10g0546200 [Oryza sativa Japonica Group]
Length = 948
Score = 736 bits (1899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/914 (44%), Positives = 554/914 (60%), Gaps = 101/914 (11%)
Query: 209 LDGLRSTPRF--GVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSH 266
L+ RS R+ G+ P R+ L +G M +++SL++ T+ K+L+ LGY +I
Sbjct: 118 LEAARSERRWWPGLAPVRLALFVGTMNINAQSLMVATLAKSLKNLGYEVEIQCC------ 171
Query: 267 SLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQED 326
A L +E + EPF +P+VW+I ED
Sbjct: 172 ------------------------------ASKLSGRE--NGRFNEPFQFLPVVWLIHED 199
Query: 327 SLANRL-PVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPAD 385
+L L + + N + W++ F+ +VFPD LP+L+S LD GNF VI GSP D
Sbjct: 200 ALGQFLRNPELHQSIPNHIEDWRTHFNACTYVVFPDSYLPLLHSALDTGNFLVISGSPVD 259
Query: 386 VWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYA 445
+ A + YS SH + R + G +D++VV+VVGS F++EL WD+A + P ++ A
Sbjct: 260 ILATKRYSSSHTQESARNQYGSKEDDVVVLVVGSYLFFDELPWDFATVLRASAPHIMDMA 319
Query: 446 RRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVL 505
+ ++ +F+F CGN TD YN A QE+AS +G SV+H+ D+ +L+ ADIVL
Sbjct: 320 KTKNL--GVQFIFFCGNDTDAYNSAFQELASHMGFPLGSVKHFSLTHDIRNLLVFADIVL 377
Query: 506 YGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF 565
YGSS+ E FP L++R+M IP+I P+ +I +Y+ +G F D+P + AF+
Sbjct: 378 YGSSRQEPVFPPLLLRSMASEIPIIVPNLTVITKYITDGTHGFLFNADDPSTMVSAFAQI 437
Query: 566 ISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVS 625
+ +LS A +VA G+L +KNMLA DC+T + +LE+VL++PS A LP P+S+LQ+ +
Sbjct: 438 LGEKRLSARAFSVALEGKLLSKNMLAYDCITAHVMLLESVLHYPSYAKLPSPVSKLQERT 497
Query: 626 WEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVD-LLEEEFTKNIT-ENENRSADQDT 683
W W+LF + L S + ++D L++E N T + N ++ +
Sbjct: 498 WLWDLFETKAALENSS-------SDDDSQLLTRIIDNLVDESHESNQTYSDSNDTSLYNY 550
Query: 684 ISELDWDVLHDIE----------SSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERF 733
S DW+ L+++E + L ++ER++ SWD++Y+NARKSER
Sbjct: 551 PSLSDWNDLNEVEIFEDIERREIEEASFSPLPF-PIDERVERPLLSWDEVYKNARKSERL 609
Query: 734 KFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAV 793
K E NERDEGELERTGQP S RR RSDDVDAV
Sbjct: 610 KPEGNERDEGELERTGQP---------------------------SKGGRRPRSDDVDAV 642
Query: 794 SRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLE 853
+ L +L+ +Y D LCE G MF++AN++D +HK PWIGFQSW+AAGRKVSLS AE+ LE
Sbjct: 643 THLSVLDNPYYSDRLCEFGAMFAVANRIDAVHKLPWIGFQSWQAAGRKVSLSEKAEETLE 702
Query: 854 ETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGL 912
ET+ E DV+Y+WA +DMD T N FWSMCD LN G CRT FEDAFR MYGL
Sbjct: 703 ETMAGENNEDVIYYWAPMDMDQ--TSN------FWSMCDWLNAGRCRTLFEDAFRTMYGL 754
Query: 913 PSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLS 972
+ ALPPMP DG WS LH WVM TPSFL+FIMFSRMFVD L +LN N + SCLL
Sbjct: 755 SDGITALPPMPNDGDYWSTLHSWVMPTPSFLKFIMFSRMFVDYLHSLNVNGTDPASCLLG 814
Query: 973 SSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFT 1032
+S+LEK+HCYCR+LE+LVNVWAYHSG+KM YL+P++G ++EQHP++ R MW+K+FNFT
Sbjct: 815 ASQLEKRHCYCRILEVLVNVWAYHSGKKMAYLNPVTGDIREQHPLDDRNE-MWVKFFNFT 873
Query: 1033 LLKSMDEDLAEAADDGDYP-REKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
LLKSMDEDLAE ADDG + ++WLWP TG+V W GI +REREE+Y +K+DKK K K K+
Sbjct: 874 LLKSMDEDLAEEADDGMHAGDDQWLWPLTGQVFWPGIADREREEKYIKKLDKKLKNKVKL 933
Query: 1092 FDRLTKGYRQKTLG 1105
+R GY+QK LG
Sbjct: 934 LERQKSGYKQKPLG 947
>gi|449441374|ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus]
Length = 1037
Score = 730 bits (1884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/907 (41%), Positives = 547/907 (60%), Gaps = 39/907 (4%)
Query: 213 RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQI 272
R+ RFG R P++ L+ ++ DS +L++T+ LQ++GYVF++Y+++ G ++ +W Q+
Sbjct: 131 RTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQGGPANDVWRQM 190
Query: 273 AGQISILGQ--EQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLAN 330
++++ E ++DW +DGI+ SL K+ S +QEPF S+PL+W I E++LA
Sbjct: 191 GVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLPLIWTIHEEALAI 250
Query: 331 RLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVE 390
R Y G ++L+ WK VF+ V+VFP+Y +PM+YS D+GNFFVIP PA+ E
Sbjct: 251 RSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFVIPSFPAEALEAE 310
Query: 391 AYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSV 450
S + LR + G+ D++V+ +VGS F Y + ++A+ + + PLL +++
Sbjct: 311 IDVTS-DADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLPLLHEFSFYEHS 369
Query: 451 EGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQ 510
K L G+S Y A++ +A RL V+H+ D + L MAD+V+YGS
Sbjct: 370 NSRLKIFVLSGDSNSNYTMAVEAIAQRLEYPRSVVKHFPVAADSDKALSMADLVIYGSCL 429
Query: 511 VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
EQ FP ++V+AM G P+I PD II+++V + F K N LS+ IS G+
Sbjct: 430 EEQSFPKVLVKAMGMGKPIIAPDLAIIRKHVDDRVNGYLFPKGNFNVLSQIILQVISEGR 489
Query: 571 LSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWE 627
LS A+++AS GR N++ + V YA +L+ VL PS+A ++++ + W+
Sbjct: 490 LSPLAQSIASIGRDTVINLMVSETVEGYASLLDAVLKLPSEAAPAKEVAEIPSKLKEKWQ 549
Query: 628 WNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSADQDTISEL 687
W LF+ G ++ + RN +L+E F KN R L
Sbjct: 550 WQLFK-----GVSNL--------TVLQRNEKSFTVLDE-FEKNWNHTPKRKPGSSF--AL 593
Query: 688 DWDVLHDIESSEEY-------ERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANER 740
+ ++ I E Y R E +++++R + +W+D+YR+A+K++R K + +ER
Sbjct: 594 NESFIYGIWEEERYTVMSNIKRRREEDEIKDRTEQPHNTWEDVYRSAKKADRSKNDLHER 653
Query: 741 DEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLN 800
DEGELERTGQP+CIYE Y G G WPFLH SLYRG+ LSS RR DDVDA SRL LLN
Sbjct: 654 DEGELERTGQPLCIYEPYFGEGVWPFLHRYSLYRGIGLSSKGRRSGIDDVDAPSRLPLLN 713
Query: 801 YTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQETE 860
+YR++L E G F+IAN+VD IHK WIGF SWRA R VSLS AE L + +Q
Sbjct: 714 NPYYRNVLGEYGAFFAIANRVDRIHKNAWIGFHSWRATARNVSLSKIAETALLDAIQTRR 773
Query: 861 -GDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEAL 919
GD +YFW +D D N L FWS CD +N G+C+ AF ++ + MYG+ S E L
Sbjct: 774 YGDALYFWVRMDSDP----RNPLQLDFWSFCDSINAGNCKFAFSESLKMMYGIKSDQEFL 829
Query: 920 PPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANS-SKVNSCLLSSSELEK 978
PPMP DG WSA+ W + T SFLEF+MFSRMFVD+LD N C LS S +
Sbjct: 830 PPMPADGYTWSAMQSWALPTRSFLEFVMFSRMFVDALDVQMYNEHHSTGRCYLSLS--KD 887
Query: 979 KHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMD 1038
KHCY R+LELLVNVWAYHS R++VY+ P +G++QEQH + RRG MW+K+F++T++KSMD
Sbjct: 888 KHCYSRLLELLVNVWAYHSARRIVYVHPETGAMQEQHKFDIRRGQMWIKWFSYTMIKSMD 947
Query: 1039 EDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKG 1098
EDL E A D D+P +WLWP TGEV W+G+YERE+ R+RQK +K+K K K+ DR+
Sbjct: 948 EDLGEEA-DADHPTRRWLWPSTGEVFWQGVYEREKNLRFRQKEHRKQKSKAKL-DRMRHR 1005
Query: 1099 YRQKTLG 1105
QK +G
Sbjct: 1006 RHQKVIG 1012
>gi|449495248|ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101212216 [Cucumis
sativus]
Length = 1037
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/907 (41%), Positives = 546/907 (60%), Gaps = 39/907 (4%)
Query: 213 RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQI 272
R+ RFG R P++ L+ ++ DS +L++T+ LQ++GYVF++Y+++ G ++ +W Q+
Sbjct: 131 RTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQGGPANDVWRQM 190
Query: 273 AGQISILGQ--EQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLAN 330
++++ E ++DW +DGI+ SL K+ S +QEPF S+PL+W I E++LA
Sbjct: 191 GVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLPLIWTIHEEALAI 250
Query: 331 RLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVE 390
R Y G ++L+ WK VF+ V+VFP+Y +PM+YS D+GNFFVIP PA+ E
Sbjct: 251 RSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFVIPSFPAEALEAE 310
Query: 391 AYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSV 450
S + LR + G+ D++V+ +VGS F Y + ++A+ + + PLL +++
Sbjct: 311 IDVTS-DADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLPLLHEFSFYEHS 369
Query: 451 EGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQ 510
K L G+S Y A++ +A RL V+H+ D + L MAD+V+YGS
Sbjct: 370 NSRLKIFVLSGDSNSNYTMAVEAIAQRLEYPRSVVKHFPVAADSDKALSMADLVIYGSCL 429
Query: 511 VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
EQ FP ++V+AM G P+I PD II+++V + F K N LS+ IS G+
Sbjct: 430 EEQSFPKVLVKAMGMGKPIIAPDLAIIRKHVDDRVNGYLFPKGNFNVLSQIILQVISEGR 489
Query: 571 LSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWE 627
LS A+++AS GR N++ + V YA +L+ VL PS+A ++++ + W+
Sbjct: 490 LSPLAQSIASIGRDTVINLMVSETVEGYASLLDAVLKLPSEAAPAKEVAEIPSKLKEKWQ 549
Query: 628 WNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSADQDTISEL 687
W LF+ G ++ + RN +L+E F KN R L
Sbjct: 550 WQLFK-----GVSNL--------TVLQRNEKSFTVLDE-FEKNWNHTPKRKPGSSF--AL 593
Query: 688 DWDVLHDIESSEEY-------ERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANER 740
+ ++ I E Y R E +++++R + +W+D+YR+A+K++R K + +ER
Sbjct: 594 NESFIYGIWEEERYTVMSNIKRRREEDEIKDRTEQPHNTWEDVYRSAKKADRSKNDLHER 653
Query: 741 DEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLN 800
DEGELERTGQP+CIYE Y G G WPFLH SLYRG+ LSS RR DDVDA SRL LLN
Sbjct: 654 DEGELERTGQPLCIYEPYFGEGVWPFLHRYSLYRGIGLSSKGRRSGIDDVDAPSRLPLLN 713
Query: 801 YTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQETE 860
+YR++L E G F+IAN+VD IHK WIGF SWRA R VSLS AE L + +Q
Sbjct: 714 NPYYRNVLGEYGAFFAIANRVDRIHKNAWIGFHSWRATARNVSLSKIAETALLDAIQTRR 773
Query: 861 -GDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEAL 919
GD +YFW +D D N L FWS CD +N G+C+ AF ++ + MYG+ S E L
Sbjct: 774 YGDALYFWVRMDSDP----RNPLQLDFWSFCDSINAGNCKFAFSESLKMMYGIKSDQEFL 829
Query: 920 PPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANS-SKVNSCLLSSSELEK 978
PPMP DG WSA+ W + T FLEF+MFSRMFVD+LD N C LS S +
Sbjct: 830 PPMPADGYTWSAMQSWALPTRXFLEFVMFSRMFVDALDVQMYNEHHSTGRCYLSLS--KD 887
Query: 979 KHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMD 1038
KHCY R+LELLVNVWAYHS R++VY+ P +G++QEQH + RRG MW+K+F++T++KSMD
Sbjct: 888 KHCYSRLLELLVNVWAYHSARRIVYVHPETGAMQEQHKFDIRRGQMWIKWFSYTMIKSMD 947
Query: 1039 EDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKG 1098
EDL E A D D+P +WLWP TGEV W+G+YERE+ R+RQK +K+K K K+ DR+
Sbjct: 948 EDLGEEA-DADHPTRRWLWPSTGEVFWQGVYEREKNLRFRQKEHRKQKSKAKL-DRMRHR 1005
Query: 1099 YRQKTLG 1105
QK +G
Sbjct: 1006 RHQKVIG 1012
>gi|357130522|ref|XP_003566897.1| PREDICTED: uncharacterized protein LOC100822323 [Brachypodium
distachyon]
Length = 1039
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/905 (41%), Positives = 549/905 (60%), Gaps = 44/905 (4%)
Query: 213 RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQI 272
R R GVR PR+ L+ G++ D+ L +++V L+ +GY K+++ G ++W I
Sbjct: 147 RPVKRVGVRKPRLALVFGDLFPDAMQLQMVSVASVLEAMGYEMKVFSFEDGPCSNIWRAI 206
Query: 273 AGQISILGQEQYSLI--DWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLAN 330
+ IL + LI DW +DGI+ +S+EA+ SSLMQEPF SIP++W +QE SLA+
Sbjct: 207 GVPVQILPVDTKLLISVDWLDYDGILVNSVEARPVFSSLMQEPFKSIPVIWTVQESSLAH 266
Query: 331 RLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVE 390
R+ Y +L WK VFSR NVIVFP+Y LP++Y+ D+GN+FVIPGSP + +
Sbjct: 267 RISEYNSSEMVQILDGWKEVFSRANVIVFPNYILPVMYAAFDSGNYFVIPGSPKVAFQAD 326
Query: 391 AYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLI---KYARR 447
+R G + V+ +VGS F Y D A+ + VGPLL +Y+
Sbjct: 327 RLVAKSYDRNVRISLGLGPRDFVIAIVGSQFSYGGHLMDEALVLQAVGPLLQHLQQYSSE 386
Query: 448 NSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYG 507
NS + K L N T+ ++ AL +A +G +V H V+ +L +A +V+YG
Sbjct: 387 NSTQTELKVRILSRNVTEKHSPALDVIALNVGFPRGAVEHVAAEY-VDNLLGVASLVIYG 445
Query: 508 SSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS 567
S EQ FPS++V+AM+ VI PD +I +Y+ +G + F + N L++ +S
Sbjct: 446 SCLDEQSFPSVLVQAMSLEKLVIAPDLGMITKYIDDGVNGLLFPRKNIAMLTQVLLQAVS 505
Query: 568 NGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLP---GPISQLQQV 624
NG+LS + VAS G++HAK+++A + + YA +L+NV+ FP++AL P G I +
Sbjct: 506 NGELSVLGQKVASVGKIHAKDLMASETIEGYAVLLQNVIKFPAEALTPLSAGEIPLALKQ 565
Query: 625 SWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSADQ--D 682
W+W+LF + +M+E S ++ +EEE+ N ++ S + +
Sbjct: 566 EWKWHLFEH-----VKHLYHMNE-----SLTGYKILQKIEEEWRSNQKDDARGSTPKIDE 615
Query: 683 TISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDE 742
S + W+ + RLE E+L++R D T +W+++YRN +K +R K E +ERDE
Sbjct: 616 AFSTIAWEEERENRIMSIKMRLEEEELKDRSDQTHGTWEEVYRNVKKVDRMKNELHERDE 675
Query: 743 GELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYT 802
ELERTGQP+CIYE + G G WPFLH SLYRG+ LSS RR +DD+DA SRL LLN
Sbjct: 676 KELERTGQPLCIYEPFFGEGTWPFLHQSSLYRGIGLSSKGRRPGADDIDASSRLPLLNNG 735
Query: 803 HYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEG 861
+YRDIL E G F++AN++D IHK WIGFQSWR RKV+LS +AE + E +Q + G
Sbjct: 736 YYRDILGEFGAFFALANRIDRIHKNSWIGFQSWRVTARKVNLSKNAESAMLEAIQTQKHG 795
Query: 862 DVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPP 921
D YFW +D D N + FWS+CD +N G+CR S++ +LP
Sbjct: 796 DAFYFWVRMDQDPRSHANQD----FWSLCDTINAGNCRL-------------SNLNSLPH 838
Query: 922 MPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKH 980
MP GG WS + WV++T SFLEF+MFSRMFVD+LDA + N + C+LS KH
Sbjct: 839 MPNAGGTWSVMQSWVLRTRSFLEFVMFSRMFVDALDAQMYDNHHQTGHCILSLH--RDKH 896
Query: 981 CYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDED 1040
CY RVLEL+VNVWA+HS R+MV+++P +G++ EQHP++ RRG M +++F++ LKSMDED
Sbjct: 897 CYSRVLELIVNVWAFHSARRMVFVNPETGAMHEQHPLDGRRGQMSIQWFSYATLKSMDED 956
Query: 1041 LAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYR 1100
LAE D+ ++P KWLWP TGEV W+G+YERE+ R ++K +K++ K+K+ R+ K R
Sbjct: 957 LAEEYDE-EHPDRKWLWPQTGEVFWQGVYEREKNMRQQEKERRKQQTKDKI-QRIKKRAR 1014
Query: 1101 QKTLG 1105
QKT+G
Sbjct: 1015 QKTIG 1019
>gi|356560308|ref|XP_003548435.1| PREDICTED: uncharacterized protein LOC100807455 [Glycine max]
Length = 1035
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/898 (41%), Positives = 553/898 (61%), Gaps = 30/898 (3%)
Query: 218 FGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQIS 277
+G R P++ L+ G + DS+ LL++TV LQ++ Y +++++ G H++W + +
Sbjct: 136 YGYRKPQLALVFGELLVDSQQLLMVTVASALQEIDYEIQVFSLADGPGHNVWRNLRVPVI 195
Query: 278 IL--GQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVY 335
+L ++ +++DW +DGII SLEAK A S +QEPF SIPL+W + E++LA R Y
Sbjct: 196 VLRACDKRNNIVDWLNYDGIIVSSLEAKGAFSCFLQEPFKSIPLIWAVHENALAYRSRQY 255
Query: 336 VERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKS 395
G +L+ W VF+R V+VFP+Y LPM+YS DAGNF+VIPGSPA+ EA+ +
Sbjct: 256 TTNGQIEVLNDWGRVFNRSTVVVFPNYALPMIYSAFDAGNFYVIPGSPAETLEAEAF-MA 314
Query: 396 HEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYA-RRNSVEGSF 454
+K LR G+ +++++ +VGS F Y L +A+ + + PLL + +++
Sbjct: 315 LQKDNLRVNMGYGPEDVIIAIVGSQFLYKGLWLGHAIVLRALEPLLADFPLNKDNSSAQL 374
Query: 455 KFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQG 514
+ + G T+ Y AL+ +A L + H + +V+ VL +D+V+YGS EQ
Sbjct: 375 RIIVHSGELTNNYTVALKTMAHSLKYPRGIIEHIAGDLNVDSVLGTSDVVIYGSFLEEQS 434
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKF 574
FP ++++AM+F P+I PD P+I++YV + F KDN L + IS GK+S
Sbjct: 435 FPEILIKAMSFEKPIIAPDVPMIRKYVDDRVNGYLFPKDNIRVLRQILLEVISKGKISPL 494
Query: 575 ARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQL---QQVSWEWNLF 631
AR +AS GR AKN++ + + YA +LENVL PS+ P +S++ + W+W+LF
Sbjct: 495 ARNIASIGRSTAKNLMVSEAIDGYASLLENVLRLPSEVAPPKAVSEIPPSAKEQWQWHLF 554
Query: 632 RKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKN-ITENENRSADQDTISELDWD 690
+ NM + R+++ +D E ++ + T + A D W+
Sbjct: 555 EA--------VPNMT--FQNRVLRSNTFLDKYEGQWNHSQKTRSTPSVAANDIFVYSIWE 604
Query: 691 VLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQ 750
+ + +R E E+L++R + + +W+D+Y++A+K++R K + +ERDEGELERTGQ
Sbjct: 605 EEKYTQLAITKKRREDEELKDRTEQSHGTWEDVYKSAKKADRLKNDLHERDEGELERTGQ 664
Query: 751 PVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCE 810
P+CIYE Y G G+W FLH SLYRG+ LS RR DDVDA SRL LLN +YRD+L E
Sbjct: 665 PLCIYEPYFGEGSWSFLHQKSLYRGIGLSGKGRRPGRDDVDAPSRLPLLNNGYYRDLLGE 724
Query: 811 IGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQETE-GDVMYFWAH 869
G F+IAN++D +HK WIGFQSWRA RK SLSI+AE L + +Q GD +YFW
Sbjct: 725 YGAFFAIANRIDRLHKNAWIGFQSWRATARKASLSITAENALLDAIQSKRYGDALYFWVR 784
Query: 870 LDMDGGFTRNNNDVLT-FWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGC 928
+DM + N + T FWS CD +N G+C+ F A R+MYG+ V++LPPMP DG
Sbjct: 785 MDM-----YSQNPLQTDFWSFCDAVNAGNCKLTFSKAMRRMYGVKDAVDSLPPMPVDGDT 839
Query: 929 WSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLE 987
WS + W + T SF+EF+MFSRMFVD+LDA + C LS S + KHCY R+LE
Sbjct: 840 WSVMQSWALPTRSFMEFVMFSRMFVDALDAQMYDEHHSTGRCSLSLS--KDKHCYSRLLE 897
Query: 988 LLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADD 1047
LLVNVW YHS R+MV++DP +G +QEQH RRG MW+K+F+++ LKSMDEDLAE +D
Sbjct: 898 LLVNVWTYHSARRMVFVDPETGLMQEQHKFPSRRGQMWIKWFSYSTLKSMDEDLAELSDS 957
Query: 1048 GDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
D P WLWP TGEV W+GI++RER R ++K +K+K EK +R+ K +RQ+ +G
Sbjct: 958 ED-PARHWLWPSTGEVFWQGIFDRERSLRQKEKEKRKQKSIEKQ-NRMRKRHRQQVIG 1013
>gi|414881397|tpg|DAA58528.1| TPA: hypothetical protein ZEAMMB73_570175 [Zea mays]
Length = 1058
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/913 (40%), Positives = 553/913 (60%), Gaps = 45/913 (4%)
Query: 217 RFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFK-----------IYAVRSGNS 265
R GVR PR+ ++ G++ + L +++V L+ +GY K ++++ G
Sbjct: 146 RLGVRKPRLAMVFGDLYPSAMQLQMVSVASVLEAMGYEMKCMSLVVGLYVLVFSLEDGPC 205
Query: 266 HSLWEQIAGQISILGQEQYSLI--DWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWII 323
++W I +SIL ++ I DW +DGI+ +S+EA+ SSL+ EPF SIP++W +
Sbjct: 206 GNIWGAIGVPVSILPEDANLPISVDWLDYDGILVNSIEARPVFSSLLHEPFKSIPIIWTV 265
Query: 324 QEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSP 383
E SLA+R Y G L++ WK VFSR NVIVFP+Y LP++Y+ D+GN+FVIPG P
Sbjct: 266 HEYSLAHRAKEYNVSGMIQLINAWKDVFSRTNVIVFPNYILPVMYAAFDSGNYFVIPGPP 325
Query: 384 ADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIK 443
++ + V+++ ++ G + ++ +VG+SF Y + + A+ + + PLL +
Sbjct: 326 SEAFQVDSFIAKSYHEDVKISLGLNPKDFLIAIVGNSFSYGDNLMEEALVLQALSPLLQR 385
Query: 444 YARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADI 503
Y NS + K GN T+ + AL+ VA +G +V H D + +L AD+
Sbjct: 386 YRSENSAQSELKVKIFTGNITEKHRMALESVALSVGFPRGAVEHVAAE-DKDNLLGTADL 444
Query: 504 VLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
V+Y S EQ FPS++V+AM+ VI PD II++++ +G + F + N L++
Sbjct: 445 VIYYSCLEEQLFPSVLVQAMSLEKLVIAPDLAIIRKHINDGVNGLLFPRKNIGMLAQVLL 504
Query: 564 LFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLP-----GPI 618
+SN K+S + +ASAG+ HAKN++A + + YA +L+NV+ P+DAL P P+
Sbjct: 505 RAVSNSKVSLSGQKIASAGKAHAKNLMASETIEGYAMLLKNVVKLPTDALSPLTADEIPL 564
Query: 619 SQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRS 678
+ Q+ W+W+LF D+ ++ TS S ++ LE+E+ N EN + S
Sbjct: 565 ALKQE--WKWHLF--------DDVKHLHRVNTSLSG--YKILQKLEQEWHSNQMENSSLS 612
Query: 679 ADQ--DTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFE 736
D S + W+ E + ++E ++L++R D +W+++YRN ++ ER K E
Sbjct: 613 TKNINDAFSAIAWEEQRVNEVMDIKRKMEEDELKDRNDQPHGTWEEVYRNVKRVERLKNE 672
Query: 737 ANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRL 796
+ERD+ ELERTGQP+CIYE + G G WPFLH SLYRG+ LSS RR +DD+DA SRL
Sbjct: 673 LHERDDKELERTGQPLCIYEPFFGEGTWPFLHQSSLYRGVGLSSKGRRPGADDIDASSRL 732
Query: 797 HLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETV 856
LLN +YRDIL E G F++AN++D IHK WIGFQSWRA RK +LS +AE + E +
Sbjct: 733 PLLNNVYYRDILGEFGAFFALANRIDRIHKNSWIGFQSWRATARKANLSTNAESAILEAI 792
Query: 857 Q-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYG--LP 913
Q + D YFW +D D N + FWS CD++N G+CR A +AF++MYG L
Sbjct: 793 QSQKHRDSFYFWVRMDQDPRNHANKD----FWSFCDVINAGNCRLAVLEAFQRMYGVHLD 848
Query: 914 SHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNSCLLS 972
++ L MP DG WS + WV+ T SFLEF+MFSRMFVD+LDA + + C+LS
Sbjct: 849 RDLDTLLHMPNDGDTWSVMQSWVLPTRSFLEFVMFSRMFVDALDAQMYDKHHQTGHCILS 908
Query: 973 SSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFT 1032
+ +HCY R+LEL+VNVWA+HS R+MVY++P +G +QEQH + RRG M +++F++
Sbjct: 909 LH--KDQHCYSRLLELIVNVWAFHSARRMVYVNPATGQMQEQHHLSGRRGQMSVQFFSYA 966
Query: 1033 LLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMF 1092
LKSMDE+LAE D D+P +WLWP TGEV W+G+YERER R +K +K++ +EK+
Sbjct: 967 TLKSMDEELAEEF-DLDHPDRRWLWPQTGEVFWQGLYERERSMRQHEKERRKQQSREKI- 1024
Query: 1093 DRLTKGYRQKTLG 1105
R+ RQKTLG
Sbjct: 1025 QRIKSRARQKTLG 1037
>gi|240255711|ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidopsis thaliana]
gi|332656594|gb|AEE81994.1| glycosyl transferase family 1 protein [Arabidopsis thaliana]
Length = 1031
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/1004 (38%), Positives = 575/1004 (57%), Gaps = 67/1004 (6%)
Query: 136 NRKGFSSLFPFRG-AYLLYFMIFLAVFAFA------MASMVLQNSIASVFGAERGRP--- 185
NR S F + YLL+ I F FA + +V+ S E P
Sbjct: 37 NRSRLSRFFLLKSFNYLLWISIICVFFFFAVLFQMFLPGLVIDKSDKPWISKEILPPDLV 96
Query: 186 -IREE--LRFGSR---------LKFVPDQVGFGNGLDGLRST-PRFGVRPPRIGLILGNM 232
RE+ L FG +KF D GF L +T RFG R P++ L+ G++
Sbjct: 97 GFREKGFLDFGDDVRIEPTKLLMKFQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDL 156
Query: 233 AKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISIL--GQEQYSLIDWS 290
D +L++++ K LQ++GY ++Y++ G +S+W+++ ++IL QE +IDW
Sbjct: 157 LADPEQVLMVSLSKALQEVGYAIEVYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWL 216
Query: 291 IFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSV 350
+DGII +SL A+ + MQEPF S+PL+W+I E++LA R Y G LL+ WK +
Sbjct: 217 SYDGIIVNSLRARSMFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSTGQTELLTDWKKI 276
Query: 351 FSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKD 410
FSR +V+VF +Y LP+LY+ DAGNF+VIPGSP +V + +K D
Sbjct: 277 FSRASVVVFHNYLLPILYTEFDAGNFYVIPGSPEEVCKAKNLEFPPQK-----------D 325
Query: 411 EIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDA 470
++V+ +VGS F Y ++A+ + + PL S K + L G + Y+ A
Sbjct: 326 DVVISIVGSQFLYKGQWLEHALLLQALRPLF-SGNYLESDNSHLKIIVLGGETASNYSVA 384
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
++ ++ L + +V+H G+V+ +L +D+V+YGS EQ FP ++++AM+ G P++
Sbjct: 385 IETISQNLTYPKEAVKHVRVAGNVDKILESSDLVIYGSFLEEQSFPEILMKAMSLGKPIV 444
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
PD I++YV + F K N + LS+ I+ GK+S A+ +A G+ KNM+
Sbjct: 445 APDLFNIRKYVDDRVTGYLFPKQNLKVLSQVVLEVITEGKISPLAQKIAMMGKTTVKNMM 504
Query: 591 ALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDILNMDE 647
A + + YA +LEN+L F S+ P + ++ + W W+ F +D
Sbjct: 505 ARETIEGYAALLENMLKFSSEVASPKDVQKVPPELREEWSWHPFEAFMD----------- 553
Query: 648 WGTSTSSRNSSVVDLLEE-EFTKNITENENR---SADQDTISELDWDVLHDIESSEEYER 703
TS ++R + + L + E N T E + + D+ W+ ++ +R
Sbjct: 554 --TSPNNRIARSYEFLAKVEGHWNYTPGEAMKFGAVNDDSFVYEIWEEERYLQMMNSKKR 611
Query: 704 LEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGA 763
E E+L+ R+ +W+D+Y++A++++R K + +ERDEGEL RTGQP+CIYE Y G G
Sbjct: 612 REDEELKSRVLQYRGTWEDVYKSAKRADRSKNDLHERDEGELLRTGQPLCIYEPYFGEGT 671
Query: 764 WPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDN 823
W FLH LYRG+ LS RR R DDVDA SRL L N +YRD L + G F+I+NK+D
Sbjct: 672 WSFLHQDPLYRGVGLSVKGRRPRMDDVDASSRLPLFNNPYYRDALGDFGAFFAISNKIDR 731
Query: 824 IHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNND 882
+HK WIGFQSWRA RK SLS AE L +Q GD +YFW +D D N
Sbjct: 732 LHKNSWIGFQSWRATARKESLSKIAEDALLNAIQTRKHGDALYFWVRMDKDP----RNPL 787
Query: 883 VLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSF 942
FWS CD +N G+CR A+ + ++MY + ++++LPPMPEDG WS + W + T SF
Sbjct: 788 QKPFWSFCDAINAGNCRFAYNETLKKMYSI-KNLDSLPPMPEDGDTWSVMQSWALPTRSF 846
Query: 943 LEFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKM 1001
LEF+MFSRMFVDSLDA + + N C LS + + KHCY RVLELLVNVWAYHS R++
Sbjct: 847 LEFVMFSRMFVDSLDAQIYEEHHRTNRCYLSLT--KDKHCYSRVLELLVNVWAYHSARRI 904
Query: 1002 VYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTG 1061
VY+DP +G +QEQH + RRG MW+K+F++T LK+MDEDLAE A D D WLWPWTG
Sbjct: 905 VYIDPETGLMQEQHKQKNRRGKMWVKWFDYTTLKTMDEDLAEEA-DSDRRVGHWLWPWTG 963
Query: 1062 EVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
E+ W+G E+E++++ +K +KK+K ++K+ ++ RQK +G
Sbjct: 964 EIVWRGTLEKEKQKKNLEKEEKKKKSRDKLSRMRSRSGRQKVIG 1007
>gi|255541752|ref|XP_002511940.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
gi|223549120|gb|EEF50609.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
Length = 935
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/901 (39%), Positives = 522/901 (57%), Gaps = 78/901 (8%)
Query: 213 RSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQI 272
R+ RFG R P++ L+ ++ D + LL++TV LQ++GY ++++V G H +W++I
Sbjct: 82 RTLLRFGYRKPQLALVFADLLADPQQLLMVTVATALQEIGYAIQVFSVNDGPVHDIWKRI 141
Query: 273 AGQISILGQEQYSL---IDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLA 329
++I Q + + +DW IFD II +SLEAK MQEPF SIPL+W I E +L
Sbjct: 142 GVPVTIF-QTNHKMEIAVDWLIFDSIIVNSLEAKVVFPCFMQEPFKSIPLIWTIHEKTLG 200
Query: 330 NRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAV 389
R Y+ G L+S WK VF+R V+VFP++ LPM+YS DA N++VIPGSPA+VW
Sbjct: 201 IRSRQYISNGQIELVSDWKRVFNRATVVVFPNHVLPMMYSAFDAENYYVIPGSPAEVWEA 260
Query: 390 EAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNS 449
EA + + K +R + G+ D+I++ +VGS F Y L ++A+ + + PL ++ ++
Sbjct: 261 EAMAAVY-KDSIRMKMGYRPDDIIIAIVGSQFLYRGLWLEHALILQALSPLFSDFSFDDN 319
Query: 450 VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 509
K + L GNST Y+ A++ +A L +V+H +GDV L ADIV YGS
Sbjct: 320 SNPHLKIIVLSGNSTSNYSVAIEAIAINLHYPIGAVKHIAIDGDVGSFLTAADIVTYGSF 379
Query: 510 QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG 569
Q FP ++++AM P+I PD +I++YV + F K+N L++ IS G
Sbjct: 380 HDGQSFPEMLMKAMCMEKPIIAPDLSVIRKYVDDRVNGYIFPKENIRVLTQIILQVISKG 439
Query: 570 KLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSW 626
KLS AR +AS G+ AKN++ + V YA +LE+++ PS+ P ++Q+ + W
Sbjct: 440 KLSPLARNIASIGKGTAKNLMVAEAVEGYASLLESIIKLPSEVAPPKAVAQIPPKLKEEW 499
Query: 627 EWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSADQDTISE 686
W+LF ++ D TSSR + +EE++ + E + A D
Sbjct: 500 SWHLFEAFLNSTYED-------RVLTSSR---FLIKVEEQWNHSQREISSSIASNDESFS 549
Query: 687 LD-WDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGEL 745
D W+ +I+ +R E ++L++R D +W+++YR+ ++++R + + +ERDEGEL
Sbjct: 550 YDIWEEEKNIQILNAKKRREEQELKDRTDQPHGTWEEVYRSTKRADRTRNDLHERDEGEL 609
Query: 746 ERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYR 805
ERTGQP+CIYE Y G W FLH SLYRG+ LS+ RR R+DDVDA SRL LL+ +YR
Sbjct: 610 ERTGQPLCIYEPYLGEATWSFLHSSSLYRGVGLSTKGRRPRTDDVDAPSRLPLLSSPYYR 669
Query: 806 DILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVM 864
D L E G F+IAN++D IHK WIGFQSWRA RK SLS +AE+ L ++ GD +
Sbjct: 670 DALGEYGAFFAIANRIDRIHKNAWIGFQSWRATARKASLSRTAERALLNAIETRRHGDTL 729
Query: 865 YFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPE 924
YFW +D D N FWS CD +N G+C+ + Q+Y L +
Sbjct: 730 YFWVRMDTDP----RNRLQQDFWSFCDTINAGNCKIFVDALDAQIYDLHH---------Q 776
Query: 925 DGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCR 984
+G C+ +L + KHCY R
Sbjct: 777 NGHCYLSL-------------------------------------------TKDKHCYSR 793
Query: 985 VLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEA 1044
VLELL+NVWAYHS R+MVY++P +G +QEQH I+ RRG MW+K+F++T LKSMDEDLAE
Sbjct: 794 VLELLINVWAYHSARQMVYVNPETGLMQEQHRIKSRRGKMWVKWFSYTTLKSMDEDLAEE 853
Query: 1045 ADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTL 1104
A D D+P+ +WLWP TGEV W+G++E+ER R +QK ++++ K+K+ R+ +RQK +
Sbjct: 854 A-DSDHPKRRWLWPSTGEVFWQGVFEKERNLRNQQKERRRQQSKDKL-KRMKSKHRQKVI 911
Query: 1105 G 1105
G
Sbjct: 912 G 912
>gi|224127812|ref|XP_002320170.1| predicted protein [Populus trichocarpa]
gi|222860943|gb|EEE98485.1| predicted protein [Populus trichocarpa]
Length = 990
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/1006 (36%), Positives = 560/1006 (55%), Gaps = 112/1006 (11%)
Query: 137 RKGFSSLFPFRGAYLLYFMIFLAVFAF--AMASMVLQNSIA--SVFGAE--RGRPIREE- 189
R FS F+ + ++ +AVF F + M L S+ S G+ RG + ++
Sbjct: 37 RSSFSRFLRFKKLDYIQWICTVAVFLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVDKD 96
Query: 190 ---------LRFGSRLKFVPDQV-------------GFGNGLDGLRSTPRFGVRPPRIGL 227
L FG +KF P ++ F N R+ RF R P++ L
Sbjct: 97 LWYLKEIGGLDFGEDIKFQPSKILQHFRKENREMNMSFSN-----RTLSRFPYRKPQLAL 151
Query: 228 ILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISIL--GQEQYS 285
+ ++ D LL++TV LQ++GY +Y++ G + S+W+ + ++I+ +
Sbjct: 152 VFADLLVDPHQLLMVTVATALQEIGYTIHVYSLGDGPAQSIWKSMRSPVNIIQISHKMEI 211
Query: 286 LIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLS 345
+DW +DGI+ +SLE K S MQEPF S+PL+W I E +LA Y LL
Sbjct: 212 AVDWLNYDGILVNSLETKSVFSCFMQEPFKSVPLIWTINERTLATHSRQYTSSWQIELLY 271
Query: 346 YWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKEN 405
W+ F+R V+VFP++ LPM+YS D GN++VIPGSPAD+W E + ++ +
Sbjct: 272 DWRKAFNRATVVVFPNHVLPMMYSAFDTGNYYVIPGSPADIWETETTMALYND-EIHVKM 330
Query: 406 GFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTD 465
G+ D+IV+ +VGS F Y L ++A+ + + PL +++ N+ + K + L G+ T
Sbjct: 331 GYEPDDIVIAIVGSQFLYRGLWLEHALVLKALLPLFAEFSLDNNSKSHLKIIILSGDPTG 390
Query: 466 GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
Y+ A++ +A+ L +V+H+ + DV L AD+V+YGS EQ FP ++V+AM+
Sbjct: 391 NYSVAVEAIAANLSYPRGTVKHFAVDDDVGSPLGAADLVIYGSFLEEQSFPEILVKAMSI 450
Query: 526 GIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLH 585
G P+ITPD +I++YV + F K+N + L++ IS G LS AR +AS G+
Sbjct: 451 GKPIITPDLSMIRKYVDDRVNGYLFPKENLKVLTQIVLQAISKGTLSPLARNIASMGKNT 510
Query: 586 AKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDI 642
AKN++ L+ V YA +LENV+ PS+ P +S++ + W W+LF+ ++ D
Sbjct: 511 AKNLMVLETVEGYATLLENVVELPSEVTPPKAVSEIPPKLKKEWCWHLFKAFMNSTHED- 569
Query: 643 LNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENR-SADQDTISELDWDVLHDIESSEEY 701
+ ++S + +EE++ E+ +A D+ S W+ +I
Sbjct: 570 ---------RTLKSSRYLKTVEEQWNYMQKESSGSIAATNDSFSYDIWEEERNIMMLNTR 620
Query: 702 ERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGS 761
+R E E+L++R D +W+D+Y++A++++R + + +ERDEGEL RTGQP+CIYE Y G
Sbjct: 621 KRREEEELKDRTDQPHGTWEDVYKSAKRADRSRNDLHERDEGELLRTGQPLCIYEPYFGE 680
Query: 762 GAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKV 821
G W FLH SLYRG+ LS+ RR R+DD+DA SRL LL+ +YRD L E G F+IAN++
Sbjct: 681 GTWSFLHQSSLYRGIGLSTKGRRPRTDDIDAPSRLSLLSKPYYRDALGEYGAFFAIANRI 740
Query: 822 DNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQETE-GDVMYFWAHLDMDGGFTRNN 880
D IHK WIGFQSWRA RK SLS AEK L + ++ + GD +YFW +DMD
Sbjct: 741 DRIHKNSWIGFQSWRATARKASLSRIAEKALVDAIESRKHGDSLYFWVRMDMDP-----R 795
Query: 881 NDVLT-FWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQT 939
ND+ + FWS CD +N G+C+ + QMY H ++ G C+ +L
Sbjct: 796 NDLQSDFWSFCDAINAGNCKMFVDALDAQMYD--EHHQS-------GRCYLSL------- 839
Query: 940 PSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGR 999
+ KHCY RVLELL+NVWAYHS R
Sbjct: 840 ------------------------------------AKDKHCYSRVLELLINVWAYHSAR 863
Query: 1000 KMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPW 1059
+MVY++P +G +QEQH ++ RRG +W+K+F++++LKSMDEDLAE A D D P+ +WLWP
Sbjct: 864 QMVYVNPETGLMQEQHAVKSRRGNIWVKWFSYSILKSMDEDLAEEA-DSDQPKRRWLWPS 922
Query: 1060 TGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
TGEV W+G++E+ER R QK ++++ K+K R+ K RQ LG
Sbjct: 923 TGEVVWQGLFEKERNLRNHQKEKRRQQSKDKQ-QRMRKKRRQPVLG 967
>gi|224064104|ref|XP_002301386.1| predicted protein [Populus trichocarpa]
gi|222843112|gb|EEE80659.1| predicted protein [Populus trichocarpa]
Length = 990
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/937 (37%), Positives = 536/937 (57%), Gaps = 96/937 (10%)
Query: 190 LRFGSRLKFVPDQV-------------GFGNGLDGLRSTPRFGVRPPRIGLILGNMAKDS 236
L FG +KF P ++ F NG + RF R P++ L+ ++ D
Sbjct: 106 LDFGEDIKFEPSKILQKFRKENREMNMPFTNG-----TLSRFPYRKPQLALVFADLLVDP 160
Query: 237 RSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSL---IDWSIFD 293
+ LL++TV LQ++GY +Y +R G ++W+ + ++I+ Q + L +DW +D
Sbjct: 161 QQLLMVTVATALQEIGYTIHVYTLRDGPVQNIWKSMGYPVTII-QMSHKLEIAVDWLNYD 219
Query: 294 GIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSR 353
GI+ +SLE + IS MQEPF S+PL+W I E +LA R Y LL+ W+ F+R
Sbjct: 220 GILVNSLETRSVISCFMQEPFKSVPLIWTIHERALAIRSRQYTSSWQIELLNDWRKAFNR 279
Query: 354 VNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIV 413
V+VFP++ LPM+YS DAGN++VIPGSPA+VW + + +R + G+ +IV
Sbjct: 280 ATVVVFPNHVLPMMYSAFDAGNYYVIPGSPAEVWEADTTMALYND-DIRVKMGYEPTDIV 338
Query: 414 VVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQE 473
+ VVGS F Y L ++A+ + + PLL + ++ K + L G+ST Y+ A++
Sbjct: 339 IAVVGSQFLYRGLWLEHALVLKALLPLLQDFPLDSNSISHLKIIVLSGDSTGNYSAAVEA 398
Query: 474 VASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPD 533
+A L +V+H+ +GDV+ L D+V+YGS EQ FP +VRAM+ G P+I PD
Sbjct: 399 IAVNLSYPRGTVKHFAVDGDVSSALSAVDLVIYGSFLEEQSFPEFLVRAMSIGKPIIAPD 458
Query: 534 FPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALD 593
+I +YV + F K+N + L++ IS G LS AR +AS G+ AKN++ L+
Sbjct: 459 LSMIGKYVDDRVNGYLFPKENLKALTQIVLQAISKGTLSPLARNIASIGKSTAKNLMVLE 518
Query: 594 CVTRYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGT 650
+ YA +LENVL PS+ LP + ++ + W WNLF+ ++ D+
Sbjct: 519 TIEGYATLLENVLKLPSEVALPKAVPEIPPKLKKEWCWNLFKAFLNSTHEDV-------- 570
Query: 651 STSSRNSSVVDLLEEEFTKNITENENR-SADQDTISELDWDVLHDIESSEEYERLEMEQL 709
+ ++S ++ +EE++ E+ +A D+ S W+ +I +R E E+L
Sbjct: 571 --TLKSSRYLNKVEEQWNHEQGESTGSIAATDDSFSYDIWEEEKNILMLNTRKRREEEEL 628
Query: 710 EERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHH 769
++R D +W+++YR+A++++R + + +ERDEGEL RTGQP+CIYE Y G G W FLH
Sbjct: 629 KDRTDQPRGTWEEVYRSAKRADRSRNDLHERDEGELLRTGQPLCIYEPYFGEGTWSFLHL 688
Query: 770 GSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPW 829
SLYRG+ LS+ RR R+DD+DA SRL LL+ ++YRD L + G F+IAN++D IHK W
Sbjct: 689 SSLYRGIGLSTKGRRPRTDDIDAPSRLSLLSNSYYRDALGDYGAFFAIANRIDRIHKNSW 748
Query: 830 IGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWS 888
IGFQSWRA RK SLS AEK L + ++ +T D +YFW +DMD + FWS
Sbjct: 749 IGFQSWRATARKASLSRIAEKALIDAIETQTHRDAVYFWVPMDMDPRSHLRRD----FWS 804
Query: 889 MCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMF 948
CD +N G+C+ + QMY H ++ G C+
Sbjct: 805 FCDAINAGNCKMFVDALDAQMYD--EHHQS-------GRCY------------------- 836
Query: 949 SRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLS 1008
S + KHCY RVLELL+NVWAYHS R+MVY++P +
Sbjct: 837 ------------------------LSPAKDKHCYSRVLELLINVWAYHSARQMVYVNPET 872
Query: 1009 GSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGI 1068
G ++EQH ++ RRG MW+++F++++LKSMDEDLAE A D D P+ +WLWP TGEV W+G+
Sbjct: 873 GLMKEQHTVKSRRGKMWVRWFSYSVLKSMDEDLAEEA-DSDRPKRRWLWPSTGEVVWEGV 931
Query: 1069 YEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
YE+ER R QK ++++ K+K R+ K +RQK LG
Sbjct: 932 YEKERNLRNHQKEKRRQQSKDKQ-QRMRKKHRQKVLG 967
>gi|297810039|ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp.
lyrata]
gi|297318740|gb|EFH49162.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp.
lyrata]
Length = 1018
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/938 (38%), Positives = 544/938 (57%), Gaps = 68/938 (7%)
Query: 190 LRFGSRLKFVPDQVGF-----GNGLDGLRST-----PRFGVRPPRIGLILGNMAKDSRSL 239
L FG ++F P ++ NGL+ S+ RFG R P++ L+ ++ D +
Sbjct: 103 LDFGDDVRFEPTKLLMKFQREANGLNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQV 162
Query: 240 LLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYS--LIDWSIFDGIIA 297
L++++ K LQ++GY ++Y++ G +S+W ++ ++IL S +IDW +DGII
Sbjct: 163 LMVSLSKALQEIGYAIEVYSLEDGPVNSIWRKMGVPVTILKTNHASSCVIDWLSYDGIIV 222
Query: 298 DSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVI 357
+SL AK + MQEPF S+PL+W+I E++LA R Y G LL+ WK +FSR +V+
Sbjct: 223 NSLRAKSMFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSIGQTELLNDWKKIFSRASVV 282
Query: 358 VFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVV 417
VF +Y LP+LY+ DAGNF+VIPGSP DVW + +K D++V+ +V
Sbjct: 283 VFHNYLLPILYTEFDAGNFYVIPGSPEDVWKAKNLEFPPQK-----------DDVVISIV 331
Query: 418 GSSFFYNELSWDYAVAMHDVGPLLI-KYARRNSVEGSFKFVFLCGNSTDGYNDALQEVAS 476
GS F Y ++A+ + + PL Y ++ K + L G S Y+ A++ ++
Sbjct: 332 GSQFLYKGQWLEHALLLQALRPLFPGNYLESDT--SHLKIIVLGGESASNYSVAIETISQ 389
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
L + +V+H G+V+ +L +D+V+YGS EQ FP ++++AM+ G P++ PD
Sbjct: 390 NLTYPKDAVKHVSIAGNVDKILESSDLVIYGSFLEEQSFPEILMKAMSLGKPIVAPDLFN 449
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVT 596
I+++V + F K + + LS+ I+ GK+S A+ +A G+ KNM+A + +
Sbjct: 450 IRKHVDDRVTGYLFPKQDLKVLSQIVLEVITEGKISSLAQKIAMMGKTTVKNMMARETIE 509
Query: 597 RYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGTSTS 653
YA +L+N+L F S+ P ++ + W W+ F +D TS +
Sbjct: 510 GYAALLDNMLKFSSEVASPKDAQKVPPELREEWSWHPFEAFLD-------------TSPN 556
Query: 654 SRNSSVVDLLEE-EFTKNITENEN---RSADQDTISELDWDVLHDIESSEEYERLEMEQL 709
+R + + L + E N T E+ + + D+ W+ ++ +R E E+L
Sbjct: 557 NRTARSYEFLAKVEGHWNHTPGESIIFGAVNDDSFVYEIWEEERYLQMMNSKKRREDEEL 616
Query: 710 EERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHH 769
+ R +W+++Y++A++++R K + +ERDEGEL RTGQP+CIYE Y G G W FLH
Sbjct: 617 KSRALQYRGTWEEVYKSAKRADRSKNDLHERDEGELLRTGQPLCIYEPYYGEGTWSFLHQ 676
Query: 770 GSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPW 829
LYRG+ LS RR R DDVDA SRL L N +YRD L + G F+I+NK+D +HK W
Sbjct: 677 YPLYRGVGLSVKGRRPRMDDVDASSRLPLFNNPYYRDALGDFGAFFAISNKIDRLHKNSW 736
Query: 830 IGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWS 888
IGFQSWRA RK SLS AE L +Q GD +YFW +D D N FWS
Sbjct: 737 IGFQSWRATARKESLSKIAENALLNAIQTRKHGDALYFWVRMDKDP----RNPLQKPFWS 792
Query: 889 MCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMF 948
CD +N G+C ++++LPPMPED WS + W + T SFLEF+MF
Sbjct: 793 FCDAINAGNCSV-------------KNLDSLPPMPEDEDTWSVMQSWTLPTRSFLEFVMF 839
Query: 949 SRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPL 1007
SRMFVDSLDA + + N C LS + + KHCY RVLELLVNVWAYHS R++VY+DP
Sbjct: 840 SRMFVDSLDAQIYEEHHRTNRCYLSLT--KDKHCYSRVLELLVNVWAYHSARRIVYIDPG 897
Query: 1008 SGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKG 1067
+G +QEQH + RRG MW+K+F++T LK+MDEDLAE A D D WLWPWTGE+ W+G
Sbjct: 898 TGLMQEQHKQKNRRGKMWVKWFDYTTLKTMDEDLAEEA-DSDRRVGHWLWPWTGEIVWRG 956
Query: 1068 IYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
E+E++++ +K +KK+K ++K+ ++ RQK +G
Sbjct: 957 TLEKEKQKKNLEKEEKKKKSRDKLTRMRSRNGRQKVIG 994
>gi|168060741|ref|XP_001782352.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666144|gb|EDQ52806.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 869
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 347/919 (37%), Positives = 521/919 (56%), Gaps = 88/919 (9%)
Query: 217 RFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQI 276
RF VRPPR+ L+ N+ ++ SL ++T+ LQ+LGY +++ +R G WE + +
Sbjct: 2 RFPVRPPRLALVCPNLVRNPLSLYVLTLATGLQELGYDLEVHTLRDGELRPAWEALGVPV 61
Query: 277 SIL--GQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPV 334
L + + +DW ++G+I+ S+ K I+SL QEPF +IP+VW+I EDSL RL
Sbjct: 62 RQLRINVQNVTSVDWLNYEGVISFSVNTKAVINSLAQEPFRNIPVVWVINEDSLGRRLED 121
Query: 335 YVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSK 394
Y + L+S W+ F R +V+VF DY L M+Y++LD GNFFVIPGSP +VW E Y
Sbjct: 122 YGSAAAEGLISDWRHSFKRADVVVFQDYALTMVYTMLDTGNFFVIPGSPLEVWTAENYKI 181
Query: 395 SHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSF 454
+H + LR + G D V+ VVGS F Y+ L ++A+ M + + + ++
Sbjct: 182 THSRDSLRAQYGVHHDSSVITVVGSPFLYHGLWREHALVMRSLARSVSTFGDASTKGDRL 241
Query: 455 KFVFLCG--NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVE 512
+F+ G N + Y +LQ +A LGL +VR+ G DV GV+ M+D V+YGS + E
Sbjct: 242 IQLFIIGHGNQSSSYGASLQIMAEHLGLKNGTVRYVGAGEDVTGVVWMSDAVVYGSFRNE 301
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
Q FPS++ AM+ PVI P+ + +E + +G I F + LS A + N
Sbjct: 302 QSFPSILSLAMSLQRPVIIPNRRVFREQIVDGENGILFPVGDDVKLSEAMVRVLRN---K 358
Query: 573 KFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQV---SWEWN 629
A +A +G++ A+NM AL+ Y +LE+VL FP +A LP + + ++ W W+
Sbjct: 359 AEAGALALSGQMKAQNMCALNVSLSYGELLESVLEFPVEAELPRRLDESEKSLKEGWRWD 418
Query: 630 LF-RKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFT--KNITENENRSADQDTISE 686
LF L M + S++++LEE++ + +N + S D+++E
Sbjct: 419 LFFPANAPLWEYFQSRMQGPMVANDDIGGSIIEVLEEQWNVHSGVVQNFSDSMGFDSLTE 478
Query: 687 LDWDVLHDIESSEEYERLEMEQLEERMDGTF--------------ASWDDIYRNARKSER 732
D ++ +D + +++ E + +E ++ ++W+DI + +K+E
Sbjct: 479 -DNNIANDFVT--DFDLHEAKAIENELEAERIEREELLERKILLQSTWEDISKMVKKTEV 535
Query: 733 FKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLH-HGSLYRGLALSSAARRLRSDDVD 791
K + ER + E++R+GQP+CIYE Y G+GAWPFLH +LYR ++LS+ ARR DDVD
Sbjct: 536 HKDDLKERSDAEIQRSGQPLCIYEPYHGTGAWPFLHDENTLYRAISLSTHARRPTDDDVD 595
Query: 792 AVSRL-HLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEK 850
A RL LLN ++YR++LCE G F+IAN++D +HK PWIGFQ WRA+ R V+L+ +AE
Sbjct: 596 AQERLPGLLNDSYYRNVLCEFGTFFAIANRIDVVHKNPWIGFQPWRASSRNVALTAAAEA 655
Query: 851 VLEETVQE-TEGDVMYFWAHLDMDGGFTRNNNDV---LTFWSMCDILNGGHCRTAFEDAF 906
L E V +GD +YFWA D + G + + V + FW+ CD +N G CR F DA
Sbjct: 656 ALIEAVSTGRDGDAVYFWARTDGNAGTSGEGSFVEPAVNFWTYCDSVNSGQCRI-FVDAL 714
Query: 907 RQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKV 966
Y H +
Sbjct: 715 DAQYYEEHHY-------------------------------------------------L 725
Query: 967 NSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWM 1026
C +S +++HCYCR+LE LVNVWAYHS R M+Y++P +G ++EQH +E R G MW+
Sbjct: 726 GKCRFAS--FQREHCYCRLLEGLVNVWAYHSARSMIYVNPKTGEMEEQHALESRLGNMWV 783
Query: 1027 KYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRK 1086
K+F+ + LK+MDEDLAEAADDGD+PR +WLWP TGEV W+GI+ERE+EER R K++KK+K
Sbjct: 784 KFFSLSTLKTMDEDLAEAADDGDHPRARWLWPQTGEVFWQGIFEREKEERLRLKLEKKKK 843
Query: 1087 MKEKMFDRLTKGYRQKTLG 1105
+KEK+ K YRQK+LG
Sbjct: 844 LKEKLARMRNKTYRQKSLG 862
>gi|7267618|emb|CAB80930.1| hypothetical protein [Arabidopsis thaliana]
Length = 963
Score = 568 bits (1463), Expect = e-159, Method: Compositional matrix adjust.
Identities = 348/995 (34%), Positives = 525/995 (52%), Gaps = 117/995 (11%)
Query: 136 NRKGFSSLFPFRG-AYLLYFMIFLAVFAFA------MASMVLQNSIASVFGAERGRP--- 185
NR S F + YLL+ I F FA + +V+ S E P
Sbjct: 37 NRSRLSRFFLLKSFNYLLWISIICVFFFFAVLFQMFLPGLVIDKSDKPWISKEILPPDLV 96
Query: 186 -IREE--LRFGSRLKFVPDQV--GFGNGLDGLRSTPRFGVRPPRIGLILGNMAKDSRSLL 240
RE+ L FG ++ P ++ F G T + G++ D +L
Sbjct: 97 GFREKGFLDFGDDVRIEPTKLLMKFQRDAHGFNFTSS----------VFGDLLADPEQVL 146
Query: 241 LITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISIL--GQEQYSLIDWSIFDGIIAD 298
++++ K LQ++GY ++Y++ G +S+W+++ ++IL QE +IDW +DGII +
Sbjct: 147 MVSLSKALQEVGYAIEVYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWLSYDGIIVN 206
Query: 299 SLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIV 358
SL A+ + MQEPF S+PL+W+I E++LA R Y G LL+ WK +FSR +V+V
Sbjct: 207 SLRARSMFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVV 266
Query: 359 FPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVG 418
F +Y LP+LY+ DAGNF+VIPGSP +V + +K D++V+ +VG
Sbjct: 267 FHNYLLPILYTEFDAGNFYVIPGSPEEVCKAKNLEFPPQK-----------DDVVISIVG 315
Query: 419 SSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRL 478
S F Y ++A+ + + PL S K + L G + Y+ A++ ++ L
Sbjct: 316 SQFLYKGQWLEHALLLQALRPLF-SGNYLESDNSHLKIIVLGGETASNYSVAIETISQNL 374
Query: 479 GLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
+ +V+H G+V+ +L +D+V+YGS EQ FP ++++AM+ G P++ PD I+
Sbjct: 375 TYPKEAVKHVRVAGNVDKILESSDLVIYGSFLEEQSFPEILMKAMSLGKPIVAPDLFNIR 434
Query: 539 EYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRY 598
+YV + F K N + LS+ I+ GK+S A+ +A G+ KNM+A + + Y
Sbjct: 435 KYVDDRVTGYLFPKQNLKVLSQVVLEVITEGKISPLAQKIAMMGKTTVKNMMARETIEGY 494
Query: 599 ARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSR 655
A +LEN+L F S+ P + ++ + W W+ F +D TS ++R
Sbjct: 495 AALLENMLKFSSEVASPKDVQKVPPELREEWSWHPFEAFMD-------------TSPNNR 541
Query: 656 NSSVVDLLEE-EFTKNITENENR---SADQDTISELDWDVLHDIESSEEYERLEMEQLEE 711
+ + L + E N T E + + D+ W+ ++ +R E E+L+
Sbjct: 542 IARSYEFLAKVEGHWNYTPGEAMKFGAVNDDSFVYEIWEEERYLQMMNSKKRREDEELKS 601
Query: 712 RMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGS 771
R+ +W+D+Y++A++++R K + +ERDEGEL RTGQP+CIYE Y G G W FLH
Sbjct: 602 RVLQYRGTWEDVYKSAKRADRSKNDLHERDEGELLRTGQPLCIYEPYFGEGTWSFLHQDP 661
Query: 772 LYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIG 831
LYRG+ LS RR R DDVDA SRL L N +YRD L + G F+I+NK+D +HK WIG
Sbjct: 662 LYRGVGLSVKGRRPRMDDVDASSRLPLFNNPYYRDALGDFGAFFAISNKIDRLHKNSWIG 721
Query: 832 FQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMC 890
FQSWRA RK SLS AE L +Q GD +YFW +D D N FWS C
Sbjct: 722 FQSWRATARKESLSKIAEDALLNAIQTRKHGDALYFWVRMDKDP----RNPLQKPFWSFC 777
Query: 891 DILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSR 950
D +N G+CR + Q+Y
Sbjct: 778 DAINAGNCRMFVDSLDAQIY---------------------------------------- 797
Query: 951 MFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGS 1010
+ N C LS + + KHCY RVLELLVNVWAYHS R++VY+DP +G
Sbjct: 798 ----------EEHHRTNRCYLSLT--KDKHCYSRVLELLVNVWAYHSARRIVYIDPETGL 845
Query: 1011 LQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYE 1070
+QEQH + RRG MW+K+F++T LK+MDEDLAE A D D WLWPWTGE+ W+G E
Sbjct: 846 MQEQHKQKNRRGKMWVKWFDYTTLKTMDEDLAEEA-DSDRRVGHWLWPWTGEIVWRGTLE 904
Query: 1071 REREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
+E++++ +K +KK+K ++K+ ++ RQK +G
Sbjct: 905 KEKQKKNLEKEEKKKKSRDKLSRMRSRSGRQKVIG 939
>gi|414881398|tpg|DAA58529.1| TPA: hypothetical protein ZEAMMB73_570175 [Zea mays]
Length = 890
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 292/757 (38%), Positives = 443/757 (58%), Gaps = 40/757 (5%)
Query: 217 RFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFK-----------IYAVRSGNS 265
R GVR PR+ ++ G++ + L +++V L+ +GY K ++++ G
Sbjct: 146 RLGVRKPRLAMVFGDLYPSAMQLQMVSVASVLEAMGYEMKCMSLVVGLYVLVFSLEDGPC 205
Query: 266 HSLWEQIAGQISILGQEQYSLI--DWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWII 323
++W I +SIL ++ I DW +DGI+ +S+EA+ SSL+ EPF SIP++W +
Sbjct: 206 GNIWGAIGVPVSILPEDANLPISVDWLDYDGILVNSIEARPVFSSLLHEPFKSIPIIWTV 265
Query: 324 QEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSP 383
E SLA+R Y G L++ WK VFSR NVIVFP+Y LP++Y+ D+GN+FVIPG P
Sbjct: 266 HEYSLAHRAKEYNVSGMIQLINAWKDVFSRTNVIVFPNYILPVMYAAFDSGNYFVIPGPP 325
Query: 384 ADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIK 443
++ + V+++ ++ G + ++ +VG+SF Y + + A+ + + PLL +
Sbjct: 326 SEAFQVDSFIAKSYHEDVKISLGLNPKDFLIAIVGNSFSYGDNLMEEALVLQALSPLLQR 385
Query: 444 YARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADI 503
Y NS + K GN T+ + AL+ VA +G +V H D + +L AD+
Sbjct: 386 YRSENSAQSELKVKIFTGNITEKHRMALESVALSVGFPRGAVEHVAAE-DKDNLLGTADL 444
Query: 504 VLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
V+Y S EQ FPS++V+AM+ VI PD II++++ +G + F + N L++
Sbjct: 445 VIYYSCLEEQLFPSVLVQAMSLEKLVIAPDLAIIRKHINDGVNGLLFPRKNIGMLAQVLL 504
Query: 564 LFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLP-----GPI 618
+SN K+S + +ASAG+ HAKN++A + + YA +L+NV+ P+DAL P P+
Sbjct: 505 RAVSNSKVSLSGQKIASAGKAHAKNLMASETIEGYAMLLKNVVKLPTDALSPLTADEIPL 564
Query: 619 SQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRS 678
+ Q+ W+W+LF D+ ++ TS S ++ LE+E+ N EN + S
Sbjct: 565 ALKQE--WKWHLF--------DDVKHLHRVNTSLSGYK--ILQKLEQEWHSNQMENSSLS 612
Query: 679 ADQ--DTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFE 736
D S + W+ E + ++E ++L++R D +W+++YRN ++ ER K E
Sbjct: 613 TKNINDAFSAIAWEEQRVNEVMDIKRKMEEDELKDRNDQPHGTWEEVYRNVKRVERLKNE 672
Query: 737 ANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRL 796
+ERD+ ELERTGQP+CIYE + G G WPFLH SLYRG+ LSS RR +DD+DA SRL
Sbjct: 673 LHERDDKELERTGQPLCIYEPFFGEGTWPFLHQSSLYRGVGLSSKGRRPGADDIDASSRL 732
Query: 797 HLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETV 856
LLN +YRDIL E G F++AN++D IHK WIGFQSWRA RK +LS +AE + E +
Sbjct: 733 PLLNNVYYRDILGEFGAFFALANRIDRIHKNSWIGFQSWRATARKANLSTNAESAILEAI 792
Query: 857 Q-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYG--LP 913
Q + D YFW +D D N + FWS CD++N G+CR A +AF++MYG L
Sbjct: 793 QSQKHRDSFYFWVRMDQDPRNHANKD----FWSFCDVINAGNCRLAVLEAFQRMYGVHLD 848
Query: 914 SHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSR 950
++ L MP DG WS + WV+ T SFLEF+MFSR
Sbjct: 849 RDLDTLLHMPNDGDTWSVMQSWVLPTRSFLEFVMFSR 885
>gi|168009211|ref|XP_001757299.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691422|gb|EDQ77784.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 874
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 327/897 (36%), Positives = 473/897 (52%), Gaps = 90/897 (10%)
Query: 223 PRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISIL--G 280
P I N+A S SL ++++ L+++GY ++ ++ G S WE + + +L
Sbjct: 3 PLFSQICPNLALSSLSLFVLSIATGLREIGYHVEVLTLKDGEFRSTWETMGVAVKLLKAN 62
Query: 281 QEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGF 340
E + +DW ++G+IA SL K + SL QEPF +P++W+I +D+L RL Y
Sbjct: 63 SENVTTVDWLNYEGVIAVSLNTKSILDSLAQEPFTKVPVIWVITDDALGRRLGKYKAENS 122
Query: 341 QNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQ 400
L++ W F R +V+VFPDY LPM+Y+ +D GNFFVI GSP +VW E Y +H +
Sbjct: 123 TELITDWIRSFKRADVVVFPDYALPMIYTTMDTGNFFVISGSPREVWTAEEYKLTHSRDS 182
Query: 401 LRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHD-------VGPLLIKYARRNSVEGS 453
LR + D+ V+ VVG F YN ++A+ + +G K RR +
Sbjct: 183 LRAQFEVHPDDSVIAVVGGPFLYNWRWREHALVIRAFTRTVSMLGNASQKGGRRIQL--- 239
Query: 454 FKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQ 513
F+ GN + Y ALQ +A L L +VR+ G DV GV+ M+D V+YGS + EQ
Sbjct: 240 --FIIGHGNHSSSYGAALQVMADHLKLKNGTVRYVGAGEDVIGVVWMSDAVVYGSFRDEQ 297
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
FP ++ AM+ G PVI P+ +E +++G + F + LS A + + L
Sbjct: 298 AFPPILGLAMSLGRPVIAPNRRAFREQISDGESGVLFPVGDEMKLSEAMIRVLGDNTL-- 355
Query: 574 FARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQV---SWEWN- 629
A +A G+ A+NM A + Y +LE++L FP++A LP + + ++ W W
Sbjct: 356 -AVAIALTGQWKARNMHASNVSLGYGELLESILEFPAEAALPRRLDEAEKSLKGGWRWEM 414
Query: 630 LFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENEN---------RSAD 680
LF M + S V+D LE + +E+ N S D
Sbjct: 415 LFPANTSFQVSFQSRMGKRVRENDDVGSGVIDTLEVHWKLINSESRNFAEEIGSHASSVD 474
Query: 681 QDTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANER 740
+ S D D I+ E++E E+LE R + S + I + RK+E K E ER
Sbjct: 475 LEIPSHSDLDEAKAIQEDLVAEKIEREELEWRKEVVQTSLEYITKMIRKTEVRKDELKER 534
Query: 741 DEGELERTGQPVCIYEIYSGSGAWPFLHH-GSLYRGLALSSAARRLRSDDVDAVSRL-HL 798
+ E++R QP+ IYE Y G GAWPFLH +L+RGL+LS+ RR DDVDA RL L
Sbjct: 535 SDAEIQRNAQPLTIYEPYHGKGAWPFLHEENTLFRGLSLSTLDRRPGDDDVDAEVRLPGL 594
Query: 799 LNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE 858
LN T+YR++LCE G FSIAN++D IH+ PWIGFQ WRA+ + V+L+ +AE L + +
Sbjct: 595 LNETYYRNLLCEYGAFFSIANRIDLIHRNPWIGFQPWRASSKDVALTPAAEVALVKAIST 654
Query: 859 -TEGDVMYFWAHLDMDGGFTRNNNDVL---TFWSMCDILNGGHCRTAFEDAFRQMYGLPS 914
+GD +YFWA D G ++ V FW CD +N G
Sbjct: 655 GRDGDAVYFWARTDGHAGTWHRDSSVEHPEDFWGYCDSVNSG------------------ 696
Query: 915 HVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSS-KVNSCLLSS 973
RMFVD+LDA + C
Sbjct: 697 -----------------------------------RMFVDALDAQYYEEHLDLGRCPFVK 721
Query: 974 SELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTL 1033
E +HCYC +L+ LVNVWAYHS R+++Y+DP +G+++EQH ++ R G MW+K+ +
Sbjct: 722 FPREGQHCYCGLLDGLVNVWAYHSARRLIYVDPKTGTMEEQHALDSRHGKMWVKFLSRDT 781
Query: 1034 LKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEK 1090
LK MDEDLA+AADD D PRE+WLWP TGEV W+GI+ERER ER+R KM++ +K KEK
Sbjct: 782 LKGMDEDLAQAADDSDLPRERWLWPLTGEVFWEGIFERERLERWRLKMERNQKQKEK 838
>gi|414881399|tpg|DAA58530.1| TPA: hypothetical protein ZEAMMB73_570175 [Zea mays]
Length = 595
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 258/592 (43%), Positives = 371/592 (62%), Gaps = 27/592 (4%)
Query: 523 MTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAG 582
M+ VI PD II++++ +G + F + N L++ +SN K+S + +ASAG
Sbjct: 1 MSLEKLVIAPDLAIIRKHINDGVNGLLFPRKNIGMLAQVLLRAVSNSKVSLSGQKIASAG 60
Query: 583 RLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGT 639
+ HAKN++A + + YA +L+NV+ P+DAL P ++ + W+W+LF
Sbjct: 61 KAHAKNLMASETIEGYAMLLKNVVKLPTDALSPLTADEIPLALKQEWKWHLF-------- 112
Query: 640 GDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSADQ--DTISELDWDVLHDIES 697
D+ ++ TS S ++ LE+E+ N EN + S D S + W+ E
Sbjct: 113 DDVKHLHRVNTSLSG--YKILQKLEQEWHSNQMENSSLSTKNINDAFSAIAWEEQRVNEV 170
Query: 698 SEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEI 757
+ ++E ++L++R D +W+++YRN ++ ER K E +ERD+ ELERTGQP+CIYE
Sbjct: 171 MDIKRKMEEDELKDRNDQPHGTWEEVYRNVKRVERLKNELHERDDKELERTGQPLCIYEP 230
Query: 758 YSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSI 817
+ G G WPFLH SLYRG+ LSS RR +DD+DA SRL LLN +YRDIL E G F++
Sbjct: 231 FFGEGTWPFLHQSSLYRGVGLSSKGRRPGADDIDASSRLPLLNNVYYRDILGEFGAFFAL 290
Query: 818 ANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGF 876
AN++D IHK WIGFQSWRA RK +LS +AE + E +Q + D YFW +D D
Sbjct: 291 ANRIDRIHKNSWIGFQSWRATARKANLSTNAESAILEAIQSQKHRDSFYFWVRMDQDPRN 350
Query: 877 TRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYG--LPSHVEALPPMPEDGGCWSALHG 934
N + FWS CD++N G+CR A +AF++MYG L ++ L MP DG WS +
Sbjct: 351 HANKD----FWSFCDVINAGNCRLAVLEAFQRMYGVHLDRDLDTLLHMPNDGDTWSVMQS 406
Query: 935 WVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVW 993
WV+ T SFLEF+MFSRMFVD+LDA + + C+LS + +HCY R+LEL+VNVW
Sbjct: 407 WVLPTRSFLEFVMFSRMFVDALDAQMYDKHHQTGHCILSLH--KDQHCYSRLLELIVNVW 464
Query: 994 AYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPRE 1053
A+HS R+MVY++P +G +QEQH + RRG M +++F++ LKSMDE+LAE D D+P
Sbjct: 465 AFHSARRMVYVNPATGQMQEQHHLSGRRGQMSVQFFSYATLKSMDEELAEEF-DLDHPDR 523
Query: 1054 KWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
+WLWP TGEV W+G+YERER R +K +K++ +EK+ R+ RQKTLG
Sbjct: 524 RWLWPQTGEVFWQGLYERERSMRQHEKERRKQQSREKI-QRIKSRARQKTLG 574
>gi|242058065|ref|XP_002458178.1| hypothetical protein SORBIDRAFT_03g028300 [Sorghum bicolor]
gi|241930153|gb|EES03298.1| hypothetical protein SORBIDRAFT_03g028300 [Sorghum bicolor]
Length = 560
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 254/557 (45%), Positives = 357/557 (64%), Gaps = 27/557 (4%)
Query: 558 LSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLP-- 615
L++ +SNGK+S + +ASAG+ HAKN++A + + YA +LEN + P+DAL P
Sbjct: 2 LAQVLLRAVSNGKVSVSGQKIASAGKAHAKNLMASETIEGYAVLLENTVKIPTDALSPLT 61
Query: 616 -GPISQLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITEN 674
G I + W+W+LF D+ ++ TS S ++ LE+E+ N EN
Sbjct: 62 AGEIPLALKQEWKWHLFE--------DVKHLHRMNTSLSG--YKILQKLEQEWHSNQMEN 111
Query: 675 ENRSADQ--DTISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSER 732
+ S + D S + W+ E + + E ++L++R D +W+++YRN ++ ER
Sbjct: 112 SSLSTTKISDAFSAIAWEEQRANEVMDIKRKTEEDELKDRNDQPHGTWEEVYRNVKRVER 171
Query: 733 FKFEANERDEGELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDA 792
K E +ERD+ ELERTGQP+CIYE + G G WPFLH SLYRG+ LSS RR +DD+DA
Sbjct: 172 LKNELHERDDKELERTGQPLCIYEPFFGEGTWPFLHQSSLYRGVGLSSKGRRPGADDIDA 231
Query: 793 VSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVL 852
SRL LLN +YRDIL E G F+++N++D IHK WIGFQSWRA RK +LS +AE +
Sbjct: 232 SSRLPLLNNIYYRDILGEFGAFFALSNRIDRIHKNSWIGFQSWRATARKANLSKNAESAI 291
Query: 853 EETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYG 911
E VQ + GD YFW +D D N + FWS CD +N G+CR A +AF++MYG
Sbjct: 292 LEAVQSQKHGDSFYFWVRMDQDPRNHANKD----FWSFCDAINAGNCRLAVLEAFQRMYG 347
Query: 912 --LPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSKVNS 968
L +++L MP DG WS + WV+ T SFLEF+MFSRMFVD+LDA + +
Sbjct: 348 VHLDHDLDSLLHMPNDGDTWSVMQSWVLPTRSFLEFVMFSRMFVDALDAQMYDKHHQTGH 407
Query: 969 CLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKY 1028
C+LS + +HCY RVLEL+VNVWA+HS R+MVY++P +G++QEQH + RRG M +++
Sbjct: 408 CILSLH--KDQHCYSRVLELIVNVWAFHSARRMVYVNPETGAMQEQHHLSGRRGQMLVQF 465
Query: 1029 FNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMK 1088
F++T LKSMDE+LAE D D+P +WLWP TGEV W+G+YERER R ++K +K++ +
Sbjct: 466 FSYTTLKSMDEELAEEF-DLDHPDRRWLWPQTGEVFWQGLYERERSMRQQEKERRKQQSR 524
Query: 1089 EKMFDRLTKGYRQKTLG 1105
EK+ R+ RQKTLG
Sbjct: 525 EKI-QRIKNRARQKTLG 540
>gi|242040313|ref|XP_002467551.1| hypothetical protein SORBIDRAFT_01g030080 [Sorghum bicolor]
gi|241921405|gb|EER94549.1| hypothetical protein SORBIDRAFT_01g030080 [Sorghum bicolor]
Length = 286
Score = 369 bits (946), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 187/294 (63%), Positives = 228/294 (77%), Gaps = 11/294 (3%)
Query: 814 MFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEE-TVQETEGDVMYFWAHLDM 872
MF+IAN+VDN+HK PWIGFQSWRAAGRKVSLS SAEK LEE T E+ DV+Y+W+ +DM
Sbjct: 1 MFAIANRVDNVHKLPWIGFQSWRAAGRKVSLSESAEKTLEEITSGESNEDVVYYWSPMDM 60
Query: 873 DGGFTRNNNDVLTFWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSAL 932
D FW CD LN G+CR+ FEDAFR MYGLP +V ALPPMP DG WS L
Sbjct: 61 D--------QTSDFWLTCDSLNAGNCRSLFEDAFRTMYGLPENVMALPPMPSDGDHWSTL 112
Query: 933 HGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNV 992
H WVM TPSFL+FIMFSR+FVDSL +LN NS++ SC L +SE E++HCYCR+LE+LVNV
Sbjct: 113 HSWVMPTPSFLKFIMFSRIFVDSLHSLNVNSTEPTSCFLGASEPERRHCYCRILEVLVNV 172
Query: 993 WAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYP- 1051
WAYHSGRKMVYL+P +G +EQH +++R MW+K+F+FTLLKSMDEDLAE ADDG +P
Sbjct: 173 WAYHSGRKMVYLNPFTGDTREQHLLDKRNR-MWVKFFDFTLLKSMDEDLAEEADDGMHPG 231
Query: 1052 REKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
++WLWP TG+V W GI +REREE+Y +K+DKK K K K+ +R GY+QK LG
Sbjct: 232 NDQWLWPLTGQVFWPGIADREREEKYIKKLDKKLKNKVKLLERQKSGYKQKPLG 285
>gi|218188684|gb|EEC71111.1| hypothetical protein OsI_02908 [Oryza sativa Indica Group]
Length = 1096
Score = 342 bits (877), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 172/335 (51%), Positives = 223/335 (66%), Gaps = 12/335 (3%)
Query: 775 GLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSIANKVDNIHKRPWIGFQS 834
+A SS RR +DD+DA SRL LLN +YRDIL E G F++AN++D IHK WIGFQS
Sbjct: 749 AVAASSKGRRPGADDIDASSRLPLLNNGYYRDILGEFGAFFALANRIDRIHKNSWIGFQS 808
Query: 835 WRAAGRKVSLSISAEKVLEETVQ-ETEGDVMYFWAHLDMDGGFTRNNNDVLTFWSMCDIL 893
WR RK +LS AE L E +Q + GD YFW +D D N FWS CD +
Sbjct: 809 WRVTARKANLSKKAETALLEAIQTQKHGDAFYFWVRMDQD----ERNLASQDFWSFCDAI 864
Query: 894 NGGHCRTAFEDAFRQMYG--LPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRM 951
N G+CR+A AF++MYG L + LP MP DG WS + WV+ T SFLEF+MFSRM
Sbjct: 865 NAGNCRSAVLKAFQRMYGVQLDDDLHTLPLMPNDGDTWSVMQSWVLPTRSFLEFVMFSRM 924
Query: 952 FVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGS 1010
FVD+LDA + + C+LS +HCY RVLEL+VNVWA+HS R+MVY+DP +G+
Sbjct: 925 FVDALDAQMYDKHHETGHCILSLHR--DQHCYSRVLELIVNVWAFHSSRRMVYVDPETGA 982
Query: 1011 LQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYE 1070
+QEQH + RRG M +++F+ LKSMDEDLAE D+ D+P +WLWP TGEV W+G+YE
Sbjct: 983 MQEQHLLNGRRGQMSIQWFSLATLKSMDEDLAEEFDE-DHPDRRWLWPKTGEVFWQGVYE 1041
Query: 1071 REREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
RER R ++K +K++ K+K+ R+ K RQKTLG
Sbjct: 1042 RERNMRQQEKERRKQQSKDKI-QRIKKRARQKTLG 1075
Score = 296 bits (758), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 164/480 (34%), Positives = 267/480 (55%), Gaps = 37/480 (7%)
Query: 206 GNGLDGLRS-TPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGN 264
+GL L S RFGVR PR+ ++ G+++ + L +++V L+ +GY K++ + G
Sbjct: 143 ADGLAKLGSPVRRFGVRKPRLAMVFGDLSPGAMQLQMVSVASVLEAMGYEMKVFTFKDGP 202
Query: 265 SHSLWEQIAGQISILGQEQ--YSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWI 322
++W I + +L ++ + +DW +DGI+ +S+EA+ SSL+QEPF SIP++W
Sbjct: 203 CTNIWRTIGVTVDLLPEDTDLHISVDWLDYDGILVNSIEARPVFSSLLQEPFKSIPVIWN 262
Query: 323 IQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGS 382
+QE SLA+R+ Y G +L WK FSR NVIVFP+Y LP++Y+ D+GN+FVIPGS
Sbjct: 263 VQESSLAHRISEYNSSGMIQILDGWKEAFSRANVIVFPNYVLPVMYAAFDSGNYFVIPGS 322
Query: 383 PADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLI 442
PA + +KS+++ SF Y + A+ + VG LL
Sbjct: 323 PAVPFQDRISAKSYDQ--------------------DSFSYGGFLMEEALVLQAVGSLLQ 362
Query: 443 KYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMAD 502
+Y NS + K L N T+ + AL+ V+ +G +V H + D + +L ++D
Sbjct: 363 QYPSENSTQLELKVRILAENVTEKHRMALEAVSLNVGFPRGAVEHVA-SEDKDNLLGISD 421
Query: 503 IVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
+V+YGS EQ FPS++V+AM VI PD II++Y+ +G + F N L++
Sbjct: 422 LVIYGSCLNEQSFPSVLVQAMCLEKLVIAPDLEIIRKYIDDGMNALLFPSKNIGKLTQVL 481
Query: 563 SLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLP---GPIS 619
+SNGK+S + +ASAG++HAKN++A + + YA +LENV+ FP++ L P G I
Sbjct: 482 LQAVSNGKISVLGKKIASAGKVHAKNLMASETIEGYAVLLENVIKFPAEVLTPLTAGEIP 541
Query: 620 QLQQVSWEWNLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA 679
+ W+W+LF D+ ++ + + N ++ LEE++ N E+ +R A
Sbjct: 542 VALKQEWKWHLFE--------DVKHLYHMNGTLAGYN--ILQKLEEKWRSNQMEDHHRRA 591
>gi|413950656|gb|AFW83305.1| hypothetical protein ZEAMMB73_733472, partial [Zea mays]
Length = 428
Score = 301 bits (770), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 167/445 (37%), Positives = 259/445 (58%), Gaps = 24/445 (5%)
Query: 342 NLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQL 401
L++ WK VFSR NVIVFP+Y LP++Y+ D+GN+FVIPG P+ EA+S HE +
Sbjct: 3 QLINAWKDVFSRANVIVFPNYILPVMYAAFDSGNYFVIPGPPS-----EAFS-YHE--DV 54
Query: 402 RKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCG 461
R G + ++ +VG+ F Y + + A+ + V PLL +Y NS + K G
Sbjct: 55 RISLGLSPKDFLIAIVGTPFSYGDNLMEEALVLQAVSPLLRRYHSENSAQSELKVKIFTG 114
Query: 462 NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
N T+ + AL+ VA +G + H D + +L AD+V+Y S EQ FPS++V
Sbjct: 115 NITEKHRMALESVALSVGFPRGAAEHVAAE-DKDILLRTADLVIYYSCLEEQLFPSVLVE 173
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASA 581
AM+ VI PD II++++ +G + F + N L++ +SNGK+S + +ASA
Sbjct: 174 AMSLEKLVIAPDLGIIRKHIDDGVNGLLFPRKNVGMLAQVLLQAVSNGKVSISGQKIASA 233
Query: 582 GRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLG 638
G+ HAKN++A + + YA +LENV+ FP+DAL P + ++ + W+W+LF
Sbjct: 234 GKAHAKNLMASETIEGYAVLLENVVKFPTDALHPLTVDEIPLALKKEWKWHLFE------ 287
Query: 639 TGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSADQ--DTISELDWDVLHDIE 696
D+ ++ TS SS ++ LE+E+ +N EN + S + DT S + W+ E
Sbjct: 288 --DVKHLHHMNTSLSSYK--ILKKLEQEWHRNQMENSSLSTTKISDTFSAIAWEEQRANE 343
Query: 697 SSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYE 756
+ ++E ++L++R D +W+++YRN ++ +R K E +ERD+ ELERTGQP+CIYE
Sbjct: 344 VMDIKRKMEEDELKDRNDQPHGTWEEVYRNVKRVDRLKNELHERDDKELERTGQPLCIYE 403
Query: 757 IYSGSGAWPFLHHGSLYRGLALSSA 781
+ G G WPFLH SLYRG+ L S
Sbjct: 404 PFFGEGTWPFLHQSSLYRGVGLVSC 428
>gi|2191144|gb|AAB61031.1| A_IG002N01.24 gene product [Arabidopsis thaliana]
Length = 981
Score = 291 bits (746), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 172/504 (34%), Positives = 269/504 (53%), Gaps = 49/504 (9%)
Query: 362 YTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSF 421
Y + +LY+ DAGNF+VIPGSP +V + +K D++V+ +VGS F
Sbjct: 177 YAIEILYTEFDAGNFYVIPGSPEEVCKAKNLEFPPQK-----------DDVVISIVGSQF 225
Query: 422 FYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLL 481
Y ++A+ + + PL S K + L G + Y+ A++ ++ L
Sbjct: 226 LYKGQWLEHALLLQALRPLF-SGNYLESDNSHLKIIVLGGETASNYSVAIETISQNLTYP 284
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
+ +V+H G+V+ +L +D+V+YGS EQ FP ++++AM+ G P++ PD I++YV
Sbjct: 285 KEAVKHVRVAGNVDKILESSDLVIYGSFLEEQSFPEILMKAMSLGKPIVAPDLFNIRKYV 344
Query: 542 A--------EGAQVI---------FFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRL 584
+G ++ F K N + LS+ I+ GK+S A+ +A G+
Sbjct: 345 CSFFPCMWLDGNVLLHVDDRVTGYLFPKQNLKVLSQVVLEVITEGKISPLAQKIAMMGKT 404
Query: 585 HAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQ---QVSWEWNLFRKEIDLGTGD 641
KNM+A + + YA +LEN+L F S+ P + ++ + W W+ F +D
Sbjct: 405 TVKNMMARETIEGYAALLENMLKFSSEVASPKDVQKVPPELREEWSWHPFEAFMD----- 459
Query: 642 ILNMDEWGTSTSSRNSSVVDLLEE-EFTKNITENENR---SADQDTISELDWDVLHDIES 697
TS ++R + + L + E N T E + + D+ W+ ++
Sbjct: 460 --------TSPNNRIARSYEFLAKVEGHWNYTPGEAMKFGAVNDDSFVYEIWEEERYLQM 511
Query: 698 SEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEGELERTGQPVCIYEI 757
+R E E+L+ R+ +W+D+Y++A++++R K + +ERDEGEL RTGQP+CIYE
Sbjct: 512 MNSKKRREDEELKSRVLQYRGTWEDVYKSAKRADRSKNDLHERDEGELLRTGQPLCIYEP 571
Query: 758 YSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTHYRDILCEIGGMFSI 817
Y G G W FLH LYRG+ LS RR R DDVDA SRL L N +YRD L + G F+I
Sbjct: 572 YFGEGTWSFLHQDPLYRGVGLSVKGRRPRMDDVDASSRLPLFNNPYYRDALGDFGAFFAI 631
Query: 818 ANKVDNIHKRPWIGFQSWRAAGRK 841
+NK+D +HK WIGFQSWRA RK
Sbjct: 632 SNKIDRLHKNSWIGFQSWRATARK 655
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/127 (54%), Positives = 96/127 (75%), Gaps = 1/127 (0%)
Query: 979 KHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMD 1038
KHCY RVLELLVNVWAYHS R++VY+DP +G +QEQH + RRG MW+K+F++T LK+MD
Sbjct: 657 KHCYSRVLELLVNVWAYHSARRIVYIDPETGLMQEQHKQKNRRGKMWVKWFDYTTLKTMD 716
Query: 1039 EDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKG 1098
EDLAE A D D WLWPWTGE+ W+G E+E++++ +K +KK+K ++K+ ++
Sbjct: 717 EDLAEEA-DSDRRVGHWLWPWTGEIVWRGTLEKEKQKKNLEKEEKKKKSRDKLSRMRSRS 775
Query: 1099 YRQKTLG 1105
RQK +G
Sbjct: 776 GRQKVIG 782
Score = 47.4 bits (111), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 63/145 (43%), Gaps = 23/145 (15%)
Query: 136 NRKGFSSLFPFRG-AYLLYFMIFLAVFAFA------MASMVLQNSIASVFGAERGRP--- 185
NR S F + YLL+ I F FA + +V+ S E P
Sbjct: 37 NRSRLSRFFLLKSFNYLLWISIICVFFFFAVLFQMFLPGLVIDKSDKPWISKEILPPDLV 96
Query: 186 -IREE--LRFGSRLKFVP---------DQVGFGNGLDGLRST-PRFGVRPPRIGLILGNM 232
RE+ L FG ++ P D GF L +T RFG R P++ L+ G++
Sbjct: 97 GFREKGFLDFGDDVRIEPTKLLMKFQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDL 156
Query: 233 AKDSRSLLLITVVKNLQKLGYVFKI 257
D +L++++ K LQ++GY +I
Sbjct: 157 LADPEQVLMVSLSKALQEVGYAIEI 181
>gi|302796717|ref|XP_002980120.1| hypothetical protein SELMODRAFT_112065 [Selaginella moellendorffii]
gi|302820458|ref|XP_002991896.1| hypothetical protein SELMODRAFT_134458 [Selaginella moellendorffii]
gi|300140282|gb|EFJ07007.1| hypothetical protein SELMODRAFT_134458 [Selaginella moellendorffii]
gi|300152347|gb|EFJ18990.1| hypothetical protein SELMODRAFT_112065 [Selaginella moellendorffii]
Length = 240
Score = 239 bits (611), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 114/223 (51%), Positives = 160/223 (71%), Gaps = 7/223 (3%)
Query: 886 FWSMCDILNGGHCRTA--FEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFL 943
FW+ CD N G+CR F + F+ M+GL LP MP G WS+ H W M T SF+
Sbjct: 1 FWTSCDASNAGNCRYVRIFMETFKTMFGLNKDQLELPTMP--SGVWSSKHCWAMSTSSFV 58
Query: 944 EFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMV 1002
EF+MFSRMFVD+LD+ L +C L++++LE +HCYCR+LE+LVNVWAYHS R+++
Sbjct: 59 EFVMFSRMFVDALDSRLYIEHHDHGNCPLATTQLEAQHCYCRLLEVLVNVWAYHSARRLI 118
Query: 1003 YLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGE 1062
Y+DP +G + EQ+ +E RRG M +K+F+F++LK MDED+AE DD ++P +WLWP TGE
Sbjct: 119 YVDPETGIMMEQNALESRRGQMKVKWFSFSVLKGMDEDMAEKVDD-EHPTYRWLWPHTGE 177
Query: 1063 VHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
V W+GI ERER+ERY K+++KR+ KE++ R+ Y+QK+LG
Sbjct: 178 VFWQGILERERQERYNMKLERKRRNKERL-ARMRSRYKQKSLG 219
>gi|302796719|ref|XP_002980121.1| hypothetical protein SELMODRAFT_112125 [Selaginella moellendorffii]
gi|300152348|gb|EFJ18991.1| hypothetical protein SELMODRAFT_112125 [Selaginella moellendorffii]
Length = 240
Score = 239 bits (609), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 114/223 (51%), Positives = 159/223 (71%), Gaps = 7/223 (3%)
Query: 886 FWSMCDILNGGHCRTA--FEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFL 943
FW+ CD N G+CR F + F+ MYGL LP MP G WS+ H W M T SF+
Sbjct: 1 FWTSCDASNAGNCRYVRIFMETFKTMYGLNKDQLELPTMP--SGVWSSKHCWAMSTSSFV 58
Query: 944 EFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMV 1002
EF+MFSRMFVD+LD+ L +C L++++LE +HCYC +LE+LVNVWAYHS R+++
Sbjct: 59 EFVMFSRMFVDALDSRLYVEHHDHGNCPLATTQLEAQHCYCHLLEVLVNVWAYHSARRLI 118
Query: 1003 YLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGE 1062
Y+DP +G + EQ+ +E RRG M +K+F+F++LK MDED+AE DD ++P +WLWP TGE
Sbjct: 119 YVDPETGIMMEQNALESRRGQMKVKWFSFSVLKGMDEDMAEKVDD-EHPTYRWLWPHTGE 177
Query: 1063 VHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
V W+GI ERER+ERY K+++KR+ KE++ R+ Y+QK+LG
Sbjct: 178 VFWQGILERERQERYNMKLERKRRNKERL-ARMRSRYKQKSLG 219
>gi|302820462|ref|XP_002991898.1| hypothetical protein SELMODRAFT_134452 [Selaginella moellendorffii]
gi|300140284|gb|EFJ07009.1| hypothetical protein SELMODRAFT_134452 [Selaginella moellendorffii]
Length = 240
Score = 238 bits (606), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 113/223 (50%), Positives = 160/223 (71%), Gaps = 7/223 (3%)
Query: 886 FWSMCDILNGGHCRTA--FEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFL 943
FW+ CD N G+CR F + F+ M+GL LP MP G WS+ H W M T SF+
Sbjct: 1 FWTSCDASNAGNCRYVRIFMETFKTMFGLNKDQLDLPTMP--SGVWSSKHCWAMSTSSFV 58
Query: 944 EFIMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMV 1002
+F+MFSRMFVD+LD+ L +C L++++LE +HCYCR+LE+LVNVWAYHS R+++
Sbjct: 59 KFVMFSRMFVDALDSRLYIEHHDHGNCPLATTQLEAQHCYCRLLEVLVNVWAYHSARRLI 118
Query: 1003 YLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGE 1062
Y+DP +G + EQ+ +E RRG M +K+F+F++LK MDED+AE DD ++P +WLWP TGE
Sbjct: 119 YVDPETGIMMEQNALESRRGQMKVKWFSFSVLKGMDEDMAEKVDD-EHPTYRWLWPHTGE 177
Query: 1063 VHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
V W+GI ERER+ERY K+++KR+ KE++ R+ Y+QK+LG
Sbjct: 178 VFWQGILERERQERYNMKLERKRRNKERL-ARMRSRYKQKSLG 219
>gi|26451288|dbj|BAC42745.1| unknown protein [Arabidopsis thaliana]
Length = 253
Score = 226 bits (576), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 159/221 (71%), Gaps = 5/221 (2%)
Query: 886 FWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEF 945
FWS CD +N G+CR A+ + ++MY + ++++LPPMPEDG WS + W + T SFLEF
Sbjct: 13 FWSFCDAINAGNCRFAYNETLKKMYSI-KNLDSLPPMPEDGDTWSVMQSWALPTRSFLEF 71
Query: 946 IMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYL 1004
+MFSRMFVDSLDA + + N C LS ++ KHCY RVLELLVNVWAYHS R++VY+
Sbjct: 72 VMFSRMFVDSLDAQIYEEHHRTNRCYLSLTK--DKHCYSRVLELLVNVWAYHSARRIVYI 129
Query: 1005 DPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVH 1064
DP +G +QEQH + RRG MW+K+F++T LK+MDEDLAE A D D WLWPWTGE+
Sbjct: 130 DPETGLMQEQHKQKNRRGKMWVKWFDYTTLKTMDEDLAEEA-DSDRRVGHWLWPWTGEIV 188
Query: 1065 WKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
W+G E+E++++ +K +KK+K ++K+ ++ RQK +G
Sbjct: 189 WRGTLEKEKQKKNLEKEEKKKKSRDKLSRMRSRSGRQKVIG 229
>gi|62321122|dbj|BAD94235.1| hypothetical protein [Arabidopsis thaliana]
Length = 253
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 113/221 (51%), Positives = 158/221 (71%), Gaps = 5/221 (2%)
Query: 886 FWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEF 945
FWS CD +N G+CR A+ + ++MY + ++++LPPMPEDG WS + W + T SFLEF
Sbjct: 13 FWSFCDAINAGNCRFAYNETLKKMYSI-KNLDSLPPMPEDGDTWSVMQSWALPTRSFLEF 71
Query: 946 IMFSRMFVDSLDA-LNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYL 1004
+MFSRMFVDSLDA + + N C LS ++ KHCY RVLELLVNVWAYHS R++VY+
Sbjct: 72 VMFSRMFVDSLDAQIYEEHHRTNRCYLSLTK--DKHCYSRVLELLVNVWAYHSARRIVYI 129
Query: 1005 DPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVH 1064
DP +G +QEQH + RRG MW+K+F++ LK+MDEDLAE A D D WLWPWTGE+
Sbjct: 130 DPETGLMQEQHKQKNRRGKMWVKWFDYITLKTMDEDLAEEA-DSDRRVGHWLWPWTGEIV 188
Query: 1065 WKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
W+G E+E++++ +K +KK+K ++K+ ++ RQK +G
Sbjct: 189 WRGTLEKEKQKKNLEKEEKKKKSRDKLSRMRSRSGRQKVIG 229
>gi|413950657|gb|AFW83306.1| hypothetical protein ZEAMMB73_662473 [Zea mays]
Length = 217
Score = 202 bits (513), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 102/200 (51%), Positives = 142/200 (71%), Gaps = 7/200 (3%)
Query: 909 MYG--LPSHVEALPPMPEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDA-LNANSSK 965
MYG L +++L MP DG WS + WV+ T SFLEF+MFSRMFVD+LDA + +
Sbjct: 1 MYGVHLDHDLDSLLHMPNDGDTWSVMQSWVLPTRSFLEFVMFSRMFVDALDAQMYDKHHQ 60
Query: 966 VNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMW 1025
C+LS + +HCY RVLEL+VNVWA+HS R+MVY++P +G++QEQH + RRG M
Sbjct: 61 TGHCILSLHK--DRHCYSRVLELIVNVWAFHSARRMVYVNPQTGAMQEQHHLSGRRGQMS 118
Query: 1026 MKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKR 1085
+++F++T LKSMDE+LAE D D+P +WLWP TGEV W+G+Y+RER R ++K +K+
Sbjct: 119 VQFFSYTTLKSMDEELAEEF-DLDHPDRRWLWPQTGEVFWQGLYQRERIMRQQEKERRKQ 177
Query: 1086 KMKEKMFDRLTKGYRQKTLG 1105
+ +EK+ R+ RQKTLG
Sbjct: 178 QSREKI-QRIKNRARQKTLG 196
>gi|242058067|ref|XP_002458179.1| hypothetical protein SORBIDRAFT_03g028310 [Sorghum bicolor]
gi|241930154|gb|EES03299.1| hypothetical protein SORBIDRAFT_03g028310 [Sorghum bicolor]
Length = 424
Score = 193 bits (490), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 95/261 (36%), Positives = 152/261 (58%), Gaps = 4/261 (1%)
Query: 217 RFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQI 276
R GVR PR+ ++ G++ + L +++V L+ +GY K++++ G ++W I +
Sbjct: 155 RLGVRKPRLAMVFGDLYPSAMQLQMVSVASVLEAMGYEMKVFSLEDGPCGNIWRAIGVPV 214
Query: 277 SILGQEQYSL---IDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLP 333
SIL E SL +DW +DGI+ +S+EA+ SSL+ EPF SIP++W + E SLA+R
Sbjct: 215 SIL-PEDTSLPIFVDWLDYDGILVNSIEARPVFSSLLHEPFKSIPIIWTVHEYSLAHRAK 273
Query: 334 VYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYS 393
Y G L++ WK VFSR NVIVFP+Y LP++Y+ D+GN+FVIPG P++ + V+ +
Sbjct: 274 EYNASGMIQLINAWKDVFSRANVIVFPNYILPVMYAAFDSGNYFVIPGPPSEAFQVDRFI 333
Query: 394 KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGS 453
+R G + ++ +VGS F Y + + A+ + V PLL +Y NS +
Sbjct: 334 AKSYHEDVRISLGLSPKDFLIAIVGSPFSYGDNLMEEALVLQAVSPLLQRYRSENSTQSE 393
Query: 454 FKFVFLCGNSTDGYNDALQEV 474
K GN T+ + AL+ +
Sbjct: 394 LKVKVFTGNITEKHRMALEVI 414
>gi|413950653|gb|AFW83302.1| hypothetical protein ZEAMMB73_990643, partial [Zea mays]
Length = 252
Score = 162 bits (411), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 128/211 (60%), Gaps = 10/211 (4%)
Query: 217 RFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQI 276
R G R PR+ ++ G++ + L +++V L+ +GY K++++ G ++W I +
Sbjct: 39 RLGARKPRLAMVFGDLYPSAMQLQMVSVASVLEAMGYEMKVFSLEDGPCGNIWRAIGVPV 98
Query: 277 SILGQEQYSLI--DWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPV 334
SIL ++ I DW +DGI+ +S+EA+ SSL+ EPF SIP++W + E SLA+R
Sbjct: 99 SILPEDTNLPISVDWLDYDGILVNSIEARPVFSSLLHEPFKSIPVIWTVHEHSLAHRAKE 158
Query: 335 YVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSK 394
Y G L++ WK VFSR NVIVFP+Y LP++Y+ D+GN+FVIPG P+ EA+S
Sbjct: 159 YNASGMIQLINAWKDVFSRANVIVFPNYILPVMYAAFDSGNYFVIPGPPS-----EAFSY 213
Query: 395 SHEKYQLRKENGFLKDEIVVVVVGSSFFYNE 425
HE +R G + ++ +VG+ F Y +
Sbjct: 214 -HE--DVRISLGLSPKDFLIAIVGTPFSYGD 241
>gi|361069891|gb|AEW09257.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127516|gb|AFG44406.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127517|gb|AFG44407.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127518|gb|AFG44408.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127519|gb|AFG44409.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127520|gb|AFG44410.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127521|gb|AFG44411.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127522|gb|AFG44412.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127523|gb|AFG44413.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127524|gb|AFG44414.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127525|gb|AFG44415.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127527|gb|AFG44416.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127528|gb|AFG44417.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127529|gb|AFG44418.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127530|gb|AFG44419.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127531|gb|AFG44420.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127532|gb|AFG44421.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127533|gb|AFG44422.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
gi|383127534|gb|AFG44423.1| Pinus taeda anonymous locus UMN_1455_01 genomic sequence
Length = 137
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 95/118 (80%), Gaps = 2/118 (1%)
Query: 988 LLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADD 1047
LLVNVWAYHS R+M+Y++P +G +QE H ++ RRG MW+K+F+FTLLKSMDED+AE D
Sbjct: 1 LLVNVWAYHSARRMIYVNPETGHMQEHHKLKSRRGNMWVKWFDFTLLKSMDEDMAEEV-D 59
Query: 1048 GDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
++P ++WLWP TGEV+W+G+YERER R R +++KKRK K+K+ R+ YRQK+LG
Sbjct: 60 TEHPTKRWLWPSTGEVYWQGVYERERHRRNRLRIEKKRKSKDKLV-RMRHRYRQKSLG 116
>gi|302820466|ref|XP_002991900.1| hypothetical protein SELMODRAFT_430173 [Selaginella moellendorffii]
gi|300140286|gb|EFJ07011.1| hypothetical protein SELMODRAFT_430173 [Selaginella moellendorffii]
Length = 801
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 105/226 (46%), Gaps = 66/226 (29%)
Query: 886 FWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSA------LHGWVMQT 939
FW+ C+ N G+CR F + F+ MYGL LP MP G WS L G+++
Sbjct: 65 FWTSCNASNAGNCRRIFMETFKTMYGLNKDQLELPTMP--FGVWSVCCTYSNLTGYLVLG 122
Query: 940 PSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGR 999
S+ I+ S + + ++ + +HCYC +LE+LVNVWA
Sbjct: 123 CSWTLSILSSTLTITIMETVLLQPHMA------------QHCYCCLLEVLVNVWAE---- 166
Query: 1000 KMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLAEAADDGDYPREKWLWPW 1059
WM AE DD ++P +WLWP
Sbjct: 167 -------------------------WM---------------AEKVDD-EHPTYRWLWPH 185
Query: 1060 TGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQKTLG 1105
TGEV W+GI ERER+ERY K+++KR+ KE++ R+ Y+QK+LG
Sbjct: 186 TGEVFWQGILERERQERYNMKLERKRRNKERL-ARMRSRYKQKSLG 230
>gi|302796715|ref|XP_002980119.1| hypothetical protein SELMODRAFT_419667 [Selaginella moellendorffii]
gi|300152346|gb|EFJ18989.1| hypothetical protein SELMODRAFT_419667 [Selaginella moellendorffii]
Length = 676
Score = 96.7 bits (239), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 80/149 (53%), Gaps = 20/149 (13%)
Query: 886 FWSMCDILNGGHCRTAFEDAFRQMYGLPSHVEALPPMPEDGGCWSA------LHGWVMQT 939
FW+ C+ N G+CR F + F+ MYGL LP MP G WS L G+++
Sbjct: 4 FWTSCNASNAGNCRRIFMETFKTMYGLNKDQLELPTMP--FGVWSVCCTYSNLTGYLVLG 61
Query: 940 PSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCYCRVLELLVNVWAYHSGR 999
S+ I+ S + + ++ + +HCYC +LE+LVNVWAYHS R
Sbjct: 62 CSWTLSILSSTLTITIMETVLLQPHMA------------QHCYCCLLEVLVNVWAYHSAR 109
Query: 1000 KMVYLDPLSGSLQEQHPIERRRGFMWMKY 1028
+++Y+DP +G + EQ+ +E RR M +K+
Sbjct: 110 RLIYVDPETGIMMEQNALESRRRQMKVKW 138
>gi|302820470|ref|XP_002991902.1| hypothetical protein SELMODRAFT_430175 [Selaginella moellendorffii]
gi|300140288|gb|EFJ07013.1| hypothetical protein SELMODRAFT_430175 [Selaginella moellendorffii]
Length = 255
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 40/74 (54%), Positives = 56/74 (75%), Gaps = 2/74 (2%)
Query: 1032 TLLKSMDEDLAEAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKM 1091
T L+ MDED+AE DD ++P +WLWP TGEV W+GI ERER+ERY K+++KR+ KE++
Sbjct: 163 TQLEGMDEDMAEKVDD-EHPTYRWLWPHTGEVFWQGILERERQERYNMKLERKRRNKERL 221
Query: 1092 FDRLTKGYRQKTLG 1105
R+ Y+QK+LG
Sbjct: 222 A-RMCSRYKQKSLG 234
>gi|414873844|tpg|DAA52401.1| TPA: putative ribosomal protein S8e family protein [Zea mays]
Length = 669
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 68/133 (51%), Gaps = 21/133 (15%)
Query: 254 VFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLI--DWSIFDGIIADSLEAKEAIS---- 307
V ++++ G ++W I +SIL ++ I DW +DGI+ +S+EA+ S
Sbjct: 483 VSVVFSLEDGPCGNIWGTIGVPVSILPEDTNLPISVDWLDYDGILVNSIEARPVFSRILS 542
Query: 308 ---------------SLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFS 352
SL+ EPF IP++W + E SLA+R Y G L+ WK VFS
Sbjct: 543 QSQMFDLDHVSVASYSLLHEPFKYIPIIWTVHEYSLAHRAKEYNVSGMIQLIKAWKDVFS 602
Query: 353 RVNVIVFPDYTLP 365
R NVIVFP+Y P
Sbjct: 603 RTNVIVFPNYIWP 615
>gi|340781523|ref|YP_004748130.1| glycosyl transferase family protein [Acidithiobacillus caldus SM-1]
gi|340555676|gb|AEK57430.1| glycosyl transferase group 1 [Acidithiobacillus caldus SM-1]
Length = 1418
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 128/289 (44%), Gaps = 28/289 (9%)
Query: 317 IPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNF 376
+P +W ++E + Y + +L S + F+ ++F + +++ +D
Sbjct: 1129 LPTLWNVRESEPWD---TYFDFLSPDLRSIPYNCFAYPYRVIFVAHATRQIWAPVDQYFN 1185
Query: 377 FVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHD 436
F + + D+ +E S ++ + R+ +++ VV+VG+ D +
Sbjct: 1186 FTVIHNGLDLSRLETRLASIDRVEARRVLSIAPNDVAVVLVGTVC-------DRKGQIDL 1238
Query: 437 VGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRL-GLLEHSVRHYGFNGDVN 495
V LL A SV G + F+ G+ Y+ A+ + SRL L + DV+
Sbjct: 1239 VRALL---ALPQSVAGRLR-AFIVGDRAGDYSTAMHQEISRLPAALRARIIVEPETSDVS 1294
Query: 496 GVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNP 555
ADI L +S++E +P +++ AM G+P+IT I+E + E +F++ N
Sbjct: 1295 VYYRAADIAL-CTSRIE-SYPRVVLEAMASGLPLITTPVFGIREQIRENINGLFYEAGNA 1352
Query: 556 EGLSRAFSLFISN-GKLSKFARTVASAGRLHAKNMLALDCVTRYARILE 603
E L++A + I N + + FA + L L+ +T Y ++E
Sbjct: 1353 EALAQALTSLIENDDQRAAFAEA----------SKLVLESLTTYPEMVE 1391
>gi|389795079|ref|ZP_10198217.1| group 1 glycosyl transferase [Rhodanobacter fulvus Jip2]
gi|388431530|gb|EIL88601.1| group 1 glycosyl transferase [Rhodanobacter fulvus Jip2]
Length = 385
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 70/167 (41%), Gaps = 11/167 (6%)
Query: 421 FFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGL 480
+ L WD VA + + R+ +G L G G A+ E R +
Sbjct: 199 LLASRLLWDKGVAEY------VDAIRQLRQQGRRFKALLAGTPDPGNPAAVPEATIRGWV 252
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
E + G D+ G+L DIV+ S + +G P +V A G+P+IT D P +E
Sbjct: 253 DEGLLTWLGHVDDMAGLLGAVDIVVLPSYR--EGLPRTLVEAAACGLPLITTDVPGCREV 310
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAK 587
V +G + N + L+RA + + FAR + A R A+
Sbjct: 311 VTDGIDGLLVPMGNSDALARAMARLQDD---PAFARRLGDAARRKAR 354
>gi|389807965|ref|ZP_10204436.1| group 1 glycosyl transferase [Rhodanobacter thiooxydans LCS2]
gi|388443300|gb|EIL99455.1| group 1 glycosyl transferase [Rhodanobacter thiooxydans LCS2]
Length = 400
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 69/168 (41%), Gaps = 11/168 (6%)
Query: 421 FFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGL 480
+ L WD V + + R+ +G L G G A+ E R
Sbjct: 214 LLASRLLWDKGVGEY------VAAIRQLRGQGRSLHALLAGAPDPGNPAAVPESTIRAWA 267
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
E + G D+ G+L DIV+ S + +G P +V A G+P+IT D P +E
Sbjct: 268 DEGLLTWLGHVDDMAGLLGSVDIVVLPSYR--EGLPRTLVEAAACGLPLITTDVPGCREV 325
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKN 588
VA+G + N + L+RA + + FAR + A R A+
Sbjct: 326 VADGIDGLLVPVRNSDALARAIARLQDD---PPFARRLGDAARSKART 370
>gi|410100363|ref|ZP_11295326.1| hypothetical protein HMPREF1076_04504 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215884|gb|EKN08874.1| hypothetical protein HMPREF1076_04504 [Parabacteroides goldsteinii
CL02T12C30]
Length = 387
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 80/171 (46%), Gaps = 22/171 (12%)
Query: 361 DYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSS 420
DY + + G + +PG D+ ++K +K +LRKE DEIV V +G
Sbjct: 163 DYERAKKFKLRKNGKVYYVPGVGVDL---SVFNKGEDKSELRKEYNIKDDEIVFVSMGDL 219
Query: 421 FFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGL 480
+YAVA++ + A N+++ +CGN D ++++ +LG
Sbjct: 220 V----PRKNYAVALNAIAK-----AGVNNLK-----YLICGNGPQ--MDEMKQLCVKLG- 262
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVIT 531
+E V G GDV+ ++ +D+ L+ S Q +G ++ AM G+P +
Sbjct: 263 IEKQVEFLGRRGDVSKIVKASDVFLFTSKQ--EGLARSLMEAMASGLPCVV 311
>gi|347360056|ref|YP_389380.2| group 1 glycosyl transferase [Desulfovibrio alaskensis G20]
gi|342906591|gb|ABB39685.2| glycosyl transferase group 1 [Desulfovibrio alaskensis G20]
Length = 374
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 73/151 (48%), Gaps = 9/151 (5%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ LCG D ++++A +L L H +R G DV G+L +DI + S + +G P
Sbjct: 231 LVLCGRD-DSNAPVIRKLAGKLHLDNH-IRFAGATDDVAGILNASDIAAF--SFISEGLP 286
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
++++ +M G+PV+ D P ++ + I +P ++ + + + L A
Sbjct: 287 NVVIESMAAGLPVVATDIPGVRAAIGSCDDNILVPPKDPAAMAHGIAKILQDPALRSRA- 345
Query: 577 TVASAGRLHAKNMLALDCV-TRYAR-ILENV 605
A+A A + A D V + +A+ IL N+
Sbjct: 346 --AAANITQATSTFAPDVVFSAFAKLILSNI 374
>gi|352085446|ref|ZP_08953066.1| sugar transferase, PEP-CTERM/EpsH1 system associated [Rhodanobacter
sp. 2APBS1]
gi|351681867|gb|EHA64981.1| sugar transferase, PEP-CTERM/EpsH1 system associated [Rhodanobacter
sp. 2APBS1]
Length = 398
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 10/146 (6%)
Query: 479 GLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
G L VR G DV L+A+ ++ S + +G P +++ AM G+PV+ D +
Sbjct: 254 GGLSTQVRLLGNRSDVAA--LLAEFDVFALSSIAEGMPGVLLEAMASGLPVVATDVGGVS 311
Query: 539 EYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKF---ARTVASAGRLHAKNMLALDCV 595
E V G +P L+ A ++++ KL + A A R ++M V
Sbjct: 312 EVVVAGVTGTLVPAGDPHALAAALRAYVADEKLRRRHGEAGCERVAARFGLRSM-----V 366
Query: 596 TRYARILENVLNFPSDALLPGPISQL 621
+ Y + + +L P++A+ P +S L
Sbjct: 367 SAYVALYDELLGRPTNAVQPRMMSGL 392
>gi|389798123|ref|ZP_10201150.1| sugar transferase [Rhodanobacter sp. 116-2]
gi|388445778|gb|EIM01836.1| sugar transferase [Rhodanobacter sp. 116-2]
Length = 398
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 10/146 (6%)
Query: 479 GLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
G L VR G DV L+A+ ++ S + +G P +++ AM G+PV+ D +
Sbjct: 254 GGLSTQVRLLGNRSDVAA--LLAEFDVFALSSIAEGMPGVLLEAMASGLPVVATDVGGVS 311
Query: 539 EYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKF---ARTVASAGRLHAKNMLALDCV 595
E V G +P L+ A ++++ KL + A A R ++M V
Sbjct: 312 EVVVAGVTGTLVPAGDPHALAAALRAYVADEKLRRRHGEAGCERVAARFGLRSM-----V 366
Query: 596 TRYARILENVLNFPSDALLPGPISQL 621
+ Y + + +L P++A+ P +S L
Sbjct: 367 SAYVALYDELLGRPTNAVQPRMMSGL 392
>gi|389806134|ref|ZP_10203274.1| sugar transferase [Rhodanobacter thiooxydans LCS2]
gi|388446133|gb|EIM02179.1| sugar transferase [Rhodanobacter thiooxydans LCS2]
Length = 398
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 69/152 (45%), Gaps = 7/152 (4%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L++ RLGL E +VR G DV +L D+ + S V +G P ++ AM G+PV+
Sbjct: 247 LEQQIVRLGLTE-TVRLLGNRSDVAQLLAECDV--FALSSVAEGMPITLLEAMAAGLPVV 303
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
D + + +G +P L+ A ++++ L R AGR
Sbjct: 304 ATDVGGVASVIEDGVTGTLVPPGDPHALAAALHFYVADEPLR---RRHGDAGRARVAAHF 360
Query: 591 AL-DCVTRYARILENVLNFPSDALLPGPISQL 621
+L V+ Y + + +L ++A+ P +S L
Sbjct: 361 SLRSMVSAYVALYDELLGRQTNAVQPRMVSGL 392
>gi|417819146|ref|ZP_12465765.1| glycosyl transferases group 1 family protein [Vibrio cholerae HE39]
gi|423944526|ref|ZP_17733207.1| glycosyl transferases group 1 family protein [Vibrio cholerae
HE-40]
gi|423973974|ref|ZP_17736754.1| glycosyl transferases group 1 family protein [Vibrio cholerae
HE-46]
gi|340043036|gb|EGR03997.1| glycosyl transferases group 1 family protein [Vibrio cholerae HE39]
gi|408662484|gb|EKL33427.1| glycosyl transferases group 1 family protein [Vibrio cholerae
HE-40]
gi|408666430|gb|EKL37218.1| glycosyl transferases group 1 family protein [Vibrio cholerae
HE-46]
Length = 360
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 66/139 (47%), Gaps = 6/139 (4%)
Query: 471 LQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
L+ +AS LG+ HSVR G+ N +++ + DI + S + F ++ A + GIP
Sbjct: 221 LKSLASELGI-SHSVRFEGYVDNKNISQYINEFDIFIALSRLESESFGVAVIEASSCGIP 279
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKN 588
V+ + KE V + I QKD P+ + A S +S+ L + + +GR+H N
Sbjct: 280 VVVSNVSGFKEVVVQEITGIVVQKDCPKSAADAISRLVSDHDLR---QKMGCSGRVHVMN 336
Query: 589 MLALDCVTRYARILENVLN 607
+ D L N +N
Sbjct: 337 NYSWDISLNKMICLYNEIN 355
>gi|291520146|emb|CBK75367.1| Glycosyltransferase [Butyrivibrio fibrisolvens 16/4]
Length = 404
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 95/218 (43%), Gaps = 16/218 (7%)
Query: 379 IPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFF---YNELSWDYAVAMH 435
IPG+ D A + K +R+E G D IV++ VG + E+ + M
Sbjct: 184 IPGAGVDTHKFVASDDARTK--VRRELGIDDDTIVLLSVGELIHRKNHAEVLRALKI-MK 240
Query: 436 DVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVN 495
D G LL + V+ +K +L N+ L+E LGL ++ V+ GF DV
Sbjct: 241 DNGTLLTPDSDER-VQPKYKIKYLIAGRGKIQNE-LEETIKHLGLQDY-VQLLGFRRDVA 297
Query: 496 GVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNP 555
V +DI ++ S Q +G P ++ AM+ G+PV+ + +++G F N
Sbjct: 298 DVFAASDIYVFPSHQ--EGLPVALMEAMSVGMPVVCSRIRGNTDLISDGEGGYLFDSRNA 355
Query: 556 EGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALD 593
+ L A + + + + + A G ++ + M D
Sbjct: 356 KSLVAALNKALVDNETKR-----AQMGEINVETMREFD 388
>gi|322419984|ref|YP_004199207.1| group 1 glycosyl transferase [Geobacter sp. M18]
gi|320126371|gb|ADW13931.1| glycosyl transferase group 1 [Geobacter sp. M18]
Length = 379
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 83/174 (47%), Gaps = 19/174 (10%)
Query: 402 RKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARR--NSVEGSFKFVFL 459
R++ G + D ++V +V + F+ D+A LI+ RR +++ G+ + + +
Sbjct: 189 RRDLGIVDDCLLVCMVANLHFFK----DHAT--------LIRAWRRVLDALRGNREAMLV 236
Query: 460 CGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLI 519
G LQ + S LG+ E VR G DV+G+L +D LY S +G P+ I
Sbjct: 237 LPGRFSGAEHELQALVSELGI-EERVRFLGKLDDVSGLLRASD--LYVHSSHTEGVPNAI 293
Query: 520 VRAMTFGIPVITPDFPIIKEYVA-EGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+ AM G+PV+ D +E V EG + +D P+ ++ F + LS
Sbjct: 294 LEAMAAGLPVVATDIAGNREAVGPEGTDGLVPVRD-PDAMAAKIVEFSEDRGLS 346
>gi|187736562|ref|YP_001878674.1| group 1 glycosyl transferase [Akkermansia muciniphila ATCC BAA-835]
gi|187426614|gb|ACD05893.1| glycosyl transferase group 1 [Akkermansia muciniphila ATCC BAA-835]
Length = 390
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
++E S GL H + +G G++ I+L+ S +GF + V AM G+P++
Sbjct: 262 IEEYVSSNGL-HHRIILHGPCGNMEEEFPRHSILLFPSRC--EGFGLVQVEAMCCGLPIL 318
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASA 581
D PI +E V + I F+ DNPE L RA ++ S A +A+A
Sbjct: 319 AADIPICREIVEKHHAGILFESDNPEDLCRAMREMTASDLSSYAANGLAAA 369
>gi|88800558|ref|ZP_01116120.1| predicted glycosyltransferase [Reinekea blandensis MED297]
gi|88776703|gb|EAR07916.1| predicted glycosyltransferase [Reinekea sp. MED297]
Length = 1009
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 132/294 (44%), Gaps = 47/294 (15%)
Query: 295 IIADSLEA-KEAISSLMQEPFHSIPLVWIIQE-----DSLANRLPVYVERGFQNLLSYWK 348
I+A++L++ + AI+ L + +P V + +E D +N LP Y+ RG Y
Sbjct: 709 IVANTLQSYQTAIAGLCLD----MPTVLVPRESEEPADYFSN-LPKYL-RG------YAD 756
Query: 349 SVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFL 408
S+ S V+ VF L+ + N VI A + +E K R GF
Sbjct: 757 SLVSSVDQAVFVAKATQKLWKHANMDNQRVIYNGLA-ISELEKKVAGETKESARISFGFS 815
Query: 409 KDEIVVVVVGSSFFYNELSWDYAVAMHDVGPL----LIKYARRNSVEGSFKFVFLCGNST 464
++ V++ VG+ G L I RN+VE S KFV + G +
Sbjct: 816 SNDKVILSVGT-------------VCERKGQLDLIRSIPSILRNNVEESIKFV-IVGMND 861
Query: 465 DGYNDALQEVASRL-GLLEHSV-----RHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
+ Y+ AL++ SR ++ SV + V +LL +D L+ S + + +P +
Sbjct: 862 NEYSMALKDAVSRFPQKVQESVFLLPQTEQSNDTLVQKLLLASD--LFVISSIYESYPRV 919
Query: 519 IVRAMTFGIPVI-TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
++ A+ FG+PVI TP F ++ E + +G F+Q+ N LS + + +L
Sbjct: 920 VLEALYFGLPVIATPCFGVL-EQIDDGKSGFFYQEGNYHDLSEKICTLVCDDRL 972
>gi|20807430|ref|NP_622601.1| glycosyltransferase [Thermoanaerobacter tengcongensis MB4]
gi|20515953|gb|AAM24205.1| predicted glycosyltransferases [Thermoanaerobacter tengcongensis
MB4]
Length = 374
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 66/150 (44%), Gaps = 8/150 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
F F+ G+ Y +L+E GL E GF D+ L D+ + S E+GF
Sbjct: 231 FFFVAGDGP--YRKSLEEKIKEKGL-ESRFFLLGFRDDIPSFLRNLDVFVLPSH--EEGF 285
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFA 575
++ AM G+PV+ I E + EG I +K N E LS+A + + L +
Sbjct: 286 GISVIEAMNEGVPVVATAVGGIPEIIQEGVNGILVEKGNIESLSKAIKSLLKDAHLKE-- 343
Query: 576 RTVASAGRLHAKNMLALDCVTRYARILENV 605
T++ G+ AK + V R + E +
Sbjct: 344 -TLSLKGKEAAKKFSCEEMVKRVEELYERI 372
>gi|150376703|ref|YP_001313299.1| group 1 glycosyl transferase [Sinorhizobium medicae WSM419]
gi|150031250|gb|ABR63366.1| glycosyl transferase group 1 [Sinorhizobium medicae WSM419]
Length = 371
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 11/146 (7%)
Query: 460 CGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLI 519
GN N + E+ ++ GL H+V G D+ D+++ S +GFP++I
Sbjct: 232 AGNGLVCDNPEVMELIAKAGLPAHAVDLRGEVSDMPAFYRSIDLLVLSSRT--EGFPNVI 289
Query: 520 VRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF--ISNGKLSKFART 577
AM++G P++T D + V G I NP+ L+ A F +S + +++ART
Sbjct: 290 AEAMSYGKPIVTTD--VGDAAVVAGRAGIAVPPRNPQALAEAMRAFLDLSEAEYARYART 347
Query: 578 VASAGRLHAKNMLALDCVT-RYARIL 602
R +N AL V+ +Y++ L
Sbjct: 348 ----ARERIENEYALAAVSAKYSKFL 369
>gi|338739958|ref|YP_004676920.1| group 1 glycosyl transferase [Hyphomicrobium sp. MC1]
gi|337760521|emb|CCB66354.1| Glycosyl transferase group 1 [Hyphomicrobium sp. MC1]
Length = 381
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 80/189 (42%), Gaps = 19/189 (10%)
Query: 416 VVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDG-YNDALQEV 474
V+G NE+ D+A +H Y R VE V DG L+E
Sbjct: 194 VIGMVARLNEIK-DHATLLHA-----FAYLRGRHVEAELLLV------GDGPLRRQLEEQ 241
Query: 475 ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDF 534
A RLG+ R G DV +L D+ + +++ E GF ++ AM GIP++ D
Sbjct: 242 AKRLGI-SMQTRFVGERVDVLDLLGGIDVYAFSTTRAE-GFGIALIEAMAAGIPIVATDV 299
Query: 535 PIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALD- 593
P +E + GA + N + A S + N ++ R V A R K ++D
Sbjct: 300 PACREVLGGGAAGLLVPPGNAAAFAYALSELLQND--ARRCRLV-EAARDRVKREYSIDV 356
Query: 594 CVTRYARIL 602
C R+ ++L
Sbjct: 357 CARRWEQVL 365
>gi|440680033|ref|YP_007154828.1| glycosyl transferase group 1 [Anabaena cylindrica PCC 7122]
gi|428677152|gb|AFZ55918.1| glycosyl transferase group 1 [Anabaena cylindrica PCC 7122]
Length = 384
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 8/134 (5%)
Query: 456 FVFLCGNSTDGYNDALQEV---ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVE 512
V L G++ G + +QE+ + LGL E+ V+ GF D+ ++ M D+V + S+ E
Sbjct: 230 IVMLVGDALFGEQEYVQELHQKVTALGL-ENQVKFLGFRSDIPQLMTMCDLVAHTSTAPE 288
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
F +IV AM G PV+ + E V +G +NP+ L++ + I+ +
Sbjct: 289 P-FGRVIVEAMLCGKPVVAAKAGGVMELVEDGINGFLVTPNNPQELAQVINTCINE---T 344
Query: 573 KFARTVASAGRLHA 586
T+A+ R++A
Sbjct: 345 TRTATIANNARINA 358
>gi|428769457|ref|YP_007161247.1| group 1 glycosyl transferase [Cyanobacterium aponinum PCC 10605]
gi|428683736|gb|AFZ53203.1| glycosyl transferase group 1 [Cyanobacterium aponinum PCC 10605]
Length = 381
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 65/136 (47%), Gaps = 3/136 (2%)
Query: 439 PLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
PLLI+ E +F+CG+ ++ L+++ + L E S+ G+ D + ++
Sbjct: 213 PLLIEAIGILKKENYPVVLFVCGHGSERETLILKKLVEKHKL-EDSIIFLGYRKDWHILV 271
Query: 499 LMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGL 558
AD+++ S + +G P++++ AM GIPV+ D IK V F + + L
Sbjct: 272 KKADLLVLPS--IREGMPNVMLEAMALGIPVLVSDISEIKALVEHKKNAYLFHVNRLDSL 329
Query: 559 SRAFSLFISNGKLSKF 574
+ S G+L K+
Sbjct: 330 VNSLKELYSLGELRKY 345
>gi|163846943|ref|YP_001634987.1| group 1 glycosyl transferase [Chloroflexus aurantiacus J-10-fl]
gi|222524767|ref|YP_002569238.1| group 1 glycosyl transferase [Chloroflexus sp. Y-400-fl]
gi|163668232|gb|ABY34598.1| glycosyl transferase group 1 [Chloroflexus aurantiacus J-10-fl]
gi|222448646|gb|ACM52912.1| glycosyl transferase group 1 [Chloroflexus sp. Y-400-fl]
Length = 392
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 9/145 (6%)
Query: 465 DGYNDALQ---EVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
DGY + Q E A+ LGL +V G D ++ D++L S+ + +G P++I+
Sbjct: 249 DGYGEKPQRFREQATALGL-SQAVFWLGARPDAIDIIAGLDVLL--SASLWEGLPTVILE 305
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASA 581
AM G PV+ D P +E V + +P L++A ISN L A+ +A++
Sbjct: 306 AMAVGTPVVATDIPGTRELVLHEQTGLLAPPQSPPALAQAMIRLISNPGL---AQRLATS 362
Query: 582 GRLHAKNMLALDCVTRYARILENVL 606
R A +Y R+ +++
Sbjct: 363 ARERAAQFSITAAARQYERLYRSLI 387
>gi|385799180|ref|YP_005835584.1| group 1 glycosyl transferase [Halanaerobium praevalens DSM 2228]
gi|309388544|gb|ADO76424.1| glycosyl transferase group 1 [Halanaerobium praevalens DSM 2228]
Length = 372
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 93/195 (47%), Gaps = 17/195 (8%)
Query: 398 KYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFV 457
K+ LR E KDE+++V +G + + Y + + P +I+ + +FK +
Sbjct: 180 KFDLRDEFEIKKDELILVNIGR--MCRQKAQQYLI---EALPKVIEKHQ------NFKVL 228
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
F+ G + +++A LG+ + +V GF D+ +L AD +++ ++ E G P
Sbjct: 229 FV--GKLGGKENKYKKLAEELGV-KDNVIFTGFRKDIPSILKQADFMVH-TAIYEGGSPW 284
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
+I+ AM G+P+++ + I E+V +G + NPE ++ I N K +
Sbjct: 285 VILEAMMAGLPIVSTEAITIPEFVQDGVNGYLAENKNPEDIANQVIKMIENKDRVKMGQK 344
Query: 578 VA--SAGRLHAKNML 590
A +A + K M+
Sbjct: 345 SAEIAAEKYTFKKMI 359
>gi|398337304|ref|ZP_10522009.1| group 1 glycosyl transferase [Leptospira kmetyi serovar Malaysia
str. Bejo-Iso9]
Length = 391
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
L G Y D L+++++ ++ V GF+ DVN L D+++ SS E F +
Sbjct: 251 LLGPGDPTYIDYLKQISAAY---KNPVSFLGFHNDVNQFLKQVDLLVVPSSHFE-SFGMV 306
Query: 519 IVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
I+ +M PVI DF +KE VA+G + + + + +A + N K+SK
Sbjct: 307 ILESMRMKKPVICTDFGGMKEVVADGKTGMVVPAKDVKAMGKAIDHLLKNPKISK 361
>gi|254479245|ref|ZP_05092589.1| glycosyl transferase, group 1 family [Carboxydibrachium pacificum
DSM 12653]
gi|214034814|gb|EEB75544.1| glycosyl transferase, group 1 family [Carboxydibrachium pacificum
DSM 12653]
Length = 371
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 66/150 (44%), Gaps = 8/150 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
F F+ G+ Y +L+E GL E GF D+ L D+ + S E+GF
Sbjct: 228 FFFVAGDGP--YRKSLEEKIKEKGL-ESRFFLLGFRDDIPSFLRNLDVFVLPSH--EEGF 282
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFA 575
++ AM G+PV+ I E + EG I +K N E LS+A + + L +
Sbjct: 283 GISVIEAMNEGVPVVATAVGGIPEIIQEGVNGILVEKGNIESLSKAIKSLLKDAHLKE-- 340
Query: 576 RTVASAGRLHAKNMLALDCVTRYARILENV 605
T++ G+ AK + V R + E +
Sbjct: 341 -TLSLKGKEVAKKFSCEEMVKRVEELYERI 369
>gi|347527976|ref|YP_004834723.1| putative glycosyltransferase [Sphingobium sp. SYK-6]
gi|345136657|dbj|BAK66266.1| putative glycosyltransferase [Sphingobium sp. SYK-6]
Length = 379
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 74/190 (38%), Gaps = 17/190 (8%)
Query: 396 HEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFK 455
E+ LR + G EI VV+V F + V P L+ R + G F
Sbjct: 183 QEREALRAKLGIAPGEIAVVMVAGIFAKKR----QLEVIRLVVPQLLDLPARFHMVGDF- 237
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
T+ Y LG E + H GF DV L ADIVL S + +G
Sbjct: 238 -----APETELYARTCAAAVDELGFSERVLFH-GFRPDVADWLAAADIVLVASER--EGL 289
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN----GKL 571
++ AM G PV++ D E + + D+ +GL++A + ++ +
Sbjct: 290 ARCMIEAMACGTPVVSVDVSSAHEMLLRTGAGLVSGADDWQGLAQALRVLAADPDRRTAM 349
Query: 572 SKFARTVASA 581
+ RT A A
Sbjct: 350 GQAGRTTAEA 359
>gi|171916002|ref|ZP_02931472.1| glycosyl transferase, group 1 [Verrucomicrobium spinosum DSM 4136]
Length = 1982
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 79/373 (21%), Positives = 154/373 (41%), Gaps = 42/373 (11%)
Query: 220 VRPPRIGLILGNMAKDSRSLLLITVVKNLQKLGYVFKIY-AVRSGNSHSLWEQIAGQISI 278
R R N+ + LL+ ++ L+ G+V + + G +EQ Q+++
Sbjct: 1321 TRATRTAFATHNLNHEGAPRLLLELITGLKTRGHVEPVVVSPVDGPLRQEFEQAGIQVAV 1380
Query: 279 ------------LGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQE- 325
L + ++ + ++A++L+ A++ +E + P +W + E
Sbjct: 1381 GPISEGGSSIEELQEYLAKILRECRAELVVANTLQTWWAVTLAHRE---NTPSIWHVHES 1437
Query: 326 ---DSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGS 382
D+ + LP+ V +++ +I TL ++ NF IPG+
Sbjct: 1438 EPWDTYFDYLPLSVRS------EAYRAFAMPYRIIFVARATLRGWEALNSRKNFECIPGA 1491
Query: 383 PADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLI 442
D+ ++A + + +LR + DE+V+++VG+ + D AM + ++
Sbjct: 1492 -LDLDYLDAKIDNASRDELRSQLNIAADEVVLILVGT-VCDRKGQADAVEAMEILAQDVL 1549
Query: 443 KYARRNSVEGSFKFVFLCGNSTDGYNDAL-QEVASRLGLLEHSVRHYGFNGDVNGVLLMA 501
K R + + + Y L ++V S L + D + A
Sbjct: 1550 KRVR---------LILVGYRESLHYGQTLVRKVESLAPALRGKIVIEKERADCIRYVKAA 1600
Query: 502 DIVLYGSSQVEQGFPSLIVRAMTFGIPVI-TPDFPIIKEYVAEGAQVIFFQKDNPEGLSR 560
DI L +S+VE +P +I+ AM G+P+I TP F I+ E V G IF+Q L+R
Sbjct: 1601 DIALC-TSRVE-SYPRVILEAMACGLPIITTPVFGIV-EQVRPGRNAIFYQPGRISELAR 1657
Query: 561 AFSLFISNGKLSK 573
A + + + +
Sbjct: 1658 AIETLVRDEDMRR 1670
>gi|345020259|ref|ZP_08783872.1| capsular polysaccharide biosynthesis protein [Ornithinibacillus
scapharcae TW25]
Length = 395
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 117/291 (40%), Gaps = 38/291 (13%)
Query: 299 SLEAKEAISSLMQE------PFHSIPLVWIIQEDSLANRLPVYVERGFQ-----NLLSYW 347
+++A AI L+QE H+ ++++ + + +Y GF + LS W
Sbjct: 66 NMKAYAAIKKLLQEGKYDIVHVHTPIAAFLVRMAAPKKQQIIYTAHGFHFHEHGSRLSNW 125
Query: 348 ---------KSVFSRVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEK 398
S ++ VI D+ + +D ++ G AD + YS + K
Sbjct: 126 IYYVAEKVAASKTRKLIVINEEDFQSAQRFLPIDKIHYIHGVGLDADRFNPRNYSDKN-K 184
Query: 399 YQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVF 458
Y+LRKE G D IV+ + + F N+ D +A ++ + + FV
Sbjct: 185 YELRKELGISNDTIVITHL-AEFNENKRQIDIVLAAEELK------------KHTHNFVV 231
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
L + E+ +R LE V G+ GD++ +L + DI L S + +G P
Sbjct: 232 LLLGDGSLAAEIKNEITNRK--LEDYVFCLGYRGDIDQILSVTDIGLLVS--LREGLPKS 287
Query: 519 IVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG 569
++ M IP+I D ++ V + NP+ L A I NG
Sbjct: 288 VMEMMAMEIPLIVTDIRGNRDLVIDEKNGYVIPIRNPKALMEACLRLIHNG 338
>gi|418404691|ref|ZP_12978139.1| group 1 glycosyl transferase [Sinorhizobium meliloti CCNWSX0020]
gi|359501359|gb|EHK73973.1| group 1 glycosyl transferase [Sinorhizobium meliloti CCNWSX0020]
Length = 358
Score = 49.3 bits (116), Expect = 0.012, Method: Composition-based stats.
Identities = 41/145 (28%), Positives = 69/145 (47%), Gaps = 11/145 (7%)
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
GN N A+ E+ ++ GL H+V G D+ D+++ S +GFP++I
Sbjct: 220 GNGLVRDNPAVIELMTQAGLPAHAVDLRGEISDMPAFYQSIDLLVLSSRT--EGFPNVIA 277
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS--NGKLSKFARTV 578
AM+FG P++T D + V G I +P+ L+ A F+ + +++ART
Sbjct: 278 EAMSFGKPIVTTD--VGDAAVVAGKAGIAVPARDPQALADAMRAFLDLPEAEYARYART- 334
Query: 579 ASAGRLHAKNMLALDCV-TRYARIL 602
R +N A+ V T+Y+ L
Sbjct: 335 ---ARERIENEYAIAAVTTKYSEFL 356
>gi|186681832|ref|YP_001865028.1| group 1 glycosyl transferase [Nostoc punctiforme PCC 73102]
gi|186464284|gb|ACC80085.1| glycosyl transferase, group 1 [Nostoc punctiforme PCC 73102]
Length = 405
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 104/247 (42%), Gaps = 32/247 (12%)
Query: 370 VLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFF------Y 423
VLD + V+ GS DV + S EK + L+ ++VV ++ Y
Sbjct: 180 VLDGRDDLVL-GSGIDVEGLMKNSAEPEKLAATRRELGLEGQLVVTMISRLVISKGVREY 238
Query: 424 NELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEH 483
+++ M +V LLI GS +S G +QE+ + G+
Sbjct: 239 LQVASIVCQQMKNVTFLLI---------GSV-------SSEGGQAIPIQEIHQQAGV--- 279
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
VR+ G D+ +L ++D+ + S +G P +++ A T +P+IT D P KE V +
Sbjct: 280 -VRYLGPRNDIPTLLNLSDVFVL-PSYYREGVPRVLLEAATMELPLITTDMPGCKEVVKD 337
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVT-RYARIL 602
G + + + L+ A +++ + + R+H + +L+ V YA I
Sbjct: 338 GWNGLLVPPRDTKALATAILKLLNSPEQRNL---MGKRSRVHVQTNFSLNQVADAYADIY 394
Query: 603 ENVLNFP 609
VL P
Sbjct: 395 NRVLKLP 401
>gi|53713292|ref|YP_099284.1| glycosyltransferase [Bacteroides fragilis YCH46]
gi|52216157|dbj|BAD48750.1| putative glycosyltransferase [Bacteroides fragilis YCH46]
Length = 392
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/124 (20%), Positives = 61/124 (49%), Gaps = 6/124 (4%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+E+S++ + F ++ +D+ + S +GFP ++ AM+ G+PVI D P++KE
Sbjct: 275 MENSIKIHPFTQNIASYYSASDVFILSSRW--EGFPLVLCEAMSMGLPVIASDLPVVKEI 332
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
+++ + FQ L+ ++ K T+ A +++ L D + ++ +
Sbjct: 333 LSDEQDCLIFQNGCVASLAEKMDRMVT----LKDIHTMGKAAFAKSRHFLVSDIIEQWKK 388
Query: 601 ILEN 604
++ +
Sbjct: 389 LIND 392
>gi|423257693|ref|ZP_17238616.1| hypothetical protein HMPREF1055_00893 [Bacteroides fragilis
CL07T00C01]
gi|423265340|ref|ZP_17244343.1| hypothetical protein HMPREF1056_02030 [Bacteroides fragilis
CL07T12C05]
gi|387778061|gb|EIK40157.1| hypothetical protein HMPREF1055_00893 [Bacteroides fragilis
CL07T00C01]
gi|392702998|gb|EIY96142.1| hypothetical protein HMPREF1056_02030 [Bacteroides fragilis
CL07T12C05]
Length = 392
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/124 (20%), Positives = 61/124 (49%), Gaps = 6/124 (4%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+E+S++ + F ++ +D+ + S +GFP ++ AM+ G+PVI D P++KE
Sbjct: 275 MENSIKIHPFTQNIASYYSASDVFILSSRW--EGFPLVLCEAMSMGLPVIASDLPVVKEI 332
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
+++ + FQ L+ ++ K T+ A +++ L D + ++ +
Sbjct: 333 LSDEQDCLIFQNGCVASLAEKMDRMVT----LKDIHTMGKAAFAKSRHFLVSDIIEQWKK 388
Query: 601 ILEN 604
++ +
Sbjct: 389 LIND 392
>gi|433654776|ref|YP_007298484.1| glycosyltransferase [Thermoanaerobacterium thermosaccharolyticum
M0795]
gi|433292965|gb|AGB18787.1| glycosyltransferase [Thermoanaerobacterium thermosaccharolyticum
M0795]
Length = 374
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 9/123 (7%)
Query: 456 FVFLCGNSTDG-YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQG 514
F F+ G DG Y + LQ++ + L ++R G+ D+N L DI + S +G
Sbjct: 230 FAFVAG---DGPYLNYLQDMVQKAKL--ENIRFLGYRNDINDFLRNIDIFVLPSHS--EG 282
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-SK 573
F + AMT G+PVI + I E + I ++P L+ A + +N L +K
Sbjct: 283 FGISVAEAMTLGVPVIATNVGGIPEIIENNEDGIIVNPESPNDLANAIEILATNADLRNK 342
Query: 574 FAR 576
F++
Sbjct: 343 FSK 345
>gi|16264482|ref|NP_437274.1| glycosyltransferase [Sinorhizobium meliloti 1021]
gi|15140619|emb|CAC49134.1| putative glycosyltransferase protein [Sinorhizobium meliloti 1021]
Length = 360
Score = 48.9 bits (115), Expect = 0.018, Method: Composition-based stats.
Identities = 41/145 (28%), Positives = 68/145 (46%), Gaps = 11/145 (7%)
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
GN N A+ E+ ++ GL HSV G D+ D+++ S +GFP++I
Sbjct: 222 GNGLVRDNPAVIELMTQAGLPAHSVDLRGEISDMPAFYQSIDLLVLSSRT--EGFPNVIA 279
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS--NGKLSKFARTV 578
AM+FG P++T D + G I +P+ L+ A F+ + +++ART
Sbjct: 280 EAMSFGKPIVTTD--VGDAAAVAGKAGIAVPARDPQALADAMRAFLDLPEAEYARYART- 336
Query: 579 ASAGRLHAKNMLALDCV-TRYARIL 602
R +N A+ V T+Y+ L
Sbjct: 337 ---ARERIENEYAIAAVTTKYSEFL 358
>gi|435855032|ref|YP_007316351.1| glycosyltransferase [Halobacteroides halobius DSM 5150]
gi|433671443|gb|AGB42258.1| glycosyltransferase [Halobacteroides halobius DSM 5150]
Length = 358
Score = 48.5 bits (114), Expect = 0.020, Method: Composition-based stats.
Identities = 36/133 (27%), Positives = 56/133 (42%), Gaps = 18/133 (13%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE S+ GF D+ +L DI + S ++GFP I+ AM+ G+P+I+ + I E
Sbjct: 238 LEESISILGFRKDIKAILNKTDIFVLPS--YDEGFPLSILEAMSMGVPIISTNIAGIPEM 295
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
+ E + + L L I++ KL K DC +R
Sbjct: 296 IEENKSGFLIKPGDINNLKAKIELLINDLKLRK----------------QMADCCMNKSR 339
Query: 601 ILENVLNFPSDAL 613
L +V N S+ L
Sbjct: 340 QLFDVSNMVSNTL 352
>gi|153004292|ref|YP_001378617.1| group 1 glycosyl transferase [Anaeromyxobacter sp. Fw109-5]
gi|152027865|gb|ABS25633.1| glycosyl transferase group 1 [Anaeromyxobacter sp. Fw109-5]
Length = 387
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 73/165 (44%), Gaps = 12/165 (7%)
Query: 442 IKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMA 501
+ ARR E + FL G D L+ +A+RLGL H+ G DV VL A
Sbjct: 187 VSLARRERPELA---AFLVGEGE--RVDGLKALAARLGLEGHAF-FLGRRPDVPAVLARA 240
Query: 502 DIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRA 561
+ + S+Q +G + ++ M G+P++ D E V +G + + PE L+RA
Sbjct: 241 AVGVLSSAQ--EGLSNAVIEGMAAGLPMVVTDAGGNGELVRDGERGLVVPVGQPEALARA 298
Query: 562 FSLFISNGKLSKFARTVASAGRLHAKNMLALD-CVTRYARILENV 605
IS + AR AGR + + +D V + R+ V
Sbjct: 299 IVEVISE---PERARAFGQAGRRFVEREITVDRMVAEHERLYRRV 340
>gi|167590846|ref|ZP_02383234.1| glycosyl transferase, group 1 [Burkholderia ubonensis Bu]
Length = 370
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 62/141 (43%), Gaps = 5/141 (3%)
Query: 439 PLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
P L++ R + +G L G + D +N L + LG+ E V G D+ +L
Sbjct: 195 PNLLRALRMLAQQGFRGRCVLVGRNMDEHNAELMSLIRELGI-ERRVMLVGLRDDIPSIL 253
Query: 499 LMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQ--KDNPE 556
D+++ SS + FP+++V AM G PV+T D + V + V+ Q D
Sbjct: 254 NAVDLLVLSSSG--EAFPNVLVEAMACGTPVVTTDVGDARMIVGDTGWVVRPQDSADLAS 311
Query: 557 GLSRAFSLFISNGKLSKFART 577
GL A + + + AR
Sbjct: 312 GLRSALDWLRTTDRAALAARC 332
>gi|304316637|ref|YP_003851782.1| group 1 glycosyl transferase [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|302778139|gb|ADL68698.1| glycosyl transferase group 1 [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
Length = 374
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 7/122 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
F F+ G+ Y + LQ++ + L ++R G+ D+N L DI + S +GF
Sbjct: 230 FAFVAGDGP--YLNYLQDMVQKAKL--ENIRFLGYRNDINDFLRNIDIFVLPSHS--EGF 283
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-SKF 574
+ AMT G+PVI + I E + I ++P L+ A + +N L +KF
Sbjct: 284 GISVAEAMTLGVPVIATNVGGIPEIIENNEDGIIVNPESPNDLANAIEILATNTDLRNKF 343
Query: 575 AR 576
++
Sbjct: 344 SK 345
>gi|16127403|ref|NP_421967.1| glycosyl transferase family protein [Caulobacter crescentus CB15]
gi|221236212|ref|YP_002518649.1| glycosyltransferase [Caulobacter crescentus NA1000]
gi|13424845|gb|AAK25135.1| glycosyl transferase, group 1 family protein [Caulobacter
crescentus CB15]
gi|220965385|gb|ACL96741.1| glycosyltransferase [Caulobacter crescentus NA1000]
Length = 349
Score = 48.1 bits (113), Expect = 0.027, Method: Composition-based stats.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 2/88 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE SV G+ GDV G L D+ + S Q +GFP ++ AM G+PV+ + E
Sbjct: 225 LEASVTLKGWTGDVAGFLATGDLFAFPSHQ--EGFPLTLLEAMAVGLPVVASEIDGPLEI 282
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ +G ++P+ L+ A + IS+
Sbjct: 283 LTDGRDGRLVPDNDPDRLAEALAELISD 310
>gi|384533390|ref|YP_005716054.1| group 1 glycosyl transferase [Sinorhizobium meliloti BL225C]
gi|333815566|gb|AEG08233.1| glycosyl transferase group 1 [Sinorhizobium meliloti BL225C]
Length = 369
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 68/146 (46%), Gaps = 11/146 (7%)
Query: 460 CGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLI 519
GN N A+ E+ ++ GL HSV G D+ D+++ S +GFP++I
Sbjct: 221 AGNGLVRDNPAVIELMTQAGLPAHSVDLRGEISDMPAFYQSIDLLVLSSRT--EGFPNVI 278
Query: 520 VRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF--ISNGKLSKFART 577
AM+FG P++T D + G I +P+ L+ A F + + +++ART
Sbjct: 279 AEAMSFGKPIVTTD--VGDAAAVAGKAGIAVPARDPQALADAMRAFLDLPEAEYARYART 336
Query: 578 VASAGRLHAKNMLALDCV-TRYARIL 602
R +N A+ V T+Y+ L
Sbjct: 337 ----ARERIENEYAIAAVTTKYSEFL 358
>gi|312199871|ref|YP_004019932.1| group 1 glycosyl transferase [Frankia sp. EuI1c]
gi|311231207|gb|ADP84062.1| glycosyl transferase group 1 [Frankia sp. EuI1c]
Length = 412
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 12/132 (9%)
Query: 452 GSFKFVFL-CGNSTDGYNDALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGS 508
G+ L G G DA E A++LG+ E R G + D+ VL A +++ S
Sbjct: 250 GAVDLPLLHVGQPGWGVLDAAAE-AAKLGVGEGRFRSLGRIDDADLAVVLSRASVLVMPS 308
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
G P+ + AM G+PV+ D P + E A V+ DNP L+ A S ++N
Sbjct: 309 RSEGFGLPA--IEAMAHGVPVVVSDVPALVEVTGPAALVVPI--DNPTALADAVSHILTN 364
Query: 569 ----GKLSKFAR 576
G+LS+ R
Sbjct: 365 HALRGRLSRTGR 376
>gi|171059760|ref|YP_001792109.1| group 1 glycosyl transferase [Leptothrix cholodnii SP-6]
gi|170777205|gb|ACB35344.1| glycosyl transferase group 1 [Leptothrix cholodnii SP-6]
Length = 379
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 81/193 (41%), Gaps = 21/193 (10%)
Query: 394 KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGS 453
+S + +LR E G L ++++ V+G + D VA+ + R V+
Sbjct: 179 ESADGRRLRAELGVLPGQVLIGVIGR-LHPLKGHADLIVALAQL--------YREGVD-- 227
Query: 454 FKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQ 513
F+ VF+ G +DALQ+ GL VR G DV VL DI S + +
Sbjct: 228 FQCVFVGGGD---LHDALQQQVDEAGL-NGVVRLLGQRSDVADVLAAIDIFAMPSRR--E 281
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG---- 569
G P ++ +M V+ I E + +G + + NP L+ A S + +
Sbjct: 282 GLPMALLESMAMARAVLATAVGSIPEVITDGENGMLVEPSNPSRLAAALSRLLRDAPLRE 341
Query: 570 KLSKFARTVASAG 582
K+ + AR AG
Sbjct: 342 KIGRAARATVEAG 354
>gi|384539104|ref|YP_005723188.1| putative glycosyltransferase protein [Sinorhizobium meliloti SM11]
gi|336037757|gb|AEH83687.1| putative glycosyltransferase protein [Sinorhizobium meliloti SM11]
Length = 369
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 70/151 (46%), Gaps = 11/151 (7%)
Query: 460 CGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLI 519
GN N A+ E+ ++ GL HSV G D+ D+++ S +GFP++I
Sbjct: 221 AGNGLVRDNPAVIELMTQAGLPAHSVDLRGEISDMPAFYQSIDLLVLSSRT--EGFPNVI 278
Query: 520 VRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF--ISNGKLSKFART 577
AM+FG P++T D + G I +P+ L+ A F + + +++ART
Sbjct: 279 AEAMSFGKPIVTTD--VGDAAAVAGKAGIAVPARDPQALADAMRAFLDLPEAEYARYART 336
Query: 578 VASAGRLHAKNMLALDCV-TRYARILENVLN 607
R +N A+ V T+Y+ L ++
Sbjct: 337 ----ARERIENEYAIAAVTTKYSEFLMALMK 363
>gi|399077996|ref|ZP_10752653.1| glycosyltransferase [Caulobacter sp. AP07]
gi|398034350|gb|EJL27621.1| glycosyltransferase [Caulobacter sp. AP07]
Length = 388
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 71/167 (42%), Gaps = 11/167 (6%)
Query: 445 ARRNSVEGSFKFVFLCGNST--DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMAD 502
AR +V+ + + L G+ Y L+ + ++ GL E SV+ G D+ L+AD
Sbjct: 224 ARLKAVDDARILLLLVGDDQGRKAYRAELERMIAQAGL-EDSVKLVGHCDDMPAAYLVAD 282
Query: 503 IVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
I + S + E F V G PV+ D +E V G + +PE + A
Sbjct: 283 IAIAPSLEPE-AFGRTAVEPQVMGKPVLAADHGAARETVLNGDTGWLVKPGDPEAWAEAL 341
Query: 563 SLFISNGKLSKFARTVASAGRLHAKNMLALDCVTR-----YARILEN 604
S G + + + A R A+ + ++D + YAR+LE
Sbjct: 342 SNACEAGAARR--QVMGMAARARARKLYSVDAMVEATLKVYARVLET 386
>gi|113475084|ref|YP_721145.1| group 1 glycosyl transferase [Trichodesmium erythraeum IMS101]
gi|110166132|gb|ABG50672.1| glycosyl transferase, group 1 [Trichodesmium erythraeum IMS101]
Length = 1991
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 112/265 (42%), Gaps = 24/265 (9%)
Query: 316 SIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPMLYSVLDAGN 375
+IP VW + E + Y R + + F I+F Y L++ +
Sbjct: 1714 NIPSVWNVHE---SEPWQTYFNRFGNEIAARALECFRYPYRIIFVSDATRNRYLPLNSHH 1770
Query: 376 FFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMH 435
F + + D+ ++ S + + R G +DEIV++++G+ H
Sbjct: 1771 NFTVIHNGLDLELLKKASAKWSRQEARSVLGVKEDEIVILLLGTVCERK--------GQH 1822
Query: 436 DVGPLL--IKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGL-LEHSVRHYGFNG 492
D+ L + R ++ FL G+ + Y+ L E+ L ++ V G
Sbjct: 1823 DLIRALSFMPEEERQKIK-----CFLVGDRPNLYSLKLHELVKNLPEEIQQRVEIVGETP 1877
Query: 493 DVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI-TPDFPIIKEYVAEGAQVIFFQ 551
+ ADI + +S++E FP +I+ AM++ +P++ TP F I+ E V +F+
Sbjct: 1878 ETAKYYQAADIFV-CTSRIE-SFPRVILEAMSYSLPIVTTPVFGIV-EQVKPNINGLFYT 1934
Query: 552 KDNPEGLSRAF-SLFISNGKLSKFA 575
+NPE L+ SL I KFA
Sbjct: 1935 PENPEELANVLTSLLIDEELRHKFA 1959
>gi|323488277|ref|ZP_08093526.1| capsular polysaccharide biosynthesis protein [Planococcus
donghaensis MPA1U2]
gi|323398029|gb|EGA90826.1| capsular polysaccharide biosynthesis protein [Planococcus
donghaensis MPA1U2]
Length = 373
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 44/206 (21%), Positives = 86/206 (41%), Gaps = 23/206 (11%)
Query: 361 DYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSS 420
DY + + + G+ + IPG + E + +K +LRK G KD ++++ +G
Sbjct: 150 DYFAAKKFKLKNEGSVYSIPG--VGINTNEYQDITIDKDKLRKSLGLEKDNVILISMGDL 207
Query: 421 FFYNELSWDY-AVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLG 479
+ A+AM + S + +CG + N L+++A G
Sbjct: 208 IKRKNYTLSIEALAM---------------TKNSNVYYLICGQGPELEN--LKKLAKEKG 250
Query: 480 LLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKE 539
+E VR G D+ +L ++DI L+ + +++G P ++ AM G+P I +
Sbjct: 251 -VEQQVRFLGHRSDIKELLTISDIFLF--TTLQEGLPRSMMEAMASGLPCIASKIRGNTD 307
Query: 540 YVAEGAQVIFFQKDNPEGLSRAFSLF 565
+ G ++ G + A +
Sbjct: 308 LIESGNGGFLVNLNDTHGFTNAIEVL 333
>gi|337267111|ref|YP_004611166.1| group 1 glycosyl transferase [Mesorhizobium opportunistum WSM2075]
gi|336027421|gb|AEH87072.1| glycosyl transferase group 1 [Mesorhizobium opportunistum WSM2075]
Length = 374
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 60/125 (48%), Gaps = 14/125 (11%)
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
R G LE +VR GF D L A+ + + +GF IV A +FG PV+T +
Sbjct: 242 RAGGLEANVRFTGFVDDRTLATLYANCLFLAMPSLYEGFGFPIVEAQSFGKPVLTSNTSS 301
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVAS-----AGRL----HAK 587
+ E A+ A + +NP ++ AF+ F + +F + +A+ AGR HAK
Sbjct: 302 MPEVAADNA--VLVDPNNPVAIAAAFNRFCVD---VEFRQGIAADAKENAGRFTWENHAK 356
Query: 588 NMLAL 592
ML +
Sbjct: 357 GMLQI 361
>gi|167644792|ref|YP_001682455.1| group 1 glycosyl transferase [Caulobacter sp. K31]
gi|167347222|gb|ABZ69957.1| glycosyl transferase group 1 [Caulobacter sp. K31]
Length = 350
Score = 47.8 bits (112), Expect = 0.035, Method: Composition-based stats.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 5/100 (5%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
VR G+ GDV+ L D+ S Q +GFP +++ AM G+PV + P E + +G
Sbjct: 230 VRLVGWRGDVSAFLAAGDVFALPSYQ--EGFPLVLLEAMAVGLPVASAAIPGPVEMIEDG 287
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRL 584
+ +P L++A S I+ + S R + A R+
Sbjct: 288 VDGVLVPPGDPAALAQALSGLIAAPEKS---RALGVAARV 324
>gi|334320877|ref|YP_004557506.1| group 1 glycosyl transferase [Sinorhizobium meliloti AK83]
gi|407723538|ref|YP_006843199.1| group 1 glycosyl transferase [Sinorhizobium meliloti Rm41]
gi|334098616|gb|AEG56626.1| glycosyl transferase group 1 [Sinorhizobium meliloti AK83]
gi|407323598|emb|CCM72199.1| group 1 glycosyl transferase [Sinorhizobium meliloti Rm41]
Length = 360
Score = 47.8 bits (112), Expect = 0.039, Method: Composition-based stats.
Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 11/145 (7%)
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
GN N A+ E+ ++ GL H+V G D+ D+++ S +GFP++I
Sbjct: 222 GNGLVRDNPAVIELMTQAGLPAHAVDLRGEISDMPAFYQSIDLLVLSSRT--EGFPNVIA 279
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS--NGKLSKFARTV 578
AM+FG P++T D + G I +P+ L+ A F+ + +++ART
Sbjct: 280 EAMSFGKPIVTTD--VGDAAAVAGKAGIAVPARDPQALADAMRAFLDLPEAEYARYART- 336
Query: 579 ASAGRLHAKNMLALDCV-TRYARIL 602
R +N A+ V T+Y+ L
Sbjct: 337 ---ARERIENEYAIAAVTTKYSEFL 358
>gi|430857861|ref|ZP_19475494.1| hypothetical protein OI3_02922 [Enterococcus faecium E1552]
gi|430547071|gb|ELA87013.1| hypothetical protein OI3_02922 [Enterococcus faecium E1552]
Length = 382
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 87/193 (45%), Gaps = 21/193 (10%)
Query: 379 IPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVG 438
IPG D+ ++ KS +K +R+E GF +++VV VG EL+ + H+
Sbjct: 177 IPGIGLDINKIDKLKKSTQK-DIREEVGFSNSDVIVVSVG------ELNEN---KNHETI 226
Query: 439 PLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
IK ++ K+ +CG T + L+ + ++LGL E V G+ D+ +L
Sbjct: 227 IRAIKL-----LDSKVKY-LICGIGTQ--KEYLESLITKLGLSER-VLLLGYRTDIVDIL 277
Query: 499 LMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGL 558
+DI + S + +G + AM +P++T + I +Y G ++ + G
Sbjct: 278 CKSDIFAFPS--IREGLGMAALEAMRCKLPLVTSNSHGINDYSVNGITGFSYKAKDTVGF 335
Query: 559 SRAFSLFISNGKL 571
S + + KL
Sbjct: 336 SNGIKMLSEDSKL 348
>gi|440749495|ref|ZP_20928741.1| hypothetical protein C943_1305 [Mariniradius saccharolyticus AK6]
gi|436481781|gb|ELP37927.1| hypothetical protein C943_1305 [Mariniradius saccharolyticus AK6]
Length = 359
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 493 DVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQK 552
D+ L AD LY S+ + +G P+ I+ AM++G+P+I D + E V G+ K
Sbjct: 251 DLQSYFLDAD--LYLSTSINEGMPNTILEAMSYGLPIIATDVGDVSELVTTGSNGFVVMK 308
Query: 553 DNPEGLSRAFSLFI 566
N +G+ + S F+
Sbjct: 309 RNIDGIVESISYFL 322
>gi|375010491|ref|YP_004984124.1| glycosyl transferase group 1 [Geobacillus thermoleovorans
CCB_US3_UF5]
gi|359289340|gb|AEV21024.1| Glycosyl transferase group 1 [Geobacillus thermoleovorans
CCB_US3_UF5]
Length = 386
Score = 47.4 bits (111), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L V+ GF D+ + MAD+++ S + +G P +I+ AM F PV+ D +E
Sbjct: 256 LSDVVKIIGFQTDIYPYIKMADLIVLTSEK--EGVPRIIMEAMAFSKPVVATDVLGTREL 313
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V +G + + N E L+ + + +S+ + K
Sbjct: 314 VVDGETGVLVEYKNVEQLASSIHVMLSDERKRK 346
>gi|332716343|ref|YP_004443809.1| Glycosyltransferase [Agrobacterium sp. H13-3]
gi|325063028|gb|ADY66718.1| Glycosyltransferase [Agrobacterium sp. H13-3]
Length = 382
Score = 47.0 bits (110), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 52/109 (47%), Gaps = 2/109 (1%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
D Y ++E ASRLG L+ VR GF DV ++ D V + +S V + F ++V AM
Sbjct: 245 DAYETRIREQASRLG-LDGRVRFLGFRSDVPELMASMDAVAH-TSVVAEPFGRVVVEAMM 302
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
G PV+ + E + +G + + L+ A +S L++
Sbjct: 303 CGRPVVATRGGGVTEIIRDGETGLLVPPGDASALAAALGCILSQPALAE 351
>gi|423225463|ref|ZP_17211930.1| hypothetical protein HMPREF1062_04116 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392632391|gb|EIY26351.1| hypothetical protein HMPREF1062_04116 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 384
Score = 47.0 bits (110), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 92/214 (42%), Gaps = 23/214 (10%)
Query: 361 DYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQ-LRKENGFLKDEIVVVVVGS 419
DYT + + A N I G DV + S + Y+ LRK G D+++++ VG
Sbjct: 160 DYTRALKFK---ARNVRKIHGIGLDVKKFMDHILSVDFYKKLRKSIGISDDDVMILSVGE 216
Query: 420 SFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLG 479
+ +H ++IK + +++ V CG D L+ +AS L
Sbjct: 217 ------------LKIHKNQKVIIKAIAKCNLDNQHYVV--CGKGPD--KTKLELLASSLN 260
Query: 480 LLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKE 539
+ V GF D+ + +ADI ++ S V +G + AM G+P+I+ IK+
Sbjct: 261 I-RQKVHFVGFRKDIPDLCRVADIFVHPS--VREGLGIAPLEAMASGLPLISSTVGGIKD 317
Query: 540 YVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
Y +G + +G ++A + S+ L K
Sbjct: 318 YAHDGETGFCLSPYDVDGFAKALMILSSDKSLRK 351
>gi|311105200|ref|YP_003978053.1| glycosyl transferase group 1 [Achromobacter xylosoxidans A8]
gi|310759889|gb|ADP15338.1| glycosyl transferase, group 1 family protein 4 [Achromobacter
xylosoxidans A8]
Length = 365
Score = 47.0 bits (110), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 80/206 (38%), Gaps = 19/206 (9%)
Query: 378 VIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDV 437
V G A V++ E+ LR+E G D+IVV V + + + +
Sbjct: 148 VPAGKVATVYSPIVLPPPVERSTLREELGLADDDIVVGCVA----VMRATKGHKDLIDAI 203
Query: 438 GPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGV 497
PL+ + VF+ G S + Q+ + LGL + + G DV +
Sbjct: 204 APLMASRPK-------LHLVFVGGGSP--LFEQTQDYVAELGL-QDRIHLMGMRRDVPNL 253
Query: 498 LLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG 557
L D+ + Q G ++ V A G+PVI D + E +G I NPE
Sbjct: 254 LAGFDLFALATQQEASG--TVYVEAQASGLPVIGTDVGGVSEMFRDGETGILVPPKNPEA 311
Query: 558 LSRAFSLFISNGKLSKFARTVASAGR 583
L+ A I + L R + AGR
Sbjct: 312 LTAALVRLIDDADLR---RRMGEAGR 334
>gi|327398502|ref|YP_004339371.1| group 1 glycosyl transferase [Hippea maritima DSM 10411]
gi|327181131|gb|AEA33312.1| glycosyl transferase group 1 [Hippea maritima DSM 10411]
Length = 355
Score = 47.0 bits (110), Expect = 0.061, Method: Composition-based stats.
Identities = 35/138 (25%), Positives = 65/138 (47%), Gaps = 5/138 (3%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D ++E+ R + E + GF D+ + D+ + S +G ++ AM IP
Sbjct: 219 DKIRELIERKKMAE-KFKLIGFKKDIENYIKAFDLFVLPSDF--EGLSGAVLNAMLLKIP 275
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKN 588
V++ D + E V + I Q++NPE L++A + + L K + V +A RL +N
Sbjct: 276 VVSTDAGGLSEVVFDKETGILVQRNNPEILAKAIETVLEDKDLRK--KIVENAYRLVKEN 333
Query: 589 MLALDCVTRYARILENVL 606
V +Y ++ + +L
Sbjct: 334 FSVDKMVEKYIKLYKELL 351
>gi|430805345|ref|ZP_19432460.1| glycosyl transferase group 1 [Cupriavidus sp. HMR-1]
gi|429502382|gb|ELA00693.1| glycosyl transferase group 1 [Cupriavidus sp. HMR-1]
Length = 346
Score = 46.6 bits (109), Expect = 0.070, Method: Composition-based stats.
Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 4/94 (4%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D L+E A +L + VR G ++ +L A + L S +GFP ++ AM +P
Sbjct: 202 DTLEEQAGQL--VPGRVRFVGSVSEIPNLLASAQVFLLASDH--EGFPISVLEAMRAELP 257
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
V+ D P I+E +A G I Q + P+ ++A
Sbjct: 258 VVASDLPGIREQLAGGRCGILVQGNEPQAFAQAL 291
>gi|149915995|ref|ZP_01904518.1| glycosyl transferase, group 1 [Roseobacter sp. AzwK-3b]
gi|149810069|gb|EDM69917.1| glycosyl transferase, group 1 [Roseobacter sp. AzwK-3b]
Length = 385
Score = 46.6 bits (109), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 57/119 (47%), Gaps = 7/119 (5%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
LQ + LGL E V GF D + ADI + S +GF +++V AM G PV+
Sbjct: 249 LQVMVMELGLAER-VTFLGFQRDPFSYMRAADIFVLSSRW--EGFGNVLVEAMAMGTPVV 305
Query: 531 TPDFP-IIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKN 588
+ D P E +A+G + D PE L+ + I + L R + AG++ A++
Sbjct: 306 STDCPHGPAEIIADGETGLLVPVDQPEALAESLQRLIDDPALR---RRLGEAGKVRAQD 361
>gi|149915996|ref|ZP_01904519.1| putative transferase [Roseobacter sp. AzwK-3b]
gi|149810070|gb|EDM69918.1| putative transferase [Roseobacter sp. AzwK-3b]
Length = 380
Score = 46.6 bits (109), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 58/137 (42%), Gaps = 9/137 (6%)
Query: 434 MHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGD 493
+ DV P L R + G FK T+ Y ++ + LGL E V +GF D
Sbjct: 214 IRDVLPQLADLHVRLHLVGDFK------PHTESYAQVCADMVAALGL-EDRVVFHGFRSD 266
Query: 494 VNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKD 553
V + DIVL S + +G ++ AM G PV++ D +E + I D
Sbjct: 267 VADWMAALDIVLVASRR--EGLARCMIEAMACGTPVVSVDVCSAREMLESTGAGIVVGMD 324
Query: 554 NPEGLSRAFSLFISNGK 570
+ GL+ A ++ K
Sbjct: 325 DWAGLAAALRDLSTDSK 341
>gi|15891381|ref|NP_357053.1| glycosyltransferase [Agrobacterium fabrum str. C58]
gi|15159774|gb|AAK89838.1| glycosyltransferase [Agrobacterium fabrum str. C58]
Length = 382
Score = 46.6 bits (109), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
+ Y ++E ASRLGL + VR GF DV ++ D+V + +S V + F ++V AM
Sbjct: 245 EAYEARIREQASRLGL-DGRVRFLGFRSDVPELMASMDVVAH-TSIVAEPFGRVVVEAMM 302
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
G PV+ + E + +G + + L+ A +S+ L++
Sbjct: 303 CGRPVVATRGGGVTEIIRDGETGLLVPPGDASALAAALGTILSDPALAQ 351
>gi|89095715|ref|ZP_01168609.1| capsular polysaccharide biosynthesis protein [Bacillus sp. NRRL
B-14911]
gi|89089461|gb|EAR68568.1| capsular polysaccharide biosynthesis protein [Bacillus sp. NRRL
B-14911]
Length = 375
Score = 46.6 bits (109), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 56/105 (53%), Gaps = 3/105 (2%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D+ +E+A++ G+ + SV G+ D+ +L ++DI + SS +G P I+ AM G+P
Sbjct: 243 DSYRELANKFGISD-SVNFLGYRNDIPNLLAISDIGV--SSSRREGLPVNILEAMATGLP 299
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
+I + ++ V EG ++++ EG +RA + L K
Sbjct: 300 IIATECRGNRDLVHEGENGYILRENDIEGFARAIEELYKSQNLRK 344
>gi|399051248|ref|ZP_10741170.1| glycosyltransferase [Brevibacillus sp. CF112]
gi|398050825|gb|EJL43170.1| glycosyltransferase [Brevibacillus sp. CF112]
Length = 408
Score = 46.6 bits (109), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 71/180 (39%), Gaps = 24/180 (13%)
Query: 397 EKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKF 456
E ++R G D++V+ VG V + LL ++R G +
Sbjct: 188 EAREVRSALGIGADDLVIAYVGR-----------MVGEKGIFELLDAFSRLARESGRVRL 236
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRH-----YGFNGDVNGVLLMADIVLYGSSQV 511
L G+ +D D + L EH +H GF D+ +L +DI + S +
Sbjct: 237 -LLVGDVSDSERDQRGK-----ALREHCRKHPQIVLTGFRRDIPQLLAASDIFVLPSHR- 289
Query: 512 EQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+G P I+ AM G P++ + +E V +G I + E L +A + +L
Sbjct: 290 -EGLPRSIIEAMAMGKPIVATNIRGCREEVTDGVNGILVEPKQSEHLYKALKKLACDARL 348
>gi|288959487|ref|YP_003449828.1| glycosyltransferase [Azospirillum sp. B510]
gi|288911795|dbj|BAI73284.1| glycosyltransferase [Azospirillum sp. B510]
Length = 420
Score = 46.6 bits (109), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 62/141 (43%), Gaps = 5/141 (3%)
Query: 466 GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
GY + L+ + GL VR GD+ L++ +V+ S + E F +IV A
Sbjct: 280 GYREELENRIANAGL-RGMVRMTDHCGDMAAAYLLSTVVVSASREPE-AFGRVIVEAQAM 337
Query: 526 GIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS---NGKLSKFARTVASAG 582
G PVI +E V G D+P+ L+RA +S + + AR +A
Sbjct: 338 GKPVIVSAIGAYQETVLPGETAWVVPPDDPDALARALDEALSLTPEQREAIGARAMAFVA 397
Query: 583 RLHAKNMLALDCVTRYARILE 603
+ ++ + D + YA +L+
Sbjct: 398 DRYTRDRMCADTLAVYAELLQ 418
>gi|169835624|ref|ZP_02868812.1| glycosyl transferase group 1 [candidate division TM7 single-cell
isolate TM7a]
Length = 477
Score = 46.6 bits (109), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 90/210 (42%), Gaps = 19/210 (9%)
Query: 375 NFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAM 434
N +PG D + + +K +LRK G KD+ V++ + N+ +
Sbjct: 164 NVIYMPGVGVDPKRFKPKLTAKQKLELRKSLGLKKDDFVMIY-PAELNKNKNQTLLLHVL 222
Query: 435 HDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDV 494
H++ N + S + ++ GY+ L A +LG+ + +V G+ D+
Sbjct: 223 HEI----------NKEDKSVHLLLAGKDNLSGYHKKL---ADKLGVAK-NVHFLGYRSDI 268
Query: 495 NGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDN 554
+L M+D+ + S + +G P ++ AM G+P++T E V +G D
Sbjct: 269 PQLLQMSDLSVSASHR--EGLPVHLIEAMFAGLPIVTTKCRGATELVEDGVNGFVVGFDK 326
Query: 555 PEGLSRAFSLFISNGKLSK-FARTVASAGR 583
E + R + I N +L K + ++A R
Sbjct: 327 VEFVERVQKI-IENTQLRKSLGKKSSAAAR 355
>gi|148657743|ref|YP_001277948.1| group 1 glycosyl transferase [Roseiflexus sp. RS-1]
gi|148569853|gb|ABQ91998.1| glycosyl transferase, group 1 [Roseiflexus sp. RS-1]
Length = 387
Score = 46.6 bits (109), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 80/173 (46%), Gaps = 22/173 (12%)
Query: 397 EKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKF 456
E +R+E G DE +V+++ + F + D A +G RR++
Sbjct: 191 EAEAVRREIGVAPDEALVLMI-AEFIPRKRHTDVLRAFAQLG-------RRDT------- 235
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHY-GFNGDVNGVLLMADIVLYGSSQVEQGF 515
L T + ++ +A LG+ + H+ G+ DV +L A +++ S Q +G
Sbjct: 236 -HLALAGTGPLLEPMRRLAVDLGIAGQT--HFLGYRSDVPALLRAATVMILPSRQ--EGL 290
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
P I+ AM G+PVI D +++ +A+GA ++ D E L+ A + I +
Sbjct: 291 PRSILEAMAMGVPVIGSDIRGVRDLLADGAGMLVPVGDI-EALAHAMARVIDD 342
>gi|114321479|ref|YP_743162.1| group 1 glycosyl transferase [Alkalilimnicola ehrlichii MLHE-1]
gi|114227873|gb|ABI57672.1| glycosyl transferase, group 1 [Alkalilimnicola ehrlichii MLHE-1]
Length = 375
Score = 46.2 bits (108), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 50/100 (50%), Gaps = 3/100 (3%)
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
+L+E + R GL E+ V GF + + AD+ + S+ +GF + I+ A G+ +
Sbjct: 245 SLEERSRRHGL-ENFVSFPGFFPNPYACMKRADV--FVSASRYEGFANAILEAAALGVKI 301
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG 569
+ D P + E A++ F+ +NP L RA I NG
Sbjct: 302 VASDCPGANREIKELARIRLFENENPADLCRALKEIIDNG 341
>gi|434385579|ref|YP_007096190.1| glycosyltransferase [Chamaesiphon minutus PCC 6605]
gi|428016569|gb|AFY92663.1| glycosyltransferase [Chamaesiphon minutus PCC 6605]
Length = 481
Score = 46.2 bits (108), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 53/117 (45%), Gaps = 7/117 (5%)
Query: 505 LYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQV--IFFQKDNPEGLSRAF 562
+Y ++GFP + AM G+P++ D P I + + +G + +D+ L+ A
Sbjct: 364 VYAMPSRQEGFPVAPIEAMACGLPIVATDAPGIPDILEDGERSGGSMVPRDHAWALADAL 423
Query: 563 SLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR--ILENVLNFPSDALLPGP 617
FI +A+TV A RL + A D V R R +L N+ P+ L P
Sbjct: 424 GRFIDR---PDWAQTVGDAARLRVEQQFAADVVGRQLRRCLLPNLPLVPNSMQLAQP 477
>gi|225872792|ref|YP_002754249.1| glycosyl transferase family protein [Acidobacterium capsulatum ATCC
51196]
gi|225792866|gb|ACO32956.1| glycosyl transferase, group 1 family [Acidobacterium capsulatum
ATCC 51196]
Length = 512
Score = 46.2 bits (108), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 77/173 (44%), Gaps = 18/173 (10%)
Query: 374 GNFFVIP-GSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAV 432
G VIP G DV+ + K +LR DE++ + G Y+++
Sbjct: 153 GRLDVIPLGLDTDVF------RPGNKAELRSRLNLPADEVLALYFGRFSHYDKM------ 200
Query: 433 AMHDVGPLLIKYARRNSVEGSFKFVFLCGN-STDGYNDALQEVASRLGLLEHSVRHYGF- 490
D+ PLL+ + G + L G+ S Y++ +++ A+++G+ + +
Sbjct: 201 ---DLFPLLLAFKMVLDGCGQKTSLVLAGSESFHKYSERVRQFAAQIGIADRVILRTDIP 257
Query: 491 NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
DV AD+ + S +++ F IV AM+ G+PVI D+ KE V +
Sbjct: 258 EADVTAYYAAADLFVSPSDNLQETFGQSIVEAMSSGLPVICSDWDGYKELVVQ 310
>gi|433611090|ref|YP_007194551.1| Glycosyltransferase [Sinorhizobium meliloti GR4]
gi|429556032|gb|AGA10952.1| Glycosyltransferase [Sinorhizobium meliloti GR4]
Length = 360
Score = 46.2 bits (108), Expect = 0.098, Method: Composition-based stats.
Identities = 33/119 (27%), Positives = 57/119 (47%), Gaps = 6/119 (5%)
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
GN N A+ E+ ++ GL H+V G D+ D+++ S +GFP++I
Sbjct: 222 GNGLVSDNPAVIELMTQAGLPAHAVDLRGEISDMPAFYQSIDLLVLSSRT--EGFPNVIA 279
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS--NGKLSKFART 577
AM+FG P++T D + G I +P+ L+ A F+ + +++ART
Sbjct: 280 EAMSFGKPIVTTD--VGDAAAVAGKAGIAVPARDPQALAGAMRAFLDLPEAEYARYART 336
>gi|312112581|ref|YP_003990897.1| group 1 glycosyl transferase [Geobacillus sp. Y4.1MC1]
gi|311217682|gb|ADP76286.1| glycosyl transferase group 1 [Geobacillus sp. Y4.1MC1]
Length = 311
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 72/164 (43%), Gaps = 18/164 (10%)
Query: 440 LLIKYARRNSVEGSFKFVFLCGNSTDGYN----DALQEVASRLGLLEHSVRHYGF--NGD 493
+L YAR + + + G++ Y +E++ L ++ VR G N
Sbjct: 134 ILRAYARVKEIIPESVLLIVGGDTLFDYQYYRKSFFEELSRLLSYVQKGVRIVGAVDNQT 193
Query: 494 VNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKD 553
+ + +AD + S +++G+ I+ AM G+PVI D P+ +EY+ + + +
Sbjct: 194 LFHLYHIADCFVQPS--IKEGWGLAILEAMAVGVPVIASDIPVFQEYLVDSHNALLANPE 251
Query: 554 NPEGLSRAF----------SLFISNGKLSKFARTVASAGRLHAK 587
+ E ++R + +SNGK + T +A R H K
Sbjct: 252 DDEAIARQMIRAVKEKDLAEMLVSNGKATAGRYTWMAAAREHMK 295
>gi|322418497|ref|YP_004197720.1| group 1 glycosyl transferase [Geobacter sp. M18]
gi|320124884|gb|ADW12444.1| glycosyl transferase group 1 [Geobacter sp. M18]
Length = 973
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 68/159 (42%), Gaps = 15/159 (9%)
Query: 401 LRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLC 460
LRKE G ++ + V V S Y + LI A + V+ +
Sbjct: 191 LRKELGASPEQQIAVTVTSLQAYKGVDN------------LIAAAAKACVQDERLVFAIA 238
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
G+ AL++ L L E V H GF D+ +L AD+ + S++ + FP +++
Sbjct: 239 GSGPPEAVAALRDQVKELKL-EGKVFHLGFRNDIADILAGADLFVLPSAK--EAFPLVVL 295
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS 559
AM+ G PV+ D +E V +G +P+ L+
Sbjct: 296 EAMSHGRPVVATDCGGTREMVIDGETGFVVPVKDPDALA 334
>gi|253701079|ref|YP_003022268.1| group 1 glycosyl transferase [Geobacter sp. M21]
gi|251775929|gb|ACT18510.1| glycosyl transferase group 1 [Geobacter sp. M21]
Length = 412
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 55/116 (47%), Gaps = 7/116 (6%)
Query: 465 DGYNDALQEV---ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
DGY D L++ +RLGL E V G D+ V + D+ L S V +G P+ ++
Sbjct: 254 DGYGDELEQARGEVARLGL-EKVVHFTGHRNDLRDVYVSFDVFLMTS--VTEGLPNTLLE 310
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-SKFAR 576
AM G+P ++ D I E + +G + E L+R + + L +F+R
Sbjct: 311 AMALGVPSVSTDVGGIPELLQDGEGGYLAPAGDAEKLARRVLELLGSADLRERFSR 366
>gi|88812786|ref|ZP_01128032.1| glycosyl transferase, group 1 family protein [Nitrococcus mobilis
Nb-231]
gi|88790024|gb|EAR21145.1| glycosyl transferase, group 1 family protein [Nitrococcus mobilis
Nb-231]
Length = 365
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
E V GF DV VL AD +++ + +++ F + +V M G+ VI PD P +E V
Sbjct: 244 EEHVHFLGFTDDVAAVLKAADFLVHPA--LKEPFGTSLVEGMAAGLAVIAPDLPGPREIV 301
Query: 542 AEGAQVIFFQKDNPEGLSRAFS 563
E +F + N + L+R +
Sbjct: 302 VEDESALFHEPGNQDELARRMA 323
>gi|297624651|ref|YP_003706085.1| glycosyl transferase group 1 protein [Truepera radiovictrix DSM
17093]
gi|297165831|gb|ADI15542.1| glycosyl transferase group 1 [Truepera radiovictrix DSM 17093]
Length = 380
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 81/183 (44%), Gaps = 16/183 (8%)
Query: 430 YAVAMHDVGP------LLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEH 483
Y VA+ + P LL YAR + + V + G + L+E+A+ LG+ E
Sbjct: 203 YLVAVGRLEPQKGFDILLRAYARLRAAGVTHPLV-IVGEGREAAR--LRELAASLGV-ED 258
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
VR GF + + A++ + SS +GF +I AM G PV+ D P V E
Sbjct: 259 GVRFPGFQENPYAWIRGAEV--FVSSSRFEGFCRVIAEAMAVGTPVVATDCPSGPAEVLE 316
Query: 544 GAQV-IFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARIL 602
G + + + ++PE L++ + +S+ + R GR + V R+ +L
Sbjct: 317 GGRAGVLVRSEDPEALAKGIAGLLSDPEARARFR---ERGRERVRAFSPERVVARFGEVL 373
Query: 603 ENV 605
E V
Sbjct: 374 EGV 376
>gi|423721540|ref|ZP_17695722.1| glycosyltransferase, GT1 family [Geobacillus thermoglucosidans
TNO-09.020]
gi|383365343|gb|EID42639.1| glycosyltransferase, GT1 family [Geobacillus thermoglucosidans
TNO-09.020]
Length = 133
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 511 VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF-------- 562
+++G+ I+ AM G+PVI D P+ +EY+ + + ++ E ++R
Sbjct: 31 IKEGWGLAILEAMAVGVPVIASDIPVFQEYLVDSHNALLANPEDDEAIARQMIRAVKEKD 90
Query: 563 --SLFISNGKLSKFARTVASAGRLHAK 587
+ +SNGK + T +A R H K
Sbjct: 91 LAEMLVSNGKATAGRYTWMAAAREHMK 117
>gi|51891918|ref|YP_074609.1| lipopolysaccharide N-acetylglucosaminyltransferase [Symbiobacterium
thermophilum IAM 14863]
gi|51855607|dbj|BAD39765.1| putative lipopolysaccharide N-acetylglucosaminyltransferase
[Symbiobacterium thermophilum IAM 14863]
Length = 375
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 54/109 (49%), Gaps = 4/109 (3%)
Query: 468 NDALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
+D ++++A R L +VR G+ G+V+G MAD V +SQ E+ + AM
Sbjct: 239 DDYVRDLARRAAELGDAVRMAGYVPYGEVDGYFRMAD-VFVCASQWEEPLARVHYEAMAC 297
Query: 526 GIPVITPDFPIIKEYVAEG-AQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
G+P++T D E VAEG +I D PE + A + + L +
Sbjct: 298 GLPIVTTDRGGNAEVVAEGRGGLIVRPHDRPEAFAAAIRTLLDDPALRR 346
>gi|418054774|ref|ZP_12692830.1| glycosyl transferase group 1 [Hyphomicrobium denitrificans 1NES1]
gi|353212399|gb|EHB77799.1| glycosyl transferase group 1 [Hyphomicrobium denitrificans 1NES1]
Length = 220
Score = 45.8 bits (107), Expect = 0.13, Method: Composition-based stats.
Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 2/101 (1%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+ A LG+ E S G DV +L D+ + +++ ++GF +++ AM GIPV+
Sbjct: 78 LEAQARSLGIAE-STSFLGNQTDVARLLGQVDVFAFSTTR-DEGFGIVLIEAMAAGIPVV 135
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
D +E +A+G I +P L+ A S +++ +L
Sbjct: 136 ASDVAACREVLADGEAGILVSPSDPVALASAISRVLNSPEL 176
>gi|334120199|ref|ZP_08494281.1| glycosyl transferase group 1 [Microcoleus vaginatus FGP-2]
gi|333456987|gb|EGK85614.1| glycosyl transferase group 1 [Microcoleus vaginatus FGP-2]
Length = 1785
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 114/284 (40%), Gaps = 25/284 (8%)
Query: 292 FDGIIADSLEAKEAISSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVF 351
D I A++LE + + Q IP+VW + E + Y R + + F
Sbjct: 1490 IDAIYANTLENFFVVDAAHQ---IGIPVVWNVHE---SEPWQTYFNRFGSEIAARALECF 1543
Query: 352 S-RVNVIVFPDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKD 410
VI D T +S+ NF VI D+ +E S RK G +
Sbjct: 1544 RFPYKVIFVADATRDRYWSLNSHHNFTVIHNG-LDLSKLENSDNSESA---RKSLGVAAE 1599
Query: 411 EIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDA 470
++V++++G+ + D A+ L+ N + F+ G+ Y++
Sbjct: 1600 DVVILLLGT-VCERKGQQDLVKALS-----LLSDKWHNKIR-----CFIVGDRPSIYSNK 1648
Query: 471 LQEVASRLGL-LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
L E+ L L V G+ ADI + +S+VE FP +I+ AM +P+
Sbjct: 1649 LAELVGELPEELRQRVTVVPETGETGKYYKAADIFV-CTSRVE-SFPRVILEAMACELPI 1706
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
IT I+E V G +F+ D PE L+ + + + L +
Sbjct: 1707 ITTPVFGIREQVRPGINGLFYTPDRPEELAASLHTLLEDKSLRQ 1750
>gi|160879353|ref|YP_001558321.1| glycosyl transferase group 1 [Clostridium phytofermentans ISDg]
gi|160428019|gb|ABX41582.1| glycosyl transferase group 1 [Clostridium phytofermentans ISDg]
Length = 409
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 56/108 (51%), Gaps = 8/108 (7%)
Query: 467 YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLL-MADIVLYGSSQVEQGFPSLIVRAMTF 525
YN+ Q++ + H + NG N V+ + D L+ + +GFP+ +V AM
Sbjct: 259 YNNLYQKIQAL-----HLENNIFLNGIKNNVMQSVHDAKLFVMTSDFEGFPNALVEAMAS 313
Query: 526 GIPVITPDFP--IIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
G+PVI+ DFP I KE + +G +++ + L++A ++N +L
Sbjct: 314 GLPVISTDFPTGIAKELIKDGVNGYVVPRNDKKELAKAMVKILNNSEL 361
>gi|224113169|ref|XP_002316414.1| predicted protein [Populus trichocarpa]
gi|222865454|gb|EEF02585.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 45.8 bits (107), Expect = 0.15, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 26/30 (86%)
Query: 228 ILGNMAKDSRSLLLITVVKNLQKLGYVFKI 257
ILGNM KD +SL+L++V+KNL+KLGY K+
Sbjct: 71 ILGNMKKDPQSLMLLSVMKNLRKLGYALKV 100
>gi|433543079|ref|ZP_20499493.1| polysaccharide biosynthesis protein [Brevibacillus agri BAB-2500]
gi|432185618|gb|ELK43105.1| polysaccharide biosynthesis protein [Brevibacillus agri BAB-2500]
Length = 408
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 70/180 (38%), Gaps = 24/180 (13%)
Query: 397 EKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKF 456
E ++R G D++V+ VG V + LL ++R G +
Sbjct: 188 EARKVRSALGIGADDLVIAYVGR-----------MVGEKGIFELLDAFSRLARESGRVRL 236
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRH-----YGFNGDVNGVLLMADIVLYGSSQV 511
L G+ +D D + L EH H GF D+ +L +DI + S +
Sbjct: 237 -LLVGDVSDSERDQRGK-----ALREHCREHPQIVLTGFRQDIPQLLAASDIFVLPSHR- 289
Query: 512 EQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+G P I+ AM G P++ + +E V +G I + E L +A + +L
Sbjct: 290 -EGLPRSIIEAMAMGKPIVATNIRGCREEVTDGVNGILVEPKQSEHLYKALKKLACDARL 348
>gi|124008210|ref|ZP_01692907.1| glycosyl transferase, group 1 family protein [Microscilla marina
ATCC 23134]
gi|123986309|gb|EAY26131.1| glycosyl transferase, group 1 family protein [Microscilla marina
ATCC 23134]
Length = 416
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 59/135 (43%), Gaps = 9/135 (6%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGD-----VNGVLLMADIVLYGSSQV 511
+ +C ++ +V +LGL + Y V+G + + S +
Sbjct: 262 LVVCASTKQKKARFYHKVIEKLGLTNRVIWQYQLPKRTLYDWVHGAVFSLAPLKECSRNL 321
Query: 512 EQGF-PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
EQG P I+ +M G PV+ D P++KE +++ + + D P L+RA + +
Sbjct: 322 EQGCCPLKILESMAVGTPVVASDLPVVKEIISDASLGKLVRADRPAELARAMRILLDYPH 381
Query: 571 LSKFARTVASAGRLH 585
L + ++ AG+ H
Sbjct: 382 LRQ---QMSEAGKAH 393
>gi|158424541|ref|YP_001525833.1| glycosyltransferase [Azorhizobium caulinodans ORS 571]
gi|158331430|dbj|BAF88915.1| glycosyltransferase [Azorhizobium caulinodans ORS 571]
Length = 794
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/123 (24%), Positives = 56/123 (45%), Gaps = 2/123 (1%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
+QE+ LGL +H V DV L + L +S VE FP ++ A+ +G PV+
Sbjct: 648 VQEICDDLGLEQHIVFMRRVGDDVLAWLYRHAVALCLTSSVEGNFPPQVLEALAYGCPVV 707
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEG-LSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
+ P I E + G +++ + + +R ++ + + R A G + +N
Sbjct: 708 STRLPTITEVLPGGFEMLLLCNERDQADFTRKIAIAMEQ-RDQTLLRQAAVMGFMRERNS 766
Query: 590 LAL 592
+A+
Sbjct: 767 IAV 769
>gi|423722316|ref|ZP_17696492.1| hypothetical protein HMPREF1078_00555 [Parabacteroides merdae
CL09T00C40]
gi|409242457|gb|EKN35219.1| hypothetical protein HMPREF1078_00555 [Parabacteroides merdae
CL09T00C40]
Length = 313
Score = 45.4 bits (106), Expect = 0.16, Method: Composition-based stats.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 462 NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
NS D +++L+++ + GL + G D+N +L ++DI L S +G I+
Sbjct: 177 NSED--SNSLRQLIAEAGL-NRKIMSLGIRNDINDILNISDIYLQPSRT--EGLSLSIME 231
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
A+ + +PVI I E V EG F+K+N E L+ + ++N
Sbjct: 232 ALNYSLPVIGTRVGGIPEIVHEGENGYLFEKENVEELADRIEILVNN 278
>gi|423336858|ref|ZP_17314605.1| hypothetical protein HMPREF1059_00557 [Parabacteroides distasonis
CL09T03C24]
gi|409239877|gb|EKN32660.1| hypothetical protein HMPREF1059_00557 [Parabacteroides distasonis
CL09T03C24]
Length = 364
Score = 45.4 bits (106), Expect = 0.16, Method: Composition-based stats.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 462 NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
NS D +++L+++ + GL + G D+N +L ++DI L S +G I+
Sbjct: 228 NSED--SNSLRQLIAEAGL-NRKIMSLGIRNDINDILNISDIYLQPSRT--EGLSLSIME 282
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
A+ + +PVI I E V EG F+K+N E L+ + ++N
Sbjct: 283 ALNYSLPVIGTRVGGIPEIVHEGENGYLFEKENVEELADRIEILVNN 329
>gi|301310609|ref|ZP_07216548.1| putative glycosyltransferase, group 1 family [Bacteroides sp. 20_3]
gi|300832183|gb|EFK62814.1| putative glycosyltransferase, group 1 family [Bacteroides sp. 20_3]
Length = 317
Score = 45.4 bits (106), Expect = 0.16, Method: Composition-based stats.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 462 NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
NS D +++L+++ + GL + G D+N +L ++DI L S +G I+
Sbjct: 181 NSED--SNSLRQLIAEAGL-NRKIMSLGIRNDINDILNISDIYLQPSRT--EGLSLSIME 235
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
A+ + +PVI I E V EG F+K+N E L+ + ++N
Sbjct: 236 ALNYSLPVIGTRVGGIPEIVHEGENGYLFEKENVEELADRIEILVNN 282
>gi|293604442|ref|ZP_06686849.1| UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide)
pyrophosphoryl-undecaprenol N-acetylglucosamine
transferase [Achromobacter piechaudii ATCC 43553]
gi|292817319|gb|EFF76393.1| UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide)
pyrophosphoryl-undecaprenol N-acetylglucosamine
transferase [Achromobacter piechaudii ATCC 43553]
Length = 366
Score = 45.4 bits (106), Expect = 0.16, Method: Composition-based stats.
Identities = 59/235 (25%), Positives = 94/235 (40%), Gaps = 33/235 (14%)
Query: 376 FFVIPGSPAD----VWAVEAYSKSHEKYQLRKENGFLKDEIV---VVVVGSSFFYNELSW 428
+ + G PAD V++ E LR E G D++V V V+ ++ + +L
Sbjct: 143 YLIERGVPADRIATVYSPIVLPAPVEHSTLRDELGLADDDVVMGCVAVMRATKGHKDL-- 200
Query: 429 DYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHY 488
+ + PL+ VF+ G S + QE ++LGL EH +
Sbjct: 201 -----IDAMTPLM-------QTRPKLHLVFVGGGSP--VFEQTQEYVAQLGL-EHRIHLM 245
Query: 489 GFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVI 548
G DV +L D+ + Q G ++ V A G+PV+ D + E +G I
Sbjct: 246 GMRRDVPNLLAGFDVFALATQQEASG--TVFVEAQASGLPVVGTDVGGVSEMFRDGETGI 303
Query: 549 FFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILE 603
NP+ L A I + L R + +AGR M+ + V AR+ E
Sbjct: 304 LVPPKNPQALVDALQRLIDDPALR---RQMGAAGR----KMVWEEAVFSPARLAE 351
>gi|333395367|ref|ZP_08477186.1| glycosyltransferase [Lactobacillus coryniformis subsp. coryniformis
KCTC 3167]
Length = 382
Score = 45.4 bits (106), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 79/170 (46%), Gaps = 24/170 (14%)
Query: 379 IPGSPADVWAVEAYSKSHEKYQL-RKENGFLKDEIVVVVVGSSFFYNELSW--DYAVAMH 435
IPG + + + + +K L RKE G ++ V++ VG ELS ++ V +
Sbjct: 177 IPGVGINYKNITMMAPTQDKISLLRKEFGIPENSFVIISVG------ELSQRKNHQVIIK 230
Query: 436 DVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVN 495
+ L + +V F +CG+ + N L+ +A +L + V+ G+ D++
Sbjct: 231 ALAKL-----KSQNV-----FYLICGHGNEKKN--LELLAEKLNI-SKKVKLLGYRTDIS 277
Query: 496 GVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGA 545
+L MADI + S + G L AM G+P+IT + IK+Y G
Sbjct: 278 DLLHMADISAFPSRREGLGLAGL--EAMAAGLPLITSNVQGIKDYSINGV 325
>gi|282856186|ref|ZP_06265469.1| glycosyltransferase, family 1 [Pyramidobacter piscolens W5455]
gi|282585945|gb|EFB91230.1| glycosyltransferase, family 1 [Pyramidobacter piscolens W5455]
Length = 365
Score = 45.4 bits (106), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 92/215 (42%), Gaps = 18/215 (8%)
Query: 390 EAYSKSHE-KYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRN 448
E Y + E + Q R++ DE V++G F +WD + + P
Sbjct: 155 EKYVRDAEVRRQFRRQLSVRGDE--TVILGMGRFVGWKAWDDYLRAVALLP--------K 204
Query: 449 SVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGS 508
V+ F +L G+ + +L+E+A RL LEH VR + F DV L AD+ + S
Sbjct: 205 DVKAHF---WLVGSGPE--EASLKELA-RLLELEHRVRFFPFAADVRPWLWAADLFVQTS 258
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ E GF +++ AM G+ I D + V +G + F+ + + LS +
Sbjct: 259 KKPE-GFSLMLIEAMAAGVVPIATDIGGTLDIVEDGENGLLFRPSDVKFLSVLMMRGMDP 317
Query: 569 GKLSKFARTVASAGRLHAKNMLALDCVTRYARILE 603
K + + + + + +A+ V Y R L+
Sbjct: 318 SLRRKLSENASKSAAEVSVSRIAVQTVALYKRTLQ 352
>gi|428297829|ref|YP_007136135.1| group 1 glycosyl transferase [Calothrix sp. PCC 6303]
gi|428234373|gb|AFZ00163.1| glycosyl transferase group 1 [Calothrix sp. PCC 6303]
Length = 379
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 4/100 (4%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D +QE+ S LG+ + VR G D+ ++ +A L S Q +G P ++ +M+ G P
Sbjct: 247 DKMQELTSELGV-QKQVRFLGLRQDIPTLMRVATATLLASEQ--EGLPRCVMESMSLGTP 303
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
VI ++ + +G + + EGLS A + + N
Sbjct: 304 VIGTSIRGTQDLLKDGCG-FLVEVGDVEGLSNAIAYVLDN 342
>gi|288928640|ref|ZP_06422486.1| lipopolysaccharide biosynthesis protein [Prevotella sp. oral taxon
317 str. F0108]
gi|288329624|gb|EFC68209.1| lipopolysaccharide biosynthesis protein [Prevotella sp. oral taxon
317 str. F0108]
Length = 399
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
SV++ G+ DV LL AD +++ S + +GFP+++++A G+P I D E V +
Sbjct: 275 SVKYVGYQTDVRPYLLAADTLVFPSYR--EGFPNVVMQAGAMGLPSIVTDINGCNEIVVD 332
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISN-GKLSKFA 575
G + L A +F+ G ++K A
Sbjct: 333 GTNGKIIPPKDETALLAAIEVFLKERGAVAKMA 365
>gi|325269294|ref|ZP_08135911.1| capsular polysaccharide biosynthesis glycosyl transferase
[Prevotella multiformis DSM 16608]
gi|324988215|gb|EGC20181.1| capsular polysaccharide biosynthesis glycosyl transferase
[Prevotella multiformis DSM 16608]
Length = 401
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 79/194 (40%), Gaps = 21/194 (10%)
Query: 394 KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGS 453
+S K LR+E GF ++ V V VG ++D G + +A + + G
Sbjct: 204 ESPVKESLRRELGFESEDFVFVFVGR-------------IVNDKGMRELSFAMQGLLGGK 250
Query: 454 FKF---VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQ 510
+ L GN D L + + SVR + DV L AD +++ S +
Sbjct: 251 HPLRPKLLLVGNFEPEL-DPLDAEDTSFFMHNPSVRFVNYQKDVRPYLATADALVFPSYR 309
Query: 511 VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
+GFP++++ A G+P I D E V EG NPE L +S+
Sbjct: 310 --EGFPNVVLEAGCMGLPSIVTDINGSNEIVREGINGRIVPPRNPEALLAVMEWTLSHP- 366
Query: 571 LSKFARTVASAGRL 584
S AR ++ R+
Sbjct: 367 -SDVARMAQNSRRI 379
>gi|167755361|ref|ZP_02427488.1| hypothetical protein CLORAM_00875 [Clostridium ramosum DSM 1402]
gi|167704300|gb|EDS18879.1| glycosyltransferase, group 1 family protein [Clostridium ramosum
DSM 1402]
Length = 375
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 5/77 (6%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
+CGN + N LQ++A LG+ E V G+ D+ +L ++D L+ S Q +G P
Sbjct: 233 LICGNGPEKNN--LQDLAKNLGIRE-QVHFLGYRTDIKELLAISDCFLFTSLQ--EGLPR 287
Query: 518 LIVRAMTFGIPVITPDF 534
++ AM G+P I D
Sbjct: 288 SLMEAMASGLPCIISDI 304
>gi|406921291|gb|EKD59193.1| hypothetical protein ACD_55C00117G0002 [uncultured bacterium]
Length = 383
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 3/117 (2%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ + DG L +A LG+ ++ V+ G DV G+L D+ Y S +G P
Sbjct: 235 LLVLAGRCDGVETRLTGLALELGI-QNRVKILGPVDDVAGLLGAIDLFAYSSK--SEGLP 291
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
+ ++ AM G+PV+ D P I+E + + + L++ + + + +L K
Sbjct: 292 NGVLEAMAAGLPVVGTDIPGIREALGSQGEGYLSPVGDSTDLAKKIVMLLQDQQLRK 348
>gi|333897348|ref|YP_004471222.1| group 1 glycosyl transferase [Thermoanaerobacterium xylanolyticum
LX-11]
gi|333112613|gb|AEF17550.1| glycosyl transferase group 1 [Thermoanaerobacterium xylanolyticum
LX-11]
Length = 373
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 60/131 (45%), Gaps = 8/131 (6%)
Query: 448 NSVEGSFK-FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLY 506
N + G K FVF+ G+ Y + L+E+ L L +V GF D+ L DI +
Sbjct: 220 NILRGKIKAFVFIAGDGP--YMEYLKEMVGELKL--DNVEFLGFIEDIFKFLSSIDIFVL 275
Query: 507 GSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFI 566
S +GF + AM G+PVI D I E V I + + P+ L+ A +
Sbjct: 276 PSRS--EGFGISVAEAMALGVPVIATDVGGIPEIVKNDENGIIVKSEAPKDLANAIEVLA 333
Query: 567 SNGKL-SKFAR 576
N L +KF++
Sbjct: 334 LNEDLRNKFSK 344
>gi|408382127|ref|ZP_11179673.1| phosphatidylinositol alpha-mannosyltransferase [Methanobacterium
formicicum DSM 3637]
gi|407815134|gb|EKF85754.1| phosphatidylinositol alpha-mannosyltransferase [Methanobacterium
formicicum DSM 3637]
Length = 389
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 1/105 (0%)
Query: 501 ADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSR 560
ADI S+ + +P I+ AM G+PV+ D I + + + NPE L
Sbjct: 284 ADIFCLPSTMSTECYPLAILEAMASGVPVVASDIGGIPDIIENNVNGLLVTPTNPEKLED 343
Query: 561 AFSLFISNGKL-SKFARTVASAGRLHAKNMLALDCVTRYARILEN 604
+L + N ++ +KF+ + ++ +A + + Y +LEN
Sbjct: 344 NLNLLLQNPEIRAKFSENALKGIKKYSWKNIATETLKLYESLLEN 388
>gi|197119804|ref|YP_002140231.1| group glycosyltransferase [Geobacter bemidjiensis Bem]
gi|197089164|gb|ACH40435.1| glycosyltransferase, WabH-like family [Geobacter bemidjiensis Bem]
Length = 385
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 3/117 (2%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ + DG L +A LG+ ++ V+ G DV G+L D+ Y S +G P
Sbjct: 237 LLVLAGRCDGVETRLTGLALELGI-QNRVKILGPVDDVAGLLGAIDLFAYSSK--SEGLP 293
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
+ ++ AM G+PV+ D P I+E + + + L++ + + + +L K
Sbjct: 294 NGVLEAMAAGLPVVGTDIPGIREALGSQGEGYLSPVGDSTDLAKKIVMLLQDQQLRK 350
>gi|417860743|ref|ZP_12505798.1| glycosyltransferase [Agrobacterium tumefaciens F2]
gi|338821147|gb|EGP55116.1| glycosyltransferase [Agrobacterium tumefaciens F2]
Length = 382
Score = 45.1 bits (105), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
+ Y +++ ASRLGL + VR GF DV ++ D+V + +S V + F ++V AM
Sbjct: 245 EAYEARIRDQASRLGL-DGRVRFLGFRSDVPELMAAMDVVAH-TSIVAEPFGRVVVEAMM 302
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
G PV+ + E + +G + + L+ A +S+ L++
Sbjct: 303 CGRPVVATRGGGVTEIIRDGETGLLVPPGDASALAAAVGCILSHPALAE 351
>gi|149372907|ref|ZP_01891904.1| hypothetical protein SCB49_01642 [unidentified eubacterium SCB49]
gi|149354400|gb|EDM42966.1| hypothetical protein SCB49_01642 [unidentified eubacterium SCB49]
Length = 395
Score = 45.1 bits (105), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 34/54 (62%)
Query: 500 MADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKD 553
M + L+ S +E+G P+++V AM G+PVI+ D +KE V EG ++ ++D
Sbjct: 289 MKEASLFLLSSLEEGLPNVLVEAMALGVPVISTDCGGVKELVEEGTGIVVPKRD 342
>gi|304316393|ref|YP_003851538.1| group 1 glycosyl transferase [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|302777895|gb|ADL68454.1| glycosyl transferase group 1 [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
Length = 374
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 129/327 (39%), Gaps = 29/327 (8%)
Query: 248 LQKLGYV-FKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAI 306
L+KLG V + I RS S S + + I I+ +++YSLI G L AK
Sbjct: 45 LKKLGIVCYDIDFERSPYSFSNLKALNQLIKIMREDKYSLIHVHTPVGAFLGRLAAKITG 104
Query: 307 SSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPM 366
+ + H + S+ N L Y + L ++W +N DY
Sbjct: 105 TKPVIYTAHGFHFY---KGASIKNWLIYYT---MERLAAHWTDGLITMND---EDYNAAK 155
Query: 367 LYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNEL 426
+ + + F + G D+ S +E+ Q RK GF + +IV++ V EL
Sbjct: 156 RFKLRKSNAVFYVHGVGLDIDKYNI-SDKNERKQTRKAFGFGEKDIVILTVA------EL 208
Query: 427 SWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVR 486
+ A + +L + S++ + + G Y L++ GL V+
Sbjct: 209 N-----ANKNHKQILDSLIKLKSLDNVYYLIVGIGE----YECYLKKYVEENGL-SGKVK 258
Query: 487 HYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQ 546
GF D+ +L ++DI + + + +G P I+ AM G PV+ D ++ V +G
Sbjct: 259 LLGFRLDIPDLLNISDI--FALTSMREGLPRCIMEAMAAGKPVVATDVRGNRDLVKDGTN 316
Query: 547 VIFFQKDNPEGLSRAFSLFISNGKLSK 573
D+ A I + L K
Sbjct: 317 GYLVPLDDVNATVDALQNLIESEDLRK 343
>gi|168704319|ref|ZP_02736596.1| Glycosyltransferase-like protein [Gemmata obscuriglobus UQM 2246]
Length = 388
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 58/127 (45%), Gaps = 6/127 (4%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L VR G+ DV G D+ + S + +G P++++ AM +PV++ +
Sbjct: 266 LGQRVRLAGWQADVRGFFEAMDV--FALSSLREGLPNVLLEAMALEVPVVSTRVNGVPRL 323
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDC-VTRYA 599
V +G + +GL+RA + + N L + R +AGR + + + R A
Sbjct: 324 VQDGRNGFLVNAGDLDGLTRALAGLLKNDGLREMFR---AAGRRTVETRYSFATRMQRLA 380
Query: 600 RILENVL 606
R+ + +L
Sbjct: 381 RLYDELL 387
>gi|297531516|ref|YP_003672791.1| glycosyl transferase family protein [Geobacillus sp. C56-T3]
gi|297254768|gb|ADI28214.1| glycosyl transferase group 1 [Geobacillus sp. C56-T3]
Length = 386
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 24/93 (25%), Positives = 48/93 (51%), Gaps = 2/93 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L V+ GF D+ + MAD+++ S + +G P +I+ AM F PV+ + +E
Sbjct: 256 LSDVVKIIGFQTDIYPYIKMADLIVLTSEK--EGVPRIIMEAMAFSKPVVATNVLGTREL 313
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V +G + + ++ E L+ + + +S+ + K
Sbjct: 314 VVDGETGVLVEYEDVEQLASSIHMMLSDERKRK 346
>gi|300865168|ref|ZP_07109992.1| Glycosyl transferase [Oscillatoria sp. PCC 6506]
gi|300336858|emb|CBN55142.1| Glycosyl transferase [Oscillatoria sp. PCC 6506]
Length = 1167
Score = 44.7 bits (104), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 78/175 (44%), Gaps = 12/175 (6%)
Query: 442 IKYARRNSVEGSFKFVFLCGNST--DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLL 499
I +++ V FV+ +T D L+E S LG+ + V+ G D++ L
Sbjct: 891 IAQLKQSPVWSQIYFVWAGPGATTHDNMEPELREKVSNLGVSDR-VKFLGQRWDISDWLD 949
Query: 500 MADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS 559
+DI + S +G P ++ AM G+PVI I E + E +++ +P+G
Sbjct: 950 ASDIFILPSKA--EGMPLAVMEAMAKGLPVIATAVSGIPEELGETGKLLPNPNRDPKGTV 1007
Query: 560 RAFSLFISNGKL-SKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDAL 613
R ++ I S+ ++V +L A+ M R R+L+ LN +AL
Sbjct: 1008 RELAMTIEAWVANSELRQSVGKECKLRAEQMF------REERMLQEYLNTIVEAL 1056
>gi|197106431|ref|YP_002131808.1| glycoside hydrolase family protein [Phenylobacterium zucineum HLK1]
gi|196479851|gb|ACG79379.1| glycosyl transferase, group 1 family protein [Phenylobacterium
zucineum HLK1]
Length = 348
Score = 44.7 bits (104), Expect = 0.29, Method: Composition-based stats.
Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 8/169 (4%)
Query: 441 LIKYARRNSVEGSFKFVFLCGNSTD---GYNDALQEVASRLGLLEHSVRHYGFNGDVNGV 497
+I+ ARR + EG F+ L GY++ L + + GL E +VR G D+
Sbjct: 178 VIEAARRLAAEGRRDFLILFAGDDQGRTGYSEELAQAIAAAGLQE-AVRIVGHCDDMPAA 236
Query: 498 LLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG 557
L+AD L ++ V + F V G PVI + + E V +G + EG
Sbjct: 237 YLLADFALLPTT-VPESFGRAAVEPQAMGRPVIASNHGGVTETVLDGVTGWLVPPQDAEG 295
Query: 558 LSRAF--SLFISNGKLSKFARTVASAGR-LHAKNMLALDCVTRYARILE 603
+ A ++ I GK + + R L++ + + Y R+LE
Sbjct: 296 WAAALGRAIDIGPGKRGEMGQAGQKRARQLYSAEAMCAATLAAYERVLE 344
>gi|303327601|ref|ZP_07358042.1| glycosyl transferase, group 2 family [Desulfovibrio sp. 3_1_syn3]
gi|302862541|gb|EFL85474.1| glycosyl transferase, group 2 family [Desulfovibrio sp. 3_1_syn3]
Length = 761
Score = 44.7 bits (104), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 2/90 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L ++R G DV +L+ AD++L +S++E GFP++++ AM+ G PV+ I E
Sbjct: 630 LTDNIRLLGPRKDVLNLLIAADVLLM-TSKIE-GFPNVVMEAMSAGRPVVATRVGAIPEL 687
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
V EG + GL + +S+ K
Sbjct: 688 VREGKDGFLHNVGDVVGLCESLQFLLSDSK 717
>gi|118588497|ref|ZP_01545906.1| putative glycosyltransferase protein [Stappia aggregata IAM 12614]
gi|118439203|gb|EAV45835.1| putative glycosyltransferase protein [Stappia aggregata IAM 12614]
Length = 430
Score = 44.7 bits (104), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 57/128 (44%), Gaps = 4/128 (3%)
Query: 440 LLIKYARRNSVEG-SFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
LL + A+R EG F+ L TD DA++ + E +R+ G DV
Sbjct: 240 LLAEAAKRLKSEGHEFELDIL--GPTDSNPDAIEPEQLKAWQDEGVLRYRGATNDVVP-F 296
Query: 499 LMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGL 558
L A VL + + +G P I+ AM+ G VIT D P E + G +D+ E L
Sbjct: 297 LHASSVLVLPTSLREGIPRSILEAMSCGRAVITTDAPGCGETIEHGVSGFVVPRDDVEAL 356
Query: 559 SRAFSLFI 566
+ A FI
Sbjct: 357 AGAMKSFI 364
>gi|88601701|ref|YP_501879.1| glycosyl transferase, group 1 [Methanospirillum hungatei JF-1]
gi|88187163|gb|ABD40160.1| glycosyl transferase, group 1 [Methanospirillum hungatei JF-1]
Length = 396
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 88/176 (50%), Gaps = 27/176 (15%)
Query: 439 PLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
PL+IK ++N + F+F+ GN D Y ++L + L + ++++ F G V+
Sbjct: 226 PLIIKKTKKNVL-----FLFI-GNG-DYYYESLN-LVKELEIEKNTL----FTGSVSKKD 273
Query: 499 LMADIVLYGSSQV------EQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQK 552
L+A Y SS + +G P+ I+ AM F +PVI+ D P ++++ A+ A I Q
Sbjct: 274 LIA---FYQSSNLFILPSLSEGLPTTILEAMYFNLPVISSDIPGVRDHFADHA--ILVQP 328
Query: 553 DNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALD-CVTRYARILENVLN 607
+ + ++ A + N +L AR ++S G+ + D + Y +I N+ N
Sbjct: 329 RDSQKIADAVIHILDNEEL---ARELSSKGKEFILSHYTWDKIICEYEKIFLNLKN 381
>gi|398344471|ref|ZP_10529174.1| glycosyl transferase [Leptospira inadai serovar Lyme str. 10]
Length = 393
Score = 44.7 bits (104), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 72/173 (41%), Gaps = 18/173 (10%)
Query: 401 LRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLC 460
LRKE KDE+VV + A+ H L+ E FK VF+
Sbjct: 192 LRKEFHLQKDELVVGNIA------------ALVDHKDQKTLLNAIALIETEQKFK-VFIV 238
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
G L+ +A + GLL+ V GF D+ L + DI S E+G + I+
Sbjct: 239 GEG--ALRKELENLARQKGLLDRVV-FTGFREDIQEFLSLFDIFTLTSK--EEGLGTSIL 293
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
AM G+P++ + I E + Q + E L++A+ + + +L K
Sbjct: 294 DAMAAGLPIVATNAGGISEMLTSEKGAFVAQVGDAEFLAKAYKTLLQDSRLRK 346
>gi|344203641|ref|YP_004788784.1| group 1 glycosyl transferase [Muricauda ruestringensis DSM 13258]
gi|343955563|gb|AEM71362.1| glycosyl transferase group 1 [Muricauda ruestringensis DSM 13258]
Length = 370
Score = 44.7 bits (104), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 4/108 (3%)
Query: 466 GYNDALQEVA--SRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAM 523
G+ D QE S+ ++H + +G + D+ L +DI + S +G P I AM
Sbjct: 226 GHGDLHQEFVDLSKKMKIDHLITFHGESNDIPSFLNKSDIFVLTSRF--EGLPLSICEAM 283
Query: 524 TFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+ G+P++ D + E V +G K+NP L+ S I + +L
Sbjct: 284 SVGVPIVASDVGGVHEMVRDGYNGYLIPKENPNHLAEKLSNLIRDKEL 331
>gi|291297931|ref|YP_003509209.1| group 1 glycosyl transferase [Stackebrandtia nassauensis DSM 44728]
gi|290567151|gb|ADD40116.1| glycosyl transferase group 1 [Stackebrandtia nassauensis DSM 44728]
Length = 388
Score = 44.7 bits (104), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE +V G GD+ G L A GS +G P +++ A T G+PV++ +P +E
Sbjct: 261 LEDNVVLMGRTGDLGGELAKASFCALGSRS--EGLPMVVLEAFTHGLPVVSCAYPGAEEL 318
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN 568
V++G I ++ E L A + ++
Sbjct: 319 VSDGRDGILVPPEDVEELGTAMAWLATD 346
>gi|302866604|ref|YP_003835241.1| group 1 glycosyl transferase protein [Micromonospora aurantiaca
ATCC 27029]
gi|302569463|gb|ADL45665.1| glycosyl transferase group 1 [Micromonospora aurantiaca ATCC 27029]
Length = 409
Score = 44.7 bits (104), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 60/136 (44%), Gaps = 15/136 (11%)
Query: 400 QLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFL 459
+LR E G D++V VV+G S + P ++ R + FV
Sbjct: 204 RLRDEIGAAPDDLVCVVIGRPHPLKRRS---------MVPSIVAALRERGIGCRAVFV-- 252
Query: 460 CGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLI 519
G D +DA+++ A+R +L+ + G DV L AD++L S+ +G P +
Sbjct: 253 -GPHDDRDDDAVRQSATRHDVLDR-MHLVGPRADVGEFLAQADLLLQPSAL--EGLPGTV 308
Query: 520 VRAMTFGIPVITPDFP 535
+ A G PV+ D P
Sbjct: 309 LEARAVGTPVVASDLP 324
>gi|374290690|ref|YP_005037725.1| putative glycosyltransferase, group 1 [Azospirillum lipoferum 4B]
gi|357422629|emb|CBS85465.1| putative glycosyltransferase, group 1 [Azospirillum lipoferum 4B]
Length = 463
Score = 44.7 bits (104), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 62/141 (43%), Gaps = 5/141 (3%)
Query: 466 GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
GY + L+ + GL VR D+ L++ +V+ S + E F +IV A
Sbjct: 323 GYREELENRIATNGL-RGVVRMTDHCSDMAAAYLLSTVVVSASREPE-AFGRVIVEAQAM 380
Query: 526 GIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS---NGKLSKFARTVASAG 582
G PVI +E V G D+P+ L+RA +S + + + AR +A
Sbjct: 381 GKPVIVSAIGAYQETVVPGETAWVVPPDDPDALARALDEALSLSPDQRDAIGARAMAFVA 440
Query: 583 RLHAKNMLALDCVTRYARILE 603
+ K+ + D + YA +L+
Sbjct: 441 DRYTKDRMCADTLAVYAELLQ 461
>gi|434395047|ref|YP_007129994.1| glycosyl transferase group 1 [Gloeocapsa sp. PCC 7428]
gi|428266888|gb|AFZ32834.1| glycosyl transferase group 1 [Gloeocapsa sp. PCC 7428]
Length = 382
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 80/199 (40%), Gaps = 19/199 (9%)
Query: 398 KYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFV 457
+ ++R G D +++ G Y+ + D+A H L KYA +
Sbjct: 195 RSEVRTSLGLAADTLLI---GLFCRYHPMK-DHANFFHAAALLTKKYANVH--------F 242
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
L G N LQ++ LGL + V G D+ ++ DI S+ E FP
Sbjct: 243 LLAGTDITPKNQNLQQLIDNLGLSQ--VHLLGERSDIPQLMAALDIATLASAYGE-AFPL 299
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
+I AM GIP + D K V + V+ NP+ L+ A+ I G ++ A
Sbjct: 300 VIGEAMACGIPCVVTDVGDSKWIVGDTGIVV--PPKNPQALADAWQELIVLGTTARTA-- 355
Query: 578 VASAGRLHAKNMLALDCVT 596
+ A R + ALD V
Sbjct: 356 LGKAARTRVQENFALDMVV 374
>gi|443311804|ref|ZP_21041428.1| glycosyltransferase [Synechocystis sp. PCC 7509]
gi|442778204|gb|ELR88473.1| glycosyltransferase [Synechocystis sp. PCC 7509]
Length = 381
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 29/115 (25%), Positives = 55/115 (47%), Gaps = 3/115 (2%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLL--EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQ 513
L G++ G D +Q++ +++ L EH V+ GF DV ++ D+V + +S + +
Sbjct: 230 IALLIGDALFGEQDYVQKLHAQVAELNLEHRVKFLGFRNDVTSLMATCDMVTH-TSTIAE 288
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
F +IV AM G P++ +E V G + + + L+ A + I+
Sbjct: 289 PFGRVIVEAMLCGTPIVATSAGGAQELVESGKTGLLVAPGDVKELAAAINTCIAQ 343
>gi|300113357|ref|YP_003759932.1| group 1 glycosyl transferase [Nitrosococcus watsonii C-113]
gi|299539294|gb|ADJ27611.1| glycosyl transferase group 1 [Nitrosococcus watsonii C-113]
Length = 379
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 72/181 (39%), Gaps = 16/181 (8%)
Query: 390 EAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNS 449
E Y + Q+RKE + V+VG Y+ M D L AR
Sbjct: 177 EYYPDKGAREQVRKEFNIPPN---TVLVGLLARYHP--------MKDHANFLQAAARLMK 225
Query: 450 VEGSFKFVFL-CGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGS 508
E FL G D N L + RLGL V G DV ++ DI S
Sbjct: 226 EEAGINVAFLLAGRGIDPSNGLLIGLICRLGL-SSKVILLGERQDVPSLITALDIATVSS 284
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ E GFP+++ AM GIP + D +A+ +++ +P+ L+ A+ + I
Sbjct: 285 AWGE-GFPNVLGEAMACGIPCVATDVGDSAYLIADTGKIV--SPRDPQALAAAWQVLIGA 341
Query: 569 G 569
G
Sbjct: 342 G 342
>gi|322418190|ref|YP_004197413.1| group 1 glycosyl transferase [Geobacter sp. M18]
gi|320124577|gb|ADW12137.1| glycosyl transferase group 1 [Geobacter sp. M18]
Length = 404
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
+CG+ D A++ + LG+ SV F D +L AD++L GS + E F
Sbjct: 263 LICGHGNDSEIGAVKGIVQNLGI-ASSVHVLDFRPDAIQILRQADLLLVGSQEFES-FGL 320
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS 559
V AM IPV+ + E V +G F+KD+ +G +
Sbjct: 321 TCVEAMANRIPVLATRVGGLPEVVQDGDGGFTFEKDDADGYA 362
>gi|410463290|ref|ZP_11316819.1| glycosyltransferase [Desulfovibrio magneticus str. Maddingley
MBC34]
gi|409983598|gb|EKO39958.1| glycosyltransferase [Desulfovibrio magneticus str. Maddingley
MBC34]
Length = 863
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 66/157 (42%), Gaps = 14/157 (8%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ L G T G L + A+RLGL E V G+ D LM D + +GF
Sbjct: 583 LVLTGADT-GLRAELGQAAARLGLAER-VTFAGYVSDEELSALMTDAAALVFPSLFEGFG 640
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
+V AM G P++ D + E E A ++F P+ ++ A S + L A
Sbjct: 641 MPLVEAMAVGTPIVCSDVTSLPEVGGEAA--LYFDPKRPDTITAALSRLLDEPGL---AE 695
Query: 577 TVASAGRLHAKNMLAL----DCVTRYARILENVLNFP 609
+ + GR + + A D +Y + E+VL P
Sbjct: 696 GLVAKGR---ERLAAFGGPEDMARKYLAVFEDVLARP 729
>gi|428315500|ref|YP_007113382.1| glycosyl transferase group 1 [Oscillatoria nigro-viridis PCC 7112]
gi|428239180|gb|AFZ04966.1| glycosyl transferase group 1 [Oscillatoria nigro-viridis PCC 7112]
Length = 1781
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 53/118 (44%), Gaps = 3/118 (2%)
Query: 457 VFLCGNSTDGYNDALQEVASRL-GLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
F+ G+ Y++ L E+ L L V G+ ADI + +S+VE F
Sbjct: 1631 CFIVGDRPSIYSNKLAELVGELPAELRERVTVVPETGETGKYYKAADIFV-CTSRVE-SF 1688
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
P +I+ AM +P+IT IKE V G +F+ D PE L A + + L +
Sbjct: 1689 PRVILEAMASDLPIITTPVFGIKEQVRPGINGLFYTPDRPEELVAALISLLEDKSLRQ 1746
>gi|118579846|ref|YP_901096.1| group 1 glycosyl transferase [Pelobacter propionicus DSM 2379]
gi|118502556|gb|ABK99038.1| glycosyl transferase, group 1 [Pelobacter propionicus DSM 2379]
Length = 383
Score = 44.3 bits (103), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 2/91 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L S+R G DV+G+L D+ Y S +G P+ ++ AM G+PV+ P I E
Sbjct: 255 LADSIRLLGPVADVSGLLHAVDLFAYSSR--SEGIPNAVLEAMAAGLPVVGTAIPGILEA 312
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
V N ++ L + N L
Sbjct: 313 VGAEGAAFLAPVGNHAAMAEKICLLLENAPL 343
>gi|372222387|ref|ZP_09500808.1| putative glycosyltransferase [Mesoflavibacter zeaxanthinifaciens
S86]
Length = 515
Score = 44.3 bits (103), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 56/127 (44%), Gaps = 4/127 (3%)
Query: 447 RNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLY 506
R S+ V+ ++ Y + Q + + L L +H + GF+ + L DI +
Sbjct: 345 RKSIPNVLFLVYGNKDAVPEYTEQCQNLIAELALEDH-FKLAGFHAKPEKLFLEGDISIL 403
Query: 507 GSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFI 566
S + +GFP ++ +M+ GIPV+ D + E + I +D EG+S +
Sbjct: 404 TS--ISEGFPYTVIESMSCGIPVVATDVGGVSEALTPACGYICKPRDL-EGISEKVIAIL 460
Query: 567 SNGKLSK 573
+ L K
Sbjct: 461 RDATLRK 467
>gi|86748665|ref|YP_485161.1| group 1 glycosyl transferase [Rhodopseudomonas palustris HaA2]
gi|86571693|gb|ABD06250.1| Glycosyl transferase, group 1 [Rhodopseudomonas palustris HaA2]
Length = 414
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D L +A RLG+ + VR G D+N ++ +DI ++ +S+VE G + ++ AMT G+P
Sbjct: 280 DNLIALAGRLGIAD-KVRFPGHVTDINCRIVNSDIFVH-ASEVE-GMSNAVLEAMTLGLP 336
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+ D P + E EG F + NP ++ I + +L
Sbjct: 337 SVVVDAPGVSECHIEG-DTGFIVERNPNAMAARLIALIDDAEL 378
>gi|402493402|ref|ZP_10840155.1| glycosyltransferase [Aquimarina agarilytica ZC1]
Length = 376
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
G+ Y + L++ ++GL E+ V G ++ +L A LY + + + FP +++
Sbjct: 235 GDGEPSYVEVLEKQIKKIGL-ENKVTLKGATDNLKNEMLNAS--LYAMTSITECFPMVLL 291
Query: 521 RAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
AM+ G+P+++ D P + + +G I + ++ E L+R I N K
Sbjct: 292 EAMSCGLPIVSFDSPTGPRNIIKDGVDGILVEYNDVEDLAREIIYLIKNPK 342
>gi|119357614|ref|YP_912258.1| group 1 glycosyl transferase [Chlorobium phaeobacteroides DSM 266]
gi|119354963|gb|ABL65834.1| glycosyl transferase, group 1 [Chlorobium phaeobacteroides DSM 266]
Length = 382
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 10/90 (11%)
Query: 439 PLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL 498
PLL++ + VEG LCG G + +Q+V +R L ++RH + + V
Sbjct: 221 PLLLEAWTKAGVEGEL---LLCG----GIDPDIQDVVARY-LTASTIRHIAYTRHIGEVY 272
Query: 499 LMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
MAD+ ++ S +E+G P + AM GIP
Sbjct: 273 RMADVFVFPS--LEEGGPMVTYEAMAHGIP 300
>gi|406979096|gb|EKE00946.1| hypothetical protein ACD_21C00250G0019 [uncultured bacterium]
Length = 386
Score = 44.3 bits (103), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 124/314 (39%), Gaps = 37/314 (11%)
Query: 260 VRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAISSLMQEPFHSIPL 319
+ S N H +++ + S+L Q+ L+ ++ + + AK + +P+
Sbjct: 65 ISSKNIHEIYKLFSLS-SLLKQQAPHLVQTWMYHANVIGGIAAK----------INKLPV 113
Query: 320 VWIIQEDSLANRLPVYVE-RGFQNLLSYWKSVFSRV--NVIVFPDYTLPMLYSV--LDAG 374
+W I+ SL+ P ++ F N + +V S+ IV + +L+
Sbjct: 114 IWNIRNSSLSKETPESLKLNTFTNYVVKTGAVLSKYIPETIVSCSHYAALLHQKWGYRKD 173
Query: 375 NFFVIPGSPADVWAVEAYS-KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVA 433
NF VIP +E +S K K QLRKE G EIV+ VG ++
Sbjct: 174 NFIVIPNG----IDIELFSPKPQIKTQLRKELGIGDLEIVIGFVGRLHPQKDI------- 222
Query: 434 MHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGD 493
P +K AR LCG+ N + + L H D
Sbjct: 223 -----PNFLKAARLFLEHEPNAQFLLCGDGLTKENPLVNKWIENENLNNHC-HLLDKISD 276
Query: 494 VNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKD 553
+ + + DI S+ E FP++I AM+ GIP + D + + G Q I
Sbjct: 277 THNIYNVLDIYCSPSAYGE-AFPNVIAEAMSCGIPCVGTD--VGDTRLILGDQGITVPPK 333
Query: 554 NPEGLSRAFSLFIS 567
+P+ L++A+ +S
Sbjct: 334 DPQALAQAWKHLLS 347
>gi|429214737|ref|ZP_19205900.1| glycosyl transferase family protein [Pseudomonas sp. M1]
gi|428155023|gb|EKX01573.1| glycosyl transferase family protein [Pseudomonas sp. M1]
Length = 384
Score = 44.3 bits (103), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
E SVR G+ DV G L + +++ S +G P I AM G PVIT D+ +E +
Sbjct: 267 EGSVRWVGWVRDVRGWLNESSVLVL-PSYYREGIPRCIQEAMALGRPVITTDWVGCRESI 325
Query: 542 AEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+G +P L++A + FI + +L
Sbjct: 326 VDGVNGFLVPIRSPTALTQAMARFIHDPQL 355
>gi|399073911|ref|ZP_10750729.1| glycosyltransferase [Caulobacter sp. AP07]
gi|398041068|gb|EJL34150.1| glycosyltransferase [Caulobacter sp. AP07]
Length = 350
Score = 44.3 bits (103), Expect = 0.43, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L SVR G+ DV+ L D+ + S Q +GFP +++ AM G+PV + P E
Sbjct: 226 LGDSVRLVGWRSDVSAFLATGDVFAFPSYQ--EGFPLVLLEAMAVGLPVASSAIPGPVEM 283
Query: 541 VAEG 544
+AEG
Sbjct: 284 IAEG 287
>gi|302392890|ref|YP_003828710.1| group 1 glycosyl transferase [Acetohalobium arabaticum DSM 5501]
gi|302204967|gb|ADL13645.1| glycosyl transferase group 1 [Acetohalobium arabaticum DSM 5501]
Length = 369
Score = 44.3 bits (103), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 57/127 (44%), Gaps = 4/127 (3%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
V L G+ G + ++ LGL E V GF D+ +L D +++ ++ E G P
Sbjct: 228 VLLVGSGKKG--EVFKDRVKELGL-EDKVIFTGFREDIPSILKQLDFMVH-TAIYEGGAP 283
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
+I+ AM G+P++T + I E+V +G + N ++ I + + K +
Sbjct: 284 WVILEAMMAGVPIVTTEATTISEFVIDGENGYLAENKNSTDIAEKILKMIKHPEREKLGQ 343
Query: 577 TVASAGR 583
A +
Sbjct: 344 QGAEIAK 350
>gi|386811508|ref|ZP_10098733.1| putative glycosyltransferase [planctomycete KSU-1]
gi|386403778|dbj|GAB61614.1| putative glycosyltransferase [planctomycete KSU-1]
Length = 374
Score = 44.3 bits (103), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 6/116 (5%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
LQ +A LG+ + + G + + + DI L S +VE F + I+ A++ PVI
Sbjct: 243 LQSLAHELGIWRNVI-WLGLQKEPKKYISIFDIFLMASYRVET-FSNAIIEALSMSKPVI 300
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG----KLSKFARTVASAG 582
D E V +G + NPE +S + FI N + S+ AR A G
Sbjct: 301 ATDVGGTSEMVKDGVNGFLVRPKNPEDISEKITYFIKNPQCFRRFSQNARLCAVEG 356
>gi|386347344|ref|YP_006045593.1| group 1 glycosyl transferase [Spirochaeta thermophila DSM 6578]
gi|339412311|gb|AEJ61876.1| glycosyl transferase group 1 [Spirochaeta thermophila DSM 6578]
Length = 369
Score = 44.3 bits (103), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 8/116 (6%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
++++E A LG+ + V+ GF DV GVL + I + S +GFP I+ AM G+P
Sbjct: 232 ESMKEEAKVLGIGDR-VQFLGFCEDVEGVLASSQIFVLTSRW--EGFPISILEAMRAGLP 288
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLS-RAFSLFISNGKLSKFARTVASAGR 583
V+ D KE V EG + + L+ R L + GK ++ R AGR
Sbjct: 289 VVASDVGGCKESVVEGETGYLIPRGDHMVLAERLRELILDPGKRARMGR----AGR 340
>gi|399074599|ref|ZP_10751100.1| glycosyltransferase [Caulobacter sp. AP07]
gi|398040263|gb|EJL33376.1| glycosyltransferase [Caulobacter sp. AP07]
Length = 419
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 470 ALQEVASRLGLLEHSVRHYG--FNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGI 527
AL+ A RLG+ VR G + D+ GV AD+++ SS+ +G+ ++++ AM G
Sbjct: 260 ALETHARRLGV-AGRVRFLGEVAHADLPGVYAAADVLVLASSR--EGWANVLLEAMACGT 316
Query: 528 PVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
PV+ + E + GA + + PE L A L N
Sbjct: 317 PVVATNVNGAGEVIRSGAAGVLMSRRTPECLVEALDLLRQN 357
>gi|108803534|ref|YP_643471.1| group 1 glycosyl transferase [Rubrobacter xylanophilus DSM 9941]
gi|108764777|gb|ABG03659.1| glycosyl transferase, group 1 [Rubrobacter xylanophilus DSM 9941]
Length = 398
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 62/135 (45%), Gaps = 9/135 (6%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L ++A+ LGL E VR GF D ++ D+++ S +G P +++ AM G+PV+
Sbjct: 240 LSQLAAGLGL-EGRVRFLGFRPDAREIIRRLDVLVVPS--FTEGAPLVVLEAMASGVPVV 296
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
I + + G + + +P L A + N L AR + AGR A +
Sbjct: 297 ASAVGGIPDQIRHGREGLLVPTGDPVALGEALLSLLDNPGL---ARRMGEAGRERAFSAF 353
Query: 591 ALDCVTRYARILENV 605
D R R +E+V
Sbjct: 354 GHD---RMVRRIESV 365
>gi|403237455|ref|ZP_10916041.1| group 1 glycosyl transferase [Bacillus sp. 10403023]
Length = 390
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 57/117 (48%), Gaps = 7/117 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D +++A+ LGL++ V G D+ +L ++DI + +S + +G P ++ AM G+P
Sbjct: 247 DECRQLANELGLVD-KVEFLGLRDDIEMLLRISDIAV--ASSLREGLPVNVMEAMACGLP 303
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN----GKLSKFARTVASA 581
+I + KE + Q D+ +G S +N +L K +R++ +
Sbjct: 304 IIATENRGHKELIVTNENGWILQHDDLDGFSTKMKELANNEYVRNRLGKMSRSLMES 360
>gi|365843168|ref|ZP_09384121.1| glycosyltransferase, group 1 family protein [Flavonifractor plautii
ATCC 29863]
gi|364573176|gb|EHM50686.1| glycosyltransferase, group 1 family protein [Flavonifractor plautii
ATCC 29863]
Length = 368
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 3/97 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V+ GF D+ + AD LY +S + LI+ AM G+PVI D +
Sbjct: 244 LEDCVKFIGFRKDIKNLYKGAD--LYVNSSQHEALSFLIIEAMAAGLPVIATDMGGNSDI 301
Query: 541 VAEGAQV-IFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
V + A + + DNP+ ++ A F+ + + R
Sbjct: 302 VNDEAGCGMLVEYDNPDSMAVAMKCFLEDPAFAARCR 338
>gi|336322553|ref|YP_004602520.1| group 1 glycosyl transferase [Flexistipes sinusarabici DSM 4947]
gi|336106134|gb|AEI13952.1| glycosyl transferase group 1 [Flexistipes sinusarabici DSM 4947]
Length = 359
Score = 43.9 bits (102), Expect = 0.46, Method: Composition-based stats.
Identities = 34/116 (29%), Positives = 57/116 (49%), Gaps = 8/116 (6%)
Query: 453 SFKF-VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQV 511
SF F +F GN ++ + D +++ L+ +V+ GF DV +L D L S
Sbjct: 206 SFHFDIFGEGNLSEWFEDFIKK-----NKLDDNVQIKGFESDVRHLLPKYDFGLLTS--F 258
Query: 512 EQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS 567
+GFP++I+ M+ G+PVI+ + + E V + F + + L R F IS
Sbjct: 259 SEGFPNVILEYMSAGLPVISSNVSGVAEIVEDDFNGFLFMAGDEDKLERLFKKAIS 314
>gi|403511562|ref|YP_006643200.1| glycosyl transferases group 1 family protein [Nocardiopsis alba
ATCC BAA-2165]
gi|402799880|gb|AFR07290.1| glycosyl transferases group 1 family protein [Nocardiopsis alba
ATCC BAA-2165]
Length = 383
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 64/136 (47%), Gaps = 9/136 (6%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
++E A RLG+ + +V G D+ L A I L SS+VE GF IV A + G+PV+
Sbjct: 252 IRETAERLGV-QDNVHLMGKTKDLTSELSKASI-LAVSSKVE-GFGMTIVEAFSVGLPVV 308
Query: 531 TPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
+ D P E + +G + + EGL RA + N + R A L +
Sbjct: 309 SFDCPHGPGEIIEDGHDGLLVPPQDVEGLGRALLRLVENDE----ERHRMGAHALKSAER 364
Query: 590 LAL-DCVTRYARILEN 604
L D V R+ R LE+
Sbjct: 365 YGLEDSVARWERFLED 380
>gi|51893850|ref|YP_076541.1| glycosyl transferase family protein [Symbiobacterium thermophilum
IAM 14863]
gi|51857539|dbj|BAD41697.1| putative glycosyl transferase [Symbiobacterium thermophilum IAM
14863]
Length = 361
Score = 43.9 bits (102), Expect = 0.47, Method: Composition-based stats.
Identities = 28/102 (27%), Positives = 53/102 (51%), Gaps = 3/102 (2%)
Query: 472 QEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVIT 531
+++A +LG+++ VR G D+ +L A I + S+ +GFP I+ AM G+PVI
Sbjct: 231 RQLAHQLGMMDR-VRFLGSRHDIADILADAQIFVLTSNW--EGFPITILEAMRAGLPVIA 287
Query: 532 PDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
D ++E V G + + + L R + +++ +L +
Sbjct: 288 SDVGGVREAVLHGRTGYLVARGDVDALQRYLTTLLTDPQLRE 329
>gi|149909974|ref|ZP_01898623.1| hypothetical protein PE36_18379 [Moritella sp. PE36]
gi|149806988|gb|EDM66947.1| hypothetical protein PE36_18379 [Moritella sp. PE36]
Length = 402
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 494 VNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKD 553
VN L+A + S V+ G P ++ AM G+PVIT D KE VA G +I QKD
Sbjct: 292 VNFDCLVAPFCVSQSGCVDTG-PLVLKEAMASGVPVITTDLMGCKEIVAPGTGLIVKQKD 350
Query: 554 NPEGLSRAFSLFI 566
+ + LS A + FI
Sbjct: 351 S-QSLSHAITEFI 362
>gi|334135154|ref|ZP_08508651.1| glycosyltransferase, group 1 family protein [Paenibacillus sp.
HGF7]
gi|333607292|gb|EGL18609.1| glycosyltransferase, group 1 family protein [Paenibacillus sp.
HGF7]
Length = 597
Score = 43.9 bits (102), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 48/118 (40%), Gaps = 4/118 (3%)
Query: 451 EGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQ 510
+G FV L L+ +A L+ V G D+ L MADIV+ S +
Sbjct: 228 QGITDFVCLLAGKGPLEKQILERIAELS--LQKEVLLIGHRSDLYAFLKMADIVVLTSEK 285
Query: 511 VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+G P ++ AM F PV+ D +E V + N L++AF I N
Sbjct: 286 --EGIPRFLMEAMAFSKPVVASDVLGTRELVRHEDTGLLVPYKNTGALAKAFRTLIEN 341
>gi|307719251|ref|YP_003874783.1| glycosyl transferase, group 1 family, partial [Spirochaeta
thermophila DSM 6192]
gi|306532976|gb|ADN02510.1| glycosyl transferase, group 1 family [Spirochaeta thermophila DSM
6192]
Length = 242
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 8/116 (6%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
++++E A LG+ + V GF DV GVL + I + S +GFP I+ AM G+P
Sbjct: 105 ESMKEEAKALGIGDR-VEFLGFCEDVEGVLASSQIFVLASRW--EGFPISILEAMRAGLP 161
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLS-RAFSLFISNGKLSKFARTVASAGR 583
V+ D +E V EG + + L+ R L + GK + R AGR
Sbjct: 162 VVASDVGGCREAVVEGETGYLVPRGDHMVLAERLRELILDPGKRERMGR----AGR 213
>gi|302390071|ref|YP_003825892.1| group 1 glycosyl transferase [Thermosediminibacter oceani DSM
16646]
gi|302200699|gb|ADL08269.1| glycosyl transferase group 1 [Thermosediminibacter oceani DSM
16646]
Length = 371
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 43/100 (43%), Gaps = 3/100 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V GF D+ ++ +D+++ S +G ++ AM G PVI D E
Sbjct: 246 LEGRVNFLGFCRDIYRIMQNSDMLVLSSRS--EGLSLSLLEAMAMGKPVIATDVGGNPEI 303
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN-GKLSKFARTVA 579
+ G + DNP L+ A I N G K ART
Sbjct: 304 IRHGVTGMLVPPDNPRALAEAMEYVIKNPGDAEKMARTAC 343
>gi|308535263|ref|YP_002138592.2| group glycosyltransferase [Geobacter bemidjiensis Bem]
gi|308052608|gb|ACH38796.2| glycosyltransferase [Geobacter bemidjiensis Bem]
Length = 413
Score = 43.9 bits (102), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 53/116 (45%), Gaps = 7/116 (6%)
Query: 465 DGYNDALQEV---ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
DGY D L+ +RLGL E V G D+ V + D+ L S V +G P+ ++
Sbjct: 255 DGYGDELERARREVARLGL-EEVVHFTGHRNDLRDVYVSFDVFLMTS--VTEGLPNTLLE 311
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS-RAFSLFISNGKLSKFAR 576
AM G+P ++ D I E + +G + E L+ R L S +F+R
Sbjct: 312 AMALGVPSVSTDVGGIPELLYDGEGGYLAPAGDAERLALRVLELLASVELRERFSR 367
>gi|37528671|ref|NP_932016.1| WalR protein [Photorhabdus luminescens subsp. laumondii TTO1]
gi|36788110|emb|CAE17234.1| WalR protein [Photorhabdus luminescens subsp. laumondii TTO1]
Length = 373
Score = 43.9 bits (102), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
+L+ + R GL +SV G DV L D+ S E G P I++AM GIPV
Sbjct: 236 SLEPLVKREGL-SNSVIFLGNRQDVPDCLNAMDLFALPSFGNE-GVPQGIMQAMACGIPV 293
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
++ I E V +G + N E L+++ L I N +L
Sbjct: 294 VSTSVGAITEAVVDGETGYIVEPRNTELLTKSLELLIHNNEL 335
>gi|226365209|ref|YP_002782992.1| glycosyltransferase [Rhodococcus opacus B4]
gi|226243699|dbj|BAH54047.1| putative glycosyltransferase [Rhodococcus opacus B4]
Length = 757
Score = 43.9 bits (102), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/139 (24%), Positives = 65/139 (46%), Gaps = 7/139 (5%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+ +AS LG+ + R G DV +L D+ S V +G P ++ +M G+PV+
Sbjct: 624 LRSLASSLGIGGRT-RFTGIRRDVQQLLPGLDVSCL--SSVHEGLPVTVIESMAAGLPVV 680
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
+ D +++ + +G Q + + L+RA S + L + GRL A+
Sbjct: 681 STDCGSLRDVITDGEQGYLVPVGDVDALARALSDLADDPGLRA---RLGEQGRLRAEQKY 737
Query: 591 ALDCVTR-YARILENVLNF 608
++ R + R+L +++
Sbjct: 738 RIELTARGFERLLADLVGV 756
>gi|390935162|ref|YP_006392667.1| group 1 glycosyl transferase [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
gi|389570663|gb|AFK87068.1| glycosyl transferase group 1 [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
Length = 373
Score = 43.9 bits (102), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 59/131 (45%), Gaps = 8/131 (6%)
Query: 448 NSVEGSFK-FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLY 506
N + G K FVF+ G+ Y + L+++ L L +V GF D+ L DI +
Sbjct: 220 NILRGKVKAFVFIAGDGP--YKEHLKDMVRDLKL--DNVEFLGFIEDIFNFLSSIDIFVL 275
Query: 507 GSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFI 566
S +GF + AM G+PVI D I E V I + + P L+ A +
Sbjct: 276 PSHS--EGFGISVAEAMALGVPVIATDVGGIPEIVRNDENGIIVKSEAPNDLANAIEILA 333
Query: 567 SNGKL-SKFAR 576
N L +KF++
Sbjct: 334 LNEDLRNKFSK 344
>gi|331699993|ref|YP_004336232.1| group 1 glycosyl transferase [Pseudonocardia dioxanivorans CB1190]
gi|326954682|gb|AEA28379.1| glycosyl transferase group 1 [Pseudonocardia dioxanivorans CB1190]
Length = 394
Score = 43.9 bits (102), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 53/118 (44%), Gaps = 5/118 (4%)
Query: 467 YNDALQEVASRLGL-LEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAM 523
Y DA LGL L H V G D+ G AD + + S V++G+ ++ AM
Sbjct: 252 YRDAALAELPELGLELGHDVHLLGTVSEADLAGWYHAADALCFPS--VKEGWGLVVFEAM 309
Query: 524 TFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASA 581
G+PVI D + EY+ +G + ++ +GL++A + + L R A
Sbjct: 310 AAGLPVIASDLDVFHEYLEDGVSALLPPVEDSDGLAQAMRTMVRDAALRDRLRAGGEA 367
>gi|443323787|ref|ZP_21052790.1| glycosyltransferase [Gloeocapsa sp. PCC 73106]
gi|442786573|gb|ELR96303.1| glycosyltransferase [Gloeocapsa sp. PCC 73106]
Length = 393
Score = 43.5 bits (101), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 78/191 (40%), Gaps = 37/191 (19%)
Query: 447 RNSVEGSFKFVFLCG--NSTDGYND--------------------------ALQEVASRL 478
+ S++ K + CG N G++D LQ L
Sbjct: 198 KESIQSGTKLIVACGRLNQQKGFSDLIKALVTVQKLIPTHLWIIGEGELRPVLQSQIESL 257
Query: 479 GLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI-I 537
GL E SVR GF + + AD+ + S + +GF ++IV AM G PV+ D P
Sbjct: 258 GLSE-SVRLLGFQANPYQYMAKADVFVL--SSIYEGFGNVIVEAMACGTPVVATDCPYGP 314
Query: 538 KEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS-KFARTVASAGRLHAKNMLALDCVT 596
+E + G + + + E L+ + + +L +FAR G++ A++ ++
Sbjct: 315 REIINSGVNGLLVKPGDSEALAEGIIQVLQDPQLQVQFARQ----GKIRAQDFDSVKIAK 370
Query: 597 RYARILENVLN 607
Y + + L+
Sbjct: 371 MYGELFISCLS 381
>gi|302389071|ref|YP_003824892.1| group 1 glycosyl transferase [Thermosediminibacter oceani DSM
16646]
gi|302199699|gb|ADL07269.1| glycosyl transferase group 1 [Thermosediminibacter oceani DSM
16646]
Length = 380
Score = 43.5 bits (101), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 46/100 (46%), Gaps = 5/100 (5%)
Query: 489 GFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVI 548
G+ DV +L +DIV S + +G P I+ AM G PV+ + ++ V G
Sbjct: 266 GYRRDVPQILRESDIVTLTSKR--EGLPKSIMEAMAAGKPVVATNVRGSRDLVEHGKTGF 323
Query: 549 FFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKN 588
+ EGL A I NG+L K T+ AGR K+
Sbjct: 324 LVDLGDDEGLFFALKSLIENGELRK---TMGKAGREKIKD 360
>gi|345017396|ref|YP_004819749.1| group 1 glycosyl transferase [Thermoanaerobacter wiegelii Rt8.B1]
gi|344032739|gb|AEM78465.1| glycosyl transferase group 1 [Thermoanaerobacter wiegelii Rt8.B1]
Length = 372
Score = 43.5 bits (101), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 66/137 (48%), Gaps = 7/137 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
+ F+ G+ + + LQ++ L L + G+ D+ L ++ + S E+GF
Sbjct: 228 YFFVAGDGP--FREELQKMIDSLNLKDRFFL-LGYRNDIPSFLRNLNVFVLPSH--EEGF 282
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK-- 573
++ A++ G+PV+ I E + +G + I +K++PE L++A F+ + +L K
Sbjct: 283 GISVIEALSEGVPVVATKVGGIPEIIQDGVEGILVEKESPEELAKAIEKFLKDEELRKNI 342
Query: 574 FARTVASAGRLHAKNML 590
R S G+ M+
Sbjct: 343 SLRGKESVGKYSCNKMI 359
>gi|392941256|ref|ZP_10306900.1| glycosyltransferase [Thermoanaerobacter siderophilus SR4]
gi|392293006|gb|EIW01450.1| glycosyltransferase [Thermoanaerobacter siderophilus SR4]
Length = 372
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 66/137 (48%), Gaps = 7/137 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
+ F+ G+ + + LQ++ L L + G+ D+ L ++ + S E+GF
Sbjct: 228 YFFVAGDGP--FREELQKMIDSLNLKDRFFL-LGYRNDIPSFLRNLNVFVLPSH--EEGF 282
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK-- 573
++ A++ G+PV+ I E + +G + I +K++PE L++A F+ + +L K
Sbjct: 283 GISVIEALSEGVPVVATKVGGIPEIIQDGVEGILVEKESPEELAKAIEKFLKDEELRKNI 342
Query: 574 FARTVASAGRLHAKNML 590
R S G+ M+
Sbjct: 343 SLRGKESVGKYSCNKMI 359
>gi|326389870|ref|ZP_08211434.1| glycosyl transferase group 1 [Thermoanaerobacter ethanolicus JW
200]
gi|325994138|gb|EGD52566.1| glycosyl transferase group 1 [Thermoanaerobacter ethanolicus JW
200]
Length = 372
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 66/137 (48%), Gaps = 7/137 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
+ F+ G+ + + LQ++ L L + G+ D+ L ++ + S E+GF
Sbjct: 228 YFFVAGDGP--FREELQKMIDSLNLKDRFFL-LGYRNDIPSFLRNLNVFVLPSH--EEGF 282
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK-- 573
++ A++ G+PV+ I E + +G + I +K++PE L++A F+ + +L K
Sbjct: 283 GISVIEALSEGVPVVATKVGGIPEIIQDGVEGILVEKESPEELAKAIEKFLKDEELRKNI 342
Query: 574 FARTVASAGRLHAKNML 590
R S G+ M+
Sbjct: 343 SLRGKESVGKYSCNKMI 359
>gi|345886326|ref|ZP_08837581.1| hypothetical protein HMPREF0178_00355 [Bilophila sp. 4_1_30]
gi|345038506|gb|EGW42940.1| hypothetical protein HMPREF0178_00355 [Bilophila sp. 4_1_30]
Length = 372
Score = 43.5 bits (101), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
E + + G DV + A++V+ S + +G P ++ AM+ G P++ D P ++ V
Sbjct: 252 EGIIEYLGVTRDVRPYVGQANVVVLPSWR--EGLPCSLMEAMSMGRPIVATDVPGCRDVV 309
Query: 542 AEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+G PE L++A F+ + L+
Sbjct: 310 VDGKNGFLVPVRTPEALAKALESFLEDSALT 340
>gi|429200227|ref|ZP_19191939.1| glycosyltransferase, group 1 family protein [Streptomyces ipomoeae
91-03]
gi|428664058|gb|EKX63369.1| glycosyltransferase, group 1 family protein [Streptomyces ipomoeae
91-03]
Length = 365
Score = 43.5 bits (101), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 4/102 (3%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L++ + LG+ + SVR G DV G L A + + S +GFP ++ AM G+P
Sbjct: 234 LRKQCTALGI-DGSVRWMGRTSDVQGALRGASV--FALSSRGEGFPLALMEAMAMGVPCA 290
Query: 531 TPDF-PIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
D P + E V +G + N L+R L +S+ L
Sbjct: 291 AFDCAPGVHEMVRDGEDGLLAHPGNTGELARKLDLLMSDRGL 332
>gi|317485313|ref|ZP_07944193.1| glycosyl transferase group 1 [Bilophila wadsworthia 3_1_6]
gi|316923439|gb|EFV44645.1| glycosyl transferase group 1 [Bilophila wadsworthia 3_1_6]
Length = 372
Score = 43.5 bits (101), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
R G+LE+ G DV + A++V+ S + +G P ++ AM+ G P++ D P
Sbjct: 251 REGILEY----LGVTRDVRPYVGQANVVVLPSWR--EGLPCSLMEAMSMGRPIVATDVPG 304
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
++ V +G PE L++A F+ + L+
Sbjct: 305 CRDVVVDGKNGFLVPVRTPEALAKALESFLEDSALT 340
>gi|288963135|ref|YP_003453414.1| glycosyltransferase [Azospirillum sp. B510]
gi|288915387|dbj|BAI76870.1| glycosyltransferase [Azospirillum sp. B510]
Length = 397
Score = 43.5 bits (101), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 5/99 (5%)
Query: 466 GYNDALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAM 523
G+ AL+E A+RLGL VR +G+ + G+ DI + S ++G P++++ AM
Sbjct: 250 GWRTALEEQAARLGL-AGRVRLHGWLDRAVLAGLYRTVDIFVLPSR--DEGMPNVVLEAM 306
Query: 524 TFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
G+PV+ ++ V EG + P+ L+ A
Sbjct: 307 ASGLPVVASAVAGARDLVVEGETGFLVPPEQPDALAGAL 345
>gi|443320891|ref|ZP_21049965.1| glycosyltransferase [Gloeocapsa sp. PCC 73106]
gi|442789396|gb|ELR99055.1| glycosyltransferase [Gloeocapsa sp. PCC 73106]
Length = 371
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 66/144 (45%), Gaps = 9/144 (6%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
+G A E A G++E G+ D+ L DI + S +G P +++ A
Sbjct: 232 NGVPSAYIEAAVAKGVIEF----LGYLKDMPLQLQGTDIFVL-PSYYREGVPRVLLEAAA 286
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRL 584
G+PV+T DFP +E V + + LS A +L +S+ +L + + AGR
Sbjct: 287 SGVPVVTTDFPGCREAVIANETGYLVPPQDSKALSEAIALLLSDAQLRE---NMGQAGRR 343
Query: 585 HAKNMLALD-CVTRYARILENVLN 607
+ + L D + Y R+ +++ +
Sbjct: 344 YILDNLTDDQIIEAYFRVYQDLCH 367
>gi|431931288|ref|YP_007244334.1| glycosyltransferase [Thioflavicoccus mobilis 8321]
gi|431829591|gb|AGA90704.1| glycosyltransferase [Thioflavicoccus mobilis 8321]
Length = 400
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 65/138 (47%), Gaps = 6/138 (4%)
Query: 470 ALQEVASRLGLLEHSVRHYGF---NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFG 526
ALQ +A++LG+ EH VR G+ G + D+ ++ S QG +++ A+ G
Sbjct: 250 ALQTLAAQLGVAEH-VRFVGYLDRGGPLEDCYSAGDVFVFASRTETQGL--VVLEALALG 306
Query: 527 IPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHA 586
IPV++ +E +A+G + ++D + ++ L + G + RT +
Sbjct: 307 IPVVSTAVMGTREVLADGRGSLIAEEDETDFANKVVRLLRNPGLREELGRTARHYAEEWS 366
Query: 587 KNMLALDCVTRYARILEN 604
+LA + Y ++LE
Sbjct: 367 APVLADRMLAFYRKVLER 384
>gi|307719249|ref|YP_003874781.1| group 1 glycosyl transferase [Spirochaeta thermophila DSM 6192]
gi|306532974|gb|ADN02508.1| glycosyl transferase group 1 [Spirochaeta thermophila DSM 6192]
Length = 370
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 55/112 (49%), Gaps = 6/112 (5%)
Query: 472 QEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVIT 531
+++A RLGL E + GF GD+ VL + I + S +G P I+ AM G+PV+
Sbjct: 235 KKLAYRLGL-EERISFPGFRGDIEAVLAQSQIYVLISHW--EGLPRSILEAMRAGLPVVA 291
Query: 532 PDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
++E V +G + + + L++ I++ +L ++ AGR
Sbjct: 292 SRVGGVEEAVEDGVTGYVVDRRDVDALAQRLERLIADPRLRA---SMGRAGR 340
>gi|169831244|ref|YP_001717226.1| group 1 glycosyl transferase [Candidatus Desulforudis audaxviator
MP104C]
gi|169638088|gb|ACA59594.1| glycosyl transferase, group 1 [Candidatus Desulforudis audaxviator
MP104C]
Length = 389
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 41/174 (23%), Positives = 73/174 (41%), Gaps = 19/174 (10%)
Query: 390 EAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNS 449
+ +S E LR+E G D VV ++ + +Y L++ A
Sbjct: 175 QPVPRSGEAEALRRELGLPADRPVVGT--AARLAPQKGVEY----------LLRAAAVLR 222
Query: 450 VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 509
EG + + G+ L++++ LG+ R G DV G+L + D+ + S
Sbjct: 223 DEGRPVALVIAGDGP--LKAPLEKMSRALGV---DTRFLGHRPDVAGLLQLFDVFVLPS- 276
Query: 510 QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
V +G P +++ AM G PV+ + E V +G +PE L++A +
Sbjct: 277 -VTEGLPLVVLEAMAVGCPVVATKVGGVPEVVEDGRTGRLVPAGDPEALAQAVA 329
>gi|325853252|ref|ZP_08171330.1| glycosyltransferase, group 1 family protein [Prevotella denticola
CRIS 18C-A]
gi|325484376|gb|EGC87300.1| glycosyltransferase, group 1 family protein [Prevotella denticola
CRIS 18C-A]
Length = 378
Score = 43.1 bits (100), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
VR G+ DV + AD++++ S + +GFP+++++A G+P I + E + EG
Sbjct: 260 VRFVGYQRDVRPFFIAADVLVFPSYR--EGFPNVVLQAGAMGLPSIVTNINGCNEIIKEG 317
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
F + + L + + I N F + +AS R
Sbjct: 318 QNGRIFPSKDADALFKEMNWCIEN---KDFIKVMASQSR 353
>gi|339504311|ref|YP_004691731.1| adenylate cyclase [Roseobacter litoralis Och 149]
gi|338758304|gb|AEI94768.1| putative adenylate cyclase [Roseobacter litoralis Och 149]
Length = 588
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 10/90 (11%)
Query: 682 DTISELDWDVLHD-IESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANER 740
D I E D D++ D + + E L + G +D YR R R + E ++
Sbjct: 104 DVIEEGDGDLMGDGVNIAARLEALAVP-------GAICMSEDAYRQVRS--RLEVEISDL 154
Query: 741 DEGELERTGQPVCIYEIYSGSGAWPFLHHG 770
E EL+ +P+ +Y IYSG+ P L HG
Sbjct: 155 GEHELKNIAEPMRVYAIYSGTMTAPHLEHG 184
>gi|407010196|gb|EKE25156.1| glycosyl transferase group 1 [uncultured bacterium]
Length = 373
Score = 43.1 bits (100), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 56/116 (48%), Gaps = 7/116 (6%)
Query: 492 GDVNGVLLMADIVLYGSSQVEQGF--PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIF 549
D+ + +ADIV++ + ++ F P ++ AM PVI D PI+KE+ + VI
Sbjct: 261 SDMPKIYNIADIVIFPVADMKGKFDVPLAVIEAMACEKPVIISDLPILKEFASSDNSVII 320
Query: 550 FQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVT-RYARILEN 604
+N + + L SN K + + + A R+ A+ ++ V +Y I +N
Sbjct: 321 ETGNNEQMICEIVRL--SNNK--EVCQKIGQAARIFAQENFDINQVAQKYEEIYKN 372
>gi|281424216|ref|ZP_06255129.1| putative lipopolysaccharide biosynthesis protein [Prevotella oris
F0302]
gi|281401485|gb|EFB32316.1| putative lipopolysaccharide biosynthesis protein [Prevotella oris
F0302]
Length = 392
Score = 43.1 bits (100), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 48/102 (47%), Gaps = 5/102 (4%)
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
+ +V G+ DV + AD++++ S + +GFP+++++A G+P I D E +
Sbjct: 271 DTNVLFVGYQNDVRPFFIAADVLVFPSYR--EGFPNVVLQAGAMGLPAIVTDINGCNEII 328
Query: 542 AEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
E F N E L A + + N + K +AS R
Sbjct: 329 QENVNGKIFSPRNAEALYEAMNWCLDNKETVK---NMASCSR 367
>gi|154494115|ref|ZP_02033435.1| hypothetical protein PARMER_03460 [Parabacteroides merdae ATCC
43184]
gi|154086375|gb|EDN85420.1| glycosyltransferase, group 1 family protein [Parabacteroides merdae
ATCC 43184]
Length = 312
Score = 43.1 bits (100), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 462 NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
NS D +++L+++ + GL + G D+N +L ++DI L S +G I+
Sbjct: 176 NSED--SNSLRQLIAEAGL-NRKIMSLGIRNDINDILNISDIYLQPSRT--EGLSLSIME 230
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
A+ + +PVI I E V EG F+K+N E L+ + ++N
Sbjct: 231 ALNYSLPVIGTRVGGIPEIVHEGENGYLFEKENVEELADRIEILVNN 277
>gi|349699669|ref|ZP_08901298.1| glycosyltransferase [Gluconacetobacter europaeus LMG 18494]
Length = 401
Score = 43.1 bits (100), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 48/110 (43%), Gaps = 10/110 (9%)
Query: 488 YGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQV 547
+G+ DV V+ ADI + S +G P ++ AM G+PV+ D E V G
Sbjct: 258 FGYRADVARVMAAADIFVLPSH--FEGLPMSVIEAMLCGLPVVASDISGPCEQVVSGQTG 315
Query: 548 IFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTR 597
+ N L+ A + +G L R + AGR M AL C T+
Sbjct: 316 LLVPPGNVPRLAAALEHLVHDGALR---RHMGEAGR-----MRALACYTQ 357
>gi|15890377|ref|NP_356049.1| glycosyltransferase [Agrobacterium fabrum str. C58]
gi|15158594|gb|AAK88834.1| glycosyltransferase [Agrobacterium fabrum str. C58]
Length = 787
Score = 43.1 bits (100), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 34/139 (24%), Positives = 65/139 (46%), Gaps = 7/139 (5%)
Query: 473 EVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITP 532
E A ++ + +++ GF +V+G+ ++D+ L S + FP +++A G+P IT
Sbjct: 634 EDARKIAASDPTIQFLGFISEVHGLYRLSDVALLPSRYPGESFPLSLIQAFQVGVPCITT 693
Query: 533 DFPIIKEYVAEGAQ---VIFFQKDNPEGLSRAFSLF---ISNGKLSKFARTVA-SAGRLH 585
D I+ G++ +IF D+ E S F I L + R A S G+
Sbjct: 694 DVGEIRSMTTLGSKQAGIIFQPIDDTEMFVDELSQFMERILEASLREELRQTANSLGKCF 753
Query: 586 AKNMLALDCVTRYARILEN 604
LA + + +Y ++++
Sbjct: 754 EIEQLASEYLQQYLLLIKS 772
>gi|404254479|ref|ZP_10958447.1| group 1 glycosyl transferase [Sphingomonas sp. PAMC 26621]
Length = 399
Score = 43.1 bits (100), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 2/92 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+++A+RLG+ + V GF D+ ++ D+VL+ S++ E F +IV M G+PVI
Sbjct: 256 LRDLAARLGIADR-VHFCGFRQDIARLMSAMDVVLHCSTEPEP-FGRVIVEGMMAGVPVI 313
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
E +A + +P+ L+ A
Sbjct: 314 ATHGGGASEIIAGSGAGLLIAPGDPQALAEAI 345
>gi|410449930|ref|ZP_11303977.1| glycosyltransferase, group 1 family protein [Leptospira sp. Fiocruz
LV3954]
gi|410016216|gb|EKO78301.1| glycosyltransferase, group 1 family protein [Leptospira sp. Fiocruz
LV3954]
Length = 407
Score = 43.1 bits (100), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 4/115 (3%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
L G Y L++++ + E+ V GF+ +VN L D+++ SS E F +
Sbjct: 267 LLGPGDPIYMKYLRDISLKY---ENPVSFLGFHNNVNQFLEQVDLLVVPSSNFE-SFGMV 322
Query: 519 IVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
I+ M PVI DF +KE V +G N + + A L + N LSK
Sbjct: 323 ILEGMRMKKPVICTDFGGMKEVVVDGETGFVVPAMNLKMMGMAIDLLLKNPNLSK 377
>gi|365856631|ref|ZP_09396644.1| glycosyltransferase, group 1 family protein [Acetobacteraceae
bacterium AT-5844]
gi|363717691|gb|EHM01055.1| glycosyltransferase, group 1 family protein [Acetobacteraceae
bacterium AT-5844]
Length = 487
Score = 43.1 bits (100), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
S +++GF ++ A G PVI D P+ +E G QV +F +PEGL+RA
Sbjct: 403 SSIDEGFGLPVIEAAAHGAPVIASDLPVFRE--VGGDQVTYFPVASPEGLARAM 454
>gi|418045202|ref|ZP_12683298.1| glycosyl transferase group 1 [Thermotoga maritima MSB8]
gi|351678284|gb|EHA61431.1| glycosyl transferase group 1 [Thermotoga maritima MSB8]
Length = 391
Score = 43.1 bits (100), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 59/126 (46%), Gaps = 4/126 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V GF+ ++ +L ADIV+ S + + FP++++ A G PV+T D E
Sbjct: 268 LEKQVFFKGFSANIIDEMLKADIVISTSVE-DDPFPTILLEASMIGKPVVTTDAGGSGEI 326
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
V G +KD E L+ A I N +L A L ++LA + R+
Sbjct: 327 VKNGETGFVVKKDKRE-LASALCELIQNKELRDVFSKNARDRFLKEFSILAFE--KRFLE 383
Query: 601 ILENVL 606
LE++L
Sbjct: 384 TLESIL 389
>gi|50954848|ref|YP_062136.1| mannosyltransferase [Leifsonia xyli subsp. xyli str. CTCB07]
gi|50951330|gb|AAT89031.1| mannosyltransferase [Leifsonia xyli subsp. xyli str. CTCB07]
Length = 360
Score = 43.1 bits (100), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 6/87 (6%)
Query: 476 SRLGLLEHSVR---HYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITP 532
SRL L R H G + + L + L +S+ ++GF +V +MT G PV+
Sbjct: 229 SRLSALAPGTRLVFHDGASDEEYAETLASATALVTASR-DEGFGIPLVESMTLGTPVVVS 287
Query: 533 DFPIIKEYVAEGAQVIFFQKDNPEGLS 559
D PI +E E A ++F D+PE L+
Sbjct: 288 DIPIFREIGGEAA--LYFSPDSPEELA 312
>gi|254446245|ref|ZP_05059721.1| glycosyl transferase, group 1 family protein [Verrucomicrobiae
bacterium DG1235]
gi|198260553|gb|EDY84861.1| glycosyl transferase, group 1 family protein [Verrucomicrobiae
bacterium DG1235]
Length = 364
Score = 43.1 bits (100), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 58/134 (43%), Gaps = 8/134 (5%)
Query: 441 LIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGD---VNGV 497
L +A+ +V KFV + D +A+ LGL E +R D
Sbjct: 197 LEAFAQTLAVNPKAKFVLIGNGPAD---EAVDAKIGELGLGESVIRTGRIERDRLMAENY 253
Query: 498 LLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG 557
L+ D+ + S Q P I+ A+ FG+P+I P I E V G+ + F+ D+ G
Sbjct: 254 PLLGDVFITASKTENQ--PVSILEALAFGLPLIGPRAKGIPELVDHGSNGLLFEPDDVYG 311
Query: 558 LSRAFSLFISNGKL 571
+S A + + +L
Sbjct: 312 MSSAMVRLMQDQQL 325
>gi|15643392|ref|NP_228436.1| lipopolysaccharide biosynthesis protein [Thermotoga maritima MSB8]
gi|4981148|gb|AAD35711.1|AE001737_4 lipopolysaccharide biosynthesis protein [Thermotoga maritima MSB8]
Length = 393
Score = 43.1 bits (100), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 59/126 (46%), Gaps = 4/126 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V GF+ ++ +L ADIV+ S + + FP++++ A G PV+T D E
Sbjct: 270 LEKQVFFKGFSANIIDEMLKADIVISTSVE-DDPFPTILLEASMIGKPVVTTDAGGSGEI 328
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
V G +KD E L+ A I N +L A L ++LA + R+
Sbjct: 329 VKNGETGFVVKKDKRE-LASALCELIQNKELRDVFSKNARDRFLKEFSILAFE--KRFLE 385
Query: 601 ILENVL 606
LE++L
Sbjct: 386 TLESIL 391
>gi|227529796|ref|ZP_03959845.1| glycosyltransferase [Lactobacillus vaginalis ATCC 49540]
gi|227350280|gb|EEJ40571.1| glycosyltransferase [Lactobacillus vaginalis ATCC 49540]
Length = 514
Score = 43.1 bits (100), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 63/135 (46%), Gaps = 6/135 (4%)
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
G + D ++ L+++ L +SV +G+ D N V A +++ S +G P +V
Sbjct: 368 GYANDNFDKTLKKIVKDEDL-GNSVTFHGYTNDPNAVYNDAQLLILPSRA--EGLPLSLV 424
Query: 521 RAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRA-FSLFISNGKLSKFARTV 578
A + G+P++ D + V +G I Q + +GL++A L KL+KF+
Sbjct: 425 EAQSHGLPIVANDIKYGPADVVIDGQDGILTQNGDVDGLAKAIIDLLTDQEKLAKFSENA 484
Query: 579 -ASAGRLHAKNMLAL 592
+ R N++ L
Sbjct: 485 YQDSERYSEPNVMKL 499
>gi|262382505|ref|ZP_06075642.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262295383|gb|EEY83314.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 375
Score = 43.1 bits (100), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 3/90 (3%)
Query: 480 LLEHS-VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
+L HS + G+ DV M+D +++ S + +GFP+++++A G+P I D
Sbjct: 250 ILNHSAISFMGWQKDVRPYFAMSDALVFPSYR--EGFPNVVMQAGAMGLPSIVTDINGCN 307
Query: 539 EYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
E + +G I + E L ++ S F+ N
Sbjct: 308 EIIVDGENGIIIPPKDEEALYKSMSYFLDN 337
>gi|365891125|ref|ZP_09429585.1| conserved hypothetical protein [Bradyrhizobium sp. STM 3809]
gi|365332955|emb|CCE02116.1| conserved hypothetical protein [Bradyrhizobium sp. STM 3809]
Length = 399
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 64/138 (46%), Gaps = 8/138 (5%)
Query: 437 VGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGF--NGDV 494
VG LL AR +G + L G DG AL AS L E++V G N DV
Sbjct: 210 VGDLLQATARLRQ-QGYPVSLALAGRDADGSMAAL---ASSLKT-ENAVSFLGVVPNEDV 264
Query: 495 NGVLLMADIVLYGSSQ-VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKD 553
++ AD+V+ S +G P I A++ P++ D P+ + +A+G + F+
Sbjct: 265 PRLMREADLVVIPSRHDYPEGLPLTIYEALSARTPIVASDHPMFRNALADGESAVIFEAG 324
Query: 554 NPEGLSRAFSLFISNGKL 571
+ L+RA +++ L
Sbjct: 325 DVNQLARAIVKVLNDPPL 342
>gi|325294207|ref|YP_004280721.1| group 1 glycosyl transferase [Desulfurobacterium thermolithotrophum
DSM 11699]
gi|325064655|gb|ADY72662.1| glycosyl transferase group 1 [Desulfurobacterium thermolithotrophum
DSM 11699]
Length = 361
Score = 43.1 bits (100), Expect = 0.95, Method: Composition-based stats.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 3/103 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
LQ + RL L E + + GF D+ + DI + S +G S I+ AM +PV+
Sbjct: 224 LQSLIERLNLQE-NFKLLGFKKDIQNYIKAFDIFVLPSDF--EGLGSSILIAMFLKVPVV 280
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
+ D KE V +G I K NP+ L+ + + KL +
Sbjct: 281 STDAGGTKEVVIDGKTGILVPKKNPQALAEGILRLLEDEKLKE 323
>gi|163848974|ref|YP_001637018.1| group 1 glycosyl transferase [Chloroflexus aurantiacus J-10-fl]
gi|222526928|ref|YP_002571399.1| group 1 glycosyl transferase [Chloroflexus sp. Y-400-fl]
gi|163670263|gb|ABY36629.1| glycosyl transferase group 1 [Chloroflexus aurantiacus J-10-fl]
gi|222450807|gb|ACM55073.1| glycosyl transferase group 1 [Chloroflexus sp. Y-400-fl]
Length = 407
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 65/151 (43%), Gaps = 5/151 (3%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
FVF+ N LQ +A+ G+ + + + LL A VL S+ E GF
Sbjct: 256 FVFVTHNPAQ--RAELQRLAAEAGVAANLHFLGTLSEEQKLALLRASAVLPFPSRYE-GF 312
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFA 575
++ M G+PV++ D P+I E + +G + ++ L+ A + + L A
Sbjct: 313 GLPVLEGMAAGVPVVSTDIPVINELIRDGEDGLLVPYNDAAALANAILRLLDDEALR--A 370
Query: 576 RTVASAGRLHAKNMLALDCVTRYARILENVL 606
R +A R A+ VT+ + E V+
Sbjct: 371 RIIAGGRRAIAERFAPQRLVTQVLAVYEQVV 401
>gi|83645503|ref|YP_433938.1| glycosyltransferase [Hahella chejuensis KCTC 2396]
gi|83633546|gb|ABC29513.1| Glycosyltransferase [Hahella chejuensis KCTC 2396]
Length = 418
Score = 43.1 bits (100), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 59/133 (44%), Gaps = 4/133 (3%)
Query: 452 GSFKFVFLCGNSTDGYNDA-LQEVASRLGLL--EHSVRHYGFNGDVNGVLLMADIVLYGS 508
G ++ L G ST+G+ + ++E+ + LL E ++ GF DV+ V D+VL+ S
Sbjct: 260 GKDVYLVLVGGSTEGFGNVYIEEIKKQAELLGVEKNIIWAGFIRDVHEVYRDVDLVLHSS 319
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ E F ++ AM +P+I KE + G + NP+ S A + +
Sbjct: 320 IEPEP-FGLVVTEAMASDVPIIASKLGAPKELINNGVTGLLVDPRNPKEFSMAIKKVVMD 378
Query: 569 GKLSKFARTVASA 581
+ A A
Sbjct: 379 AAFRELVTQNAKA 391
>gi|119488484|ref|ZP_01621657.1| glycosyltransferase, family 4 [Lyngbya sp. PCC 8106]
gi|119455295|gb|EAW36435.1| glycosyltransferase, family 4 [Lyngbya sp. PCC 8106]
Length = 407
Score = 43.1 bits (100), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 70/154 (45%), Gaps = 9/154 (5%)
Query: 440 LLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLL 499
+L+K + +G V + G G++ ++E+ S G+ + V ++ F D +
Sbjct: 173 ILLKALEKLGNQGEQFGVLITG---PGWDKVVKEIKS-YGI--NQVYYFPFLPDRLMPIC 226
Query: 500 MADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS 559
+ LY + +G P ++ M G+PV+T I+K+Y+ +G + KD+ E +
Sbjct: 227 YNALDLYVVTSKVEGGPVPVLNCMACGVPVVTTPVGIVKDYLEDGVNGLIVPKDDAEATA 286
Query: 560 RAFSLFISNGKLSKFARTVASAGRLHAKNMLALD 593
A S + + L + +A AG N L D
Sbjct: 287 NAISRLLKSEDLRE---QLAKAGLETVNNYLTWD 317
>gi|338992027|ref|ZP_08634807.1| Glycosyl transferase, group 1 [Acidiphilium sp. PM]
gi|338205064|gb|EGO93420.1| Glycosyl transferase, group 1 [Acidiphilium sp. PM]
Length = 355
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 61/145 (42%), Gaps = 12/145 (8%)
Query: 436 DVGPLLIKYARRNSVEGSFKFVFLCGNSTDGY---------NDALQEVASRLGLLEHSVR 486
+ P++I +R + +G + D Y D L+ +A +LG+ VR
Sbjct: 162 ETAPVVITLSRLHPTKGIDTLIQALVKLDDVYLWIAGDGPLRDDLELLAEKLGVAPR-VR 220
Query: 487 HYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQ 546
G+ D +L ADI + S + F ++I+ A GIP++ D K +V
Sbjct: 221 FLGWRTDRGSLLRAADICVLPSRY--EPFGTVIIDAWIAGIPLVACDSAGPKAHVRTREN 278
Query: 547 VIFFQKDNPEGLSRAFSLFISNGKL 571
+ KDNP+ L A + N L
Sbjct: 279 GMLVPKDNPDALGEAIRELLGNDGL 303
>gi|425466594|ref|ZP_18845892.1| putative Glycosyl transferase, group 1 [Microcystis aeruginosa PCC
9809]
gi|389830859|emb|CCI26867.1| putative Glycosyl transferase, group 1 [Microcystis aeruginosa PCC
9809]
Length = 345
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 66/142 (46%), Gaps = 13/142 (9%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
L G S Y LQ +AS+L + + GF + ++L +D+ ++ S +GF
Sbjct: 205 LIGGSNIQYIQNLQSLASKLDVGQRC-YFLGFVDNPWEIILESDVFIFPSRS--EGFGIA 261
Query: 519 IVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTV 578
+V A + GIP+I D P +E + + +V FF+ D+ + L+ + +
Sbjct: 262 LVEAASLGIPIICSDIPTFRE-MFDDNEVTFFKLDDIDDLNNKILMLDE-------IKDK 313
Query: 579 ASAGRLHAKNMLALD--CVTRY 598
S +L A+ + +L+ C Y
Sbjct: 314 VSLAKLKAEKLFSLESMCYNYY 335
>gi|384134495|ref|YP_005517209.1| group 1 glycosyl transferase [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339288580|gb|AEJ42690.1| glycosyl transferase group 1 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
Length = 500
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 11/119 (9%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE+ VR G DVNG L DI + SS +GFP V A+ G P++ + + E
Sbjct: 365 LENVVRFAGPTDDVNGALNSGDIAVMSSSS--EGFPYAAVEAVMAGRPIVATNVGGMSEI 422
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN-GKLSKFARTVASAGRLHAKNMLALDCVTRY 598
V + I P L+ A + +N G+L A+ GRL + MLA + R+
Sbjct: 423 V-QPPYGILVPPRRPRELADAITRLAANRGQL-------AALGRLGRERMLAEFTLDRF 473
>gi|406977632|gb|EKD99753.1| glycosyl transferase group 1 [uncultured bacterium]
Length = 402
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 63/138 (45%), Gaps = 6/138 (4%)
Query: 471 LQEVASRLGLLEHSVRHYGFN-GDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
L+ +RLGL E V H N V LL +DI + + + E GFP +V AM +P+
Sbjct: 263 LKNNITRLGLSEKFVFHGKLNRQQVTERLLQSDIYVMPTLRAE-GFPMTLVEAMFASLPI 321
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
I + I + V + + + G + I++ L T+ GR+ A+N
Sbjct: 322 IANNIGGISDAVEDAKTGYLIKVGDLSGFKAKLTTLIADTSLRA---TLGQNGRIKAQNE 378
Query: 590 LALDC-VTRYARILENVL 606
L+ + +Y + ++ VL
Sbjct: 379 FTLETMLNKYQQEMQKVL 396
>gi|193215650|ref|YP_001996849.1| group 1 glycosyl transferase [Chloroherpeton thalassium ATCC 35110]
gi|193089127|gb|ACF14402.1| glycosyl transferase group 1 [Chloroherpeton thalassium ATCC 35110]
Length = 387
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF-SLFISNGKLSKF 574
P+ + ++ GIP++ D P I++++AEG + +F +NPE L+ A +F S F
Sbjct: 299 PNKLFDYLSRGIPIVASDLPSIRDFIAEGNEGLFVPPENPEALAAAIRKIFESEQSYEAF 358
Query: 575 -ARTVASAGRL----HAKNML 590
AR SA + A+NM+
Sbjct: 359 HARARKSAIKYLWENQARNMI 379
>gi|377830918|ref|ZP_09813908.1| glycosyltransferase protein [Lactobacillus mucosae LM1]
gi|377555202|gb|EHT16891.1| glycosyltransferase protein [Lactobacillus mucosae LM1]
Length = 535
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 65/137 (47%), Gaps = 6/137 (4%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
L G + D ++ L+ S +E+SV G+ DVN V A +++ S +G P
Sbjct: 388 LWGYANDNFDQKLKAQVSA-EQIENSVTFCGYTTDVNAVYERAQLLILPSR--AEGLPLS 444
Query: 519 IVRAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRA-FSLFISNGKLSKF-A 575
+V A + G+P++ D + V + + + + +GL++A SL +L++F A
Sbjct: 445 LVEAQSHGLPIVANDIKYGPADVVIDQQDGLLTKNGDIDGLAQAIISLLTDQKRLAEFSA 504
Query: 576 RTVASAGRLHAKNMLAL 592
+ R A N++ L
Sbjct: 505 HAYQDSARYSAANVMKL 521
>gi|373119167|ref|ZP_09533276.1| hypothetical protein HMPREF0995_04112 [Lachnospiraceae bacterium
7_1_58FAA]
gi|371664517|gb|EHO29689.1| hypothetical protein HMPREF0995_04112 [Lachnospiraceae bacterium
7_1_58FAA]
Length = 368
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 45/97 (46%), Gaps = 3/97 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V+ GF D+ + +D LY +S + LI+ AM G+PVI D +
Sbjct: 244 LEDCVKFIGFRKDIKNLYKGSD--LYVNSSQHEALSFLIIEAMAAGLPVIATDMGGNSDI 301
Query: 541 VAEGAQV-IFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
V + A + + DNP+ ++ A F+ + + R
Sbjct: 302 VNDEAGCGMLVEYDNPDSMAVAMKCFLEDPAFAARCR 338
>gi|262392542|ref|YP_003284396.1| glycosyl transferase group 1 [Vibrio sp. Ex25]
gi|262336136|gb|ACY49931.1| glycosyl transferase group 1 [Vibrio sp. Ex25]
Length = 383
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 3/98 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
++++V + G DV + + ++ +G P ++ AM G P++T D P +E
Sbjct: 250 IDNAVEYKGATADVRP--FIEECTVFVLPSYHEGLPRTVLEAMAIGRPILTTDVPGCRET 307
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNG-KLSKFART 577
V +G ++ N E L+ F+ N KLS A+
Sbjct: 308 VVDGLNGFLVERGNVEQLASRMIWFVENSEKLSGMAQV 345
>gi|423522477|ref|ZP_17498950.1| hypothetical protein IGC_01860 [Bacillus cereus HuA4-10]
gi|401174413|gb|EJQ81621.1| hypothetical protein IGC_01860 [Bacillus cereus HuA4-10]
Length = 400
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 3/104 (2%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++A++LG+ + V G+ D+ +L M+D+ + +S +G P I+ AM G+P
Sbjct: 257 EECKKIATQLGV-SNMVHFLGYRNDIASLLQMSDLAV--ASSYREGLPVNIMEAMACGLP 313
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
VI D +E + +D+ + +S + N KL
Sbjct: 314 VIVTDNRGHRELIISNKNGWIIDRDDIKTMSERINFISKNTKLQ 357
>gi|389737375|ref|ZP_10190822.1| glycosyl transferase family protein [Rhodanobacter sp. 115]
gi|388435974|gb|EIL92861.1| glycosyl transferase family protein [Rhodanobacter sp. 115]
Length = 183
Score = 42.7 bits (99), Expect = 1.2, Method: Composition-based stats.
Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 8/120 (6%)
Query: 426 LSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSV 485
L WD V + I+ R +G L G+S G A+ E R + E V
Sbjct: 7 LLWDKGVGEY------IQACRYLLAQGREIRFLLAGSSDLGNPAAIPEDVIRAWVNEGIV 60
Query: 486 RHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGA 545
G D+ +L D+V+ S + +G P+++V A G+P+I D P +E V +GA
Sbjct: 61 EWLGHVSDMRALLGSVDLVVLPSYR--EGLPTILVEAAACGLPLIATDVPGCREVVTDGA 118
>gi|448681497|ref|ZP_21691588.1| group 1 glycosyl transferase [Haloarcula argentinensis DSM 12282]
gi|445767367|gb|EMA18470.1| group 1 glycosyl transferase [Haloarcula argentinensis DSM 12282]
Length = 380
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 6/128 (4%)
Query: 465 DGYN-DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAM 523
DG N + LQ+ R GL E SV G+ DV +L AD+ + S + +G P +I A
Sbjct: 240 DGENREELQDAVKRRGL-EDSVSFLGYCDDVPSLLATADVFVLPSYR--EGMPRVITEAR 296
Query: 524 TFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS-RAFSLFISNGKLSKFA-RTVASA 581
G+P+++ I E V+ G + + LS R L S + + + RT A
Sbjct: 297 AAGLPIVSTAVAGIPEQVSHGETGFLVEPGDVSELSARIEQLVASPSRRREMSERTTADL 356
Query: 582 GRLHAKNM 589
R + M
Sbjct: 357 NRFDIERM 364
>gi|373955345|ref|ZP_09615305.1| glycosyl transferase group 1 [Mucilaginibacter paludis DSM 18603]
gi|373891945|gb|EHQ27842.1| glycosyl transferase group 1 [Mucilaginibacter paludis DSM 18603]
Length = 372
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+G P I+ A + G+P+++ IKE V G + + EG++ LF +N +
Sbjct: 278 EGTPVTILEAGSSGLPIVSTLHAGIKEAVVNGKTGYLVAEHDIEGMAEKMMLFAAN---A 334
Query: 573 KFARTVASAGRLH 585
KFA V SAGR H
Sbjct: 335 KFAEQVGSAGREH 347
>gi|448576703|ref|ZP_21642579.1| group 1 glycosyl transferase [Haloferax larsenii JCM 13917]
gi|445728891|gb|ELZ80491.1| group 1 glycosyl transferase [Haloferax larsenii JCM 13917]
Length = 411
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+ +SV G DV VL AD+ L+ +G P ++ AM G+P++ D P E
Sbjct: 288 VSNSVHFLGTRDDVPNVL--ADLDLFAFPSHWEGLPGALLEAMAAGVPIVATDIPGNDEL 345
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN 568
V +G + +P L A S +++
Sbjct: 346 VTDGETGVLVPAHDPAALCGAISGVLAH 373
>gi|428206249|ref|YP_007090602.1| group 1 glycosyl transferase [Chroococcidiopsis thermalis PCC 7203]
gi|428008170|gb|AFY86733.1| glycosyl transferase group 1 [Chroococcidiopsis thermalis PCC 7203]
Length = 378
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/165 (23%), Positives = 70/165 (42%), Gaps = 10/165 (6%)
Query: 441 LIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLM 500
+++ A R S G + + G+ + N ++VA R + G D++ +L
Sbjct: 222 ILRIADRMSKNGVNVYFVMVGSHGELLNTLKEKVAERNDM----AMMVGVE-DISELLKT 276
Query: 501 ADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSR 560
AD+ S +E+GF + + A G+PV+ D P I+E + F +N E +
Sbjct: 277 ADLFFLPS--LEEGFGVVAIEAAAAGLPVVATDLPTIREACSPSHHPFMFPPNNDELACK 334
Query: 561 AFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENV 605
+ N +L K +++ R A N + + A + EN
Sbjct: 335 NILSILENQELRK---QLSADARKWANNFSIFSSIKQLAHLYENC 376
>gi|154496600|ref|ZP_02035296.1| hypothetical protein BACCAP_00892 [Bacteroides capillosus ATCC
29799]
gi|150274233|gb|EDN01324.1| glycosyltransferase, group 1 family protein [Pseudoflavonifractor
capillosus ATCC 29799]
Length = 382
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 49/106 (46%), Gaps = 4/106 (3%)
Query: 475 ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDF 534
A LGL + V+ GF D+ + +D LY +S + LI+ AM G+PV+ D
Sbjct: 253 AKELGL-DDCVKFIGFRKDIKNLYKASD--LYVNSSRHEALSFLIIEAMAAGLPVVVTDI 309
Query: 535 PIIKEYVAEGAQV-IFFQKDNPEGLSRAFSLFISNGKLSKFARTVA 579
+ V + + + DNPE ++ A + +L + RT A
Sbjct: 310 AGNPDIVNDQTNCGLLAEYDNPESMAGALKRMMEEPELLERCRTNA 355
>gi|376295981|ref|YP_005167211.1| group 1 glycosyl transferase [Desulfovibrio desulfuricans ND132]
gi|323458542|gb|EGB14407.1| glycosyl transferase group 1 [Desulfovibrio desulfuricans ND132]
Length = 404
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 64/137 (46%), Gaps = 7/137 (5%)
Query: 439 PLLIKYARRNSVE--GSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNG 496
PL ++ RR E GS + GN ++ +A LG+ H V GF D++
Sbjct: 232 PLYLELIRREMTENPGSNINYLIVGNGE--LQADMEALAESLGI-RHRVVFTGFVSDMDT 288
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
V ADI++ S+ +G P ++ A ++G+P++ P + V G + +
Sbjct: 289 VYRGADILVQTSTW--EGTPMCLLEARSYGLPIVAPAVGGNVDVVESGLDGTLYPTGDLA 346
Query: 557 GLSRAFSLFISNGKLSK 573
L ++ ++S+G+L +
Sbjct: 347 ALWEGYAAYVSSGELRQ 363
>gi|313673222|ref|YP_004051333.1| group 1 glycosyl transferase [Calditerrivibrio nitroreducens DSM
19672]
gi|312939978|gb|ADR19170.1| glycosyl transferase group 1 [Calditerrivibrio nitroreducens DSM
19672]
Length = 351
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 78/173 (45%), Gaps = 18/173 (10%)
Query: 388 AVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARR 447
+E ++ +LRKE KDE++ ++ S +E D A+ D + RR
Sbjct: 152 VIEPLVTVEDRLRLRKELCIDKDEVLFGMI--SRIIHEKGHDIALEAFD------EIGRR 203
Query: 448 NSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYG 507
K +F+ G+ + +++ S++GL E +V G +V L M DI L
Sbjct: 204 G------KLIFV-GDFNTEFGQVVKDKISKMGLSE-NVFIVGQQDNVYPYLAMIDIFLAP 255
Query: 508 SSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSR 560
S + + P I+ A+ G+PV+ + I E V G FQ DN + ++R
Sbjct: 256 SRR--EAMPLAILEALGAGLPVVGANTGGIPEAVEHGVNGFVFQSDNAKEMAR 306
>gi|456864900|gb|EMF83274.1| glycosyltransferase, group 1 family protein [Leptospira weilii
serovar Topaz str. LT2116]
Length = 271
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 61/143 (42%), Gaps = 11/143 (7%)
Query: 437 VGPLLIKYARRNSVEGSFKFVF------LCGNSTDGYNDALQEVASRLGLLEHSVRHYGF 490
+G L A VE +F+FV L G Y L++++S E+ V GF
Sbjct: 104 IGSLTPPKAPHTLVE-AFRFVTSKSELALLGPGDPIYIKYLRDISSEY---ENPVSFLGF 159
Query: 491 NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFF 550
+ DVN L D+++ SS E F +I+ +M PVI DF +KE V G
Sbjct: 160 HNDVNQFLEQVDLLVVPSSHFE-SFGMVILESMRMKKPVICTDFGGMKEVVVNGETGFVV 218
Query: 551 QKDNPEGLSRAFSLFISNGKLSK 573
N + + A + N L K
Sbjct: 219 PSKNIKAMGMAIDHLLRNPNLRK 241
>gi|254421922|ref|ZP_05035640.1| glycosyl transferase, group 1 family protein [Synechococcus sp. PCC
7335]
gi|196189411|gb|EDX84375.1| glycosyl transferase, group 1 family protein [Synechococcus sp. PCC
7335]
Length = 379
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Query: 458 FLCGNSTDGYNDALQEV---ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQG 514
L G++ G +D + E+ +LGL + VR GF D+ ++ D+V + S+ E
Sbjct: 231 ILVGDALFGEDDYVLELHRQVEQLGL-QDRVRFLGFRSDIPQLMSACDLVTHTSTAPEP- 288
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
F +IV AM G PV+ E V G DNPE L++A S
Sbjct: 289 FGRVIVEAMLCGTPVVAAAAGGATELVDAGKNGWLCPPDNPELLAKAIS 337
>gi|300021978|ref|YP_003754589.1| group 1 glycosyl transferase [Hyphomicrobium denitrificans ATCC
51888]
gi|299523799|gb|ADJ22268.1| glycosyl transferase group 1 [Hyphomicrobium denitrificans ATCC
51888]
Length = 379
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/93 (25%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
+L+ A LG+ S +G DV +L AD+ + +++ ++GF +++ AM GIP+
Sbjct: 236 SLEAHARNLGI-SKSTTFFGNRTDVASLLGQADVFAFSTTR-DEGFGIVLIEAMAAGIPI 293
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
+ D +E +A G + + + L+ A
Sbjct: 294 VATDVAACREVLANGEAGLLVAPSDADALALAL 326
>gi|406831617|ref|ZP_11091211.1| group 1 glycosyl transferase [Schlesneria paludicola DSM 18645]
Length = 372
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 60/138 (43%), Gaps = 7/138 (5%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+E A ++G H + G D + + D+ SS +G + ++ AM+ G PV+
Sbjct: 240 LEEFAKQIGATNH-IHFVGHRDDAPRWMSLFDVFCLASS--FEGMSNSVMEAMSMGKPVL 296
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
D P +E VA+G + + G + I L RT+ AGR +
Sbjct: 297 ASDIPANRELVAQGETGFLPKLTDTVGFMQFLRRLIDEPGL---GRTLGQAGRERIQQSF 353
Query: 591 ALD-CVTRYARILENVLN 607
++ V+ YA I +LN
Sbjct: 354 SIPRMVSAYADIYRQLLN 371
>gi|329962799|ref|ZP_08300700.1| glycosyltransferase, group 1 family protein [Bacteroides fluxus YIT
12057]
gi|328529454|gb|EGF56362.1| glycosyltransferase, group 1 family protein [Bacteroides fluxus YIT
12057]
Length = 389
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D L++ R+ H + +GF DV L AD++++ S + +GFP+++++A G+P
Sbjct: 252 DPLEKDVERIIKQNHKIVIWGFQSDVRPFLASADVLVFPSYR--EGFPNVVLQAGAMGLP 309
Query: 529 VITPDFPIIKEYVAEG 544
I D E + +G
Sbjct: 310 SIVTDINGCNEIIQDG 325
>gi|425733953|ref|ZP_18852273.1| group 1 glycosyl transferase [Brevibacterium casei S18]
gi|425482393|gb|EKU49550.1| group 1 glycosyl transferase [Brevibacterium casei S18]
Length = 376
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
V H G + + LL L +S+ E GF ++ AM G+PV D PI E +
Sbjct: 258 VFHRGVSDEEYADLLAEATALVTASRAE-GFGLPVIEAMAKGVPVAVSDIPIFHEITGDA 316
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSKFART---VASAGRLHAK 587
A +FF +D+P ++ A G+L+ ART A+AG++ ++
Sbjct: 317 A--VFFDEDDPASIAAAV------GELTD-ARTWEETAAAGQVASQ 353
>gi|379724034|ref|YP_005316165.1| galactosyltransferase [Paenibacillus mucilaginosus 3016]
gi|378572706|gb|AFC33016.1| galactosyltransferase [Paenibacillus mucilaginosus 3016]
Length = 396
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 7/105 (6%)
Query: 441 LIKYARRNSVEGSFKFVFLCGNST----DGYNDALQEVASRLGLLEHSVRHYGFNGDVNG 496
LI+ A R +G+ + G++T Y + L +A LG+ + V G DV
Sbjct: 220 LIRAAERFKADGNRGHFVIVGDATFEHEKVYKEKLFALAGELGMTDR-VHFLGLRRDVPE 278
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
+L AD+ + S V FP+ ++ AM G+PVI+ D + E V
Sbjct: 279 LLCQADVFVL--SSVYDTFPTAVLEAMRAGLPVISTDVGGVPEMV 321
>gi|418051272|ref|ZP_12689357.1| glycosyl transferase group 1 [Mycobacterium rhodesiae JS60]
gi|353184929|gb|EHB50453.1| glycosyl transferase group 1 [Mycobacterium rhodesiae JS60]
Length = 398
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE SVR GF D+ +L D V+ S+ V++ P+ ++ A G+PV+ D +E
Sbjct: 253 LEDSVRRLGFRNDIADILGTVDGVV--SASVDEALPTALIEAGACGLPVVAADAGGTREI 310
Query: 541 VAEG 544
V +G
Sbjct: 311 VVDG 314
>gi|189500750|ref|YP_001960220.1| group 1 glycosyl transferase [Chlorobium phaeobacteroides BS1]
gi|189496191|gb|ACE04739.1| glycosyl transferase group 1 [Chlorobium phaeobacteroides BS1]
Length = 380
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 61/140 (43%), Gaps = 3/140 (2%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
DG + EV +R +E VR G D+ +L ++D++L SS G +L AM
Sbjct: 240 DGPERSEAEVWARENGIERHVRFLGKIDDIVPLLSVSDLMLMPSSGESFGLAAL--EAMA 297
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLS-RAFSLFISNGKLSKFARTVASAGR 583
G+PVI D E+++ G + N +S +A L G+ F+R R
Sbjct: 298 CGVPVIVTDTGGFPEFISNGRHGYLVELGNVGRMSEKALELLTDTGRWEAFSRNCVEQAR 357
Query: 584 LHAKNMLALDCVTRYARILE 603
+ + L Y+ +LE
Sbjct: 358 NYHVSTLVERYEAFYSSLLE 377
>gi|421861286|ref|ZP_16293330.1| glycosyltransferase [Paenibacillus popilliae ATCC 14706]
gi|410829179|dbj|GAC43767.1| glycosyltransferase [Paenibacillus popilliae ATCC 14706]
Length = 387
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 65/141 (46%), Gaps = 9/141 (6%)
Query: 467 YNDALQEVASRLGL-LEHSVRHYGFNGDVNGVLLMADI-VLYGSSQVEQGFPSLIVRAMT 524
Y L + R GL L+ + GF D + + L+ L S E G + A+
Sbjct: 243 YIPTLVSLIRRTGLKLDEHIYPLGFVPDNDVIPLVRQAQALIMPSLSEGGGSYPVEEALK 302
Query: 525 FGIPVITPDFPIIKEYVA-EGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASA-G 582
G+PV+ D P+++E+++ A+V++F ++P+ ++RA N +A ASA G
Sbjct: 303 LGVPVLCSDIPVMREHLSRHSAEVVWFDPESPDAITRALEQLFGN-----YAHYKASAMG 357
Query: 583 RLHAKNMLALDCVTRYARILE 603
+H + D +Y +
Sbjct: 358 GIHDSSETWNDIALKYILVFH 378
>gi|392373321|ref|YP_003205154.1| Glycosyl transferase, group 1 [Candidatus Methylomirabilis oxyfera]
gi|258591014|emb|CBE67309.1| putative Glycosyl transferase, group 1 [Candidatus Methylomirabilis
oxyfera]
Length = 409
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 56/128 (43%), Gaps = 6/128 (4%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L+ +V G DV+ +L +DI S + +G P+ ++ AM+ +PV+ + E
Sbjct: 281 LDRTVTLLGIREDVSDILSESDIFCLAS--LWEGLPTCVMEAMSSALPVVATTVGGVPEL 338
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLAL-DCVTRYA 599
V +G + NP L+ A + + +L AGR K L D +
Sbjct: 339 VCDGETGLLVPPRNPSALAEALTKLLDAPRLRA---AFGEAGRSVVKRHFGLTDKIRDLE 395
Query: 600 RILENVLN 607
++ E +L+
Sbjct: 396 KVYEGLLS 403
>gi|224369098|ref|YP_002603262.1| protein RfaG2 [Desulfobacterium autotrophicum HRM2]
gi|223691815|gb|ACN15098.1| RfaG2 [Desulfobacterium autotrophicum HRM2]
Length = 377
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 53/122 (43%), Gaps = 7/122 (5%)
Query: 457 VFLCGNSTDG-YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
V + G +D Y LQE + L + GF D + + AD+ + S +G
Sbjct: 234 VAVAGKLSDSPYCRMLQESLVKENL-KQCFSFLGFRSDTDALYNAADVFVLSSR--SEGL 290
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF-SLFISNGKLSKF 574
P +I+ AM G+PVI I + + G I + NPE LS A NG + KF
Sbjct: 291 PMVILEAMMAGLPVIATRVGGIPKMM--GKNGILVEAANPEELSSAMHCCLFKNGIIEKF 348
Query: 575 AR 576
R
Sbjct: 349 GR 350
>gi|374310437|ref|YP_005056867.1| group 1 glycosyl transferase [Granulicella mallensis MP5ACTX8]
gi|358752447|gb|AEU35837.1| glycosyl transferase group 1 [Granulicella mallensis MP5ACTX8]
Length = 366
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 8/137 (5%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+++A LG+ V G+ DV VL +DI ++ + GF AM G+PV+
Sbjct: 228 LEQMAQSLGI-RSRVTFLGWRKDVGAVLKASDIYVHPTH--SDGFGIAACEAMAAGLPVV 284
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNML 590
D P + + VA G ++F +D+ + L+ + I + + R AG +++
Sbjct: 285 ASDVPGLAQLVA-GVGILFPAEDD-KALASHLNALIQSPERQNEMRM---AGLQRVRHLS 339
Query: 591 ALDCVTRYARILENVLN 607
+ V Y R+ E+VL
Sbjct: 340 IENTVDGYIRMYESVLQ 356
>gi|85708006|ref|ZP_01039072.1| glycosyl transferase group 1 [Erythrobacter sp. NAP1]
gi|85689540|gb|EAQ29543.1| glycosyl transferase group 1 [Erythrobacter sp. NAP1]
Length = 379
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+ H V G D ++ M DI SS+ EQ FP +V AM G+PV PD +K
Sbjct: 254 ISHRVHLPGDVEDPAKLIGMFDIFAL-SSKSEQ-FPISVVEAMAAGLPVAAPDVGDVKSI 311
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
VAE + D+ + L+ S ++N L
Sbjct: 312 VAEANRPFIAAPDSTDALAAMLSELVANEAL 342
>gi|226945015|ref|YP_002800088.1| group 1 glycosyl transferase [Azotobacter vinelandii DJ]
gi|226719942|gb|ACO79113.1| Glycosyl transferase, group 1 [Azotobacter vinelandii DJ]
Length = 451
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 61/138 (44%), Gaps = 6/138 (4%)
Query: 440 LLIKYARRNSVEGSFKFVFLCGNSTDG--YNDALQEVASRLGLLEHSVRHYGFNGDVNGV 497
LL+ AR G + +CG + G + ++A+ LGL H +G++
Sbjct: 271 LLLYAARLLKARGVELQLVVCGATAKGESCKRVVADLAAHLGLPIHQA--GAVSGEIRDT 328
Query: 498 LLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQV--IFFQKDNP 555
LL + S + F ++ AM G PV+ PD+ I E + +G + + F+ +
Sbjct: 329 LLAHSHCVVYPSVNREAFGLVVAEAMGHGTPVLVPDYGGITEVMRDGERAGGLTFKAWDS 388
Query: 556 EGLSRAFSLFISNGKLSK 573
L+R + ++N L +
Sbjct: 389 GDLARQLEVLLANASLHR 406
>gi|386713280|ref|YP_006179603.1| glycosyltransferase EpsD [Halobacillus halophilus DSM 2266]
gi|384072836|emb|CCG44326.1| probable glycosyltransferase EpsD [Halobacillus halophilus DSM
2266]
Length = 383
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 472 QEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVIT 531
+E+A++L + +H V G+ D+ G++ M D+V +S + +G P I+ AM G+P++
Sbjct: 243 KELAAKLEI-DHMVDFLGYCDDIKGIVSMCDMV--AASSLREGLPVNIMEAMACGLPIVA 299
Query: 532 PD 533
D
Sbjct: 300 ID 301
>gi|448746273|ref|ZP_21727941.1| Glycosyl transferase, group 1 [Halomonas titanicae BH1]
gi|445566135|gb|ELY22242.1| Glycosyl transferase, group 1 [Halomonas titanicae BH1]
Length = 402
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 5/121 (4%)
Query: 455 KFVFLCGNSTDGYNDAL--QEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVE 512
+ VF D D + QE+ + +G E S+ H G+ DV L + + + S
Sbjct: 237 EVVFQLAGDIDSNPDTISQQELDTWVG--EGSLEHLGWLNDVRLALAESSVFVL-PSYYR 293
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+G P I+ AM+ G VIT D P +E V G N + L +A FI L
Sbjct: 294 EGTPRSILEAMSMGRAVITTDAPGCRETVTNGDNGYLIPVKNVDALVKAMQHFIDRPALV 353
Query: 573 K 573
K
Sbjct: 354 K 354
>gi|251796452|ref|YP_003011183.1| group 1 glycosyl transferase [Paenibacillus sp. JDR-2]
gi|247544078|gb|ACT01097.1| glycosyl transferase group 1 [Paenibacillus sp. JDR-2]
Length = 387
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 8/141 (5%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V G DV V+ MAD++L S + + F + + AM G+P + + I E
Sbjct: 253 LEDRVHFLGKQEDVAQVISMADLLLLPSEK--ESFGLVALEAMACGVPTVGSNAGGIPEL 310
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCV-TRYA 599
V G + EG++ + + + KL + + A A+N D + T+Y
Sbjct: 311 VTHGETGYLCPIGDVEGMAAYATQLLKDEKLHQ---SFKEACLFRARNEFCNDVITTQYE 367
Query: 600 RILENVLNFPSDALLPGPISQ 620
+I VL +D LP P+ +
Sbjct: 368 QIYYRVLGMEAD--LPMPVCE 386
>gi|406927156|gb|EKD63231.1| glycosyl transferase [uncultured bacterium]
Length = 384
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 6/106 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFG 526
D L +A LG+ E V+ G + V +L AD+ + S +GF V AM G
Sbjct: 247 DELSHLAKELGV-EDRVKFLGTLPHKMVPDLLRSADLFVRPSRA--EGFGVSFVEAMACG 303
Query: 527 IPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF-SLFISNGKL 571
+PVIT I ++V +G I + D+P+GL+ A ++F GKL
Sbjct: 304 VPVITCPSGGIVDFVVDGETGILVKPDDPKGLAEAIEAVFSDVGKL 349
>gi|53804222|ref|YP_113890.1| glycosyl transferase family protein [Methylococcus capsulatus str.
Bath]
gi|53757983|gb|AAU92274.1| glycosyl transferase, group 1 family protein [Methylococcus
capsulatus str. Bath]
Length = 409
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 54/116 (46%), Gaps = 6/116 (5%)
Query: 493 DVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQK 552
+++G AD + S +G P +++ AM G+P IT + E + +GA+ +
Sbjct: 297 EISGYYAAADAFVLPS--FAEGLPVVLMEAMALGVPCITTHITGVPELIKDGAEGLLVAP 354
Query: 553 DNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALD-CVTRYARILENVLN 607
+ +GL A + + L AR +A AGR+ L+ + R A + E L+
Sbjct: 355 SDVDGLVAAIERLMDDPAL---ARRLAEAGRIKVLAEYELNGSIARLADVFERRLS 407
>gi|226311240|ref|YP_002771134.1| glycosyltransferase [Brevibacillus brevis NBRC 100599]
gi|226094188|dbj|BAH42630.1| putative glycosyltransferase [Brevibacillus brevis NBRC 100599]
Length = 378
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 53/112 (47%), Gaps = 5/112 (4%)
Query: 451 EGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQ 510
+G+ ++ L G DG A+ E + LE VR GF D+ L D ++ S
Sbjct: 229 KGAAPYLVLIG---DGSERAMLEAKVKELGLESRVRFAGFRQDIPACLHALDGFVH--SS 283
Query: 511 VEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
+ +G I+ AM +PV+ + +KE+V +G + + NP L++A
Sbjct: 284 LYEGLGYTIIEAMASEVPVVASNVGGVKEFVFDGETGLIVEPGNPALLAQAM 335
>gi|4927425|gb|AAD33103.1| glucosyltransferase I [Pseudomonas aeruginosa PAO1]
Length = 373
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 51/234 (21%), Positives = 101/234 (43%), Gaps = 38/234 (16%)
Query: 324 QEDSLANRLPVYVERG-FQNLLSYWKSVF---SRVNVIVFPDYTLPML--YSVLDAGNFF 377
+E + R P+Y + G +++ Y ++VF S+ +++ + P+L + A F
Sbjct: 104 EEKAQTLRNPLYRQWGRYRHFAGYERAVFDPASKTEILMISEVQQPLLVKHYGTQAERFH 163
Query: 378 VIPG-------SPADVWAVEAYSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDY 430
++P +PA+ V A + R+E G +D++++V +GS F L
Sbjct: 164 LLPPGISQDRRAPANAADVRA--------EFRREFGLEEDDLLLVQIGSGFKTKGLDRSL 215
Query: 431 AVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGF 490
A+ + L + R ++ FL + G ND +Q + R
Sbjct: 216 K-ALSALPKALRRRTRLIAIGQDDPKPFLLQIAALGLNDQVQILKGR------------- 261
Query: 491 NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
D+ LL AD++++ + G ++++ A+ G+PV+ D Y+AE
Sbjct: 262 -SDIPRFLLGADLLIHPAYNENTG--TVLLEALVSGLPVLVTDVCGYAHYIAEA 312
>gi|448591913|ref|ZP_21651288.1| group 1 glycosyl transferase [Haloferax elongans ATCC BAA-1513]
gi|445733202|gb|ELZ84777.1| group 1 glycosyl transferase [Haloferax elongans ATCC BAA-1513]
Length = 411
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+ +S+ G DV VL AD+ L+ +G P ++ AM G+P++ D P E
Sbjct: 288 VSNSIHFLGTRDDVPNVL--ADLDLFAFPSHWEGLPGALLEAMAAGVPIVATDIPGNDEL 345
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN 568
V +G + +P L A S +++
Sbjct: 346 VTDGETGVLVPAHDPAALCGAISGVLAH 373
>gi|383781747|ref|YP_005466314.1| putative glycosyltransferase [Actinoplanes missouriensis 431]
gi|381374980|dbj|BAL91798.1| putative glycosyltransferase [Actinoplanes missouriensis 431]
Length = 408
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 59/118 (50%), Gaps = 9/118 (7%)
Query: 465 DGYNDALQEVASRLGLLEHSVR---HYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
D Y D+L+ +A G+ + VR Y D++ ++ AD V+ EQ +++
Sbjct: 243 DQYRDSLRALAEERGVAD-CVRWSDVYLKPDDLSRLICSADAVVLPYDSTEQVTSGVLIE 301
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN----GKLSKFA 575
A+ GIPV+ +FP E +A+G ++ +D PE ++ A ++ G+L+ A
Sbjct: 302 AVAAGIPVVATEFPHAVELLADGPGLLVPHQD-PEAMAMAIRRVLAEPGLPGRLTGLA 358
>gi|392989004|ref|YP_006487597.1| group 1 glycosyl transferase [Enterococcus hirae ATCC 9790]
gi|392336424|gb|AFM70706.1| glycosyl transferase group 1 [Enterococcus hirae ATCC 9790]
Length = 363
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 67/134 (50%), Gaps = 6/134 (4%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
F+CG T L++ LGL E V G+ D+ ++ M+D+ L+ S + +G P
Sbjct: 224 FICG--TGPLKQTLEQKIKFLGL-EKKVSLLGYRKDIIQIMKMSDLFLFPS--LREGLPV 278
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVA-EGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
I+ AM+ G+PV+ + K+ + E +F K++ E ++ F ++K++
Sbjct: 279 SIMEAMSVGLPVVASNIRGNKDLIYDENTGKLFDVKESEELVNILQDFFAGTLPVTKYSE 338
Query: 577 TVASAGRLHAKNML 590
A+ + ++N++
Sbjct: 339 NAAAHIQQFSENVI 352
>gi|325981972|ref|YP_004294374.1| group 1 glycosyl transferase [Nitrosomonas sp. AL212]
gi|325531491|gb|ADZ26212.1| glycosyl transferase group 1 [Nitrosomonas sp. AL212]
Length = 364
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 4/141 (2%)
Query: 467 YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFG 526
Y ++E+A RLG+ + + GF DVN D VL S +G P +I+ AM G
Sbjct: 224 YEKTIKEMADRLGVGDM-IDWIGFAQDVNAEFTQMD-VLVLPSLFGEGMPMVILEAMATG 281
Query: 527 IPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHA 586
+PV+ D I E + G + + + L+ I N + R +A +L
Sbjct: 282 VPVVASDVEGIPEVLEYGKTGLIVPAGDIKQLATVIGDLIDNKYDWRAMRE--AAYQLQV 339
Query: 587 KNMLALDCVTRYARILENVLN 607
+ AR+ + +LN
Sbjct: 340 ASFSDYSMAKGVARVYDAILN 360
>gi|384044141|ref|YP_005497408.1| hypothetical protein BMWSH_p10035 [Bacillus megaterium WSH-002]
gi|345447083|gb|AEN92099.1| hypothetical protein BMWSH_p10035 [Bacillus megaterium WSH-002]
Length = 434
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ L++ LGL V G DV +L DI + + + P I+ AM G
Sbjct: 300 EKLEKQKKALGL--SMVNFLGSRDDVPSLLNKTDIFVLPT--INDSLPISIIEAMHSGTA 355
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKN 588
VI+ + I E + I + +PE L+ A I+N ++ T A + HAKN
Sbjct: 356 VISTNCGGIPELIKHNKTGIIVEPGDPEQLAHALKFLITNKEVRNKMSTTA---KNHAKN 412
Query: 589 MLALDCV 595
L +D +
Sbjct: 413 HLTVDSM 419
>gi|344210662|ref|YP_004794982.1| group 1 glycosyl transferase [Haloarcula hispanica ATCC 33960]
gi|343782017|gb|AEM55994.1| glycosyl transferase group 1 [Haloarcula hispanica ATCC 33960]
Length = 402
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 64/138 (46%), Gaps = 12/138 (8%)
Query: 453 SFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQ 510
+F+F + G+ GY DA+Q A++LG+ + V YG + DV G+L D+ +
Sbjct: 244 TFEF-HIAGDGQSGYIDAMQAKAAKLGVADQIV-WYGRVPHSDVPGLLTRTDVAISPLED 301
Query: 511 VEQ---GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS 567
+E +P+ ++ M G V+ D P + + +G + + GL A +
Sbjct: 302 IESYRISYPAKLLEYMAAGAVVMATDIPPHQRLITDGDNGLLYDG-TATGLCSAIEQCLD 360
Query: 568 N----GKLSKFARTVASA 581
+ G++ + AR A A
Sbjct: 361 DPALAGQIERTARETAEA 378
>gi|428297828|ref|YP_007136134.1| group 1 glycosyl transferase [Calothrix sp. PCC 6303]
gi|428234372|gb|AFZ00162.1| glycosyl transferase group 1 [Calothrix sp. PCC 6303]
Length = 390
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 71/151 (47%), Gaps = 8/151 (5%)
Query: 455 KFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQG 514
KFV L G + D ND L + +LGL E G D+ ++ DI+ S+ E
Sbjct: 246 KFV-LAGTNVDNKNDFLLDKIKQLGL-EDKTYLLGERQDIPNIIPALDIMTMASAYGE-A 302
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKF 574
FP ++ +M+ G+P + D V + +V+ NPE L+ A+ I+ ++
Sbjct: 303 FPLIVGESMSCGVPCVVTDVGDASWIVGDTGKVV--PPRNPEALADAWQELIAMDMEARD 360
Query: 575 ARTVASAGRLHAKNMLALD-CVTRYARILEN 604
+A+ R+ N +LD V++Y + E+
Sbjct: 361 KLGIAARSRV--INSFSLDSIVSKYEHLYES 389
>gi|414174740|ref|ZP_11429144.1| hypothetical protein HMPREF9695_02790 [Afipia broomeae ATCC 49717]
gi|410888569|gb|EKS36372.1| hypothetical protein HMPREF9695_02790 [Afipia broomeae ATCC 49717]
Length = 396
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 67/150 (44%), Gaps = 7/150 (4%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
+G L+ A RLG+ + S+ G DV +L AD+ L S Q +GF + ++ AM
Sbjct: 247 EGEGPRLRAQAERLGMAD-SILWLGERRDVEAILPAADVALLVSHQ--EGFSNALIEAMA 303
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRL 584
G+PVI + VA+ + P L A L ++ +L + R + A R
Sbjct: 304 QGLPVIATAVGGNLDAVADDVSGLLVPPREPAALGAAI-LGLAMDELKR--RAMGHAARQ 360
Query: 585 HAKNMLALD-CVTRYARILENVLNFPSDAL 613
+M + C+ RY + + P+ A+
Sbjct: 361 RTLDMFSQGACIARYDHLYRGLGRSPARAV 390
>gi|194335539|ref|YP_002017333.1| group 1 glycosyl transferase [Pelodictyon phaeoclathratiforme BU-1]
gi|194308016|gb|ACF42716.1| glycosyl transferase group 1 [Pelodictyon phaeoclathratiforme BU-1]
Length = 364
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 47/97 (48%), Gaps = 3/97 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE S G+ D+ L D+ + S + +G P++++ AM PVI D +E
Sbjct: 236 LEDSFIFLGYADDIYPYLKGCDLFVLAS--LFEGMPNVVMEAMAMKKPVIATDVNGAREL 293
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNG-KLSKFAR 576
+ EG + +PE L+ A + I N KL++F R
Sbjct: 294 MDEGRTGLIVPPKDPEALASAITSIIDNPVKLAEFGR 330
>gi|260592243|ref|ZP_05857701.1| putative lipopolysaccharide biosynthesis protein [Prevotella
veroralis F0319]
gi|260535877|gb|EEX18494.1| putative lipopolysaccharide biosynthesis protein [Prevotella
veroralis F0319]
Length = 392
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 43/89 (48%), Gaps = 2/89 (2%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
VR G+ DV + AD++++ S + +GFP+++++A G+P I + E + EG
Sbjct: 274 VRFVGYQTDVRPFFIAADVLVFPSYR--EGFPNVVLQAGAMGLPSIVTNINGCNEIIKEG 331
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
F + + L + I N L K
Sbjct: 332 QNGKIFPSKDADTLFKEMKWCIENKDLIK 360
>gi|428307005|ref|YP_007143830.1| group 1 glycosyl transferase [Crinalium epipsammum PCC 9333]
gi|428248540|gb|AFZ14320.1| glycosyl transferase group 1 [Crinalium epipsammum PCC 9333]
Length = 378
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 64/149 (42%), Gaps = 8/149 (5%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
L G D N L + ++L + E + G DV+ + DI SS +GFP+
Sbjct: 232 LLGGKEVDSTNHVLNQQIAQLEIGEQ-IHLLGERQDVHHLTAALDIA--SSSSYSEGFPN 288
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
++ AM+ G+P D V +V+ +NPE L+ A+ I G +
Sbjct: 289 VVGEAMSCGVPCAVTDVGDSAWIVGNTGRVV--PPNNPEALANAWQELIEMGAEGR--SK 344
Query: 578 VASAGRLHAKNMLALD-CVTRYARILENV 605
+ SA R +LD VT+Y + E +
Sbjct: 345 LGSAARSRIIECFSLDSVVTQYETLYETM 373
>gi|125973872|ref|YP_001037782.1| group 1 glycosyl transferase [Clostridium thermocellum ATCC 27405]
gi|125714097|gb|ABN52589.1| glycosyl transferase group 1 [Clostridium thermocellum ATCC 27405]
Length = 364
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 47/99 (47%), Gaps = 18/99 (18%)
Query: 466 GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
G ND +Q + ++ +++H +AD L+ S +GFP+ + AM
Sbjct: 232 GLNDRIQLMGTKTNVMKH----------------IADAQLFVMSSNFEGFPNALAEAMAS 275
Query: 526 GIPVITPDFP--IIKEYVAEGAQVIFFQKDNPEGLSRAF 562
G+PVI+ +FP + KE + +G +N E ++ A
Sbjct: 276 GLPVISTNFPSGVAKELIIDGENGYVVDINNREQMADAM 314
>gi|390933715|ref|YP_006391220.1| group 1 glycosyl transferase [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
gi|389569216|gb|AFK85621.1| glycosyl transferase group 1 [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
Length = 378
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 129/329 (39%), Gaps = 33/329 (10%)
Query: 248 LQKLGYV-FKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFDGIIADSLEAKEAI 306
L+KLG V + I RS S S + + I I+ +++YSL+ G L AK
Sbjct: 49 LKKLGIVCYDIDFERSPYSFSNLKALNQLIKIMREDKYSLVHVHTPVGAFLGRLAAKITG 108
Query: 307 SSLMQEPFHSIPLVWIIQEDSLANRLPVYVERGFQNLLSYWKSVFSRVNVIVFPDYTLPM 366
+ + H + S+ N L Y + L ++W +N DY
Sbjct: 109 TKPVIYTAHGFHFY---KGASIKNWLVYYT---MERLAAHWTDGLITMND---EDYNAAK 159
Query: 367 LYSVLDAGNFFVIPGSPADVWAVEAYSKSHE--KYQLRKENGFLKDEIVVVVVGSSFFYN 424
+ + F + G D+ + Y+ S E + Q RK GF + +IV++ V
Sbjct: 160 RFKLRKNNAVFYVHGVGLDI---DKYNMSDENKRKQTRKAFGFDEKDIVILTVA------ 210
Query: 425 ELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHS 484
EL+ A + +L + S++ + + G Y L++ GL
Sbjct: 211 ELN-----ANKNHKQILDSLIKLKSLDNVYYLIVGIGE----YECYLKKYVKENGL-SGK 260
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
V+ GF D+ +L ++DI + + + +G P I+ AM G PV+ D ++ V +G
Sbjct: 261 VKLLGFRHDIPDLLNISDI--FALTSMREGLPRCIMEAMAAGKPVVATDVRGNRDLVRDG 318
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
D+ A N L K
Sbjct: 319 VNGYLVPLDDVNATVDALQNLSENEDLRK 347
>gi|159899874|ref|YP_001546121.1| group 1 glycosyl transferase [Herpetosiphon aurantiacus DSM 785]
gi|159892913|gb|ABX05993.1| glycosyl transferase group 1 [Herpetosiphon aurantiacus DSM 785]
Length = 388
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 5/105 (4%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
+L G T N +++ + L + EH+V+ G GD+ VL D+++ S+ +GF +
Sbjct: 241 WLAGEGTQEAN--FRQLTANLAI-EHAVKFLGPRGDIPEVLSQVDVLV--STSRWEGFAT 295
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
+I+ AM PVI D E + +G +NP ++ A
Sbjct: 296 VILEAMAARTPVIATDIGGNNEQIVDGENGRLVASENPSAVADAM 340
>gi|423558756|ref|ZP_17535058.1| hypothetical protein II3_03960 [Bacillus cereus MC67]
gi|401190525|gb|EJQ97566.1| hypothetical protein II3_03960 [Bacillus cereus MC67]
Length = 385
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 3/104 (2%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++A++LG+ + V G+ D+ +L M D+ + +S +G P I+ AM G+P
Sbjct: 242 EECKKIATQLGV-SNMVHFLGYRNDIASLLQMCDLAV--ASSYREGLPVNIMEAMACGLP 298
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
V+ D +E + +D+ + +S + N KL
Sbjct: 299 VVATDNRGHRELIISNKNGWIIDRDDIKTMSERINFISKNTKLQ 342
>gi|433776255|ref|YP_007306722.1| glycosyltransferase [Mesorhizobium australicum WSM2073]
gi|433668270|gb|AGB47346.1| glycosyltransferase [Mesorhizobium australicum WSM2073]
Length = 374
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 54/122 (44%), Gaps = 8/122 (6%)
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
R G L +++ GF D L ++ + + +GF IV A +FG PV+T +
Sbjct: 242 RDGALGTNIKFTGFVDDATLATLYSNCLFLAMPSLYEGFGFPIVEAQSFGKPVLTSNTSS 301
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK--FARTVASAGRL----HAKNML 590
+ E G + ++P ++ AFS ++ + A +AGR HAK ML
Sbjct: 302 MPE--VAGNNAVLVDPEDPAAIAAAFSRLCTDAEFRNGIAADAKKNAGRFTWENHAKGML 359
Query: 591 AL 592
+
Sbjct: 360 QI 361
>gi|256004327|ref|ZP_05429309.1| glycosyl transferase group 1 [Clostridium thermocellum DSM 2360]
gi|385778250|ref|YP_005687415.1| group 1 glycosyl transferase [Clostridium thermocellum DSM 1313]
gi|419722144|ref|ZP_14249292.1| glycosyl transferase group 1 [Clostridium thermocellum AD2]
gi|419724241|ref|ZP_14251309.1| glycosyl transferase group 1 [Clostridium thermocellum YS]
gi|255991761|gb|EEU01861.1| glycosyl transferase group 1 [Clostridium thermocellum DSM 2360]
gi|316939930|gb|ADU73964.1| glycosyl transferase group 1 [Clostridium thermocellum DSM 1313]
gi|380772247|gb|EIC06099.1| glycosyl transferase group 1 [Clostridium thermocellum YS]
gi|380781715|gb|EIC11365.1| glycosyl transferase group 1 [Clostridium thermocellum AD2]
Length = 381
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 47/99 (47%), Gaps = 18/99 (18%)
Query: 466 GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
G ND +Q + ++ +++H +AD L+ S +GFP+ + AM
Sbjct: 249 GLNDRIQLMGTKTNVMKH----------------IADAQLFVMSSNFEGFPNALAEAMAS 292
Query: 526 GIPVITPDFP--IIKEYVAEGAQVIFFQKDNPEGLSRAF 562
G+PVI+ +FP + KE + +G +N E ++ A
Sbjct: 293 GLPVISTNFPSGVAKELIIDGENGYVVDINNREQMADAM 331
>gi|302345621|ref|YP_003813974.1| glycosyltransferase, group 1 family protein [Prevotella
melaninogenica ATCC 25845]
gi|302149561|gb|ADK95823.1| glycosyltransferase, group 1 family protein [Prevotella
melaninogenica ATCC 25845]
Length = 384
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/153 (21%), Positives = 67/153 (43%), Gaps = 15/153 (9%)
Query: 393 SKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEG 452
S+ + +R E GF ++ + V +G ++D G + A R +
Sbjct: 179 SEKEARSLVRSELGFSNNDFIFVFIGR-------------LVNDKGLGELADAIRKLEDE 225
Query: 453 SFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVE 512
F+ L DG +DAL + + +V++ G D+ L+ +D++++ S +
Sbjct: 226 KFEIKLLLIGEIDGEDDALAKDKLNYLMQSKNVKYIGVQSDIRPYLMASDVLVFPSYR-- 283
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGA 545
+GFP++ + A G+P I + E + +G
Sbjct: 284 EGFPNVPLEAGALGLPAIVTNINGSNEIIEDGV 316
>gi|335037444|ref|ZP_08530750.1| glycosyltransferase [Agrobacterium sp. ATCC 31749]
gi|333791109|gb|EGL62500.1| glycosyltransferase [Agrobacterium sp. ATCC 31749]
Length = 382
Score = 42.0 bits (97), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 2/80 (2%)
Query: 465 DGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
+ Y ++E ASRLGL + VR GF DV ++ D+V + +S V + F ++V AM
Sbjct: 245 EAYEARIREQASRLGL-DDRVRFLGFRSDVPELMASMDVVAH-TSIVAEPFGRVVVEAMM 302
Query: 525 FGIPVITPDFPIIKEYVAEG 544
G PV+ + E + +G
Sbjct: 303 CGRPVVATRGGGVTEIIRDG 322
>gi|392406934|ref|YP_006443542.1| glycosyltransferase [Anaerobaculum mobile DSM 13181]
gi|390620070|gb|AFM21217.1| glycosyltransferase [Anaerobaculum mobile DSM 13181]
Length = 354
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 40/71 (56%), Gaps = 3/71 (4%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+++ +LGL H V+ +GF D + + +L+ S +E+G ++RA+ G+PV+
Sbjct: 218 LEDLVEKLGL-SHKVKFWGFREDADEWIARCSCLLFPS--LEEGMGMTLMRAIQMGVPVL 274
Query: 531 TPDFPIIKEYV 541
+ P ++E
Sbjct: 275 ASNLPPVRELC 285
>gi|375148544|ref|YP_005010985.1| group 1 glycosyl transferase [Niastella koreensis GR20-10]
gi|361062590|gb|AEW01582.1| glycosyl transferase group 1 [Niastella koreensis GR20-10]
Length = 399
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 81/177 (45%), Gaps = 16/177 (9%)
Query: 436 DVGPLLIKYARRNSVEGSFKFVFLCGNST-----DG-YNDALQEVASRLGLLEHSVRHYG 489
+VG L+ +R+ ++ F+ C ++ +G ++L + LGL +H+V G
Sbjct: 225 NVGRLVPVKGQRDLIQAFAAFLPACPDAELHIAGEGPARESLTALIDELGL-QHNVELLG 283
Query: 490 FNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIF 549
+ D+ ++ D ++ S +GF +V AM G+PV+ D + KE + F
Sbjct: 284 -SQDIPRIINGYDCFVFPSHS--EGFSGAVVEAMFAGLPVLASDIAVNKEVITHLETGYF 340
Query: 550 FQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVL 606
F+K + E + +A + N + A T A HA+ LD + A LEN L
Sbjct: 341 FEKGSVESIKQALLWYKDNVAV---ANTFAVKANEHARQHFELD---KIAGKLENYL 391
>gi|428201878|ref|YP_007080467.1| glycosyltransferase [Pleurocapsa sp. PCC 7327]
gi|427979310|gb|AFY76910.1| glycosyltransferase [Pleurocapsa sp. PCC 7327]
Length = 424
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 75/164 (45%), Gaps = 15/164 (9%)
Query: 470 ALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGI 527
AL+ + ++LGL E V+ G+ +V L D+ + S +G P +++ AM G+
Sbjct: 266 ALERMTAQLGLQER-VKFVGYKSQAEVRQYLQQTDVFVLPS--FAEGVPVVLMEAMAAGV 322
Query: 528 PVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-SKFARTVASAGRLHA 586
PV+ E V +G + L++ + +++ +L +KF ++GRL
Sbjct: 323 PVVATQIAGTSELVEDGIGGYLVPPGDAVSLAQRIEVLLTDSQLRAKF----GASGRLKV 378
Query: 587 KNMLALDC-VTRYARILENVLNFPSDALLP----GPISQLQQVS 625
+ LD V R R++ + L +A+ P I LQ+ S
Sbjct: 379 EKDFNLDKEVDRLYRVMTSALQGKVEAIRPNTSESAIEDLQKAS 422
>gi|154252367|ref|YP_001413191.1| group 1 glycosyl transferase [Parvibaculum lavamentivorans DS-1]
gi|154156317|gb|ABS63534.1| glycosyl transferase group 1 [Parvibaculum lavamentivorans DS-1]
Length = 407
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 3/102 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L VR GF DV VL DI ++ S + +G P IV+ G P++ P I+
Sbjct: 259 LAARVRILGFRNDVESVLAACDICVFTSQR--EGLPRSIVQYGLAGKPIVVTRLPGIEAV 316
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-SKFARTVASA 581
V G D+ G + + + N L ++F++ A A
Sbjct: 317 VRHGENGYVVASDDFSGFASSIEKLLENPDLRARFSQASADA 358
>gi|392392421|ref|YP_006429023.1| glycosyltransferase, MSMEG_0565 family [Desulfitobacterium
dehalogenans ATCC 51507]
gi|390523499|gb|AFL99229.1| glycosyltransferase, MSMEG_0565 family [Desulfitobacterium
dehalogenans ATCC 51507]
Length = 390
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/143 (23%), Positives = 60/143 (41%), Gaps = 13/143 (9%)
Query: 464 TDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAM 523
+GYN + E LG+L+ +V + AD + S V++G+ + + A+
Sbjct: 254 VEGYNLKMGEDVINLGVLKEE--------EVPLIYQAADCFAFPS--VQEGWGLVALEAL 303
Query: 524 TFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
G P + D P+ E++ G +F DN E + A I + L R + S G
Sbjct: 304 LSGTPTVVSDIPVFHEFLTPGEDALFVNPDNAEQFAAAMIETIMDTNLR---RRLISGGL 360
Query: 584 LHAKNMLALDCVTRYARILENVL 606
+ A N D ++ + +L
Sbjct: 361 MTASNFSWEDTARKHEQFYYEIL 383
>gi|395490399|ref|ZP_10421978.1| group 1 glycosyl transferase [Sphingomonas sp. PAMC 26617]
Length = 399
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 2/92 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+++A+RLG+ + V GF D+ ++ D+VL+ S++ E F +IV M G+PVI
Sbjct: 256 LRDLAARLGIADR-VHFCGFRQDIARLMAAMDLVLHCSTEPEP-FGRVIVEGMMAGVPVI 313
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
E +A + +P L+ A
Sbjct: 314 ATHGGGASEIIAGSGAGLLIAPGDPHALADAI 345
>gi|344342103|ref|ZP_08773011.1| glycosyl transferase group 1 [Thiocapsa marina 5811]
gi|343798013|gb|EGV15979.1| glycosyl transferase group 1 [Thiocapsa marina 5811]
Length = 368
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 467 YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIV----LYGSSQVEQGFPSLIVRA 522
Y ++ +A RLG+ + ++ GF DV L D+ LYG +G P +I+ A
Sbjct: 230 YESVIKGLARRLGVSD-AIDWTGFTRDVESQLAQMDLFVLPSLYG-----EGMPMVILEA 283
Query: 523 MTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
M G+PV+ D + E + G + ++P L++ +
Sbjct: 284 MAAGVPVVASDVEGVSEVLEHGRSGLVVPPNDPHTLAKTIA 324
>gi|300113359|ref|YP_003759934.1| group 1 glycosyl transferase [Nitrosococcus watsonii C-113]
gi|299539296|gb|ADJ27613.1| glycosyl transferase group 1 [Nitrosococcus watsonii C-113]
Length = 378
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 37/188 (19%), Positives = 80/188 (42%), Gaps = 16/188 (8%)
Query: 424 NELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEH 483
+ + WD V ++ AR G L G++ G A+ + + +
Sbjct: 201 SRMLWDKGVGE------FVEAARILQTAGVKARFVLVGDTDPGNPAAVPQSMLKEWRDQG 254
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+V +G+ ++ +L A+I+ S + +G P +++ A G P++T D P +E V
Sbjct: 255 TVEWWGYCDNMPAILAGANIICLPSYR--EGLPKVLIEAAACGRPIVTTDMPGCREIVRH 312
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRL-----HAKNMLALDCVTRY 598
G + + + L++A I + S+ + + GR H+ +++ + + Y
Sbjct: 313 GKNGLLVSVRDSKELAQALRTLIKD---SEMRQRMGQEGRALVVAEHSVDLINMQTINLY 369
Query: 599 ARILENVL 606
++L L
Sbjct: 370 EKLLPKSL 377
>gi|423488813|ref|ZP_17465495.1| hypothetical protein IEU_03436 [Bacillus cereus BtB2-4]
gi|423494538|ref|ZP_17471182.1| hypothetical protein IEW_03436 [Bacillus cereus CER057]
gi|423498672|ref|ZP_17475289.1| hypothetical protein IEY_01899 [Bacillus cereus CER074]
gi|401152152|gb|EJQ59593.1| hypothetical protein IEW_03436 [Bacillus cereus CER057]
gi|401158754|gb|EJQ66143.1| hypothetical protein IEY_01899 [Bacillus cereus CER074]
gi|402433820|gb|EJV65870.1| hypothetical protein IEU_03436 [Bacillus cereus BtB2-4]
Length = 385
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/117 (23%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++ ++LG+ + V G+ D+ +L M+D+ + +S +G P I+ AM GIP
Sbjct: 242 EECKKITTQLGV-SNMVHFLGYRNDIASLLQMSDLAV--ASSYREGLPVNIMEAMACGIP 298
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS----KFARTVASA 581
VI D +E + +D+ + +S + N KL + RT+ ++
Sbjct: 299 VIATDNRGHRELIISNKNGWIIDRDDIKTMSERINTISKNTKLQAQFGQCGRTIITS 355
>gi|374605217|ref|ZP_09678155.1| group 1 glycosyl transferase [Paenibacillus dendritiformis C454]
gi|374389196|gb|EHQ60580.1| group 1 glycosyl transferase [Paenibacillus dendritiformis C454]
Length = 386
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 48/102 (47%), Gaps = 3/102 (2%)
Query: 472 QEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVIT 531
+ V+++LG+ H V G+ D+ +L M+DI + GS + +G P ++ AM G+PVI
Sbjct: 244 KSVSAKLGVDRH-VHFLGYRNDIPNLLHMSDIAVSGS--LREGLPVQMMEAMACGLPVIA 300
Query: 532 PDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
E V +G PE + + ++ L +
Sbjct: 301 TRNRGHLELVCDGRNGYIVPPQAPEQFAERIATLAASADLRE 342
>gi|256752417|ref|ZP_05493276.1| glycosyl transferase group 1 [Thermoanaerobacter ethanolicus CCSD1]
gi|256748686|gb|EEU61731.1| glycosyl transferase group 1 [Thermoanaerobacter ethanolicus CCSD1]
Length = 376
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 3/94 (3%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ ++ +A LG+ H V G DV ++ AD + SS +G P +++ A G+P
Sbjct: 236 EKIKHLAGELGI-THQVYFLGVRKDVPELMNAADAYVMSSSW--EGMPLVLLEASAVGLP 292
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
++ D KE V +G NPE L++A
Sbjct: 293 IVATDVGGNKEIVVDGESGFLVLSRNPESLAQAM 326
>gi|448681495|ref|ZP_21691586.1| LPS biosynthesis RfbU related protein [Haloarcula argentinensis DSM
12282]
gi|445767365|gb|EMA18468.1| LPS biosynthesis RfbU related protein [Haloarcula argentinensis DSM
12282]
Length = 386
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 1/111 (0%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+AL+ ASR G+ + V H G + +++ ++ + +G P +++ AM G P
Sbjct: 246 NALEARASRQGVGDQVVFH-GRIPHEQVLEIVSQSRVFALNSTYEGLPHVVLEAMACGTP 304
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVA 579
V+ E + +G Q+D+ E S+ F +SN L RT A
Sbjct: 305 VVASAVCGTPEAITDGEDGFLVQQDDIESFSKRFEQLLSNPSLRNRLRTQA 355
>gi|50120375|ref|YP_049542.1| glycosyl transferase family protein [Pectobacterium atrosepticum
SCRI1043]
gi|49610901|emb|CAG74346.1| putative glycosyl transferase [Pectobacterium atrosepticum
SCRI1043]
Length = 369
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 64/131 (48%), Gaps = 13/131 (9%)
Query: 417 VGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVAS 476
+ ++F SW + A H + Y + N + V +CG + Y D L+ + +
Sbjct: 193 IKNTFLMVGASWKHKNA-HSFLKNFLAYHKDN-----YSAVIVCGQT--DYFDELKSLVA 244
Query: 477 RLGLLEHSV--RHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDF 534
LGL E++V +H+ +++ + + ++Y S + +GF + AM+ +PVI D
Sbjct: 245 TLGL-ENNVTFKHHISEQELSSLYKTSFALVYPS--LSEGFGIPPIEAMSLSLPVIVSDI 301
Query: 535 PIIKEYVAEGA 545
P+ KE + E A
Sbjct: 302 PVFKEILGEAA 312
>gi|374601963|ref|ZP_09674959.1| group 1 glycosyl transferase [Paenibacillus dendritiformis C454]
gi|374392405|gb|EHQ63731.1| group 1 glycosyl transferase [Paenibacillus dendritiformis C454]
Length = 387
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 53/110 (48%), Gaps = 3/110 (2%)
Query: 467 YNDALQEVASRLGL-LEHSVRHYGFNGDVNGVLLMADI-VLYGSSQVEQGFPSLIVRAMT 524
Y L + R GL L+ + GF D + + L+ L S E G + A+
Sbjct: 243 YIPTLVSLIHRKGLRLDEHIYPLGFVPDNDVIPLVRQAQALIMPSLSEGGGSYPVEEALK 302
Query: 525 FGIPVITPDFPIIKEYVA-EGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
G+PV+ D P+++E+++ A+V++F ++P+ + RA N L K
Sbjct: 303 LGVPVLCSDIPVMREHLSRHSAEVVWFDPESPDAIVRALEQLFGNYALYK 352
>gi|448310094|ref|ZP_21499946.1| group 1 glycosyl transferase [Natronorubrum bangense JCM 10635]
gi|445588424|gb|ELY42667.1| group 1 glycosyl transferase [Natronorubrum bangense JCM 10635]
Length = 407
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 68/161 (42%), Gaps = 15/161 (9%)
Query: 457 VFLCGNSTDGYNDA-LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
V+L G DG L+ R GL E +V G+ DV ++ D+ + S + +G
Sbjct: 228 VYLVG---DGPQKTFLEREIDRRGLAE-TVSMLGYRSDVPAIMAACDVFVLPSYR--EGT 281
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFA 575
P +I AM G+PV+ D I E VA+G + + L+ +++ + +
Sbjct: 282 PRVITEAMASGLPVVATDIAGIPEQVADGESGYLIEPGDDAALANRLEALLASREQREAL 341
Query: 576 RTVA--SAGRLHAKNMLA-LDCVTRYARILENVLNFPSDAL 613
V R MLA LD V R LE P+D +
Sbjct: 342 GAVGRKRVERFSDTQMLADLDAVYREVLGLE-----PADTM 377
>gi|88603619|ref|YP_503797.1| glycosyl transferase, group 1 [Methanospirillum hungatei JF-1]
gi|88189081|gb|ABD42078.1| glycosyl transferase, group 1 [Methanospirillum hungatei JF-1]
Length = 418
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 45/93 (48%), Gaps = 2/93 (2%)
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+G P I+ A +FGIPVI E V E ++ + +PE ++ A FI N +++
Sbjct: 328 EGIPVSIMEAQSFGIPVIATAVGGTPEIVTEDVGILLSENPSPEEIANAIEFFIDNPEIT 387
Query: 573 KFARTVASAGRLHAKNMLALDCVTRYARILENV 605
K R + KN A+ T +A+ L+ +
Sbjct: 388 KRMRLKSIEN--WEKNFNAVKNFTEFAKTLKMI 418
>gi|406954756|gb|EKD83497.1| glycosyl transferase group 1 [uncultured bacterium]
Length = 378
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 49/101 (48%), Gaps = 11/101 (10%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL-LMADIVLYGSSQVEQGF 515
V +CG D N L+++A G + + F G +N V M L+ S + +G
Sbjct: 225 VVICGEGADRKN--LEQIAIDSGFADR----FHFTGYINNVWQYMKAAKLFCSISLVEGQ 278
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
P+ ++ AM G PV+ D P +E++ A + +F NPE
Sbjct: 279 PNTVLEAMANGCPVVVSDIPAHREFI--NADMAYFV--NPE 315
>gi|307942482|ref|ZP_07657832.1| glycosyl transferase, group 1 family [Roseibium sp. TrichSKD4]
gi|307774320|gb|EFO33531.1| glycosyl transferase, group 1 family [Roseibium sp. TrichSKD4]
Length = 388
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 62/151 (41%), Gaps = 8/151 (5%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
LC G LQ A +LG+ + V G DV L +ADI + S E+GF +
Sbjct: 234 LLCIGGDSGILGELQAHAEKLGISDRIV-WLGRRADVERFLKIADIGVLASH--EEGFSN 290
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
I+ M +PVI E + +G I NP L+ A + R+
Sbjct: 291 SILEYMAAQLPVIVTRVGGAAEAIKDGETGILVPPHNPPALAEAIVRLAD----PEIRRS 346
Query: 578 VASAGRLHAKNMLALD-CVTRYARILENVLN 607
+ AGR + + D C+ Y + ++++
Sbjct: 347 MGKAGREQLISKFSYDKCIENYETLYRSLMS 377
>gi|398349462|ref|ZP_10534165.1| glycosyl transferase [Leptospira broomii str. 5399]
Length = 393
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 73/173 (42%), Gaps = 18/173 (10%)
Query: 401 LRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLC 460
LRKE KDE+V+ + A+ H L+ E FK VF+
Sbjct: 192 LRKEFHLQKDELVIGNIA------------ALVDHKDQRTLLNAIALIETEQKFK-VFIV 238
Query: 461 GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
G L+++A + GLL+ V GF D+ L + DI S E+G + I+
Sbjct: 239 GEG--ALRKELEDLARQKGLLDRVV-FTGFREDIPEFLSLFDIFTLTSK--EEGLGTSIL 293
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
AM G+P++ + I E + Q + E L++++ + + +L K
Sbjct: 294 DAMAAGLPIVATNGGGISEMLTPEKGAFVAQVGDAEFLAKSYKTLLEDSRLRK 346
>gi|413958668|ref|ZP_11397907.1| putative glycosyl transferase [Burkholderia sp. SJ98]
gi|413941248|gb|EKS73208.1| putative glycosyl transferase [Burkholderia sp. SJ98]
Length = 379
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
EH V + G DV ++ AD V+ S + +G P ++ A G P++ D P +E V
Sbjct: 254 EHIVEYLGEANDVRPLVAAADCVVLPSYR--EGVPRTLMEASAMGRPIVATDVPGCREVV 311
Query: 542 AEGAQVIFFQKDNPEGLSRAFS--LFISNGKLSKFAR 576
+G + + + + L+ A L +S + + AR
Sbjct: 312 EDGVNGLLCEVKSADSLTEALERMLTLSADERAAIAR 348
>gi|423599000|ref|ZP_17575000.1| hypothetical protein III_01802 [Bacillus cereus VD078]
gi|401235984|gb|EJR42450.1| hypothetical protein III_01802 [Bacillus cereus VD078]
Length = 385
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 28/117 (23%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++ ++LG+ + V G+ D+ +L M+D+ + +S +G P I+ AM GIP
Sbjct: 242 EECKKITTQLGV-SNMVHFLGYRNDIASLLQMSDLAV--ASSYREGLPVNIMEAMACGIP 298
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS----KFARTVASA 581
VI D +E + +D+ + +S + N KL + RT+ ++
Sbjct: 299 VIATDNRGHRELIISNKNGWIIDRDDIKTMSERINTISKNTKLQAQFGQCGRTIITS 355
>gi|448455860|ref|ZP_21594792.1| Glycosyl transferase, group 1 [Halorubrum lipolyticum DSM 21995]
gi|445813079|gb|EMA63061.1| Glycosyl transferase, group 1 [Halorubrum lipolyticum DSM 21995]
Length = 382
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/85 (25%), Positives = 41/85 (48%), Gaps = 2/85 (2%)
Query: 489 GFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVI 548
GF D+ VL +D+++ S + +G P +I AM G+PV+ D I E + +
Sbjct: 259 GFRDDIPCVLAASDVLVLPSFR--EGTPRVITEAMASGLPVVATDIAGIPEQIEDSENGY 316
Query: 549 FFQKDNPEGLSRAFSLFISNGKLSK 573
Q + E L+ +++ +L +
Sbjct: 317 LIQTGDSEALANRLETLLADAELRE 341
>gi|393766484|ref|ZP_10355040.1| group 1 glycosyl transferase [Methylobacterium sp. GXF4]
gi|392728265|gb|EIZ85574.1| group 1 glycosyl transferase [Methylobacterium sp. GXF4]
Length = 408
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 33/144 (22%), Positives = 57/144 (39%), Gaps = 13/144 (9%)
Query: 493 DVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQK 552
D+ G+ +AD +++ S G ++ AM G PVI P EY+A +V+ +
Sbjct: 277 DMPGLYRIADTLVFPSLSEGNGL--CVLEAMACGTPVIVSARPPFTEYLAP-EEVLAVEP 333
Query: 553 DNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDA 612
D P+ ++ A + + + R HA A + YA ++
Sbjct: 334 DEPDAIADAMQASLDPSRRVRLRVAGFEVARAHAWEACAARHLPTYASLVPG-------- 385
Query: 613 LLPGPISQLQQVSWEWNLFRKEID 636
P P++ + S WN R D
Sbjct: 386 --PAPVAAIATGSGAWNTTRIPGD 407
>gi|339321356|ref|YP_004680250.1| glycosyltransferase [Cupriavidus necator N-1]
gi|338167964|gb|AEI79018.1| glycosyltransferase [Cupriavidus necator N-1]
Length = 349
Score = 41.6 bits (96), Expect = 2.8, Method: Composition-based stats.
Identities = 24/91 (26%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 513 QGFPSLIVRAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+GF +++ AM +G+PV+ + +K + G D+P L+R + I++ +L
Sbjct: 247 EGFSLVLLEAMAYGLPVVATNCETGVKAMLTHGENAQLSPVDDPAALARGIAAVITSPEL 306
Query: 572 SKFARTVASAGRLHAKNMLALDCVTRYARIL 602
A ++ AGR HA+ + T + R+L
Sbjct: 307 ---AARLSEAGRRHAQRFTPGEIGTHWERLL 334
>gi|434406570|ref|YP_007149455.1| glycosyltransferase [Cylindrospermum stagnale PCC 7417]
gi|428260825|gb|AFZ26775.1| glycosyltransferase [Cylindrospermum stagnale PCC 7417]
Length = 383
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 57/130 (43%), Gaps = 9/130 (6%)
Query: 458 FLCGNSTDGYNDALQEV---ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQG 514
L G++ G D +QE+ + LGL E+ V+ GF D+ ++ D+V + S+ E
Sbjct: 232 ILVGDALFGEQDYVQELHQQVAELGL-ENRVKFLGFRSDIPQLMAACDLVAHTSTSPEP- 289
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN----GK 570
F +IV M G PV+ E V G P+ L+ S + +
Sbjct: 290 FGRVIVEGMLCGKPVVAAKAGGATELVEPGINGFLVTPGEPQELANIISFCVQEREQIAR 349
Query: 571 LSKFARTVAS 580
++ ART AS
Sbjct: 350 IANNARTSAS 359
>gi|289580605|ref|YP_003479071.1| group 1 glycosyl transferase [Natrialba magadii ATCC 43099]
gi|448284272|ref|ZP_21475532.1| group 1 glycosyl transferase [Natrialba magadii ATCC 43099]
gi|289530158|gb|ADD04509.1| glycosyl transferase group 1 [Natrialba magadii ATCC 43099]
gi|445570607|gb|ELY25166.1| group 1 glycosyl transferase [Natrialba magadii ATCC 43099]
Length = 376
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 504 VLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
+L SQ +G P++I+ ++ G PV + + V EG ++ +PE ++ S
Sbjct: 278 LLILPSQPTEGLPTVILESLACGTPVYATPVSGVPDVVKEGKTGYLMRESDPETIASRVS 337
Query: 564 LFISNGKLSKFARTVASAGRLHAKNMLALD-CVTRYARILENV 605
+ KL+ TV+ GR + + + V RY RILE++
Sbjct: 338 QIQRDEKLA----TVSENGRTLIEQKFSFEGAVERYRRILESL 376
>gi|134097289|ref|YP_001102950.1| group 1 glycosyl transferase [Saccharopolyspora erythraea NRRL
2338]
gi|291009818|ref|ZP_06567791.1| glycosyl transferase, group 1 [Saccharopolyspora erythraea NRRL
2338]
gi|133909912|emb|CAM00024.1| glycosyl transferase, group 1 [Saccharopolyspora erythraea NRRL
2338]
Length = 378
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 8/115 (6%)
Query: 491 NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFF 550
+GD + +L A ++ S ++GF I+ AM G PV+ D I +E G FF
Sbjct: 270 DGDYHELLSRAFALVTASK--DEGFGLPIIEAMNVGTPVVCSDLTIFRE--VTGGHARFF 325
Query: 551 QKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENV 605
Q D+P+G + A + +G + A V SA R+ AK+ R ++ +V
Sbjct: 326 QPDSPDGFAAAID-GLEDGAVR--AELVESA-RVQAKHFTWASSAERLLEVMRSV 376
>gi|402816170|ref|ZP_10865761.1| glycosyl transferase group 1 [Paenibacillus alvei DSM 29]
gi|402506074|gb|EJW16598.1| glycosyl transferase group 1 [Paenibacillus alvei DSM 29]
Length = 391
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 52/105 (49%), Gaps = 3/105 (2%)
Query: 467 YNDALQEVASRLGL-LEHSVRHYGFNGDVNGVLLMADI-VLYGSSQVEQGFPSLIVRAMT 524
Y L + R GL ++ + GF D N + ++ L S E G + A+
Sbjct: 246 YVPTLISLMERSGLRVDADIYPLGFIQDSNVIPIIRKAQALIMPSMSEGGGSYPVEEALR 305
Query: 525 FGIPVITPDFPIIKEYVA-EGAQVIFFQKDNPEGLSRAFSLFISN 568
G+PV+ D P+++E++A A +++F ++P+ +++A L N
Sbjct: 306 LGVPVLCSDIPVMREHLARHSADILWFDPESPDSIAQALQLLFDN 350
>gi|386001787|ref|YP_005920086.1| group 1 glycosyl transferase [Methanosaeta harundinacea 6Ac]
gi|357209843|gb|AET64463.1| group 1 glycosyl transferase [Methanosaeta harundinacea 6Ac]
Length = 372
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 60/118 (50%), Gaps = 7/118 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGD--VNGVLLMADI-VLYGSSQVEQGFPSLIVRAMTF 525
D + ++RLGL E +V+ GF + V DI +L +S+ ++GF + + A+
Sbjct: 234 DHYRRASARLGL-EENVQFLGFVPEEMVAKHYNRCDIFILPSTSRSQEGFGMVALEALAC 292
Query: 526 GIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
G PVI + E V + + ++PEGL+RA + + +G AR + +AGR
Sbjct: 293 GRPVICTRIVGVAEDVERVGAGLVVEPNDPEGLARAIARLLQSG---PEARRMGAAGR 347
>gi|357502221|ref|XP_003621399.1| hypothetical protein MTR_7g012910 [Medicago truncatula]
gi|355496414|gb|AES77617.1| hypothetical protein MTR_7g012910 [Medicago truncatula]
Length = 87
Score = 41.2 bits (95), Expect = 3.1, Method: Composition-based stats.
Identities = 32/92 (34%), Positives = 42/92 (45%), Gaps = 16/92 (17%)
Query: 434 MHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGD 493
MH + PLL YAR+ + K V + VAS L L + HYG +
Sbjct: 8 MHPIRPLLPNYARQYDL--VMKTVLV--------------VASCLELPRGFLWHYGLHDT 51
Query: 494 VNGVLLMADIVLYGSSQVEQGFPSLIVRAMTF 525
N VLLM DIVL+ Q FPSL + + +
Sbjct: 52 TNNVLLMLDIVLHDFVHDVQCFPSLYDKCLCY 83
>gi|339326925|ref|YP_004686618.1| glycosyltransferase EpsD [Cupriavidus necator N-1]
gi|338167082|gb|AEI78137.1| glycosyltransferase EpsD [Cupriavidus necator N-1]
Length = 391
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 6/106 (5%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ L G+ + +A+Q++A + V G D+ G+L A S +GFP
Sbjct: 239 LVLAGDGPE--REAMQQLAQ--AIAPGRVTFPGSVTDIAGLLATAQAFTLASDH--EGFP 292
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
++ AM G+P+I D P I+E + +G + + L+ AF
Sbjct: 293 LSVLEAMRAGLPIIASDLPGIREQLEDGKHGLLVNHHDESALAAAF 338
>gi|423306720|ref|ZP_17284719.1| hypothetical protein HMPREF1072_03659 [Bacteroides uniformis
CL03T00C23]
gi|423308692|ref|ZP_17286682.1| hypothetical protein HMPREF1073_01432 [Bacteroides uniformis
CL03T12C37]
gi|392678094|gb|EIY71503.1| hypothetical protein HMPREF1072_03659 [Bacteroides uniformis
CL03T00C23]
gi|392686797|gb|EIY80098.1| hypothetical protein HMPREF1073_01432 [Bacteroides uniformis
CL03T12C37]
Length = 392
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 62/147 (42%), Gaps = 8/147 (5%)
Query: 452 GSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQV 511
G KF+ L G+ L E R L++ + G+ + + +DIV+ S +
Sbjct: 240 GKVKFI-LAGDCDKENLAVLGEEKLRSLLVDDYIEWIGYQAQMFPIYTDSDIVVLPSYR- 297
Query: 512 EQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-NGK 570
+G P ++ A G P++T D P +E V G + L+ A L I + K
Sbjct: 298 -EGLPKSLIEACAVGRPIVTTDVPGCRECVKSGYNGYLVPAKDSRALADAIGLLIEDDAK 356
Query: 571 LSKFARTVASAGRLHAKNMLALDCVTR 597
+F R RL A+ ++D V R
Sbjct: 357 RKEFGRN----SRLLAEEEFSIDKVVR 379
>gi|258647957|ref|ZP_05735426.1| glycosyl transferase, group 1 family [Prevotella tannerae ATCC
51259]
gi|260851796|gb|EEX71665.1| glycosyl transferase, group 1 family [Prevotella tannerae ATCC
51259]
Length = 378
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 61/155 (39%), Gaps = 5/155 (3%)
Query: 453 SFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVE 512
FKF L G D A+ E + ++ + G DV ++ V+ S
Sbjct: 225 QFKFQLL-GPKGDNNPAAISEEQLEKDVTSGAIEYLGITDDVQSIVGRPGTVVVLPS-YH 282
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+G ++ A G P+I D P +E V +G N L +A +I K
Sbjct: 283 EGLSRSLMEACAMGKPIIASDIPGCRETVEDGVNGFLVPPQNTTALIKAMQRYIDLSKEE 342
Query: 573 KFARTVASAGRLHAKNMLALDCVT-RYARILENVL 606
K +T + AGR A+ + V RY +IL V
Sbjct: 343 K--QTFSKAGRRIAEERFDVRNVAERYEQILAEVF 375
>gi|212694702|ref|ZP_03302830.1| hypothetical protein BACDOR_04234 [Bacteroides dorei DSM 17855]
gi|265750615|ref|ZP_06086678.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|423239127|ref|ZP_17220243.1| hypothetical protein HMPREF1065_00866 [Bacteroides dorei
CL03T12C01]
gi|212662681|gb|EEB23255.1| glycosyltransferase, group 1 family protein [Bacteroides dorei DSM
17855]
gi|263237511|gb|EEZ22961.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|392646914|gb|EIY40620.1| hypothetical protein HMPREF1065_00866 [Bacteroides dorei
CL03T12C01]
Length = 381
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+++ G+ D+ L AD ++ S + +GFP++I++A G+P I + E +
Sbjct: 258 AIKFMGWQEDIRPFLAAADAFVFPSYR--EGFPNVILQAGAMGLPCIVTNINGCNEIIEN 315
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
G I + E L R F+S+ +L +ASA R
Sbjct: 316 GKNGIIIPPHDSEYLYRTMCGFLSSPRL---VEQLASAAR 352
>gi|423225769|ref|ZP_17212236.1| hypothetical protein HMPREF1062_04422 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392631354|gb|EIY25327.1| hypothetical protein HMPREF1062_04422 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 351
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 3/99 (3%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLL-MADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
L+EV +L L E V G N D+ + + D L + +GF I M GIPV
Sbjct: 213 LKEVTRQLKL-EKQVNFLG-NCDIEWIYHHLCDYHLLVQPSISEGFGLTIAEGMAAGIPV 270
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ D P E + +G Q +F+ ++ + L R F I +
Sbjct: 271 LVSDLPAPLEIIDKGKQGYYFRHNDVDDLCRMFRYIIKH 309
>gi|148655004|ref|YP_001275209.1| group 1 glycosyl transferase [Roseiflexus sp. RS-1]
gi|148567114|gb|ABQ89259.1| glycosyl transferase, group 1 [Roseiflexus sp. RS-1]
Length = 405
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 65/155 (41%), Gaps = 13/155 (8%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNG----VLLMADIVLYGSSQV 511
FVF+ N + L A+ G+ R+ F G ++ LL A VL S+
Sbjct: 257 FVFVTHNQAQRAD--LMRRAAAYGI----ERNLHFPGTISEEEKLALLRASDVLPFPSRY 310
Query: 512 EQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
E GF ++ M G+PVI+ D P++ E V G + D+ + L+RA + + L
Sbjct: 311 E-GFGLPLLEGMAAGVPVISTDIPVVNEIVVHGENGLLIPYDDTDALARAILSVLDDQNL 369
Query: 572 SKFARTVASAGRLHAKNMLALDCVTRYARILENVL 606
R +A R + V + E+VL
Sbjct: 370 RN--RLIAGGQRALTERFAPERLVRHIIDVYEDVL 402
>gi|383785474|ref|YP_005470044.1| glycosyl transferase, group 1 [Leptospirillum ferrooxidans C2-3]
gi|383084387|dbj|BAM07914.1| putative glycosyl transferase, group 1 [Leptospirillum ferrooxidans
C2-3]
Length = 374
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 3/103 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+ A RL L + SVR G DV +L D+ L SS+ E F + I+ AM G+PVI
Sbjct: 242 LEGEAKRLSLGD-SVRFLGVRPDVERILPAFDVFLL-SSKTES-FSNAILEAMAAGLPVI 298
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
E V EG + NPE +++A N L +
Sbjct: 299 ATRVGGNPECVKEGETGLLVPSGNPEEMAKAMLTLARNPDLRR 341
>gi|423227993|ref|ZP_17214399.1| hypothetical protein HMPREF1063_00219 [Bacteroides dorei
CL02T00C15]
gi|423243253|ref|ZP_17224329.1| hypothetical protein HMPREF1064_00535 [Bacteroides dorei
CL02T12C06]
gi|392637740|gb|EIY31606.1| hypothetical protein HMPREF1063_00219 [Bacteroides dorei
CL02T00C15]
gi|392646128|gb|EIY39847.1| hypothetical protein HMPREF1064_00535 [Bacteroides dorei
CL02T12C06]
Length = 396
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+++ G+ D+ L AD ++ S + +GFP++I++A G+P I + E +
Sbjct: 273 AIKFMGWQEDIRPFLAAADAFVFPSYR--EGFPNVILQAGAMGLPCIVTNINGCNEIIEN 330
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
G I + E L R F+S+ +L +ASA R
Sbjct: 331 GKNGIIIPPHDSEYLYRTMCGFLSSPRL---VEQLASAAR 367
>gi|229917099|ref|YP_002885745.1| group 1 glycosyl transferase [Exiguobacterium sp. AT1b]
gi|229468528|gb|ACQ70300.1| glycosyl transferase group 1 [Exiguobacterium sp. AT1b]
Length = 376
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 37/83 (44%), Gaps = 3/83 (3%)
Query: 487 HY-GFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGA 545
HY GF D + D+ L S + +G P I+ AM G+PVI D +E V +G
Sbjct: 255 HYAGFVRDTPSLFAAHDVFLLPSYR--EGLPRSILEAMASGLPVIATDIRGCREEVTDGE 312
Query: 546 QVIFFQKDNPEGLSRAFSLFISN 568
PE L+ A FI +
Sbjct: 313 TGFLVPVGRPEALTLAMRWFIEH 335
>gi|94311659|ref|YP_584869.1| putative glycosyl transferase [Cupriavidus metallidurans CH34]
gi|93355511|gb|ABF09600.1| putative glycosyl transferase, group 1 [Cupriavidus metallidurans
CH34]
Length = 403
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 43/76 (56%), Gaps = 7/76 (9%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ L+ +AS + + H GF D+ L++AD+++ S + +GFP++I++A + +P
Sbjct: 272 NTLEMIAS-----DQRIHHLGFMDDIRPALMLADVLVLPSYR--EGFPNVILQAGSMELP 324
Query: 529 VITPDFPIIKEYVAEG 544
VI D E + +G
Sbjct: 325 VIATDINGCNEVIEDG 340
>gi|237711019|ref|ZP_04541500.1| sugar transferase [Bacteroides sp. 9_1_42FAA]
gi|229454863|gb|EEO60584.1| sugar transferase [Bacteroides sp. 9_1_42FAA]
Length = 440
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+++ G+ D+ L AD ++ S + +GFP++I++A G+P I + E +
Sbjct: 317 AIKFMGWQEDIRPFLAAADAFVFPSYR--EGFPNVILQAGAMGLPCIVTNINGCNEIIEN 374
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
G I + E L R F+S+ +L +ASA R
Sbjct: 375 GKNGIIIPPHDSEYLYRTMCGFLSSPRL---VEQLASAAR 411
>gi|345516049|ref|ZP_08795544.1| sugar transferase [Bacteroides dorei 5_1_36/D4]
gi|229434176|gb|EEO44253.1| sugar transferase [Bacteroides dorei 5_1_36/D4]
Length = 440
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+++ G+ D+ L AD ++ S + +GFP++I++A G+P I + E +
Sbjct: 317 AIKFMGWQEDIRPFLAAADAFVFPSYR--EGFPNVILQAGAMGLPCIVTNINGCNEIIEN 374
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
G I + E L R F+S+ +L +ASA R
Sbjct: 375 GKNGIIIPPHDSEYLYRTMCGFLSSPRL---VEQLASAAR 411
>gi|283798121|ref|ZP_06347274.1| YveN, capsular polysaccharide biosynthesis protein, Glycosyl
transferase family 4 [Clostridium sp. M62/1]
gi|291074099|gb|EFE11463.1| glycosyltransferase, group 1 family protein [Clostridium sp. M62/1]
Length = 374
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 48/114 (42%), Gaps = 5/114 (4%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
+CGN L E+ + GL V GF D+N + ++DI L+ S Q +G P
Sbjct: 233 LICGNGR--LEKKLTELIEKNGL-RKQVYLLGFRNDINAICQVSDIFLFPSLQ--EGLPV 287
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
++ AM +PVI D + + I ++ E + A +N L
Sbjct: 288 ALMEAMANKMPVIATDIRGNHDLIQNNEGGILISPNSTEDMKNAILYLTANSHL 341
>gi|147677432|ref|YP_001211647.1| glycosyltransferase [Pelotomaculum thermopropionicum SI]
gi|146273529|dbj|BAF59278.1| glycosyltransferase [Pelotomaculum thermopropionicum SI]
Length = 423
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 49/113 (43%), Gaps = 4/113 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L VR GF D+ V + DI Y S + +GF I AM+ +PV++ + ++
Sbjct: 295 LSGHVRLLGFRKDIENVYSLFDI--YVSPSLMEGFGMAIAEAMSRRLPVVSTAAGGVTDF 352
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISN--GKLSKFARTVASAGRLHAKNMLA 591
+ G + +P L+ F++N G + AGR + ++A
Sbjct: 353 LEHGKNSYLVRPRDPSALAEGIRYFVNNKAGAVKIGETAFIDAGRFRTERLVA 405
>gi|357061105|ref|ZP_09121865.1| hypothetical protein HMPREF9332_01422 [Alloprevotella rava F0323]
gi|355375122|gb|EHG22412.1| hypothetical protein HMPREF9332_01422 [Alloprevotella rava F0323]
Length = 381
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 2/84 (2%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
VR+ G+ D+ L+ AD +++ S + +GFP+++++A +P I D E + G
Sbjct: 261 VRYVGWQTDIRPYLIAADTLVFPSYR--EGFPNVVLQAGAMELPAIVTDINGANEIIVNG 318
Query: 545 AQVIFFQKDNPEGLSRAFSLFISN 568
I + E L A FI +
Sbjct: 319 QNGIIIPPKDKEALYHAMISFIDS 342
>gi|227530687|ref|ZP_03960736.1| possible glycosyltransferase, partial [Lactobacillus vaginalis ATCC
49540]
gi|227349395|gb|EEJ39686.1| possible glycosyltransferase [Lactobacillus vaginalis ATCC 49540]
Length = 125
Score = 41.2 bits (95), Expect = 3.5, Method: Composition-based stats.
Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 3/83 (3%)
Query: 472 QEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVIT 531
Q + S+ L E +V+ GF D+N VL DI + + +G P I+ AM +G+PV+
Sbjct: 41 QRLVSKFNL-ESNVKFIGFREDINKVLEKKDIFVLSTHY--EGLPISIIEAMAYGLPVLA 97
Query: 532 PDFPIIKEYVAEGAQVIFFQKDN 554
D E V F N
Sbjct: 98 TDVGGNSEMVKNNINGFLFSSKN 120
>gi|186683232|ref|YP_001866428.1| group 1 glycosyl transferase [Nostoc punctiforme PCC 73102]
gi|186465684|gb|ACC81485.1| glycosyl transferase, group 1 [Nostoc punctiforme PCC 73102]
Length = 379
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 62/133 (46%), Gaps = 8/133 (6%)
Query: 457 VFLCGNSTDGYNDALQEV---ASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQ 513
V L G++ G D Q++ + LGL E+ V+ GF D+ ++ D+V + S+ E
Sbjct: 231 VILVGDALFGEQDYAQKLHKQVTDLGL-ENRVKFLGFRSDIPQLMAACDLVAHTSTSPEP 289
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
F +IV AM G PV+ + E V G P+ L++ + I+ + ++
Sbjct: 290 -FGRVIVEAMLCGKPVVAAKAGGVMELVEHGLNGFLVTPGEPQELAQ---VIITCLQETE 345
Query: 574 FARTVASAGRLHA 586
T+A+ R A
Sbjct: 346 ITATIANNARTTA 358
>gi|414079281|ref|YP_007000705.1| group 1 glycosyl transferase [Anabaena sp. 90]
gi|413972560|gb|AFW96648.1| glycosyl transferase group 1 [Anabaena sp. 90]
Length = 383
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 3/108 (2%)
Query: 457 VFLCGNSTDGYNDALQEVASRL-GL-LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQG 514
V L G++ G + +Q++ ++ GL LE+ V GF D+ ++ M D++ + S+ E
Sbjct: 231 VILVGDALFGEQEYIQDLHQKVAGLGLENRVNFLGFRSDIPKLMAMCDLITHTSTAPEP- 289
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
F +IV AM G PVI E V +G P+ L++
Sbjct: 290 FGRVIVEAMLCGKPVIAAQAGGAIELVEDGINGFLVTPGEPQELAKVI 337
>gi|406880456|gb|EKD28806.1| Glycosyltransferase [uncultured bacterium]
Length = 170
Score = 41.2 bits (95), Expect = 3.5, Method: Composition-based stats.
Identities = 31/123 (25%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 449 SVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGS 508
S++ + K V LCG+ + L++ + L L E V G D+ ++ + D+ Y
Sbjct: 18 SIKDNLK-VLLCGSGP--MLEELKKQVAELNLNE-IVEFLGVRKDIPELMSVLDV--YTL 71
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ +G P I+ A+ G P++ + I E + +G + + +NPE L+ A + I++
Sbjct: 72 VSIYEGMPLSILEAIAAGKPIVATNVLGINEVIKDGENGLLAESNNPESLANALTRMIND 131
Query: 569 GKL 571
+L
Sbjct: 132 KEL 134
>gi|337750991|ref|YP_004645153.1| galactosyltransferase [Paenibacillus mucilaginosus KNP414]
gi|336302180|gb|AEI45283.1| galactosyltransferase [Paenibacillus mucilaginosus KNP414]
Length = 396
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 7/105 (6%)
Query: 441 LIKYARRNSVEGSFKFVFLCGNST----DGYNDALQEVASRLGLLEHSVRHYGFNGDVNG 496
LI+ A R +G+ + G++T Y + L +A LG+ + V G DV
Sbjct: 220 LIRAAERFKADGNRGHFVIVGDATFEHEKVYKEKLFALAGELGVTDR-VHFLGLRRDVPE 278
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
+L AD+ + S V FP+ ++ AM G+PVI+ D + E V
Sbjct: 279 LLRQADVFVL--SSVYDTFPTAVLEAMRAGLPVISTDVGGVPEMV 321
>gi|374312445|ref|YP_005058875.1| group 1 glycosyl transferase [Granulicella mallensis MP5ACTX8]
gi|358754455|gb|AEU37845.1| glycosyl transferase group 1 [Granulicella mallensis MP5ACTX8]
Length = 404
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 58/128 (45%), Gaps = 6/128 (4%)
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
ALQ AS LG+ + + G D+ +LL +D+ S +G P++++ AM G+PV
Sbjct: 265 ALQAAASNLGIADRML-FLGRVDDIAALLLASDVFALLSR--SEGMPNVVLEAMAAGLPV 321
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNM 589
+ E V +G ++ E + IS+ L AR + AG H N
Sbjct: 322 VATAVGGTPEVVVDGVTGYLVPNEDSEAAAERIGQLISDPYLR--AR-IGDAGITHIHNH 378
Query: 590 LALDCVTR 597
+L+ + R
Sbjct: 379 FSLEKMIR 386
>gi|187922918|ref|YP_001894560.1| group 1 glycosyl transferase [Burkholderia phytofirmans PsJN]
gi|187714112|gb|ACD15336.1| glycosyl transferase group 1 [Burkholderia phytofirmans PsJN]
Length = 382
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 47/102 (46%), Gaps = 4/102 (3%)
Query: 482 EHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
E + + G DV ++ AD V+ S + +G P ++ A G P++ D P +E V
Sbjct: 257 EGIIEYLGEAHDVRPLIAEADCVVLPSYR--EGVPRTLMEASAIGRPIVATDVPGCREVV 314
Query: 542 AEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
A+G + + N E L+ A + + + R +A GR
Sbjct: 315 ADGVNGLLCEVRNAESLAAALARMLDMSGAER--RAMAERGR 354
>gi|372323987|ref|ZP_09518576.1| glycosyl transferase group 1 [Oenococcus kitaharae DSM 17330]
gi|366982795|gb|EHN58194.1| glycosyl transferase group 1 [Oenococcus kitaharae DSM 17330]
Length = 375
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 30/59 (50%)
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+GFP + AM+FG+P+I D P ++E + G + N + ++ S + N L
Sbjct: 282 EGFPLVAAEAMSFGLPIIAFDIPSLEEVTSNGTYGVLVTNSNTKEMAEQISKLLENRNL 340
>gi|406931585|gb|EKD66859.1| hypothetical protein ACD_49C00005G0001, partial [uncultured
bacterium (gcode 4)]
Length = 630
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 53/109 (48%), Gaps = 8/109 (7%)
Query: 501 ADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSR 560
ADIVL S +GF + V AM F PVI KE V + +F++ N LS+
Sbjct: 276 ADIVLMCSKS--EGFGRVTVEAMLFEKPVIGSFSGGTKEIVVDNKNGLFYEPGNISDLSK 333
Query: 561 AFSLFISN-GKLSKFARTVASAGRLHAKNMLALD-CVTRYARILENVLN 607
F N K+++F + G++ +++ + + V R ILEN+ N
Sbjct: 334 KIEFFYRNRNKIAEFGKN----GKIFCEDIFSEERYVGRIRNILENLKN 378
>gi|386726770|ref|YP_006193096.1| galactosyltransferase [Paenibacillus mucilaginosus K02]
gi|384093895|gb|AFH65331.1| galactosyltransferase [Paenibacillus mucilaginosus K02]
Length = 396
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 7/105 (6%)
Query: 441 LIKYARRNSVEGSFKFVFLCGNST----DGYNDALQEVASRLGLLEHSVRHYGFNGDVNG 496
LI+ A R +G+ + G++T Y + L +A LG+ + V G DV
Sbjct: 220 LIRAAERFKADGNRGHFVIVGDATFEHEKVYKEKLFALAGELGVTDR-VHFLGLRRDVPE 278
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYV 541
+L AD+ + S V FP+ ++ AM G+PVI+ D + E V
Sbjct: 279 LLRQADVFVL--SSVYDTFPTAVLEAMRAGLPVISTDVGGVPEMV 321
>gi|251798151|ref|YP_003012882.1| group 1 glycosyl transferase [Paenibacillus sp. JDR-2]
gi|247545777|gb|ACT02796.1| glycosyl transferase group 1 [Paenibacillus sp. JDR-2]
Length = 457
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 458 FLCGNSTDGY-NDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
++C DG+ D L ++ ++LGL H + +G DV ++ AD V+ S Q Q F
Sbjct: 260 WVCWLVGDGFLRDELIQLTNQLGLANH-ILFWGHREDVPEMMQQADFVVLPSMQDNQPFA 318
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGL 558
I+ A G PVIT D I E V + + +PE L
Sbjct: 319 --IIEAQIAGKPVITSDAGGIPEMVTHLQNGLISRLGDPEQL 358
>gi|320449388|ref|YP_004201484.1| glycosyltransferase [Thermus scotoductus SA-01]
gi|320149557|gb|ADW20935.1| glycosyltransferase [Thermus scotoductus SA-01]
Length = 386
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 17/149 (11%)
Query: 396 HEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFK 455
H +LR+E G +D + + VG E S DY +L +Y R
Sbjct: 188 HAGLRLREELGIERDTFLWLAVGRL----EESKDYPTLFKAFSRVLTEYPRAQ------- 236
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
+ + G+ L+++ LGL + SVR G DV ++ AD + S+ +G
Sbjct: 237 -LLVAGHG--ALEKELRQMVGSLGL-QTSVRFLGLRKDVPDLMKAADAFVMSSAW--EGM 290
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
P +++ A G+P++ D +E V +G
Sbjct: 291 PMVLLEAHATGLPIVATDVGGNREVVQDG 319
>gi|434394651|ref|YP_007129598.1| glycosyl transferase group 1 [Gloeocapsa sp. PCC 7428]
gi|428266492|gb|AFZ32438.1| glycosyl transferase group 1 [Gloeocapsa sp. PCC 7428]
Length = 372
Score = 41.2 bits (95), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 8/117 (6%)
Query: 505 LYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSL 564
+Y +GFP IV AM +PV+ + E V +G K++ GL+ A
Sbjct: 258 IYVQPSRSEGFPLAIVEAMLASLPVVATRVGSVAEAVIDGETGFLVNKNDVAGLAAALCR 317
Query: 565 FISNGKLS-KFARTVASAGRLHAKNMLALDCVTR-YARILENVLNFP--SDALLPGP 617
+NG+L KF + GR A+ + +TR Y R+ ++N P S +P P
Sbjct: 318 LRNNGELRWKFGQK----GRAIAQASFTVKHMTRSYERLWYKLVNQPRASRLFVPRP 370
>gi|119899550|ref|YP_934763.1| glycosyltransferase [Azoarcus sp. BH72]
gi|119671963|emb|CAL95877.1| glycosyltransferase [Azoarcus sp. BH72]
Length = 410
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
VR GF DV V M D+ L S Q +GF AM G+PV+ D P ++ VA+G
Sbjct: 282 VRILGFQPDVENVFQMLDVHLLLSRQ--EGFGIATAEAMACGLPVVATDVPGSRDVVADG 339
Query: 545 AQVI 548
+
Sbjct: 340 CGTL 343
>gi|257053201|ref|YP_003131034.1| glycosyl transferase group 1 [Halorhabdus utahensis DSM 12940]
gi|256691964|gb|ACV12301.1| glycosyl transferase group 1 [Halorhabdus utahensis DSM 12940]
Length = 370
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 66/155 (42%), Gaps = 5/155 (3%)
Query: 450 VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 509
VE + V L + D L E+A LGL V G+ + + L+ D +Y +
Sbjct: 219 VERQTETVSLTVAGSGPLEDELHELAGDLGL--DRVNFPGYVSESHKKGLLRDASIYVTP 276
Query: 510 QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG 569
+ +GF + AM G V++ D +K+Y+ +G +P ++ A + + N
Sbjct: 277 TIYEGFGISPLEAMASGCAVVSSDTWGVKDYIEDGVNGRLVPTRSPHQVATAVTDLLEND 336
Query: 570 KLSKFARTVASAGRLHAKNMLALDCVTRYARILEN 604
+ +VA GR AK + R +LEN
Sbjct: 337 ERRC---SVAEHGRATAKVYSMDKSLDREVTVLEN 368
>gi|332158755|ref|YP_004424034.1| LPS biosynthesis rfbu-like protein [Pyrococcus sp. NA2]
gi|331034218|gb|AEC52030.1| LPS biosynthesis rfbu related protein [Pyrococcus sp. NA2]
Length = 369
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVL--LMADIVLYGSSQVEQGFPSLIVRAMTFG 526
+ L ++AS L L ++ V+ GF D + V+ L + V S+ E GF +++ A G
Sbjct: 234 ERLVKLASILDLSDN-VKFLGFLKDHDEVISYLKSSKVFVLPSKRE-GFGIVVLEANASG 291
Query: 527 IPVITPDFPI--IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRL 584
+PVIT D+P+ KE + G F P+ L++ + +SN K KF R R
Sbjct: 292 LPVITLDYPMNASKELIIHGYNG-FVSSPTPDHLAKYIEISLSNRK--KFRRNCTKNARR 348
Query: 585 HAKNMLALDCVTRYARIL 602
+ N++A Y R+L
Sbjct: 349 YDWNIIAKLTEKFYERVL 366
>gi|352095224|ref|ZP_08956327.1| glycosyl transferase group 1 [Synechococcus sp. WH 8016]
gi|351679235|gb|EHA62377.1| glycosyl transferase group 1 [Synechococcus sp. WH 8016]
Length = 369
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+ +A LGL+ V G D+ +L AD+ + S+ +GF +++ AM G+P+I
Sbjct: 238 LEGLARDLGLVPSQV-FLGRRSDIPELLGQADLFAF-STTFAEGFGIVLIEAMAAGLPII 295
Query: 531 TPDFPIIKEYVAEGA 545
D P +E + +GA
Sbjct: 296 ASDVPACREVLRDGA 310
>gi|146295262|ref|YP_001179033.1| group 1 glycosyl transferase [Caldicellulosiruptor saccharolyticus
DSM 8903]
gi|145408838|gb|ABP65842.1| glycosyl transferase, group 1 [Caldicellulosiruptor saccharolyticus
DSM 8903]
Length = 425
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 7/114 (6%)
Query: 469 DALQEVASR-LGLLEHSVRHY-GF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMT 524
++L+E A R L E+ ++ GF N + + D L+ ++ +G P I+ AM+
Sbjct: 272 ESLKEYAQRKLSKKENITYNFMGFLPNSKILDLYAKEDFNLFVNTSSSEGLPVSIMEAMS 331
Query: 525 FGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG-LSRAFSLFIS--NGKLSKFA 575
FGIPV+ D ++E V +G+ +KD E LS FI G+ +F
Sbjct: 332 FGIPVVATDVGGVREIVKDGSNGYLLEKDFTEDELSSLIEKFIQMPEGEYQRFC 385
>gi|260910845|ref|ZP_05917490.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635034|gb|EEX53079.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
Length = 401
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+VR G+ DV LL AD +++ S + +GFP+++++A G+P I D E + +
Sbjct: 275 NVRFVGYQTDVRPYLLAADALVFPSYR--EGFPNVVMQAGAMGLPSIVTDINGCNEIIVD 332
Query: 544 GAQVIFFQKDNPEGLSRAFSLFI 566
G + L A F+
Sbjct: 333 GTNGKIIPPKDEAALFSAMQFFL 355
>gi|256847893|ref|ZP_05553338.1| lipopolysaccharide 1,6-galactosyltransferase [Lactobacillus
coleohominis 101-4-CHN]
gi|256715582|gb|EEU30558.1| lipopolysaccharide 1,6-galactosyltransferase [Lactobacillus
coleohominis 101-4-CHN]
Length = 352
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 4/102 (3%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
DA Q+ A G+ + + +GF + AD +L S+ +G P +I+ AM++G+P
Sbjct: 219 DACQQYAQNKGI-SNKINWHGFTQKPFEKIKTADALLLSSTY--EGLPMVILEAMSYGVP 275
Query: 529 VITPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNG 569
I+ + P E + G + + + L++ +I+NG
Sbjct: 276 CISSNCPSGTTELIKNGKNGYLYNMGDSDDLAKKVIKYITNG 317
>gi|406904722|gb|EKD46408.1| glycosyltransferase [uncultured bacterium]
Length = 370
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 71/160 (44%), Gaps = 17/160 (10%)
Query: 455 KFVFLCGNSTDGYNDALQ--EVASRL---GLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 509
KFVF N DA + EV S L G+LE V F D+ + +D+V++
Sbjct: 216 KFVF--ANRVKNEKDARKKDEVISTLKEKGILEKVVFTDTF-SDMPKIYNFSDVVVFPVR 272
Query: 510 QVEQGF--PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS 567
++ F P ++ AM+ PVI + PI+KE+ + ++ D+ + + LF +
Sbjct: 273 NMQGKFDVPLAVIEAMSCAKPVIISNLPILKEFTSNETSMVIDPTDSNQLIGSIVDLFEN 332
Query: 568 NGKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLN 607
K +K + A R + ++ V A+ EN N
Sbjct: 333 KEKCAK----IGEAARKYVVENFDINSV---AKQYENAYN 365
>gi|297585310|ref|YP_003701090.1| glycosyl transferase group 1 protein [Bacillus selenitireducens
MLS10]
gi|297143767|gb|ADI00525.1| glycosyl transferase group 1 [Bacillus selenitireducens MLS10]
Length = 364
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 4/123 (3%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE V GF D+ ++ DI + S+ +GF + V AM GIPV+ D P ++E
Sbjct: 245 LEDRVSFLGFRKDIPEIMKTCDIFVLSSNW--EGFGLVAVEAMASGIPVLVSDVPGLREV 302
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
V G + + F K + L++ + + K ++ + + ++ + RY R
Sbjct: 303 V--GRKDMLFPKGDSRLLAKQMKDLLIDEKRNEIVEYGINRSQQYSIFSMVTQIQQRYNR 360
Query: 601 ILE 603
+L+
Sbjct: 361 LLK 363
>gi|346226412|ref|ZP_08847554.1| group 1 glycosyl transferase [Anaerophaga thermohalophila DSM
12881]
Length = 362
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 68/148 (45%), Gaps = 7/148 (4%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
+ FL G+ L++ +LGL E GF +++ +L ADI ++ S + +G
Sbjct: 220 YFFLAGHGK--LQSELRKQIKQLGL-EKRFILLGFIENIHPLLKSADIFVF--SSLYEGM 274
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFA 575
P+ I+ AM G+PV++ + I E + G + + L+ A I + + K
Sbjct: 275 PNSILEAMAHGLPVVSTNVNGISELIENGVNGYMVKPGKTDELTTALDNVIRDKE--KIK 332
Query: 576 RTVASAGRLHAKNMLALDCVTRYARILE 603
R A+A ++N V + ++L+
Sbjct: 333 RVGANARHFVSQNFAVQKMVEKLNKLLD 360
>gi|257054680|ref|YP_003132512.1| glycosyltransferase [Saccharomonospora viridis DSM 43017]
gi|256584552|gb|ACU95685.1| glycosyltransferase [Saccharomonospora viridis DSM 43017]
Length = 379
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 57/109 (52%), Gaps = 11/109 (10%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYG--FNGDVNGVLLMADIVLYGSSQVEQG 514
+ + G G D L+ +A+R GL++ VR G +G++ VL A +V S + +G
Sbjct: 222 LLVVGQPGWGGLDPLR-LAARHGLVDGRVRVLGRLTDGELATVLRNATVVAVPS--LAEG 278
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFS 563
F ++ AM G+PV+ D P + E A+ A+V+ P G +RA +
Sbjct: 279 FGLPVLEAMAAGVPVVHSDDPALVEVSAKAAEVV------PRGDARALA 321
>gi|158521844|ref|YP_001529714.1| group 1 glycosyl transferase [Desulfococcus oleovorans Hxd3]
gi|158510670|gb|ABW67637.1| glycosyl transferase group 1 [Desulfococcus oleovorans Hxd3]
Length = 368
Score = 40.8 bits (94), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 495 NGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDN 554
GV A + ++ S+ E G P+ ++ AM FG+PVIT I ++ +G +F +
Sbjct: 255 TGVFKSAHVYVFPSAHGE-GMPTSVLEAMAFGLPVITTRVSGISDFFEDGKMGLFLDTRD 313
Query: 555 PEGLSRAFSLFISNGKL 571
PE +++ + +L
Sbjct: 314 PEHIAKKIRYLLDRPEL 330
>gi|15643387|ref|NP_228431.1| lipopolysaccharide biosynthesis protein [Thermotoga maritima MSB8]
gi|418045206|ref|ZP_12683302.1| glycosyl transferase group 1 [Thermotoga maritima MSB8]
gi|4981142|gb|AAD35706.1|AE001736_4 lipopolysaccharide biosynthesis protein, putative [Thermotoga
maritima MSB8]
gi|351678288|gb|EHA61435.1| glycosyl transferase group 1 [Thermotoga maritima MSB8]
Length = 388
Score = 40.8 bits (94), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 45/92 (48%), Gaps = 3/92 (3%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
++E+ +LGL E V+ +G DV +L ADI + S +GF ++ AM G+PVI
Sbjct: 242 IEELVKQLGL-EEKVKFFGVRSDVPELLSQADIFVLSSDY--EGFGLVVAEAMAAGLPVI 298
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
I E + G I + + L++A
Sbjct: 299 ATAIGGIPEILEGGRAGILVPPKDVDALAKAI 330
>gi|428307003|ref|YP_007143828.1| group 1 glycosyl transferase [Crinalium epipsammum PCC 9333]
gi|428248538|gb|AFZ14318.1| glycosyl transferase group 1 [Crinalium epipsammum PCC 9333]
Length = 371
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 5/116 (4%)
Query: 457 VFLC-GNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
VF+C G + Y + ++A++L + E + G DV V DI + SS E GF
Sbjct: 229 VFVCIGGGLENYAREMHQLANQLEISEKVI-WAGARADVPDVYNALDIAVSSSSYGE-GF 286
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
P++I AM G+P + D V E V+ NPE L + + + + K+
Sbjct: 287 PNVIGEAMATGVPCVVTDVGDSAWIVGETGIVV--PPKNPEALVKGWLVCLEIDKI 340
>gi|119720627|ref|YP_921122.1| glycosyl transferase, group 1 [Thermofilum pendens Hrk 5]
gi|119525747|gb|ABL79119.1| glycosyl transferase, group 1 [Thermofilum pendens Hrk 5]
Length = 383
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLF 565
+GF +++ A+ G+PVI D P + E V GA I KD+ L+RA F
Sbjct: 286 EGFGIVLIEALAAGVPVIAYDIPPLNEVVEHGATGILVPKDDVRVLARAIVRF 338
>gi|158317623|ref|YP_001510131.1| group 1 glycosyl transferase [Frankia sp. EAN1pec]
gi|158113028|gb|ABW15225.1| glycosyl transferase group 1 [Frankia sp. EAN1pec]
Length = 395
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 52/110 (47%), Gaps = 12/110 (10%)
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
V+L VL S+ E GF ++ AM G+PV+ D P + E + A V + +P
Sbjct: 279 VVLSRATVLVAPSRSE-GFGLPVIEAMAHGVPVVVSDAPALVEVAGDAALV--ARIGDPA 335
Query: 557 GLSRAFSLFISNGKL-SKFARTVASAGRLHAK----NMLALDCVTRYARI 601
G + A + + N +L S+ +R+ GR+ A N A C Y RI
Sbjct: 336 GFAEALARIVQNPRLHSRLSRS----GRVRAAGYTWNGAARSCWELYRRI 381
>gi|448668354|ref|ZP_21686485.1| group 1 glycosyl transferase [Haloarcula amylolytica JCM 13557]
gi|445768436|gb|EMA19521.1| group 1 glycosyl transferase [Haloarcula amylolytica JCM 13557]
Length = 403
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 63/138 (45%), Gaps = 12/138 (8%)
Query: 453 SFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQ 510
+F+F + G+ GY DA+Q A +LG+ + V YG + DV G+L D+ +
Sbjct: 244 TFEF-HIAGDGQSGYIDAMQAKAVKLGVADQIV-WYGRVPHSDVPGLLTRTDVAISPLED 301
Query: 511 VEQ---GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS 567
+E +P+ ++ M G V+ D P + + +G + + GL A +
Sbjct: 302 IESYRISYPAKLLEYMAAGAVVMATDIPPHQRLITDGDNGLLYDG-TATGLCSAIEQCLD 360
Query: 568 N----GKLSKFARTVASA 581
+ G++ + AR A A
Sbjct: 361 DPALAGQIERTARETAEA 378
>gi|254507999|ref|ZP_05120127.1| glycosyltransferase [Vibrio parahaemolyticus 16]
gi|219549107|gb|EED26104.1| glycosyltransferase [Vibrio parahaemolyticus 16]
Length = 402
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 2/105 (1%)
Query: 473 EVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITP 532
E + L L H + G +V G L D + G+ + F +I A IPVI P
Sbjct: 240 ESQAELLRLAHCIHFVGEKSNVVGWLKDCDAFISGARN--EAFGLVIAEASLANIPVIAP 297
Query: 533 DFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
I E+V G + ++ +++A L I+N KLS + T
Sbjct: 298 YEGGIPEFVRHGETGLLYKNVGIRSMTKAMRLVIANPKLSAYLAT 342
>gi|373112080|ref|ZP_09526313.1| hypothetical protein HMPREF9466_00346 [Fusobacterium necrophorum
subsp. funduliforme 1_1_36S]
gi|371656158|gb|EHO21489.1| hypothetical protein HMPREF9466_00346 [Fusobacterium necrophorum
subsp. funduliforme 1_1_36S]
Length = 275
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 6/115 (5%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+F+ G T+ N L+ ++ L L E + G + + A + L+ S +++GF
Sbjct: 136 LFILGTGTEQSN--LEALSKSLSL-ESEITFLGQKQNPYPYMKQAKLFLHAS--LQEGFG 190
Query: 517 SLIVRAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK 570
++V AM G PVI D PI KE + +G I N + +A ++N K
Sbjct: 191 LVLVEAMACGTPVIATDCPIGPKEILLDGKCGILVSMKNERSMFQAIQQLMTNSK 245
>gi|156743817|ref|YP_001433946.1| group 1 glycosyl transferase [Roseiflexus castenholzii DSM 13941]
gi|156235145|gb|ABU59928.1| glycosyl transferase group 1 [Roseiflexus castenholzii DSM 13941]
Length = 408
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 64/134 (47%), Gaps = 8/134 (5%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYG-FNGDVNGVLLMADIVLYGSSQVEQG 514
FVF+ N + L A++ G+ EH++ G + + LL A +L S+ E G
Sbjct: 257 FVFVTHNQAQRAD--LMRRAAQQGI-EHNLHFLGTISEEEKLALLRASDLLPFPSRYE-G 312
Query: 515 FPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKF 574
F ++ M G+PVI+ + P++ E V G + D+ L++A + + ++ K
Sbjct: 313 FGLPLLEGMAAGVPVISTNIPVVNEIVIHGENGLLIPYDDTHALAQAMLTLLEDQEMRK- 371
Query: 575 ARTVASAGR-LHAK 587
R +A R LH +
Sbjct: 372 -RLIAGGKRALHER 384
>gi|83594676|ref|YP_428428.1| group 1 glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
gi|386351441|ref|YP_006049689.1| group 1 glycosyl transferase [Rhodospirillum rubrum F11]
gi|83577590|gb|ABC24141.1| Glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170]
gi|346719877|gb|AEO49892.1| glycosyl transferase, group 1 [Rhodospirillum rubrum F11]
Length = 400
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 61/131 (46%), Gaps = 9/131 (6%)
Query: 478 LGLLE-HSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
LGL + + R +G D+ A + ++G+ + F +I AM G+PV+
Sbjct: 267 LGLPDAETYRFFGRQNDLRPYYQAASVFIHGA--FRESFGLVIAEAMASGLPVVATHAHG 324
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVT 596
E +AEG +D+ EGL +A ++ + +L RT AGR +AL +T
Sbjct: 325 PAEIIAEGETGRLVGRDDWEGLGKAVIEYLDDAELR---RTHGLAGR---ARCVALFSIT 378
Query: 597 RYARILENVLN 607
R A+ ++L
Sbjct: 379 RQAQEFADLLR 389
>gi|319957723|ref|YP_004168986.1| group 1 glycosyl transferase [Nitratifractor salsuginis DSM 16511]
gi|319420127|gb|ADV47237.1| glycosyl transferase group 1 [Nitratifractor salsuginis DSM 16511]
Length = 347
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 4/92 (4%)
Query: 470 ALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPV 529
+L+ +A LG+ + + H G+ D+ + D + S +GFP + G P+
Sbjct: 219 SLEALAETLGVKKRVLFH-GYVSDIASFIAEIDFFIVASRS--EGFPLALQEIACKGKPL 275
Query: 530 ITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRA 561
I D P+ +E E +V FF+ +N E L RA
Sbjct: 276 ICSDIPLFREIFTED-EVRFFELENIESLVRA 306
>gi|350530035|ref|ZP_08908976.1| group 1 glycosyl transferase [Vibrio rotiferianus DAT722]
Length = 375
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L++++ +L L EH + G+ DV +L +D+++ S Q +G P ++ AM+ G PVI
Sbjct: 244 LEQLSRQLKLNEH-ILFLGYRTDVKRLLSASDVLVLSSKQ--EGLPRCVMEAMSVGKPVI 300
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+ ++ + GA I + ++ EG A + N L
Sbjct: 301 ASNIRGNRDLLQSGAG-ILVELNDVEGWQVAIQELVDNESL 340
>gi|228934997|ref|ZP_04097828.1| Glycosyl transferase group 1 [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228824897|gb|EEM70698.1| Glycosyl transferase group 1 [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
Length = 374
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 55/114 (48%), Gaps = 7/114 (6%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++A++LGL + V G+ D+ +L M D+ + +S +G P I+ AM G+P
Sbjct: 231 EECKKLATQLGL-SNKVHFLGYRNDIASLLQMCDLAV--ASSYREGLPVNIMEAMACGLP 287
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS----KFARTV 578
VI +E + ++D+ + ++ + N KL ++ RT+
Sbjct: 288 VIATSNRGHRELIIHNKNGWTIERDDIKTMAERINSISKNTKLQAQFGRYGRTI 341
>gi|330816925|ref|YP_004360630.1| group 1 glycosyl transferase [Burkholderia gladioli BSR3]
gi|327369318|gb|AEA60674.1| glycosyl transferase group 1 [Burkholderia gladioli BSR3]
Length = 369
Score = 40.4 bits (93), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 481 LEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
L V+H GF +G++ + A ++ S G P V AMT G P++ D P ++
Sbjct: 248 LPDYVKHVGFVSDGELKALFRRATCFVHPSRYEGFGLPP--VEAMTLGCPIVVADLPAVR 305
Query: 539 EYVAEGAQVIFFQKDNP 555
E E A I+ D+P
Sbjct: 306 EACGEAA--IYVSPDDP 320
>gi|312136907|ref|YP_004004244.1| group 1 glycosyl transferase [Methanothermus fervidus DSM 2088]
gi|311224626|gb|ADP77482.1| glycosyl transferase group 1 [Methanothermus fervidus DSM 2088]
Length = 362
Score = 40.4 bits (93), Expect = 5.0, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 482 EHSVRHYGFNG---DVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIK 538
E ++ F G D+N V+ AD+V+ S + + F ++ AM G PV+ IK
Sbjct: 238 EEKIKDVVFTGVRHDINNVMAAADLVVLPS--ISESFGLALLEAMACGKPVVATKVGGIK 295
Query: 539 EYVAEGAQVIFFQKDNPEGLSRAFSLFISNGK----LSKFARTVA 579
E V E ++ +D P+ L+ A + N K + K AR +A
Sbjct: 296 EIVTEDVGLLVNPRD-PKALANAIDYILKNEKKKKEMGKNARKIA 339
>gi|148272186|ref|YP_001221747.1| putative glycosyltransferase,mannosyltransferase [Clavibacter
michiganensis subsp. michiganensis NCPPB 382]
gi|147830116|emb|CAN01045.1| putative glycosyltransferase,mannosyltransferase [Clavibacter
michiganensis subsp. michiganensis NCPPB 382]
Length = 394
Score = 40.4 bits (93), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 56/116 (48%), Gaps = 7/116 (6%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ + G +T G + L +VA G+ VR G D + + + ++ + +GF
Sbjct: 248 LLIVGPATWGDVE-LAQVADEAGVDPSRVRSLGSLTDADLAVTLDRATVFVHPSLSEGFG 306
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG----LSRAFSLFISN 568
+V A++FG PV+ D P + E A+ V+ +++P+G L+ A +S+
Sbjct: 307 LPVVEALSFGTPVVHSDAPALLEVAADAGVVV--PREDPDGYPLRLAEAIGGLLSD 360
>gi|430746176|ref|YP_007205305.1| glycosyltransferase [Singulisphaera acidiphila DSM 18658]
gi|430017896|gb|AGA29610.1| glycosyltransferase [Singulisphaera acidiphila DSM 18658]
Length = 368
Score = 40.4 bits (93), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 55/116 (47%), Gaps = 7/116 (6%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+ + + GL + VR G D+ L D+ + S + +G P I+ AM G+PVI
Sbjct: 235 LERLVAERGLTDR-VRFLGMRRDIAEQLREMDVFVLSSHR--EGRPISIMEAMASGLPVI 291
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-SKFA---RTVASAG 582
+ I + EG + + + EGL+RA + + S+FA R +A AG
Sbjct: 292 STRVGSIPGLIGEGQAGLLVEPGDVEGLARAIMTLADDSEARSRFAVESRRLAVAG 347
>gi|196038348|ref|ZP_03105657.1| EpsD [Bacillus cereus NVH0597-99]
gi|196030756|gb|EDX69354.1| EpsD [Bacillus cereus NVH0597-99]
Length = 385
Score = 40.4 bits (93), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 55/114 (48%), Gaps = 7/114 (6%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++A++LGL + V G+ D+ +L M D+ + +S +G P I+ AM G+P
Sbjct: 242 EECKKLATQLGL-SNKVHFLGYRNDIASLLQMCDLAV--ASSYREGLPVNIMEAMACGLP 298
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS----KFARTV 578
VI +E + ++D+ + ++ + N KL ++ RT+
Sbjct: 299 VIATSNRGHRELIIHNKNGWTIERDDIKTMAERINSISKNTKLQAQFGRYGRTI 352
>gi|347754324|ref|YP_004861888.1| glycosyltransferase [Candidatus Chloracidobacterium thermophilum B]
gi|347586842|gb|AEP11372.1| Glycosyltransferase [Candidatus Chloracidobacterium thermophilum B]
Length = 378
Score = 40.4 bits (93), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 20/78 (25%), Positives = 37/78 (47%), Gaps = 3/78 (3%)
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
S F +V AM G+P + D P ++E A+G F+ + +R + + +
Sbjct: 281 SSCSDTFGLALVEAMLAGLPCVASDIPALREVTADGTAAALFRAGDAADCARVLNHVLDD 340
Query: 569 GKLSKFARTVASAGRLHA 586
+ + RT+A+ G+ HA
Sbjct: 341 ---ADYRRTLAARGQAHA 355
>gi|305666578|ref|YP_003862865.1| putative Capsular polysaccharide biosynthesis glycosyl transferase
[Maribacter sp. HTCC2170]
gi|88708849|gb|EAR01084.1| putative Capsular polysaccharide biosynthesis glycosyl transferase
[Maribacter sp. HTCC2170]
Length = 388
Score = 40.4 bits (93), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
+ H G DV L M+D +++ S + +GFP++++ A G+P I D E + EG
Sbjct: 270 IIHVGPQKDVRPFLAMSDALVFPSYR--EGFPNVVLEAGAMGLPSIVTDINGCNEIITEG 327
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
+ + L + F+++ L T+A+ R
Sbjct: 328 QNGVIIPSRDEAALHKNMKTFVTDRNL---VNTLAANAR 363
>gi|388600258|ref|ZP_10158654.1| group 1 glycosyl transferase [Vibrio campbellii DS40M4]
Length = 378
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 56/113 (49%), Gaps = 7/113 (6%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
L+ +A +GL H + G DV +L D+V S + +G P ++ AM+ G P+I
Sbjct: 246 LENLAKAVGL-SHQIMFLGQRDDVPDLLSACDVVTLSS--LREGLPRCLLEAMSVGRPII 302
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
+ K+ ++ GA V+ K++ + ++++S + A+ +A GR
Sbjct: 303 ASNIRGCKDLLSSGAGVLVDPKNSQQWAKAIQNVYLSPNQ----AKAMADKGR 351
>gi|242399385|ref|YP_002994809.1| Glycosyltransferase, family 4 [Thermococcus sibiricus MM 739]
gi|242265778|gb|ACS90460.1| Glycosyltransferase, family 4 [Thermococcus sibiricus MM 739]
Length = 367
Score = 40.4 bits (93), Expect = 5.7, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 50/100 (50%), Gaps = 7/100 (7%)
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
+V+ G+ DV G+L +++ S + +GF +I+ A + G+PVI IKE + E
Sbjct: 227 NVKFLGYKEDVGGILSKTKLLVVPS--LREGFGLVIIEANSLGVPVIGRAVGGIKELIRE 284
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNG---KLSKFARTVAS 580
G F+ E L + +SN K+ K +T++S
Sbjct: 285 GKNGYTFK--TFEELVEKVEILLSNKKALKMGKIGKTISS 322
>gi|187736561|ref|YP_001878673.1| group 1 glycosyl transferase [Akkermansia muciniphila ATCC BAA-835]
gi|187426613|gb|ACD05892.1| glycosyl transferase group 1 [Akkermansia muciniphila ATCC BAA-835]
Length = 392
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 66/146 (45%), Gaps = 13/146 (8%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
+ G+ D D L++ A +LGL F+G N + ++ + +GF +
Sbjct: 258 IVGDGPDA--DMLKDSAQKLGL-----SRIAFHGFQNPEPYYSRASVFCMTSTFEGFGLV 310
Query: 519 IVRAMTFG-IPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN-GKLSKFAR 576
+V AM G +PV +P +++ ++ G I E S A + I+N G+L KF+R
Sbjct: 311 LVEAMQHGCVPVAFDSYPAVRDIISHGENGILVPPFQEEIYSNALTSLINNPGELEKFSR 370
Query: 577 TVASAGRLHAKNMLALDCVTRYARIL 602
R + + LA R+A IL
Sbjct: 371 HSLVTSRNFSSSNLA----ARWAAIL 392
>gi|189345826|ref|YP_001942355.1| group 1 glycosyl transferase [Chlorobium limicola DSM 245]
gi|189339973|gb|ACD89376.1| glycosyl transferase group 1 [Chlorobium limicola DSM 245]
Length = 364
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 7/104 (6%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
LE S GF D+ L +D+ + S + +G P++++ AM PV+ D +E
Sbjct: 236 LEQSFIFEGFAADIYPYLKGSDLFVLAS--LFEGMPNVVMEAMAVEKPVVATDVNGAREL 293
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNG-KLSKFARTVASAGR 583
+ + + NPE L+RA I N +L+ F + AGR
Sbjct: 294 MLDDVTGLIVPPSNPEALARAIGSIIDNPMRLASFGQ----AGR 333
>gi|398818383|ref|ZP_10576974.1| glycosyltransferase [Brevibacillus sp. BC25]
gi|398027825|gb|EJL21355.1| glycosyltransferase [Brevibacillus sp. BC25]
Length = 369
Score = 40.4 bits (93), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 5/107 (4%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
++ L G+ ++ AL+ LGL H VR GF D+ L D ++ S + +G
Sbjct: 225 YLVLIGDGSE--RAALEAKVKELGLESH-VRFAGFRQDIPACLHALDGFVH--SSLYEGL 279
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAF 562
I+ AM +PV+ +KE+V +G + + NP L++A
Sbjct: 280 GYTIIEAMASEVPVVASSVGGVKEFVFDGDTGLIVEPGNPALLAQAM 326
>gi|375010296|ref|YP_004983929.1| glycosyl transferase group 1 [Geobacillus thermoleovorans
CCB_US3_UF5]
gi|359289145|gb|AEV20829.1| Glycosyl transferase group 1 [Geobacillus thermoleovorans
CCB_US3_UF5]
Length = 360
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 472 QEVASRLGLLEHSVRHYGF---NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+E+ +RLG+ SV + N V + DI S++ + F V AM G+P
Sbjct: 220 EELCARLGI--QSVTTFAGKVPNEQVPLYINQMDIFAVPSTEDSESFGVAAVEAMACGVP 277
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V+ + + E V EG + K++PE L+ AF + + +L +
Sbjct: 278 VVVSNVGGLPEVVREGTTGLIVPKNSPEKLAEAFERLLLDERLRQ 322
>gi|441497294|ref|ZP_20979510.1| glycosyltransferase [Fulvivirga imtechensis AK7]
gi|441438960|gb|ELR72288.1| glycosyltransferase [Fulvivirga imtechensis AK7]
Length = 405
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 3/83 (3%)
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
G P++++ AM G+P+IT I E + + ++ +PE ++ I+N L K
Sbjct: 309 GLPNVLLEAMASGLPIITTPVSAIPELIIHKKNGLLVRERSPEDIAHHVEELIANPSLYK 368
Query: 574 FARTVASAGRLHAKNMLALDCVT 596
++S GR K +DC T
Sbjct: 369 ---EISSLGRKLLKGEFRIDCST 388
>gi|428297825|ref|YP_007136131.1| group 1 glycosyl transferase [Calothrix sp. PCC 6303]
gi|428234369|gb|AFZ00159.1| glycosyl transferase group 1 [Calothrix sp. PCC 6303]
Length = 376
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDNP 555
+L ADI + S +GF ++I+ AM G+PVI+ D P +E + +G I + +N
Sbjct: 266 ILKCADIFVLSSRY--EGFGNVIIEAMACGLPVISTDCPSGPREIIRDGIDGILVENENI 323
Query: 556 EGLSRAFSLFISNGKLSKFARTVASAG 582
LS A + IS+ + + AS G
Sbjct: 324 SSLSTAIARLISDPQERQRLSKNASEG 350
>gi|56421694|ref|YP_149012.1| glycosyltransferase [Geobacillus kaustophilus HTA426]
gi|56381536|dbj|BAD77444.1| glycosyltransferase [Geobacillus kaustophilus HTA426]
Length = 377
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 472 QEVASRLGLLEHSVRHYGF---NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+E+ +RLG+ SV + N V + DI S++ + F V AM G+P
Sbjct: 237 EELCARLGI--QSVTTFAGKVPNEQVPLYINQMDIFAVPSTEDSESFGVAAVEAMACGVP 294
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V+ + + E V EG + K++PE L+ AF + + +L +
Sbjct: 295 VVVSNVGGLPEVVREGTTGLIVPKNSPEKLAEAFERLLLDERLRQ 339
>gi|392429461|ref|YP_006470475.1| hypothetical protein SCIM_1575 [Streptococcus intermedius JTH08]
gi|419776892|ref|ZP_14302811.1| glycosyltransferase 4-like / glycosyltransferase, group 1 family
multi-domain protein [Streptococcus intermedius SK54]
gi|423071672|ref|ZP_17060445.1| hypothetical protein HMPREF9177_01762 [Streptococcus intermedius
F0413]
gi|355363446|gb|EHG11183.1| hypothetical protein HMPREF9177_01762 [Streptococcus intermedius
F0413]
gi|383845577|gb|EID82980.1| glycosyltransferase 4-like / glycosyltransferase, group 1 family
multi-domain protein [Streptococcus intermedius SK54]
gi|391758610|dbj|BAM24227.1| conserved hypothetical protein [Streptococcus intermedius JTH08]
Length = 376
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
+CG T + L ++ + GL E+ VR G+ DV+ +L +DI ++ S +G P
Sbjct: 237 ICGEGT--LKEELTQLIKQYGL-ENQVRLLGYRDDVHRLLAESDIFVFPS--FREGLPVS 291
Query: 519 IVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGL 558
++ AM G+PV+ D + V F+ N E L
Sbjct: 292 VMEAMATGLPVLASDIRGNHDLVIHEKGGFLFESGNVEDL 331
>gi|307566346|ref|ZP_07628785.1| glycosyltransferase, group 1 family protein [Prevotella amnii CRIS
21A-A]
gi|307344923|gb|EFN90321.1| glycosyltransferase, group 1 family protein [Prevotella amnii CRIS
21A-A]
Length = 377
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 42/84 (50%), Gaps = 2/84 (2%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
+ + G DV LL +D +++ S + +GFP++++ A G+P I D E + +G
Sbjct: 260 ILYVGSQKDVRPYLLASDALVFPSYR--EGFPNVVMEAGAMGLPSIVTDINGSNEIIEDG 317
Query: 545 AQVIFFQKDNPEGLSRAFSLFISN 568
+ + L +A +LFI N
Sbjct: 318 VNGLIIPSKSAIALLKAMNLFIQN 341
>gi|345007211|ref|YP_004810063.1| group 1 glycosyl transferase [halophilic archaeon DL31]
gi|344322837|gb|AEN07690.1| glycosyl transferase group 1 [halophilic archaeon DL31]
Length = 364
Score = 40.4 bits (93), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 22/91 (24%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+ V G ++ V + D+ +Y S G+P++I+ + G+PV+T D + E
Sbjct: 242 IRERVYTLGHVDEIKNVYSIGDVFVYNSYL--DGYPNVILESQMAGVPVVTTDQEGMSEQ 299
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+ + +FF +NP L+ S + N ++
Sbjct: 300 IDDFETGLFFSPNNPNNLNGKISTLLDNPEV 330
>gi|336436590|ref|ZP_08616302.1| hypothetical protein HMPREF0988_01887 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336007455|gb|EGN37480.1| hypothetical protein HMPREF0988_01887 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 407
Score = 40.4 bits (93), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 459 LCGNSTDGY-NDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
+CG DG+ + ++ + LGL EH+VR +G+ DV +L AD+ ++ S + G
Sbjct: 246 ICG---DGFFREKIERMIQTLGL-EHNVRMFGYCMDVREILGCADVSVFPSRREGLGMAG 301
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDN 554
L A+ IP++ + +EY+ G F + D+
Sbjct: 302 L--EALAMEIPLLAAENRGTREYLKPGINGYFCESDS 336
>gi|159901116|ref|YP_001547363.1| group 1 glycosyl transferase [Herpetosiphon aurantiacus DSM 785]
gi|159894155|gb|ABX07235.1| glycosyl transferase group 1 [Herpetosiphon aurantiacus DSM 785]
Length = 371
Score = 40.4 bits (93), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/92 (25%), Positives = 45/92 (48%), Gaps = 1/92 (1%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ G T + + + A +LG+ EH V G+ D + L++ +Y + +GF
Sbjct: 223 ILALGGKTGWLTEQITQQAQQLGISEH-VAFLGYVADDDLPALLSGATIYLLPSLYEGFG 281
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVI 548
++ AM+ G+PVIT + + E + A ++
Sbjct: 282 MTVLEAMSSGVPVITSNVSSLPEVAGDAALLV 313
>gi|170781059|ref|YP_001709391.1| glycosyl transferase [Clavibacter michiganensis subsp. sepedonicus]
gi|169155627|emb|CAQ00745.1| putative glycosyl transferase [Clavibacter michiganensis subsp.
sepedonicus]
Length = 379
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 56/116 (48%), Gaps = 7/116 (6%)
Query: 457 VFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
+ + G +T G + L +VA G+ VR G D + + + ++ + +GF
Sbjct: 233 LLIVGPATWGDVE-LAQVADEAGVDPSRVRSLGSLTDADLAVALDRATVFVHPSLSEGFG 291
Query: 517 SLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEG----LSRAFSLFISN 568
+V A++FG PV+ D P + E A+ V+ +++P+G L+ A +S+
Sbjct: 292 LPVVEALSFGTPVVHSDAPALLEVAADAGVVV--PREDPDGYPLRLAEAIGGLLSD 345
>gi|448239418|ref|YP_007403476.1| glycosyl transferase group 1 [Geobacillus sp. GHH01]
gi|445208260|gb|AGE23725.1| glycosyl transferase group 1 [Geobacillus sp. GHH01]
Length = 359
Score = 40.0 bits (92), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 472 QEVASRLGLLEHSVRHYGF---NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+E+ +RLG+ SV + N V + DI S++ + F V AM G+P
Sbjct: 220 EELCARLGI--QSVTTFAGKVPNEQVPLYINQMDIFAVPSTEDSESFGVAAVEAMACGVP 277
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V+ + + E V EG + K++PE L+ AF + + +L +
Sbjct: 278 VVVSNVGGLPEVVREGTTGLIVPKNSPEKLAEAFERLLLDERLRQ 322
>gi|257052202|ref|YP_003130035.1| glycosyl transferase group 1 [Halorhabdus utahensis DSM 12940]
gi|256690965|gb|ACV11302.1| glycosyl transferase group 1 [Halorhabdus utahensis DSM 12940]
Length = 371
Score = 40.0 bits (92), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 59/128 (46%), Gaps = 7/128 (5%)
Query: 436 DVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVN 495
D LL A+ S + K V L S L+++A+ LG+ E SV G+ D
Sbjct: 216 DFSTLLHAVAKFKSDRVNTKLVLLSEGSE---TPNLKKLAANLGI-EQSVSFEGYVSDPY 271
Query: 496 GVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI-IKEYVAEGAQVIFFQKDN 554
AD+ + S +GF ++IV AM G PV+ D P+ E +++G +
Sbjct: 272 PYFSAADLFVLSSRW--EGFGNVIVEAMMVGTPVVATDCPVGPAEVLSDGEFGKLAPVGD 329
Query: 555 PEGLSRAF 562
P+GL+ A
Sbjct: 330 PDGLAHAI 337
>gi|357403451|ref|YP_004915375.1| exopolysaccharide biosynthesis glycosyltransferase
[Methylomicrobium alcaliphilum 20Z]
gi|351716116|emb|CCE21767.1| putative exopolysaccharide biosynthesis glycosyltransferase
[Methylomicrobium alcaliphilum 20Z]
Length = 369
Score = 40.0 bits (92), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 49/101 (48%), Gaps = 3/101 (2%)
Query: 471 LQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVI 530
+ ++A GL E V+ G DV+ +L ADI S+ +G P I+ AM G+PVI
Sbjct: 235 MTDLARDYGLAER-VKFLGARNDVDTLLEKADIFCLISNW--EGLPLTILEAMRAGLPVI 291
Query: 531 TPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
+ E V +G I +D+ L++A + + + L
Sbjct: 292 ASRVGGVPEAVQDGKTGILVDRDDEGALAQAITSLVESESL 332
>gi|440718179|ref|ZP_20898643.1| glycosyl transferase group 1 [Rhodopirellula baltica SWK14]
gi|436436721|gb|ELP30441.1| glycosyl transferase group 1 [Rhodopirellula baltica SWK14]
Length = 416
Score = 40.0 bits (92), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 60/148 (40%), Gaps = 16/148 (10%)
Query: 464 TDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIV----LYGSSQVEQGFPSLI 519
T+ Y + S LG+ H V GF DV G L D + LYG +G P ++
Sbjct: 269 TEAYRQQIDAQISDLGI-GHLVDWVGFTQDVPGELARLDAMVLPSLYG-----EGLPMVV 322
Query: 520 VRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN----GKLSKFA 575
+ +M G PVI E + G + + + + E L+ +S +S FA
Sbjct: 323 LESMAAGTPVIATSVEGTPEAIRHGIEGVLAEPRDAESLAEQIQAMVSGQYDWQSMSDFA 382
Query: 576 RTVASAGRLHAKNMLALDCVTRYARILE 603
V + + + +A Y RIL+
Sbjct: 383 --VTRHHQCFSDHTMAHGVAKVYRRILD 408
>gi|448399468|ref|ZP_21570757.1| glycosyltransferase [Haloterrigena limicola JCM 13563]
gi|445668836|gb|ELZ21458.1| glycosyltransferase [Haloterrigena limicola JCM 13563]
Length = 377
Score = 40.0 bits (92), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 477 RLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPI 536
+LGL ++ G+ V AD L+ S V +GF +++ AM +PVI + P
Sbjct: 249 KLGL-NDNITITGYTSSVEDYYAAAD--LFVLSSVLEGFGIVVIEAMASELPVIGTNIPA 305
Query: 537 IKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL 571
I+E + + + ++P LS+A +S+ +L
Sbjct: 306 IEEIIDQNESGLLVSPESPSELSKAILKLLSSQRL 340
>gi|423367674|ref|ZP_17345106.1| hypothetical protein IC3_02775 [Bacillus cereus VD142]
gi|401084224|gb|EJP92474.1| hypothetical protein IC3_02775 [Bacillus cereus VD142]
Length = 385
Score = 40.0 bits (92), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 27/117 (23%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++A++LG+ + V G+ D+ +L M+D+ + +S +G P I+ AM G+P
Sbjct: 242 EECKKIATQLGV-SNMVHFLGYRNDIASLLQMSDLAV--ASSYREGLPVNIMEAMACGLP 298
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS----KFARTVASA 581
VI D +E + +D+ + + + N KL + RT+ ++
Sbjct: 299 VIATDNRGHRELIISNKNGWIIDRDDIKTMPERINTISKNTKLQAQFGQCGRTIITS 355
>gi|297531397|ref|YP_003672672.1| glycosyl transferase family protein [Geobacillus sp. C56-T3]
gi|297254649|gb|ADI28095.1| glycosyl transferase group 1 [Geobacillus sp. C56-T3]
Length = 360
Score = 40.0 bits (92), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 472 QEVASRLGLLEHSVRHYGF---NGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+E+ +RLG+ SV + N V + DI S++ + F V AM G+P
Sbjct: 220 EELCARLGI--QSVTTFAGKVPNEQVPLYINQMDIFAVPSTEDSESFGVAAVEAMACGVP 277
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V+ + + E V EG + K++PE L+ AF + + +L +
Sbjct: 278 VVVSNVGGLPEVVREGTTGLIVPKNSPEKLAEAFERLLLDERLRQ 322
>gi|420262649|ref|ZP_14765290.1| glycosyl transferase family protein [Enterococcus sp. C1]
gi|394770406|gb|EJF50210.1| glycosyl transferase family protein [Enterococcus sp. C1]
Length = 359
Score = 40.0 bits (92), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 64/140 (45%), Gaps = 8/140 (5%)
Query: 436 DVGPLLIKYARRNSVEGSFKFVFLCGNSTDG-YNDALQEVASRLGLLEHSVRHYGFNGDV 494
D G L I A R S + KF+ + DG + LQE S + H + +G +V
Sbjct: 197 DKGILDICEAARKSDVENIKFLIV----GDGPLKEELQEEYST-EISNHKLLFFGKTSNV 251
Query: 495 NGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDN 554
+L +D+ L+G+ + + ++ AMT G+PV+ + E V E F ++
Sbjct: 252 LSLLCASDVFLFGTYH--ENLSNALLEAMTVGLPVVVTNVGGNMEVVEENYNGYFISPND 309
Query: 555 PEGLSRAFSLFISNGKLSKF 574
P+ ++ + N KL K+
Sbjct: 310 PDDINAKINSLNENKKLCKY 329
>gi|237653931|ref|YP_002890245.1| sugar transferase [Thauera sp. MZ1T]
gi|237625178|gb|ACR01868.1| sugar transferase, PEP-CTERM/EpsH1 system associated [Thauera sp.
MZ1T]
Length = 388
Score = 40.0 bits (92), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 62/144 (43%), Gaps = 4/144 (2%)
Query: 463 STDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRA 522
+ DG A E R LE +V G DV V+ D+ S +G + ++ A
Sbjct: 243 AGDGPLRATLEAEIRSSGLEDAVWLAGERADVPAVMRALDVFALPSRA--EGISNTVLEA 300
Query: 523 MTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAG 582
M G+PV+ E V G +NP G++ A + +N L + +A+
Sbjct: 301 MASGLPVVATAVGGNAELVVPGETGALVAAENPRGMAEALLCYAANAALRQ-KHGLAARR 359
Query: 583 RLHAKNMLALDCVTRYARILENVL 606
R+ A+ + V RYA + E++L
Sbjct: 360 RVEAEFSIE-RMVARYAELYESLL 382
>gi|212212703|ref|YP_002303639.1| alpha-D-QuiNAc alpha-1,3-galactosyltransferase [Coxiella burnetii
CbuG_Q212]
gi|212011113|gb|ACJ18494.1| alpha-D-QuiNAc alpha-1,3-galactosyltransferase [Coxiella burnetii
CbuG_Q212]
Length = 377
Score = 40.0 bits (92), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 52/236 (22%), Positives = 93/236 (39%), Gaps = 41/236 (17%)
Query: 347 WKSVFSRVNVIVF----PDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLR 402
+K VF N V D T + +LD N +IPGS + A Y +
Sbjct: 139 YKLVFQHKNTCVIFQNEDDRTQFLQTKLLDPNNSVLIPGSGVSMSAF---------YPME 189
Query: 403 KENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGN 462
+ G + V++ S +++ ++ A K ++ ++ F V
Sbjct: 190 EMPGVPR-----VILASRLLWDKGVGEFVEAA--------KILKQKKIDACFILV----- 231
Query: 463 STDGYNDALQEVASRLGLLEH-----SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
G D+ A LEH + +G + ++ ++ A+IV S + +G P
Sbjct: 232 ---GGIDSENPAAINQKQLEHWESEGLIEWWGESTEMLAIMHRANIVCLPSYR--EGLPR 286
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
++V A G ++T D P ++ V +G + N E L+ A + I N +L K
Sbjct: 287 VLVEAAASGRAIVTTDVPGCRDVVCDGENGLLVPVKNSEELASAIEILIQNPELRK 342
>gi|120609897|ref|YP_969575.1| group 1 glycosyl transferase [Acidovorax citrulli AAC00-1]
gi|120588361|gb|ABM31801.1| glycosyl transferase, group 1 [Acidovorax citrulli AAC00-1]
Length = 373
Score = 40.0 bits (92), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 53/121 (43%), Gaps = 26/121 (21%)
Query: 452 GSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGV----------LLMA 501
G L G+ D A + +A+RLGL + SV F GDV+ V +LM+
Sbjct: 222 GQHGSATLLGDGPD--RPAHEALAARLGL-QSSV---DFKGDVDDVPAQLARHGIFVLMS 275
Query: 502 DIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRA 561
D +G P ++ AM G+ ++ D P ++E + E + D PE L+ A
Sbjct: 276 D---------HEGLPVSLIEAMRAGMAIVASDLPGVRELLPEPGHALLVPAD-PEALAAA 325
Query: 562 F 562
Sbjct: 326 L 326
>gi|317152076|ref|YP_004120124.1| group 1 glycosyl transferase [Desulfovibrio aespoeensis Aspo-2]
gi|316942327|gb|ADU61378.1| glycosyl transferase group 1 [Desulfovibrio aespoeensis Aspo-2]
Length = 386
Score = 40.0 bits (92), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 55/134 (41%), Gaps = 4/134 (2%)
Query: 458 FLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
L G D N AL + GL VR G D+ ++ D+++ SS V + FP+
Sbjct: 235 VLAGRGADHGNAALAGWLAEAGLAPERVRLLGERADMERLMAAMDVLV--SSSVGESFPN 292
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
++ AM G+P + D VA V+ D P L RA G + A
Sbjct: 293 VVGEAMACGVPCVVTDVGDSASLVASTGLVV-PPADAPA-LGRAMLDMAGLGPTGRAALG 350
Query: 578 VASAGRLHAKNMLA 591
A+ R+ A LA
Sbjct: 351 QAARSRVMAGFSLA 364
>gi|161829796|ref|YP_001596863.1| glycoside hydrolase family protein [Coxiella burnetii RSA 331]
gi|161761663|gb|ABX77305.1| glycosyl transferase, group 1 family protein [Coxiella burnetii RSA
331]
Length = 380
Score = 40.0 bits (92), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 52/236 (22%), Positives = 93/236 (39%), Gaps = 41/236 (17%)
Query: 347 WKSVFSRVNVIVF----PDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLR 402
+K VF N V D T + +LD N +IPGS + A Y +
Sbjct: 142 YKLVFQHKNTCVIFQNEDDRTQFLQTKLLDPNNSVLIPGSGVSMSAF---------YPME 192
Query: 403 KENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGN 462
+ G + V++ S +++ ++ A K ++ ++ F V
Sbjct: 193 EMPGVPR-----VILASRLLWDKGVGEFVEAA--------KILKQKKIDACFILV----- 234
Query: 463 STDGYNDALQEVASRLGLLEH-----SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
G D+ A LEH + +G + ++ ++ A+IV S + +G P
Sbjct: 235 ---GGIDSENPAAINQKQLEHWESEGLIEWWGESTEMLAIMHRANIVCLPSYR--EGLPR 289
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
++V A G ++T D P ++ V +G + N E L+ A + I N +L K
Sbjct: 290 VLVEAAASGRAIVTTDVPGCRDVVCDGENGLLVPVKNSEELASAIEILIQNPELRK 345
>gi|359687344|ref|ZP_09257345.1| glycosyltransferase [Leptospira licerasiae serovar Varillal str.
MMD0835]
gi|418756842|ref|ZP_13313030.1| glycosyltransferase, group 1 family protein [Leptospira licerasiae
serovar Varillal str. VAR 010]
gi|384116513|gb|EIE02770.1| glycosyltransferase, group 1 family protein [Leptospira licerasiae
serovar Varillal str. VAR 010]
Length = 407
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 46/214 (21%), Positives = 89/214 (41%), Gaps = 20/214 (9%)
Query: 394 KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGS 453
K+++ LRKE KDE+++ + A+ H L+ + +
Sbjct: 200 KANDTSYLRKEFNLSKDELIIGNIA------------ALVDHKDQKTLLDALSKVETDKK 247
Query: 454 FKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQ 513
+K V + G L+ +AS LL+ + GF D++ +L + DI S E+
Sbjct: 248 YK-VLIVGEGE--LRKELERIASEKKLLDKVI-FTGFRTDISELLSIFDIFTLTSK--EE 301
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-- 571
G + ++ AM G+P++ + I E + EG + + L+ ++ + + K+
Sbjct: 302 GLGTSVLDAMASGLPIVATNGGGIAEMLTEGKGAFISEVGDAVSLASSYKTLLEDPKIRR 361
Query: 572 SKFARTVASAGRLHAKNMLALDCVTRYARILENV 605
S A S R KN + + Y+ + E +
Sbjct: 362 SMGAFNKESVKRFSVKNTIKKTELAYYSLLGEEI 395
>gi|258406152|ref|YP_003198894.1| group 1 glycosyl transferase [Desulfohalobium retbaense DSM 5692]
gi|257798379|gb|ACV69316.1| glycosyl transferase group 1 [Desulfohalobium retbaense DSM 5692]
Length = 355
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 458 FLCGN-STDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFP 516
F+C T + + LQ++AS LGL H + H GF DV L ADI L S + +G P
Sbjct: 205 FVCHILGTGPFENQLQDLASELGLDHHLIWH-GFQPDVRPFLRQADIFLLPSYR--EGLP 261
Query: 517 SLIVRAMTFGI 527
+ ++ AM G+
Sbjct: 262 NTLLEAMAEGL 272
>gi|154707211|ref|YP_001424289.1| alpha-D-QuiNAc alpha-1,3-galactosyltransferase [Coxiella burnetii
Dugway 5J108-111]
gi|154356497|gb|ABS77959.1| alpha-D-QuiNAc alpha-1,3-galactosyltransferase [Coxiella burnetii
Dugway 5J108-111]
Length = 377
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 52/236 (22%), Positives = 93/236 (39%), Gaps = 41/236 (17%)
Query: 347 WKSVFSRVNVIVF----PDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLR 402
+K VF N V D T + +LD N +IPGS + A Y +
Sbjct: 139 YKLVFQHKNTCVIFQNEDDRTQFLQTKLLDPNNSVLIPGSGVSMSAF---------YPME 189
Query: 403 KENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGN 462
+ G + V++ S +++ ++ A K ++ ++ F V
Sbjct: 190 EMPGVPR-----VILASRLLWDKGVGEFVEAA--------KILKQKKIDACFILV----- 231
Query: 463 STDGYNDALQEVASRLGLLEH-----SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPS 517
G D+ A LEH + +G + ++ ++ A+IV S + +G P
Sbjct: 232 ---GGIDSENPAAINQKQLEHWESEGLIEWWGESTEMLAIMHRANIVCLPSYR--EGLPR 286
Query: 518 LIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
++V A G ++T D P ++ V +G + N E L+ A + I N +L K
Sbjct: 287 VLVEAAASGRAIVTTDVPGCRDVVCDGENGLLVPVKNSEELASAIEILIQNPELRK 342
>gi|354568825|ref|ZP_08987987.1| glycosyl transferase group 1 [Fischerella sp. JSC-11]
gi|353539630|gb|EHC09114.1| glycosyl transferase group 1 [Fischerella sp. JSC-11]
Length = 384
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 65/137 (47%), Gaps = 10/137 (7%)
Query: 473 EVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITP 532
++A+ LGL E V G DV ++ D +++ S F +++ AM G+PVIT
Sbjct: 254 QMAASLGLNER-VHFLGLCRDVPELMRAVDFLVFPSRY--DPFGLVVIEAMACGLPVITA 310
Query: 533 DFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN----GKLSKFARTVASAGRLHAKN 588
E V A ++ ++ E L +A S S+ ++ K ART+A H+
Sbjct: 311 VTTGAAELVQPEAGIVLSDPNDTEALIQALSSLTSDRTLRSQMGKAARTIAEQ---HSWQ 367
Query: 589 MLALDCVTRYARILENV 605
++A V + +++++
Sbjct: 368 LMAKSYVDLFEELVKSI 384
>gi|188995104|ref|YP_001929356.1| hypothetical protein PGN_1240 [Porphyromonas gingivalis ATCC 33277]
gi|188594784|dbj|BAG33759.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC
33277]
Length = 383
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 74/173 (42%), Gaps = 31/173 (17%)
Query: 394 KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVE-- 451
+ H+K QLR G + +VVG F + PL+I A ++ +E
Sbjct: 189 EDHQK-QLRHRLGLPVSGKIFLVVGRLFSRKD-------------PLIIIRAFKSLLESQ 234
Query: 452 ---GSFKFVFLCGNSTDG-YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYG 507
G F FL DG +DA Q+ +G H++R GF +V+ ++ AD +
Sbjct: 235 KLSGDFYLFFL----GDGPLSDACQK---EIGDKTHNIRLLGFQKNVHEYMMAADYAITA 287
Query: 508 SSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQ--VIFFQKDNPEGL 558
S +GF + A+ G PV+ D P +E G Q +FF N + L
Sbjct: 288 SH--SEGFGLNYLEAIACGKPVVATDLPPFREICGTGFQDNELFFSVGNQKEL 338
>gi|424788649|ref|ZP_18215399.1| glycosyl transferases group 1 family protein [Streptococcus
intermedius BA1]
gi|422112429|gb|EKU16216.1| glycosyl transferases group 1 family protein [Streptococcus
intermedius BA1]
Length = 377
Score = 40.0 bits (92), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 459 LCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSL 518
+CG T + L ++ + GL E+ VR G+ DV+ +L +DI ++ S +G P
Sbjct: 238 ICGEGT--LKEELTQLIKQYGL-ENQVRLLGYRDDVHRLLAESDIFVFPS--FREGLPVS 292
Query: 519 IVRAMTFGIPVITPD 533
++ AM G+PV+ D
Sbjct: 293 VMEAMATGLPVLASD 307
>gi|392411148|ref|YP_006447755.1| glycosyltransferase [Desulfomonile tiedjei DSM 6799]
gi|390624284|gb|AFM25491.1| glycosyltransferase [Desulfomonile tiedjei DSM 6799]
Length = 396
Score = 40.0 bits (92), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 5/112 (4%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
L+ VR G DV ++ +D+ L S + +G P + + A G+PV+ P + E
Sbjct: 258 LDEFVRFLGLRDDVLDLMTKSDMFLLPS--LHEGLPVVALEAQAAGLPVVGSRIPGLSEA 315
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLAL 592
V EG + + +G++ + + N A +A AGR + +L
Sbjct: 316 VEEGTTALLHDVHDTDGMADSVLKLLRN---PGSAEELARAGRERIQQQFSL 364
>gi|381396745|ref|ZP_09922159.1| glycosyl transferase group 1 [Microbacterium laevaniformans OR221]
gi|380775704|gb|EIC08994.1| glycosyl transferase group 1 [Microbacterium laevaniformans OR221]
Length = 373
Score = 40.0 bits (92), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 2/94 (2%)
Query: 462 NSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVN-GVLLMADIVLYGSSQVEQGFPSLIV 520
+ +G A+ E+A+ GL E V G D + +L + L S++ GFPS +V
Sbjct: 235 GAPEGDEPAIAELAAASGLAEGRVHVRGALEDADRAAVLGGAVALLAPSRL-AGFPSRVV 293
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDN 554
A+ G+PVI D +E V +G + D
Sbjct: 294 EALAVGVPVIAADSDTHREVVWDGGVLASASDDG 327
>gi|448666542|ref|ZP_21685187.1| group 1 glycosyl transferase [Haloarcula amylolytica JCM 13557]
gi|445771673|gb|EMA22729.1| group 1 glycosyl transferase [Haloarcula amylolytica JCM 13557]
Length = 242
Score = 40.0 bits (92), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 21/93 (22%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 481 LEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEY 540
+ SV Y F D++ + +A + + +G P ++ AM G+P + + + E
Sbjct: 121 ISESVTVYNFVEDISEMYDIASV--FCRPSYSEGLPLTMLEAMASGVPPVVTEIAGVPEV 178
Query: 541 VAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V +G + + P+ + R S+ N +L K
Sbjct: 179 VTDGESGVLLEPGRPDEIEREISILFKNNELRK 211
>gi|407986413|ref|ZP_11166956.1| glycosyl transferases group 1 family protein [Mycobacterium
hassiacum DSM 44199]
gi|407372019|gb|EKF21092.1| glycosyl transferases group 1 family protein [Mycobacterium
hassiacum DSM 44199]
Length = 364
Score = 40.0 bits (92), Expect = 7.6, Method: Composition-based stats.
Identities = 35/145 (24%), Positives = 64/145 (44%), Gaps = 6/145 (4%)
Query: 464 TDGYNDALQEVASRLGLLEHSVRHYGF--NGDVNGVLLMADIVLYGSSQVEQGFPSLIVR 521
TD Y A A+ LG+ + G+ + ++ A +V+ +Q ++V+
Sbjct: 214 TDAYRTARITQAAELGVADSVTFDGGYYTGARLTELICQAAVVVLPYDSTDQVTSGVLVQ 273
Query: 522 AMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASA 581
A+ G PV+ FP +E + GA I D+P L+ A ++ +L A +A+A
Sbjct: 274 AVAHGRPVVATAFPHARELLGGGAGTI-VDHDDPVALATALRQILTRPRL---AGDMAAA 329
Query: 582 GRLHAKNMLALDCVTRYARILENVL 606
R A + + T Y ++ +L
Sbjct: 330 ARDLAPQLAWSEVATDYLGVVRRLL 354
>gi|89069354|ref|ZP_01156713.1| Glycosyl transferase group 1 [Oceanicola granulosus HTCC2516]
gi|89045121|gb|EAR51192.1| Glycosyl transferase group 1 [Oceanicola granulosus HTCC2516]
Length = 368
Score = 40.0 bits (92), Expect = 7.6, Method: Composition-based stats.
Identities = 37/136 (27%), Positives = 61/136 (44%), Gaps = 9/136 (6%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
D L +A ++GL E V G+ + V+ AD Y S+ +GFP+ +V AM G P
Sbjct: 238 DRLAALARKMGLAER-VHMPGYLDNPQAVVARADA--YVSASRSEGFPNALVEAMALGRP 294
Query: 529 VITPDFPIIKEYVAEGAQV--IFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHA 586
VI D + +G + D+ + L+RA L ++ ++A A
Sbjct: 295 VIATDCASGPREILQGPPPAGLLVPVDDVDALARALDLV----NEAETRHSLAGRASDRA 350
Query: 587 KNMLALDCVTRYARIL 602
++ V+ YAR+L
Sbjct: 351 QDFRPGPVVSAYARVL 366
>gi|410462088|ref|ZP_11315693.1| glycosyltransferase [Desulfovibrio magneticus str. Maddingley
MBC34]
gi|409984774|gb|EKO41058.1| glycosyltransferase [Desulfovibrio magneticus str. Maddingley
MBC34]
Length = 415
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 60/116 (51%), Gaps = 8/116 (6%)
Query: 470 ALQEVASRLGLLEHSVRHYGFNG--DVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGI 527
AL E A+ LG+ +R G+ G + +L AD+ ++ S ++G P+ ++ AM G+
Sbjct: 279 ALTEQAAALGV-SARLRFAGWAGREAMPELLRRADLFVFPSR--DEGMPNAVLEAMASGL 335
Query: 528 PVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGR 583
PV+ +E V +G + D+P+ L+ A + +S+ L R + +AGR
Sbjct: 336 PVVATRISGNEEVVIDGETGLLVPPDDPDALAGALARCLSDVALR---RRMGAAGR 388
>gi|154247726|ref|YP_001418684.1| group 1 glycosyl transferase [Xanthobacter autotrophicus Py2]
gi|154161811|gb|ABS69027.1| glycosyl transferase group 1 [Xanthobacter autotrophicus Py2]
Length = 466
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 3/104 (2%)
Query: 497 VLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPE 556
VL D+++ S+ E G P +++A+ PV+ D P + E++ EG F + + +
Sbjct: 354 VLSQVDVLVLPSTWFENG-PLTLLKALATHTPVVVSDVPGMTEFIQEGIDGFAFPRGDVD 412
Query: 557 GLSRAFSLFISNGKLSKFARTVASAGRLHAKNMLALDCVTRYAR 600
L+ F+ L++ R A+ + +AL+ + YAR
Sbjct: 413 ALAAVLRRFVEAPDLAR--RMSAATAYPRTERAMALEVLDLYAR 454
>gi|302392897|ref|YP_003828717.1| group 1 glycosyl transferase [Acetohalobium arabaticum DSM 5501]
gi|302204974|gb|ADL13652.1| glycosyl transferase group 1 [Acetohalobium arabaticum DSM 5501]
Length = 364
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 3/108 (2%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ +++ LG+ E+ + GF DV ++ D +L+ + + +GF +I AM G P
Sbjct: 236 NKIKKQVKSLGVEEYII-FTGFRSDVYNIMEQIDFLLH--TALWEGFGFVIAEAMAVGKP 292
Query: 529 VITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFAR 576
V++ D I E + +G + NP ++ I+ K K R
Sbjct: 293 VVSTDVSNISEIMVDGQTGYLAESKNPADIAEKTIKMINTTKKEKMGR 340
>gi|334144919|ref|YP_004538128.1| group 1 glycosyl transferase [Novosphingobium sp. PP1Y]
gi|333936802|emb|CCA90161.1| glycosyl transferase, group 1 [Novosphingobium sp. PP1Y]
Length = 396
Score = 40.0 bits (92), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
DAL +A+ LG+ E VR G+ ++ +L AD L S +G P +IV A+ G+P
Sbjct: 242 DALVRLATELGIAER-VRFPGYLASIDPLLADADAFLLSSDY--EGLPGVIVEALAAGLP 298
Query: 529 VITPD 533
V+ D
Sbjct: 299 VLATD 303
>gi|29654169|ref|NP_819861.1| alpha-D-QuiNAc alpha-1,3-galactosyltransferase [Coxiella burnetii
RSA 493]
gi|29541435|gb|AAO90375.1| alpha-D-QuiNAc alpha-1,3-galactosyltransferase [Coxiella burnetii
RSA 493]
Length = 377
Score = 40.0 bits (92), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 53/234 (22%), Positives = 91/234 (38%), Gaps = 37/234 (15%)
Query: 347 WKSVFSRVNVIVF----PDYTLPMLYSVLDAGNFFVIPGSPADVWAVEAYSKSHEKYQLR 402
+K VF N V D T + +LD N +IPGS + A Y +
Sbjct: 139 YKLVFQHKNTCVIFQNEDDRTQFLQTKLLDPNNSVLIPGSGVSMSAF---------YPME 189
Query: 403 KENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGN 462
+ G V + L WD V ++K + ++ F+ + G
Sbjct: 190 EMPG----------VPRVILASRLLWDKGVGEFVEAAKILKQKKIDAC-----FILVGGI 234
Query: 463 STD---GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLI 519
++ N E GL+E +G + ++ ++ A+IV S + +G P ++
Sbjct: 235 DSENPAAINQKQLEYWESEGLIEW----WGESTEMLAIMHRANIVCLPSYR--EGLPRVL 288
Query: 520 VRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
V A G ++T D P ++ V +G + N E L+ A + I N +L K
Sbjct: 289 VEAAASGRAIVTTDVPGCRDVVCDGENGLLVPVKNSEELASAIEILIQNPELRK 342
>gi|213155472|ref|YP_002317517.1| group 1 glycosyl transferase [Acinetobacter baumannii AB0057]
gi|301348158|ref|ZP_07228899.1| glycosyl transferase, group 1 [Acinetobacter baumannii AB056]
gi|301597504|ref|ZP_07242512.1| glycosyl transferase, group 1 [Acinetobacter baumannii AB059]
gi|213054632|gb|ACJ39534.1| glycosyl transferase, group 1 [Acinetobacter baumannii AB0057]
Length = 360
Score = 40.0 bits (92), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 41/89 (46%), Gaps = 2/89 (2%)
Query: 485 VRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEG 544
V +G D+ L A +V+ S + +G P +++ A G PV+T D P +E + EG
Sbjct: 237 VEWWGHQSDMAETLSKATVVVLPSYR--EGMPKVLLEAQALGRPVVTTDVPGCREAIEEG 294
Query: 545 AQVIFFQKDNPEGLSRAFSLFISNGKLSK 573
+ + L+ A ISN L +
Sbjct: 295 VTGFLAEVKDENSLATAIEKLISNDSLCE 323
>gi|410666653|ref|YP_006919024.1| glycosyl transferase family protein [Thermacetogenium phaeum DSM
12270]
gi|409104400|gb|AFV10525.1| glycosyl transferase [Thermacetogenium phaeum DSM 12270]
Length = 380
Score = 39.7 bits (91), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 49/219 (22%), Positives = 90/219 (41%), Gaps = 26/219 (11%)
Query: 392 YSKSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYAR---RN 448
Y E+ +LR++ G +E + + VG E DY + L++K R R
Sbjct: 175 YIDLKERIKLRQQLGISYNEFLWLAVGRL----EEQKDYPTLLDAFRILILKNVRVQLRI 230
Query: 449 SVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGS 508
+ +G V G +D + G D+ +L AD + S
Sbjct: 231 AGQGPLLEVLKRQAVDSGLSD--------------RIVFLGLRRDIPSLLNAADGFVLSS 276
Query: 509 SQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
+ +G P++++ AM PV+ + + E V +G +PE L+ A +S
Sbjct: 277 AW--EGLPNVVMEAMAACKPVVATNVGGVPELVEDGVSGYIVPPHDPEALAAAMLKIMSL 334
Query: 569 GKLSKFARTVASAGRLHAKNMLALD-CVTRYARILENVL 606
+ + R + AGR H + +LD V R+ ++ +++L
Sbjct: 335 SEDER--RAMGRAGRAHIEAKYSLDQVVDRWEKLYQDLL 371
>gi|218960752|ref|YP_001740527.1| putative glycosyl transferase [Candidatus Cloacamonas
acidaminovorans]
gi|167729409|emb|CAO80320.1| putative glycosyl transferase [Candidatus Cloacamonas
acidaminovorans str. Evry]
Length = 375
Score = 39.7 bits (91), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 46/108 (42%), Gaps = 6/108 (5%)
Query: 464 TDGYNDALQE---VASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIV 520
T G D L E +A LGL H + GF +V L DI + S +G + ++
Sbjct: 237 TVGGGDLLSEMKNIAQELGLQGH-IAFTGFQKEVGHFLKAFDIFVLASYL--EGLGTSVL 293
Query: 521 RAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
AM+ G+PV+ I E + G + NP LS+A N
Sbjct: 294 EAMSIGLPVVGTKAGGITEMIISGENGLLVPPQNPSELSKAILYLAQN 341
>gi|423375180|ref|ZP_17352517.1| hypothetical protein IC5_04233 [Bacillus cereus AND1407]
gi|401092759|gb|EJQ00883.1| hypothetical protein IC5_04233 [Bacillus cereus AND1407]
Length = 372
Score = 39.7 bits (91), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 68/153 (44%), Gaps = 19/153 (12%)
Query: 456 FVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGF 515
+ +CG D L+ +A +LG+ E+ V+ G+ D+ + ++D+ + S + +G
Sbjct: 231 YYVICGEGQ--LKDYLENLAKKLGV-ENQVKLLGYRTDIVEISKISDVFAFPS--IREGL 285
Query: 516 PSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFA 575
++ AM+ G+PVI D + + G Q D+ EG A + K+++
Sbjct: 286 SLALMEAMSSGLPVIASDIRGNSDLIHGGKGGYLVQPDDIEGFVIALN------KINQVD 339
Query: 576 RTVASAGRLHAKNMLALDCVTRYARILENVLNF 608
S G + K + D L+NVL+
Sbjct: 340 SLRYSMGEFNKKTVSEFD--------LDNVLSI 364
>gi|374594304|ref|ZP_09667309.1| glycosyl transferase group 1 [Gillisia limnaea DSM 15749]
gi|373872379|gb|EHQ04376.1| glycosyl transferase group 1 [Gillisia limnaea DSM 15749]
Length = 505
Score = 39.7 bits (91), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 61/140 (43%), Gaps = 6/140 (4%)
Query: 467 YNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFG 526
Y + ++ LGL E+ + G D + L D+ + S + +GFP ++ +M G
Sbjct: 358 YTNECLDLIDELGLQEN-FKFMGPRKDPQNIFLEGDLSILTS--ISEGFPYTVIESMGCG 414
Query: 527 IPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAGRLHA 586
IPV++ + +KE + E KD EG+ A + + + L K A L
Sbjct: 415 IPVVSTEVGGVKEALDESCGFTCKPKDT-EGIGNAVTKLLLDKDLRKSMGKNARERVL-- 471
Query: 587 KNMLALDCVTRYARILENVL 606
KN V+ Y + N++
Sbjct: 472 KNFTLNKFVSEYETVYNNLM 491
>gi|389581263|ref|ZP_10171290.1| glycosyltransferase [Desulfobacter postgatei 2ac9]
gi|389402898|gb|EIM65120.1| glycosyltransferase [Desulfobacter postgatei 2ac9]
Length = 348
Score = 39.7 bits (91), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 31/134 (23%), Positives = 61/134 (45%), Gaps = 7/134 (5%)
Query: 466 GYNDALQEVASRLGLLEHSVRHYGFNGDVNGVL---LMADIVLYGSSQVEQGFPSLIVRA 522
G + A++E SR +E + H G + G + L++ LY +G P I+ A
Sbjct: 206 GGDGAIEETKSRAHKIEVGL-HVETLGWIAGDMKNKLLSQSTLYVLPSYNEGLPMSILEA 264
Query: 523 MTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLSKFARTVASAG 582
M G+P+++ I E V +G + + + E L++ + +S+ L + +AG
Sbjct: 265 MAAGLPIVSTKVGGIPEAVTDGVEGFLIEPGDVEALAKCIDVLLSDLDLR---HRMGAAG 321
Query: 583 RLHAKNMLALDCVT 596
R + + + D +
Sbjct: 322 RKKIQTVFSADIIV 335
>gi|418750274|ref|ZP_13306560.1| glycosyltransferase, group 1 family protein [Leptospira licerasiae
str. MMD4847]
gi|404272877|gb|EJZ40197.1| glycosyltransferase, group 1 family protein [Leptospira licerasiae
str. MMD4847]
Length = 375
Score = 39.7 bits (91), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 46/214 (21%), Positives = 89/214 (41%), Gaps = 20/214 (9%)
Query: 394 KSHEKYQLRKENGFLKDEIVVVVVGSSFFYNELSWDYAVAMHDVGPLLIKYARRNSVEGS 453
K+++ LRKE KDE+++ + A+ H L+ + +
Sbjct: 168 KANDTSYLRKEFNLSKDELIIGNIA------------ALVDHKDQKTLLDALSKVETDKK 215
Query: 454 FKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQ 513
+K V + G L+ +AS LL+ + GF D++ +L + DI S E+
Sbjct: 216 YK-VLIVGEGE--LRKELERIASEKKLLDKVI-FTGFRTDISELLSIFDIFTLTSK--EE 269
Query: 514 GFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKL-- 571
G + ++ AM G+P++ + I E + EG + + L+ ++ + + K+
Sbjct: 270 GLGTSVLDAMASGLPIVATNGGGIAEMLTEGKGAFISEVGDAVSLASSYKTLLEDPKIRR 329
Query: 572 SKFARTVASAGRLHAKNMLALDCVTRYARILENV 605
S A S R KN + + Y+ + E +
Sbjct: 330 SMGAFNKESVKRFSVKNTIKKTELAYYSLLGEEI 363
>gi|89095704|ref|ZP_01168598.1| glycosyltransferase group 1 family protein [Bacillus sp. NRRL
B-14911]
gi|89089450|gb|EAR68557.1| glycosyltransferase group 1 family protein [Bacillus sp. NRRL
B-14911]
Length = 365
Score = 39.7 bits (91), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
Query: 469 DALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIP 528
+ L++ A+R G+ E VR GF+ DV+ L ++D+ L S + FP +I+ A +P
Sbjct: 232 EELEKEAARKGIAE-KVRFLGFHDDVHSYLSLSDVKLLAS--YSESFPLVILEAARAHVP 288
Query: 529 VITPDFPIIKEYVAE 543
VI+ D +K+ +++
Sbjct: 289 VISTDVGGVKDLISD 303
>gi|404497026|ref|YP_006721132.1| glycosyltransferase, Cap1E-like family [Geobacter metallireducens
GS-15]
gi|78194633|gb|ABB32400.1| glycosyltransferase, Cap1E-like family [Geobacter metallireducens
GS-15]
Length = 390
Score = 39.7 bits (91), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 65/154 (42%), Gaps = 27/154 (17%)
Query: 424 NELSWDYAVAMHDVGPLLIKYARRNSVEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEH 483
N L D +H +G + ARR S EG ++ CG
Sbjct: 234 NALEPDAYAFLH-LGLVDADAARRISKEGMEQYAADCG---------------------- 270
Query: 484 SVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAE 543
V++ GF ++ ++ +DIV+ SS E G P ++ A+ G ++T D P +E V +
Sbjct: 271 -VKYLGFKDNIKDYMIASDIVVLPSSYRE-GTPRSLIEALALGKAIVTTDMPGCRETVID 328
Query: 544 GAQVIFFQKDNPEGLSRAFSLFISNGKLSKFART 577
G +K + + L L +G++ AR+
Sbjct: 329 GWNGYLCRKGDEQSL--VAKLLAVDGEMLTAARS 360
>gi|212225010|ref|YP_002308246.1| phosphatidylinositol glycantransferase-class A [Thermococcus
onnurineus NA1]
gi|212009967|gb|ACJ17349.1| phosphatidylinositol glycantransferase-class A [Thermococcus
onnurineus NA1]
Length = 373
Score = 39.7 bits (91), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 15/61 (24%), Positives = 33/61 (54%)
Query: 513 QGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISNGKLS 572
+G P++++ AM G+PV+ D P ++ + G + ++ +PE ++ + N KL
Sbjct: 281 EGLPTVVLEAMASGLPVVASDIPAHRDVIINGHNGLLSKRGSPESIAENVLTLLENEKLQ 340
Query: 573 K 573
+
Sbjct: 341 R 341
>gi|229523280|ref|ZP_04412687.1| hypothetical protein VIF_000138 [Vibrio cholerae TM 11079-80]
gi|229339643|gb|EEO04658.1| hypothetical protein VIF_000138 [Vibrio cholerae TM 11079-80]
Length = 351
Score = 39.7 bits (91), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 49/106 (46%), Gaps = 1/106 (0%)
Query: 463 STDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSSQVEQGFPSLIVRA 522
+ DG + Q +A +LGL SV G+ + + L+ +Y +GFP ++ A
Sbjct: 209 AGDGEIETYQAMAEQLGL-NGSVHCLGWIAGEDKLKLLTQTDIYCLPSYNEGFPMGVIEA 267
Query: 523 MTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFISN 568
M+ GIPV+ I + +++G Q + + L++A I
Sbjct: 268 MSAGIPVVASRAGGIPDAISDGEQGRLIEAGDVVALAQALGDLIEQ 313
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,003,059,968
Number of Sequences: 23463169
Number of extensions: 796813971
Number of successful extensions: 2180174
Number of sequences better than 100.0: 567
Number of HSP's better than 100.0 without gapping: 84
Number of HSP's successfully gapped in prelim test: 483
Number of HSP's that attempted gapping in prelim test: 2179541
Number of HSP's gapped (non-prelim): 652
length of query: 1106
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 952
effective length of database: 8,745,867,341
effective search space: 8326065708632
effective search space used: 8326065708632
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 83 (36.6 bits)