RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|255764474|ref|YP_003064854.2| glycosyl transferase group 1 [Candidatus Liberibacter asiaticus str. psy62] (352 letters) >gnl|CDD|132132 TIGR03088, stp2, sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Length = 374 Score = 102 bits (255), Expect = 2e-22 Identities = 62/217 (28%), Positives = 105/217 (48%), Gaps = 12/217 (5%) Query: 138 IMHGVDTERFRPTS-NKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINIL---- 192 I +GVDTERF P+ ++ ++++ ++G GR++ +K V ++ Sbjct: 165 IYNGVDTERFHPSRGDRSPILPPDFFADESVVVGTVGRLQAVKDQPTLVRAFALLVRQLP 224 Query: 193 PHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAP 252 V+VG + ++ + A GL + E+ + +AL++FV P Sbjct: 225 EGAERLRLVIVGDGPAR-----GACEQMVRAAGLAHLVWLPGERDDVPALMQALDLFVLP 279 Query: 253 PLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN 312 L EG T LEAMASG+PV+A+ G EL+ ++ G +VPP + AL +A+ +++ Sbjct: 280 SLAEGISNTILEAMASGLPVIATAVGGNPELV--QHGVTGALVPPGDAVALARALQPYVS 337 Query: 313 SKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLL 349 G GR RA + FSI + +YD+LL Sbjct: 338 DPAARRAHGAAGRARAEQQFSINAMVAAYAGLYDQLL 374 >gnl|CDD|132490 TIGR03449, mycothiol_MshA, UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species. Length = 405 Score = 89.8 bits (223), Expect = 1e-18 Identities = 59/228 (25%), Positives = 92/228 (40%), Gaps = 30/228 (13%) Query: 137 VIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHP 196 V+ G D ERFRP ++ R L + D K++ GRI+ LK D+ + + +L P Sbjct: 191 VVAPGADLERFRP-GDRATERARLGLPLDTKVVAFVGRIQPLKAPDVLLRAVAELLDRDP 249 Query: 197 GWTAVVV--------GKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIE--DWYRAL 246 V+ G T L + G+ R+ F+ + E YRA Sbjct: 250 DRNLRVIVVGGPSGSGLATPDAL---IELAAEL---GIADRVRFLPPRPPEELVHVYRAA 303 Query: 247 NIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKA 306 ++ P E FGL +EA A G PVVA+ G + + + G++V + A Sbjct: 304 DVVAVPSYNESFGLVAMEAQACGTPVVAARVGGLPVAVA--DGETGLLVDGHDPADWADA 361 Query: 307 VLYFMNSKKIMSDTGNRGR--ERAVKH---FSIVKEASDIGKVYDRLL 349 + +++ D R R AV+H FS A + Y L Sbjct: 362 L------ARLLDDPRTRIRMGAAAVEHAAGFSWAATADGLLSSYRDAL 403 >gnl|CDD|178460 PLN02871, PLN02871, UDP-sulfoquinovose:DAG sulfoquinovosyltransferase. Length = 465 Score = 66.7 bits (163), Expect = 8e-12 Identities = 53/197 (26%), Positives = 81/197 (41%), Gaps = 23/197 (11%) Query: 141 GVDTERFRPTSNKQEARRHLKISE-DAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWT 199 GVD+E F P +E R L E + LI GR+ K D M + PG Sbjct: 237 GVDSESFHPRFRSEEMRARLSGGEPEKPLIVYVGRLGAEKNLDFLKRVMERL----PGAR 292 Query: 200 AVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFI-----DEQSSIEDWYRALNIFVAPPL 254 VG +++ L++ G +F DE S Y + ++FV P Sbjct: 293 LAFVGDGP-----YREELEK--MFAGT--PTVFTGMLQGDELSQA---YASGDVFVMPSE 340 Query: 255 YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENA-KAGVIVPPRNLHALEKAVLYFMNS 313 E G LEAMASG+PVVA+ G +++ P+ K G + P ++ + + + Sbjct: 341 SETLGFVVLEAMASGVPVVAARAGGIPDIIPPDQEGKTGFLYTPGDVDDCVEKLETLLAD 400 Query: 314 KKIMSDTGNRGRERAVK 330 ++ G RE K Sbjct: 401 PELRERMGAAAREEVEK 417 >gnl|CDD|162731 TIGR02149, glgA_Coryne, glycogen synthase, Corynebacterium family. This model describes Corynebacterium glutamicum GlgA and closely related proteins in several other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related TIGR02095 family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor. Length = 388 Score = 64.1 bits (156), Expect = 5e-11 Identities = 63/254 (24%), Positives = 98/254 (38%), Gaps = 33/254 (12%) Query: 118 DEVITTSQKSARFIER--PST------VIMHGVDTERFRPTSNKQEARRHLKISEDAKLI 169 D VI S I + P VI +G+DT+ ++P + I I Sbjct: 146 DRVIAVSGGMREDILKYYPDLDPEKVHVIYNGIDTKEYKP-DDGNVVLDRYGIDRSRPYI 204 Query: 170 GCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLK--------HYLFKKNLQRRI 221 GRI + KG +D + I VV+ + R Sbjct: 205 LFVGRITRQKGVPHLLDAVHYIPKD----VQVVLCAGAPDTPEVAEEVRQAVALLDRNRT 260 Query: 222 YANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFS 281 + K + + + + +FV P +YE G+ LEAMA G PVVAS TG Sbjct: 261 GIIWINKML----PKEELVELLSNAEVFVCPSIYEPLGIVNLEAMACGTPVVASATGGIP 316 Query: 282 ELLDPENAKAGVIVPPRN------LHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIV 335 E++ + + G +VPP N L KA+ + ++ G GR+RA + FS Sbjct: 317 EVV--VDGETGFLVPPDNSDADGFQAELAKAINILLADPELAKKMGIAGRKRAEEEFSWG 374 Query: 336 KEASDIGKVYDRLL 349 A ++Y ++L Sbjct: 375 SIAKKTVEMYRKVL 388 >gnl|CDD|182369 PRK10307, PRK10307, putative glycosyl transferase; Provisional. Length = 412 Score = 51.5 bits (124), Expect = 3e-07 Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 29/257 (11%) Query: 94 LKLVFTSPSQRNHSRWTRYLISRMDEVITTSQK-SARFIER---PSTVIM--HGVDTERF 147 L L+ R + + R L+ R D V T S+ + E+ VI + + RF Sbjct: 149 LGLLKGGKVARLATAFERSLLRRFDNVSTISRSMMNKAREKGVAAEKVIFFPNWSEVARF 208 Query: 148 RPTSNKQEA--RRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGK 205 +P ++ R L + + K++ G I + +G +L +D L P V+ G+ Sbjct: 209 QPVADADVDALRAQLGLPDGKKIVLYSGNIGEKQGLELVID-AARRLRDRPDLIFVICGQ 267 Query: 206 TTLKHYLFKKNLQRRIYANGLKKRILFIDEQSS--IEDWYRALNIFVAPPLYEGFGL--- 260 K L++ GL + F+ Q + + + + P L Sbjct: 268 GG-----GKARLEKMAQCRGLP-NVHFLPLQPYDRLPALLKMADCHLLPQKAGAADLVLP 321 Query: 261 TPLEAM-ASGIPVVAS---NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKI 316 + L M ASG VVA+ T + + GV V P ++ AL A+ + Sbjct: 322 SKLTNMLASGRNVVATAEPGTELGQLV-----EGIGVCVEPESVEALVAAIAALARQALL 376 Query: 317 MSDTGNRGRERAVKHFS 333 G RE A + Sbjct: 377 RPKLGTVAREYAERTLD 393 >gnl|CDD|131525 TIGR02472, sucr_P_syn_N, sucrose-phosphate synthase, putative, glycosyltransferase domain. This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (TIGR02471), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant SPS. At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase. Length = 439 Score = 51.3 bits (123), Expect = 4e-07 Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 14/92 (15%) Query: 248 IFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNL----HAL 303 IFV P L E FGLT LEA A G+P+VA++ G +++ N + G++V +L AL Sbjct: 343 IFVNPALTEPFGLTLLEAAACGLPIVATDDGGPRDII--ANCRNGLLVDVLDLEAIASAL 400 Query: 304 EKAVLYFMNSK--KIMSDTGNRGRERAVKHFS 333 E A +S ++ S G G R H+S Sbjct: 401 EDA---LSDSSQWQLWSRNGIEGVRR---HYS 426 >gnl|CDD|185387 PRK15490, PRK15490, Vi polysaccharide biosynthesis protein TviE; Provisional. Length = 578 Score = 50.9 bits (121), Expect = 5e-07 Identities = 29/99 (29%), Positives = 48/99 (48%), Gaps = 5/99 (5%) Query: 184 FVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWY 243 ++D L HHP V+VG L+ Q+R G+ +RILF+ + W Sbjct: 416 WIDFAARYLQHHPATRFVLVGDGDLR-----AEAQKRAEQLGILERILFVGASRDVGYWL 470 Query: 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSE 282 + +N+F+ YEG +EA G+PV+++ G +E Sbjct: 471 QKMNVFILFSRYEGLPNVLIEAQMVGVPVISTPAGGSAE 509 >gnl|CDD|162872 TIGR02470, sucr_synth, sucrose synthase. This model represents sucrose synthase, an enzyme that, despite its name, generally uses rather produces sucrose. Sucrose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyzes the penultimate step in sucrose synthesis. Sucrose synthase is found, so far, exclusively in plants and cyanobacteria. Length = 784 Score = 49.7 bits (119), Expect = 1e-06 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 3/64 (4%) Query: 248 IFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNL-HALEKA 306 IFV P LYE FGLT LEAM G+P A+ G E++ ++ +G + P + A EK Sbjct: 646 IFVQPALYEAFGLTVLEAMTCGLPTFATRFGGPLEII--QDGVSGFHIDPYHGEEAAEKI 703 Query: 307 VLYF 310 V +F Sbjct: 704 VDFF 707 >gnl|CDD|185381 PRK15484, PRK15484, lipopolysaccharide 1,2-N-acetylglucosaminetransferase; Provisional. Length = 380 Score = 49.4 bits (118), Expect = 1e-06 Identities = 40/151 (26%), Positives = 69/151 (45%), Gaps = 6/151 (3%) Query: 137 VIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHP 196 ++ +G E ++ ++ + R+ L IS D ++ GRI KG L + + H Sbjct: 165 IVPNGFCLETYQ-SNPQPNLRQQLNISPDETVLLYAGRISPDKGILLLMQAFEKLATAHS 223 Query: 197 GWTAVVVGKTTLKHYLFKKNLQRRIY--ANGLKKRILFIDEQS--SIEDWYRALNIFVAP 252 VVVG T K Q+++ A + R + + Q + ++Y ++ V P Sbjct: 224 NLKLVVVGDPTASSKGEKAAYQKKVLEAAKRIGDRCIMLGGQPPEKMHNYYPLADLVVVP 283 Query: 253 PLY-EGFGLTPLEAMASGIPVVASNTGVFSE 282 E F + +EAMA+G PV+AS G +E Sbjct: 284 SQVEEAFCMVAVEAMAAGKPVLASTKGGITE 314 >gnl|CDD|177747 PLN00142, PLN00142, sucrose synthase. Length = 815 Score = 46.5 bits (111), Expect = 9e-06 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%) Query: 249 FVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRN 299 FV P LYE FGLT +EAM G+P A+ G +E++ + +G + P + Sbjct: 670 FVQPALYEAFGLTVVEAMTCGLPTFATCQGGPAEII--VDGVSGFHIDPYH 718 >gnl|CDD|162700 TIGR02095, glgA, glycogen/starch synthases, ADP-glucose type. This family consists of glycogen (or starch) synthases that use ADP-glucose (EC 2.4.1.21), rather than UDP-glucose (EC 2.4.1.11) as in animals, as the glucose donor. This enzyme is found in bacteria and plants. Whether the name given is glycogen synthase or starch synthase depends on context, and therefore on substrate. Length = 473 Score = 45.7 bits (109), Expect = 2e-05 Identities = 50/243 (20%), Positives = 78/243 (32%), Gaps = 65/243 (26%) Query: 121 ITTSQKS------ARFIERPSTVIMHGVDTERFRPTS------------------NKQEA 156 I T + + I++G+DTE + P + NK+ Sbjct: 220 ILTPEFGYGLDGVLKARSGKLRGILNGIDTEVWNPATDPYLKANYSADDLAGKAENKEAL 279 Query: 157 RRHLKISEDAK--LIGCFGRIRKLKGTDLFVDCMINILPHHPGWTA--VVVGK------T 206 + L + D L G R+ + KG DL ++ LP VV+G Sbjct: 280 QEELGLPVDDDVPLFGVISRLTQQKGVDL----LLAALPELLELGGQLVVLGTGDPELEE 335 Query: 207 TLKHYLFKKNLQRRI---YANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPL 263 L+ + R+ Y L I Y + + P +E GLT L Sbjct: 336 ALRELAERYPGNVRVIIGYDEALAHLI------------YAGADFILMPSRFEPCGLTQL 383 Query: 264 EAMASGIPVVASNTG-----VFSELLDPENAKA-GVIVPPRN----LHALEKAVLYFMNS 313 AM G + TG V DPE G + + L AL +A+ + Sbjct: 384 YAMRYGTVPIVRRTGGLADTVVD--GDPEAESGTGFLFEEYDPGALLAALSRALRLYRQD 441 Query: 314 KKI 316 + Sbjct: 442 PSL 444 >gnl|CDD|178527 PLN02939, PLN02939, transferase, transferring glycosyl groups. Length = 977 Score = 42.2 bits (99), Expect = 2e-04 Identities = 48/194 (24%), Positives = 80/194 (41%), Gaps = 45/194 (23%) Query: 122 TTSQKSARFIERPSTVIMHGVDTERFRP------------------TSNKQEARRHLKIS 163 T S +F+ I++G+DT+ + P +NK R+ L +S Sbjct: 719 TLKFHSKKFVG-----ILNGIDTDTWNPSTDRFLKVQYNANDLQGKAANKAALRKQLGLS 773 Query: 164 -EDAK--LIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRR 220 DA L+GC R+ KG L + G V++G + + H +QR Sbjct: 774 SADASQPLVGCITRLVPQKGVHLIRHAIYKTA--ELGGQFVLLGSSPVPH------IQRE 825 Query: 221 I--YANGLKKR-----ILFIDEQSSIEDW-YRALNIFVAPPLYEGFGLTPLEAMASG-IP 271 A+ + IL DE ++ Y A ++F+ P ++E GLT + AM G +P Sbjct: 826 FEGIADQFQSNNNIRLILKYDE--ALSHSIYAASDMFIIPSMFEPCGLTQMIAMRYGSVP 883 Query: 272 VVASNTGVFSELLD 285 +V G+ + D Sbjct: 884 IVRKTGGLNDSVFD 897 >gnl|CDD|162870 TIGR02468, sucrsPsyn_pln, sucrose phosphate synthase/possible sucrose phosphate phosphatase, plant. Members of this family are sucrose-phosphate synthases of plants. This enzyme is known to exist in multigene families in several species of both monocots and dicots. The N-terminal domain is the glucosyltransferase domain. Members of this family also have a variable linker region and a C-terminal domain that resembles sucrose phosphate phosphatase (SPP) (EC 3.1.3.24) (see TIGR01485), the next and final enzyme of sucrose biosynthesis. The SPP-like domain likely serves a binding and not a catalytic function, as the reported SPP is always encoded by a distinct protein. Length = 1050 Score = 41.7 bits (98), Expect = 3e-04 Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 4/47 (8%) Query: 236 QSSIEDWYR----ALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTG 278 QS + D YR +F+ P E FGLT +EA A G+P+VA+ G Sbjct: 558 QSDVPDIYRLAAKTKGVFINPAFIEPFGLTLIEAAAHGLPMVATKNG 604 >gnl|CDD|185101 PRK15179, PRK15179, Vi polysaccharide biosynthesis protein TviE; Provisional. Length = 694 Score = 41.6 bits (97), Expect = 3e-04 Identities = 40/177 (22%), Positives = 71/177 (40%), Gaps = 10/177 (5%) Query: 161 KISEDAKL-IGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQR 219 + DA+ +G R+ K L+V+ HP ++VG L + + + Sbjct: 511 ARTSDARFTVGTVMRVDDNKRPFLWVEAAQRFAASHPKVRFIMVGGGPLLESV--REFAQ 568 Query: 220 RIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV 279 R+ G+ +RILF + W N F+ +EG +EA SG+PVV + G Sbjct: 569 RL---GMGERILFTGLSRRVGYWLTQFNAFLLLSRFEGLPNVLIEAQFSGVPVVTTLAGG 625 Query: 280 FSELLDPENAKAGVIVPPRNLHA--LEKAVLYFMNSKKIMSDTGNRGRERAVKHFSI 334 E + E G+ +P + A + +A+ + + + A FS+ Sbjct: 626 AGEAVQ-EGV-TGLTLPADTVTAPDVAEALARIHDMCAADPGIARKAADWASARFSL 680 >gnl|CDD|185325 PRK15427, PRK15427, colanic acid biosynthesis glycosyltransferase WcaL; Provisional. Length = 406 Score = 37.4 bits (87), Expect = 0.006 Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 8/72 (11%) Query: 247 NIFVAPPL------YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNL 300 ++F+ P + EG + +EAMA GIPVV++ EL+ E K+G +VP + Sbjct: 300 DVFLLPSVTGADGDMEGIPVALMEAMAVGIPVVSTLHSGIPELV--EADKSGWLVPENDA 357 Query: 301 HALEKAVLYFMN 312 AL + + F Sbjct: 358 QALAQRLAAFSQ 369 >gnl|CDD|182148 PRK09922, PRK09922, UDP-D-galactose:(glucosyl)lipopolysaccharide-1, 6-D-galactosyltransferase; Provisional. Length = 359 Score = 34.7 bits (80), Expect = 0.036 Identities = 12/22 (54%), Positives = 18/22 (81%) Query: 255 YEGFGLTPLEAMASGIPVVASN 276 +EGF +T LEAM+ GIP ++S+ Sbjct: 267 FEGFPMTLLEAMSYGIPCISSD 288 >gnl|CDD|107050 PHA01633, PHA01633, putative glycosyl transferase group 1. Length = 335 Score = 33.8 bits (77), Expect = 0.068 Identities = 14/32 (43%), Positives = 21/32 (65%) Query: 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVV 273 +Y A++ + P EGFG+ LE+MA G PV+ Sbjct: 220 FYGAMDFTIVPSGTEGFGMPVLESMAMGTPVI 251 >gnl|CDD|131964 TIGR02918, TIGR02918, accessory Sec system glycosylation protein GtfA. Members of this protein family are found only in Gram-positive bacteria of the Firmicutes lineage, including several species of Staphylococcus, Streptococcus, and Lactobacillus. Members are associated with glycosylation of serine-rich glycoproteins exported by the accessory Sec system. Length = 500 Score = 33.5 bits (77), Expect = 0.077 Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 21/94 (22%) Query: 195 HPGW--TAVVVGKTTLKHYLFKKNLQRRIYANGLKKRIL--FIDE-----------QSSI 239 H W AVV K ++ F IY G +K+ L I+E ++ Sbjct: 333 HIDWLVKAVVKAKKSVPELTFD------IYGEGGEKQKLQKIINENQAQDYIHLKGHRNL 386 Query: 240 EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVV 273 + Y+ ++++ EGFGLT +EA+ SG+ ++ Sbjct: 387 SEVYKDYELYLSASTSEGFGLTLMEAVGSGLGMI 420 >gnl|CDD|184505 PRK14099, PRK14099, glycogen synthase; Provisional. Length = 485 Score = 33.2 bits (76), Expect = 0.12 Identities = 42/193 (21%), Positives = 66/193 (34%), Gaps = 74/193 (38%) Query: 138 IMHGVDTERFRPTS------------------NKQ--EARRHLKISEDAKLIGCFGRIRK 177 I++G+DT + P + NK +AR L DA L+G R+ Sbjct: 247 ILNGIDTAVWNPATDELIAATYDVETLAARAANKAALQARFGLDPDPDALLLGVISRLSW 306 Query: 178 LKGTDLFVDCMINILP------------------------HHPGWTAVVVGKTTLKHYLF 213 KG DL ++ + +L +PG VV+G + Sbjct: 307 QKGLDLLLEALPTLLGEGAQLALLGSGDAELEARFRAAAQAYPGQIGVVIG--------Y 358 Query: 214 KKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASG-IPV 272 + L I A + + P +E GLT L A+ G +PV Sbjct: 359 DEALAHLIQAGA---------------------DALLVPSRFEPCGLTQLCALRYGAVPV 397 Query: 273 VASNTGVFSELLD 285 VA G+ ++D Sbjct: 398 VARVGGLADTVVD 410 >gnl|CDD|163131 TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Length = 397 Score = 32.7 bits (75), Expect = 0.15 Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 18/78 (23%) Query: 263 LEAMASGIPVVASNTGVFSELLDPENAKAGVIVPP--RNLHALE-----KAVLYFMNSKK 315 LEAMA PVVAS PE A G+ P L A + A+L + + Sbjct: 316 LEAMAMAKPVVAS----------PE-AAEGIDALPGAELLVAADPADFAAAILALLANPA 364 Query: 316 IMSDTGNRGRERAVKHFS 333 + G R R ++H+ Sbjct: 365 EREELGQAARRRVLQHYH 382 >gnl|CDD|177950 PLN02316, PLN02316, synthase/transferase. Length = 1036 Score = 32.5 bits (74), Expect = 0.17 Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 1/49 (2%) Query: 243 YRALNIFVAPPLYEGFGLTPLEAMASG-IPVVASNTGVFSELLDPENAK 290 Y + + P ++E GLT L AM G IPVV G+F + D ++ K Sbjct: 917 YAGADFILVPSIFEPCGLTQLTAMRYGSIPVVRKTGGLFDTVFDVDHDK 965 >gnl|CDD|164861 PHA01630, PHA01630, putative group 1 glycosyl transferase. Length = 331 Score = 32.1 bits (73), Expect = 0.21 Identities = 18/69 (26%), Positives = 29/69 (42%), Gaps = 4/69 (5%) Query: 219 RRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTG 278 R NG+K + D I + +I P F + +EA+A G+ VV + G Sbjct: 187 RLFGLNGVKTPLPDDD----IYSLFAGCDILFYPVRGGAFEIPVIEALALGLDVVVTEKG 242 Query: 279 VFSELLDPE 287 +SE + Sbjct: 243 AWSEWVLSN 251 >gnl|CDD|167080 PRK00854, rocD, ornithine--oxo-acid transaminase; Reviewed. Length = 401 Score = 30.9 bits (70), Expect = 0.48 Identities = 27/98 (27%), Positives = 40/98 (40%), Gaps = 12/98 (12%) Query: 253 PLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPP-------RNLHALEK 305 P GF + P + + NT F L++P +AGVI+PP R L Sbjct: 164 PFTPGFRVVPFGDAEALEAAITPNTVAF--LVEPIQGEAGVIIPPAGYFTRVRELCTANN 221 Query: 306 AVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 L +I + G G+ A +H I + + IGK Sbjct: 222 VTLIL---DEIQTGLGRTGKLLAEEHEGIEADVTLIGK 256 >gnl|CDD|179083 PRK00654, glgA, glycogen synthase; Provisional. Length = 466 Score = 30.5 bits (70), Expect = 0.69 Identities = 36/133 (27%), Positives = 60/133 (45%), Gaps = 16/133 (12%) Query: 152 NKQEARRHLKIS-EDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKH 210 NK+ + + +DA L R+ + KG DL ++ + +L V++G T Sbjct: 267 NKRALQERFGLPDDDAPLFAMVSRLTEQKGLDLVLEALPELLEQGG--QLVLLG-TGDPE 323 Query: 211 Y--LFKKNLQRRIYANGLKKRILFIDEQSS--IEDWYRALNIFVAPPLYEGFGLTPLEAM 266 F+ L R Y + +I + DE + I Y ++F+ P +E GLT L A+ Sbjct: 324 LEEAFR-ALAAR-YPGKVGVQIGY-DEALAHRI---YAGADMFLMPSRFEPCGLTQLYAL 377 Query: 267 ASG-IPVVASNTG 278 G +P+V TG Sbjct: 378 RYGTLPIV-RRTG 389 >gnl|CDD|151162 pfam10649, DUF2478, Protein of unknown function (DUF2478). This is a family of hypothetical bacterial proteins found in the vicinity of Molybdenum ABC transporter ATP-binding gene-products MobA MobB and MobC. However the function could not be confirmed. This family appears to belong to the P-loop superfamily by alignment to pfam03266. However, the characteristic P-loop sequence motif appears to have diverged beyond recognition in this family. Length = 159 Score = 29.8 bits (68), Expect = 1.1 Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 5/39 (12%) Query: 256 EGFGLTPL--EAMASGIPVVASNTGVFSELLDPENAKAG 292 EG GL EA+A+GIPV+ T V L+ A AG Sbjct: 108 EGRGLRAEIAEALAAGIPVL---TAVPPRNLEAWRAFAG 143 >gnl|CDD|181401 PRK08360, PRK08360, 4-aminobutyrate aminotransferase; Provisional. Length = 443 Score = 29.7 bits (67), Expect = 1.3 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 22/79 (27%) Query: 278 GVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIM-------------SDTGNRG 324 GV + +P AG+IVPP + YF KKI+ S G G Sbjct: 204 GVAALFAEPIQGDAGMIVPPED---------YFKKLKKILDEHGILLVVDEVQSGLGRTG 254 Query: 325 RERAVKHFSIVKEASDIGK 343 + A++HF + + +GK Sbjct: 255 KWFAIEHFGVEPDIITLGK 273 >gnl|CDD|131814 TIGR02767, TraG-Ti, Ti-type conjugative transfer system protien TraG. This protein is found in the Agrobacterium tumefaciens Ti plasmid tra region responsible for conjugative transfer of the entire plasmid among Agrobacterium strains. The protein is distantly related to the F-type conjugation system TraG protein. Both of these systems are examples of type IV secretion systems. Length = 623 Score = 29.5 bits (66), Expect = 1.4 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 12/58 (20%) Query: 180 GTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANG-LKKRILFI-DE 235 TD+F++ +IL +PG V++G L A+G +K R LF+ DE Sbjct: 426 KTDVFINLPASILESYPGIGRVIIGSL----------LNAMYNADGNVKGRTLFLLDE 473 >gnl|CDD|132415 TIGR03372, putres_am_tran, putrescine aminotransferase. Members of this family are putrescine aminotransferase, as found in Escherichia coli, Erwinia carotovora subsp. atroseptica, and closely related species. This pyridoxal phosphate enzyme, as characterized in E. coli, can act also on cadaverine and, more weakly, spermidine. Length = 442 Score = 29.2 bits (65), Expect = 1.5 Identities = 23/91 (25%), Positives = 38/91 (41%), Gaps = 10/91 (10%) Query: 253 PLYEGFGLTP---LEAMASGIPVVASNTG--VFSELLDPENAKAGVIVPPRN----LHAL 303 PL GF +EAM + TG V + +L+P + GVI+PP + AL Sbjct: 195 PLLPGFHHVAFGDIEAMLKALNE-CKKTGDDVAAIILEPIQGEGGVILPPEGYLPAVRAL 253 Query: 304 EKAVLYFMNSKKIMSDTGNRGRERAVKHFSI 334 + ++ + G G+ A +H + Sbjct: 254 CDEFGALLILDEVQTGMGRTGKMFACEHEGV 284 >gnl|CDD|162673 TIGR02049, gshA_ferroox, glutamate--cysteine ligase, T. ferrooxidans family. This family consists of a rare family of glutamate--cysteine ligases, demonstrated first in Thiobacillus ferrooxidans and present in a few other Proteobacteria. It is the first of two enzymes for glutathione biosynthesis. It is also called gamma-glutamylcysteine synthetase. Length = 403 Score = 28.7 bits (64), Expect = 2.7 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%) Query: 230 ILFIDEQSSIEDWYRALNIFVAPPLY-------EGFGLTPLEA 265 I +D + SIE W+RA PP Y G+ + P++ Sbjct: 2 IKILDLEKSIERWFRAQWQGHTPPFYSSVDLRNAGYKIAPVDT 44 >gnl|CDD|140319 PTZ00298, PTZ00298, mevalonate kinase; Provisional. Length = 328 Score = 28.3 bits (63), Expect = 2.9 Identities = 27/105 (25%), Positives = 42/105 (40%), Gaps = 18/105 (17%) Query: 180 GTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSI 239 G D + +C + + PG VV + + Y+ +K ++R A+ L R L ID Sbjct: 37 GIDEYTECRLELTKGVPG-LQVVDQRPAVPGYIVEKREEQRK-AHQLVLRHLNIDTSVD- 93 Query: 240 EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELL 284 L + + P L P +SGI AS+ S L Sbjct: 94 -----GLKMHLGGP------LVP----SSGIGASASDVVSLSRAL 123 >gnl|CDD|179248 PRK01212, PRK01212, homoserine kinase; Provisional. Length = 301 Score = 28.2 bits (64), Expect = 3.0 Identities = 14/61 (22%), Positives = 19/61 (31%), Gaps = 24/61 (39%) Query: 45 LPKNIP------SIGISSLLTCWKKPIGQNSRIWHA-RRNNEMLLGVMMRDVL----RMP 93 LPK + ++LL A + L G M+DVL R Sbjct: 185 LPKQYSLKDAVFNSSRAALLV-------------AALYTGDYELAGRAMKDVLHEPYRAK 231 Query: 94 L 94 L Sbjct: 232 L 232 >gnl|CDD|179736 PRK04073, rocD, ornithine--oxo-acid transaminase; Provisional. Length = 396 Score = 28.1 bits (63), Expect = 3.2 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 8/48 (16%) Query: 253 PLYEGFGLTP---LEAMASGIPVVASNTGVFSELLDPENAKAGVIVPP 297 P+ G P LEA+ + I NT F L++P +AG+ +PP Sbjct: 163 PMLPGIKKIPYGDLEALKAAI---TPNTAAF--LVEPIQGEAGINIPP 205 >gnl|CDD|172350 PRK13822, PRK13822, conjugal transfer coupling protein TraG; Provisional. Length = 641 Score = 27.6 bits (62), Expect = 5.6 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 14/59 (23%) Query: 180 GTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIY-ANG-LKKRILFI-DE 235 TD+F++ + IL +PG V++G +L +Y A+G +K R LF+ DE Sbjct: 438 NTDVFINIPLKILESYPGLARVIIG-----------SLLNAMYNADGNVKGRALFLLDE 485 >gnl|CDD|183175 PRK11522, PRK11522, putrescine--2-oxoglutarate aminotransferase; Provisional. Length = 459 Score = 27.4 bits (61), Expect = 5.9 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 4/51 (7%) Query: 253 PLYEGFGLTP---LEAMASGIPVVASN-TGVFSELLDPENAKAGVIVPPRN 299 PL GF P +EAM + + V + +L+P + GVI+PP Sbjct: 202 PLLPGFRHVPFGNIEAMRTALSECKKTGDDVAAVILEPIQGEGGVILPPEG 252 >gnl|CDD|162413 TIGR01547, phage_term_2, phage terminase, large subunit, PBSX family. This model detects members of a highly divergent family of the large subunit of phage terminase. All members are encoded by phage genomes or within prophage regions of bacterial genomes. This is a distinct family from pfam03354. Length = 396 Score = 27.0 bits (60), Expect = 7.0 Identities = 18/96 (18%), Positives = 27/96 (28%), Gaps = 17/96 (17%) Query: 151 SNKQEARRHLKISEDAKLI---GCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTT 207 K ++ +KI K G + KLK + + Sbjct: 66 FKKSKSSMEIKILNTGKKFIFKGLNDKPNKLKSGAGIAIIWFE--------------EAS 111 Query: 208 LKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWY 243 + K L R+ G KK I+F S W Sbjct: 112 QLTFEDIKELIPRLRETGGKKFIIFSSNPESPLHWV 147 >gnl|CDD|184626 PRK14329, PRK14329, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional. Length = 467 Score = 26.9 bits (60), Expect = 8.8 Identities = 9/19 (47%), Positives = 12/19 (63%) Query: 323 RGRERAVKHFSIVKEASDI 341 RGRER+ SI+ E D+ Sbjct: 191 RGRERSRDPESILNEVRDL 209 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.323 0.138 0.416 Gapped Lambda K H 0.267 0.0671 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 5,911,012 Number of extensions: 388253 Number of successful extensions: 731 Number of sequences better than 10.0: 1 Number of HSP's gapped: 723 Number of HSP's successfully gapped: 41 Length of query: 352 Length of database: 5,994,473 Length adjustment: 94 Effective length of query: 258 Effective length of database: 3,963,321 Effective search space: 1022536818 Effective search space used: 1022536818 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 58 (26.2 bits)