BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037469
(429 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q5NDE4|GTDC2_TAKRU Glycosyltransferase-like domain-containing protein 2 OS=Takifugu
rubripes GN=gtdc2 PE=2 SV=1
Length = 590
Score = 50.4 bits (119), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/258 (20%), Positives = 104/258 (40%), Gaps = 32/258 (12%)
Query: 146 NIFHDFSDIIIPLYINSRQF-NGQVQFVITNKQSWWINKYQNILKKLSRYDII---DIDN 201
N+ H F D ++P + +QF + + + W + + + LS + + N
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDSDEDARLVFMEGWEEGPHFELYRLLSNKQPLLKEQLRN 221
Query: 202 QDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFRQFIRSSYSLQKPAA 251
+ CF + IGL + + N S I+ F + + ++ + A
Sbjct: 222 FGKLMCFTKSYIGLSKMTTWYQYGFVQPQGPKANILVSGNEIRHFAKVLMEKMNITRAAG 281
Query: 252 IRLRDG--EKKKPR---LLIVSRKRTRAFTN-AEEIAQMGRRLGFNVV-VAEANGNLSRF 304
G E +KP+ +++ SR TR N AE I + + VV V+ +
Sbjct: 282 GEKDQGNAEDEKPKDEYIVVFSRSTTRLILNEAELIMALAQEFQMRVVTVSLEEQSFPSI 341
Query: 305 AETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK------AM 358
+ ++ + +++HGA + +FLP AV +++ PF D P + M
Sbjct: 342 VQVISGASMLVSMHGAQLITSLFLPPGAVVVELYPFA-----VNPDQYTPYRTLASLPGM 396
Query: 359 KLRYLEYKIKLDESTLIQ 376
L Y+ ++ +E+T+
Sbjct: 397 DLHYIPWRNTEEENTVTH 414
>sp|Q5NDE3|GTDC2_TETNG Glycosyltransferase-like domain-containing protein 2 OS=Tetraodon
nigroviridis GN=gtdc2 PE=2 SV=1
Length = 579
Score = 50.1 bits (118), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/254 (19%), Positives = 105/254 (41%), Gaps = 35/254 (13%)
Query: 146 NIFHDFSDIIIPLYINSRQF-----NGQVQFVITNKQSWWINKYQNILKKLSRYDII--- 197
N+ H F D ++P + +QF + ++ F+ + W + ++ + LS +
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDLDEDARLVFM----EGWDEGPHFHLYRLLSDKQPLLKE 217
Query: 198 DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFIRSSYSLQKPA 250
+ N + CF + IGL + + + P S + R F + L +
Sbjct: 218 QLRNFGKLMCFTKSYIGLSKMTTWYQYGFVQPQGPKANILVSGNEIRHFAKV---LMEKM 274
Query: 251 AIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGFNVV-VAEANGNLSRFAETV 308
+ +G ++ +++ SR TR N AE + + + VV V+ + + + +
Sbjct: 275 NVTRAEGGQEDEYIVVFSRSSTRLILNQAELVMALAQEFQMRVVTVSLEEQSFASIVQVI 334
Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK------AMKLRY 362
+ +++HGA + +FLP AV +++ PF D P + M L Y
Sbjct: 335 GAASMLVSMHGAQLITALFLPPGAVVVELFPFA-----VNPDQYTPYRTLAALPGMDLHY 389
Query: 363 LEYKIKLDESTLIQ 376
+ ++ +E+T+
Sbjct: 390 ISWRNTEEENTITH 403
>sp|Q8BW41|GTDC2_MOUSE Glycosyltransferase-like domain-containing protein 2 OS=Mus
musculus GN=Gtdc2 PE=2 SV=1
Length = 605
Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/272 (21%), Positives = 107/272 (39%), Gaps = 44/272 (16%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++PL+ RQF G Q + + W + ++ K
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAQEARLFFMEGWGEGAHFDLYKL 208
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
LS + + + CF +GL + + N S I+ F
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQFT 268
Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGF 290
+F+ ++ A GE+ +L+ SR + R N E+ QM
Sbjct: 269 RFMTERLNVSHAGAPL---GEEY---ILVFSRTQNRLILNEAELLLELAQEFQMK----- 317
Query: 291 NVVVAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTD 350
V V+ + + V+ + +++HGA + +FLP A +++ P+ D
Sbjct: 318 TVTVSLEDHTFADVVRLVSNASMLVSMHGAQLVTALFLPRGATVVELFPYA-----VNPD 372
Query: 351 YEEPAK------AMKLRYLEYKIKLDESTLIQ 376
+ P K M L+Y+ ++ + E+T+
Sbjct: 373 HYTPYKTLATLPGMDLQYVAWRNMIRENTVTH 404
>sp|Q5NDF0|GTDC2_RAT Glycosyltransferase-like domain-containing protein 2 OS=Rattus
norvegicus GN=Gtdc2 PE=2 SV=1
Length = 580
Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/272 (21%), Positives = 107/272 (39%), Gaps = 44/272 (16%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++PL+ RQF G Q + + W + ++ K
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAQEARLFFMEGWGEGAHFDLYKL 208
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
LS + + + CF +GL + + N S I+ F
Sbjct: 209 LSPKQPLLRSQLKTLGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQFT 268
Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGF 290
+F+ ++ A GE+ +L+ SR + R N E+ QM
Sbjct: 269 RFMTERLNVSHAGAPL---GEEY---ILVFSRTQNRLILNEAELLLELAQEFQMK----- 317
Query: 291 NVVVAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTD 350
V V+ + + V+ + +++HGA + +FLP A +++ P+ D
Sbjct: 318 TVTVSLEDHTFADVVRLVSNASMLVSMHGAQLVTALFLPRGATVVELFPYA-----VNPD 372
Query: 351 YEEPAK------AMKLRYLEYKIKLDESTLIQ 376
+ P K M L+Y+ ++ + E+T+
Sbjct: 373 HYTPYKTLATLPGMDLQYVAWRNMIRENTVTH 404
>sp|Q5NDE9|GTDC2_CANFA Glycosyltransferase-like domain-containing protein 2 OS=Canis
familiaris GN=GTDC2 PE=2 SV=1
Length = 580
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/272 (21%), Positives = 107/272 (39%), Gaps = 44/272 (16%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++PL+ RQF G + + W + ++ K
Sbjct: 149 VPDVALVANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWSEGAHFDLYKL 208
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
LS + + + CF +GL + + N S I+ F
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQFA 268
Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGF 290
+F+ ++ + A GE+ +L+ SR + R N E+ QM
Sbjct: 269 RFMMEKLNVSQAGAPL---GEEY---ILVFSRTQNRLILNEAELLLALAQEFQMK----- 317
Query: 291 NVVVAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTD 350
V V+ + + V+ + ++VHGA + +FLP A +++ P+ D
Sbjct: 318 TVTVSLEDQAFADVVRLVSNASMLVSVHGAQLVTALFLPRGATVVELFPYA-----VNPD 372
Query: 351 YEEPAK------AMKLRYLEYKIKLDESTLIQ 376
+ P K M L+Y+ ++ + E+T+
Sbjct: 373 HYTPYKTLATLPGMDLQYVAWRNMMPENTVTH 404
>sp|Q5NDF2|GTDC2_BOVIN Glycosyltransferase-like domain-containing protein 2 OS=Bos taurus
GN=GTDC2 PE=2 SV=1
Length = 580
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/269 (20%), Positives = 107/269 (39%), Gaps = 38/269 (14%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++PL+ RQF G + + + W + ++ K
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAREARLFFMEGWGEGAHFDLYKL 208
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
LS + + + CF +GL + + + P S + RQF
Sbjct: 209 LSPKQPLLRAQLKALGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQF- 267
Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGFNVV 293
++ L + + G + +L+ SR + R N E+ QM V
Sbjct: 268 --AHFLMEKLNVSQAGGPLGEEYILVFSRTQNRLILNEAELLLALAQEFQMK-----TVT 320
Query: 294 VAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEE 353
V+ + + V+ + +++HGA + +FLP A +++ P+ D+
Sbjct: 321 VSLEDHAFADVVRLVSNASMLVSMHGAQLVTALFLPRGAAVVELFPYA-----VNPDHYT 375
Query: 354 PAK------AMKLRYLEYKIKLDESTLIQ 376
P K M L+Y+ ++ + E+T+
Sbjct: 376 PYKTLATLPGMDLQYIAWQNTMPENTVTH 404
>sp|Q8NAT1|GTDC2_HUMAN Glycosyltransferase-like domain-containing protein 2 OS=Homo
sapiens GN=GTDC2 PE=1 SV=1
Length = 580
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/269 (21%), Positives = 105/269 (39%), Gaps = 38/269 (14%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++PL+ RQF G + + W + ++ K
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWGEGAHFDLYKL 208
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
LS + + + CF +GL + + + P S + RQF
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFA 268
Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGFNVV 293
R + + GE+ +L+ SR + R N E+ QM V
Sbjct: 269 RFMTEKLNVSHTGVPLGEEY---ILVFSRTQNRLILNEAELLLALAQEFQMK-----TVT 320
Query: 294 VAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEE 353
V+ + + V+ + +++HGA + +FLP A +++ P+ D+
Sbjct: 321 VSLEDHTFADVVRLVSNASMLVSMHGAQLVTTLFLPRGATVVELFPYA-----VNPDHYT 375
Query: 354 PAK------AMKLRYLEYKIKLDESTLIQ 376
P K M L+Y+ ++ + E+T+
Sbjct: 376 PYKTLAMLPGMDLQYVAWRNMMPENTVTH 404
>sp|Q5NDL3|EOGT_CHICK EGF domain-specific O-linked N-acetylglucosamine transferase
OS=Gallus gallus GN=EOGT PE=2 SV=2
Length = 535
Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 90/205 (43%), Gaps = 17/205 (8%)
Query: 146 NIFHDFSDII---IPLYINSRQFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D + I +IN+ F+ V V+ + S+ + + + K + YDII +
Sbjct: 253 NMYHHFCDFVNLYITQHINN-SFSTDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIYLKT 311
Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
D CF + L R+ N + FR F S + L + I
Sbjct: 312 FDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAF--SQHVLHR-LNITQEGP 368
Query: 258 EKKKPRLLIVSRKRT-RAFTNAEEIAQMGRRLG-FNVVVAEANGNLSRFAETV---NYCD 312
+ K R+ I++R R N E+ + + V V + F+E + + D
Sbjct: 369 KDGKIRVTILARSTDYRKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSD 428
Query: 313 VFLAVHGAAMTNMIFLPENAVFIQV 337
+F+ +HGA +T+++FLP+ AV ++
Sbjct: 429 IFIGMHGAGLTHLLFLPDWAVVFEL 453
>sp|Q5NDF1|GTDC2_PANTR Glycosyltransferase-like domain-containing protein 2 OS=Pan
troglodytes GN=GTDC2 PE=2 SV=1
Length = 580
Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/269 (21%), Positives = 105/269 (39%), Gaps = 38/269 (14%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++PL+ RQF G + + W + ++ K
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWGEGAHFDLYKL 208
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
LS + + + CF +GL + + + P S + RQF
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFA 268
Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGFNVV 293
R + + GE+ +L+ SR + R N E+ QM V
Sbjct: 269 RFMTEKLNVSHTGVPLGEEY---ILVFSRTQNRLILNEAELLLALAQEFQMK-----TVT 320
Query: 294 VAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEE 353
V+ + + V+ + +++HGA + +FLP A +++ P+ D+
Sbjct: 321 VSLEDHAFADVVRLVSNASMLVSMHGAQLVTTLFLPRGATVVELFPYA-----VNPDHYT 375
Query: 354 PAK------AMKLRYLEYKIKLDESTLIQ 376
P K M L+Y+ ++ + E+T+
Sbjct: 376 PYKTLAMLPGMDLQYVAWRNMMPENTVTH 404
>sp|A0JND3|EOGT_BOVIN EGF domain-specific O-linked N-acetylglucosamine transferase OS=Bos
taurus GN=EOGT PE=2 SV=1
Length = 527
Score = 43.1 bits (100), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 90/210 (42%), Gaps = 19/210 (9%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D I LYI F+ V V+ + S+ + + + + K + YD+I +
Sbjct: 245 NMYHHFCDFI-NLYITQHVNNSFSTDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 303
Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
D CF I L R+ N + FR F S + L + I
Sbjct: 304 YDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAF--SQHVLHR-LNITQEGP 360
Query: 258 EKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETVNYC 311
+ K R+ I++R R N E+ + + F V + + G L + T N
Sbjct: 361 KGGKIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHN-T 419
Query: 312 DVFLAVHGAAMTNMIFLPENAVFIQVVPFG 341
D+F+ +HGA +T+++FLP+ A ++ G
Sbjct: 420 DIFIGMHGAGLTHLLFLPDWAAVFELYNCG 449
>sp|Q5NDE6|GTDC2_XENTR Glycosyltransferase-like domain-containing protein 2 OS=Xenopus
tropicalis GN=gtdc2 PE=2 SV=1
Length = 576
Score = 43.1 bits (100), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/270 (18%), Positives = 112/270 (41%), Gaps = 37/270 (13%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQF-NGQVQFVITNKQSWWINKYQNILKK 190
VP + L N N+ H F D +IP++ +QF + + + + W + + K
Sbjct: 148 VPDVALIMNRFNPDNLMHVFHDDLIPIFYTIQQFADLDFESRLFFMEGWNEGLHFELYKF 207
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
+S + + + CF + +GL + + N S I+ F
Sbjct: 208 MSNKQPLLKEQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQGPKANILVSGNEIRHFA 267
Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGF-NVVVA 295
+F+ ++ K D + +++ SR R N AE + + + + V+
Sbjct: 268 KFMMGKLNITK-------DQNAAEAYIVLFSRSMNRLIVNEAELLLALAQEFQMKTITVS 320
Query: 296 EANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPA 355
+ + + ++ + +++HGA + +FLP+ A+ +++ P+G ++ P
Sbjct: 321 LEDHSFADIVRLISNATMLVSMHGAQLITSLFLPKGAIVVELFPYG-----VNPEHYTPY 375
Query: 356 K------AMKLRYLEYKIKLDESTLIQQYP 379
K M+L+Y+ ++ +E+T+ YP
Sbjct: 376 KTLSTLPGMELQYVAWQNTEEENTI--AYP 403
>sp|Q5NDL0|EOGT_RAT EGF domain-specific O-linked N-acetylglucosamine transferase
OS=Rattus norvegicus GN=Eogt PE=2 SV=1
Length = 527
Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 95/209 (45%), Gaps = 25/209 (11%)
Query: 146 NIFHDFSDII---IPLYINSRQFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D + + +IN+ F+ V V+ + S+ + + + + K + YD+I +
Sbjct: 245 NMYHHFCDFLNLYLTQHINN-SFSTDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 303
Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
D CF + L R+ N I+ ++ + F Q + ++ +
Sbjct: 304 YDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNISQEGP--- 360
Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
+DG K R+ I++R R N E+ + + F V V + G L + T
Sbjct: 361 KDG---KLRVTILARSTEYRKILNQNELVNALKTVSTFEVRVVDYKYRELGFLDQLRITH 417
Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
N D+F+ +HGA +T+++FLP+ A ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445
>sp|Q6GQ23|EOGT_XENLA EGF domain-specific O-linked N-acetylglucosamine transferase
OS=Xenopus laevis GN=eogt PE=2 SV=1
Length = 525
Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 86/204 (42%), Gaps = 15/204 (7%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVI-TNKQSWWINKYQNILKKLSRYDIIDIDN 201
N++H F D + LYI F+ + V+ T + + + + K + Y+I +
Sbjct: 243 NMYHHFCDFV-NLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYEITHLKA 301
Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
D+ CF + L R+ N + FR F S + L + +
Sbjct: 302 YDNKRVCFKDAVFALLPRMRYGLFYNTPLISHCHGSGLFRAF--SQHVLHRLNITQHPAT 359
Query: 258 EKKKPRLLIVSRKRTRAFTNAEEIAQMGRRL-GFNVVVAEANGNLSRFAETVNYC---DV 313
E K ++V R N +E+ Q + F V V + + F E ++ D+
Sbjct: 360 EAKIRVTILVRSTEFRKILNLDELVQALEAVPTFQVKVVDYKYRVLGFLEQLSITHNSDI 419
Query: 314 FLAVHGAAMTNMIFLPENAVFIQV 337
F+ +HGA +T+++FLP+ AV ++
Sbjct: 420 FIGMHGAGLTHLLFLPDWAVVFEL 443
>sp|Q08CY9|EOGT_XENTR EGF domain-specific O-linked N-acetylglucosamine transferase
OS=Xenopus tropicalis GN=eogt PE=2 SV=1
Length = 525
Score = 42.7 bits (99), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 86/204 (42%), Gaps = 15/204 (7%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVI-TNKQSWWINKYQNILKKLSRYDIIDIDN 201
N++H F D + LYI F+ + V+ T + + + + K + YDI +
Sbjct: 243 NMYHHFCDFV-NLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYDITHLKA 301
Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
D+ CF + L R+ N + + FR F S + L + +
Sbjct: 302 YDNKRVCFKDAVFALLPRMRYGLFYNTPLISNCHGSGLFRAF--SQHVLHRLNITQQLPK 359
Query: 258 EKKKPRLLIVSRKRTRAFTNAEEIAQ-MGRRLGFNVVVAEANGNLSRFAETV---NYCDV 313
E K ++V R N +E+ + F V V + + F E + + D+
Sbjct: 360 EAKIRITILVRSTEFRKILNLDELVHALEAEPTFQVKVVDYKYRVLGFLEQLEITHNSDI 419
Query: 314 FLAVHGAAMTNMIFLPENAVFIQV 337
F+ +HGA +T+++FLP+ AV ++
Sbjct: 420 FIGMHGAGLTHLLFLPDWAVVFEL 443
>sp|Q8BYW9|EOGT_MOUSE EGF domain-specific O-linked N-acetylglucosamine transferase OS=Mus
musculus GN=Eogt PE=1 SV=1
Length = 527
Score = 42.4 bits (98), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 48/209 (22%), Positives = 94/209 (44%), Gaps = 25/209 (11%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D + LY+ F+ V V+ + ++ + + + + K + YD+I +
Sbjct: 245 NMYHHFCDFL-NLYLTQHVNNSFSTDVYIVMWDTSTYGYGDLFSDTWKAFTDYDVIHLKT 303
Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
D CF + L R+ N I+ ++ + F Q + ++ +
Sbjct: 304 YDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGP--- 360
Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
+DG K R+ I++R R N +E+ + + F V V + G L + T
Sbjct: 361 KDG---KVRVTILARSTEYRKILNQDELVNALKTVSTFEVRVVDYKYRELGFLDQLRITH 417
Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
N D+F+ +HGA +T+++FLP+ A ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445
>sp|Q5NDL1|EOGT_PANTR EGF domain-specific O-linked N-acetylglucosamine transferase OS=Pan
troglodytes GN=EOGT PE=2 SV=1
Length = 527
Score = 41.6 bits (96), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 92/209 (44%), Gaps = 25/209 (11%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D I LYI F+ V V+ + S+ + + + + + YD+I +
Sbjct: 245 NMYHHFCDFI-NLYITQHVNNSFSTDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLKT 303
Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
D CF + L R+ N I+ ++ + F Q + ++ +
Sbjct: 304 YDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGP--- 360
Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
+DG K R+ I++R R N E+ + + F V + + G L + T
Sbjct: 361 KDG---KIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITH 417
Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
N D+F+ +HGA +T+++FLP+ A ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445
>sp|Q5NDL2|EOGT_HUMAN EGF domain-specific O-linked N-acetylglucosamine transferase
OS=Homo sapiens GN=EOGT PE=2 SV=1
Length = 527
Score = 41.6 bits (96), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 92/209 (44%), Gaps = 25/209 (11%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D I LYI F+ V V+ + S+ + + + + + YD+I +
Sbjct: 245 NMYHHFCDFI-NLYITQHVNNSFSTDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLKT 303
Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
D CF + L R+ N I+ ++ + F Q + ++ +
Sbjct: 304 YDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGP--- 360
Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
+DG K R+ I++R R N E+ + + F V + + G L + T
Sbjct: 361 KDG---KIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITH 417
Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
N D+F+ +HGA +T+++FLP+ A ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445
>sp|Q5NDL9|EOGT_CANFA EGF domain-specific O-linked N-acetylglucosamine transferase
OS=Canis familiaris GN=EOGT PE=2 SV=1
Length = 527
Score = 40.8 bits (94), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 48/209 (22%), Positives = 92/209 (44%), Gaps = 25/209 (11%)
Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
N++H F D + LYI F+ V V+ + S+ + + + + K + YD+I +
Sbjct: 245 NMYHHFCDFV-NLYITQHVNNSFSTDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 303
Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
D CF + L R+ N I+ ++ + F Q + ++ +
Sbjct: 304 YDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGP--- 360
Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
+DG K R+ I++R R N E+ + + V + + G L + T
Sbjct: 361 KDG---KIRVTILARSTEYRKILNQNELVNALKTVSTLEVQIVDYKYKELGFLDQLRITH 417
Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
N D+F+ +HGA +T+++FLP+ A ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445
>sp|Q5NDE7|GTDC2_XENLA Glycosyltransferase-like domain-containing protein 2 OS=Xenopus
laevis GN=gtdc2 PE=2 SV=1
Length = 578
Score = 40.4 bits (93), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 51/270 (18%), Positives = 111/270 (41%), Gaps = 37/270 (13%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQF-NGQVQFVITNKQSWWINKYQNILKK 190
VP + L N N+ H F D ++P++ +QF + + + + W + + K
Sbjct: 148 VPDVALIMNRFNPDNLMHVFHDDLLPIFYTIQQFPDLDFESRLFFMEGWNEGLHFELYKF 207
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
+S + + + CF + +GL + + N S I+ F
Sbjct: 208 MSNKQPLLKEQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQGPKANILVSGNEIRHFA 267
Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGF-NVVVA 295
+F+ ++ D + +++ SR R N AE + + + + V+
Sbjct: 268 KFMMGKLNIT-------LDQNAAEAYIVLFSRSMNRLIVNEAELLLALAQEFQMKTITVS 320
Query: 296 EANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPA 355
+ + S ++ + +++HGA + +FLP+ AV +++ P+G ++ P
Sbjct: 321 LEDHSFSDIVRLLSNATMLVSMHGAQLVTSLFLPKGAVVVELFPYG-----INPEHYTPY 375
Query: 356 K------AMKLRYLEYKIKLDESTLIQQYP 379
K M+L+Y+ ++ +E+T+ YP
Sbjct: 376 KTLSTLPGMELQYVAWQNTEEENTIT--YP 403
>sp|Q5NDE8|GTDC2_CHICK Glycosyltransferase-like domain-containing protein 2 OS=Gallus
gallus GN=GTDC2 PE=2 SV=1
Length = 577
Score = 37.7 bits (86), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 53/267 (19%), Positives = 107/267 (40%), Gaps = 31/267 (11%)
Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNG-QVQFVITNKQSWWINKYQNILKK 190
VP + L +N N+ H F D ++P+Y +QF + + + W + ++ K
Sbjct: 148 VPDVALIANRFNPDNLMHVFHDDLLPIYYTMQQFTDLDPETRLFFMEGWSEGVHFDLYKL 207
Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
LS + + + CF + +GL + + + P S + RQF
Sbjct: 208 LSNKQPLLREQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFT 267
Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGF-NVVVAEAN 298
+ + +QK ++ +++ SR R N AE I + + + V+
Sbjct: 268 K--FMMQKLNVSLEESSSEE--YIVVFSRTINRLILNEAELILALAQEFQMKTITVSLEE 323
Query: 299 GNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK-- 356
+ S ++ + +++HGA + +FLP A +++ P+ ++ P K
Sbjct: 324 HSFSDIVRLISNASMLVSMHGAQLVMSLFLPRGATVVELFPYA-----INPEHYTPYKTL 378
Query: 357 ----AMKLRYLEYKIKLDESTLIQQYP 379
M L+Y+ ++ E T+ YP
Sbjct: 379 ATLPGMDLQYIAWQNTAREDTVT--YP 403
>sp|Q5NDE5|GTDC2_DANRE Glycosyltransferase-like domain-containing protein 2 OS=Danio rerio
GN=gtdc2 PE=2 SV=1
Length = 578
Score = 37.4 bits (85), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 48/247 (19%), Positives = 104/247 (42%), Gaps = 25/247 (10%)
Query: 146 NIFHDFSDIIIPLYINSRQF---NGQVQFVITNKQSWWINKYQNILKKLSRYDIIDIDNQ 202
N+ H F D ++P+Y +Q+ + + + V + W + ++ + LS + D
Sbjct: 162 NLMHIFHDDLLPVYYTMQQYSDLDDEARLVFM--EGWGEGAHFDLYRLLSSKQPLLKDQL 219
Query: 203 DD---IHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFIRSSYSLQKPAAI 252
+ CF + +GL + + + P S + RQF +S+ +++
Sbjct: 220 KTFGKLMCFTKSYVGLSKMTTWYQYGFVQPQGPKANILISGNEIRQF--ASFLMERLNIT 277
Query: 253 RLRDGEKKKPRLLIVSRKRTRAFTNAEEIAQMGRRLGF---NVVVAEANGNLSRFAETVN 309
R + E ++V ++ T E + F V V+ + + ++
Sbjct: 278 REEEEEDDD--YIVVFKRTTNRLILNEAELLLALAQEFQMRTVTVSLEEQSFDNIIQIIS 335
Query: 310 YCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK--AMKLRYLEYKI 367
+ +++HGA M +FLP A +++ P+G T Y+ A M L+Y+ ++
Sbjct: 336 RAAMLVSMHGAQMITSMFLPRGAAVVELFPYGVNP-EQYTPYKTLASLPGMDLQYVAWRN 394
Query: 368 KLDESTL 374
++E+T+
Sbjct: 395 TMEENTV 401
>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=rse-1 PE=3 SV=2
Length = 1209
Score = 33.5 bits (75), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 46/114 (40%), Gaps = 5/114 (4%)
Query: 59 VCSVTKGSDFCEINNDIRIDGSSATVFIASSLADQ---TEWIIRPYARKYDHVAMKRVNE 115
V S T G+ I + S + A +LA Q + +I+ + + H+ RVNE
Sbjct: 494 VLSFTNGTLVLSIGETVEEVSDSGFLTTAPTLAVQQMGEDGLIQVHPKGIRHIVQGRVNE 553
Query: 116 WSVKSAASLLQCNQNHSVPAILFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQV 169
W S++ N + I S+G F SD + Y ++ +G V
Sbjct: 554 WPAPQHRSIVAATANENQVVIALSSGEIV--YFEMDSDGSLAEYDEKKEMSGTV 605
>sp|P47449|Y207_MYCGE Putative metallophosphoesterase MG207 OS=Mycoplasma genitalium
(strain ATCC 33530 / G-37 / NCTC 10195) GN=MG207 PE=3
SV=2
Length = 163
Score = 32.3 bits (72), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 19/103 (18%)
Query: 292 VVVAEANGNLSRFAETVNYCDVFLAVH-GAAMTNMIFLPENAVF------------IQVV 338
+V+A+ +G R+ E NY + + +H G MT F+ +NA F I++
Sbjct: 5 LVIADTHGQNQRWIELKNYHNPDVIIHAGDHMTTKQFMDQNATFWVAGNNDSIGNEIEIF 64
Query: 339 PFGGFAWLARTDYEEPAKAMKLRYLEYKIKLDESTLIQQYPLD 381
G ++ ++ P +K Y +K QQYP D
Sbjct: 65 QLGQINFVLMHGHQAPRDNLKKWYQLLVLKA------QQYPCD 101
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.137 0.410
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 151,539,964
Number of Sequences: 539616
Number of extensions: 6104139
Number of successful extensions: 18662
Number of sequences better than 100.0: 27
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 18643
Number of HSP's gapped (non-prelim): 27
length of query: 429
length of database: 191,569,459
effective HSP length: 120
effective length of query: 309
effective length of database: 126,815,539
effective search space: 39186001551
effective search space used: 39186001551
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 63 (28.9 bits)