BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037469
         (429 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q5NDE4|GTDC2_TAKRU Glycosyltransferase-like domain-containing protein 2 OS=Takifugu
           rubripes GN=gtdc2 PE=2 SV=1
          Length = 590

 Score = 50.4 bits (119), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/258 (20%), Positives = 104/258 (40%), Gaps = 32/258 (12%)

Query: 146 NIFHDFSDIIIPLYINSRQF-NGQVQFVITNKQSWWINKYQNILKKLSRYDII---DIDN 201
           N+ H F D ++P +   +QF +      +   + W    +  + + LS    +    + N
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDSDEDARLVFMEGWEEGPHFELYRLLSNKQPLLKEQLRN 221

Query: 202 QDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFRQFIRSSYSLQKPAA 251
              + CF +  IGL +            +    N   S   I+ F + +    ++ + A 
Sbjct: 222 FGKLMCFTKSYIGLSKMTTWYQYGFVQPQGPKANILVSGNEIRHFAKVLMEKMNITRAAG 281

Query: 252 IRLRDG--EKKKPR---LLIVSRKRTRAFTN-AEEIAQMGRRLGFNVV-VAEANGNLSRF 304
                G  E +KP+   +++ SR  TR   N AE I  + +     VV V+    +    
Sbjct: 282 GEKDQGNAEDEKPKDEYIVVFSRSTTRLILNEAELIMALAQEFQMRVVTVSLEEQSFPSI 341

Query: 305 AETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK------AM 358
            + ++   + +++HGA +   +FLP  AV +++ PF         D   P +       M
Sbjct: 342 VQVISGASMLVSMHGAQLITSLFLPPGAVVVELYPFA-----VNPDQYTPYRTLASLPGM 396

Query: 359 KLRYLEYKIKLDESTLIQ 376
            L Y+ ++   +E+T+  
Sbjct: 397 DLHYIPWRNTEEENTVTH 414


>sp|Q5NDE3|GTDC2_TETNG Glycosyltransferase-like domain-containing protein 2 OS=Tetraodon
           nigroviridis GN=gtdc2 PE=2 SV=1
          Length = 579

 Score = 50.1 bits (118), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/254 (19%), Positives = 105/254 (41%), Gaps = 35/254 (13%)

Query: 146 NIFHDFSDIIIPLYINSRQF-----NGQVQFVITNKQSWWINKYQNILKKLSRYDII--- 197
           N+ H F D ++P +   +QF     + ++ F+    + W    + ++ + LS    +   
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDLDEDARLVFM----EGWDEGPHFHLYRLLSDKQPLLKE 217

Query: 198 DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFIRSSYSLQKPA 250
            + N   + CF +  IGL +     +      + P      S  + R F +    L +  
Sbjct: 218 QLRNFGKLMCFTKSYIGLSKMTTWYQYGFVQPQGPKANILVSGNEIRHFAKV---LMEKM 274

Query: 251 AIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGFNVV-VAEANGNLSRFAETV 308
            +   +G ++   +++ SR  TR   N AE +  + +     VV V+    + +   + +
Sbjct: 275 NVTRAEGGQEDEYIVVFSRSSTRLILNQAELVMALAQEFQMRVVTVSLEEQSFASIVQVI 334

Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK------AMKLRY 362
               + +++HGA +   +FLP  AV +++ PF         D   P +       M L Y
Sbjct: 335 GAASMLVSMHGAQLITALFLPPGAVVVELFPFA-----VNPDQYTPYRTLAALPGMDLHY 389

Query: 363 LEYKIKLDESTLIQ 376
           + ++   +E+T+  
Sbjct: 390 ISWRNTEEENTITH 403


>sp|Q8BW41|GTDC2_MOUSE Glycosyltransferase-like domain-containing protein 2 OS=Mus
           musculus GN=Gtdc2 PE=2 SV=1
          Length = 605

 Score = 45.1 bits (105), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 107/272 (39%), Gaps = 44/272 (16%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++PL+   RQF G  Q   +   + W    + ++ K 
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAQEARLFFMEGWGEGAHFDLYKL 208

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
           LS    +    +     + CF    +GL +            +    N   S   I+ F 
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQFT 268

Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGF 290
           +F+    ++    A     GE+    +L+ SR + R   N  E+        QM      
Sbjct: 269 RFMTERLNVSHAGAPL---GEEY---ILVFSRTQNRLILNEAELLLELAQEFQMK----- 317

Query: 291 NVVVAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTD 350
            V V+  +   +     V+   + +++HGA +   +FLP  A  +++ P+         D
Sbjct: 318 TVTVSLEDHTFADVVRLVSNASMLVSMHGAQLVTALFLPRGATVVELFPYA-----VNPD 372

Query: 351 YEEPAK------AMKLRYLEYKIKLDESTLIQ 376
           +  P K       M L+Y+ ++  + E+T+  
Sbjct: 373 HYTPYKTLATLPGMDLQYVAWRNMIRENTVTH 404


>sp|Q5NDF0|GTDC2_RAT Glycosyltransferase-like domain-containing protein 2 OS=Rattus
           norvegicus GN=Gtdc2 PE=2 SV=1
          Length = 580

 Score = 45.1 bits (105), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 107/272 (39%), Gaps = 44/272 (16%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++PL+   RQF G  Q   +   + W    + ++ K 
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAQEARLFFMEGWGEGAHFDLYKL 208

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
           LS    +    +     + CF    +GL +            +    N   S   I+ F 
Sbjct: 209 LSPKQPLLRSQLKTLGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQFT 268

Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGF 290
           +F+    ++    A     GE+    +L+ SR + R   N  E+        QM      
Sbjct: 269 RFMTERLNVSHAGAPL---GEEY---ILVFSRTQNRLILNEAELLLELAQEFQMK----- 317

Query: 291 NVVVAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTD 350
            V V+  +   +     V+   + +++HGA +   +FLP  A  +++ P+         D
Sbjct: 318 TVTVSLEDHTFADVVRLVSNASMLVSMHGAQLVTALFLPRGATVVELFPYA-----VNPD 372

Query: 351 YEEPAK------AMKLRYLEYKIKLDESTLIQ 376
           +  P K       M L+Y+ ++  + E+T+  
Sbjct: 373 HYTPYKTLATLPGMDLQYVAWRNMIRENTVTH 404


>sp|Q5NDE9|GTDC2_CANFA Glycosyltransferase-like domain-containing protein 2 OS=Canis
           familiaris GN=GTDC2 PE=2 SV=1
          Length = 580

 Score = 44.3 bits (103), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 107/272 (39%), Gaps = 44/272 (16%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++PL+   RQF G      +   + W    + ++ K 
Sbjct: 149 VPDVALVANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWSEGAHFDLYKL 208

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
           LS    +    +     + CF    +GL +            +    N   S   I+ F 
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQFA 268

Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGF 290
           +F+    ++ +  A     GE+    +L+ SR + R   N  E+        QM      
Sbjct: 269 RFMMEKLNVSQAGAPL---GEEY---ILVFSRTQNRLILNEAELLLALAQEFQMK----- 317

Query: 291 NVVVAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTD 350
            V V+  +   +     V+   + ++VHGA +   +FLP  A  +++ P+         D
Sbjct: 318 TVTVSLEDQAFADVVRLVSNASMLVSVHGAQLVTALFLPRGATVVELFPYA-----VNPD 372

Query: 351 YEEPAK------AMKLRYLEYKIKLDESTLIQ 376
           +  P K       M L+Y+ ++  + E+T+  
Sbjct: 373 HYTPYKTLATLPGMDLQYVAWRNMMPENTVTH 404


>sp|Q5NDF2|GTDC2_BOVIN Glycosyltransferase-like domain-containing protein 2 OS=Bos taurus
           GN=GTDC2 PE=2 SV=1
          Length = 580

 Score = 44.3 bits (103), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/269 (20%), Positives = 107/269 (39%), Gaps = 38/269 (14%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++PL+   RQF G  +   +   + W    + ++ K 
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAREARLFFMEGWGEGAHFDLYKL 208

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
           LS    +    +     + CF    +GL +     +      + P      S  + RQF 
Sbjct: 209 LSPKQPLLRAQLKALGRLLCFSHAFVGLSKVTTWYQYGFVQPQGPKANILVSGNEIRQF- 267

Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGFNVV 293
             ++ L +   +    G   +  +L+ SR + R   N  E+        QM       V 
Sbjct: 268 --AHFLMEKLNVSQAGGPLGEEYILVFSRTQNRLILNEAELLLALAQEFQMK-----TVT 320

Query: 294 VAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEE 353
           V+  +   +     V+   + +++HGA +   +FLP  A  +++ P+         D+  
Sbjct: 321 VSLEDHAFADVVRLVSNASMLVSMHGAQLVTALFLPRGAAVVELFPYA-----VNPDHYT 375

Query: 354 PAK------AMKLRYLEYKIKLDESTLIQ 376
           P K       M L+Y+ ++  + E+T+  
Sbjct: 376 PYKTLATLPGMDLQYIAWQNTMPENTVTH 404


>sp|Q8NAT1|GTDC2_HUMAN Glycosyltransferase-like domain-containing protein 2 OS=Homo
           sapiens GN=GTDC2 PE=1 SV=1
          Length = 580

 Score = 44.3 bits (103), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 105/269 (39%), Gaps = 38/269 (14%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++PL+   RQF G      +   + W    + ++ K 
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWGEGAHFDLYKL 208

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
           LS    +    +     + CF    +GL +     +      + P      S  + RQF 
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFA 268

Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGFNVV 293
           R        +   +  GE+    +L+ SR + R   N  E+        QM       V 
Sbjct: 269 RFMTEKLNVSHTGVPLGEEY---ILVFSRTQNRLILNEAELLLALAQEFQMK-----TVT 320

Query: 294 VAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEE 353
           V+  +   +     V+   + +++HGA +   +FLP  A  +++ P+         D+  
Sbjct: 321 VSLEDHTFADVVRLVSNASMLVSMHGAQLVTTLFLPRGATVVELFPYA-----VNPDHYT 375

Query: 354 PAK------AMKLRYLEYKIKLDESTLIQ 376
           P K       M L+Y+ ++  + E+T+  
Sbjct: 376 PYKTLAMLPGMDLQYVAWRNMMPENTVTH 404


>sp|Q5NDL3|EOGT_CHICK EGF domain-specific O-linked N-acetylglucosamine transferase
           OS=Gallus gallus GN=EOGT PE=2 SV=2
          Length = 535

 Score = 43.5 bits (101), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 90/205 (43%), Gaps = 17/205 (8%)

Query: 146 NIFHDFSDII---IPLYINSRQFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D +   I  +IN+  F+  V  V+ +  S+ + + +    K  + YDII +  
Sbjct: 253 NMYHHFCDFVNLYITQHINN-SFSTDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIYLKT 311

Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
            D    CF   +  L    R+    N       +    FR F  S + L +   I     
Sbjct: 312 FDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAF--SQHVLHR-LNITQEGP 368

Query: 258 EKKKPRLLIVSRKRT-RAFTNAEEIAQMGRRLG-FNVVVAEANGNLSRFAETV---NYCD 312
           +  K R+ I++R    R   N  E+    + +    V V +       F+E +   +  D
Sbjct: 369 KDGKIRVTILARSTDYRKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSD 428

Query: 313 VFLAVHGAAMTNMIFLPENAVFIQV 337
           +F+ +HGA +T+++FLP+ AV  ++
Sbjct: 429 IFIGMHGAGLTHLLFLPDWAVVFEL 453


>sp|Q5NDF1|GTDC2_PANTR Glycosyltransferase-like domain-containing protein 2 OS=Pan
           troglodytes GN=GTDC2 PE=2 SV=1
          Length = 580

 Score = 43.5 bits (101), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 105/269 (39%), Gaps = 38/269 (14%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQVQFV-ITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++PL+   RQF G      +   + W    + ++ K 
Sbjct: 149 VPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWGEGAHFDLYKL 208

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
           LS    +    +     + CF    +GL +     +      + P      S  + RQF 
Sbjct: 209 LSPKQPLLRAQLKTLGRLLCFSHAFVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFA 268

Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTNAEEIA-------QMGRRLGFNVV 293
           R        +   +  GE+    +L+ SR + R   N  E+        QM       V 
Sbjct: 269 RFMTEKLNVSHTGVPLGEEY---ILVFSRTQNRLILNEAELLLALAQEFQMK-----TVT 320

Query: 294 VAEANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEE 353
           V+  +   +     V+   + +++HGA +   +FLP  A  +++ P+         D+  
Sbjct: 321 VSLEDHAFADVVRLVSNASMLVSMHGAQLVTTLFLPRGATVVELFPYA-----VNPDHYT 375

Query: 354 PAK------AMKLRYLEYKIKLDESTLIQ 376
           P K       M L+Y+ ++  + E+T+  
Sbjct: 376 PYKTLAMLPGMDLQYVAWRNMMPENTVTH 404


>sp|A0JND3|EOGT_BOVIN EGF domain-specific O-linked N-acetylglucosamine transferase OS=Bos
           taurus GN=EOGT PE=2 SV=1
          Length = 527

 Score = 43.1 bits (100), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 90/210 (42%), Gaps = 19/210 (9%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D I  LYI       F+  V  V+ +  S+ + + + +  K  + YD+I +  
Sbjct: 245 NMYHHFCDFI-NLYITQHVNNSFSTDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 303

Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
            D    CF   I  L    R+    N        +   FR F  S + L +   I     
Sbjct: 304 YDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAF--SQHVLHR-LNITQEGP 360

Query: 258 EKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETVNYC 311
           +  K R+ I++R    R   N  E+    + +  F V + +      G L +   T N  
Sbjct: 361 KGGKIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHN-T 419

Query: 312 DVFLAVHGAAMTNMIFLPENAVFIQVVPFG 341
           D+F+ +HGA +T+++FLP+ A   ++   G
Sbjct: 420 DIFIGMHGAGLTHLLFLPDWAAVFELYNCG 449


>sp|Q5NDE6|GTDC2_XENTR Glycosyltransferase-like domain-containing protein 2 OS=Xenopus
           tropicalis GN=gtdc2 PE=2 SV=1
          Length = 576

 Score = 43.1 bits (100), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/270 (18%), Positives = 112/270 (41%), Gaps = 37/270 (13%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQF-NGQVQFVITNKQSWWINKYQNILKK 190
           VP + L  N     N+ H F D +IP++   +QF +   +  +   + W    +  + K 
Sbjct: 148 VPDVALIMNRFNPDNLMHVFHDDLIPIFYTIQQFADLDFESRLFFMEGWNEGLHFELYKF 207

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
           +S    +    +     + CF +  +GL +            +    N   S   I+ F 
Sbjct: 208 MSNKQPLLKEQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQGPKANILVSGNEIRHFA 267

Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGF-NVVVA 295
           +F+    ++ K       D    +  +++ SR   R   N AE +  + +      + V+
Sbjct: 268 KFMMGKLNITK-------DQNAAEAYIVLFSRSMNRLIVNEAELLLALAQEFQMKTITVS 320

Query: 296 EANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPA 355
             + + +     ++   + +++HGA +   +FLP+ A+ +++ P+G        ++  P 
Sbjct: 321 LEDHSFADIVRLISNATMLVSMHGAQLITSLFLPKGAIVVELFPYG-----VNPEHYTPY 375

Query: 356 K------AMKLRYLEYKIKLDESTLIQQYP 379
           K       M+L+Y+ ++   +E+T+   YP
Sbjct: 376 KTLSTLPGMELQYVAWQNTEEENTI--AYP 403


>sp|Q5NDL0|EOGT_RAT EGF domain-specific O-linked N-acetylglucosamine transferase
           OS=Rattus norvegicus GN=Eogt PE=2 SV=1
          Length = 527

 Score = 42.7 bits (99), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 95/209 (45%), Gaps = 25/209 (11%)

Query: 146 NIFHDFSDII---IPLYINSRQFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D +   +  +IN+  F+  V  V+ +  S+ + + + +  K  + YD+I +  
Sbjct: 245 NMYHHFCDFLNLYLTQHINN-SFSTDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 303

Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
            D    CF   +  L    R+    N   I+  ++    + F Q +    ++ +      
Sbjct: 304 YDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNISQEGP--- 360

Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
           +DG   K R+ I++R    R   N  E+    + +  F V V +      G L +   T 
Sbjct: 361 KDG---KLRVTILARSTEYRKILNQNELVNALKTVSTFEVRVVDYKYRELGFLDQLRITH 417

Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
           N  D+F+ +HGA +T+++FLP+ A   ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445


>sp|Q6GQ23|EOGT_XENLA EGF domain-specific O-linked N-acetylglucosamine transferase
           OS=Xenopus laevis GN=eogt PE=2 SV=1
          Length = 525

 Score = 42.7 bits (99), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 47/204 (23%), Positives = 86/204 (42%), Gaps = 15/204 (7%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVI-TNKQSWWINKYQNILKKLSRYDIIDIDN 201
           N++H F D +  LYI       F+  +  V+ T     + + + +  K  + Y+I  +  
Sbjct: 243 NMYHHFCDFV-NLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYEITHLKA 301

Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
            D+   CF   +  L    R+    N       +    FR F  S + L +    +    
Sbjct: 302 YDNKRVCFKDAVFALLPRMRYGLFYNTPLISHCHGSGLFRAF--SQHVLHRLNITQHPAT 359

Query: 258 EKKKPRLLIVSRKRTRAFTNAEEIAQMGRRL-GFNVVVAEANGNLSRFAETVNYC---DV 313
           E K    ++V     R   N +E+ Q    +  F V V +    +  F E ++     D+
Sbjct: 360 EAKIRVTILVRSTEFRKILNLDELVQALEAVPTFQVKVVDYKYRVLGFLEQLSITHNSDI 419

Query: 314 FLAVHGAAMTNMIFLPENAVFIQV 337
           F+ +HGA +T+++FLP+ AV  ++
Sbjct: 420 FIGMHGAGLTHLLFLPDWAVVFEL 443


>sp|Q08CY9|EOGT_XENTR EGF domain-specific O-linked N-acetylglucosamine transferase
           OS=Xenopus tropicalis GN=eogt PE=2 SV=1
          Length = 525

 Score = 42.7 bits (99), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 47/204 (23%), Positives = 86/204 (42%), Gaps = 15/204 (7%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVI-TNKQSWWINKYQNILKKLSRYDIIDIDN 201
           N++H F D +  LYI       F+  +  V+ T     + + + +  K  + YDI  +  
Sbjct: 243 NMYHHFCDFV-NLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYDITHLKA 301

Query: 202 QDDIH-CFPRGIIGL---KRHDRELNINSSESPYSIKDFRQFIRSSYSLQKPAAIRLRDG 257
            D+   CF   +  L    R+    N     + +    FR F  S + L +    +    
Sbjct: 302 YDNKRVCFKDAVFALLPRMRYGLFYNTPLISNCHGSGLFRAF--SQHVLHRLNITQQLPK 359

Query: 258 EKKKPRLLIVSRKRTRAFTNAEEIAQ-MGRRLGFNVVVAEANGNLSRFAETV---NYCDV 313
           E K    ++V     R   N +E+   +     F V V +    +  F E +   +  D+
Sbjct: 360 EAKIRITILVRSTEFRKILNLDELVHALEAEPTFQVKVVDYKYRVLGFLEQLEITHNSDI 419

Query: 314 FLAVHGAAMTNMIFLPENAVFIQV 337
           F+ +HGA +T+++FLP+ AV  ++
Sbjct: 420 FIGMHGAGLTHLLFLPDWAVVFEL 443


>sp|Q8BYW9|EOGT_MOUSE EGF domain-specific O-linked N-acetylglucosamine transferase OS=Mus
           musculus GN=Eogt PE=1 SV=1
          Length = 527

 Score = 42.4 bits (98), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 48/209 (22%), Positives = 94/209 (44%), Gaps = 25/209 (11%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D +  LY+       F+  V  V+ +  ++ + + + +  K  + YD+I +  
Sbjct: 245 NMYHHFCDFL-NLYLTQHVNNSFSTDVYIVMWDTSTYGYGDLFSDTWKAFTDYDVIHLKT 303

Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
            D    CF   +  L    R+    N   I+  ++    + F Q +    ++ +      
Sbjct: 304 YDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGP--- 360

Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
           +DG   K R+ I++R    R   N +E+    + +  F V V +      G L +   T 
Sbjct: 361 KDG---KVRVTILARSTEYRKILNQDELVNALKTVSTFEVRVVDYKYRELGFLDQLRITH 417

Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
           N  D+F+ +HGA +T+++FLP+ A   ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445


>sp|Q5NDL1|EOGT_PANTR EGF domain-specific O-linked N-acetylglucosamine transferase OS=Pan
           troglodytes GN=EOGT PE=2 SV=1
          Length = 527

 Score = 41.6 bits (96), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 92/209 (44%), Gaps = 25/209 (11%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D I  LYI       F+  V  V+ +  S+ + + + +     + YD+I +  
Sbjct: 245 NMYHHFCDFI-NLYITQHVNNSFSTDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLKT 303

Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
            D    CF   +  L    R+    N   I+  ++    + F Q +    ++ +      
Sbjct: 304 YDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGP--- 360

Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
           +DG   K R+ I++R    R   N  E+    + +  F V + +      G L +   T 
Sbjct: 361 KDG---KIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITH 417

Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
           N  D+F+ +HGA +T+++FLP+ A   ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445


>sp|Q5NDL2|EOGT_HUMAN EGF domain-specific O-linked N-acetylglucosamine transferase
           OS=Homo sapiens GN=EOGT PE=2 SV=1
          Length = 527

 Score = 41.6 bits (96), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 92/209 (44%), Gaps = 25/209 (11%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D I  LYI       F+  V  V+ +  S+ + + + +     + YD+I +  
Sbjct: 245 NMYHHFCDFI-NLYITQHVNNSFSTDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLKT 303

Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
            D    CF   +  L    R+    N   I+  ++    + F Q +    ++ +      
Sbjct: 304 YDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGP--- 360

Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
           +DG   K R+ I++R    R   N  E+    + +  F V + +      G L +   T 
Sbjct: 361 KDG---KIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITH 417

Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
           N  D+F+ +HGA +T+++FLP+ A   ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445


>sp|Q5NDL9|EOGT_CANFA EGF domain-specific O-linked N-acetylglucosamine transferase
           OS=Canis familiaris GN=EOGT PE=2 SV=1
          Length = 527

 Score = 40.8 bits (94), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 48/209 (22%), Positives = 92/209 (44%), Gaps = 25/209 (11%)

Query: 146 NIFHDFSDIIIPLYINSR---QFNGQVQFVITNKQSW-WINKYQNILKKLSRYDIIDIDN 201
           N++H F D +  LYI       F+  V  V+ +  S+ + + + +  K  + YD+I +  
Sbjct: 245 NMYHHFCDFV-NLYITQHVNNSFSTDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 303

Query: 202 QDDIH-CFPRGIIGL---KRHDRELN---INSSESPYSIKDFRQFIRSSYSLQKPAAIRL 254
            D    CF   +  L    R+    N   I+  ++    + F Q +    ++ +      
Sbjct: 304 YDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGP--- 360

Query: 255 RDGEKKKPRLLIVSRK-RTRAFTNAEEIAQMGRRLG-FNVVVAEAN----GNLSRFAETV 308
           +DG   K R+ I++R    R   N  E+    + +    V + +      G L +   T 
Sbjct: 361 KDG---KIRVTILARSTEYRKILNQNELVNALKTVSTLEVQIVDYKYKELGFLDQLRITH 417

Query: 309 NYCDVFLAVHGAAMTNMIFLPENAVFIQV 337
           N  D+F+ +HGA +T+++FLP+ A   ++
Sbjct: 418 N-TDIFIGMHGAGLTHLLFLPDWAAVFEL 445


>sp|Q5NDE7|GTDC2_XENLA Glycosyltransferase-like domain-containing protein 2 OS=Xenopus
           laevis GN=gtdc2 PE=2 SV=1
          Length = 578

 Score = 40.4 bits (93), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 51/270 (18%), Positives = 111/270 (41%), Gaps = 37/270 (13%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQF-NGQVQFVITNKQSWWINKYQNILKK 190
           VP + L  N     N+ H F D ++P++   +QF +   +  +   + W    +  + K 
Sbjct: 148 VPDVALIMNRFNPDNLMHVFHDDLLPIFYTIQQFPDLDFESRLFFMEGWNEGLHFELYKF 207

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKR----------HDRELNINSSESPYSIKDFR 237
           +S    +    +     + CF +  +GL +            +    N   S   I+ F 
Sbjct: 208 MSNKQPLLKEQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQGPKANILVSGNEIRHFA 267

Query: 238 QFIRSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGF-NVVVA 295
           +F+    ++         D    +  +++ SR   R   N AE +  + +      + V+
Sbjct: 268 KFMMGKLNIT-------LDQNAAEAYIVLFSRSMNRLIVNEAELLLALAQEFQMKTITVS 320

Query: 296 EANGNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPA 355
             + + S     ++   + +++HGA +   +FLP+ AV +++ P+G        ++  P 
Sbjct: 321 LEDHSFSDIVRLLSNATMLVSMHGAQLVTSLFLPKGAVVVELFPYG-----INPEHYTPY 375

Query: 356 K------AMKLRYLEYKIKLDESTLIQQYP 379
           K       M+L+Y+ ++   +E+T+   YP
Sbjct: 376 KTLSTLPGMELQYVAWQNTEEENTIT--YP 403


>sp|Q5NDE8|GTDC2_CHICK Glycosyltransferase-like domain-containing protein 2 OS=Gallus
           gallus GN=GTDC2 PE=2 SV=1
          Length = 577

 Score = 37.7 bits (86), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 53/267 (19%), Positives = 107/267 (40%), Gaps = 31/267 (11%)

Query: 133 VPAI-LFSNGGYAGNIFHDFSDIIIPLYINSRQFNG-QVQFVITNKQSWWINKYQNILKK 190
           VP + L +N     N+ H F D ++P+Y   +QF     +  +   + W    + ++ K 
Sbjct: 148 VPDVALIANRFNPDNLMHVFHDDLLPIYYTMQQFTDLDPETRLFFMEGWSEGVHFDLYKL 207

Query: 191 LSRYDII---DIDNQDDIHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFI 240
           LS    +    +     + CF +  +GL +     +      + P      S  + RQF 
Sbjct: 208 LSNKQPLLREQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFT 267

Query: 241 RSSYSLQKPAAIRLRDGEKKKPRLLIVSRKRTRAFTN-AEEIAQMGRRLGF-NVVVAEAN 298
           +  + +QK          ++   +++ SR   R   N AE I  + +      + V+   
Sbjct: 268 K--FMMQKLNVSLEESSSEE--YIVVFSRTINRLILNEAELILALAQEFQMKTITVSLEE 323

Query: 299 GNLSRFAETVNYCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK-- 356
            + S     ++   + +++HGA +   +FLP  A  +++ P+         ++  P K  
Sbjct: 324 HSFSDIVRLISNASMLVSMHGAQLVMSLFLPRGATVVELFPYA-----INPEHYTPYKTL 378

Query: 357 ----AMKLRYLEYKIKLDESTLIQQYP 379
                M L+Y+ ++    E T+   YP
Sbjct: 379 ATLPGMDLQYIAWQNTAREDTVT--YP 403


>sp|Q5NDE5|GTDC2_DANRE Glycosyltransferase-like domain-containing protein 2 OS=Danio rerio
           GN=gtdc2 PE=2 SV=1
          Length = 578

 Score = 37.4 bits (85), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 48/247 (19%), Positives = 104/247 (42%), Gaps = 25/247 (10%)

Query: 146 NIFHDFSDIIIPLYINSRQF---NGQVQFVITNKQSWWINKYQNILKKLSRYDIIDIDNQ 202
           N+ H F D ++P+Y   +Q+   + + + V    + W    + ++ + LS    +  D  
Sbjct: 162 NLMHIFHDDLLPVYYTMQQYSDLDDEARLVFM--EGWGEGAHFDLYRLLSSKQPLLKDQL 219

Query: 203 DD---IHCFPRGIIGLKRHDR--ELNINSSESP-----YSIKDFRQFIRSSYSLQKPAAI 252
                + CF +  +GL +     +      + P      S  + RQF  +S+ +++    
Sbjct: 220 KTFGKLMCFTKSYVGLSKMTTWYQYGFVQPQGPKANILISGNEIRQF--ASFLMERLNIT 277

Query: 253 RLRDGEKKKPRLLIVSRKRTRAFTNAEEIAQMGRRLGF---NVVVAEANGNLSRFAETVN 309
           R  + E      ++V ++ T      E    +     F    V V+    +     + ++
Sbjct: 278 REEEEEDDD--YIVVFKRTTNRLILNEAELLLALAQEFQMRTVTVSLEEQSFDNIIQIIS 335

Query: 310 YCDVFLAVHGAAMTNMIFLPENAVFIQVVPFGGFAWLARTDYEEPAK--AMKLRYLEYKI 367
              + +++HGA M   +FLP  A  +++ P+G       T Y+  A    M L+Y+ ++ 
Sbjct: 336 RAAMLVSMHGAQMITSMFLPRGAAVVELFPYGVNP-EQYTPYKTLASLPGMDLQYVAWRN 394

Query: 368 KLDESTL 374
            ++E+T+
Sbjct: 395 TMEENTV 401


>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
           24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
           GN=rse-1 PE=3 SV=2
          Length = 1209

 Score = 33.5 bits (75), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 28/114 (24%), Positives = 46/114 (40%), Gaps = 5/114 (4%)

Query: 59  VCSVTKGSDFCEINNDIRIDGSSATVFIASSLADQ---TEWIIRPYARKYDHVAMKRVNE 115
           V S T G+    I   +     S  +  A +LA Q    + +I+ + +   H+   RVNE
Sbjct: 494 VLSFTNGTLVLSIGETVEEVSDSGFLTTAPTLAVQQMGEDGLIQVHPKGIRHIVQGRVNE 553

Query: 116 WSVKSAASLLQCNQNHSVPAILFSNGGYAGNIFHDFSDIIIPLYINSRQFNGQV 169
           W      S++    N +   I  S+G      F   SD  +  Y   ++ +G V
Sbjct: 554 WPAPQHRSIVAATANENQVVIALSSGEIV--YFEMDSDGSLAEYDEKKEMSGTV 605


>sp|P47449|Y207_MYCGE Putative metallophosphoesterase MG207 OS=Mycoplasma genitalium
           (strain ATCC 33530 / G-37 / NCTC 10195) GN=MG207 PE=3
           SV=2
          Length = 163

 Score = 32.3 bits (72), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 19/103 (18%)

Query: 292 VVVAEANGNLSRFAETVNYCDVFLAVH-GAAMTNMIFLPENAVF------------IQVV 338
           +V+A+ +G   R+ E  NY +  + +H G  MT   F+ +NA F            I++ 
Sbjct: 5   LVIADTHGQNQRWIELKNYHNPDVIIHAGDHMTTKQFMDQNATFWVAGNNDSIGNEIEIF 64

Query: 339 PFGGFAWLARTDYEEPAKAMKLRYLEYKIKLDESTLIQQYPLD 381
             G   ++    ++ P   +K  Y    +K       QQYP D
Sbjct: 65  QLGQINFVLMHGHQAPRDNLKKWYQLLVLKA------QQYPCD 101


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.323    0.137    0.410 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 151,539,964
Number of Sequences: 539616
Number of extensions: 6104139
Number of successful extensions: 18662
Number of sequences better than 100.0: 27
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 18643
Number of HSP's gapped (non-prelim): 27
length of query: 429
length of database: 191,569,459
effective HSP length: 120
effective length of query: 309
effective length of database: 126,815,539
effective search space: 39186001551
effective search space used: 39186001551
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 63 (28.9 bits)