BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy6128
         (529 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|345487243|ref|XP_001599461.2| PREDICTED: hypothetical protein LOC100114438 [Nasonia vitripennis]
          Length = 2706

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 304/424 (71%), Positives = 358/424 (84%), Gaps = 6/424 (1%)

Query: 111  FGVDPRMEEHS-DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGM 169
            F  D ++E  S D  + +Y F G+GGP  +    G WCCR GGTE PT EHL++G CQG+
Sbjct: 1409 FSRDIKLENESRDQEESNYKFRGDGGPAKMSPGVGSWCCRRGGTEQPTPEHLREGCCQGL 1468

Query: 170  RTQDEMLE---PKEPNNNEEPATVKAEDPNSK--EMLDHIERLKNNMRTEVPDCKCFASD 224
            +T+DE  E    K    NE+ A  KA + ++   ++ +H+E+LKNN+RTEVPDC CF++D
Sbjct: 1469 QTRDEFSEDSPQKSEVKNEDSAGGKASNGSTTGTKLQEHLEKLKNNVRTEVPDCDCFSAD 1528

Query: 225  KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
            K PPEPGSYY+HLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW
Sbjct: 1529 KCPPEPGSYYSHLGAAASLPDLRNDLERRTGLKGHAIRFEKVVYTGKEGKTTQGCPMAKW 1588

Query: 285  VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
            VIRR+ +EEK+L IVKHRQGH C+TAWIVV +VAWEGVP +++D +Y++L++KLN++GLP
Sbjct: 1589 VIRRSGIEEKILTIVKHRQGHKCATAWIVVAMVAWEGVPNHEADRIYSLLSHKLNRFGLP 1648

Query: 345  TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 404
            TTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ
Sbjct: 1649 TTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 1708

Query: 405  EIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAH 464
            E+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAH
Sbjct: 1709 EVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAH 1768

Query: 465  SHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGA 524
            +HRDLHNMNNGCTVVVSLTKHRS SKPDDEQLHVLPLYIMDDSDEFG+KE QE K+ +GA
Sbjct: 1769 AHRDLHNMNNGCTVVVSLTKHRSFSKPDDEQLHVLPLYIMDDSDEFGSKEGQEAKIKSGA 1828

Query: 525  IENL 528
            IE L
Sbjct: 1829 IEVL 1832


>gi|307188349|gb|EFN73124.1| Protein TET2 [Camponotus floridanus]
          Length = 1632

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 297/412 (72%), Positives = 350/412 (84%), Gaps = 16/412 (3%)

Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPKEPNNNEEP 187
           Y F G+GGP  +    G WCCR GGT+ P+ EHLKDG CQG++T+DEML      ++ + 
Sbjct: 340 YKFRGDGGPAKVSPETGSWCCRRGGTKQPSPEHLKDGCCQGLQTKDEMLA-----DSPQA 394

Query: 188 ATVKAEDPNS-----------KEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTH 236
           A +K E P+S            ++ DH+++LKNN+RTEVPDC CF +DK PPEPGSYYTH
Sbjct: 395 AELKNEGPHSPRTPASAATTTTKLQDHLDKLKNNVRTEVPDCNCFPADKCPPEPGSYYTH 454

Query: 237 LGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLL 296
           LGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW+IRR+ ++EK+L
Sbjct: 455 LGAAASLPDLRNDLERRTGLKGDAIRFEKVIYTGKEGKTTQGCPMAKWIIRRSGMDEKIL 514

Query: 297 LIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRT 356
            IVKHRQGH C+TAWIVV +VAWEGVP +++D +Y++LT+KLN++GLPTTRRC TNEPRT
Sbjct: 515 TIVKHRQGHKCATAWIVVAMVAWEGVPTHEADRIYSLLTHKLNRFGLPTTRRCGTNEPRT 574

Query: 357 CACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATT 416
           CACQGLDPD CGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT 
Sbjct: 575 CACQGLDPDNCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATL 634

Query: 417 ISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGC 476
           +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGC
Sbjct: 635 LSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGC 694

Query: 477 TVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           TVVVSLTKHR+LSKP+DEQLHVLPLYIMDD+DEFG+KE QE+KV +GA+E L
Sbjct: 695 TVVVSLTKHRALSKPEDEQLHVLPLYIMDDTDEFGSKEGQEKKVRSGALEIL 746


>gi|383857295|ref|XP_003704140.1| PREDICTED: uncharacterized protein LOC100883443 [Megachile
           rotundata]
          Length = 1646

 Score =  648 bits (1671), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 297/418 (71%), Positives = 352/418 (84%), Gaps = 8/418 (1%)

Query: 114 DPRMEEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQD 173
           D R +E S+     Y F G+GGP  +    G WCCR GGTE PT EHL+DG CQG++T+D
Sbjct: 338 DQRSQESSN-----YKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLRDGCCQGLQTRD 392

Query: 174 EMLEP---KEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEP 230
           E+L     K    NE P + ++    + ++ DH+++LKNN+RTEVPDC CF +DK PPEP
Sbjct: 393 EILADSADKSDVKNEGPQSPRSAASTTTKLQDHLDKLKNNVRTEVPDCNCFPADKCPPEP 452

Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
           GSYYTHLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ 
Sbjct: 453 GSYYTHLGAAASLPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSG 512

Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
           LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC 
Sbjct: 513 LEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCG 572

Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
           TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+M
Sbjct: 573 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERM 632

Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
           H+LAT +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLH
Sbjct: 633 HVLATLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLH 692

Query: 471 NMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           NMNNGCTVVV+LTKHR+LSKP+DEQLHVLPLYIMD +DE+G+KE Q+EKV  G++E L
Sbjct: 693 NMNNGCTVVVTLTKHRNLSKPEDEQLHVLPLYIMDTTDEYGSKEGQDEKVRGGSVEVL 750


>gi|380029496|ref|XP_003698406.1| PREDICTED: uncharacterized protein LOC100866593 [Apis florea]
          Length = 1865

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 292/406 (71%), Positives = 349/406 (85%), Gaps = 4/406 (0%)

Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEP---KEPNN 183
           +Y F G+GGP  +    G WCCR GGTE PT EHL++G CQG++T+DE+L     K    
Sbjct: 556 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADAADKSDVK 615

Query: 184 NEEPATVKAED-PNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAAS 242
           NE P + ++   P++ ++ DH+E+LKNN+R+EVPDC CF +DK PPEPGSYYTHLGAAAS
Sbjct: 616 NEGPQSPRSGGAPSTTKLQDHLEKLKNNVRSEVPDCNCFPADKCPPEPGSYYTHLGAAAS 675

Query: 243 LPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHR 302
           LPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHR
Sbjct: 676 LPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHR 735

Query: 303 QGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGL 362
           QGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGL
Sbjct: 736 QGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGL 795

Query: 363 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYK 422
           DP+TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY 
Sbjct: 796 DPETCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYL 855

Query: 423 ALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSL 482
           +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTVVV++
Sbjct: 856 SLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTM 915

Query: 483 TKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           TKHR+LSKP+DEQLHVLPLYIMD +DE+G+KE Q+EKV  GA+E L
Sbjct: 916 TKHRTLSKPEDEQLHVLPLYIMDTTDEYGSKEGQDEKVRAGAVEVL 961


>gi|328780619|ref|XP_396330.4| PREDICTED: hypothetical protein LOC412878 [Apis mellifera]
          Length = 1695

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 293/414 (70%), Positives = 346/414 (83%), Gaps = 20/414 (4%)

Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLE--------- 177
           +Y F G+GGP  +    G WCCR GGTE PT EHL++G CQG++T+DE+L          
Sbjct: 386 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADGTDKSDVK 445

Query: 178 ---PKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYY 234
              P+ P N   P+T K        + DH+E+LKNN+R+EVPDC CF +DK PPEPGSYY
Sbjct: 446 NEGPQSPRNGGAPSTTK--------LQDHLEKLKNNVRSEVPDCNCFPADKCPPEPGSYY 497

Query: 235 THLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEK 294
           THLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK
Sbjct: 498 THLGAAASLPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEK 557

Query: 295 LLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEP 354
           +L IVKHRQGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEP
Sbjct: 558 ILTIVKHRQGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEP 617

Query: 355 RTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLA 414
           RTCACQGLDP+TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LA
Sbjct: 618 RTCACQGLDPETCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLA 677

Query: 415 TTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNN 474
           T +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNN
Sbjct: 678 TLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNN 737

Query: 475 GCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           GCTVVV++TKHR+LSKP+DEQLHVLPLYIMD +DE+G+KE Q+EKV  GA+E L
Sbjct: 738 GCTVVVTMTKHRTLSKPEDEQLHVLPLYIMDTTDEYGSKEGQDEKVRAGAVEVL 791


>gi|340722271|ref|XP_003399531.1| PREDICTED: hypothetical protein LOC100642293 [Bombus terrestris]
          Length = 1697

 Score =  639 bits (1648), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 290/406 (71%), Positives = 345/406 (84%), Gaps = 4/406 (0%)

Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEML----EPKEPN 182
           +Y F G+GGP  +    G WCCR GGTE PT EHL++G CQG++T+DE+L    E  +  
Sbjct: 391 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADSAEKSDVK 450

Query: 183 NNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAAS 242
           N    +      P++ ++ DH+++LKNN+R+EVPDC CF +DK PPEPGSYYTHLGAAAS
Sbjct: 451 NEGTQSPRTGSVPSTTKLQDHLDKLKNNVRSEVPDCNCFPADKCPPEPGSYYTHLGAAAS 510

Query: 243 LPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHR 302
           LPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHR
Sbjct: 511 LPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHR 570

Query: 303 QGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGL 362
           QGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGL
Sbjct: 571 QGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGL 630

Query: 363 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYK 422
           DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY 
Sbjct: 631 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYL 690

Query: 423 ALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSL 482
           +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTVVV++
Sbjct: 691 SLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTM 750

Query: 483 TKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           TKHRSLSKP++EQLHVLPLYIMD +DE G+KE Q+EKV  GA+E L
Sbjct: 751 TKHRSLSKPEEEQLHVLPLYIMDTTDENGSKEGQDEKVRAGAVEVL 796


>gi|350416717|ref|XP_003491069.1| PREDICTED: hypothetical protein LOC100741227 [Bombus impatiens]
          Length = 1697

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 289/406 (71%), Positives = 345/406 (84%), Gaps = 4/406 (0%)

Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEML----EPKEPN 182
           +Y F G+GGP  +    G WCCR GGTE PT EHL++G CQG++T+DE+L    +  +  
Sbjct: 391 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADSADKSDVK 450

Query: 183 NNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAAS 242
           N    +      P++ ++ DH+++LKNN+R+EVPDC CF +DK PPEPGSYYTHLGAAAS
Sbjct: 451 NEGTQSPRTGSVPSTTKLQDHLDKLKNNVRSEVPDCNCFPADKCPPEPGSYYTHLGAAAS 510

Query: 243 LPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHR 302
           LPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHR
Sbjct: 511 LPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHR 570

Query: 303 QGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGL 362
           QGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGL
Sbjct: 571 QGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGL 630

Query: 363 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYK 422
           DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY 
Sbjct: 631 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYL 690

Query: 423 ALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSL 482
           +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTVVV++
Sbjct: 691 SLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTM 750

Query: 483 TKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           TKHRSLSKP++EQLHVLPLYIMD +DE G+KE Q+EKV  GA+E L
Sbjct: 751 TKHRSLSKPEEEQLHVLPLYIMDTTDENGSKEGQDEKVRAGAVEVL 796


>gi|242005152|ref|XP_002423436.1| hypothetical protein Phum_PHUM059340 [Pediculus humanus corporis]
 gi|212506514|gb|EEB10698.1| hypothetical protein Phum_PHUM059340 [Pediculus humanus corporis]
          Length = 1861

 Score =  620 bits (1599), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 294/425 (69%), Positives = 344/425 (80%), Gaps = 11/425 (2%)

Query: 116  RMEEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEM 175
            R EE++    + Y++ GEGGP SL  + GPWCCR GG EPPT +H K G C+G +T DE 
Sbjct: 833  RGEENNKLIAEDYLYQGEGGPISLNSTKGPWCCRMGGIEPPTDDHAKIGNCKGHKTADEF 892

Query: 176  ---LEPKEPNNNEEPATVKAEDPNSKEML--------DHIERLKNNMRTEVPDCKCFASD 224
               ++  +  N  E   VK    N+   L        +++ERLKNN++T VP CKCF  D
Sbjct: 893  SSDVKKLDVENLNEKLRVKKFYENTNLKLSPQENFQEENMERLKNNIKTNVPHCKCFPPD 952

Query: 225  KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
            K PPEPGSYYTHLGAAASL DLRKD+E R+G  GKALR EKI YTGKEGKTT+GCPLAKW
Sbjct: 953  KSPPEPGSYYTHLGAAASLSDLRKDLESRTGQTGKALRFEKICYTGKEGKTTRGCPLAKW 1012

Query: 285  VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
            VIRR+ L+EK+L+IVKHR GHTCSTAWIVV +VAW+GVP  ++D +YA+LT+KLNK+GLP
Sbjct: 1013 VIRRSGLDEKVLIIVKHRPGHTCSTAWIVVCLVAWDGVPTPEADRIYALLTHKLNKFGLP 1072

Query: 345  TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 404
            T RRCATNE RTCACQGLDP+TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ
Sbjct: 1073 TIRRCATNETRTCACQGLDPNTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 1132

Query: 405  EIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAH 464
            ++EE+MH+LAT +SPLY  LAP A++NQ  FEREA+ECRLGFKPGRPFSGVTAC DFCAH
Sbjct: 1133 DVEERMHVLATLLSPLYNTLAPEAYSNQTSFEREAAECRLGFKPGRPFSGVTACIDFCAH 1192

Query: 465  SHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGA 524
            +HRDLHNMNNGCTVV +LTKHR+LSKP+DEQLHVLP YI+DD+DEFG K +QEEK   G+
Sbjct: 1193 AHRDLHNMNNGCTVVFTLTKHRTLSKPEDEQLHVLPHYILDDTDEFGCKASQEEKYKNGS 1252

Query: 525  IENLN 529
            IE LN
Sbjct: 1253 IECLN 1257


>gi|328712256|ref|XP_001947546.2| PREDICTED: hypothetical protein LOC100159694 [Acyrthosiphon pisum]
          Length = 2023

 Score =  585 bits (1509), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 263/424 (62%), Positives = 339/424 (79%), Gaps = 16/424 (3%)

Query: 108  KVPFGVDPRMEEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQ 167
            ++PFG+DPR+ +     K  YVF G GGP  +   AGPWCCR GGT+ PT++HL DG C 
Sbjct: 1160 RLPFGIDPRVTQ-----KNGYVFCGNGGPNPIDVVAGPWCCRMGGTDTPTTKHLSDGCCH 1214

Query: 168  GMRTQDEMLEPKEPNNNEEPATVKAEDP-NSKEMLDHIE-RLKNNMRTEVPDCKCFASDK 225
            G++T DE ++P E         +K E+  N+ +  D+++ + K N++  +PDC CF +D+
Sbjct: 1215 GLKTLDEGIDPVE---------MKQENGLNNSQCSDNLDDKQKTNIKATIPDCNCFPTDQ 1265

Query: 226  LPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWV 285
             PPEPG +YTHLG+A SL +LR ++E  SG +G  +RMEK+LYTGKEGKTTQGCPLAKWV
Sbjct: 1266 APPEPGPFYTHLGSAYSLIELRTNMENMSGIRGNGIRMEKVLYTGKEGKTTQGCPLAKWV 1325

Query: 286  IRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPT 345
            IRR+S +EKLL++VK+R+GH C  +WIV+ IV+WEG+  +++D +Y +L++KLNKYG+PT
Sbjct: 1326 IRRSSTDEKLLVVVKNRRGHKCQHSWIVICIVSWEGILSDEADFLYTMLSHKLNKYGVPT 1385

Query: 346  TRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE 405
            TRRC TN+PRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK VRKFRLSVR+EEQE
Sbjct: 1386 TRRCGTNDPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKDVRKFRLSVRTEEQE 1445

Query: 406  IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHS 465
            +EE++H+LAT +SPLYK+LAP ++ NQ Q ERE S+CRLG KPGRPF+ VTAC DFCAH+
Sbjct: 1446 LEERLHVLATNLSPLYKSLAPRSYNNQIQCEREGSDCRLGLKPGRPFASVTACIDFCAHA 1505

Query: 466  HRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAI 525
            HRD HNM+NGCTVVV+L KHR   KPDDEQLHVLPLY++D+SDEFG+K+AQ +K   G++
Sbjct: 1506 HRDFHNMHNGCTVVVTLNKHRGFQKPDDEQLHVLPLYVVDESDEFGDKQAQSDKFKNGSV 1565

Query: 526  ENLN 529
            E L+
Sbjct: 1566 EMLS 1569


>gi|357624916|gb|EHJ75511.1| hypothetical protein KGM_05166 [Danaus plexippus]
          Length = 2066

 Score =  558 bits (1438), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 270/452 (59%), Positives = 325/452 (71%), Gaps = 60/452 (13%)

Query: 106  DLKVPFGVDPRMEEHSDSGKK-------------SYVFAGEGGPCSLVDSAGPWCCRGGG 152
            DLK P+ ++    EHS  G K              Y+FAGEGGP +  +  G  CCR G 
Sbjct: 902  DLKPPYYIEQIKSEHSPPGHKIYKNLLYGPPRSEPYMFAGEGGPNAFRNEIGYACCRQGS 961

Query: 153  TEPPTSEHLKDGLCQGMRTQDEMLE--------------PKEPNNNEEPATVKAEDPN-S 197
             + P  EHL+DG C G++T+DE+LE              P  P ++  P T K    N S
Sbjct: 962  VKKPPPEHLRDGACAGLQTKDEILEEDPDSTDNSKTPSKPGTPISDLFPKTTKENQFNYS 1021

Query: 198  KEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYK 257
            KE LD++ERLKNN RTEVPDC CF +DK PPEPGSYYTHL                    
Sbjct: 1022 KEYLDNLERLKNNSRTEVPDCNCFPADKNPPEPGSYYTHL-------------------- 1061

Query: 258  GKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIV 317
                        GKEGKT QGCP+AKW+IRR+S  EK+L +VK R GH CST+WIVV +V
Sbjct: 1062 ------------GKEGKTAQGCPMAKWIIRRSSYTEKVLAVVKFRNGHKCSTSWIVVCLV 1109

Query: 318  AWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSW 377
            AWEG+P +++D  Y +L++KLN+YGLPTTRRCATNE RTCACQGLDP+TCGAS+SFGCSW
Sbjct: 1110 AWEGIPQSEADLDYTLLSHKLNRYGLPTTRRCATNENRTCACQGLDPETCGASYSFGCSW 1169

Query: 378  SMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFER 437
            SMYYNGCKYARSKTVRKFRLSV++EE EIEE+MH+LAT +SPLY  LAP +F NQCQFE+
Sbjct: 1170 SMYYNGCKYARSKTVRKFRLSVKTEESEIEERMHVLATLLSPLYMNLAPKSFENQCQFEK 1229

Query: 438  EASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLH 497
            EAS+CRLGFKPGRPFSGVTAC DFCAH+HRDLHNMNNGCT VV+L KHR+L+KP+DEQLH
Sbjct: 1230 EASDCRLGFKPGRPFSGVTACIDFCAHAHRDLHNMNNGCTAVVTLAKHRALTKPNDEQLH 1289

Query: 498  VLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            VLPLY++D +DEFG+KE QEEK+ +GA+E L+
Sbjct: 1290 VLPLYVLDTTDEFGSKEGQEEKIASGALEILD 1321


>gi|307213413|gb|EFN88849.1| Protein TET2 [Harpegnathos saltator]
          Length = 1214

 Score =  549 bits (1415), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 246/303 (81%), Positives = 281/303 (92%)

Query: 227 PPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVI 286
           PPEPGSYYTHLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW+I
Sbjct: 8   PPEPGSYYTHLGAAASLPDLRNDLERRTGLKGDAIRFEKVIYTGKEGKTTQGCPMAKWII 67

Query: 287 RRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTT 346
           RR+ ++EK+L IVKHRQGH C TAWIVV +VAWEGVP +++D +Y++L +KLN++GLPTT
Sbjct: 68  RRSGIDEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLCHKLNRFGLPTT 127

Query: 347 RRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEI 406
           RRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+
Sbjct: 128 RRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEV 187

Query: 407 EEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSH 466
           EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSH
Sbjct: 188 EERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSH 247

Query: 467 RDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
           RDLHNMNNGCTVVVSLTKHR+LSKP+DEQLHVLPLYIMDD+DEFG+KE QE+K+ +GA+E
Sbjct: 248 RDLHNMNNGCTVVVSLTKHRTLSKPEDEQLHVLPLYIMDDTDEFGSKEGQEKKIRSGAVE 307

Query: 527 NLN 529
           NL+
Sbjct: 308 NLS 310


>gi|195020981|ref|XP_001985305.1| GH14579 [Drosophila grimshawi]
 gi|193898787|gb|EDV97653.1| GH14579 [Drosophila grimshawi]
          Length = 2971

 Score =  505 bits (1301), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 249/421 (59%), Positives = 308/421 (73%), Gaps = 25/421 (5%)

Query: 128  YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRT--------QDEMLEP- 178
            Y + GEG P +     G  CCR GGT PPT+EHLKDG C G+          +DE+ E  
Sbjct: 1613 YPYLGEGKPIN----NGFSCCRQGGTRPPTAEHLKDGTCLGLGITPKEELLDEDELAEAH 1668

Query: 179  -------KEPNNNEEP-ATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEP 230
                   K+   +E P   VK E  N   M D  +RL+   +TE+P+C+CF SDK PPEP
Sbjct: 1669 NGIKAKSKKQKQDEIPEIVVKHEKIN--PMFDTTDRLEKGNKTEIPECECFQSDKNPPEP 1726

Query: 231  GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
            G+YYTHLG A+SL +LR++ E+R    G+ LR+EKI+YTGKEGKTTQGCP+AKWVIRRA 
Sbjct: 1727 GTYYTHLGTASSLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTTQGCPVAKWVIRRAD 1786

Query: 291  LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
             EEK+L++VK R GH C  A+IVV +VAW+GVP  ++D  Y  L  KLNKYGLPTTRRCA
Sbjct: 1787 PEEKILVVVKKRPGHRCIAAYIVVCMVAWDGVPRLEADNAYKNLIPKLNKYGLPTTRRCA 1846

Query: 351  TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
            TNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+ M
Sbjct: 1847 TNENRTCACQGLDPETSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHM 1906

Query: 411  HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
            +L+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLH
Sbjct: 1907 NLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLH 1966

Query: 471  NMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
            NM +GCTV V+L K  +R    PDDEQ HVLPLY MD +DEF + E Q +K  TGA++ L
Sbjct: 1967 NMQDGCTVHVALLKPGNRDSRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQML 2026

Query: 529  N 529
            +
Sbjct: 2027 D 2027


>gi|195429100|ref|XP_002062602.1| GK17629 [Drosophila willistoni]
 gi|194158687|gb|EDW73588.1| GK17629 [Drosophila willistoni]
          Length = 2132

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 240/403 (59%), Positives = 302/403 (74%), Gaps = 20/403 (4%)

Query: 147  CCRGGGTEPPTSEHLKDGLCQGMRTQ--DEMLEP---KEPNNN-------------EEPA 188
            CCR GGT PPT+EHLKDG C G+  Q  +E+L+     +P+NN             +E  
Sbjct: 884  CCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDDLADPHNNSSVKSGKSKKHKQDEIP 943

Query: 189  TVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRK 248
             +  +      M D  +RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL +LR+
Sbjct: 944  EIIVKHEKINPMFDTTDRLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMELRR 1003

Query: 249  DIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCS 308
            + EER    G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C 
Sbjct: 1004 EFEERCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCI 1063

Query: 309  TAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCG 368
             A+IVV +VAW+G+P  ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP+T G
Sbjct: 1064 AAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSG 1123

Query: 369  ASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGA 428
            AS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P +
Sbjct: 1124 ASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEETAIEDHMNLIATLLAPVFKQVCPRS 1183

Query: 429  FTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HR 486
            + NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R
Sbjct: 1184 YDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPNNR 1243

Query: 487  SLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
                PDDEQ HVLPLY MD +DEF + E Q +K  TGA++ L+
Sbjct: 1244 DSRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQMLD 1286


>gi|195377914|ref|XP_002047732.1| GJ11762 [Drosophila virilis]
 gi|194154890|gb|EDW70074.1| GJ11762 [Drosophila virilis]
          Length = 2228

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 246/421 (58%), Positives = 309/421 (73%), Gaps = 25/421 (5%)

Query: 128  YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--------DEMLEP- 178
            Y + GEG P +    +G  CCR GGT PPT+EHLKDG C G+  Q        DE+ E  
Sbjct: 876  YAYLGEGKPLN----SGFSCCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDELAEAH 931

Query: 179  -------KEPNNNEEP-ATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEP 230
                   K+   +E P   VK E  N   M D  +RL+   +TE+P+C+CF SDK PPEP
Sbjct: 932  NGVKAKSKKQKQDEIPEIIVKHEKIN--PMFDTTDRLEKGNKTEIPECECFQSDKNPPEP 989

Query: 231  GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
            G+YYTHLG A++L +LR++ E+R    G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA 
Sbjct: 990  GTYYTHLGTASTLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRAD 1049

Query: 291  LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
             EEK+L++VK R GH C  A+IVV +VAW+G+P  ++D  Y  L  KLNKYGLPTTRRCA
Sbjct: 1050 PEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCA 1109

Query: 351  TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
            TNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+ M
Sbjct: 1110 TNENRTCACQGLDPESSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHM 1169

Query: 411  HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
            +L+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLH
Sbjct: 1170 NLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLH 1229

Query: 471  NMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
            NM +GCTV V+L K  +R    PDDEQ HVLPLY MD +DEF + E Q +K  TGA++ L
Sbjct: 1230 NMQDGCTVHVALLKPTNRDSRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQML 1289

Query: 529  N 529
            +
Sbjct: 1290 D 1290


>gi|194865212|ref|XP_001971317.1| GG14498 [Drosophila erecta]
 gi|190653100|gb|EDV50343.1| GG14498 [Drosophila erecta]
          Length = 2186

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 252/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 836  LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 891

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 892  IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 948

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 949  RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1008

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 1009 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1068

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1069 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1128

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IEE M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1129 YARSKTVRKFRLSVKSEEAAIEEHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1188

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1189 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1248

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1249 MDGTDEFESVEGQRDKHRTGAVQMLD 1274


>gi|195492879|ref|XP_002094180.1| GE21689 [Drosophila yakuba]
 gi|194180281|gb|EDW93892.1| GE21689 [Drosophila yakuba]
          Length = 2053

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 252/446 (56%), Positives = 319/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKK-SYVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G   SY + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 711  LEPKIEDMGMLGHGGSYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 766

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 767  IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 823

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 824  RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 883

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 884  IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 943

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 944  EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1003

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1004 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1063

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1064 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1123

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1124 MDGTDEFESVEGQRDKHRTGAVQMLD 1149


>gi|194749274|ref|XP_001957064.1| GF10236 [Drosophila ananassae]
 gi|190624346|gb|EDV39870.1| GF10236 [Drosophila ananassae]
          Length = 2255

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 245/443 (55%), Positives = 315/443 (71%), Gaps = 27/443 (6%)

Query: 113  VDPRMEEHSDSGKKS-YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRT 171
            ++P++E+    G    Y + G  G    +++ G  CCR GGT PPT+EHLKDG C G+  
Sbjct: 901  LEPKLEDMGMLGHGGGYTYLGGAGEGKGLNN-GFSCCRQGGTRPPTAEHLKDGTCLGLGI 959

Query: 172  Q--DEMLEPKE----------PNNNEEPATVKAEDPNS-----------KEMLDHIERLK 208
            Q  +E+L+  E          P           + P+              + D  +RL+
Sbjct: 960  QPKEELLDEDELIDSHGNGLKPGGGAAGKAKGKQKPDEIPEIVVKHEKINPLFDTTDRLE 1019

Query: 209  NNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILY 268
               +TE+P+C+CF SDK PPEPG+YYTHLG A+SL +LR++ EER    G+ LR+EKI+Y
Sbjct: 1020 KGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMELRREFEERCNLTGRQLRIEKIVY 1079

Query: 269  TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
            TGKEGKTTQGCP+AKWVIRRA +EEK+L++VK R GH C  A+IVV +VAW+G+P  ++D
Sbjct: 1080 TGKEGKTTQGCPVAKWVIRRADMEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRLEAD 1139

Query: 329  GVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYAR 388
              Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCKYAR
Sbjct: 1140 NAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCKYAR 1199

Query: 389  SKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKP 448
            SKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E+EAS+CRLG +P
Sbjct: 1200 SKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEQEASDCRLGLEP 1259

Query: 449  GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDD 506
            G+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY MD 
Sbjct: 1260 GKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYTMDG 1319

Query: 507  SDEFGNKEAQEEKVNTGAIENLN 529
            +DEF + E Q +K  TGA++ L+
Sbjct: 1320 TDEFESVEGQRDKHRTGAVQMLD 1342


>gi|442629819|ref|NP_001261343.1| CG43444, isoform E [Drosophila melanogaster]
 gi|440215220|gb|AGB94038.1| CG43444, isoform E [Drosophila melanogaster]
          Length = 2866

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 1519 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1574

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 1575 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1631

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 1632 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1691

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 1692 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1751

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1752 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1811

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1812 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1871

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1872 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1931

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1932 MDGTDEFESVEGQRDKHRTGAVQMLD 1957


>gi|442629821|ref|NP_001261344.1| CG43444, isoform F [Drosophila melanogaster]
 gi|440215221|gb|AGB94039.1| CG43444, isoform F [Drosophila melanogaster]
          Length = 2921

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 1574 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1629

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 1630 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1686

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 1687 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1746

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 1747 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1806

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1807 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1866

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1867 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1926

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1927 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1986

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1987 MDGTDEFESVEGQRDKHRTGAVQMLD 2012


>gi|386770417|ref|NP_001246581.1| CG43444, isoform A [Drosophila melanogaster]
 gi|383291702|gb|AFH04252.1| CG43444, isoform A [Drosophila melanogaster]
          Length = 2860

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 1513 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1568

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 1569 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1625

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 1626 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1685

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 1686 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1745

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1746 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1805

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1806 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1865

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1866 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1925

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1926 MDGTDEFESVEGQRDKHRTGAVQMLD 1951


>gi|386770419|ref|NP_001246582.1| CG43444, isoform B [Drosophila melanogaster]
 gi|383291703|gb|AFH04253.1| CG43444, isoform B [Drosophila melanogaster]
          Length = 2915

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 1568 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1623

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 1624 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1680

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 1681 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1740

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 1741 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1800

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1801 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1860

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1861 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1920

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1921 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1980

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1981 MDGTDEFESVEGQRDKHRTGAVQMLD 2006


>gi|386770421|ref|NP_647750.4| CG43444, isoform C [Drosophila melanogaster]
 gi|386770423|ref|NP_001246583.1| CG43444, isoform D [Drosophila melanogaster]
 gi|383291704|gb|AAF47691.4| CG43444, isoform C [Drosophila melanogaster]
 gi|383291705|gb|AFH04254.1| CG43444, isoform D [Drosophila melanogaster]
          Length = 2056

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113  VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
            ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 709  LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 764

Query: 171  TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
             Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 765  IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 821

Query: 206  RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 822  RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 881

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 882  IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 941

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
            ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 942  EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1001

Query: 386  YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1002 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1061

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
             +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 1062 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1121

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            MD +DEF + E Q +K  TGA++ L+
Sbjct: 1122 MDGTDEFESVEGQRDKHRTGAVQMLD 1147


>gi|195336964|ref|XP_002035103.1| GM14104 [Drosophila sechellia]
 gi|194128196|gb|EDW50239.1| GM14104 [Drosophila sechellia]
          Length = 1253

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)

Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
           ++P++E+    G    Y + G EG P +     G  CCR GGT PPT+EHLKDG C G+ 
Sbjct: 77  LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 132

Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
            Q        DE+++                  ++P+   E   VK E  N   M D  +
Sbjct: 133 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKINP--MFDTTD 189

Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
           RL+   +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER    G+ LR+EK
Sbjct: 190 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 249

Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
           I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C  A+IVV +VAW+G+P  
Sbjct: 250 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 309

Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
           ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 310 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 369

Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
           YARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 370 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 429

Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
            +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K  +R    PDDEQ HVLPLY 
Sbjct: 430 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 489

Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
           MD +DEF + E Q +K  TGA++ L+
Sbjct: 490 MDGTDEFESVEGQRDKHRTGAVQMLD 515


>gi|195129473|ref|XP_002009180.1| GI13905 [Drosophila mojavensis]
 gi|193920789|gb|EDW19656.1| GI13905 [Drosophila mojavensis]
          Length = 2290

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 243/419 (57%), Positives = 306/419 (73%), Gaps = 21/419 (5%)

Query: 128  YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--DEML---EPKEPN 182
            Y + GEG P +     G  CCR GGT PPT EHLK G C G+  Q  +E+L   E  E +
Sbjct: 941  YPYLGEGKPLN----NGFSCCRQGGTRPPTEEHLKGGTCLGLSIQPKEELLDEDELAEAH 996

Query: 183  NNEEPATVKAEDPNSKE----------MLDHIERLKNNMRTEVPDCKCFASDKLPPEPGS 232
            N  +  T K +     E          M D  +RL+   +TE+P+C+CF SDK PPEPG+
Sbjct: 997  NGVKAKTKKQKQEEIPEIIVKHEKINPMFDTTDRLEKGNKTEIPECECFQSDKNPPEPGT 1056

Query: 233  YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
            YYTHLG A++L +LR++ EER    G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA  E
Sbjct: 1057 YYTHLGTASTLMELRREFEERCHLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADPE 1116

Query: 293  EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
            EK+L++VK R GH C  A+IVV +VAW+G+P  ++D  Y  L  KLNK+GLPTTRRCATN
Sbjct: 1117 EKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRKEADDAYVNLIPKLNKFGLPTTRRCATN 1176

Query: 353  EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHL 412
            E RTCACQGLDP++ GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+ M+L
Sbjct: 1177 ENRTCACQGLDPESSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHMNL 1236

Query: 413  LATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNM 472
            +AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM
Sbjct: 1237 IATLLAPVFKQVCPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNM 1296

Query: 473  NNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
             +GCTV V+L K  +R    PDDEQ HVLPLY MD +DEF + E Q +K  TGA++ L+
Sbjct: 1297 QDGCTVHVALLKPSNRDTHLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQMLD 1355


>gi|198463152|ref|XP_002135446.1| GA28319 [Drosophila pseudoobscura pseudoobscura]
 gi|198151134|gb|EDY74073.1| GA28319 [Drosophila pseudoobscura pseudoobscura]
          Length = 2141

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/423 (56%), Positives = 304/423 (71%), Gaps = 25/423 (5%)

Query: 128  YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--------DEMLEPK 179
            Y + G+G P +     G  CCR GGT PPT+EHLKDG C G+  Q        DE+++  
Sbjct: 766  YTYLGDGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDELVDAH 821

Query: 180  EPNNNEEPATVKAEDPNS-----------KEMLDHIERLKNNMRTEVPDCKCFASDKLPP 228
                         + P+              + D  +RL+   +TE+P+C+CF SDK PP
Sbjct: 822  NGMKGGAGKAKGKQKPDEIPEIIVKHEKINPLFDTTDRLEKGNKTEIPECECFQSDKNPP 881

Query: 229  EPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRR 288
            EPG+YYTHLG A+SL +LR++ E+R    G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRR
Sbjct: 882  EPGTYYTHLGTASSLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRR 941

Query: 289  ASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRR 348
            A LEEK+L++VK R GH C  A+IVV +VAW+G+P  ++D  Y  L  KLNKYGLPTTRR
Sbjct: 942  ADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRR 1001

Query: 349  CATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEE 408
            CATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+
Sbjct: 1002 CATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIED 1061

Query: 409  KMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRD 468
             M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRD
Sbjct: 1062 HMNLIATLLAPVFKQVCPRSYDNQTKYEGEASDCRLGLEPGKPFSGVTACLDFCAHSHRD 1121

Query: 469  LHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
            LHNM +GCTV V+L K  +R    PDDEQ HVLPLY MD +DEF + E Q +K  TGA++
Sbjct: 1122 LHNMQDGCTVHVALLKPGNRDSRLPDDEQFHVLPLYTMDGTDEFESIEGQRDKHRTGAVQ 1181

Query: 527  NLN 529
             L+
Sbjct: 1182 MLD 1184


>gi|157125426|ref|XP_001654335.1| hypothetical protein AaeL_AAEL001921 [Aedes aegypti]
 gi|108882699|gb|EAT46924.1| AAEL001921-PA [Aedes aegypti]
          Length = 1953

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 218/327 (66%), Positives = 262/327 (80%), Gaps = 3/327 (0%)

Query: 205  ERLKNNMRTEVPDCKCFASD--KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALR 262
            E+L+   + E PDC CF S   K P EPGSYYTHLGAA+SL +LR++ E R G  GK LR
Sbjct: 1069 EKLEKAHKPEAPDCDCFTSSDTKAPSEPGSYYTHLGAASSLEELRRETETRVGLSGKQLR 1128

Query: 263  MEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGV 322
            +EK++YTGKEGK++QGCP+AKWVIRR   EEKLL IVK RQGH C  A+IV+ IV W+G+
Sbjct: 1129 IEKVVYTGKEGKSSQGCPIAKWVIRRVDPEEKLLFIVKRRQGHRCKAAFIVICIVVWDGI 1188

Query: 323  PLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYN 382
            P  ++D VY +L+ KLNKYGLPT RRCATNE RTCACQGLDP+TCG S+SFGCSWSMYYN
Sbjct: 1189 PTQEADSVYRMLSVKLNKYGLPTVRRCATNENRTCACQGLDPETCGVSYSFGCSWSMYYN 1248

Query: 383  GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
            GCKYARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY  +AP AF NQ Q+EREA +C
Sbjct: 1249 GCKYARSKTVRKFRLSVKNEEAEIEERMNILATMLSPLYVTVAPQAFQNQVQYEREAPDC 1308

Query: 443  RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS-KPDDEQLHVLPL 501
            RLG KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V+L K      K DDEQLH+LPL
Sbjct: 1309 RLGLKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKPLPPGVKADDEQLHILPL 1368

Query: 502  YIMDDSDEFGNKEAQEEKVNTGAIENL 528
            Y MD +DEF ++E Q++K  TGA++ L
Sbjct: 1369 YTMDTTDEFDSEEGQKKKAETGAVQVL 1395


>gi|170047947|ref|XP_001851464.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870207|gb|EDS33590.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1872

 Score =  475 bits (1222), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 215/327 (65%), Positives = 263/327 (80%), Gaps = 3/327 (0%)

Query: 205  ERLKNNMRTEVPDCKCFAS--DKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALR 262
            E+L+   + E PDC CF S  +K P EPGSYYTHLG+A++L +LR++ E R G  GK LR
Sbjct: 977  EKLEKAHKPEAPDCDCFNSTDNKAPSEPGSYYTHLGSASTLEELRRETEARVGLTGKQLR 1036

Query: 263  MEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGV 322
            +EK++YTGKEGK++QGCP+AKWVIRR   EEKLL +VK RQGH C  ++IV+ IV W+G+
Sbjct: 1037 IEKVVYTGKEGKSSQGCPIAKWVIRRVDQEEKLLFVVKRRQGHRCKASFIVICIVVWDGI 1096

Query: 323  PLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYN 382
            P  ++D VY +L  KLNKYGLPT RRCATNE RTCACQGLDP+TCG S+SFGCSWSMYYN
Sbjct: 1097 PTQEADSVYRMLAVKLNKYGLPTVRRCATNENRTCACQGLDPETCGVSYSFGCSWSMYYN 1156

Query: 383  GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
            GCKYARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY  +AP AF NQ Q+EREA +C
Sbjct: 1157 GCKYARSKTVRKFRLSVKNEEAEIEERMNVLATMLSPLYVTVAPQAFQNQVQYEREAPDC 1216

Query: 443  RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS-KPDDEQLHVLPL 501
            RLG KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V+L K      KPDDEQLH+LPL
Sbjct: 1217 RLGLKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKPLPPGVKPDDEQLHILPL 1276

Query: 502  YIMDDSDEFGNKEAQEEKVNTGAIENL 528
            Y MD +DEF ++E Q++K  TGA++ L
Sbjct: 1277 YTMDTTDEFDSEEGQKKKAETGAVQVL 1303


>gi|158286121|ref|XP_001688023.1| AGAP007180-PA [Anopheles gambiae str. PEST]
 gi|157020316|gb|EDO64672.1| AGAP007180-PA [Anopheles gambiae str. PEST]
          Length = 2328

 Score =  473 bits (1217), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 216/327 (66%), Positives = 264/327 (80%), Gaps = 3/327 (0%)

Query: 205  ERLKNNMRTEVPDCKCFAS--DKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALR 262
            ++L+ + + E PDC CF+S  DK P EPGSYYTHLG AA+L DLR++ E R G  GK LR
Sbjct: 1326 DKLEKSHKPEAPDCDCFSSGTDKAPSEPGSYYTHLGCAATLEDLRRETELRVGLTGKQLR 1385

Query: 263  MEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGV 322
            +EK++YTGKEGK++QGCP+AKWVIRR   EEKLL +VK RQGH C  ++IV+ IV W+G+
Sbjct: 1386 IEKVVYTGKEGKSSQGCPIAKWVIRRVDPEEKLLFVVKRRQGHRCKASFIVICIVVWDGI 1445

Query: 323  PLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYN 382
            P +++D VY +L  KLNK+GLPT RRCATNE RTCACQGLDP+ CG S+SFGCSWSMYYN
Sbjct: 1446 PTHEADSVYRMLAVKLNKFGLPTVRRCATNENRTCACQGLDPELCGVSYSFGCSWSMYYN 1505

Query: 383  GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
            GCKYARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY  +AP AF NQ Q+EREA +C
Sbjct: 1506 GCKYARSKTVRKFRLSVKNEEAEIEERMNVLATMLSPLYVTVAPQAFQNQVQYEREAPDC 1565

Query: 443  RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS-KPDDEQLHVLPL 501
            RLG KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V+L K      KPDDEQLHVLPL
Sbjct: 1566 RLGLKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKPLPPGVKPDDEQLHVLPL 1625

Query: 502  YIMDDSDEFGNKEAQEEKVNTGAIENL 528
            Y MD +DEF ++E Q++K  TGA++ L
Sbjct: 1626 YTMDTTDEFDSEEGQKKKHETGAVQVL 1652


>gi|270007246|gb|EFA03694.1| hypothetical protein TcasGA2_TC013798 [Tribolium castaneum]
          Length = 856

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 207/324 (63%), Positives = 260/324 (80%)

Query: 205 ERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRME 264
           +R  +    E+P C C  + +   EPG++YTHLG A +L +LR D+E R+G KG+A+R+E
Sbjct: 333 KRYSSPYENEIPYCNCVRAGRGAAEPGTFYTHLGCANNLINLRHDLETRTGVKGRAIRIE 392

Query: 265 KILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPL 324
           KI YTGKEGKT QGCP+AKWVIRR+  +EK L+IVKHR GH+C +A+IVV IV W+G+P 
Sbjct: 393 KIRYTGKEGKTAQGCPIAKWVIRRSGSDEKYLIIVKHRPGHSCPSAFIVVCIVMWDGLPQ 452

Query: 325 NQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 384
             SD +Y +LT+KLNK+GL   RRCATNE +TCACQGL+PDTCGASFSFGCSWSMYYNGC
Sbjct: 453 PTSDELYTLLTSKLNKFGLANRRRCATNESKTCACQGLNPDTCGASFSFGCSWSMYYNGC 512

Query: 385 KYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRL 444
           K++RSK VRKFRL+V+ EE+ +EEK+ +LAT +SP+Y++LAP AF NQC FE    ECRL
Sbjct: 513 KFSRSKFVRKFRLNVQPEEKIVEEKLQILATYLSPIYRSLAPVAFRNQCFFEEGGRECRL 572

Query: 445 GFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIM 504
           G +PGRPFSGVTAC DFCAHSH+D HNM NGCTVVV+LTKHR   KP+DEQLHVLPLY++
Sbjct: 573 GLRPGRPFSGVTACLDFCAHSHKDSHNMVNGCTVVVTLTKHRKGEKPEDEQLHVLPLYVV 632

Query: 505 DDSDEFGNKEAQEEKVNTGAIENL 528
           + +DEF ++  QEEK+  G+IE L
Sbjct: 633 EGTDEFDSQGGQEEKIRMGSIEVL 656


>gi|405950810|gb|EKC18772.1| Putative methylcytosine dioxygenase TET2 [Crassostrea gigas]
          Length = 1231

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 195/329 (59%), Positives = 248/329 (75%), Gaps = 2/329 (0%)

Query: 202 DHIERLKNNMRTEVPDCKCFASDKLPPE--PGSYYTHLGAAASLPDLRKDIEERSGYKGK 259
           +H++RL+ N+++E+P C C   D +P E   G YYTHLGAA S+  +R+ +E+R+G KG+
Sbjct: 365 EHLDRLRKNIKSEMPRCSCRGPDYVPSEDVEGPYYTHLGAARSIQAVRELLEKRTGEKGR 424

Query: 260 ALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAW 319
           ++R+EKI YTGKEGK++QGCP+AKW+IRR+  EEK L +V+ R GH C TA I+ V+VAW
Sbjct: 425 SIRIEKIRYTGKEGKSSQGCPIAKWIIRRSGQEEKYLCVVRQRPGHFCETACIIAVLVAW 484

Query: 320 EGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSM 379
           EGVP N +D +Y  L   L   G  T RRC TNE +TCACQG+D    GASFSFGCSWSM
Sbjct: 485 EGVPQNMADDLYQYLRTTLPTNGFETERRCGTNERKTCACQGIDLVRRGASFSFGCSWSM 544

Query: 380 YYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREA 439
           YYNGCK+ARS+  RKF+L   ++E E+E K+  LAT ++PLY+ +AP A++NQ QFE  A
Sbjct: 545 YYNGCKFARSREARKFKLKDTTKEVELEGKLQDLATKMAPLYQQMAPDAYSNQTQFEDTA 604

Query: 440 SECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVL 499
             CRLG + GRPFSGVTAC DFCAHSHRDLHNMNNG TVVV+LTKHR + KPDDEQLH L
Sbjct: 605 RMCRLGNEEGRPFSGVTACVDFCAHSHRDLHNMNNGSTVVVTLTKHRGMGKPDDEQLHTL 664

Query: 500 PLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           PL++MD +DE G  EAQ EK   G++E L
Sbjct: 665 PLHVMDMTDEHGCSEAQFEKARNGSLEVL 693


>gi|427788369|gb|JAA59636.1| Putative thyroid hormone receptor-associated protein complex
           subunit [Rhipicephalus pulchellus]
          Length = 1666

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 201/335 (60%), Positives = 255/335 (76%), Gaps = 1/335 (0%)

Query: 195 PNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPP-EPGSYYTHLGAAASLPDLRKDIEER 253
           P +    + +ERL++N + E P C C + +  PP +   YYTHLG+  ++  +R+ +E R
Sbjct: 568 PGADPWWERLERLRSNAKAEPPACDCLSPEDAPPLDKSPYYTHLGSGPTVAAIREMLERR 627

Query: 254 SGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIV 313
               G ALR+EK+LYTGKEGKT+QGCP+AKWVIRR+S  EK+L +++HRQGH C +A+IV
Sbjct: 628 LNETGSALRIEKVLYTGKEGKTSQGCPVAKWVIRRSSPNEKVLAVLRHRQGHRCLSAYIV 687

Query: 314 VVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
           + IVAWEGV  + +D +Y  + +K   +G PT RRC TNE RTCACQG D + CGASFSF
Sbjct: 688 MAIVAWEGVHADMADDLYRTVVHKTVNFGFPTQRRCGTNEQRTCACQGADSENCGASFSF 747

Query: 374 GCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQC 433
           GCSWSMYYNGCKYARSK+VRKF+LS +SEEQE+EEK+  LAT ++PLY  +AP ++ NQ 
Sbjct: 748 GCSWSMYYNGCKYARSKSVRKFKLSEQSEEQELEEKLQQLATDMAPLYARVAPESYKNQT 807

Query: 434 QFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDD 493
           +FE E   CRLG KPGRPFSGVTAC DFCAHSH+DLHNMNNGCTVVV+LTKHR   K DD
Sbjct: 808 EFESEGISCRLGLKPGRPFSGVTACVDFCAHSHKDLHNMNNGCTVVVTLTKHRGFEKGDD 867

Query: 494 EQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           EQLHVLPLY++D +DE+G K+   EKV  G++E L
Sbjct: 868 EQLHVLPLYVLDATDEYGKKDGFYEKVKAGSLEVL 902


>gi|195167891|ref|XP_002024766.1| GL22434 [Drosophila persimilis]
 gi|194108171|gb|EDW30214.1| GL22434 [Drosophila persimilis]
          Length = 567

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 191/292 (65%), Positives = 237/292 (81%), Gaps = 2/292 (0%)

Query: 240 AASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIV 299
           A+SL +LR++ E+R    G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++V
Sbjct: 18  ASSLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVV 77

Query: 300 KHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCAC 359
           K R GH C  A+IVV +VAW+G+P  ++D  Y  L  KLNKYGLPTTRRCATNE RTCAC
Sbjct: 78  KKRPGHRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCATNENRTCAC 137

Query: 360 QGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISP 419
           QGLDP+T GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P
Sbjct: 138 QGLDPETSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAP 197

Query: 420 LYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVV 479
           ++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV 
Sbjct: 198 VFKQVCPRSYDNQTKYEGEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVH 257

Query: 480 VSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           V+L K  +R    PDDEQ HVLPLY MD +DEF + E Q +K  TGA++ L+
Sbjct: 258 VALLKPGNRDSRLPDDEQFHVLPLYTMDGTDEFESIEGQRDKHRTGAVQMLD 309


>gi|296195853|ref|XP_002745572.1| PREDICTED: methylcytosine dioxygenase TET2 [Callithrix jacchus]
          Length = 1998

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 191/321 (59%), Positives = 246/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ LN
Sbjct: 1428 EFGSIEAQEEKKRSGAIQVLN 1448


>gi|195587298|ref|XP_002083402.1| GD13372 [Drosophila simulans]
 gi|194195411|gb|EDX08987.1| GD13372 [Drosophila simulans]
          Length = 907

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 189/287 (65%), Positives = 233/287 (81%), Gaps = 2/287 (0%)

Query: 245 DLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQG 304
           DLR++ EER    G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R G
Sbjct: 2   DLRREFEERCNLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPG 61

Query: 305 HTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDP 364
           H C  A+IVV +VAW+G+P  ++D  Y  L  KLNKYGLPTTRRCATNE RTCACQGLDP
Sbjct: 62  HRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDP 121

Query: 365 DTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKAL 424
           ++ GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE  IE+ M+L+AT ++P++K +
Sbjct: 122 ESSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQV 181

Query: 425 APGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK 484
            P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K
Sbjct: 182 CPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLK 241

Query: 485 --HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
             +R    PDDEQ HVLPLY MD +DEF + E Q +K  TGA++ L+
Sbjct: 242 PGNRDTRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQMLD 288


>gi|403275632|ref|XP_003929543.1| PREDICTED: methylcytosine dioxygenase TET2 [Saimiri boliviensis
            boliviensis]
          Length = 1999

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 190/321 (59%), Positives = 245/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1130 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1188

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1189 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1248

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1249 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1308

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1309 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1368

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1369 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1428

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1429 EFGSVEAQEEKKRSGAIQVLS 1449


>gi|297674086|ref|XP_002815070.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pongo abelii]
          Length = 2023

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLSLADKLYSE 1268

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1269 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469


>gi|338722529|ref|XP_001503267.3| PREDICTED: methylcytosine dioxygenase TET2 [Equus caballus]
          Length = 1933

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/321 (59%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADRLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGALTNRRCAHNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L V    EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLVDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRNGAIQVLS 1448


>gi|332216742|ref|XP_003257511.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2
            [Nomascus leucogenys]
          Length = 1996

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1122 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1180

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1181 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1240

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1241 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1300

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1301 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1360

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1361 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1420

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1421 EFGSVEAQEEKKQSGAIQVLS 1441


>gi|297674088|ref|XP_002815071.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pongo abelii]
          Length = 2002

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>gi|397519747|ref|XP_003830015.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pan paniscus]
          Length = 2002

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>gi|332819904|ref|XP_003310448.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pan
            troglodytes]
 gi|410352429|gb|JAA42818.1| tet oncogene family member 2 [Pan troglodytes]
          Length = 2002

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>gi|410213066|gb|JAA03752.1| tet oncogene family member 2 [Pan troglodytes]
 gi|410301428|gb|JAA29314.1| tet oncogene family member 2 [Pan troglodytes]
          Length = 2002

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>gi|397519749|ref|XP_003830016.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pan paniscus]
          Length = 2023

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1268

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1269 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469


>gi|426345124|ref|XP_004040272.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Gorilla gorilla
            gorilla]
          Length = 2002

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>gi|332819906|ref|XP_526645.2| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pan
            troglodytes]
          Length = 2023

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1268

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1269 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469


>gi|355749478|gb|EHH53877.1| hypothetical protein EGM_14586 [Macaca fascicularis]
          Length = 1999

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1127 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1185

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1186 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1245

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1246 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1305

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1306 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1365

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1366 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1425

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1426 EFGSVEAQEEKKRSGAIQVLS 1446


>gi|402870138|ref|XP_003899096.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2
            [Papio anubis]
          Length = 2027

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1154 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1212

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1213 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1272

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1273 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1332

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1333 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1392

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1393 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1452

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1453 EFGSVEAQEEKKQSGAIQVLS 1473


>gi|431897123|gb|ELK06385.1| Putative methylcytosine dioxygenase TET2 [Pteropus alecto]
          Length = 2040

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1125 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1183

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+ +EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1184 KSSQGCPIAKWVVRRSCIEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1243

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1244 LTETLKKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1303

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1304 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1363

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1364 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1423

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1424 EFGSIEAQEEKKRNGAIQVLS 1444


>gi|426345126|ref|XP_004040273.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Gorilla gorilla
            gorilla]
          Length = 2023

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1268

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1269 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469


>gi|187761317|ref|NP_001120680.1| methylcytosine dioxygenase TET2 isoform a [Homo sapiens]
 gi|239938839|sp|Q6N021.3|TET2_HUMAN RecName: Full=Methylcytosine dioxygenase TET2
 gi|227806663|emb|CAX30492.1| tet oncogene family member 2 [Homo sapiens]
          Length = 2002

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>gi|355687510|gb|EHH26094.1| hypothetical protein EGK_15982 [Macaca mulatta]
          Length = 2003

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1131 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1189

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1190 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1249

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1250 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1309

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1310 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1369

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1370 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1429

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1430 EFGSVEAQEEKKRSGAIQVLS 1450


>gi|444723451|gb|ELW64107.1| Methylcytosine dioxygenase TET2 [Tupaia chinensis]
          Length = 2020

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 201/369 (54%), Positives = 261/369 (70%), Gaps = 17/369 (4%)

Query: 176  LEPKEPNNNEEPATVKAEDPNSKEMLDHIERL-----KNNMRTEV------PDCKCFASD 224
            LE +  +++E+  T +   P     L+   RL     KN + T V      P C+C    
Sbjct: 1117 LEQQAASSSEKTPTKRTAGPVLSNFLESPSRLLDTPIKNLLDTPVKTQYDFPSCRCV-EQ 1175

Query: 225  KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
             +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEGK++QGCP+AKW
Sbjct: 1176 IIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEGKSSQGCPIAKW 1235

Query: 285  VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
            V+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL  +D +Y+ LT  L KYG  
Sbjct: 1236 VVRRSCDEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLTLADKLYSELTETLRKYGTL 1295

Query: 345  TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSE 402
            T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  RKF+L      E
Sbjct: 1296 TNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPKE 1355

Query: 403  EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
            E+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRPFSGVTAC DFC
Sbjct: 1356 EEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFC 1415

Query: 463  AHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
            AH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D DEFG+ EAQEEK
Sbjct: 1416 AHAHRDLHNMQNGSTLVCTLTREDNREVGGKPEDEQLHVLPLYKVSDVDEFGSAEAQEEK 1475

Query: 520  VNTGAIENL 528
              +GAI+ L
Sbjct: 1476 KRSGAIQVL 1484


>gi|395847439|ref|XP_003796382.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Otolemur
            garnettii]
 gi|395847441|ref|XP_003796383.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Otolemur
            garnettii]
          Length = 2014

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 190/321 (59%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1127 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1185

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL  V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1186 KSSQGCPIAKWVVRRSCSEEKLLCFVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1245

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1246 LTETLRKYGTLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1305

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1306 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1365

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1366 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRDIGGKPEDEQLHVLPLYKVSDVD 1425

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1426 EFGSVEAQEEKKRSGAIQVLS 1446


>gi|410957091|ref|XP_003985168.1| PREDICTED: methylcytosine dioxygenase TET2 [Felis catus]
          Length = 2017

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKQSGAIQVLS 1448


>gi|291401335|ref|XP_002717242.1| PREDICTED: tet oncogene family member 2 [Oryctolagus cuniculus]
          Length = 2011

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 191/320 (59%), Positives = 242/320 (75%), Gaps = 6/320 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+EK++YTGKEG
Sbjct: 1130 DFPSCRCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIEKVIYTGKEG 1188

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWVIRR+  EEKLL +V+ R GHTC  A IV++I+ WEG+P + +  +Y+ 
Sbjct: 1189 KSSQGCPIAKWVIRRSCSEEKLLCLVRERAGHTCEAAVIVILILLWEGIPQSLATELYSE 1248

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L  +G  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1249 LTETLKNHGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKVPR 1308

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P+YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1309 KFKLLGDDPKEEEKLESHLQNLSTLLAPIYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1368

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM+NG TVV +LTK  +R + +KPDDEQLHVLPLY + D D
Sbjct: 1369 FSGVTACLDFCAHAHRDLHNMHNGSTVVCTLTKEDNREIGAKPDDEQLHVLPLYKISDVD 1428

Query: 509  EFGNKEAQEEKVNTGAIENL 528
            EFG+ EAQEEK   GAIE L
Sbjct: 1429 EFGSVEAQEEKKRNGAIEVL 1448


>gi|395501402|ref|XP_003755084.1| PREDICTED: methylcytosine dioxygenase TET1 [Sarcophilus harrisii]
          Length = 1578

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 192/321 (59%), Positives = 239/321 (74%), Gaps = 6/321 (1%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLG   S+  +R+ +E R G KG+A+R+E ++YTGKE
Sbjct: 826  SELPSCSCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMEARYGEKGRAIRIEVVVYTGKE 884

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y 
Sbjct: 885  GKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLYQ 944

Query: 333  ILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
             LT  LNKYG PTTRRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK  
Sbjct: 945  ELTQSLNKYGCPTTRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKNP 1004

Query: 393  RKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
            R+FRL      EE+ +E  +  LAT ++P+YK LAP AF NQ + E   S+CRLG K GR
Sbjct: 1005 RRFRLIADDPKEEENLESNLQTLATDVAPVYKKLAPDAFQNQVENEHLGSDCRLGRKDGR 1064

Query: 451  PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
            PFSGVTAC DFCAH+H+D HNMNNG TVV +LTK  +RS+   P DEQLHVLPLY +  +
Sbjct: 1065 PFSGVTACIDFCAHAHKDTHNMNNGSTVVCTLTKEDNRSVGVIPKDEQLHVLPLYKISQT 1124

Query: 508  DEFGNKEAQEEKVNTGAIENL 528
            DEFG +E  E K+ TGAI+ L
Sbjct: 1125 DEFGTREGLEAKIKTGAIQVL 1145


>gi|417406864|gb|JAA50073.1| Putative vesicle coat complex copii subunit sec31 [Desmodus rotundus]
          Length = 2036

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADQLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGALTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRNGAIQVLS 1448


>gi|380805593|gb|AFE74672.1| methylcytosine dioxygenase TET2 isoform a, partial [Macaca mulatta]
          Length = 430

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
           + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 40  DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 98

Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
           K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 99  KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 158

Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
           LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 159 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 218

Query: 394 KFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
           KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 219 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 278

Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
           FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 279 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 338

Query: 509 EFGNKEAQEEKVNTGAIENLN 529
           EFG+ EAQEEK  +GAI+ L+
Sbjct: 339 EFGSVEAQEEKKRSGAIQVLS 359


>gi|344277247|ref|XP_003410414.1| PREDICTED: methylcytosine dioxygenase TET2 [Loxodonta africana]
          Length = 2013

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1130 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1188

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1189 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADRLYSE 1248

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1249 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1308

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1309 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEDRAPECRLGLKEGRP 1368

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1369 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDMD 1428

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1429 EFGSVEAQEEKKRNGAIQVLS 1449


>gi|345795800|ref|XP_535678.3| PREDICTED: methylcytosine dioxygenase TET2 [Canis lupus familiaris]
          Length = 2018

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1139 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1197

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1198 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1257

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1258 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1317

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1318 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1377

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1378 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1437

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQE+K   GAI+ L+
Sbjct: 1438 EFGSVEAQEKKKQNGAIQVLS 1458


>gi|301782603|ref|XP_002926716.1| PREDICTED: probable methylcytosine dioxygenase TET2-like [Ailuropoda
            melanoleuca]
          Length = 2006

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1131 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1189

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1190 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1249

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1250 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1309

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1310 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1369

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1370 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1429

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQE+K   GAI+ L+
Sbjct: 1430 EFGSVEAQEKKKQNGAIQVLS 1450


>gi|426231353|ref|XP_004009704.1| PREDICTED: methylcytosine dioxygenase TET2 [Ovis aries]
          Length = 2001

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1128 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1186

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+P++ +D +Y+ 
Sbjct: 1187 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVSLADKLYSE 1246

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1247 LTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1306

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1307 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1366

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDL NM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1367 FSGVTACLDFCAHAHRDLQNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1426

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1427 EFGSVEAQEEKKRNGAIQVLS 1447


>gi|326918544|ref|XP_003205548.1| PREDICTED: methylcytosine dioxygenase TET2-like [Meleagris gallopavo]
          Length = 1955

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 187/320 (58%), Positives = 242/320 (75%), Gaps = 6/320 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1102 DFPSCSCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 1160

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P + +D +Y+ 
Sbjct: 1161 KSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTSLADKLYSE 1220

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT+ L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1221 LTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1280

Query: 394  KFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1281 KFKLMGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1340

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R + + P+DEQLHVLPLY + D D
Sbjct: 1341 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKVSDVD 1400

Query: 509  EFGNKEAQEEKVNTGAIENL 528
            EFG+ E QEEK   G+I+ L
Sbjct: 1401 EFGSTEGQEEKKRNGSIQVL 1420


>gi|297466579|ref|XP_001790198.2| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Bos taurus]
 gi|297475658|ref|XP_002688138.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Bos taurus]
 gi|296486794|tpg|DAA28907.1| TPA: tet oncogene family member 2 [Bos taurus]
          Length = 2007

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1128 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1186

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+P++ +D +Y+ 
Sbjct: 1187 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVSLADKLYSE 1246

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1247 LTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1306

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1307 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1366

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDL NM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1367 FSGVTACLDFCAHAHRDLQNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1426

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1427 EFGSVEAQEEKKRNGAIQVLS 1447


>gi|350587911|ref|XP_003129326.3| PREDICTED: methylcytosine dioxygenase TET2 [Sus scrofa]
          Length = 2019

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/321 (58%), Positives = 242/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL  +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSGSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLPLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ +AQEEK   GAI+ L+
Sbjct: 1428 EFGSVDAQEEKKRNGAIQVLS 1448


>gi|449265874|gb|EMC77004.1| putative methylcytosine dioxygenase TET2, partial [Columba livia]
          Length = 1470

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 187/320 (58%), Positives = 241/320 (75%), Gaps = 6/320 (1%)

Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
           + P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 615 DFPSCSC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 673

Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
           K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P + +D +Y  
Sbjct: 674 KSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTSLADKLYTE 733

Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
           LT+ L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 734 LTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 793

Query: 394 KFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
           KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 794 KFKLMGDDPKEEEKLESNLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 853

Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
           FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R + + P+DEQLHVLPLY + D D
Sbjct: 854 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKVSDVD 913

Query: 509 EFGNKEAQEEKVNTGAIENL 528
           EFG+ E QEEK   G+I+ L
Sbjct: 914 EFGSTEGQEEKKRNGSIQVL 933


>gi|395542103|ref|XP_003772974.1| PREDICTED: methylcytosine dioxygenase TET2 [Sarcophilus harrisii]
          Length = 2011

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 187/320 (58%), Positives = 241/320 (75%), Gaps = 6/320 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1145 DFPSCSC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1203

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1204 KSSQGCPIAKWVVRRSCNEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1263

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1264 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1323

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1324 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYENRAPECRLGLKEGRP 1383

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R + + P+DEQLHVLPLY + + D
Sbjct: 1384 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGRTPEDEQLHVLPLYKVSNMD 1443

Query: 509  EFGNKEAQEEKVNTGAIENL 528
            EFG+ EAQEEK   GAI+ L
Sbjct: 1444 EFGSVEAQEEKKRNGAIQVL 1463


>gi|224049493|ref|XP_002193886.1| PREDICTED: methylcytosine dioxygenase TET2 [Taeniopygia guttata]
          Length = 1960

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 186/320 (58%), Positives = 241/320 (75%), Gaps = 6/320 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1104 DFPSCSCV-EHIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 1162

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P + +D +Y+ 
Sbjct: 1163 KSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTSLADRLYSE 1222

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT+ L KYG  T RRCA NE R CACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1223 LTDTLRKYGTLTNRRCALNEERNCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1282

Query: 394  KFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1283 KFKLMGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1342

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R + + P+DEQLHVLPLY + D D
Sbjct: 1343 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKVSDVD 1402

Query: 509  EFGNKEAQEEKVNTGAIENL 528
            EFG+ E QEEK   G+I+ L
Sbjct: 1403 EFGSTEGQEEKKRNGSIQVL 1422


>gi|334313839|ref|XP_001368961.2| PREDICTED: methylcytosine dioxygenase TET1 [Monodelphis domestica]
          Length = 2124

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 191/321 (59%), Positives = 238/321 (74%), Gaps = 6/321 (1%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLG   S+  +R+ +E R G KG+A+R+E ++YTGKE
Sbjct: 1373 SELPSCSCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMEARYGEKGRAIRIEVVVYTGKE 1431

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y 
Sbjct: 1432 GKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLYQ 1491

Query: 333  ILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
             LT  LNKYG PTTRRCA NE RTCACQG+DP+TCGASFSFGCSWSMY+NGCK+ARSK  
Sbjct: 1492 ELTQSLNKYGCPTTRRCALNEDRTCACQGMDPETCGASFSFGCSWSMYFNGCKFARSKNP 1551

Query: 393  RKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
            R+FRL      EE+ +E  +  LAT ++P+YK LAP AF NQ + E    +CRLG K GR
Sbjct: 1552 RRFRLIADDPKEEEILESNLQSLATDVAPVYKKLAPDAFRNQVENEPLGPDCRLGRKDGR 1611

Query: 451  PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLYIMDDS 507
            PFSGVTAC DFCAH+H+D HNMNNG TVV +LTK  +RS+   P DEQLHVLPLY +  +
Sbjct: 1612 PFSGVTACIDFCAHAHKDTHNMNNGSTVVCTLTKEDNRSVGVVPKDEQLHVLPLYKISQT 1671

Query: 508  DEFGNKEAQEEKVNTGAIENL 528
            DEFG KE  E K+ TGAI+ L
Sbjct: 1672 DEFGTKEGLEAKIKTGAIQVL 1692


>gi|334330961|ref|XP_003341431.1| PREDICTED: methylcytosine dioxygenase TET2 [Monodelphis domestica]
          Length = 2016

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 187/321 (58%), Positives = 241/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1148 DFPSCSCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1206

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1207 KSSQGCPIAKWVVRRSCNEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADRLYSE 1266

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1267 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1326

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1327 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1386

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R + + P+DEQLHVLPLY +   D
Sbjct: 1387 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREVGRTPEDEQLHVLPLYKVSSMD 1446

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK   GAI+ L+
Sbjct: 1447 EFGSVEAQEEKKRNGAIQVLS 1467


>gi|395841208|ref|XP_003793438.1| PREDICTED: methylcytosine dioxygenase TET3 [Otolemur garnettii]
          Length = 1655

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 253/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 662  LKYLDTPTKNLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 717

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 718  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 777

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PTTRRC  N+ RTCACQG DP+TCGA
Sbjct: 778  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTTRRCGLNDDRTCACQGKDPNTCGA 837

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 838  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 897

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 898  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 957

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV++GAI+ L 
Sbjct: 958  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVDSGAIQVLT 1002


>gi|351694675|gb|EHA97593.1| Putative methylcytosine dioxygenase TET2 [Heterocephalus glaber]
          Length = 1947

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 241/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+EK++YTGKEG
Sbjct: 1091 DFPSCHCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIEKVVYTGKEG 1149

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWVIRR+S EEKLL +V+ R+GHTC  A IVV+I+ WEG+PL  ++ +Y  
Sbjct: 1150 KSSQGCPIAKWVIRRSSREEKLLCLVRERRGHTCEVAVIVVLILLWEGIPLPLANRLYTE 1209

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LTN L + G  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1210 LTNTLCRNGSLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKVPR 1269

Query: 394  KFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T +SP+Y+ LAP A+ NQ + E  A +CRLG K GRP
Sbjct: 1270 KFKLVGDDPKEEEKLESNLQNLSTFLSPMYQKLAPDAYNNQVELEHRAPDCRLGLKEGRP 1329

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM+NG T+V +LT+  +      P+DEQLHVLPLY + D D
Sbjct: 1330 FSGVTACLDFCAHAHRDLHNMHNGSTLVCTLTREDNREFGVVPEDEQLHVLPLYKISDVD 1389

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK+ +GAIE L 
Sbjct: 1390 EFGSAEAQEEKMRSGAIEVLT 1410


>gi|410918022|ref|XP_003972485.1| PREDICTED: methylcytosine dioxygenase TET2-like [Takifugu rubripes]
          Length = 939

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 242/321 (75%), Gaps = 6/321 (1%)

Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
           ++  C C        E G YYTHLG+A S+P +R+ +E+RSG  G A+R+EK++YTGKEG
Sbjct: 369 DIASCHCVEQISEKDE-GPYYTHLGSAPSVPGIRELMEKRSGITGSAIRIEKVVYTGKEG 427

Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
           K+TQGCP+AKWVIRR S EEK+L++V+ R GHTC+TA I+VVI+ WEG+  N +D +Y  
Sbjct: 428 KSTQGCPIAKWVIRRGSEEEKILVLVRERTGHTCNTACIIVVILVWEGILPNLADRLYHE 487

Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
           L++ L K+G  T RRCA NE RTCACQGL+P+ CGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 488 LSDTLRKHGALTQRRCAHNEERTCACQGLNPEACGASFSFGCSWSMYYNGCKFARSKNPR 547

Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
           KF+L      EE+ +E+    LAT + PLYK+LAP A+ NQ + E+   +CRLG K GRP
Sbjct: 548 KFKLLGDDMKEEERLEQNFQSLATLLGPLYKSLAPEAYGNQVEHEQRGLDCRLGHKEGRP 607

Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
           FSGVTAC DFCAH+HRDLHNM  G TVV +LTK  +R + K PDDEQLHVLPLY   ++D
Sbjct: 608 FSGVTACMDFCAHAHRDLHNMQGGSTVVCTLTKEDNREIGKIPDDEQLHVLPLYKASNTD 667

Query: 509 EFGNKEAQEEKVNTGAIENLN 529
           EFG++E Q+EK+ +GAI+ L+
Sbjct: 668 EFGSEEGQQEKIKSGAIQVLS 688


>gi|354495922|ref|XP_003510077.1| PREDICTED: methylcytosine dioxygenase TET3 [Cricetulus griseus]
 gi|344253854|gb|EGW09958.1| putative methylcytosine dioxygenase TET3 [Cricetulus griseus]
          Length = 1668

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 676  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 731

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 732  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRGTLEEKLLCLVRHRAGHHCQN 791

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A I+++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 792  AVIIILILAWEGIPRSLGDALYRELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 851

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 852  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 911

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 912  AYQNQVTNEEVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M ++DEFG++E Q  KV++GAI+ L 
Sbjct: 972  RCVGQIPEDEQLHVLPLYKMANTDEFGSEENQNAKVSSGAIQVLT 1016


>gi|335285293|ref|XP_003125075.2| PREDICTED: methylcytosine dioxygenase TET3 [Sus scrofa]
          Length = 1660

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 666  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 721

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 722  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 781

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 782  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 841

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 842  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 901

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 902  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 961

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 962  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1006


>gi|338714173|ref|XP_001917149.2| PREDICTED: methylcytosine dioxygenase TET3 [Equus caballus]
          Length = 1664

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|395731669|ref|XP_002811944.2| PREDICTED: methylcytosine dioxygenase TET3 [Pongo abelii]
          Length = 1659

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|426226468|ref|XP_004007365.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3
           [Ovis aries]
          Length = 1498

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
           +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 642 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 697

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 698 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 757

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
           A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 758 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 817

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
           SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 818 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 877

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
           A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 878 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 937

Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 938 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 982


>gi|301772240|ref|XP_002921545.1| PREDICTED: probable methylcytosine dioxygenase TET3-like [Ailuropoda
            melanoleuca]
          Length = 1695

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 706  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 761

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 762  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 821

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP TCGA
Sbjct: 822  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGA 881

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 882  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 941

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 942  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1001

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 1002 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1046


>gi|426336006|ref|XP_004029495.1| PREDICTED: methylcytosine dioxygenase TET3 [Gorilla gorilla gorilla]
          Length = 1662

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 670  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 725

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 726  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 785

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 786  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 845

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 846  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 905

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 906  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 966  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1010


>gi|149944516|ref|NP_659430.1| methylcytosine dioxygenase TET3 [Homo sapiens]
 gi|190358928|sp|O43151.3|TET3_HUMAN RecName: Full=Methylcytosine dioxygenase TET3
          Length = 1660

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|390474307|ref|XP_002757648.2| PREDICTED: methylcytosine dioxygenase TET3 [Callithrix jacchus]
          Length = 1660

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEVAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|119620108|gb|EAW99702.1| hCG40738 [Homo sapiens]
          Length = 1714

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 722  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 777

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 778  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 837

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 838  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 897

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 898  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 957

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 958  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1017

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 1018 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1062


>gi|410955071|ref|XP_003984182.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3
            [Felis catus]
          Length = 1658

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL   +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLVGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMSNTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|397478119|ref|XP_003810404.1| PREDICTED: methylcytosine dioxygenase TET3 [Pan paniscus]
          Length = 1660

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|297266306|ref|XP_001107194.2| PREDICTED: probable methylcytosine dioxygenase TET3 [Macaca mulatta]
          Length = 1714

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 722  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 777

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 778  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 837

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 838  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 897

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 898  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 957

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 958  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1017

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 1018 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1062


>gi|291386514|ref|XP_002709671.1| PREDICTED: tet oncogene family member 3 [Oryctolagus cuniculus]
          Length = 1822

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 204/376 (54%), Positives = 263/376 (69%), Gaps = 12/376 (3%)

Query: 160  HLKDGLCQGMRTQDEM-LEPKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDC 218
            H +DG  +   T+ E  L P      E P  +K  D  +K +LD   +     + E P C
Sbjct: 820  HSEDGGQEATPTKAENPLTPTLSGFLESP--LKYLDTPTKSLLDTPAK---RAQAEFPTC 874

Query: 219  KCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQG 278
             C     +  + G YYTHLG+  ++  +R+ +EER G KGKA+R+EK++YTGKEGK+++G
Sbjct: 875  DCV-EQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRG 933

Query: 279  CPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKL 338
            CP+AKWVIRR +LEEKLL +V+HR GH C  A IV++I+AWEG+P +  D +Y  LT+ L
Sbjct: 934  CPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTL 993

Query: 339  NKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS 398
             KYG PT+RRC  N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+
Sbjct: 994  RKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLA 1053

Query: 399  VRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVT 456
              +  EE+ + +    LAT ++PLYK LAP A+ NQ   E  A +CRLG K GRPFSGVT
Sbjct: 1054 GDNPKEEEVLPKSFQGLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFSGVT 1113

Query: 457  ACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNK 513
            AC DFCAH+H+D HN+ NGCTVV +LTK  +R + K P+DEQLHVLPLY M  +DEFG++
Sbjct: 1114 ACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYKMASTDEFGSE 1173

Query: 514  EAQEEKVNTGAIENLN 529
            E Q  KV +GAI+ L 
Sbjct: 1174 ENQNAKVGSGAIQVLT 1189


>gi|440904536|gb|ELR55033.1| Putative methylcytosine dioxygenase TET3, partial [Bos grunniens
            mutus]
          Length = 1675

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 682  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 737

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 738  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 797

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 798  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 857

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 858  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 917

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 918  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 977

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 978  RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1022


>gi|281343070|gb|EFB18654.1| hypothetical protein PANDA_010427 [Ailuropoda melanoleuca]
          Length = 1674

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 685  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 740

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 741  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 800

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP TCGA
Sbjct: 801  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGA 860

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 861  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 920

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 921  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 980

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 981  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1025


>gi|355565799|gb|EHH22228.1| hypothetical protein EGK_05455, partial [Macaca mulatta]
          Length = 1693

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 703  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 758

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 759  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 818

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 819  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 878

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 879  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 938

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 939  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 998

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 999  RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1043


>gi|358414357|ref|XP_582145.4| PREDICTED: methylcytosine dioxygenase TET3 [Bos taurus]
          Length = 1657

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 664  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 719

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 720  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 779

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 780  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 839

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 840  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 899

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 900  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 959

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 960  RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1004


>gi|403260367|ref|XP_003922646.1| PREDICTED: methylcytosine dioxygenase TET3 [Saimiri boliviensis
            boliviensis]
          Length = 1659

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEVAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|332813444|ref|XP_515553.3| PREDICTED: methylcytosine dioxygenase TET3 [Pan troglodytes]
          Length = 1662

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 670  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 725

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 726  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 785

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 786  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 845

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 846  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 905

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 906  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 966  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1010


>gi|431920360|gb|ELK18392.1| Putative methylcytosine dioxygenase TET3 [Pteropus alecto]
          Length = 1631

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 669  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 724

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 725  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 784

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 785  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 844

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 845  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 904

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 905  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 964

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 965  RCVGKIPEDEQLHVLPLYKMATTDEFGSEENQNAKVGSGAIQVLT 1009


>gi|441643103|ref|XP_003268728.2| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3,
            partial [Nomascus leucogenys]
          Length = 1787

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 796  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 851

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 852  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRTGHHCQN 911

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 912  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 971

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 972  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 1031

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 1032 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1091

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 1092 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1136


>gi|402891273|ref|XP_003908876.1| PREDICTED: methylcytosine dioxygenase TET3 [Papio anubis]
          Length = 1660

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1008


>gi|344283728|ref|XP_003413623.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase
           TET3-like [Loxodonta africana]
          Length = 1582

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
           +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 592 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 647

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 648 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 707

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
           A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 708 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 767

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
           SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 768 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 827

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
           A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 828 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 887

Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 888 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 932


>gi|316990466|gb|ADU77107.1| putative methylcytosine dioxygenase [Homo sapiens]
          Length = 1795

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 803  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 858

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 859  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 918

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 919  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 978

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 979  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 1038

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 1039 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1098

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 1099 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1143


>gi|355751424|gb|EHH55679.1| hypothetical protein EGM_04930, partial [Macaca fascicularis]
          Length = 1621

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 703  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 758

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 759  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 818

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 819  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 878

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 879  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 938

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 939  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 998

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 999  RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1043


>gi|417406721|gb|JAA50005.1| Putative snf2 family dna-dependent atpase [Desmodus rotundus]
          Length = 1759

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 770  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 825

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 826  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 885

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 886  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 945

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 946  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 1005

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 1006 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1065

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 1066 RCVGKIPEDEQLHVLPLYKMATTDEFGSEENQNAKVGSGAIQVLT 1110


>gi|348566495|ref|XP_003469037.1| PREDICTED: methylcytosine dioxygenase TET3-like [Cavia porcellus]
          Length = 1670

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 679  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 734

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 735  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 794

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 795  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 854

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 855  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 914

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 915  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 974

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 975  RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1019


>gi|345782422|ref|XP_540225.3| PREDICTED: methylcytosine dioxygenase TET3 [Canis lupus familiaris]
          Length = 1660

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 670  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 725

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 726  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 785

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP TCGA
Sbjct: 786  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGA 845

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 846  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 905

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 906  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DE+G++E Q  KV +GAI+ L 
Sbjct: 966  RCVGKIPEDEQLHVLPLYKMANTDEYGSEENQNAKVGSGAIQVLT 1010


>gi|390347525|ref|XP_785530.3| PREDICTED: uncharacterized protein LOC580376 [Strongylocentrotus
           purpuratus]
          Length = 1458

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 179/334 (53%), Positives = 238/334 (71%), Gaps = 8/334 (2%)

Query: 201 LDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKA 260
           + H++ +  + R E P+C C  +D    +   YYTHLG   +LP +R+ +E RSG++G  
Sbjct: 491 MKHMQLISEDARIEAPNCGCLENDM---DEAPYYTHLGTGPNLPAIRELVEIRSGFQGSQ 547

Query: 261 LRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWE 320
           +R+EK++Y+GKEGK++ GCP+AKW+IRR+S +EK+L++V+HR GH C T++I++ IVAWE
Sbjct: 548 VRIEKVVYSGKEGKSSTGCPIAKWIIRRSSTDEKILVLVRHRPGHRCDTSYIIIAIVAWE 607

Query: 321 GVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMY 380
           GV    +D  Y +L   L    +PT RRC TNE +TCACQG  PD+CGASF+FGCSWSMY
Sbjct: 608 GVNNYVADDTYEMLRTTLPNGAIPTVRRCGTNEDKTCACQGFSPDSCGASFTFGCSWSMY 667

Query: 381 YNGCKYARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFERE 438
           YN CK+ARS+T RKF+L   + E E  + ++   +AT + PLYK LAP +F N   FE E
Sbjct: 668 YNTCKFARSRTPRKFKLLEANPEVEDVLSDRFQNMATDLGPLYKRLAPESFNNMVVFEEE 727

Query: 439 ASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSK---PDDEQ 495
             ECRLG + GRPF+GVTAC DFCAH+H+D HNMNNGCTVVV+LTK    +K   P DEQ
Sbjct: 728 GKECRLGKETGRPFAGVTACMDFCAHAHKDQHNMNNGCTVVVTLTKDDIRNKRPSPGDEQ 787

Query: 496 LHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           LHVLPLY +D +DEFG  E Q+ KV  G+IE L 
Sbjct: 788 LHVLPLYYLDSTDEFGTAEGQQNKVRNGSIEVLT 821


>gi|432108066|gb|ELK33047.1| Methylcytosine dioxygenase TET3 [Myotis davidii]
          Length = 1772

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 746  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 801

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 802  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 861

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 862  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 921

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 922  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 981

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 982  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1041

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 1042 RCVGRIPEDEQLHVLPLYKMATTDEFGSEENQNAKVGSGAIQVLT 1086


>gi|363742165|ref|XP_003642602.1| PREDICTED: methylcytosine dioxygenase TET3-like [Gallus gallus]
          Length = 1308

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 201/370 (54%), Positives = 259/370 (70%), Gaps = 12/370 (3%)

Query: 166 CQGMRTQDEM-LEPKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASD 224
            +G  T+DE+ L P      E P  +K  D  +K +LD   +     + E P C C    
Sbjct: 308 AEGTPTKDEVPLTPTLSGFLESP--LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQ 361

Query: 225 KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
            +  + G YYTHLG+  ++  +R+ +EER G KGKA+R+EK++YTGKEGK+++GCP+AKW
Sbjct: 362 IVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKW 421

Query: 285 VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
           VIRR + EEKLL +V+HR GH C  A I+++I+AWEG+P    D +Y  LT+ L KYG P
Sbjct: 422 VIRRHNQEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRTLGDTLYQELTDTLTKYGNP 481

Query: 345 TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--E 402
           T+RRC  N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL   +  E
Sbjct: 482 TSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLVGDNPKE 541

Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
           E+ + +    LAT ++PLYK LAP A+ NQ   E  A +CRLG K GRPFSGVTAC DFC
Sbjct: 542 EELLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEDIAIDCRLGLKEGRPFSGVTACMDFC 601

Query: 463 AHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
           AH+H+D HN+ NGCTVV +LTK  +R + K P+DEQLHVLPLY M  +DEFG++E Q  K
Sbjct: 602 AHAHKDQHNLYNGCTVVCTLTKEDNRVVGKIPEDEQLHVLPLYKMSSTDEFGSEENQNAK 661

Query: 520 VNTGAIENLN 529
           V +GAI+ L 
Sbjct: 662 VGSGAIQVLT 671


>gi|444725167|gb|ELW65745.1| Methylcytosine dioxygenase TET1 [Tupaia chinensis]
          Length = 1472

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 187/322 (58%), Positives = 238/322 (73%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            TEVP+C C     +  + G YYTHLGA  ++  +R+ +E R G KGKA+R+E ++YTGKE
Sbjct: 770  TEVPECDCL-DRAIQKDKGPYYTHLGAGPTVAAVREIMENRYGQKGKAIRIETVVYTGKE 828

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWVIRR+S EEK+L +V+ R GH CSTA IVV+I+ WEG+PL  +D +Y 
Sbjct: 829  GKSSHGCPVAKWVIRRSSEEEKVLCLVRKRAGHHCSTAVIVVLIMVWEGIPLPMADQLYK 888

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 889  ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 948

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 949  PRRFRIDPSSPLHEKNLEDNLQNLATQLAPIYKQFAPDAYKNQVEYEHVARECRLGSKEG 1008

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1009 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLAD 1068

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1069 TDEFGSQEGMEAKIKSGAIEVL 1090


>gi|291404265|ref|XP_002718498.1| PREDICTED: CXXC finger 5-like [Oryctolagus cuniculus]
          Length = 2112

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 191/326 (58%), Positives = 241/326 (73%), Gaps = 15/326 (4%)

Query: 213  TEVPDCKCF----ASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILY 268
            TEVP C C       DK     G YYTHLGA  S+  +R+ +E R G KGKA+R+E+++Y
Sbjct: 1405 TEVPSCNCLDRGTQKDK-----GPYYTHLGAGPSVAAVREIMENRYGQKGKAVRIEEVVY 1459

Query: 269  TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
            TGKEGK+++GCP+AKWV+RR+S EEK+L +V+ R GH CSTA IVV+I+ WEG+PL  +D
Sbjct: 1460 TGKEGKSSRGCPVAKWVLRRSSEEEKVLCLVRKRPGHHCSTAVIVVLIMIWEGIPLPMAD 1519

Query: 329  GVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYA 387
             +Y+ LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ 
Sbjct: 1520 RLYSELTENLRSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFG 1579

Query: 388  RSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            RS + R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG
Sbjct: 1580 RSPSPRRFRIDPSSPLHEKNLEDNLQSLATELAPIYKQYAPVAYQNQVEYEHVARECRLG 1639

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLY 502
             K GRPFSGVTAC DFCAHSHRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY
Sbjct: 1640 RKEGRPFSGVTACLDFCAHSHRDIHNMNNGSTVVCTLTREDNRSLGVVPQDEQLHVLPLY 1699

Query: 503  IMDDSDEFGNKEAQEEKVNTGAIENL 528
             + D+DEFG+KE  E K+ +GAIE L
Sbjct: 1700 KLADTDEFGSKEGMERKIKSGAIEVL 1725


>gi|148666664|gb|EDK99080.1| mCG133587 [Mus musculus]
          Length = 1707

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     ++E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 715  LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 770

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 771  MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 830

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 831  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 890

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 891  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 950

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 951  AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1010

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 1011 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1055


>gi|256773243|ref|NP_898961.2| methylcytosine dioxygenase TET3 [Mus musculus]
 gi|239938841|sp|Q8BG87.3|TET3_MOUSE RecName: Full=Methylcytosine dioxygenase TET3
          Length = 1668

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     ++E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 676  LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 731

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 732  MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 791

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 792  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 851

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 852  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 911

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 912  AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 972  RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1016


>gi|313493537|gb|ADR57138.1| TET3 isoform 2 [Mus musculus]
          Length = 1784

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     ++E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 792  LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 847

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 848  MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 907

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 908  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 967

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 968  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 1027

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 1028 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1087

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 1088 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1132


>gi|313493535|gb|ADR57137.1| TET3 isoform 1 [Mus musculus]
 gi|432138979|gb|AGB05430.1| Tet methylcytosine deoxygenase 3 isoform [Mus musculus]
          Length = 1803

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     ++E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 811  LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 866

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 867  MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 926

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 927  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 986

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 987  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 1046

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 1047 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1106

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 1107 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1151


>gi|432847164|ref|XP_004065962.1| PREDICTED: methylcytosine dioxygenase TET2-like [Oryzias latipes]
          Length = 1755

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 187/330 (56%), Positives = 248/330 (75%), Gaps = 8/330 (2%)

Query: 207  LKNNMRTEVPDCKCFASDKL-PPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
            L   ++T+     C   D++   + G YYTHLG+A ++P +R+ +E+RSG  G+A+R+EK
Sbjct: 915  LDTPLKTQYDIASCHCVDQIVEKDEGPYYTHLGSAPTVPGIREMMEKRSGLTGRAIRIEK 974

Query: 266  ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
            ++YTGKEGK+TQGCP+AKWVIRR+S+EEKLL++V+ R GH C TA I+VVI+ WEG+  +
Sbjct: 975  VIYTGKEGKSTQGCPIAKWVIRRSSVEEKLLVLVRERTGHRCETACIIVVILVWEGIQAS 1034

Query: 326  QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
             +D +Y  L+  L K G  T RRCA NE RTCACQGL+P+  GASFSFGCSWSMYYNGCK
Sbjct: 1035 LADRLYLELSETLKKNGAHTQRRCAFNEERTCACQGLNPEESGASFSFGCSWSMYYNGCK 1094

Query: 386  YARSKTVRKFRL---SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
            +ARSK  RKF+L    VR EE+++E     LAT ++PLYKA+AP A+ NQ + E  A +C
Sbjct: 1095 FARSKIPRKFKLLGDDVR-EEEKVERNFQNLATLLAPLYKAMAPEAYGNQVEHEHRAPDC 1153

Query: 443  RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVL 499
            RLG K GRPFSGVTAC DFCAH+HRDLHNM  G TVV +LT+  +R + + P+DEQLHVL
Sbjct: 1154 RLGLKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTVVCTLTREDNREIGRIPEDEQLHVL 1213

Query: 500  PLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            PLY   ++DEFG++E Q+EK+ +GAI+ L+
Sbjct: 1214 PLYKASNTDEFGSEEGQQEKMKSGAIQVLS 1243


>gi|293346889|ref|XP_002726470.1| PREDICTED: methylcytosine dioxygenase TET3 [Rattus norvegicus]
 gi|293358777|ref|XP_001057850.2| PREDICTED: methylcytosine dioxygenase TET3 [Rattus norvegicus]
 gi|149036522|gb|EDL91140.1| rCG56357 [Rattus norvegicus]
          Length = 1667

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 251/345 (72%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 675  LKYLDTPTKSLLDTPAK---KAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 730

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 731  MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 790

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 791  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 850

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 851  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 910

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 911  AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 970

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 971  RCVGQIPEDEQLHVLPLYKMATTDEFGSEENQNAKVSSGAIQVLT 1015


>gi|327283432|ref|XP_003226445.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2-like
            [Anolis carolinensis]
          Length = 1631

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 184/320 (57%), Positives = 237/320 (74%), Gaps = 6/320 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G +YTHLGA  ++  +R+ +EER   KGKA+R+E+I+YTGKEG
Sbjct: 786  DFPSCSC-VEQIIEKDEGPFYTHLGAGPNVAAIRQIMEERYEQKGKAIRIERIVYTGKEG 844

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K+ QGCP+AKWVIRR S EEKLL +V+ R GH+C TA IVV+I+ WEG+P + +D +Y+ 
Sbjct: 845  KSAQGCPIAKWVIRRGSTEEKLLCLVRERAGHSCETAVIVVLILVWEGIPQSLADKLYSD 904

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            L+  L KYG  T RRCA NE RTCACQGLD ++CGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 905  LSETLRKYGTLTNRRCALNEERTCACQGLDTESCGASFSFGCSWSMYYNGCKFARSKIPR 964

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 965  KFKLLGDDPKEEEKLETSLQTLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1024

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R + + P+DEQLHVLPLY + + D
Sbjct: 1025 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKISNID 1084

Query: 509  EFGNKEAQEEKVNTGAIENL 528
            EFG+ E QEEK   G+I+ L
Sbjct: 1085 EFGSTEGQEEKKRNGSIQVL 1104


>gi|410922577|ref|XP_003974759.1| PREDICTED: methylcytosine dioxygenase TET3-like [Takifugu rubripes]
          Length = 2020

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 186/324 (57%), Positives = 238/324 (73%), Gaps = 6/324 (1%)

Query: 210  NMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYT 269
            +++ E P C C     L  + G YY HLGA  ++  +R  +E R+G KG A+R+EK++YT
Sbjct: 1049 DLQAEFPTCTCV-EQILEKDEGPYYNHLGAGPTVAAVRDLMERRTGLKGDAIRLEKVVYT 1107

Query: 270  GKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDG 329
            G+EGK++QGCP+AKWVIRR +  EKLL +V+ R GH C  A I++VI+AWEGVP   +D 
Sbjct: 1108 GREGKSSQGCPIAKWVIRRGNETEKLLCLVRERAGHHCPNAVIIIVILAWEGVPRAMADM 1167

Query: 330  VYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 389
            +Y  L++ L KYG PT+RRC  N+ RTCACQG DP+ CGASFSFGCSWSMY+NGCKYARS
Sbjct: 1168 LYRDLSDSLTKYGNPTSRRCGFNDDRTCACQGKDPEKCGASFSFGCSWSMYFNGCKYARS 1227

Query: 390  KTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
            K  RKFRL      EE ++ ++   LAT ++PLYK LAP A++NQCQ E +A +CRLG K
Sbjct: 1228 KMPRKFRLQGERPEEEDKVGDRFQALATHVAPLYKQLAPQAYSNQCQTESKAPDCRLGLK 1287

Query: 448  PGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIM 504
             GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + K PDDEQLHVLPLY +
Sbjct: 1288 EGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNREVKKIPDDEQLHVLPLYKV 1347

Query: 505  DDSDEFGNKEAQEEKVNTGAIENL 528
              +DEFG +E Q  K+ TGAI+ L
Sbjct: 1348 SLTDEFGREEGQRLKMKTGAIQVL 1371


>gi|449504705|ref|XP_002190919.2| PREDICTED: methylcytosine dioxygenase TET1 [Taeniopygia guttata]
          Length = 2187

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 187/322 (58%), Positives = 233/322 (72%), Gaps = 6/322 (1%)

Query: 212  RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            ++E+P C C     +  + G YYTHLG   S+  +R+ +E R G KG A+R+E ++YTGK
Sbjct: 1427 QSELPTCDCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMENRYGAKGSAVRIEVVVYTGK 1485

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
            EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y
Sbjct: 1486 EGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 1545

Query: 332  AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
              LT  L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK 
Sbjct: 1546 KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 1605

Query: 392  VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             RKFRL     +QE  +E  +  LAT ++P+YK LAP AF NQ + E    +CRLG K G
Sbjct: 1606 PRKFRLLTDDPKQEELLENNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGCKDG 1665

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK    R    P DEQLHVLPLY +  
Sbjct: 1666 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1725

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG +E  E K+  GAI+ L
Sbjct: 1726 TDEFGTEEGLEAKIKAGAIQVL 1747


>gi|363735173|ref|XP_421571.3| PREDICTED: methylcytosine dioxygenase TET1 [Gallus gallus]
          Length = 1541

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 187/322 (58%), Positives = 233/322 (72%), Gaps = 6/322 (1%)

Query: 212  RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            ++E+P C C     +  + G YYTHLG   S+  +R+ +E R G KG A+R+E ++YTGK
Sbjct: 781  QSELPTCDCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMENRYGAKGSAVRIEVVVYTGK 839

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
            EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y
Sbjct: 840  EGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 899

Query: 332  AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
              LT  L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK 
Sbjct: 900  KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 959

Query: 392  VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             RKFRL     +QE  +E  +  LAT ++P+YK LAP AF NQ + E    +CRLG K G
Sbjct: 960  PRKFRLLTDDPKQEELLEHNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGSKDG 1019

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK    R    P DEQLHVLPLY +  
Sbjct: 1020 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1079

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG +E  E K+  GAI+ L
Sbjct: 1080 TDEFGTEEGLEAKIKAGAIQVL 1101


>gi|449268998|gb|EMC79810.1| Methylcytosine dioxygenase TET1, partial [Columba livia]
          Length = 1186

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 187/322 (58%), Positives = 233/322 (72%), Gaps = 6/322 (1%)

Query: 212  RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            ++E+P C C     +  + G YYTHLG   S+  +R+ +E R G KG A+R+E ++YTGK
Sbjct: 866  QSELPTCDCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMENRYGAKGSAVRIEVVVYTGK 924

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
            EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y
Sbjct: 925  EGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 984

Query: 332  AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
              LT  L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK 
Sbjct: 985  KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 1044

Query: 392  VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             RKFRL     +QE  +E  +  LAT ++P+YK LAP AF NQ + E    +CRLG K G
Sbjct: 1045 PRKFRLLTDDPKQEELLENNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGCKDG 1104

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK    R    P DEQLHVLPLY +  
Sbjct: 1105 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1164

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG +E  E K+  GAI+ L
Sbjct: 1165 TDEFGTEEGLEAKIKAGAIQVL 1186


>gi|47227721|emb|CAG09718.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 2294

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 186/325 (57%), Positives = 236/325 (72%), Gaps = 6/325 (1%)

Query: 210  NMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYT 269
            +++ E P C C     L  + G YY HLGA  ++  +R  +E R+G KG A+R+EK++YT
Sbjct: 1478 DLQAEFPTCTCV-EQILEKDEGPYYNHLGAGPTVAAVRDLMERRTGLKGDAIRLEKVVYT 1536

Query: 270  GKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDG 329
            G+EGK++QGCP+AKWVIRR S  EKLL +V+ R GH C  A I++VI+AWEGVP   +D 
Sbjct: 1537 GREGKSSQGCPIAKWVIRRGSETEKLLCLVRERAGHHCPNAVIIIVILAWEGVPRAMADM 1596

Query: 330  VYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 389
            +Y  L++ L KYG PT RRC  N+ RTCACQG DP+  GASFSFGCSWSMY+NGCKYARS
Sbjct: 1597 LYRDLSDSLTKYGNPTNRRCGFNDDRTCACQGKDPEKSGASFSFGCSWSMYFNGCKYARS 1656

Query: 390  KTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
            K  RKFRL      EE ++ ++   LAT ++PLYK LAP A++NQCQ E +A +CRLG K
Sbjct: 1657 KMPRKFRLQGDRPEEEDKVRDRFQALATHVAPLYKQLAPQAYSNQCQTESKAPDCRLGLK 1716

Query: 448  PGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIM 504
             GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + K PDDEQLHVLPLY +
Sbjct: 1717 EGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNREVQKIPDDEQLHVLPLYKV 1776

Query: 505  DDSDEFGNKEAQEEKVNTGAIENLN 529
              +DEFG +E Q  K+ TGAI+ L 
Sbjct: 1777 SPTDEFGREEGQRLKMKTGAIQVLQ 1801


>gi|326667684|ref|XP_003198655.1| PREDICTED: methylcytosine dioxygenase TET3 [Danio rerio]
          Length = 1799

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 188/340 (55%), Positives = 247/340 (72%), Gaps = 9/340 (2%)

Query: 194  DPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEER 253
            D  +K +LD   +   +++ + P C C     L  + G YY HLG+   +P +R+ +E+R
Sbjct: 833  DTPTKNLLDTPGK---DVQPDFPICDCV-DQVLEKDEGPYYNHLGSGRDIPSVRQLMEDR 888

Query: 254  SGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIV 313
             G KG+A+R+EK++YTG+EGK++QGCP+AKWV+RR+S +EK+L +VK R GH C+   IV
Sbjct: 889  YGEKGEAVRIEKVVYTGREGKSSQGCPIAKWVLRRSSEKEKVLCVVKQRPGHHCANTVIV 948

Query: 314  VVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
            VVI+AWEGVP    D +Y  +T  + KYG PT+RRC  NE RTCACQG DP+TCGASFSF
Sbjct: 949  VVILAWEGVPRALGDKLYREVTETITKYGNPTSRRCGLNEDRTCACQGKDPETCGASFSF 1008

Query: 374  GCSWSMYYNGCKYARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTN 431
            GCSWSMY+NGCKYARSK  RKFRL      EE  + +    LAT ++PLYK LAP A++N
Sbjct: 1009 GCSWSMYFNGCKYARSKVPRKFRLQGEHPKEEDNLRDNFQALATHVAPLYKKLAPQAYSN 1068

Query: 432  QCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS 489
            QC  E  AS+CRLG K GRPFSG+TAC DFCAH+H+D HN++NGCTVV +LTK  +R++ 
Sbjct: 1069 QCLHEDVASDCRLGLKEGRPFSGITACMDFCAHAHKDQHNLHNGCTVVCTLTKEDNRTVG 1128

Query: 490  K-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
              P+DEQLHVLPLY +  +DEFG++E Q  K+ TGAI+ L
Sbjct: 1129 TIPEDEQLHVLPLYKLATTDEFGSEENQRLKMQTGAIQVL 1168


>gi|432875799|ref|XP_004072913.1| PREDICTED: methylcytosine dioxygenase TET3-like [Oryzias latipes]
          Length = 2014

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 187/340 (55%), Positives = 246/340 (72%), Gaps = 9/340 (2%)

Query: 194  DPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEER 253
            D  +K +LD   +   + + + P C C     L  + G YY HLG+  ++  +R  +E R
Sbjct: 1022 DTPTKSLLDTPSK---DPQLDFPTCTCV-EQILEKDEGPYYNHLGSGPTVASIRTLMEAR 1077

Query: 254  SGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIV 313
             G KG A+R+EK++YTGKEGK++ GCP+AKWVIRR S +EK+L +V+HR GH C  A I+
Sbjct: 1078 FGEKGDAVRIEKVVYTGKEGKSSHGCPIAKWVIRRGSEKEKVLCLVRHRAGHHCENAVII 1137

Query: 314  VVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
            ++I+AWEGVP   +D +Y  +T+ L KYG PT+RRC  N+ RTCACQG DP+TCGASFSF
Sbjct: 1138 ILILAWEGVPKALADKLYREVTDTLTKYGNPTSRRCGLNDDRTCACQGKDPETCGASFSF 1197

Query: 374  GCSWSMYYNGCKYARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTN 431
            GCSWSMY+NGCKYARSK  RKFRL      EE+++ +    LAT ++PLYK LAP A++N
Sbjct: 1198 GCSWSMYFNGCKYARSKMPRKFRLQGDHPEEEEKLRDNFQNLATEVAPLYKRLAPQAYSN 1257

Query: 432  QCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS 489
            QC  E +AS+CRLG K GRPFSG+TAC DFCAH+H+D HN++NGCTVV +LTK  +R + 
Sbjct: 1258 QCLSEDKASDCRLGLKEGRPFSGITACMDFCAHAHKDQHNLHNGCTVVCTLTKEDNRKVG 1317

Query: 490  K-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
              P+DEQLHVLPLY +  +DEFG+ EAQ  K+ TGAI+ L
Sbjct: 1318 GIPEDEQLHVLPLYTVSHTDEFGSAEAQRIKMQTGAIQAL 1357


>gi|410901250|ref|XP_003964109.1| PREDICTED: methylcytosine dioxygenase TET3-like [Takifugu rubripes]
          Length = 1134

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 188/322 (58%), Positives = 238/322 (73%), Gaps = 6/322 (1%)

Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
           +++P C+C     +  E G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 592 SDLPSCQCM-DQIIEKEEGPYYTHLGAGPSIAAVREMMENRYGAKGNAVRIEAVVYTGKE 650

Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
           GK++QGCP+AKWVIRR S EEKLL +V+ R GH C TA +V++I+AWEG+    +DG+Y 
Sbjct: 651 GKSSQGCPIAKWVIRRDSEEEKLLCLVRRRPGHCCDTAVLVILILAWEGISRPVADGLYQ 710

Query: 333 ILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
            LT  L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK+ARSK  
Sbjct: 711 ELTTTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCKFARSKVP 770

Query: 393 RKFRLS--VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
           RKFRL      EE+++E  +  LAT ++PLYK LAP AF NQ + E    +CRLG + GR
Sbjct: 771 RKFRLQGDYPEEEEKLETHLQGLATDLAPLYKRLAPEAFQNQVENEDGGGDCRLGQREGR 830

Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
           PFSGVTAC DFCAH+H+D HNMNNG TVV +LTK  +R++   P+DEQLHVLPLY + D 
Sbjct: 831 PFSGVTACVDFCAHAHKDTHNMNNGSTVVCTLTKEDNRAVRNVPEDEQLHVLPLYRISDR 890

Query: 508 DEFGNKEAQEEKVNTGAIENLN 529
           DEFG  E Q  K+ +G ++ L+
Sbjct: 891 DEFGQVEGQWAKIRSGGLQVLS 912


>gi|296482779|tpg|DAA24894.1| TPA: hypothetical protein BOS_11388 [Bos taurus]
          Length = 964

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 186/304 (61%), Positives = 235/304 (77%), Gaps = 5/304 (1%)

Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
           G YYTHLG+  ++  +R+ +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +
Sbjct: 8   GPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHT 67

Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
           LEEKLL +V+HR GH C  A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC 
Sbjct: 68  LEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCG 127

Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEE 408
            N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +
Sbjct: 128 LNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRK 187

Query: 409 KMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRD 468
               LAT ++PLYK LAP A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D
Sbjct: 188 SFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKD 247

Query: 469 LHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAI 525
            HN+ NGCTVV +LTK  +R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI
Sbjct: 248 QHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAI 307

Query: 526 ENLN 529
           + L 
Sbjct: 308 QVLT 311


>gi|355723845|gb|AES08024.1| tet oncoprotein family member 2 [Mustela putorius furo]
          Length = 870

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 185/311 (59%), Positives = 236/311 (75%), Gaps = 12/311 (3%)

Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGC-------PLAK 283
           G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEGK++QGC       P+AK
Sbjct: 8   GPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEGKSSQGCGKSSQGCPIAK 67

Query: 284 WVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGL 343
           WV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ LT  L KYG 
Sbjct: 68  WVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSELTETLRKYGT 127

Query: 344 PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRS 401
            T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  RKF+L      
Sbjct: 128 LTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPK 187

Query: 402 EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 461
           EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRPFSGVTAC DF
Sbjct: 188 EEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDF 247

Query: 462 CAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSDEFGNKEAQEE 518
           CAH+HRDLHNM NG T+V +LT+  +R +  KP+DEQLHVLPLY + D DEFG+ EAQE+
Sbjct: 248 CAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEK 307

Query: 519 KVNTGAIENLN 529
           K   GAI+ L+
Sbjct: 308 KKQNGAIQVLS 318


>gi|432951908|ref|XP_004084919.1| PREDICTED: methylcytosine dioxygenase TET1-like [Oryzias latipes]
          Length = 1530

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 188/321 (58%), Positives = 240/321 (74%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            ++P C+C     +  E G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKEG
Sbjct: 829  DLPSCQCV-DQIIEKEEGPYYTHLGAGPSVAAVREMMENRYGAKGNAIRVEVVVYTGKEG 887

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            +++QGCP+AKWVIRR S EEKLL +V+ R GH+C +A +V++I+AWEG+P   +D +Y  
Sbjct: 888  RSSQGCPIAKWVIRRGSEEEKLLCLVRQRPGHSCDSAVLVILILAWEGIPRPVADHLYRE 947

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT+ L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK+ARSK  R
Sbjct: 948  LTDTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCKFARSKVPR 1007

Query: 394  KFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KFRL     EQE  IE  +  LA+ ++PLYK LAP AF NQ + E   S+CRLG + GRP
Sbjct: 1008 KFRLHGDFPEQEEKIENNLQNLASDLAPLYKKLAPQAFQNQVEHEVAGSDCRLGREEGRP 1067

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+H+D  NMNNG TVV +LTK  +R++   P+DEQLHVLPLY + D+D
Sbjct: 1068 FSGVTACVDFCAHAHKDTSNMNNGSTVVCTLTKEDNRAVRNIPEDEQLHVLPLYKVSDTD 1127

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG  E Q  K+ +GA++ L+
Sbjct: 1128 EFGMVEGQWAKIQSGALQILS 1148


>gi|351702491|gb|EHB05410.1| Methylcytosine dioxygenase TET1 [Heterocephalus glaber]
          Length = 2011

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 185/321 (57%), Positives = 237/321 (73%), Gaps = 7/321 (2%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            EVP C C     +  + G YYTHLGA  S+  +R+ +E R G KGKA+R+EK++YTGKEG
Sbjct: 1332 EVPACNC-PDRGIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGKAIRIEKVVYTGKEG 1390

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWVIRR+S EEK+L +V+ R GH C TA IVV+I+ W+G+PL  +D +Y  
Sbjct: 1391 KSSQGCPVAKWVIRRSSEEEKVLCLVRQRPGHQCETAVIVVLIMLWDGIPLPMADRLYTE 1450

Query: 334  LTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
            LT  L  Y G PT RRC  NE RTC CQG+DP+ CGASFSFGCSWSMY+NGCK+ RS + 
Sbjct: 1451 LTENLKSYSGHPTDRRCTLNENRTCTCQGIDPERCGASFSFGCSWSMYFNGCKFGRSPSP 1510

Query: 393  RKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
            R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K GR
Sbjct: 1511 RRFRIDPSSPLHEKNLEDNLQNLATELAPIYKQYAPVAYQNQVEYEHVARECRLGRKEGR 1570

Query: 451  PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
            PFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P+DEQLHVLPLY + D+
Sbjct: 1571 PFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPEDEQLHVLPLYRLSDT 1630

Query: 508  DEFGNKEAQEEKVNTGAIENL 528
            DEFG+KE  E K+ +GA++ L
Sbjct: 1631 DEFGSKEGMEAKIQSGAVQVL 1651


>gi|334313524|ref|XP_003339916.1| PREDICTED: methylcytosine dioxygenase TET3-like [Monodelphis
           domestica]
          Length = 1614

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 191/345 (55%), Positives = 247/345 (71%), Gaps = 9/345 (2%)

Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
           +K  D  +K +LD   +     + E P C+C     +  + G YYTHLGA  S+  +R+ 
Sbjct: 617 LKYLDTPTKNLLDTPSK---RAQAEFPVCECV-EQIVEKDEGPYYTHLGAGPSVAAIREL 672

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWV RR + EEKLL +V+HR GH C  
Sbjct: 673 MEDRYGEKGKAIRIEKVVYTGKEGKSSRGCPIAKWVYRRYTEEEKLLCLVRHRSGHRCEQ 732

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
           A I+++I+ WEG+     D +Y  LT  L  YG PTTRRC  N+ RTCACQG DP TCGA
Sbjct: 733 AVIIILILVWEGISSELGDTLYRELTETLRCYGNPTTRRCGLNDDRTCACQGKDPSTCGA 792

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHL--LATTISPLYKALAPG 427
           SFSFGCSWSMY+NGCKYARSK  RKFRL+  + E+E   + H   LAT ++PLYK LAP 
Sbjct: 793 SFSFGCSWSMYFNGCKYARSKFPRKFRLTGDNPEEEENLRKHFQNLATQVAPLYKKLAPQ 852

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
           A+ NQ + E EA +CRLG KPGRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 853 AYQNQVKDEEEAIDCRLGLKPGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 912

Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           R + + P+DEQLHVLPLY +D +DEFG++E Q  K+ +GAI+ L 
Sbjct: 913 RCVGQIPEDEQLHVLPLYKIDSTDEFGSEENQRAKMASGAIQVLT 957


>gi|297686810|ref|XP_002820931.1| PREDICTED: methylcytosine dioxygenase TET1 [Pongo abelii]
          Length = 2136

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737


>gi|355782886|gb|EHH64807.1| hypothetical protein EGM_18120, partial [Macaca fascicularis]
          Length = 1479

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 760  SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 818

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 819  GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 878

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 879  ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 938

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 939  PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 998

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 999  RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1058

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1059 TDEFGSKEGMEAKIKSGAIEVL 1080


>gi|327287144|ref|XP_003228289.1| PREDICTED: methylcytosine dioxygenase TET3-like [Anolis carolinensis]
          Length = 1795

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 194/371 (52%), Positives = 256/371 (69%), Gaps = 14/371 (3%)

Query: 166  CQGMRTQDEMLEPKEPNNN---EEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFA 222
             +G  T++E+  P  P  +   E P  +K  D  +K +LD   +     + E P C C  
Sbjct: 781  VEGTPTKEEVPPPLTPTLSGFLESP--LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV- 834

Query: 223  SDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLA 282
               +  + G YYTHLG+  ++  +R+ +EER G KG A+R+EK++YTGKEGK+++GCP+A
Sbjct: 835  EQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGDAIRIEKVIYTGKEGKSSRGCPIA 894

Query: 283  KWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYG 342
            KWVIRR +LEEKLL +V+HR GH C  A I+++I+AWEG+P    D +Y  L++ L KYG
Sbjct: 895  KWVIRRHNLEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRTLGDTLYQELSDILTKYG 954

Query: 343  LPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVR-- 400
             PTTRRC  N+ RTCACQG DP++CGASFSFGCSWSMY+NGCKYARSK  RKFRL     
Sbjct: 955  NPTTRRCGLNDDRTCACQGKDPNSCGASFSFGCSWSMYFNGCKYARSKMPRKFRLQGYNP 1014

Query: 401  SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFD 460
            +EE  + +    LAT ++PLY+ LAP A+ NQ   E  A +CRLG K GRPFSGVTAC D
Sbjct: 1015 NEEDVLRKNFQDLATEVAPLYQRLAPQAYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMD 1074

Query: 461  FCAHSHRDLHNMNNGCTVVVSLTKHRSLSK---PDDEQLHVLPLYIMDDSDEFGNKEAQE 517
            FCAH+H+D HN+ NGCTVV +LTK  + +    P+DEQLHVLPLY M  +DEFG++E Q 
Sbjct: 1075 FCAHAHKDQHNLYNGCTVVCTLTKEDNRTTGQVPEDEQLHVLPLYKMSPTDEFGSEERQA 1134

Query: 518  EKVNTGAIENL 528
             K+ +GAI+ L
Sbjct: 1135 AKMGSGAIQVL 1145


>gi|156139122|ref|NP_085128.2| methylcytosine dioxygenase TET1 [Homo sapiens]
 gi|115502139|sp|Q8NFU7.2|TET1_HUMAN RecName: Full=Methylcytosine dioxygenase TET1; AltName:
            Full=CXXC-type zinc finger protein 6; AltName:
            Full=Leukemia-associated protein with a CXXC domain;
            AltName: Full=Ten-eleven translocation 1 gene protein
 gi|225000490|gb|AAI72365.1| Tet oncogene 1 [synthetic construct]
          Length = 2136

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737


>gi|119574684|gb|EAW54299.1| CXXC finger 6 [Homo sapiens]
          Length = 2150

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1431 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1489

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1490 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1549

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1550 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1609

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1610 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1669

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1670 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1729

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1730 TDEFGSKEGMEAKIKSGAIEVL 1751


>gi|22001093|gb|AAM88301.1|AF430147_1 leukemia-associated protein with a CXXC domain [Homo sapiens]
          Length = 2136

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737


>gi|402880644|ref|XP_003903908.1| PREDICTED: methylcytosine dioxygenase TET1 [Papio anubis]
          Length = 2132

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1413 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1471

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1472 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1531

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1532 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1591

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1592 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1651

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1652 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1711

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1712 TDEFGSKEGMEAKIKSGAIEVL 1733


>gi|296220536|ref|XP_002756350.1| PREDICTED: methylcytosine dioxygenase TET1 [Callithrix jacchus]
          Length = 2134

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1415 SELPTCNCI-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGEKGNAIRIEIVVYTGKE 1473

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1474 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1533

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1534 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1593

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1594 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENIARECRLGSKEG 1653

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1654 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1713

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1714 TDEFGSKEGMEAKIQSGAIEVL 1735


>gi|397489915|ref|XP_003846089.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1 [Pan
            paniscus]
          Length = 2136

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737


>gi|410043931|ref|XP_507822.3| PREDICTED: methylcytosine dioxygenase TET1 [Pan troglodytes]
          Length = 2220

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1501 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1559

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1560 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1619

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1620 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1679

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1680 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1739

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1740 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1799

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1800 TDEFGSKEGMEAKIKSGAIEVL 1821


>gi|426364946|ref|XP_004049552.1| PREDICTED: methylcytosine dioxygenase TET1 [Gorilla gorilla gorilla]
          Length = 2136

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1716 TDEFGSREGMEAKIKSGAIEVL 1737


>gi|348575712|ref|XP_003473632.1| PREDICTED: methylcytosine dioxygenase TET1-like [Cavia porcellus]
          Length = 2168

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 185/326 (56%), Positives = 237/326 (72%), Gaps = 15/326 (4%)

Query: 213  TEVPDCKC----FASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILY 268
            TEVP C C       DK     G YYTHLGA  S+  +R+ +E R G+KGKA+R+EK++Y
Sbjct: 1398 TEVPSCDCPDRGIQKDK-----GPYYTHLGAGPSVAAVREIMETRCGHKGKAVRIEKLVY 1452

Query: 269  TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
            TGKEGK++QGCP+AK VIRR+S EE++L +V+ R GH C TA +V++IV W+G+P   +D
Sbjct: 1453 TGKEGKSSQGCPVAKKVIRRSSEEEEVLCLVRERPGHQCQTAVMVMLIVVWDGIPRPMAD 1512

Query: 329  GVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYA 387
             +Y  LT  L  Y G PT RRC  NE RTC CQG DP+TCGASFSFGCSWSMY+NGCK+ 
Sbjct: 1513 RLYTELTESLKSYNGHPTDRRCTLNENRTCTCQGTDPETCGASFSFGCSWSMYFNGCKFG 1572

Query: 388  RSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
            RS + R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG
Sbjct: 1573 RSPSPRRFRIDPSSPLNEKNLEDNLQNLATELAPIYKQYAPVAYQNQVEYEHVARECRLG 1632

Query: 446  FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLY 502
             K GRPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P+DEQLHVLPLY
Sbjct: 1633 RKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVVPEDEQLHVLPLY 1692

Query: 503  IMDDSDEFGNKEAQEEKVNTGAIENL 528
             + D+DEFG+KE  E K+ +GA++ L
Sbjct: 1693 KLSDTDEFGSKEGMEAKIRSGAVQVL 1718



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/38 (60%), Positives = 30/38 (78%)

Query: 491  PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
            P DEQLHVLPLY + D+DEFG+KE  E K+ +GA++ L
Sbjct: 1794 PCDEQLHVLPLYKLSDTDEFGSKEGMEAKIRSGAVQVL 1831


>gi|73953303|ref|XP_536371.2| PREDICTED: methylcytosine dioxygenase TET1 [Canis lupus familiaris]
          Length = 2137

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 180/322 (55%), Positives = 234/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +++P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1418 SDLPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1476

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1477 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1536

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1537 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1596

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1597 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1656

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1657 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1716

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1717 TDEFGSREGMEAKIKSGAIEVL 1738


>gi|281346965|gb|EFB22549.1| hypothetical protein PANDA_001619 [Ailuropoda melanoleuca]
          Length = 2136

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/322 (56%), Positives = 233/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1416 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1474

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1475 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1534

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1535 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1594

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1595 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1654

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1655 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1714

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+ E  E K+ +GAIE L
Sbjct: 1715 TDEFGSSEGMEAKIQSGAIEVL 1736


>gi|338716538|ref|XP_003363468.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1-like
            [Equus caballus]
          Length = 1811

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1091 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1149

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1150 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1209

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1210 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1269

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1270 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1329

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1330 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1389

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1390 TDEFGSREGMEAKIKSGAIEVL 1411


>gi|301755888|ref|XP_002913781.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Ailuropoda
            melanoleuca]
          Length = 2143

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 181/322 (56%), Positives = 233/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1423 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1481

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1482 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1541

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1542 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1601

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1602 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1661

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1662 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1721

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+ E  E K+ +GAIE L
Sbjct: 1722 TDEFGSSEGMEAKIQSGAIEVL 1743


>gi|260781795|ref|XP_002585985.1| hypothetical protein BRAFLDRAFT_185107 [Branchiostoma floridae]
 gi|229271061|gb|EEN41996.1| hypothetical protein BRAFLDRAFT_185107 [Branchiostoma floridae]
          Length = 326

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 185/323 (57%), Positives = 241/323 (74%), Gaps = 10/323 (3%)

Query: 214 EVPDCKC--FASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
           E+P C C  F S+K     G YYTHLG   ++  +R+ +E+R G  GKALR+EKI+YTGK
Sbjct: 1   ELPTCNCVDFVSEKAE---GPYYTHLGTGPTIQAIRELMEKRFGQSGKALRIEKIIYTGK 57

Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
           EGK++QGCP+AKW++RR+S EEK+L +V+HR GH C++++I++ IVAWEG+   ++D +Y
Sbjct: 58  EGKSSQGCPIAKWIVRRSSEEEKVLTLVRHRPGHRCNSSYIIICIVAWEGIQRARADELY 117

Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             L+  L+K GLPTTRRC  N+ +TCACQG+D + CGASFSFGCSWSMYYNGCK+ARS+ 
Sbjct: 118 DYLSGTLSKAGLPTTRRCGVNDTKTCACQGVDDNNCGASFSFGCSWSMYYNGCKFARSRV 177

Query: 392 VRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG-- 449
            +KF+L   SEE  IE+    LA  + P+Y+ LAP AF NQ ++   AS+CRLGF P   
Sbjct: 178 PKKFKLEDPSEEAIIEDHFQRLAGEVGPVYEQLAPDAFRNQTEYSEVASDCRLGFGPDNT 237

Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLT--KHRSLS-KPDDEQLHVLPLYIMDD 506
           RPFSGVTAC DFCAH+HRD HNMNNG T+V +LT  ++R L   P+DEQLHVLPLY M  
Sbjct: 238 RPFSGVTACVDFCAHAHRDQHNMNNGSTIVCTLTCPENRKLGPPPEDEQLHVLPLYKMAP 297

Query: 507 SDEFGNKEAQEEKVNTGAIENLN 529
           +DEF ++E QEEKV TGA+E L 
Sbjct: 298 TDEFDSEEGQEEKVRTGALEMLT 320


>gi|410975237|ref|XP_003994040.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1,
            partial [Felis catus]
          Length = 2153

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 180/322 (55%), Positives = 233/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +++P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1432 SDLPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAIREIMENRYGQKGNAIRIEIVVYTGKE 1490

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1491 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1550

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1551 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1610

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1611 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1670

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1671 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1730

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+ E  E K+ +GAIE L
Sbjct: 1731 TDEFGSSEGMEAKIKSGAIEVL 1752


>gi|12697897|dbj|BAB21767.1| KIAA1676 protein [Homo sapiens]
          Length = 735

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
           +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 16  SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 74

Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
           GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 75  GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 134

Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
            LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 135 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 194

Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
            R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 195 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 254

Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
           RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 255 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 314

Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
           +DEFG+KE  E K+ +GAIE L
Sbjct: 315 TDEFGSKEGMEAKIKSGAIEVL 336


>gi|119626584|gb|EAX06179.1| hCG21336 [Homo sapiens]
          Length = 839

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 178/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +EER G KGKA+R+E+++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  
Sbjct: 1   MEERFGQKGKAIRIERVIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEA 60

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
           A IV++I+ WEG+PL+ +D +Y+ LT  L KYG  T RRCA NE RTCACQGLDP+TCGA
Sbjct: 61  AVIVILILVWEGIPLSLADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGA 120

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPG 427
           SFSFGCSWSMYYNGCK+ARSK  RKF+L      EE+++E  +  L+T ++P YK LAP 
Sbjct: 121 SFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPD 180

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRS 487
           A+ NQ ++E  A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +
Sbjct: 181 AYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDN 240

Query: 488 L---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
                KP+DEQLHVLPLY + D DEFG+ EAQEEK  +GAI+ L+
Sbjct: 241 REFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEEKKRSGAIQVLS 285


>gi|395508956|ref|XP_003758773.1| PREDICTED: methylcytosine dioxygenase TET3 [Sarcophilus harrisii]
          Length = 1685

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 190/345 (55%), Positives = 245/345 (71%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C+C     +  + G YYTHLGA  S+  +R+ 
Sbjct: 667  LKYLDTPTKNLLDTPSK---RAQAEFPVCECV-EQIVEKDEGPYYTHLGAGPSVAAIREL 722

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWV RR + EEKLL +V+HR GH C  
Sbjct: 723  MEDRYGEKGKAIRIEKVVYTGKEGKSSRGCPIAKWVYRRYTEEEKLLCLVRHRSGHRCEQ 782

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A I+++I+ WEG+     D +Y  LT  L  YG PTTRRC  N+ RTCACQG DP TCGA
Sbjct: 783  AVIIILIMVWEGIGPELGDTLYRELTETLRCYGNPTTRRCGLNDDRTCACQGKDPSTCGA 842

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSK  RKFRLS  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 843  SFSFGCSWSMYFNGCKYARSKYPRKFRLSGDNPVEEENLRKHFQNLATQVAPLYKKLAPQ 902

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E EA +CRLG KPGRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 903  AYQNQVNNEEEAIDCRLGLKPGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 962

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY + ++DEFG++E Q  K+  GAI+ L 
Sbjct: 963  RLVGQIPEDEQLHVLPLYKIANTDEFGSEENQRAKMANGAIQVLT 1007


>gi|344275087|ref|XP_003409345.1| PREDICTED: methylcytosine dioxygenase TET1 [Loxodonta africana]
          Length = 2139

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 180/322 (55%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +++P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1419 SDLPSCSCV-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1477

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1478 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1537

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1538 ELTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1597

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1598 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPIAYQNQVEYEHVARECRLGSKEG 1657

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1658 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1717

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1718 TDEFGSREGLEAKIKSGAIEVL 1739


>gi|316990462|gb|ADU77105.1| putative methylcytosine dioxygenase isoform 1 [Xenopus laevis]
          Length = 1924

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 178/337 (52%), Positives = 244/337 (72%), Gaps = 9/337 (2%)

Query: 197  SKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGY 256
            +K ++D   ++    + E P C C        E G YYTHLG+  ++  +R+ +E+R G 
Sbjct: 963  TKSLIDTPAKM---AQAEFPTCDCVEQINEKDE-GPYYTHLGSGPTVASIRELMEDRFGE 1018

Query: 257  KGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVI 316
            KG+A+R+EK++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C  A I+++I
Sbjct: 1019 KGEAIRIEKVIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILI 1078

Query: 317  VAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCS 376
            +AWEG+P    D +Y+ +T  + KYG PT+RRC  N+ RTCACQG DP+TCGASFSFGCS
Sbjct: 1079 MAWEGIPRALGDSLYSDITETITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCS 1138

Query: 377  WSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQ 434
            WSMY+NGCKYARSKT RKFRL   +  EE+ + +    LAT ++P+Y+ LAP ++ NQ  
Sbjct: 1139 WSMYFNGCKYARSKTPRKFRLIGDNPKEEEFLNDNFQDLATKVAPVYQMLAPQSYENQVN 1198

Query: 435  FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-P 491
             E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R++ + P
Sbjct: 1199 NEEVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRTIGRIP 1258

Query: 492  DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
            +DEQLHVLPLY +  +DEFG+++ Q EK+  G I+ L
Sbjct: 1259 EDEQLHVLPLYKVSSTDEFGSEDGQAEKIRKGGIQVL 1295


>gi|426256086|ref|XP_004021676.1| PREDICTED: methylcytosine dioxygenase TET1, partial [Ovis aries]
          Length = 2146

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 177/319 (55%), Positives = 230/319 (72%), Gaps = 7/319 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1427 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1485

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y+
Sbjct: 1486 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADKLYS 1545

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1546 QLTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1605

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ   E  A ECRLG K G
Sbjct: 1606 PRRFRIDPSSPLHEKNLEDNLQSLATELAPIYKQYAPAAYQNQVALEHIARECRLGKKEG 1665

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  + S    P DEQLHVLPLY + D
Sbjct: 1666 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSFGVIPQDEQLHVLPLYKLSD 1725

Query: 507  SDEFGNKEAQEEKVNTGAI 525
            +DEFG++E  E K+ +GAI
Sbjct: 1726 TDEFGSREGMEAKIKSGAI 1744


>gi|148237918|ref|NP_001090656.1| tet methylcytosine dioxygenase 3 [Xenopus (Silurana) tropicalis]
 gi|117558065|gb|AAI27290.1| LOC100036628 protein [Xenopus (Silurana) tropicalis]
          Length = 1901

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 179/322 (55%), Positives = 233/322 (72%), Gaps = 6/322 (1%)

Query: 212  RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            + E P C C        E G YYTHLG+  ++  +R+ +EER G KG A+R+EK++YTGK
Sbjct: 951  QAEFPTCDCVEQINEKDE-GPYYTHLGSGPTVASIRELMEERFGQKGDAIRIEKVIYTGK 1009

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
            EGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C  A I+++I+AWEG+P +  D +Y
Sbjct: 1010 EGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILIMAWEGIPRSLGDSLY 1069

Query: 332  AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
              +T  + KYG PT+RRC  N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT
Sbjct: 1070 NDITETITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKT 1129

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             RKFRL   +  EE  +++    LAT ++P+YK LAP A+ NQ   E  A +CRLG K G
Sbjct: 1130 PRKFRLIGENPKEEDGLKDNFQNLATKVAPVYKMLAPQAYQNQVNNEDIAIDCRLGLKEG 1189

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + +  +DEQLHVLPLY +  
Sbjct: 1190 RPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRMIGRVAEDEQLHVLPLYKVST 1249

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E Q EK+  G I  L
Sbjct: 1250 TDEFGSEEGQLEKIKKGGIHVL 1271


>gi|316990464|gb|ADU77106.1| putative methylcytosine dioxygenase isoform 2 [Xenopus laevis]
          Length = 1915

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 180/338 (53%), Positives = 243/338 (71%), Gaps = 9/338 (2%)

Query: 197  SKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGY 256
            +K ++D   ++    + E P C C        E G YYTHLG+  ++  +R+ +EER G 
Sbjct: 956  TKSLIDTPAKM---AQAEFPTCDCVEQINEKDE-GPYYTHLGSGPTVASIRELMEERFGE 1011

Query: 257  KGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVI 316
            KG+A+R+EK++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C  A I+++I
Sbjct: 1012 KGEAIRIEKVIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILI 1071

Query: 317  VAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCS 376
            +AWEG+P    D +Y  ++  + KYG PT+RRC  N+ RTCACQG DP+TCGASFSFGCS
Sbjct: 1072 MAWEGIPRALGDSLYDDISGTITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCS 1131

Query: 377  WSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQ 434
            WSMY+NGCKYARSKT RKFRL   +  EE+ +++    LAT ++P+YK LAP A+ NQ  
Sbjct: 1132 WSMYFNGCKYARSKTPRKFRLIGDNPKEEEFLKDSFQDLATKVAPVYKMLAPQAYQNQAN 1191

Query: 435  FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-P 491
             E  A +CRLG + GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + K  
Sbjct: 1192 NEDVAIDCRLGLEEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRMIGKIA 1251

Query: 492  DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            +DEQLHVLPLY +  +DEFG++E Q EK+  G I+ L+
Sbjct: 1252 EDEQLHVLPLYKVSTTDEFGSEERQLEKIRKGGIQVLS 1289


>gi|293345707|ref|XP_001077411.2| PREDICTED: methylcytosine dioxygenase TET2 [Rattus norvegicus]
 gi|293357583|ref|XP_227694.5| PREDICTED: methylcytosine dioxygenase TET2 [Rattus norvegicus]
          Length = 1920

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 182/335 (54%), Positives = 235/335 (70%), Gaps = 5/335 (1%)

Query: 200  MLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGK 259
            +L  +    ++  T V D  C A      + G YYTHLGA  ++  +R  +EER G KGK
Sbjct: 1041 VLTDVSESPSDSDTPVEDISCEACKNAEKDEGPYYTHLGAGPNVAAIRTIMEERFGEKGK 1100

Query: 260  ALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAW 319
            A+R+E+++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R  HTC TA IV+VI+ W
Sbjct: 1101 AIRIERVIYTGKEGKSSQGCPIAKWVYRRSSTEEKLLCLVRVRAKHTCDTAVIVIVILLW 1160

Query: 320  EGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSM 379
            +G+P   +  +Y+ LT  L+  G+ T RRCA NE R C CQG +P+TCGASFS+GCSWSM
Sbjct: 1161 DGIPKPLASELYSELTEILSNRGICTNRRCAQNENRNCCCQGENPETCGASFSYGCSWSM 1220

Query: 380  YYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFER 437
            YYNGCK+ARSK  RKFRL      EE+++   +  LAT I+P+YK LAP A+ NQ +FE 
Sbjct: 1221 YYNGCKFARSKNPRKFRLHGDEPKEEEKLGSHLQNLATVIAPIYKKLAPDAYRNQVEFEH 1280

Query: 438  EASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDE 494
             A ECRLG K GRPFSGVTAC DF AH+HRD  NM NG TVVV+LT+  +     +P+DE
Sbjct: 1281 RAIECRLGLKEGRPFSGVTACLDFSAHAHRDQQNMANGSTVVVTLTREDNREVGGQPEDE 1340

Query: 495  QLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            QLHVLPLY +   DEFG+ E QEEK+  G+I+ L+
Sbjct: 1341 QLHVLPLYTIATEDEFGSTEGQEEKILQGSIQVLH 1375


>gi|351698810|gb|EHB01729.1| Putative methylcytosine dioxygenase TET3 [Heterocephalus glaber]
          Length = 1721

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 188/345 (54%), Positives = 241/345 (69%), Gaps = 29/345 (8%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C  +                      +R+ 
Sbjct: 750  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCVVAS---------------------IREL 785

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 786  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 845

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 846  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 905

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRLS  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 906  SFSFGCSWSMYFNGCKYARSKTPRKFRLSGDNPKEEEVLRKSFQDLATEVAPLYKQLAPQ 965

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 966  AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1025

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 1026 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1070


>gi|444723357|gb|ELW64014.1| Methylcytosine dioxygenase TET3 [Tupaia chinensis]
          Length = 2326

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 195/386 (50%), Positives = 250/386 (64%), Gaps = 50/386 (12%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 1245 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 1300

Query: 250  IEERS-----------------------------------------GYKGKALRMEKILY 268
            +EER                                          G KGKA+R+EK++Y
Sbjct: 1301 MEERGDDDESAHVRECRESDRNWCTEHTVLAVSTEADSRLPHIHMYGEKGKAIRIEKVIY 1360

Query: 269  TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
            TGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  A IV++I+AWEG+P +  D
Sbjct: 1361 TGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGD 1420

Query: 329  GVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYAR 388
             +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYAR
Sbjct: 1421 TLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYAR 1480

Query: 389  SKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGF 446
            SKT RKFRL   +  EE+ + +    LAT ++PLYK LAP A+ NQ   E  A +CRLG 
Sbjct: 1481 SKTPRKFRLVGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGL 1540

Query: 447  KPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYI 503
            K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + K P+DEQLHVLPLY 
Sbjct: 1541 KEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYK 1600

Query: 504  MDDSDEFGNKEAQEEKVNTGAIENLN 529
            M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 1601 MASTDEFGSEENQNAKVGSGAIQVLT 1626


>gi|431904167|gb|ELK09589.1| Methylcytosine dioxygenase TET1 [Pteropus alecto]
          Length = 2135

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 180/322 (55%), Positives = 232/322 (72%), Gaps = 9/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W G+PL   D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRERTGHHCPTAVMVVLIMVWAGLPL--PDKLYT 1533

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1534 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1593

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1594 PRRFRIDPSSPLHEKNLEDNLQSLATQLAPVYKQYAPVAYQNQVEYEHVARECRLGSKEG 1653

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +R L   P DEQLHVLPLY + D
Sbjct: 1654 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRLLGVIPQDEQLHVLPLYKLSD 1713

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1714 TDEFGSREGMEAKIRSGAIEVL 1735


>gi|281346571|gb|EFB22155.1| hypothetical protein PANDA_016408 [Ailuropoda melanoleuca]
          Length = 830

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 175/282 (62%), Positives = 220/282 (78%), Gaps = 5/282 (1%)

Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
           R G KGKA+R+E+++YTGKEGK++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A I
Sbjct: 1   RFGQKGKAIRIERVIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVI 60

Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
           V++I+ WEG+PL+ +D +Y+ LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFS
Sbjct: 61  VILILVWEGIPLSLADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFS 120

Query: 373 FGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFT 430
           FGCSWSMYYNGCK+ARSK  RKF+L      EE+++E  +  L+T ++P YK LAP A+ 
Sbjct: 121 FGCSWSMYYNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYN 180

Query: 431 NQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL 488
           NQ ++E  A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +R +
Sbjct: 181 NQIEYEHRAPECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREI 240

Query: 489 -SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
             KP+DEQLHVLPLY + D DEFG+ EAQE+K   GAI+ L+
Sbjct: 241 GGKPEDEQLHVLPLYKVSDVDEFGSVEAQEKKKQNGAIQVLS 282


>gi|359069958|ref|XP_002691249.2| PREDICTED: methylcytosine dioxygenase TET3 [Bos taurus]
          Length = 938

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 178/285 (62%), Positives = 222/285 (77%), Gaps = 5/285 (1%)

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 1   MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 60

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
           A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 61  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 120

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
           SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 121 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 180

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
           A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 181 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 240

Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 241 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 285


>gi|241896976|ref|NP_081660.1| methylcytosine dioxygenase TET1 isoform 2 [Mus musculus]
 gi|239977645|sp|Q3URK3.2|TET1_MOUSE RecName: Full=Methylcytosine dioxygenase TET1; AltName:
            Full=CXXC-type zinc finger protein 6; AltName:
            Full=Ten-eleven translocation 1 gene protein homolog
          Length = 2007

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/350 (51%), Positives = 242/350 (69%), Gaps = 12/350 (3%)

Query: 187  PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
            P T  A+    + ++D + +   N+       E   C C    +   E G YYTHLGA  
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392

Query: 242  SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
            S+  +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+  EEKL+ +V+ 
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452

Query: 302  RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
            R  H CSTA IVV+I+ WEG+P   +D +Y  LT  L  Y G PT RRC  N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512

Query: 361  GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTIS 418
            G+DP TCGASFSFGCSWSMY+NGCK+ RS+  RKFRL+      E+++E+ +  LAT ++
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHEKQLEKNLQELATVLA 1572

Query: 419  PLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
            PLYK +AP A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV
Sbjct: 1573 PLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTV 1632

Query: 479  VVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
            V +L +   R  + P+DEQLHVLPLY + D+DEFG+ E  + K+ +GAI+
Sbjct: 1633 VCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSGAIQ 1682


>gi|18490118|gb|AAH22243.1| TET3 protein [Homo sapiens]
 gi|62702130|gb|AAX93057.1| unknown [Homo sapiens]
 gi|168272980|dbj|BAG10329.1| KIAA0401 protein [synthetic construct]
 gi|313882564|gb|ADR82768.1| Unknown protein [synthetic construct]
          Length = 937

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 177/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 1   MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 60

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
           A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 61  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 120

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
           SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 121 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 180

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
           A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 181 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 240

Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
           R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 241 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 285


>gi|392338377|ref|XP_003753514.1| PREDICTED: methylcytosine dioxygenase TET1 isoform 2 [Rattus
            norvegicus]
          Length = 2008

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 177/319 (55%), Positives = 229/319 (71%), Gaps = 6/319 (1%)

Query: 214  EVPDCKCFASDKLP-PEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            E   C+C   D  P  + G YYTHLGA  S+  +R+ +E R G KGKA+R+EKI++TGKE
Sbjct: 1364 EAATCQCARPDGGPQKDKGPYYTHLGAGPSVAAVRELMETRYGQKGKAIRIEKIVFTGKE 1423

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++QGCP+AKWVIRR+  EEK++ +V+ R  H CSTA IVV+I+ WEG+P   +D +Y 
Sbjct: 1424 GKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRLMADRLYK 1483

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  N+ RTC CQG +P TCGASFSFGCSWSMY+NGCK+ RS  
Sbjct: 1484 ELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGCKFGRSAN 1543

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             RKFRL+      E+++EE +  LAT ++P+YK +AP A+ NQ ++E  A +CRLG + G
Sbjct: 1544 PRKFRLAPNYPLHEKQLEENLQDLATVLAPVYKQMAPVAYQNQVEYEDIAGDCRLGNEEG 1603

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIMDDS 507
            RPFSGVT C DFCAHSH+D+HNMNNG TVV +L +   R  S P DEQLHVLPLY + D+
Sbjct: 1604 RPFSGVTCCMDFCAHSHKDIHNMNNGSTVVCTLIREDGRDRSVPGDEQLHVLPLYRLADT 1663

Query: 508  DEFGNKEAQEEKVNTGAIE 526
            DEFG+ E  + K+ +GAI+
Sbjct: 1664 DEFGSVEGMKAKIQSGAIQ 1682


>gi|157057152|ref|NP_001035490.2| methylcytosine dioxygenase TET2 [Mus musculus]
 gi|239938840|sp|Q4JK59.3|TET2_MOUSE RecName: Full=Methylcytosine dioxygenase TET2; AltName: Full=Protein
            Ayu17-449
          Length = 1912

 Score =  367 bits (942), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)

Query: 233  YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
            YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1060 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1119

Query: 293  EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
            EKLL +V+ R  HTC TA +V+ I+ W+G+P   +  +Y+ LT+ L K G+ T RRC+ N
Sbjct: 1120 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 1179

Query: 353  EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
            E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  +   EE+ +   +
Sbjct: 1180 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 1239

Query: 411  HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
              LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD  
Sbjct: 1240 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 1299

Query: 471  NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
            NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE 
Sbjct: 1300 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 1359

Query: 528  L 528
            L
Sbjct: 1360 L 1360


>gi|148700127|gb|EDL32074.1| mCG11334 [Mus musculus]
          Length = 630

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 173/303 (57%), Positives = 226/303 (74%), Gaps = 5/303 (1%)

Query: 229 EPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRR 288
           E G YYTHLGA  S+  +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR
Sbjct: 3   EKGPYYTHLGAGPSVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRR 62

Query: 289 ASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTR 347
           +  EEKL+ +V+ R  H CSTA IVV+I+ WEG+P   +D +Y  LT  L  Y G PT R
Sbjct: 63  SGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDR 122

Query: 348 RCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQE 405
           RC  N+ RTC CQG+DP TCGASFSFGCSWSMY+NGCK+ RS+  RKFRL+      E++
Sbjct: 123 RCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHEKQ 182

Query: 406 IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHS 465
           +E+ +  LAT ++PLYK +AP A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHS
Sbjct: 183 LEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHS 242

Query: 466 HRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTG 523
           H+D+HNM+NG TVV +L +   R  + P+DEQLHVLPLY + D+DEFG+ E  + K+ +G
Sbjct: 243 HKDIHNMHNGSTVVCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSG 302

Query: 524 AIE 526
           AI+
Sbjct: 303 AIQ 305


>gi|359718960|ref|NP_001240786.1| methylcytosine dioxygenase TET1 isoform 1 [Mus musculus]
          Length = 2039

 Score =  364 bits (935), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 182/382 (47%), Positives = 244/382 (63%), Gaps = 44/382 (11%)

Query: 187  PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
            P T  A+    + ++D + +   N+       E   C C    +   E G YYTHLGA  
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392

Query: 242  SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
            S+  +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+  EEKL+ +V+ 
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452

Query: 302  RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
            R  H CSTA IVV+I+ WEG+P   +D +Y  LT  L  Y G PT RRC  N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512

Query: 361  GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS---------------------- 398
            G+DP TCGASFSFGCSWSMY+NGCK+ RS+  RKFRL+                      
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDV 1572

Query: 399  ------------VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGF 446
                        +  EE+++E+ +  LAT ++PLYK +AP A+ NQ ++E  A +CRLG 
Sbjct: 1573 KTGWIIPDRKTLISREEKQLEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGN 1632

Query: 447  KPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIM 504
            + GRPFSGVT C DFCAHSH+D+HNM+NG TVV +L +   R  + P+DEQLHVLPLY +
Sbjct: 1633 EEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTVVCTLIRADGRDTNCPEDEQLHVLPLYRL 1692

Query: 505  DDSDEFGNKEAQEEKVNTGAIE 526
             D+DEFG+ E  + K+ +GAI+
Sbjct: 1693 ADTDEFGSVEGMKAKIKSGAIQ 1714


>gi|262225296|gb|ACY38291.1| tet oncogene 1 [Mus musculus]
          Length = 2039

 Score =  364 bits (935), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 182/382 (47%), Positives = 244/382 (63%), Gaps = 44/382 (11%)

Query: 187  PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
            P T  A+    + ++D + +   N+       E   C C    +   E G YYTHLGA  
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392

Query: 242  SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
            S+  +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+  EEKL+ +V+ 
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452

Query: 302  RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
            R  H CSTA IVV+I+ WEG+P   +D +Y  LT  L  Y G PT RRC  N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512

Query: 361  GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS---------------------- 398
            G+DP TCGASFSFGCSWSMY+NGCK+ RS+  RKFRL+                      
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDV 1572

Query: 399  ------------VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGF 446
                        +  EE+++E+ +  LAT ++PLYK +AP A+ NQ ++E  A +CRLG 
Sbjct: 1573 KTGWIIPDRKTLISREEKQLEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGN 1632

Query: 447  KPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIM 504
            + GRPFSGVT C DFCAHSH+D+HNM+NG TVV +L +   R  + P+DEQLHVLPLY +
Sbjct: 1633 EEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTVVCTLIRADGRDTNCPEDEQLHVLPLYRL 1692

Query: 505  DDSDEFGNKEAQEEKVNTGAIE 526
             D+DEFG+ E  + K+ +GAI+
Sbjct: 1693 ADTDEFGSVEGMKAKIKSGAIQ 1714


>gi|74140016|dbj|BAE31842.1| unnamed protein product [Mus musculus]
 gi|74151946|dbj|BAE32012.1| unnamed protein product [Mus musculus]
          Length = 991

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)

Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
           YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 139 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 198

Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
           EKLL +V+ R  HTC TA +V+ I+ W+G+P   +  +Y+ LT+ L K G+ T RRC+ N
Sbjct: 199 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 258

Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
           E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  +   EE+ +   +
Sbjct: 259 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 318

Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
             LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD  
Sbjct: 319 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 378

Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
           NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE 
Sbjct: 379 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 438

Query: 528 L 528
           L
Sbjct: 439 L 439


>gi|392338375|ref|XP_003753513.1| PREDICTED: methylcytosine dioxygenase TET1 isoform 1 [Rattus
            norvegicus]
          Length = 2040

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 178/351 (50%), Positives = 231/351 (65%), Gaps = 38/351 (10%)

Query: 214  EVPDCKCFASDKLP-PEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            E   C+C   D  P  + G YYTHLGA  S+  +R+ +E R G KGKA+R+EKI++TGKE
Sbjct: 1364 EAATCQCARPDGGPQKDKGPYYTHLGAGPSVAAVRELMETRYGQKGKAIRIEKIVFTGKE 1423

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++QGCP+AKWVIRR+  EEK++ +V+ R  H CSTA IVV+I+ WEG+P   +D +Y 
Sbjct: 1424 GKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRLMADRLYK 1483

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  N+ RTC CQG +P TCGASFSFGCSWSMY+NGCK+ RS  
Sbjct: 1484 ELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGCKFGRSAN 1543

Query: 392  VRKFRLS----------------------------------VRSEEQEIEEKMHLLATTI 417
             RKFRL+                                  +  EE+++EE +  LAT +
Sbjct: 1544 PRKFRLAPNYPLHDYYKRITGRCSEGSDVKTGWIIPERKTLISREEKQLEENLQDLATVL 1603

Query: 418  SPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCT 477
            +P+YK +AP A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHSH+D+HNMNNG T
Sbjct: 1604 APVYKQMAPVAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGST 1663

Query: 478  VVVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
            VV +L +   R  S P DEQLHVLPLY + D+DEFG+ E  + K+ +GAI+
Sbjct: 1664 VVCTLIREDGRDRSVPGDEQLHVLPLYRLADTDEFGSVEGMKAKIQSGAIQ 1714


>gi|392355330|ref|XP_003752007.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1
            [Rattus norvegicus]
          Length = 2038

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 178/351 (50%), Positives = 231/351 (65%), Gaps = 38/351 (10%)

Query: 214  EVPDCKCFASDKLP-PEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            E   C+C   D  P  + G YYTHLGA  S+  +R+ +E R G KGKA+R+EKI++TGKE
Sbjct: 1362 EAATCQCARPDGGPQKDKGPYYTHLGAGPSVAAVRELMETRYGQKGKAIRIEKIVFTGKE 1421

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++QGCP+AKWVIRR+  EEK++ +V+ R  H CSTA IVV+I+ WEG+P   +D +Y 
Sbjct: 1422 GKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRLMADRLYK 1481

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  N+ RTC CQG +P TCGASFSFGCSWSMY+NGCK+ RS  
Sbjct: 1482 ELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGCKFGRSAN 1541

Query: 392  VRKFRLS----------------------------------VRSEEQEIEEKMHLLATTI 417
             RKFRL+                                  +  EE+++EE +  LAT +
Sbjct: 1542 PRKFRLAPNYPLHDYYKRITGRCSEGSDVKTGWIIPERKTLISREEKQLEENLQDLATVL 1601

Query: 418  SPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCT 477
            +P+YK +AP A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHSH+D+HNMNNG T
Sbjct: 1602 APVYKQMAPVAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGST 1661

Query: 478  VVVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
            VV +L +   R  S P DEQLHVLPLY + D+DEFG+ E  + K+ +GAI+
Sbjct: 1662 VVCTLIREDGRDRSVPGDEQLHVLPLYRLADTDEFGSVEGMKAKIQSGAIQ 1712


>gi|262225298|gb|ACY38292.1| tet oncogene 2 [Mus musculus]
          Length = 1921

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 174/308 (56%), Positives = 222/308 (72%), Gaps = 12/308 (3%)

Query: 233  YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
            YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1062 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1121

Query: 293  EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
            EKLL +V+ R  HTC TA +V+ I+ W+G+P   +  +Y+ LT+ L K G+ T RRC+ N
Sbjct: 1122 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 1181

Query: 353  E-------PRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEE 403
            E       PR C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  +   EE
Sbjct: 1182 ETKKKQSPPRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEE 1241

Query: 404  QEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCA 463
            + +   +  LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF A
Sbjct: 1242 ERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSA 1301

Query: 464  HSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKV 520
            HSHRD  NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+K+
Sbjct: 1302 HSHRDQQNMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKI 1361

Query: 521  NTGAIENL 528
              G+IE L
Sbjct: 1362 RMGSIEVL 1369


>gi|74191515|dbj|BAE30334.1| unnamed protein product [Mus musculus]
          Length = 992

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)

Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
           YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 139 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 198

Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
           EKLL +V+ R  HTC TA +V+ I+ W+G+P   +  +Y+ LT+ L K G+ T RRC+ N
Sbjct: 199 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 258

Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
           E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  +   EE+ +   +
Sbjct: 259 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 318

Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
             LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD  
Sbjct: 319 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 378

Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
           NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE 
Sbjct: 379 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 438

Query: 528 L 528
           L
Sbjct: 439 L 439


>gi|74142256|dbj|BAE31892.1| unnamed protein product [Mus musculus]
 gi|74214512|dbj|BAE31106.1| unnamed protein product [Mus musculus]
          Length = 991

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)

Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
           YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 139 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 198

Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
           EKLL +V+ R  HTC TA +V+ I+ W+G+P   +  +Y+ LT+ L K G+ T RRC+ N
Sbjct: 199 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 258

Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
           E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  +   EE+ +   +
Sbjct: 259 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 318

Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
             LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD  
Sbjct: 319 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 378

Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
           NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE 
Sbjct: 379 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 438

Query: 528 L 528
           L
Sbjct: 439 L 439


>gi|443702254|gb|ELU00383.1| hypothetical protein CAPTEDRAFT_102094, partial [Capitella teleta]
          Length = 316

 Score =  361 bits (926), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 168/301 (55%), Positives = 218/301 (72%), Gaps = 2/301 (0%)

Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
           G YYTHLGA  ++P +R+ +E+R    G ALR+EK++YTG+EGK+ QGCP+AKW++RR+S
Sbjct: 12  GPYYTHLGAGPTVPAIRELMEKRMNITGDALRIEKVIYTGREGKSPQGCPVAKWILRRSS 71

Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
            +EK ++IV+ R GHTC TA +V+ IV W+G+P  Q+ G+Y  L + L   G  T RRC 
Sbjct: 72  KDEKCMVIVRQRPGHTCPTAIMVIAIVVWDGIPETQATGLYDYLRHTLPDNGHETERRCG 131

Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS--KTVRKFRLSVRSEEQEIEE 408
           TNE RTCACQG      GASF+FGCSWSMY+NGCKYA+S    V +FRL    EE  +E 
Sbjct: 132 TNEKRTCACQGWSDAVGGASFTFGCSWSMYFNGCKYAKSSDSKVHRFRLRDPMEEPILER 191

Query: 409 KMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRD 468
            +  LAT I PLYK +AP ++ N    E EA++CRLG++ GRPF GVTA  DFCAH+H+D
Sbjct: 192 HLQTLATDIGPLYKMVAPDSYANMTALEDEATDCRLGYRRGRPFGGVTAVVDFCAHAHKD 251

Query: 469 LHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
            HNMNNGCTVV +LTKHR L KP+DEQLHVLPL +++  DEFG+ + Q  K+ +GAIE L
Sbjct: 252 QHNMNNGCTVVATLTKHRGLEKPEDEQLHVLPLCVLESKDEFGSVDNQFAKIRSGAIEWL 311

Query: 529 N 529
            
Sbjct: 312 T 312


>gi|380803039|gb|AFE73395.1| methylcytosine dioxygenase TET1, partial [Macaca mulatta]
          Length = 680

 Score =  360 bits (925), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 169/288 (58%), Positives = 216/288 (75%), Gaps = 6/288 (2%)

Query: 247 RKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHT 306
           R+ +E R G KG A+R+E ++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH 
Sbjct: 1   REIMENRYGQKGNAIRIEIVVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHH 60

Query: 307 CSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPD 365
           C TA +VV+I+ W+G+PL  +D +Y  LT  L  Y G PT RRC  NE RTC CQG+DP+
Sbjct: 61  CPTAVMVVLIMVWDGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPE 120

Query: 366 TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKA 423
           TCGASFSFGCSWSMY+NGCK+ RS + R+FR+   S   E+ +E+ +  LAT ++P+YK 
Sbjct: 121 TCGASFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQ 180

Query: 424 LAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLT 483
            AP A+ NQ ++E  A ECRLG K GRPFSGVTAC DFCAH HRD+HNMNNG TVV +LT
Sbjct: 181 YAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLT 240

Query: 484 K--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           +  +RSL   P DEQLHVLPLY + D+DEFG+KE  E K+ +GAIE L
Sbjct: 241 REDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEGMEAKIKSGAIEVL 288


>gi|345322870|ref|XP_003430647.1| PREDICTED: methylcytosine dioxygenase TET2 [Ornithorhynchus anatinus]
          Length = 1462

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 166/290 (57%), Positives = 216/290 (74%), Gaps = 9/290 (3%)

Query: 207  LKNNMRTEV------PDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKA 260
            +KN M T V      P C C     +  + G +YTHLGA  ++  +R+ +EER G KGKA
Sbjct: 1109 IKNLMDTPVKTQYDFPSCSCV-EHIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKA 1167

Query: 261  LRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWE 320
            +R+E+++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA +V++I+ WE
Sbjct: 1168 IRIERVIYTGKEGKSSQGCPIAKWVVRRSSDEEKLLCLVRERAGHTCETAVVVILILVWE 1227

Query: 321  GVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMY 380
            G+PL+ +D +Y+ LT  L KYG  T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY
Sbjct: 1228 GIPLSLADRLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMY 1287

Query: 381  YNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFERE 438
            YNGCK+ARSK  RKF+L      EE+++E  +  L+T ++P+YK LAP A+ NQ ++E  
Sbjct: 1288 YNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPIYKKLAPDAYNNQIEYEHR 1347

Query: 439  ASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL 488
            A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T++ + T+ + +
Sbjct: 1348 APECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLLGAATEFKDV 1397


>gi|37360506|dbj|BAC98231.1| mKIAA1676 protein [Mus musculus]
          Length = 625

 Score =  358 bits (918), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 177/350 (50%), Positives = 232/350 (66%), Gaps = 39/350 (11%)

Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
           E   C C    +   E G YYTHLGA  S+  +R+ +E R G KGKA+R+EKI++TGKEG
Sbjct: 17  EAAPCDCDGGTQ--KEKGPYYTHLGAGPSVAAVRELMETRFGQKGKAIRIEKIVFTGKEG 74

Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
           K++QGCP+AKWVIRR+  EEKL+ +V+ R  H CSTA IVV+I+ WEG+P   +D +Y  
Sbjct: 75  KSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRLMADRLYKE 134

Query: 334 LTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
           LT  L  Y G PT RRC  N+ RTC CQG+DP TCGASFSFGCSWSMY+NGCK+ RS+  
Sbjct: 135 LTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGCKFGRSENP 194

Query: 393 RKFRLS----------------------------------VRSEEQEIEEKMHLLATTIS 418
           RKFRL+                                  +  EE+++E+ +  LAT ++
Sbjct: 195 RKFRLAPNYPLHNYYKRITGMSSEGSDVKTGWIIPDRKTLISREEKQLEKNLQELATVLA 254

Query: 419 PLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
           PLYK +AP A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV
Sbjct: 255 PLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTV 314

Query: 479 VVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
           V +L +   R  + P+DEQLHVLPLY + D+DEFG+ E  + K+ +GAI+
Sbjct: 315 VCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSGAIQ 364


>gi|291230173|ref|XP_002735044.1| PREDICTED: CXXC finger 5-like [Saccoglossus kowalevskii]
          Length = 1354

 Score =  357 bits (915), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 173/343 (50%), Positives = 245/343 (71%), Gaps = 11/343 (3%)

Query: 196 NSKEMLDHIERLKNNMRTE---VPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEE 252
           N +E   H   +K++   E   +P+C C   +    E G YYT LGA  ++ ++R+ +E+
Sbjct: 421 NMEETPTHQLTIKDDKTEEKIVIPNCGCV-DNPNEKEEGPYYTQLGAGRTIAEIREIMEK 479

Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
           R G  GKA+R+E+++YTGKEGK + GCP+AKWVIRR+S EEK+L++V+HR  H C+TA I
Sbjct: 480 RYGDTGKAIRIEQVIYTGKEGKGSMGCPIAKWVIRRSSSEEKVLVVVRHRVNHHCATAVI 539

Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
           V+ IVAWE +  ++++  Y  L   L+K+G PT RRC TNE ++CACQG D +  GASFS
Sbjct: 540 VIAIVAWEALSSDKTNDAYDWLRTTLSKHGNPTVRRCGTNEEKSCACQGYDSEKSGASFS 599

Query: 373 FGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ---EIEEKMHLLATTISPLYKALAPGAF 429
           FGCSWSMYYNGCK+ARSKT +KF+L   ++ +   ++E ++  LAT I+P+YK +AP ++
Sbjct: 600 FGCSWSMYYNGCKFARSKTPKKFKLGNNADSRKDVKLEHRLQTLATLIAPIYKKMAPESY 659

Query: 430 TNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RS 487
            NQ   E+E+  CRLG++ GRPFSGVTAC DFCAH+H+D HNMN GCT +++LT    R+
Sbjct: 660 ANQSAHEQESLPCRLGYEEGRPFSGVTACVDFCAHAHKDQHNMNTGCTTLLTLTGEEIRT 719

Query: 488 LSKP--DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           ++KP   DEQLHVLPLY +   DE+G+ E Q+EK+  G++E L
Sbjct: 720 IAKPRGADEQLHVLPLYKISPVDEYGSFEGQQEKIKNGSLEIL 762


>gi|449488387|ref|XP_002188340.2| PREDICTED: methylcytosine dioxygenase TET3-like [Taeniopygia
           guttata]
          Length = 1419

 Score =  353 bits (907), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 171/282 (60%), Positives = 211/282 (74%), Gaps = 7/282 (2%)

Query: 255 GYKGKALRMEKILYTGKEGKTTQGCPLAKW--VIRRASLEEKLLLIVKHRQGHTCSTAWI 312
           G KGKA+R+EK++Y GKEGK+ +GC +AKW  VIRR + EEKLL +V+HR GH C  A I
Sbjct: 503 GRKGKAIRIEKVMYAGKEGKSFRGCTIAKWMSVIRRHNQEEKLLCLVRHRAGHHCQNAVI 562

Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
           +++I+AWEG+P    D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGASFS
Sbjct: 563 IILILAWEGIPRTLGDTLYQELTDTLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFS 622

Query: 373 FGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFT 430
           FGCSWSMY+NGCKYARSKT RKFRL   +  EE+ +      LAT ++PLYK LAP A+ 
Sbjct: 623 FGCSWSMYFNGCKYARSKTPRKFRLVGDNPKEEELLRRSFQDLATEVAPLYKRLAPQAYQ 682

Query: 431 NQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL 488
           NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R +
Sbjct: 683 NQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRVV 742

Query: 489 SK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L 
Sbjct: 743 GKIPEDEQLHVLPLYKMSSTDEFGSEENQNAKVGSGAIQVLT 784


>gi|440910400|gb|ELR60199.1| Putative methylcytosine dioxygenase TET2, partial [Bos grunniens
            mutus]
          Length = 1394

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 158/268 (58%), Positives = 204/268 (76%), Gaps = 3/268 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1128 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1186

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+  EEKLL +V+ R GHTC  A IV++I+ WEG+P++ +D +Y+ 
Sbjct: 1187 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVSLADKLYSE 1246

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1247 LTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1306

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1307 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1366

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVV 479
            FSGVTAC DFCAH+HRDL NM NG T+V
Sbjct: 1367 FSGVTACLDFCAHAHRDLQNMQNGSTLV 1394


>gi|297293151|ref|XP_001082840.2| PREDICTED: probable methylcytosine dioxygenase TET2-like [Macaca
            mulatta]
          Length = 1973

 Score =  348 bits (894), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 171/321 (53%), Positives = 219/321 (68%), Gaps = 32/321 (9%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1127 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1185

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A I + IV    +P          
Sbjct: 1186 KSSQGCPIAKWVVRRSSSEEKLLCLVRERGGHTCEAAVISIGIVLCVVMP---------- 1235

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
                              N  RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1236 ----------------RLNTERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1279

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1280 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1339

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1340 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1399

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1400 EFGSVEAQEEKKRSGAIQVLS 1420


>gi|321462649|gb|EFX73671.1| hypothetical protein DAPPUDRAFT_200491 [Daphnia pulex]
          Length = 401

 Score =  345 bits (886), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 153/238 (64%), Positives = 195/238 (81%), Gaps = 4/238 (1%)

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +E+R+G  G+A+R EKI+YTGKEGKT QGCP+AKW+IRR+SLEEK+L ++K R+GH C T
Sbjct: 1   MEQRTGLAGRAIRFEKIIYTGKEGKTAQGCPIAKWIIRRSSLEEKVLCLIKERRGHRCQT 60

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            W++V+ VAWEG+ L  SD +Y  L  +LN +G+ T RRCATNE RTCACQGLDPDTCGA
Sbjct: 61  TWLIVISVAWEGLALRDSDYLYGELVYRLNAHGVATNRRCATNEDRTCACQGLDPDTCGA 120

Query: 370 SFSFGCSWSMYYNGCKYARSK--TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPG 427
           SFSFGCSWSM++NGCK+ARSK  TVRKFRL+  S+E ++ +++   AT I+PLYK +AP 
Sbjct: 121 SFSFGCSWSMFFNGCKFARSKQQTVRKFRLTDESQEADMGDRLQRFATAIAPLYKRIAPD 180

Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH 485
           A+ NQ QFE +A +CRLG  PGRPF+GVTACFDFCAHSH+D+H+MNNGCT  V+L +H
Sbjct: 181 AYANQVQFEGKAVDCRLGLAPGRPFAGVTACFDFCAHSHKDIHDMNNGCT--VNLLRH 236


>gi|345321271|ref|XP_001520561.2| PREDICTED: methylcytosine dioxygenase TET1-like [Ornithorhynchus
            anatinus]
          Length = 2358

 Score =  344 bits (883), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 160/270 (59%), Positives = 200/270 (74%), Gaps = 3/270 (1%)

Query: 212  RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            + E P C C     +  + G YYTHLG   S+  +R+ +E R G KG+A+R+E ++YTGK
Sbjct: 2076 QAEFPTCNCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMETRYGAKGRAIRIEVVVYTGK 2134

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
            EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y
Sbjct: 2135 EGKSSQGCPIAKWVIRRSSNEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 2194

Query: 332  AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
              LT  L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK 
Sbjct: 2195 QELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 2254

Query: 392  VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FRL     +QE  +E  +  LAT ++P+YK LAP AF NQ + E    +CRLG K G
Sbjct: 2255 PRRFRLLTDDPKQEESLENNLQNLATDVAPVYKKLAPDAFQNQVENEHLGPDCRLGCKDG 2314

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVV 479
            RPFSGVTAC DFCAH+H+D HNM+NG TVV
Sbjct: 2315 RPFSGVTACIDFCAHAHKDTHNMHNGSTVV 2344


>gi|149043923|gb|EDL97374.1| CXXC finger 6 (predicted) [Rattus norvegicus]
          Length = 608

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 162/282 (57%), Positives = 209/282 (74%), Gaps = 5/282 (1%)

Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
           +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+  EEK++ +V+ R  H CST
Sbjct: 1   METRYGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCST 60

Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCG 368
           A IVV+I+ WEG+P   +D +Y  LT  L  Y G PT RRC  N+ RTC CQG +P TCG
Sbjct: 61  AVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCG 120

Query: 369 ASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAP 426
           ASFSFGCSWSMY+NGCK+ RS   RKFRL+      E+++EE +  LAT ++P+YK +AP
Sbjct: 121 ASFSFGCSWSMYFNGCKFGRSANPRKFRLAPNYPLHEKQLEENLQDLATVLAPVYKQMAP 180

Query: 427 GAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH- 485
            A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHSH+D+HNMNNG TVV +L +  
Sbjct: 181 VAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGSTVVCTLIRED 240

Query: 486 -RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
            R  S P DEQLHVLPLY + D+DEFG+ E  + K+ +GAI+
Sbjct: 241 GRDRSVPGDEQLHVLPLYRLADTDEFGSVEGMKAKIQSGAIQ 282


>gi|198433354|ref|XP_002125458.1| PREDICTED: similar to Protein TET2 [Ciona intestinalis]
          Length = 1706

 Score =  337 bits (864), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 171/328 (52%), Positives = 220/328 (67%), Gaps = 17/328 (5%)

Query: 215  VPDCKCFAS----DKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTG 270
             P C C       ++LPP    YYTH+GA+ S+  +RK  EER G+ G+ALR+EK+ YTG
Sbjct: 767  FPRCTCIPGSDGLEELPP----YYTHIGASHSIQGIRKLFEERCGFTGRALRIEKVCYTG 822

Query: 271  KEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGV 330
            KEGKT++GCP+AKWV+RR+S +EK++++ + R GH C TA +VVVI+ WEGV    +D  
Sbjct: 823  KEGKTSRGCPIAKWVLRRSSEQEKIMVVCRQRPGHRCITAVMVVVIMLWEGVSRPLADFS 882

Query: 331  YAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
            Y   T  +   G  T RRC TNE RTCACQG DP+  GAS+SFGCSWSMYYNGCK+ARS 
Sbjct: 883  YNKCTQLIPTNGTATERRCGTNEERTCACQGFDPEKGGASYSFGCSWSMYYNGCKFARST 942

Query: 391  TVRKFRLSVRSE---EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
               KF+L+   +   E  + +    LA+ +S LYK  AP A  NQ + E E  ECRLG+ 
Sbjct: 943  KPNKFKLNGTKDSNAESCVADFCQRLASAMSVLYKTAAPDAHMNQIERECEGQECRLGYN 1002

Query: 448  P---GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLS-KPDDEQLHVLPL 501
            P   GRPFSGVT C DFCAH+H+D HNM NG T+V++LTK   R +  KP DEQLHVLPL
Sbjct: 1003 PPNEGRPFSGVTCCMDFCAHAHKDQHNMENGTTLVLTLTKPELRVIGQKPPDEQLHVLPL 1062

Query: 502  YIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            Y +D ++E G  E   +K+  G+IE LN
Sbjct: 1063 YKLDLTNEEGTFEGVGQKIREGSIEILN 1090


>gi|350597142|ref|XP_003484366.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Sus scrofa]
          Length = 1048

 Score =  336 bits (862), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 157/284 (55%), Positives = 202/284 (71%), Gaps = 4/284 (1%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 763  SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 821

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y+
Sbjct: 822  GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRAGHHCPTAVMVVLIMVWDGIPLPLADRLYS 881

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQGLDP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 882  ELTESLKSYNGHPTDRRCTLNENRTCTCQGLDPETCGASFSFGCSWSMYFNGCKFGRSPS 941

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ IE+ +  LAT ++P+YK  AP A+ NQ  FE  A ECRLG K G
Sbjct: 942  PRRFRIDPSSPLHEKNIEDNLQTLATELAPIYKQYAPVAYENQVAFEHVARECRLGKKEG 1001

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDD 493
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +     S + P +
Sbjct: 1002 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTWVAFDSKAPPQN 1045


>gi|117167823|gb|AAI10511.2| TET2 protein [Homo sapiens]
 gi|117167991|gb|AAI10510.1| TET2 protein [Homo sapiens]
          Length = 805

 Score =  330 bits (845), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 155/251 (61%), Positives = 191/251 (76%), Gaps = 5/251 (1%)

Query: 284 WVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGL 343
           WV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ LT  L KYG 
Sbjct: 1   WVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSELTETLRKYGT 60

Query: 344 PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRS 401
            T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  RKF+L      
Sbjct: 61  LTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPK 120

Query: 402 EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 461
           EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRPFSGVTAC DF
Sbjct: 121 EEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDF 180

Query: 462 CAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEE 518
           CAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D DEFG+ EAQEE
Sbjct: 181 CAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEE 240

Query: 519 KVNTGAIENLN 529
           K  +GAI+ L+
Sbjct: 241 KKRSGAIQVLS 251


>gi|354475486|ref|XP_003499959.1| PREDICTED: methylcytosine dioxygenase TET1 [Cricetulus griseus]
          Length = 1956

 Score =  327 bits (839), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 164/321 (51%), Positives = 220/321 (68%), Gaps = 13/321 (4%)

Query: 214  EVPDCKC---FASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTG 270
            E P C C   + +DK     G YYTHLGA  S+  +R+ +E R   KGKA+R+EKI Y G
Sbjct: 1329 EGPPCDCKGEYQTDK-----GPYYTHLGAGPSVAAIRELMETRYCEKGKAIRIEKIEYMG 1383

Query: 271  KEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGV 330
            KE K+++GCP+ K V+R+ + +EK+L + + R GH C TA +VV IV W+ +    +D +
Sbjct: 1384 KESKSSRGCPVVKTVLRQNNDDEKVLCLARERVGHHCQTAVMVVGIVLWQPISPPLADHL 1443

Query: 331  YAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 389
            Y  +T+ L  Y G PT RRC  NE RTC CQGL+P TCGASFSFGCSWSMY NGCK+ RS
Sbjct: 1444 YDEITDNLRSYSGHPTDRRCTFNEKRTCTCQGLNPRTCGASFSFGCSWSMYLNGCKFGRS 1503

Query: 390  KTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
               RKF+L+      E++IE  ++ +A T++P+YK +AP A+ NQ ++E  A++CRLG K
Sbjct: 1504 PNPRKFKLAPNYPLNEKKIEGILNKVADTLAPIYKQMAPVAYQNQVKYEDVAADCRLGTK 1563

Query: 448  PGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIMD 505
             GRPFSGVT C DFCAHSH+D HNM NG TVV++L +   R  +   DEQ HVLPL+ + 
Sbjct: 1564 KGRPFSGVTCCMDFCAHSHKDNHNMINGSTVVLTLLRKDARDRNNLQDEQFHVLPLHRLA 1623

Query: 506  DSDEFGNKEAQEEKVNTGAIE 526
            D+DEFG++E  E K+ +GAIE
Sbjct: 1624 DTDEFGSREGMEAKIRSGAIE 1644


>gi|47223312|emb|CAF98696.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1615

 Score =  324 bits (830), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 157/278 (56%), Positives = 193/278 (69%), Gaps = 31/278 (11%)

Query: 255  GYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVV 314
            G KG A+R+E ++YTGKEGK++QGCP+AKWVIRR S EEKLL +V+ R GH C TA +V+
Sbjct: 970  GAKGNAVRVEVVVYTGKEGKSSQGCPIAKWVIRRDSEEEKLLCLVRRRPGHCCDTAVLVI 1029

Query: 315  VIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFG 374
            +I+AWEG+    +DG+Y  LT  L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFG
Sbjct: 1030 LILAWEGISRPVADGLYQELTRTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFG 1089

Query: 375  CSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQ 434
            CSWSMY+NGCK+ARSK  RKFRL                             G +  + +
Sbjct: 1090 CSWSMYFNGCKFARSKVPRKFRLQ----------------------------GDYPEEVE 1121

Query: 435  FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-P 491
             E    +CRLG + GRPFSGVTAC DFCAH+HRD  NMNNG TVV +LTK  +R++   P
Sbjct: 1122 NEEAGRDCRLGQREGRPFSGVTACVDFCAHAHRDTQNMNNGSTVVCTLTKEDNRAVRNVP 1181

Query: 492  DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            +DEQLHVLPLY + D DEFG  E Q  K+ +GA++ L+
Sbjct: 1182 EDEQLHVLPLYRISDRDEFGQVEGQWAKIRSGALQVLS 1219


>gi|68342456|gb|AAY90126.1| Ayu17-449 [Mus musculus]
          Length = 1919

 Score =  321 bits (822), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 166/310 (53%), Positives = 213/310 (68%), Gaps = 16/310 (5%)

Query: 233  YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
            YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1060 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1119

Query: 293  EKLLLIVKHRQGHTCSTAWIVVVIVAW---EGVPLNQSDGVYAILTN--KLNKYGLPT-- 345
            EKLL +V+ R  HTC TA +V+  V     +   +      Y  L     +++  L +  
Sbjct: 1120 EKLLCLVRVRPNHTCETAVMVIASVVGRNPKATRIRTLLRTYRYLGQVWHMHQPSLFSDE 1179

Query: 346  TRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE 405
            T++  +   R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  R  E +
Sbjct: 1180 TKKKQSPPSRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRL--RGAEPK 1237

Query: 406  IEEKM--HL--LATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 461
             EE++  HL  LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF
Sbjct: 1238 EEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDF 1297

Query: 462  CAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEE 518
             AHSHRD  NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+
Sbjct: 1298 SAHSHRDQQNMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEK 1357

Query: 519  KVNTGAIENL 528
            K+  G+IE L
Sbjct: 1358 KIRMGSIEVL 1367


>gi|326923418|ref|XP_003207933.1| PREDICTED: methylcytosine dioxygenase TET1-like [Meleagris gallopavo]
          Length = 1500

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 164/322 (50%), Positives = 199/322 (61%), Gaps = 45/322 (13%)

Query: 212  RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            ++E+P C C     +  + G YYTHLG                                 
Sbjct: 779  QSELPTCDCV-EQIIEKDEGPYYTHLGTG------------------------------- 806

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
                    P    VIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P   +D +Y
Sbjct: 807  --------PSVAAVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 858

Query: 332  AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
              LT  L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK 
Sbjct: 859  KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 918

Query: 392  VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             RKFRL     +QE  +E  +  LAT ++P+YK LAP AF NQ + E    +CRLG K G
Sbjct: 919  PRKFRLLTDDPKQEELLEHNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGSKDG 978

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK    R    P DEQLHVLPLY +  
Sbjct: 979  RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1038

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG +E  E K+  GAI+ L
Sbjct: 1039 TDEFGTEEGLEAKIKAGAIQVL 1060


>gi|47219959|emb|CAG11492.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 400

 Score =  312 bits (799), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 146/228 (64%), Positives = 177/228 (77%), Gaps = 2/228 (0%)

Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
           RSG  G A+R+EK++YTGKEGK+TQGCP+AKWVIRR S +EKLL++V+ R GHTC+TA I
Sbjct: 1   RSGITGSAIRIEKVVYTGKEGKSTQGCPIAKWVIRRGSEKEKLLVLVRERTGHTCNTACI 60

Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
           +VVI+ WEG+  + +D +Y  L+  L K+G  T RRCA NE RTCACQGLDP+ CGASFS
Sbjct: 61  IVVILVWEGILPSLADRLYNELSETLRKHGALTQRRCAHNEERTCACQGLDPEACGASFS 120

Query: 373 FGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFT 430
           FGCSWSMYYNGCK+ARSK  RKF+L      EE+ IE+    LAT ++PLYK LAP A+ 
Sbjct: 121 FGCSWSMYYNGCKFARSKNPRKFKLLGDDMREEERIEQNFQGLATLLAPLYKTLAPEAYG 180

Query: 431 NQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
           NQ + E+ A +CRLG K GRPFSGVTAC DFCAH+HRDLHNM  G TV
Sbjct: 181 NQVEHEQRALDCRLGLKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTV 228


>gi|449664940|ref|XP_002161163.2| PREDICTED: uncharacterized protein LOC100213294 [Hydra
           magnipapillata]
          Length = 1336

 Score =  311 bits (798), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 143/291 (49%), Positives = 197/291 (67%), Gaps = 4/291 (1%)

Query: 217 DCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTT 276
           +C C  SD    + G +Y HLG+  SL +LR  + +R   +  AL ++ + +T  EGK  
Sbjct: 325 ECGCAVSD--TSDSGPFYNHLGSGYSLNELRNTLLDRFSIQNSALNLQLVKHTSVEGKNG 382

Query: 277 QGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTN 336
            GCPLAKW+IRR S EEK L++V+H +GHTCS+ + V+VIVAWEG+    +D +Y  LT 
Sbjct: 383 DGCPLAKWIIRRTSDEEKYLVVVRHHEGHTCSSTFTVIVIVAWEGISKQYADDMYRYLTK 442

Query: 337 KLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFR 396
            LN+ G  T RRC+ NE +TC CQG   ++ GASFSFGCSWSM+++GCK+ +S   RKF+
Sbjct: 443 TLNESGFRTRRRCSANESKTCLCQGEVEESQGASFSFGCSWSMFFDGCKFTKSTNARKFK 502

Query: 397 LSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVT 456
           +    +E+EIE+ +  + T +SPL K  AP  + N   FE  A +CR+G   GRPFSGVT
Sbjct: 503 MQDPVKEEEIEKVLQEMTTQVSPLLKIWAPKCYENMTHFEEIADKCRIGLNKGRPFSGVT 562

Query: 457 ACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLY-IMDD 506
            C DFCAHSHRD+H++NNG T+V +L K  + ++  DEQLHVLPLY ++DD
Sbjct: 563 CCLDFCAHSHRDIHDLNNGTTMVCTLLK-PNYNERTDEQLHVLPLYQLLDD 612


>gi|301610531|ref|XP_002934823.1| PREDICTED: probable methylcytosine dioxygenase TET2 [Xenopus
            (Silurana) tropicalis]
          Length = 1737

 Score =  302 bits (773), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 153/321 (47%), Positives = 209/321 (65%), Gaps = 31/321 (9%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C C     +  + G YYTHLGA  ++  +R+ +EER G KG A+R+E+++YTGKEG
Sbjct: 929  DFPSCSC-VDQIIEKDEGPYYTHLGAGPNVAAIREMMEERFGQKGNAIRIERVVYTGKEG 987

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K+ QGCP+AKWVIRR+  +EK+L +V+ R GH+C TA IV++I+ WEG+  + +D +Y+ 
Sbjct: 988  KSAQGCPIAKWVIRRSGTDEKMLCLVRERAGHSCETAVIVILILVWEGISFSLADRLYSE 1047

Query: 334  LTNKLNKYGLPTTRRCATNE---PRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
            LT  LNKYG  T RRCA NE     +   +G+   T G +++F                 
Sbjct: 1048 LTETLNKYGTLTNRRCARNEEVWEESGVLRGISGIT-GRTYTF----------------- 1089

Query: 391  TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
                  L+  S E+++E  +  L+T ++P+YK LAP A+ NQ + E  A +CRLG K GR
Sbjct: 1090 ------LADSSLEEKLEANLQHLSTLMAPIYKKLAPDAYHNQIEHEHRAPDCRLGLKEGR 1143

Query: 451  PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
            PFSGVTAC DFCAHSHRDLHNM NG T+V +LT+  +R   K P DEQLHVLPLY + + 
Sbjct: 1144 PFSGVTACLDFCAHSHRDLHNMQNGSTLVCTLTREDNRENGKIPQDEQLHVLPLYKVSNV 1203

Query: 508  DEFGNKEAQEEKVNTGAIENL 528
            DEFG+ E+QEEK  TGAI+ L
Sbjct: 1204 DEFGSSESQEEKKRTGAIQVL 1224


>gi|348564585|ref|XP_003468085.1| PREDICTED: methylcytosine dioxygenase TET2-like [Cavia porcellus]
          Length = 1937

 Score =  301 bits (770), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 158/341 (46%), Positives = 211/341 (61%), Gaps = 57/341 (16%)

Query: 194  DPNSKEMLDHIERLKNNMRT--EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIE 251
            DP+ K +L+        M+T  E P C+C     +  + G +YTHLGA  ++  +R+ +E
Sbjct: 1108 DPSIKNLLE------TTMKTQYEFPSCRCV-EQIIEKDEGPFYTHLGAGPNVAAIREIME 1160

Query: 252  ERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAW 311
            ER G KGKA+R+EKI+YTGKEGK++QGCP+AKWV RR+S +EKLL +V+ R GHTCS A 
Sbjct: 1161 ERFGQKGKAIRIEKIIYTGKEGKSSQGCPIAKWVFRRSSSKEKLLCLVRERTGHTCSAAV 1220

Query: 312  IVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASF 371
            I+V+I+ W+ +P + +D +Y  L   L+K+G  T RRCA NE                  
Sbjct: 1221 ILVMIMVWDAIPRSLADQLYTELRETLHKHGTLTNRRCALNE------------------ 1262

Query: 372  SFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTN 431
                SW                         E+++E  +  LAT I+P+YK LAP A+ N
Sbjct: 1263 --ETSW-------------------------EEKLESHLQNLATLIAPIYKKLAPDAYNN 1295

Query: 432  QCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL--- 488
            Q ++E  A +CRLG K GRPFSGVTAC DFCAH+HRDLHNM NG TVV +LT+  +    
Sbjct: 1296 QVEYEHRAPDCRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTVVCTLTREDNRDPD 1355

Query: 489  SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            S P+DEQLHVLPLY + D DEFG+ EAQEEK  +GAI+ L+
Sbjct: 1356 STPEDEQLHVLPLYKISDVDEFGSAEAQEEKKRSGAIQVLS 1396


>gi|441614500|ref|XP_004088220.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1-like
            [Nomascus leucogenys]
          Length = 1989

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 200/323 (61%), Gaps = 10/323 (3%)

Query: 213  TEVPDCKCFASDKLPPE-PGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
            +E+P C C   D++  +  GSYY HL A   +  +R+ +    G KG  +R+E +++TG 
Sbjct: 1267 SELPTCNCL--DRVTQKIKGSYYIHLXAGPGVAAVREIMVNMYGKKGNTIRIETVVHTGN 1324

Query: 272  EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
            EGK++  CP+ KWV+ R+S  EK  L V  R GH C TA IV++I+ W+G     +D +Y
Sbjct: 1325 EGKSSNRCPIIKWVLTRSSDTEKAXL-VXQRTGHYCPTAVIVMLIMVWDGNHFPVADWLY 1383

Query: 332  AILTNKLNKYGL-PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
              LT  L    + PT RRC  +  RTC C G+DP+TCGASFSFGCSWSMY+N CK+ R  
Sbjct: 1384 TELTENLRSXNMHPTNRRCTLHXNRTCTCXGIDPETCGASFSFGCSWSMYFNDCKFGRGP 1443

Query: 391  TVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKP 448
            + R+FR+   S   E+ +E+ +  LAT + P+YK  AP A+ NQ + E  A EC LG K 
Sbjct: 1444 SCRRFRIDSSSLLHEKNLEDNLQSLATQLVPIYKQHAPLAYQNQVEHENVAXECXLGSKD 1503

Query: 449  GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMD 505
               FSG+ AC DF A  HRD+HNMNNG TVV +L +  + S    P D+QLHVL LY + 
Sbjct: 1504 SFSFSGIIACLDFSAQPHRDIHNMNNGSTVVCTLIQEDNFSLSVIPQDKQLHVLILYTLS 1563

Query: 506  DSDEFGNKEAQEEKVNTGAIENL 528
            D+DEFG +E  E K+ +G  E L
Sbjct: 1564 DTDEFGLREGMEAKIKSGTTEVL 1586


>gi|359080787|ref|XP_003588047.1| PREDICTED: methylcytosine dioxygenase TET1-like [Bos taurus]
          Length = 2105

 Score =  290 bits (743), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 147/317 (46%), Positives = 195/317 (61%), Gaps = 35/317 (11%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y+
Sbjct: 1476 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADKLYS 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE                       W +            
Sbjct: 1536 QLTESLKSYNGHPTDRRCTLNE----------------------KWVVV--------GTD 1565

Query: 392  VRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            V      +R  E+ +E+ +  LAT ++P+YK  AP A+ NQ   E  A ECRLG K GRP
Sbjct: 1566 VEMMTREIRYREKNLEDNLQSLATELAPIYKQYAPAAYQNQVALEHIARECRLGKKEGRP 1625

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH HRD+HNMNNG TVV +LT+  + S    P DEQLHVLPLY + D+D
Sbjct: 1626 FSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSFGVIPQDEQLHVLPLYKLSDTD 1685

Query: 509  EFGNKEAQEEKVNTGAI 525
            EFG++E  E K+ +GAI
Sbjct: 1686 EFGSREGMEAKIKSGAI 1702


>gi|395820931|ref|XP_003783809.1| PREDICTED: methylcytosine dioxygenase TET1 [Otolemur garnettii]
          Length = 2169

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 202/322 (62%), Gaps = 43/322 (13%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G  YTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1489 SELPTCNCI-DRVIQKDKGPNYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1547

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCPLAKWVIRR+S EEK+L +V+ R GH C  A +VV+I+ WEG+PL  +D +Y 
Sbjct: 1548 GKSSHGCPLAKWVIRRSSKEEKVLCLVRKRIGHRCPAAVMVVLIMVWEGIPLPMADRLYT 1607

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1608 ELTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1667

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT + PLY+  AP A+ NQ  FE             
Sbjct: 1668 PRRFRIDPSSPVHEKNLEDNLQGLATVLGPLYQQYAPVAYQNQVHFE------------- 1714

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
                               LH+     ++V +LT+  +R+L   P DEQLHVLPLY + D
Sbjct: 1715 ------------------TLHS-----SLVCTLTREDNRTLGVIPQDEQLHVLPLYKLAD 1751

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG++E  E K+ +GAIE L
Sbjct: 1752 TDEFGSREGMEAKIRSGAIEVL 1773


>gi|380805809|gb|AFE74780.1| methylcytosine dioxygenase TET3, partial [Macaca mulatta]
          Length = 348

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 140/230 (60%), Positives = 173/230 (75%), Gaps = 5/230 (2%)

Query: 304 GHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLD 363
           GH C  A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG D
Sbjct: 1   GHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKD 60

Query: 364 PDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLY 421
           P+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLY
Sbjct: 61  PNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLY 120

Query: 422 KALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVS 481
           K LAP A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +
Sbjct: 121 KRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCT 180

Query: 482 LTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           LTK  +R + K P+DEQLHVLPLY M  +DEFG++E Q  KV +GAI+ L
Sbjct: 181 LTKEDNRCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVL 230


>gi|156389231|ref|XP_001634895.1| predicted protein [Nematostella vectensis]
 gi|156221983|gb|EDO42832.1| predicted protein [Nematostella vectensis]
          Length = 256

 Score =  289 bits (740), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 137/257 (53%), Positives = 181/257 (70%), Gaps = 2/257 (0%)

Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
           R G KGKALR+E I+YT KEG+  QGCP+A+WVIRR+  +EK+L++V+ R GH CS A +
Sbjct: 1   RFGIKGKALRIELIIYTNKEGRNAQGCPIARWVIRRSGNDEKVLVLVRKRPGHHCSMALV 60

Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
           V  +V WEG+   +   +Y  L+  + +   PT RRC  N+ ++CACQG+  DTCGASFS
Sbjct: 61  VTSVVIWEGISEERGHSLYKELSGLIPENAAPTIRRCGLNDSKSCACQGVGEDTCGASFS 120

Query: 373 FGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQ 432
           FGCSW+MY+NGCK+ARSK+ RK++L   S+E+ +E  +  +AT I+P+Y   AP AF NQ
Sbjct: 121 FGCSWNMYFNGCKFARSKSPRKYKLLDSSKEETLERILEGIATEIAPVYSKAAPVAFANQ 180

Query: 433 CQFEREASECRLGFKP-GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKP 491
            + ER   ECR+G    GRPFSGVT C DFCAHSHRD  NM+ G TVV +L K    ++P
Sbjct: 181 TREERNGHECRIGHSAVGRPFSGVTCCMDFCAHSHRDKQNMDGGATVVCTLLKP-GCAQP 239

Query: 492 DDEQLHVLPLYIMDDSD 508
           +DEQLHVLPLY +   D
Sbjct: 240 EDEQLHVLPLYQLLSKD 256


>gi|403274103|ref|XP_003928828.1| PREDICTED: methylcytosine dioxygenase TET1 [Saimiri boliviensis
            boliviensis]
          Length = 2088

 Score =  288 bits (738), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 148/320 (46%), Positives = 195/320 (60%), Gaps = 49/320 (15%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1415 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGEKGNAIRIEIVVYTGKE 1473

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1474 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1533

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1534 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1593

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQ-CQFEREASECRLGFKP 448
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ C   RE          
Sbjct: 1594 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVCTLTRED--------- 1644

Query: 449  GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSD 508
                                    N    VV           P DEQLHVLPLY + D+D
Sbjct: 1645 ------------------------NRSLGVV-----------PQDEQLHVLPLYKLSDTD 1669

Query: 509  EFGNKEAQEEKVNTGAIENL 528
            EFG+KE  E K+ +GAIE L
Sbjct: 1670 EFGSKEGMEAKIKSGAIEVL 1689


>gi|340371755|ref|XP_003384410.1| PREDICTED: methylcytosine dioxygenase TET1-like [Amphimedon
           queenslandica]
          Length = 1077

 Score =  275 bits (702), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 132/282 (46%), Positives = 183/282 (64%), Gaps = 6/282 (2%)

Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
           G +YTHLGA ++   LR+ +E+R    G  LRM +I YTG E KT++GCP A+WV+RR S
Sbjct: 181 GIFYTHLGAGSTPETLRETLEKRFNVTGIELRMLEITYTGIEAKTSEGCPTAEWVVRRKS 240

Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
            EEK L++ +H  GH+C   + VV IV W+ +   ++   Y  L   L + G PT R+C 
Sbjct: 241 KEEKFLVLYRHHIGHSCDEQYTVVSIVYWDALTPERAGYTYNKLVEILPQNGFPTPRKCE 300

Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
            N+ +TC+CQG D    GAS+SFGCSWS+YY+GCK+ +SK  RKF+L V  +E E+E  +
Sbjct: 301 FNDSKTCSCQGDDKTVHGASYSFGCSWSVYYDGCKFGKSKIPRKFKLQVPEKEPELEGNV 360

Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
             LAT ++PLYK LAP A++NQ   +    ECR+G  P +PFSG+T C D+CAHSH D H
Sbjct: 361 DELATYLAPLYKRLAPKAYSNQVATQASGEECRIGLGPEKPFSGMTCCMDYCAHSHYDKH 420

Query: 471 NM-NNGCTVVVSLTKH-----RSLSKPDDEQLHVLPLYIMDD 506
           NM + G TVVV++ K      + + +   EQ+H LPLY + D
Sbjct: 421 NMPDGGATVVVTILKEGVYPDQYVKEDTGEQIHCLPLYRLKD 462


>gi|358419451|ref|XP_003584239.1| PREDICTED: methylcytosine dioxygenase TET1-like [Bos taurus]
          Length = 2131

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 143/318 (44%), Positives = 195/318 (61%), Gaps = 11/318 (3%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y+
Sbjct: 1476 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADKLYS 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEP-RTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
             LT  L  Y G PT RRC  NE  +         +    ++ +       +NGCK+ RS 
Sbjct: 1536 QLTESLKSYNGHPTDRRCTLNENCKLLVLNNTSENEVQYNYQYNYQNQYVFNGCKFXRSP 1595

Query: 391  TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
            + R+FR+       +       L        +  A  A  ++   E  A ECRLG K GR
Sbjct: 1596 SPRRFRIDPSLPYMKKHSSFPELRKD-----QCEAQQARESEVALEHIARECRLGKKEGR 1650

Query: 451  PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDS 507
            PFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  + S    P DEQLHVLPLY + D+
Sbjct: 1651 PFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSFGVIPQDEQLHVLPLYKLSDT 1710

Query: 508  DEFGNKEAQEEKVNTGAI 525
            DEFG++E  E K+ +GAI
Sbjct: 1711 DEFGSREGMEAKIKSGAI 1728


>gi|297301263|ref|XP_002805756.1| PREDICTED: methylcytosine dioxygenase TET1-like [Macaca mulatta]
          Length = 1972

 Score =  255 bits (652), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 117/224 (52%), Positives = 159/224 (70%), Gaps = 4/224 (1%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1413 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1471

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1472 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1531

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1532 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1591

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQC 433
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ 
Sbjct: 1592 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQV 1635


>gi|349604556|gb|AEQ00074.1| Methylcytosine dioxygenase TET1-like protein, partial [Equus
           caballus]
          Length = 375

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 106/222 (47%), Positives = 141/222 (63%), Gaps = 32/222 (14%)

Query: 255 GYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVV 314
           G KG A+R+E ++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV
Sbjct: 154 GQKGNAVRIEIVVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVV 213

Query: 315 VIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
           +I   +G+PL  +D +Y  LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSF
Sbjct: 214 LIWYGDGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSF 273

Query: 374 GCSWSMYYNGCKYARSKTVRKFRLSVRS-------------------------------E 402
           GCSWSMY+NGCK+ RS + R+FR+   S                                
Sbjct: 274 GCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHNYYERITKGRNPERRYMKPEPICPGHEAM 333

Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRL 444
           E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRL
Sbjct: 334 EKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRL 375


>gi|26350989|dbj|BAC39131.1| unnamed protein product [Mus musculus]
          Length = 267

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 92/155 (59%), Positives = 115/155 (74%), Gaps = 5/155 (3%)

Query: 379 MYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 436
           MY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP A+ NQ   E
Sbjct: 1   MYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNE 60

Query: 437 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDD 493
             A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + + P+D
Sbjct: 61  DVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGQIPED 120

Query: 494 EQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           EQLHVLPLY M  +DEFG++E Q  KV++GAI+ L
Sbjct: 121 EQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVL 155


>gi|66396578|gb|AAH96437.1| Tet3 protein [Mus musculus]
          Length = 695

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 91/155 (58%), Positives = 114/155 (73%), Gaps = 5/155 (3%)

Query: 379 MYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 436
           MY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP A+ NQ   E
Sbjct: 1   MYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNE 60

Query: 437 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDD 493
             A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCT V +LTK  +R + + P+D
Sbjct: 61  DVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTAVCTLTKEDNRCVGQIPED 120

Query: 494 EQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           EQLHVLPLY M  +DEFG++E Q  KV++GAI+ L
Sbjct: 121 EQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVL 155


>gi|441657124|ref|XP_003258249.2| PREDICTED: methylcytosine dioxygenase TET1-like [Nomascus
           leucogenys]
          Length = 583

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 90/184 (48%), Positives = 114/184 (61%), Gaps = 34/184 (18%)

Query: 379 MYYNGCKYARSKTVRKFRLSVRSE-------------------------------EQEIE 407
           MY+NGCK+ RS + R+FR+   S                                E+ +E
Sbjct: 1   MYFNGCKFGRSPSPRRFRIDPSSPLHTYYERITKGRNPERRYMKPERISPGHEAMEKNLE 60

Query: 408 EKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHR 467
           + +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K GRPFSGVTAC DFCAH HR
Sbjct: 61  DNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHR 120

Query: 468 DLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGA 524
           D+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D+DEFG+KE  E K+ +GA
Sbjct: 121 DIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEGMEAKIKSGA 180

Query: 525 IENL 528
           IE L
Sbjct: 181 IEVL 184


>gi|195998193|ref|XP_002108965.1| hypothetical protein TRIADDRAFT_52488 [Trichoplax adhaerens]
 gi|190589741|gb|EDV29763.1| hypothetical protein TRIADDRAFT_52488 [Trichoplax adhaerens]
          Length = 687

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 152/290 (52%), Gaps = 40/290 (13%)

Query: 217 DCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTT 276
           +C+C   D  P     +YT LG A S  +LR+ + +R       LR+ ++ YTG E KT+
Sbjct: 179 NCQCQDEDGAP-----FYTQLGVAGSTEELREMLRDRFRIDESKLRVIEVEYTGVESKTS 233

Query: 277 QGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTN 336
            GCP A+W IRR S  EKLL +V  R+GHTC  + +++ IVAW+G+  +++         
Sbjct: 234 DGCPRAEWAIRRISKSEKLLALVHRRRGHTCKASVVLMAIVAWDGIHPDRA--------- 284

Query: 337 KLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFR 396
                          N  + C CQG D    GA+F+ G  +    +G K   + +   ++
Sbjct: 285 ---------------NVLQDCHCQGTDNQREGAAFTLGNMYQTEDDGLKIILNASA--YQ 327

Query: 397 LSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVT 456
           L+  ++EQE+ E +  LA  ++P+YK  AP A+ NQ +++       +  +   PFSGV 
Sbjct: 328 LADSAKEQELAEALESLAADLAPVYKKFAPWAYNNQIKYQENCVGKSINEEKNGPFSGVI 387

Query: 457 ACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDE----QLHVLPLY 502
              DFCAH+H +   +++G ++V +L     L  P+DE    QLH+ P+Y
Sbjct: 388 CSLDFCAHNHVNTEGLDDGASMVCTL-----LKDPEDENVKNQLHIYPMY 432


>gi|10047157|dbj|BAB13372.1| KIAA1546 protein [Homo sapiens]
          Length = 684

 Score =  161 bits (408), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 76/130 (58%), Positives = 97/130 (74%), Gaps = 3/130 (2%)

Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
           E+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRPFSGVTAC DFC
Sbjct: 1   EEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFC 60

Query: 463 AHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
           AH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D DEFG+ EAQEEK
Sbjct: 61  AHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEEK 120

Query: 520 VNTGAIENLN 529
             +GAI+ L+
Sbjct: 121 KRSGAIQVLS 130


>gi|432106709|gb|ELK32361.1| Methylcytosine dioxygenase TET1 [Myotis davidii]
          Length = 2018

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 73/129 (56%), Positives = 95/129 (73%), Gaps = 3/129 (2%)

Query: 403  EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
            E+ +E+ +  LAT ++P+Y+  AP A+ NQ QFE  A ECRLG K GRPFSGVTAC DFC
Sbjct: 1515 EKNLEDNLQSLATQLAPIYRQYAPVAYQNQIQFEHIARECRLGNKEGRPFSGVTACVDFC 1574

Query: 463  AHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
             H+HRD+HNMNNG TVV +LT+  + S    P DEQLHVLPLY + D+DEFG++E  E K
Sbjct: 1575 THAHRDIHNMNNGSTVVCTLTREDNRSFGIVPQDEQLHVLPLYKLSDTDEFGSREGMEAK 1634

Query: 520  VNTGAIENL 528
            + +GA++ L
Sbjct: 1635 IRSGAVDVL 1643



 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/72 (48%), Positives = 50/72 (69%), Gaps = 1/72 (1%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  E G YYTHLGA  ++  +R+ +E R G KG A+R+EK++YTGKE
Sbjct: 1444 SELPSCNCL-DRVIQKEKGPYYTHLGAGPNVAAVREIMETRYGQKGSAVRIEKVIYTGKE 1502

Query: 273  GKTTQGCPLAKW 284
             K++ GCP+AKW
Sbjct: 1503 AKSSHGCPVAKW 1514


>gi|355723851|gb|AES08026.1| tet oncoprotein family member 3 [Mustela putorius furo]
          Length = 104

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 67/99 (67%), Positives = 79/99 (79%)

Query: 300 KHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCAC 359
           +HR GH C  A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCAC
Sbjct: 1   RHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCAC 60

Query: 360 QGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS 398
           QG DP TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+
Sbjct: 61  QGKDPSTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLA 99


>gi|241849264|ref|XP_002415674.1| hypothetical protein IscW_ISCW023647 [Ixodes scapularis]
 gi|215509888|gb|EEC19341.1| hypothetical protein IscW_ISCW023647 [Ixodes scapularis]
          Length = 750

 Score =  148 bits (374), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 74/161 (45%), Positives = 104/161 (64%), Gaps = 12/161 (7%)

Query: 195 PNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYT--HLGAAASLPDLRKDIEE 252
           P +    + +ERL++N + E P C C+ +D    E   Y T      A+SLP     + E
Sbjct: 599 PGADPWWERLERLRSNAKAEPPACDCYGAD----ETREYRTPRSPSLASSLPAAM--LNE 652

Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
           R    G ALR+EK+LY+GKEGKT+QGCP+AKW+IRR+   EK+L +++HR GH C +A+I
Sbjct: 653 R----GPALRIEKVLYSGKEGKTSQGCPVAKWIIRRSGPSEKVLAVLRHRPGHRCLSAYI 708

Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNE 353
           V+ IVAWEGV  + +D +Y  +T+K   +G PT RRC TNE
Sbjct: 709 VMAIVAWEGVQADMADDLYRTVTHKTVNFGFPTQRRCGTNE 749


>gi|351712963|gb|EHB15882.1| Methylcytosine dioxygenase TET1 [Heterocephalus glaber]
          Length = 561

 Score =  138 bits (348), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 67/129 (51%), Positives = 90/129 (69%), Gaps = 3/129 (2%)

Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
           E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G PFSGVTAC DF 
Sbjct: 195 EKNLEDNLQNLATELAPIYKQYAPAAYQNQVEYEHVAQECRLGAKEGHPFSGVTACLDFS 254

Query: 463 AHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
           AH H D+HNMN+  TVV +L +  +RSL   P+D+ LHVL LY + D DEFG+KE  E K
Sbjct: 255 AHLHWDIHNMNHRNTVVSTLAREDNRSLGVVPEDKHLHVLLLYRLSDKDEFGSKEGMEAK 314

Query: 520 VNTGAIENL 528
           + +GA++ L
Sbjct: 315 IQSGAVQVL 323


>gi|344237690|gb|EGV93793.1| Methylcytosine dioxygenase TET1 [Cricetulus griseus]
          Length = 337

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 59/105 (56%), Positives = 77/105 (73%), Gaps = 2/105 (1%)

Query: 424 LAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLT 483
           +AP A+ NQ ++E  A++CRLG K GRPFSGVT C DFCAHSH+D HNM NG TVV++L 
Sbjct: 1   MAPVAYQNQVKYEDVAADCRLGTKKGRPFSGVTCCMDFCAHSHKDNHNMINGSTVVLTLL 60

Query: 484 KH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
           +   R  +   DEQ HVLPL+ + D+DEFG++E  E K+ +GAIE
Sbjct: 61  RKDARDRNNLQDEQFHVLPLHRLADTDEFGSREGMEAKIRSGAIE 105


>gi|355723854|gb|AES08027.1| tet oncoprotein family member 3 [Mustela putorius furo]
          Length = 91

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 58/89 (65%), Positives = 71/89 (79%), Gaps = 3/89 (3%)

Query: 443 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVL 499
           RLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +R + K P+DEQLHVL
Sbjct: 1   RLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVL 60

Query: 500 PLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           PLY M ++DEFG++E Q  KV +GAI+ L
Sbjct: 61  PLYKMANTDEFGSEENQNAKVGSGAIQVL 89


>gi|344237691|gb|EGV93794.1| Methylcytosine dioxygenase TET1 [Cricetulus griseus]
          Length = 1466

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 88/141 (62%), Gaps = 5/141 (3%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            E P C C    K   + G YYTHLGA  S+  +R+ +E R   KGKA+R+EKI Y GKE 
Sbjct: 1329 EGPPCDC----KDQTDKGPYYTHLGAGPSVAAIRELMETRYCEKGKAIRIEKIEYMGKES 1384

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K+++GCP+ K V+R+ + +EK+L + + R GH C TA +VV IV W+ +    +D +Y  
Sbjct: 1385 KSSRGCPVVKTVLRQNNDDEKVLCLARERVGHHCQTAVMVVGIVLWQPISPPLADHLYDE 1444

Query: 334  LTNKLNKY-GLPTTRRCATNE 353
            +T+ L  Y G PT RRC  NE
Sbjct: 1445 ITDNLRSYSGHPTDRRCTFNE 1465


>gi|332025525|gb|EGI65688.1| hypothetical protein G5I_05788 [Acromyrmex echinatior]
          Length = 1048

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/131 (43%), Positives = 80/131 (61%), Gaps = 26/131 (19%)

Query: 126  KSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPKEPNNNE 185
            + Y F G+GGP  +    G WCCR GGTE PT EHL+DG CQG++T+DE+L+     ++ 
Sbjct: 927  RDYKFRGDGGPAKVSPGTGSWCCRRGGTEQPTPEHLRDGCCQGLQTKDEILD-----DSM 981

Query: 186  EPATVKAEDPNS----------KEMLDHIERLKNNMRTEVPDCKCFASDK------LPPE 229
            E A +K E P+S           ++ DH+++LKNN+RTEVPDC CF +DK      +P  
Sbjct: 982  EKAELKNEGPHSPHTPTTTTVTTKLQDHLDKLKNNVRTEVPDCNCFPADKCELQLRIP-- 1039

Query: 230  PGSYYTHLGAA 240
               YYT++  A
Sbjct: 1040 ---YYTNIEKA 1047


>gi|307213412|gb|EFN88848.1| hypothetical protein EAI_08435 [Harpegnathos saltator]
          Length = 685

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 53/115 (46%), Positives = 69/115 (60%), Gaps = 7/115 (6%)

Query: 118 EEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEML- 176
           EE S      Y F G+GGP  +  + G WCCR GGTE PT EHL+DG CQG++T+DEML 
Sbjct: 554 EEESQKIVPDYKFRGDGGPAKVSPATGSWCCRRGGTEQPTPEHLRDGCCQGLQTKDEMLA 613

Query: 177 ------EPKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDK 225
                 E K       P T       + ++ DH+++LKNN+RTEVP+C CF +DK
Sbjct: 614 DSPQRDELKSEGGPHSPRTPSTATTTTTKLQDHLDKLKNNVRTEVPNCNCFPADK 668


>gi|355723824|gb|AES08017.1| tet oncoprotein 1 [Mustela putorius furo]
          Length = 70

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 45/70 (64%), Positives = 53/70 (75%), Gaps = 1/70 (1%)

Query: 320 EGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWS 378
           +G+PL  +D +Y  LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWS
Sbjct: 1   DGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWS 60

Query: 379 MYYNGCKYAR 388
           MY+NGCK+ R
Sbjct: 61  MYFNGCKFGR 70


>gi|68161848|emb|CAD28467.3| hypothetical protein [Homo sapiens]
          Length = 414

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/73 (64%), Positives = 56/73 (76%), Gaps = 3/73 (4%)

Query: 459 FDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEA 515
            DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D+DEFG+KE 
Sbjct: 1   LDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEG 60

Query: 516 QEEKVNTGAIENL 528
            E K+ +GAIE L
Sbjct: 61  MEAKIKSGAIEVL 73


>gi|355723821|gb|AES08016.1| tet oncoprotein 1 [Mustela putorius furo]
          Length = 218

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 55/77 (71%), Gaps = 1/77 (1%)

Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
           +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 143 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 201

Query: 273 GKTTQGCPLAKWVIRRA 289
           GK++QGCP+AKWV+RR 
Sbjct: 202 GKSSQGCPIAKWVLRRG 218


>gi|21410433|gb|AAH31159.1| Tet2 protein [Mus musculus]
 gi|26251882|gb|AAH40785.1| Tet2 protein [Mus musculus]
 gi|148680233|gb|EDL12180.1| mCG123956 [Mus musculus]
          Length = 612

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/60 (53%), Positives = 44/60 (73%), Gaps = 3/60 (5%)

Query: 472 MNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           M NG TVVV+L +  +R + +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE L
Sbjct: 1   MPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEVL 60


>gi|351715495|gb|EHB18414.1| Transmembrane protease, serine 11A [Heterocephalus glaber]
          Length = 588

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 41/62 (66%), Gaps = 1/62 (1%)

Query: 468 DLHNMNN-GCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
           DLH ++N G  +   +   + +S+  DEQLH+LPLY + D DEFG+KE  E K+ +GA++
Sbjct: 128 DLHIISNSGQKITCQIKDLQEMSENLDEQLHILPLYRLSDKDEFGSKEGMEAKIQSGAVQ 187

Query: 527 NL 528
            L
Sbjct: 188 VL 189


>gi|440895445|gb|ELR47628.1| hypothetical protein M91_16421, partial [Bos grunniens mutus]
          Length = 614

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/39 (69%), Positives = 31/39 (79%)

Query: 490 KPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           KP+DEQLHVLPLY + D DEFG+ EAQEEK   GAI+ L
Sbjct: 15  KPEDEQLHVLPLYKVSDVDEFGSVEAQEEKKRNGAIQVL 53


>gi|149025993|gb|EDL82236.1| similar to KIAA1546 protein (predicted), isoform CRA_a [Rattus
           norvegicus]
          Length = 644

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 40/57 (70%), Gaps = 3/57 (5%)

Query: 476 CTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            T+VV+LT+  +     +P+DEQLHVLPLY +   DEFG+ E QEEK+  G+I+ L+
Sbjct: 43  VTLVVTLTREDNREVGGQPEDEQLHVLPLYTIATEDEFGSTEGQEEKILQGSIQVLH 99


>gi|37360442|dbj|BAC98199.1| mKIAA1546 protein [Mus musculus]
          Length = 614

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/40 (57%), Positives = 31/40 (77%)

Query: 489 SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE L
Sbjct: 23  AKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEVL 62


>gi|335309464|ref|XP_003361647.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Sus
           scrofa]
          Length = 453

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/38 (60%), Positives = 30/38 (78%)

Query: 491 PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
           P DEQLHVLPLY + D+DEFG++E  E K+ +GAI+ L
Sbjct: 16  PQDEQLHVLPLYKLSDTDEFGSREGIEAKIKSGAIKVL 53


>gi|193227751|emb|CAQ60121.1| hypothetical protein [Homo sapiens]
          Length = 96

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 16/103 (15%)

Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM---------HLLATTISPL 420
           SFSFGCSWSMY+NGCK+ RS + R+FR+   S      E++         ++    ISP 
Sbjct: 1   SFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHTYYERITKGRNPERRYMKPERISPG 60

Query: 421 YKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCA 463
           ++A+        C+ E       LG     P +    CF  CA
Sbjct: 61  HEAM------EDCEAENVWEMGGLGILTSVPITPRVVCF-LCA 96


>gi|195192277|ref|XP_002029595.1| GL24721 [Drosophila persimilis]
 gi|194104035|gb|EDW26078.1| GL24721 [Drosophila persimilis]
          Length = 209

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 6/55 (10%)

Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--DEMLEPKE 180
           Y + G+G P +     G  CCR GGT PPT+EHLKDG C G+  Q  +E+L+  E
Sbjct: 107 YTYLGDGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDE 157


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.134    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,425,253,563
Number of Sequences: 23463169
Number of extensions: 427106682
Number of successful extensions: 858874
Number of sequences better than 100.0: 236
Number of HSP's better than 100.0 without gapping: 229
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 858134
Number of HSP's gapped (non-prelim): 307
length of query: 529
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 382
effective length of database: 8,910,109,524
effective search space: 3403661838168
effective search space used: 3403661838168
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)