BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy6128
(529 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|345487243|ref|XP_001599461.2| PREDICTED: hypothetical protein LOC100114438 [Nasonia vitripennis]
Length = 2706
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 304/424 (71%), Positives = 358/424 (84%), Gaps = 6/424 (1%)
Query: 111 FGVDPRMEEHS-DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGM 169
F D ++E S D + +Y F G+GGP + G WCCR GGTE PT EHL++G CQG+
Sbjct: 1409 FSRDIKLENESRDQEESNYKFRGDGGPAKMSPGVGSWCCRRGGTEQPTPEHLREGCCQGL 1468
Query: 170 RTQDEMLE---PKEPNNNEEPATVKAEDPNSK--EMLDHIERLKNNMRTEVPDCKCFASD 224
+T+DE E K NE+ A KA + ++ ++ +H+E+LKNN+RTEVPDC CF++D
Sbjct: 1469 QTRDEFSEDSPQKSEVKNEDSAGGKASNGSTTGTKLQEHLEKLKNNVRTEVPDCDCFSAD 1528
Query: 225 KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
K PPEPGSYY+HLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW
Sbjct: 1529 KCPPEPGSYYSHLGAAASLPDLRNDLERRTGLKGHAIRFEKVVYTGKEGKTTQGCPMAKW 1588
Query: 285 VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
VIRR+ +EEK+L IVKHRQGH C+TAWIVV +VAWEGVP +++D +Y++L++KLN++GLP
Sbjct: 1589 VIRRSGIEEKILTIVKHRQGHKCATAWIVVAMVAWEGVPNHEADRIYSLLSHKLNRFGLP 1648
Query: 345 TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 404
TTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ
Sbjct: 1649 TTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 1708
Query: 405 EIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAH 464
E+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAH
Sbjct: 1709 EVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAH 1768
Query: 465 SHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGA 524
+HRDLHNMNNGCTVVVSLTKHRS SKPDDEQLHVLPLYIMDDSDEFG+KE QE K+ +GA
Sbjct: 1769 AHRDLHNMNNGCTVVVSLTKHRSFSKPDDEQLHVLPLYIMDDSDEFGSKEGQEAKIKSGA 1828
Query: 525 IENL 528
IE L
Sbjct: 1829 IEVL 1832
>gi|307188349|gb|EFN73124.1| Protein TET2 [Camponotus floridanus]
Length = 1632
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 297/412 (72%), Positives = 350/412 (84%), Gaps = 16/412 (3%)
Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPKEPNNNEEP 187
Y F G+GGP + G WCCR GGT+ P+ EHLKDG CQG++T+DEML ++ +
Sbjct: 340 YKFRGDGGPAKVSPETGSWCCRRGGTKQPSPEHLKDGCCQGLQTKDEMLA-----DSPQA 394
Query: 188 ATVKAEDPNS-----------KEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTH 236
A +K E P+S ++ DH+++LKNN+RTEVPDC CF +DK PPEPGSYYTH
Sbjct: 395 AELKNEGPHSPRTPASAATTTTKLQDHLDKLKNNVRTEVPDCNCFPADKCPPEPGSYYTH 454
Query: 237 LGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLL 296
LGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW+IRR+ ++EK+L
Sbjct: 455 LGAAASLPDLRNDLERRTGLKGDAIRFEKVIYTGKEGKTTQGCPMAKWIIRRSGMDEKIL 514
Query: 297 LIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRT 356
IVKHRQGH C+TAWIVV +VAWEGVP +++D +Y++LT+KLN++GLPTTRRC TNEPRT
Sbjct: 515 TIVKHRQGHKCATAWIVVAMVAWEGVPTHEADRIYSLLTHKLNRFGLPTTRRCGTNEPRT 574
Query: 357 CACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATT 416
CACQGLDPD CGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT
Sbjct: 575 CACQGLDPDNCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATL 634
Query: 417 ISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGC 476
+SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGC
Sbjct: 635 LSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGC 694
Query: 477 TVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
TVVVSLTKHR+LSKP+DEQLHVLPLYIMDD+DEFG+KE QE+KV +GA+E L
Sbjct: 695 TVVVSLTKHRALSKPEDEQLHVLPLYIMDDTDEFGSKEGQEKKVRSGALEIL 746
>gi|383857295|ref|XP_003704140.1| PREDICTED: uncharacterized protein LOC100883443 [Megachile
rotundata]
Length = 1646
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 297/418 (71%), Positives = 352/418 (84%), Gaps = 8/418 (1%)
Query: 114 DPRMEEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQD 173
D R +E S+ Y F G+GGP + G WCCR GGTE PT EHL+DG CQG++T+D
Sbjct: 338 DQRSQESSN-----YKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLRDGCCQGLQTRD 392
Query: 174 EMLEP---KEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEP 230
E+L K NE P + ++ + ++ DH+++LKNN+RTEVPDC CF +DK PPEP
Sbjct: 393 EILADSADKSDVKNEGPQSPRSAASTTTKLQDHLDKLKNNVRTEVPDCNCFPADKCPPEP 452
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
GSYYTHLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+
Sbjct: 453 GSYYTHLGAAASLPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSG 512
Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC
Sbjct: 513 LEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCG 572
Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+M
Sbjct: 573 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERM 632
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
H+LAT +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLH
Sbjct: 633 HVLATLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLH 692
Query: 471 NMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
NMNNGCTVVV+LTKHR+LSKP+DEQLHVLPLYIMD +DE+G+KE Q+EKV G++E L
Sbjct: 693 NMNNGCTVVVTLTKHRNLSKPEDEQLHVLPLYIMDTTDEYGSKEGQDEKVRGGSVEVL 750
>gi|380029496|ref|XP_003698406.1| PREDICTED: uncharacterized protein LOC100866593 [Apis florea]
Length = 1865
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 292/406 (71%), Positives = 349/406 (85%), Gaps = 4/406 (0%)
Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEP---KEPNN 183
+Y F G+GGP + G WCCR GGTE PT EHL++G CQG++T+DE+L K
Sbjct: 556 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADAADKSDVK 615
Query: 184 NEEPATVKAED-PNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAAS 242
NE P + ++ P++ ++ DH+E+LKNN+R+EVPDC CF +DK PPEPGSYYTHLGAAAS
Sbjct: 616 NEGPQSPRSGGAPSTTKLQDHLEKLKNNVRSEVPDCNCFPADKCPPEPGSYYTHLGAAAS 675
Query: 243 LPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHR 302
LPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHR
Sbjct: 676 LPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHR 735
Query: 303 QGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGL 362
QGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGL
Sbjct: 736 QGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGL 795
Query: 363 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYK 422
DP+TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY
Sbjct: 796 DPETCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYL 855
Query: 423 ALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSL 482
+LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTVVV++
Sbjct: 856 SLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTM 915
Query: 483 TKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
TKHR+LSKP+DEQLHVLPLYIMD +DE+G+KE Q+EKV GA+E L
Sbjct: 916 TKHRTLSKPEDEQLHVLPLYIMDTTDEYGSKEGQDEKVRAGAVEVL 961
>gi|328780619|ref|XP_396330.4| PREDICTED: hypothetical protein LOC412878 [Apis mellifera]
Length = 1695
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 293/414 (70%), Positives = 346/414 (83%), Gaps = 20/414 (4%)
Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLE--------- 177
+Y F G+GGP + G WCCR GGTE PT EHL++G CQG++T+DE+L
Sbjct: 386 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADGTDKSDVK 445
Query: 178 ---PKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYY 234
P+ P N P+T K + DH+E+LKNN+R+EVPDC CF +DK PPEPGSYY
Sbjct: 446 NEGPQSPRNGGAPSTTK--------LQDHLEKLKNNVRSEVPDCNCFPADKCPPEPGSYY 497
Query: 235 THLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEK 294
THLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK
Sbjct: 498 THLGAAASLPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEK 557
Query: 295 LLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEP 354
+L IVKHRQGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEP
Sbjct: 558 ILTIVKHRQGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEP 617
Query: 355 RTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLA 414
RTCACQGLDP+TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LA
Sbjct: 618 RTCACQGLDPETCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLA 677
Query: 415 TTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNN 474
T +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNN
Sbjct: 678 TLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNN 737
Query: 475 GCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
GCTVVV++TKHR+LSKP+DEQLHVLPLYIMD +DE+G+KE Q+EKV GA+E L
Sbjct: 738 GCTVVVTMTKHRTLSKPEDEQLHVLPLYIMDTTDEYGSKEGQDEKVRAGAVEVL 791
>gi|340722271|ref|XP_003399531.1| PREDICTED: hypothetical protein LOC100642293 [Bombus terrestris]
Length = 1697
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 290/406 (71%), Positives = 345/406 (84%), Gaps = 4/406 (0%)
Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEML----EPKEPN 182
+Y F G+GGP + G WCCR GGTE PT EHL++G CQG++T+DE+L E +
Sbjct: 391 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADSAEKSDVK 450
Query: 183 NNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAAS 242
N + P++ ++ DH+++LKNN+R+EVPDC CF +DK PPEPGSYYTHLGAAAS
Sbjct: 451 NEGTQSPRTGSVPSTTKLQDHLDKLKNNVRSEVPDCNCFPADKCPPEPGSYYTHLGAAAS 510
Query: 243 LPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHR 302
LPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHR
Sbjct: 511 LPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHR 570
Query: 303 QGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGL 362
QGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGL
Sbjct: 571 QGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGL 630
Query: 363 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYK 422
DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY
Sbjct: 631 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYL 690
Query: 423 ALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSL 482
+LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTVVV++
Sbjct: 691 SLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTM 750
Query: 483 TKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
TKHRSLSKP++EQLHVLPLYIMD +DE G+KE Q+EKV GA+E L
Sbjct: 751 TKHRSLSKPEEEQLHVLPLYIMDTTDENGSKEGQDEKVRAGAVEVL 796
>gi|350416717|ref|XP_003491069.1| PREDICTED: hypothetical protein LOC100741227 [Bombus impatiens]
Length = 1697
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 289/406 (71%), Positives = 345/406 (84%), Gaps = 4/406 (0%)
Query: 127 SYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEML----EPKEPN 182
+Y F G+GGP + G WCCR GGTE PT EHL++G CQG++T+DE+L + +
Sbjct: 391 NYKFRGDGGPAKVSPGVGSWCCRRGGTEQPTPEHLREGCCQGLQTRDEILADSADKSDVK 450
Query: 183 NNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAAS 242
N + P++ ++ DH+++LKNN+R+EVPDC CF +DK PPEPGSYYTHLGAAAS
Sbjct: 451 NEGTQSPRTGSVPSTTKLQDHLDKLKNNVRSEVPDCNCFPADKCPPEPGSYYTHLGAAAS 510
Query: 243 LPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHR 302
LPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHR
Sbjct: 511 LPDLRNDLERRTGLKGNAIRFEKVIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHR 570
Query: 303 QGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGL 362
QGH C TAWIVV +VAWEGVP +++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGL
Sbjct: 571 QGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGL 630
Query: 363 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYK 422
DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY
Sbjct: 631 DPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYL 690
Query: 423 ALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSL 482
+LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTVVV++
Sbjct: 691 SLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTM 750
Query: 483 TKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
TKHRSLSKP++EQLHVLPLYIMD +DE G+KE Q+EKV GA+E L
Sbjct: 751 TKHRSLSKPEEEQLHVLPLYIMDTTDENGSKEGQDEKVRAGAVEVL 796
>gi|242005152|ref|XP_002423436.1| hypothetical protein Phum_PHUM059340 [Pediculus humanus corporis]
gi|212506514|gb|EEB10698.1| hypothetical protein Phum_PHUM059340 [Pediculus humanus corporis]
Length = 1861
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 294/425 (69%), Positives = 344/425 (80%), Gaps = 11/425 (2%)
Query: 116 RMEEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEM 175
R EE++ + Y++ GEGGP SL + GPWCCR GG EPPT +H K G C+G +T DE
Sbjct: 833 RGEENNKLIAEDYLYQGEGGPISLNSTKGPWCCRMGGIEPPTDDHAKIGNCKGHKTADEF 892
Query: 176 ---LEPKEPNNNEEPATVKAEDPNSKEML--------DHIERLKNNMRTEVPDCKCFASD 224
++ + N E VK N+ L +++ERLKNN++T VP CKCF D
Sbjct: 893 SSDVKKLDVENLNEKLRVKKFYENTNLKLSPQENFQEENMERLKNNIKTNVPHCKCFPPD 952
Query: 225 KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
K PPEPGSYYTHLGAAASL DLRKD+E R+G GKALR EKI YTGKEGKTT+GCPLAKW
Sbjct: 953 KSPPEPGSYYTHLGAAASLSDLRKDLESRTGQTGKALRFEKICYTGKEGKTTRGCPLAKW 1012
Query: 285 VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
VIRR+ L+EK+L+IVKHR GHTCSTAWIVV +VAW+GVP ++D +YA+LT+KLNK+GLP
Sbjct: 1013 VIRRSGLDEKVLIIVKHRPGHTCSTAWIVVCLVAWDGVPTPEADRIYALLTHKLNKFGLP 1072
Query: 345 TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 404
T RRCATNE RTCACQGLDP+TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ
Sbjct: 1073 TIRRCATNETRTCACQGLDPNTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ 1132
Query: 405 EIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAH 464
++EE+MH+LAT +SPLY LAP A++NQ FEREA+ECRLGFKPGRPFSGVTAC DFCAH
Sbjct: 1133 DVEERMHVLATLLSPLYNTLAPEAYSNQTSFEREAAECRLGFKPGRPFSGVTACIDFCAH 1192
Query: 465 SHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGA 524
+HRDLHNMNNGCTVV +LTKHR+LSKP+DEQLHVLP YI+DD+DEFG K +QEEK G+
Sbjct: 1193 AHRDLHNMNNGCTVVFTLTKHRTLSKPEDEQLHVLPHYILDDTDEFGCKASQEEKYKNGS 1252
Query: 525 IENLN 529
IE LN
Sbjct: 1253 IECLN 1257
>gi|328712256|ref|XP_001947546.2| PREDICTED: hypothetical protein LOC100159694 [Acyrthosiphon pisum]
Length = 2023
Score = 585 bits (1509), Expect = e-164, Method: Compositional matrix adjust.
Identities = 263/424 (62%), Positives = 339/424 (79%), Gaps = 16/424 (3%)
Query: 108 KVPFGVDPRMEEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQ 167
++PFG+DPR+ + K YVF G GGP + AGPWCCR GGT+ PT++HL DG C
Sbjct: 1160 RLPFGIDPRVTQ-----KNGYVFCGNGGPNPIDVVAGPWCCRMGGTDTPTTKHLSDGCCH 1214
Query: 168 GMRTQDEMLEPKEPNNNEEPATVKAEDP-NSKEMLDHIE-RLKNNMRTEVPDCKCFASDK 225
G++T DE ++P E +K E+ N+ + D+++ + K N++ +PDC CF +D+
Sbjct: 1215 GLKTLDEGIDPVE---------MKQENGLNNSQCSDNLDDKQKTNIKATIPDCNCFPTDQ 1265
Query: 226 LPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWV 285
PPEPG +YTHLG+A SL +LR ++E SG +G +RMEK+LYTGKEGKTTQGCPLAKWV
Sbjct: 1266 APPEPGPFYTHLGSAYSLIELRTNMENMSGIRGNGIRMEKVLYTGKEGKTTQGCPLAKWV 1325
Query: 286 IRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPT 345
IRR+S +EKLL++VK+R+GH C +WIV+ IV+WEG+ +++D +Y +L++KLNKYG+PT
Sbjct: 1326 IRRSSTDEKLLVVVKNRRGHKCQHSWIVICIVSWEGILSDEADFLYTMLSHKLNKYGVPT 1385
Query: 346 TRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE 405
TRRC TN+PRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK VRKFRLSVR+EEQE
Sbjct: 1386 TRRCGTNDPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKDVRKFRLSVRTEEQE 1445
Query: 406 IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHS 465
+EE++H+LAT +SPLYK+LAP ++ NQ Q ERE S+CRLG KPGRPF+ VTAC DFCAH+
Sbjct: 1446 LEERLHVLATNLSPLYKSLAPRSYNNQIQCEREGSDCRLGLKPGRPFASVTACIDFCAHA 1505
Query: 466 HRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAI 525
HRD HNM+NGCTVVV+L KHR KPDDEQLHVLPLY++D+SDEFG+K+AQ +K G++
Sbjct: 1506 HRDFHNMHNGCTVVVTLNKHRGFQKPDDEQLHVLPLYVVDESDEFGDKQAQSDKFKNGSV 1565
Query: 526 ENLN 529
E L+
Sbjct: 1566 EMLS 1569
>gi|357624916|gb|EHJ75511.1| hypothetical protein KGM_05166 [Danaus plexippus]
Length = 2066
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 270/452 (59%), Positives = 325/452 (71%), Gaps = 60/452 (13%)
Query: 106 DLKVPFGVDPRMEEHSDSGKK-------------SYVFAGEGGPCSLVDSAGPWCCRGGG 152
DLK P+ ++ EHS G K Y+FAGEGGP + + G CCR G
Sbjct: 902 DLKPPYYIEQIKSEHSPPGHKIYKNLLYGPPRSEPYMFAGEGGPNAFRNEIGYACCRQGS 961
Query: 153 TEPPTSEHLKDGLCQGMRTQDEMLE--------------PKEPNNNEEPATVKAEDPN-S 197
+ P EHL+DG C G++T+DE+LE P P ++ P T K N S
Sbjct: 962 VKKPPPEHLRDGACAGLQTKDEILEEDPDSTDNSKTPSKPGTPISDLFPKTTKENQFNYS 1021
Query: 198 KEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYK 257
KE LD++ERLKNN RTEVPDC CF +DK PPEPGSYYTHL
Sbjct: 1022 KEYLDNLERLKNNSRTEVPDCNCFPADKNPPEPGSYYTHL-------------------- 1061
Query: 258 GKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIV 317
GKEGKT QGCP+AKW+IRR+S EK+L +VK R GH CST+WIVV +V
Sbjct: 1062 ------------GKEGKTAQGCPMAKWIIRRSSYTEKVLAVVKFRNGHKCSTSWIVVCLV 1109
Query: 318 AWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSW 377
AWEG+P +++D Y +L++KLN+YGLPTTRRCATNE RTCACQGLDP+TCGAS+SFGCSW
Sbjct: 1110 AWEGIPQSEADLDYTLLSHKLNRYGLPTTRRCATNENRTCACQGLDPETCGASYSFGCSW 1169
Query: 378 SMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFER 437
SMYYNGCKYARSKTVRKFRLSV++EE EIEE+MH+LAT +SPLY LAP +F NQCQFE+
Sbjct: 1170 SMYYNGCKYARSKTVRKFRLSVKTEESEIEERMHVLATLLSPLYMNLAPKSFENQCQFEK 1229
Query: 438 EASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLH 497
EAS+CRLGFKPGRPFSGVTAC DFCAH+HRDLHNMNNGCT VV+L KHR+L+KP+DEQLH
Sbjct: 1230 EASDCRLGFKPGRPFSGVTACIDFCAHAHRDLHNMNNGCTAVVTLAKHRALTKPNDEQLH 1289
Query: 498 VLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
VLPLY++D +DEFG+KE QEEK+ +GA+E L+
Sbjct: 1290 VLPLYVLDTTDEFGSKEGQEEKIASGALEILD 1321
>gi|307213413|gb|EFN88849.1| Protein TET2 [Harpegnathos saltator]
Length = 1214
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 246/303 (81%), Positives = 281/303 (92%)
Query: 227 PPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVI 286
PPEPGSYYTHLGAAASLPDLR D+E R+G KG A+R EK++YTGKEGKTTQGCP+AKW+I
Sbjct: 8 PPEPGSYYTHLGAAASLPDLRNDLERRTGLKGDAIRFEKVIYTGKEGKTTQGCPMAKWII 67
Query: 287 RRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTT 346
RR+ ++EK+L IVKHRQGH C TAWIVV +VAWEGVP +++D +Y++L +KLN++GLPTT
Sbjct: 68 RRSGIDEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTHEADRIYSLLCHKLNRFGLPTT 127
Query: 347 RRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEI 406
RRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE+
Sbjct: 128 RRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEV 187
Query: 407 EEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSH 466
EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLGFKPGRPFSGVTAC DFCAHSH
Sbjct: 188 EERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLGFKPGRPFSGVTACIDFCAHSH 247
Query: 467 RDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
RDLHNMNNGCTVVVSLTKHR+LSKP+DEQLHVLPLYIMDD+DEFG+KE QE+K+ +GA+E
Sbjct: 248 RDLHNMNNGCTVVVSLTKHRTLSKPEDEQLHVLPLYIMDDTDEFGSKEGQEKKIRSGAVE 307
Query: 527 NLN 529
NL+
Sbjct: 308 NLS 310
>gi|195020981|ref|XP_001985305.1| GH14579 [Drosophila grimshawi]
gi|193898787|gb|EDV97653.1| GH14579 [Drosophila grimshawi]
Length = 2971
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 249/421 (59%), Positives = 308/421 (73%), Gaps = 25/421 (5%)
Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRT--------QDEMLEP- 178
Y + GEG P + G CCR GGT PPT+EHLKDG C G+ +DE+ E
Sbjct: 1613 YPYLGEGKPIN----NGFSCCRQGGTRPPTAEHLKDGTCLGLGITPKEELLDEDELAEAH 1668
Query: 179 -------KEPNNNEEP-ATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEP 230
K+ +E P VK E N M D +RL+ +TE+P+C+CF SDK PPEP
Sbjct: 1669 NGIKAKSKKQKQDEIPEIVVKHEKIN--PMFDTTDRLEKGNKTEIPECECFQSDKNPPEP 1726
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
G+YYTHLG A+SL +LR++ E+R G+ LR+EKI+YTGKEGKTTQGCP+AKWVIRRA
Sbjct: 1727 GTYYTHLGTASSLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTTQGCPVAKWVIRRAD 1786
Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
EEK+L++VK R GH C A+IVV +VAW+GVP ++D Y L KLNKYGLPTTRRCA
Sbjct: 1787 PEEKILVVVKKRPGHRCIAAYIVVCMVAWDGVPRLEADNAYKNLIPKLNKYGLPTTRRCA 1846
Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
TNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+ M
Sbjct: 1847 TNENRTCACQGLDPETSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHM 1906
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
+L+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLH
Sbjct: 1907 NLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLH 1966
Query: 471 NMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
NM +GCTV V+L K +R PDDEQ HVLPLY MD +DEF + E Q +K TGA++ L
Sbjct: 1967 NMQDGCTVHVALLKPGNRDSRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQML 2026
Query: 529 N 529
+
Sbjct: 2027 D 2027
>gi|195429100|ref|XP_002062602.1| GK17629 [Drosophila willistoni]
gi|194158687|gb|EDW73588.1| GK17629 [Drosophila willistoni]
Length = 2132
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 240/403 (59%), Positives = 302/403 (74%), Gaps = 20/403 (4%)
Query: 147 CCRGGGTEPPTSEHLKDGLCQGMRTQ--DEMLEP---KEPNNN-------------EEPA 188
CCR GGT PPT+EHLKDG C G+ Q +E+L+ +P+NN +E
Sbjct: 884 CCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDDLADPHNNSSVKSGKSKKHKQDEIP 943
Query: 189 TVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRK 248
+ + M D +RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL +LR+
Sbjct: 944 EIIVKHEKINPMFDTTDRLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMELRR 1003
Query: 249 DIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCS 308
+ EER G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C
Sbjct: 1004 EFEERCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCI 1063
Query: 309 TAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCG 368
A+IVV +VAW+G+P ++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T G
Sbjct: 1064 AAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSG 1123
Query: 369 ASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGA 428
AS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P +
Sbjct: 1124 ASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEETAIEDHMNLIATLLAPVFKQVCPRS 1183
Query: 429 FTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HR 486
+ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R
Sbjct: 1184 YDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPNNR 1243
Query: 487 SLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
PDDEQ HVLPLY MD +DEF + E Q +K TGA++ L+
Sbjct: 1244 DSRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQMLD 1286
>gi|195377914|ref|XP_002047732.1| GJ11762 [Drosophila virilis]
gi|194154890|gb|EDW70074.1| GJ11762 [Drosophila virilis]
Length = 2228
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 246/421 (58%), Positives = 309/421 (73%), Gaps = 25/421 (5%)
Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--------DEMLEP- 178
Y + GEG P + +G CCR GGT PPT+EHLKDG C G+ Q DE+ E
Sbjct: 876 YAYLGEGKPLN----SGFSCCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDELAEAH 931
Query: 179 -------KEPNNNEEP-ATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEP 230
K+ +E P VK E N M D +RL+ +TE+P+C+CF SDK PPEP
Sbjct: 932 NGVKAKSKKQKQDEIPEIIVKHEKIN--PMFDTTDRLEKGNKTEIPECECFQSDKNPPEP 989
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
G+YYTHLG A++L +LR++ E+R G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA
Sbjct: 990 GTYYTHLGTASTLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRAD 1049
Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
EEK+L++VK R GH C A+IVV +VAW+G+P ++D Y L KLNKYGLPTTRRCA
Sbjct: 1050 PEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCA 1109
Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
TNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+ M
Sbjct: 1110 TNENRTCACQGLDPESSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHM 1169
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
+L+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLH
Sbjct: 1170 NLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLH 1229
Query: 471 NMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
NM +GCTV V+L K +R PDDEQ HVLPLY MD +DEF + E Q +K TGA++ L
Sbjct: 1230 NMQDGCTVHVALLKPTNRDSRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQML 1289
Query: 529 N 529
+
Sbjct: 1290 D 1290
>gi|194865212|ref|XP_001971317.1| GG14498 [Drosophila erecta]
gi|190653100|gb|EDV50343.1| GG14498 [Drosophila erecta]
Length = 2186
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 252/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 836 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 891
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 892 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 948
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 949 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1008
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1009 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1068
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1069 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1128
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IEE M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1129 YARSKTVRKFRLSVKSEEAAIEEHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1188
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1189 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1248
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1249 MDGTDEFESVEGQRDKHRTGAVQMLD 1274
>gi|195492879|ref|XP_002094180.1| GE21689 [Drosophila yakuba]
gi|194180281|gb|EDW93892.1| GE21689 [Drosophila yakuba]
Length = 2053
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 252/446 (56%), Positives = 319/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKK-SYVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G SY + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 711 LEPKIEDMGMLGHGGSYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 766
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 767 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 823
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 824 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 883
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 884 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 943
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 944 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1003
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1004 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1063
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1064 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1123
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1124 MDGTDEFESVEGQRDKHRTGAVQMLD 1149
>gi|194749274|ref|XP_001957064.1| GF10236 [Drosophila ananassae]
gi|190624346|gb|EDV39870.1| GF10236 [Drosophila ananassae]
Length = 2255
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 245/443 (55%), Positives = 315/443 (71%), Gaps = 27/443 (6%)
Query: 113 VDPRMEEHSDSGKKS-YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRT 171
++P++E+ G Y + G G +++ G CCR GGT PPT+EHLKDG C G+
Sbjct: 901 LEPKLEDMGMLGHGGGYTYLGGAGEGKGLNN-GFSCCRQGGTRPPTAEHLKDGTCLGLGI 959
Query: 172 Q--DEMLEPKE----------PNNNEEPATVKAEDPNS-----------KEMLDHIERLK 208
Q +E+L+ E P + P+ + D +RL+
Sbjct: 960 QPKEELLDEDELIDSHGNGLKPGGGAAGKAKGKQKPDEIPEIVVKHEKINPLFDTTDRLE 1019
Query: 209 NNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILY 268
+TE+P+C+CF SDK PPEPG+YYTHLG A+SL +LR++ EER G+ LR+EKI+Y
Sbjct: 1020 KGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMELRREFEERCNLTGRQLRIEKIVY 1079
Query: 269 TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
TGKEGKTTQGCP+AKWVIRRA +EEK+L++VK R GH C A+IVV +VAW+G+P ++D
Sbjct: 1080 TGKEGKTTQGCPVAKWVIRRADMEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRLEAD 1139
Query: 329 GVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYAR 388
Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCKYAR
Sbjct: 1140 NAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCKYAR 1199
Query: 389 SKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKP 448
SKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E+EAS+CRLG +P
Sbjct: 1200 SKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEQEASDCRLGLEP 1259
Query: 449 GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDD 506
G+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY MD
Sbjct: 1260 GKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYTMDG 1319
Query: 507 SDEFGNKEAQEEKVNTGAIENLN 529
+DEF + E Q +K TGA++ L+
Sbjct: 1320 TDEFESVEGQRDKHRTGAVQMLD 1342
>gi|442629819|ref|NP_001261343.1| CG43444, isoform E [Drosophila melanogaster]
gi|440215220|gb|AGB94038.1| CG43444, isoform E [Drosophila melanogaster]
Length = 2866
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 1519 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1574
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 1575 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1631
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 1632 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1691
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1692 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1751
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1752 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1811
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1812 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1871
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1872 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1931
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1932 MDGTDEFESVEGQRDKHRTGAVQMLD 1957
>gi|442629821|ref|NP_001261344.1| CG43444, isoform F [Drosophila melanogaster]
gi|440215221|gb|AGB94039.1| CG43444, isoform F [Drosophila melanogaster]
Length = 2921
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 1574 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1629
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 1630 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1686
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 1687 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1746
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1747 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1806
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1807 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1866
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1867 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1926
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1927 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1986
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1987 MDGTDEFESVEGQRDKHRTGAVQMLD 2012
>gi|386770417|ref|NP_001246581.1| CG43444, isoform A [Drosophila melanogaster]
gi|383291702|gb|AFH04252.1| CG43444, isoform A [Drosophila melanogaster]
Length = 2860
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 1513 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1568
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 1569 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1625
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 1626 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1685
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1686 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1745
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1746 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1805
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1806 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1865
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1866 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1925
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1926 MDGTDEFESVEGQRDKHRTGAVQMLD 1951
>gi|386770419|ref|NP_001246582.1| CG43444, isoform B [Drosophila melanogaster]
gi|383291703|gb|AFH04253.1| CG43444, isoform B [Drosophila melanogaster]
Length = 2915
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 1568 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 1623
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 1624 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 1680
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 1681 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 1740
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1741 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1800
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1801 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1860
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1861 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1920
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1921 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1980
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1981 MDGTDEFESVEGQRDKHRTGAVQMLD 2006
>gi|386770421|ref|NP_647750.4| CG43444, isoform C [Drosophila melanogaster]
gi|386770423|ref|NP_001246583.1| CG43444, isoform D [Drosophila melanogaster]
gi|383291704|gb|AAF47691.4| CG43444, isoform C [Drosophila melanogaster]
gi|383291705|gb|AFH04254.1| CG43444, isoform D [Drosophila melanogaster]
Length = 2056
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 709 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 764
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 765 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKIN--PMFDTTD 821
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 822 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 881
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 882 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 941
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 942 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1001
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1002 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1061
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 1062 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 1121
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 1122 MDGTDEFESVEGQRDKHRTGAVQMLD 1147
>gi|195336964|ref|XP_002035103.1| GM14104 [Drosophila sechellia]
gi|194128196|gb|EDW50239.1| GM14104 [Drosophila sechellia]
Length = 1253
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 251/446 (56%), Positives = 318/446 (71%), Gaps = 36/446 (8%)
Query: 113 VDPRMEEHSDSGKKS-YVFAG-EGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMR 170
++P++E+ G Y + G EG P + G CCR GGT PPT+EHLKDG C G+
Sbjct: 77 LEPKIEDMGMLGHGGGYAYLGSEGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLG 132
Query: 171 TQ--------DEMLEP-----------------KEPNNNEEPATVKAEDPNSKEMLDHIE 205
Q DE+++ ++P+ E VK E N M D +
Sbjct: 133 IQPKEELIDEDELIDTHGNGLKPIGGVGKAKGKQKPDEIPE-IVVKHEKINP--MFDTTD 189
Query: 206 RLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
RL+ +TE+P+C+CF SDK PPEPG+YYTHLG A+SL DLR++ EER G+ LR+EK
Sbjct: 190 RLEKGNKTEIPECECFQSDKNPPEPGTYYTHLGTASSLMDLRREFEERCNLTGRQLRIEK 249
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 250 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 309
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 310 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 369
Query: 386 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 370 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 429
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYI 503
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K +R PDDEQ HVLPLY
Sbjct: 430 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPGNRDTRLPDDEQFHVLPLYT 489
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
MD +DEF + E Q +K TGA++ L+
Sbjct: 490 MDGTDEFESVEGQRDKHRTGAVQMLD 515
>gi|195129473|ref|XP_002009180.1| GI13905 [Drosophila mojavensis]
gi|193920789|gb|EDW19656.1| GI13905 [Drosophila mojavensis]
Length = 2290
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 243/419 (57%), Positives = 306/419 (73%), Gaps = 21/419 (5%)
Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--DEML---EPKEPN 182
Y + GEG P + G CCR GGT PPT EHLK G C G+ Q +E+L E E +
Sbjct: 941 YPYLGEGKPLN----NGFSCCRQGGTRPPTEEHLKGGTCLGLSIQPKEELLDEDELAEAH 996
Query: 183 NNEEPATVKAEDPNSKE----------MLDHIERLKNNMRTEVPDCKCFASDKLPPEPGS 232
N + T K + E M D +RL+ +TE+P+C+CF SDK PPEPG+
Sbjct: 997 NGVKAKTKKQKQEEIPEIIVKHEKINPMFDTTDRLEKGNKTEIPECECFQSDKNPPEPGT 1056
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLG A++L +LR++ EER G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA E
Sbjct: 1057 YYTHLGTASTLMELRREFEERCHLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADPE 1116
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EK+L++VK R GH C A+IVV +VAW+G+P ++D Y L KLNK+GLPTTRRCATN
Sbjct: 1117 EKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRKEADDAYVNLIPKLNKFGLPTTRRCATN 1176
Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHL 412
E RTCACQGLDP++ GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+ M+L
Sbjct: 1177 ENRTCACQGLDPESSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHMNL 1236
Query: 413 LATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNM 472
+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM
Sbjct: 1237 IATLLAPVFKQVCPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNM 1296
Query: 473 NNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
+GCTV V+L K +R PDDEQ HVLPLY MD +DEF + E Q +K TGA++ L+
Sbjct: 1297 QDGCTVHVALLKPSNRDTHLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQMLD 1355
>gi|198463152|ref|XP_002135446.1| GA28319 [Drosophila pseudoobscura pseudoobscura]
gi|198151134|gb|EDY74073.1| GA28319 [Drosophila pseudoobscura pseudoobscura]
Length = 2141
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/423 (56%), Positives = 304/423 (71%), Gaps = 25/423 (5%)
Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--------DEMLEPK 179
Y + G+G P + G CCR GGT PPT+EHLKDG C G+ Q DE+++
Sbjct: 766 YTYLGDGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDELVDAH 821
Query: 180 EPNNNEEPATVKAEDPNS-----------KEMLDHIERLKNNMRTEVPDCKCFASDKLPP 228
+ P+ + D +RL+ +TE+P+C+CF SDK PP
Sbjct: 822 NGMKGGAGKAKGKQKPDEIPEIIVKHEKINPLFDTTDRLEKGNKTEIPECECFQSDKNPP 881
Query: 229 EPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRR 288
EPG+YYTHLG A+SL +LR++ E+R G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRR
Sbjct: 882 EPGTYYTHLGTASSLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRR 941
Query: 289 ASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRR 348
A LEEK+L++VK R GH C A+IVV +VAW+G+P ++D Y L KLNKYGLPTTRR
Sbjct: 942 ADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRR 1001
Query: 349 CATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEE 408
CATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+
Sbjct: 1002 CATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIED 1061
Query: 409 KMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRD 468
M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRD
Sbjct: 1062 HMNLIATLLAPVFKQVCPRSYDNQTKYEGEASDCRLGLEPGKPFSGVTACLDFCAHSHRD 1121
Query: 469 LHNMNNGCTVVVSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
LHNM +GCTV V+L K +R PDDEQ HVLPLY MD +DEF + E Q +K TGA++
Sbjct: 1122 LHNMQDGCTVHVALLKPGNRDSRLPDDEQFHVLPLYTMDGTDEFESIEGQRDKHRTGAVQ 1181
Query: 527 NLN 529
L+
Sbjct: 1182 MLD 1184
>gi|157125426|ref|XP_001654335.1| hypothetical protein AaeL_AAEL001921 [Aedes aegypti]
gi|108882699|gb|EAT46924.1| AAEL001921-PA [Aedes aegypti]
Length = 1953
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 218/327 (66%), Positives = 262/327 (80%), Gaps = 3/327 (0%)
Query: 205 ERLKNNMRTEVPDCKCFASD--KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALR 262
E+L+ + E PDC CF S K P EPGSYYTHLGAA+SL +LR++ E R G GK LR
Sbjct: 1069 EKLEKAHKPEAPDCDCFTSSDTKAPSEPGSYYTHLGAASSLEELRRETETRVGLSGKQLR 1128
Query: 263 MEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGV 322
+EK++YTGKEGK++QGCP+AKWVIRR EEKLL IVK RQGH C A+IV+ IV W+G+
Sbjct: 1129 IEKVVYTGKEGKSSQGCPIAKWVIRRVDPEEKLLFIVKRRQGHRCKAAFIVICIVVWDGI 1188
Query: 323 PLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYN 382
P ++D VY +L+ KLNKYGLPT RRCATNE RTCACQGLDP+TCG S+SFGCSWSMYYN
Sbjct: 1189 PTQEADSVYRMLSVKLNKYGLPTVRRCATNENRTCACQGLDPETCGVSYSFGCSWSMYYN 1248
Query: 383 GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
GCKYARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY +AP AF NQ Q+EREA +C
Sbjct: 1249 GCKYARSKTVRKFRLSVKNEEAEIEERMNILATMLSPLYVTVAPQAFQNQVQYEREAPDC 1308
Query: 443 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS-KPDDEQLHVLPL 501
RLG KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V+L K K DDEQLH+LPL
Sbjct: 1309 RLGLKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKPLPPGVKADDEQLHILPL 1368
Query: 502 YIMDDSDEFGNKEAQEEKVNTGAIENL 528
Y MD +DEF ++E Q++K TGA++ L
Sbjct: 1369 YTMDTTDEFDSEEGQKKKAETGAVQVL 1395
>gi|170047947|ref|XP_001851464.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870207|gb|EDS33590.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1872
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 215/327 (65%), Positives = 263/327 (80%), Gaps = 3/327 (0%)
Query: 205 ERLKNNMRTEVPDCKCFAS--DKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALR 262
E+L+ + E PDC CF S +K P EPGSYYTHLG+A++L +LR++ E R G GK LR
Sbjct: 977 EKLEKAHKPEAPDCDCFNSTDNKAPSEPGSYYTHLGSASTLEELRRETEARVGLTGKQLR 1036
Query: 263 MEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGV 322
+EK++YTGKEGK++QGCP+AKWVIRR EEKLL +VK RQGH C ++IV+ IV W+G+
Sbjct: 1037 IEKVVYTGKEGKSSQGCPIAKWVIRRVDQEEKLLFVVKRRQGHRCKASFIVICIVVWDGI 1096
Query: 323 PLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYN 382
P ++D VY +L KLNKYGLPT RRCATNE RTCACQGLDP+TCG S+SFGCSWSMYYN
Sbjct: 1097 PTQEADSVYRMLAVKLNKYGLPTVRRCATNENRTCACQGLDPETCGVSYSFGCSWSMYYN 1156
Query: 383 GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
GCKYARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY +AP AF NQ Q+EREA +C
Sbjct: 1157 GCKYARSKTVRKFRLSVKNEEAEIEERMNVLATMLSPLYVTVAPQAFQNQVQYEREAPDC 1216
Query: 443 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS-KPDDEQLHVLPL 501
RLG KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V+L K KPDDEQLH+LPL
Sbjct: 1217 RLGLKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKPLPPGVKPDDEQLHILPL 1276
Query: 502 YIMDDSDEFGNKEAQEEKVNTGAIENL 528
Y MD +DEF ++E Q++K TGA++ L
Sbjct: 1277 YTMDTTDEFDSEEGQKKKAETGAVQVL 1303
>gi|158286121|ref|XP_001688023.1| AGAP007180-PA [Anopheles gambiae str. PEST]
gi|157020316|gb|EDO64672.1| AGAP007180-PA [Anopheles gambiae str. PEST]
Length = 2328
Score = 473 bits (1217), Expect = e-130, Method: Compositional matrix adjust.
Identities = 216/327 (66%), Positives = 264/327 (80%), Gaps = 3/327 (0%)
Query: 205 ERLKNNMRTEVPDCKCFAS--DKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALR 262
++L+ + + E PDC CF+S DK P EPGSYYTHLG AA+L DLR++ E R G GK LR
Sbjct: 1326 DKLEKSHKPEAPDCDCFSSGTDKAPSEPGSYYTHLGCAATLEDLRRETELRVGLTGKQLR 1385
Query: 263 MEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGV 322
+EK++YTGKEGK++QGCP+AKWVIRR EEKLL +VK RQGH C ++IV+ IV W+G+
Sbjct: 1386 IEKVVYTGKEGKSSQGCPIAKWVIRRVDPEEKLLFVVKRRQGHRCKASFIVICIVVWDGI 1445
Query: 323 PLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYN 382
P +++D VY +L KLNK+GLPT RRCATNE RTCACQGLDP+ CG S+SFGCSWSMYYN
Sbjct: 1446 PTHEADSVYRMLAVKLNKFGLPTVRRCATNENRTCACQGLDPELCGVSYSFGCSWSMYYN 1505
Query: 383 GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
GCKYARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY +AP AF NQ Q+EREA +C
Sbjct: 1506 GCKYARSKTVRKFRLSVKNEEAEIEERMNVLATMLSPLYVTVAPQAFQNQVQYEREAPDC 1565
Query: 443 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS-KPDDEQLHVLPL 501
RLG KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V+L K KPDDEQLHVLPL
Sbjct: 1566 RLGLKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKPLPPGVKPDDEQLHVLPL 1625
Query: 502 YIMDDSDEFGNKEAQEEKVNTGAIENL 528
Y MD +DEF ++E Q++K TGA++ L
Sbjct: 1626 YTMDTTDEFDSEEGQKKKHETGAVQVL 1652
>gi|270007246|gb|EFA03694.1| hypothetical protein TcasGA2_TC013798 [Tribolium castaneum]
Length = 856
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 207/324 (63%), Positives = 260/324 (80%)
Query: 205 ERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRME 264
+R + E+P C C + + EPG++YTHLG A +L +LR D+E R+G KG+A+R+E
Sbjct: 333 KRYSSPYENEIPYCNCVRAGRGAAEPGTFYTHLGCANNLINLRHDLETRTGVKGRAIRIE 392
Query: 265 KILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPL 324
KI YTGKEGKT QGCP+AKWVIRR+ +EK L+IVKHR GH+C +A+IVV IV W+G+P
Sbjct: 393 KIRYTGKEGKTAQGCPIAKWVIRRSGSDEKYLIIVKHRPGHSCPSAFIVVCIVMWDGLPQ 452
Query: 325 NQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 384
SD +Y +LT+KLNK+GL RRCATNE +TCACQGL+PDTCGASFSFGCSWSMYYNGC
Sbjct: 453 PTSDELYTLLTSKLNKFGLANRRRCATNESKTCACQGLNPDTCGASFSFGCSWSMYYNGC 512
Query: 385 KYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRL 444
K++RSK VRKFRL+V+ EE+ +EEK+ +LAT +SP+Y++LAP AF NQC FE ECRL
Sbjct: 513 KFSRSKFVRKFRLNVQPEEKIVEEKLQILATYLSPIYRSLAPVAFRNQCFFEEGGRECRL 572
Query: 445 GFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIM 504
G +PGRPFSGVTAC DFCAHSH+D HNM NGCTVVV+LTKHR KP+DEQLHVLPLY++
Sbjct: 573 GLRPGRPFSGVTACLDFCAHSHKDSHNMVNGCTVVVTLTKHRKGEKPEDEQLHVLPLYVV 632
Query: 505 DDSDEFGNKEAQEEKVNTGAIENL 528
+ +DEF ++ QEEK+ G+IE L
Sbjct: 633 EGTDEFDSQGGQEEKIRMGSIEVL 656
>gi|405950810|gb|EKC18772.1| Putative methylcytosine dioxygenase TET2 [Crassostrea gigas]
Length = 1231
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 195/329 (59%), Positives = 248/329 (75%), Gaps = 2/329 (0%)
Query: 202 DHIERLKNNMRTEVPDCKCFASDKLPPE--PGSYYTHLGAAASLPDLRKDIEERSGYKGK 259
+H++RL+ N+++E+P C C D +P E G YYTHLGAA S+ +R+ +E+R+G KG+
Sbjct: 365 EHLDRLRKNIKSEMPRCSCRGPDYVPSEDVEGPYYTHLGAARSIQAVRELLEKRTGEKGR 424
Query: 260 ALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAW 319
++R+EKI YTGKEGK++QGCP+AKW+IRR+ EEK L +V+ R GH C TA I+ V+VAW
Sbjct: 425 SIRIEKIRYTGKEGKSSQGCPIAKWIIRRSGQEEKYLCVVRQRPGHFCETACIIAVLVAW 484
Query: 320 EGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSM 379
EGVP N +D +Y L L G T RRC TNE +TCACQG+D GASFSFGCSWSM
Sbjct: 485 EGVPQNMADDLYQYLRTTLPTNGFETERRCGTNERKTCACQGIDLVRRGASFSFGCSWSM 544
Query: 380 YYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREA 439
YYNGCK+ARS+ RKF+L ++E E+E K+ LAT ++PLY+ +AP A++NQ QFE A
Sbjct: 545 YYNGCKFARSREARKFKLKDTTKEVELEGKLQDLATKMAPLYQQMAPDAYSNQTQFEDTA 604
Query: 440 SECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVL 499
CRLG + GRPFSGVTAC DFCAHSHRDLHNMNNG TVVV+LTKHR + KPDDEQLH L
Sbjct: 605 RMCRLGNEEGRPFSGVTACVDFCAHSHRDLHNMNNGSTVVVTLTKHRGMGKPDDEQLHTL 664
Query: 500 PLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
PL++MD +DE G EAQ EK G++E L
Sbjct: 665 PLHVMDMTDEHGCSEAQFEKARNGSLEVL 693
>gi|427788369|gb|JAA59636.1| Putative thyroid hormone receptor-associated protein complex
subunit [Rhipicephalus pulchellus]
Length = 1666
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 201/335 (60%), Positives = 255/335 (76%), Gaps = 1/335 (0%)
Query: 195 PNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPP-EPGSYYTHLGAAASLPDLRKDIEER 253
P + + +ERL++N + E P C C + + PP + YYTHLG+ ++ +R+ +E R
Sbjct: 568 PGADPWWERLERLRSNAKAEPPACDCLSPEDAPPLDKSPYYTHLGSGPTVAAIREMLERR 627
Query: 254 SGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIV 313
G ALR+EK+LYTGKEGKT+QGCP+AKWVIRR+S EK+L +++HRQGH C +A+IV
Sbjct: 628 LNETGSALRIEKVLYTGKEGKTSQGCPVAKWVIRRSSPNEKVLAVLRHRQGHRCLSAYIV 687
Query: 314 VVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
+ IVAWEGV + +D +Y + +K +G PT RRC TNE RTCACQG D + CGASFSF
Sbjct: 688 MAIVAWEGVHADMADDLYRTVVHKTVNFGFPTQRRCGTNEQRTCACQGADSENCGASFSF 747
Query: 374 GCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQC 433
GCSWSMYYNGCKYARSK+VRKF+LS +SEEQE+EEK+ LAT ++PLY +AP ++ NQ
Sbjct: 748 GCSWSMYYNGCKYARSKSVRKFKLSEQSEEQELEEKLQQLATDMAPLYARVAPESYKNQT 807
Query: 434 QFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDD 493
+FE E CRLG KPGRPFSGVTAC DFCAHSH+DLHNMNNGCTVVV+LTKHR K DD
Sbjct: 808 EFESEGISCRLGLKPGRPFSGVTACVDFCAHSHKDLHNMNNGCTVVVTLTKHRGFEKGDD 867
Query: 494 EQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
EQLHVLPLY++D +DE+G K+ EKV G++E L
Sbjct: 868 EQLHVLPLYVLDATDEYGKKDGFYEKVKAGSLEVL 902
>gi|195167891|ref|XP_002024766.1| GL22434 [Drosophila persimilis]
gi|194108171|gb|EDW30214.1| GL22434 [Drosophila persimilis]
Length = 567
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 191/292 (65%), Positives = 237/292 (81%), Gaps = 2/292 (0%)
Query: 240 AASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIV 299
A+SL +LR++ E+R G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++V
Sbjct: 18 ASSLMELRREFEDRCQLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVV 77
Query: 300 KHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCAC 359
K R GH C A+IVV +VAW+G+P ++D Y L KLNKYGLPTTRRCATNE RTCAC
Sbjct: 78 KKRPGHRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCATNENRTCAC 137
Query: 360 QGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISP 419
QGLDP+T GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+ M+L+AT ++P
Sbjct: 138 QGLDPETSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAP 197
Query: 420 LYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVV 479
++K + P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV
Sbjct: 198 VFKQVCPRSYDNQTKYEGEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVH 257
Query: 480 VSLTK--HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
V+L K +R PDDEQ HVLPLY MD +DEF + E Q +K TGA++ L+
Sbjct: 258 VALLKPGNRDSRLPDDEQFHVLPLYTMDGTDEFESIEGQRDKHRTGAVQMLD 309
>gi|296195853|ref|XP_002745572.1| PREDICTED: methylcytosine dioxygenase TET2 [Callithrix jacchus]
Length = 1998
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 191/321 (59%), Positives = 246/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ LN
Sbjct: 1428 EFGSIEAQEEKKRSGAIQVLN 1448
>gi|195587298|ref|XP_002083402.1| GD13372 [Drosophila simulans]
gi|194195411|gb|EDX08987.1| GD13372 [Drosophila simulans]
Length = 907
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 189/287 (65%), Positives = 233/287 (81%), Gaps = 2/287 (0%)
Query: 245 DLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQG 304
DLR++ EER G+ LR+EKI+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R G
Sbjct: 2 DLRREFEERCNLTGRQLRIEKIVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPG 61
Query: 305 HTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDP 364
H C A+IVV +VAW+G+P ++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP
Sbjct: 62 HRCIAAYIVVCMVAWDGMPRLEADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDP 121
Query: 365 DTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKAL 424
++ GAS+SFGCSWSMYYNGCKYARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K +
Sbjct: 122 ESSGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQV 181
Query: 425 APGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK 484
P ++ NQ ++E EAS+CRLG +PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V+L K
Sbjct: 182 CPRSYDNQTKYEHEASDCRLGLEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLK 241
Query: 485 --HRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
+R PDDEQ HVLPLY MD +DEF + E Q +K TGA++ L+
Sbjct: 242 PGNRDTRLPDDEQFHVLPLYTMDGTDEFESVEGQRDKHRTGAVQMLD 288
>gi|403275632|ref|XP_003929543.1| PREDICTED: methylcytosine dioxygenase TET2 [Saimiri boliviensis
boliviensis]
Length = 1999
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 190/321 (59%), Positives = 245/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1130 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1188
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1189 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1248
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1249 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1308
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1309 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1368
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1369 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1428
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1429 EFGSVEAQEEKKRSGAIQVLS 1449
>gi|297674086|ref|XP_002815070.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pongo abelii]
Length = 2023
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLSLADKLYSE 1268
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1269 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469
>gi|338722529|ref|XP_001503267.3| PREDICTED: methylcytosine dioxygenase TET2 [Equus caballus]
Length = 1933
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/321 (59%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADRLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGALTNRRCAHNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L V EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLVDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRNGAIQVLS 1448
>gi|332216742|ref|XP_003257511.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2
[Nomascus leucogenys]
Length = 1996
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1122 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1180
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1181 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1240
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1241 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1300
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1301 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1360
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1361 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1420
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1421 EFGSVEAQEEKKQSGAIQVLS 1441
>gi|297674088|ref|XP_002815071.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pongo abelii]
Length = 2002
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>gi|397519747|ref|XP_003830015.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pan paniscus]
Length = 2002
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>gi|332819904|ref|XP_003310448.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pan
troglodytes]
gi|410352429|gb|JAA42818.1| tet oncogene family member 2 [Pan troglodytes]
Length = 2002
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>gi|410213066|gb|JAA03752.1| tet oncogene family member 2 [Pan troglodytes]
gi|410301428|gb|JAA29314.1| tet oncogene family member 2 [Pan troglodytes]
Length = 2002
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>gi|397519749|ref|XP_003830016.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pan paniscus]
Length = 2023
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1268
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1269 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469
>gi|426345124|ref|XP_004040272.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Gorilla gorilla
gorilla]
Length = 2002
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>gi|332819906|ref|XP_526645.2| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pan
troglodytes]
Length = 2023
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1268
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1269 LTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469
>gi|355749478|gb|EHH53877.1| hypothetical protein EGM_14586 [Macaca fascicularis]
Length = 1999
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1127 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1185
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1186 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1245
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1246 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1305
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1306 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1365
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1366 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1425
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1426 EFGSVEAQEEKKRSGAIQVLS 1446
>gi|402870138|ref|XP_003899096.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2
[Papio anubis]
Length = 2027
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1154 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1212
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1213 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1272
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1273 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1332
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1333 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1392
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1393 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1452
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1453 EFGSVEAQEEKKQSGAIQVLS 1473
>gi|431897123|gb|ELK06385.1| Putative methylcytosine dioxygenase TET2 [Pteropus alecto]
Length = 2040
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1125 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1183
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ +EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1184 KSSQGCPIAKWVVRRSCIEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1243
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1244 LTETLKKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1303
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1304 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1363
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1364 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1423
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1424 EFGSIEAQEEKKRNGAIQVLS 1444
>gi|426345126|ref|XP_004040273.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Gorilla gorilla
gorilla]
Length = 2023
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1150 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1208
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1209 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1268
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1269 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1328
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1329 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1388
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1389 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1448
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1449 EFGSVEAQEEKKRSGAIQVLS 1469
>gi|187761317|ref|NP_001120680.1| methylcytosine dioxygenase TET2 isoform a [Homo sapiens]
gi|239938839|sp|Q6N021.3|TET2_HUMAN RecName: Full=Methylcytosine dioxygenase TET2
gi|227806663|emb|CAX30492.1| tet oncogene family member 2 [Homo sapiens]
Length = 2002
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>gi|355687510|gb|EHH26094.1| hypothetical protein EGK_15982 [Macaca mulatta]
Length = 2003
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1131 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1189
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1190 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1249
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1250 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1309
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1310 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1369
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1370 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1429
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1430 EFGSVEAQEEKKRSGAIQVLS 1450
>gi|444723451|gb|ELW64107.1| Methylcytosine dioxygenase TET2 [Tupaia chinensis]
Length = 2020
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/369 (54%), Positives = 261/369 (70%), Gaps = 17/369 (4%)
Query: 176 LEPKEPNNNEEPATVKAEDPNSKEMLDHIERL-----KNNMRTEV------PDCKCFASD 224
LE + +++E+ T + P L+ RL KN + T V P C+C
Sbjct: 1117 LEQQAASSSEKTPTKRTAGPVLSNFLESPSRLLDTPIKNLLDTPVKTQYDFPSCRCV-EQ 1175
Query: 225 KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
+ + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEGK++QGCP+AKW
Sbjct: 1176 IIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEGKSSQGCPIAKW 1235
Query: 285 VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
V+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL +D +Y+ LT L KYG
Sbjct: 1236 VVRRSCDEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLTLADKLYSELTETLRKYGTL 1295
Query: 345 TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSE 402
T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK RKF+L E
Sbjct: 1296 TNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPKE 1355
Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
E+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRPFSGVTAC DFC
Sbjct: 1356 EEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFC 1415
Query: 463 AHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
AH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D DEFG+ EAQEEK
Sbjct: 1416 AHAHRDLHNMQNGSTLVCTLTREDNREVGGKPEDEQLHVLPLYKVSDVDEFGSAEAQEEK 1475
Query: 520 VNTGAIENL 528
+GAI+ L
Sbjct: 1476 KRSGAIQVL 1484
>gi|395847439|ref|XP_003796382.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Otolemur
garnettii]
gi|395847441|ref|XP_003796383.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Otolemur
garnettii]
Length = 2014
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 190/321 (59%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1127 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1185
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1186 KSSQGCPIAKWVVRRSCSEEKLLCFVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1245
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1246 LTETLRKYGTLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1305
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1306 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1365
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1366 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRDIGGKPEDEQLHVLPLYKVSDVD 1425
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1426 EFGSVEAQEEKKRSGAIQVLS 1446
>gi|410957091|ref|XP_003985168.1| PREDICTED: methylcytosine dioxygenase TET2 [Felis catus]
Length = 2017
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 244/321 (76%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKQSGAIQVLS 1448
>gi|291401335|ref|XP_002717242.1| PREDICTED: tet oncogene family member 2 [Oryctolagus cuniculus]
Length = 2011
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 191/320 (59%), Positives = 242/320 (75%), Gaps = 6/320 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+EK++YTGKEG
Sbjct: 1130 DFPSCRCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIEKVIYTGKEG 1188
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWVIRR+ EEKLL +V+ R GHTC A IV++I+ WEG+P + + +Y+
Sbjct: 1189 KSSQGCPIAKWVIRRSCSEEKLLCLVRERAGHTCEAAVIVILILLWEGIPQSLATELYSE 1248
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L +G T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1249 LTETLKNHGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKVPR 1308
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P+YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1309 KFKLLGDDPKEEEKLESHLQNLSTLLAPIYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1368
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM+NG TVV +LTK +R + +KPDDEQLHVLPLY + D D
Sbjct: 1369 FSGVTACLDFCAHAHRDLHNMHNGSTVVCTLTKEDNREIGAKPDDEQLHVLPLYKISDVD 1428
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+ EAQEEK GAIE L
Sbjct: 1429 EFGSVEAQEEKKRNGAIEVL 1448
>gi|395501402|ref|XP_003755084.1| PREDICTED: methylcytosine dioxygenase TET1 [Sarcophilus harrisii]
Length = 1578
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 192/321 (59%), Positives = 239/321 (74%), Gaps = 6/321 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLG S+ +R+ +E R G KG+A+R+E ++YTGKE
Sbjct: 826 SELPSCSCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMEARYGEKGRAIRIEVVVYTGKE 884
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 885 GKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLYQ 944
Query: 333 ILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
LT LNKYG PTTRRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 945 ELTQSLNKYGCPTTRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKNP 1004
Query: 393 RKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
R+FRL EE+ +E + LAT ++P+YK LAP AF NQ + E S+CRLG K GR
Sbjct: 1005 RRFRLIADDPKEEENLESNLQTLATDVAPVYKKLAPDAFQNQVENEHLGSDCRLGRKDGR 1064
Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
PFSGVTAC DFCAH+H+D HNMNNG TVV +LTK +RS+ P DEQLHVLPLY + +
Sbjct: 1065 PFSGVTACIDFCAHAHKDTHNMNNGSTVVCTLTKEDNRSVGVIPKDEQLHVLPLYKISQT 1124
Query: 508 DEFGNKEAQEEKVNTGAIENL 528
DEFG +E E K+ TGAI+ L
Sbjct: 1125 DEFGTREGLEAKIKTGAIQVL 1145
>gi|417406864|gb|JAA50073.1| Putative vesicle coat complex copii subunit sec31 [Desmodus rotundus]
Length = 2036
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADQLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGALTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRNGAIQVLS 1448
>gi|380805593|gb|AFE74672.1| methylcytosine dioxygenase TET2 isoform a, partial [Macaca mulatta]
Length = 430
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 40 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 98
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 99 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 158
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 159 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 218
Query: 394 KFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 219 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 278
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 279 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 338
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 339 EFGSVEAQEEKKRSGAIQVLS 359
>gi|344277247|ref|XP_003410414.1| PREDICTED: methylcytosine dioxygenase TET2 [Loxodonta africana]
Length = 2013
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1130 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1188
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1189 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADRLYSE 1248
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1249 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1308
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1309 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEDRAPECRLGLKEGRP 1368
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1369 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDMD 1428
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1429 EFGSVEAQEEKKRNGAIQVLS 1449
>gi|345795800|ref|XP_535678.3| PREDICTED: methylcytosine dioxygenase TET2 [Canis lupus familiaris]
Length = 2018
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1139 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1197
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1198 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1257
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1258 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1317
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1318 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1377
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1378 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1437
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQE+K GAI+ L+
Sbjct: 1438 EFGSVEAQEKKKQNGAIQVLS 1458
>gi|301782603|ref|XP_002926716.1| PREDICTED: probable methylcytosine dioxygenase TET2-like [Ailuropoda
melanoleuca]
Length = 2006
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1131 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1189
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1190 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1249
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1250 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1309
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1310 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1369
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1370 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1429
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQE+K GAI+ L+
Sbjct: 1430 EFGSVEAQEKKKQNGAIQVLS 1450
>gi|426231353|ref|XP_004009704.1| PREDICTED: methylcytosine dioxygenase TET2 [Ovis aries]
Length = 2001
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1128 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1186
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+P++ +D +Y+
Sbjct: 1187 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVSLADKLYSE 1246
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1247 LTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1306
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1307 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1366
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDL NM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1367 FSGVTACLDFCAHAHRDLQNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1426
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1427 EFGSVEAQEEKKRNGAIQVLS 1447
>gi|326918544|ref|XP_003205548.1| PREDICTED: methylcytosine dioxygenase TET2-like [Meleagris gallopavo]
Length = 1955
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 187/320 (58%), Positives = 242/320 (75%), Gaps = 6/320 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1102 DFPSCSCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 1160
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P + +D +Y+
Sbjct: 1161 KSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTSLADKLYSE 1220
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT+ L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1221 LTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1280
Query: 394 KFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1281 KFKLMGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1340
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + + P+DEQLHVLPLY + D D
Sbjct: 1341 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKVSDVD 1400
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+ E QEEK G+I+ L
Sbjct: 1401 EFGSTEGQEEKKRNGSIQVL 1420
>gi|297466579|ref|XP_001790198.2| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Bos taurus]
gi|297475658|ref|XP_002688138.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Bos taurus]
gi|296486794|tpg|DAA28907.1| TPA: tet oncogene family member 2 [Bos taurus]
Length = 2007
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1128 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1186
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+P++ +D +Y+
Sbjct: 1187 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVSLADKLYSE 1246
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1247 LTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1306
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1307 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1366
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDL NM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1367 FSGVTACLDFCAHAHRDLQNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1426
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1427 EFGSVEAQEEKKRNGAIQVLS 1447
>gi|350587911|ref|XP_003129326.3| PREDICTED: methylcytosine dioxygenase TET2 [Sus scrofa]
Length = 2019
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/321 (58%), Positives = 242/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSGSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLPLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ +AQEEK GAI+ L+
Sbjct: 1428 EFGSVDAQEEKKRNGAIQVLS 1448
>gi|449265874|gb|EMC77004.1| putative methylcytosine dioxygenase TET2, partial [Columba livia]
Length = 1470
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/320 (58%), Positives = 241/320 (75%), Gaps = 6/320 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 615 DFPSCSC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 673
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P + +D +Y
Sbjct: 674 KSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTSLADKLYTE 733
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT+ L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 734 LTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 793
Query: 394 KFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 794 KFKLMGDDPKEEEKLESNLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 853
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + + P+DEQLHVLPLY + D D
Sbjct: 854 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKVSDVD 913
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+ E QEEK G+I+ L
Sbjct: 914 EFGSTEGQEEKKRNGSIQVL 933
>gi|395542103|ref|XP_003772974.1| PREDICTED: methylcytosine dioxygenase TET2 [Sarcophilus harrisii]
Length = 2011
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/320 (58%), Positives = 241/320 (75%), Gaps = 6/320 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1145 DFPSCSC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1203
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1204 KSSQGCPIAKWVVRRSCNEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1263
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1264 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1323
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1324 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYENRAPECRLGLKEGRP 1383
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + + P+DEQLHVLPLY + + D
Sbjct: 1384 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGRTPEDEQLHVLPLYKVSNMD 1443
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+ EAQEEK GAI+ L
Sbjct: 1444 EFGSVEAQEEKKRNGAIQVL 1463
>gi|224049493|ref|XP_002193886.1| PREDICTED: methylcytosine dioxygenase TET2 [Taeniopygia guttata]
Length = 1960
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 186/320 (58%), Positives = 241/320 (75%), Gaps = 6/320 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1104 DFPSCSCV-EHIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVVYTGKEG 1162
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P + +D +Y+
Sbjct: 1163 KSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTSLADRLYSE 1222
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT+ L KYG T RRCA NE R CACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1223 LTDTLRKYGTLTNRRCALNEERNCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1282
Query: 394 KFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1283 KFKLMGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1342
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + + P+DEQLHVLPLY + D D
Sbjct: 1343 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKVSDVD 1402
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+ E QEEK G+I+ L
Sbjct: 1403 EFGSTEGQEEKKRNGSIQVL 1422
>gi|334313839|ref|XP_001368961.2| PREDICTED: methylcytosine dioxygenase TET1 [Monodelphis domestica]
Length = 2124
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 191/321 (59%), Positives = 238/321 (74%), Gaps = 6/321 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLG S+ +R+ +E R G KG+A+R+E ++YTGKE
Sbjct: 1373 SELPSCSCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMEARYGEKGRAIRIEVVVYTGKE 1431
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 1432 GKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLYQ 1491
Query: 333 ILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
LT LNKYG PTTRRCA NE RTCACQG+DP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 1492 ELTQSLNKYGCPTTRRCALNEDRTCACQGMDPETCGASFSFGCSWSMYFNGCKFARSKNP 1551
Query: 393 RKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
R+FRL EE+ +E + LAT ++P+YK LAP AF NQ + E +CRLG K GR
Sbjct: 1552 RRFRLIADDPKEEEILESNLQSLATDVAPVYKKLAPDAFRNQVENEPLGPDCRLGRKDGR 1611
Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLYIMDDS 507
PFSGVTAC DFCAH+H+D HNMNNG TVV +LTK +RS+ P DEQLHVLPLY + +
Sbjct: 1612 PFSGVTACIDFCAHAHKDTHNMNNGSTVVCTLTKEDNRSVGVVPKDEQLHVLPLYKISQT 1671
Query: 508 DEFGNKEAQEEKVNTGAIENL 528
DEFG KE E K+ TGAI+ L
Sbjct: 1672 DEFGTKEGLEAKIKTGAIQVL 1692
>gi|334330961|ref|XP_003341431.1| PREDICTED: methylcytosine dioxygenase TET2 [Monodelphis domestica]
Length = 2016
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/321 (58%), Positives = 241/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1148 DFPSCSCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1206
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1207 KSSQGCPIAKWVVRRSCNEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADRLYSE 1266
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1267 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1326
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1327 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1386
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + + P+DEQLHVLPLY + D
Sbjct: 1387 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREVGRTPEDEQLHVLPLYKVSSMD 1446
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK GAI+ L+
Sbjct: 1447 EFGSVEAQEEKKRNGAIQVLS 1467
>gi|395841208|ref|XP_003793438.1| PREDICTED: methylcytosine dioxygenase TET3 [Otolemur garnettii]
Length = 1655
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 253/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 662 LKYLDTPTKNLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 717
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 718 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 777
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PTTRRC N+ RTCACQG DP+TCGA
Sbjct: 778 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTTRRCGLNDDRTCACQGKDPNTCGA 837
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 838 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 897
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 898 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 957
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV++GAI+ L
Sbjct: 958 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVDSGAIQVLT 1002
>gi|351694675|gb|EHA97593.1| Putative methylcytosine dioxygenase TET2 [Heterocephalus glaber]
Length = 1947
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 241/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER G KGKA+R+EK++YTGKEG
Sbjct: 1091 DFPSCHCV-EQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIEKVVYTGKEG 1149
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWVIRR+S EEKLL +V+ R+GHTC A IVV+I+ WEG+PL ++ +Y
Sbjct: 1150 KSSQGCPIAKWVIRRSSREEKLLCLVRERRGHTCEVAVIVVLILLWEGIPLPLANRLYTE 1209
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LTN L + G T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1210 LTNTLCRNGSLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKVPR 1269
Query: 394 KFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T +SP+Y+ LAP A+ NQ + E A +CRLG K GRP
Sbjct: 1270 KFKLVGDDPKEEEKLESNLQNLSTFLSPMYQKLAPDAYNNQVELEHRAPDCRLGLKEGRP 1329
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM+NG T+V +LT+ + P+DEQLHVLPLY + D D
Sbjct: 1330 FSGVTACLDFCAHAHRDLHNMHNGSTLVCTLTREDNREFGVVPEDEQLHVLPLYKISDVD 1389
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK+ +GAIE L
Sbjct: 1390 EFGSAEAQEEKMRSGAIEVLT 1410
>gi|410918022|ref|XP_003972485.1| PREDICTED: methylcytosine dioxygenase TET2-like [Takifugu rubripes]
Length = 939
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 242/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
++ C C E G YYTHLG+A S+P +R+ +E+RSG G A+R+EK++YTGKEG
Sbjct: 369 DIASCHCVEQISEKDE-GPYYTHLGSAPSVPGIRELMEKRSGITGSAIRIEKVVYTGKEG 427
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K+TQGCP+AKWVIRR S EEK+L++V+ R GHTC+TA I+VVI+ WEG+ N +D +Y
Sbjct: 428 KSTQGCPIAKWVIRRGSEEEKILVLVRERTGHTCNTACIIVVILVWEGILPNLADRLYHE 487
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
L++ L K+G T RRCA NE RTCACQGL+P+ CGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 488 LSDTLRKHGALTQRRCAHNEERTCACQGLNPEACGASFSFGCSWSMYYNGCKFARSKNPR 547
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+ +E+ LAT + PLYK+LAP A+ NQ + E+ +CRLG K GRP
Sbjct: 548 KFKLLGDDMKEEERLEQNFQSLATLLGPLYKSLAPEAYGNQVEHEQRGLDCRLGHKEGRP 607
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM G TVV +LTK +R + K PDDEQLHVLPLY ++D
Sbjct: 608 FSGVTACMDFCAHAHRDLHNMQGGSTVVCTLTKEDNREIGKIPDDEQLHVLPLYKASNTD 667
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG++E Q+EK+ +GAI+ L+
Sbjct: 668 EFGSEEGQQEKIKSGAIQVLS 688
>gi|354495922|ref|XP_003510077.1| PREDICTED: methylcytosine dioxygenase TET3 [Cricetulus griseus]
gi|344253854|gb|EGW09958.1| putative methylcytosine dioxygenase TET3 [Cricetulus griseus]
Length = 1668
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 676 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 731
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 732 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRGTLEEKLLCLVRHRAGHHCQN 791
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A I+++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 792 AVIIILILAWEGIPRSLGDALYRELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 851
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 852 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 911
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 912 AYQNQVTNEEVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M ++DEFG++E Q KV++GAI+ L
Sbjct: 972 RCVGQIPEDEQLHVLPLYKMANTDEFGSEENQNAKVSSGAIQVLT 1016
>gi|335285293|ref|XP_003125075.2| PREDICTED: methylcytosine dioxygenase TET3 [Sus scrofa]
Length = 1660
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 666 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 721
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 722 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 781
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 782 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 841
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 842 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 901
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 902 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 961
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 962 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1006
>gi|338714173|ref|XP_001917149.2| PREDICTED: methylcytosine dioxygenase TET3 [Equus caballus]
Length = 1664
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|395731669|ref|XP_002811944.2| PREDICTED: methylcytosine dioxygenase TET3 [Pongo abelii]
Length = 1659
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|426226468|ref|XP_004007365.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3
[Ovis aries]
Length = 1498
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 642 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 697
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 698 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 757
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 758 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 817
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 818 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 877
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 878 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 937
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 938 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 982
>gi|301772240|ref|XP_002921545.1| PREDICTED: probable methylcytosine dioxygenase TET3-like [Ailuropoda
melanoleuca]
Length = 1695
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 706 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 761
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 762 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 821
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP TCGA
Sbjct: 822 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGA 881
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 882 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 941
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 942 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1001
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 1002 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1046
>gi|426336006|ref|XP_004029495.1| PREDICTED: methylcytosine dioxygenase TET3 [Gorilla gorilla gorilla]
Length = 1662
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 670 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 725
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 726 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 785
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 786 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 845
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 846 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 905
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 906 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 966 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1010
>gi|149944516|ref|NP_659430.1| methylcytosine dioxygenase TET3 [Homo sapiens]
gi|190358928|sp|O43151.3|TET3_HUMAN RecName: Full=Methylcytosine dioxygenase TET3
Length = 1660
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|390474307|ref|XP_002757648.2| PREDICTED: methylcytosine dioxygenase TET3 [Callithrix jacchus]
Length = 1660
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEVAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|119620108|gb|EAW99702.1| hCG40738 [Homo sapiens]
Length = 1714
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 722 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 777
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 778 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 837
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 838 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 897
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 898 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 957
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 958 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1017
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 1018 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1062
>gi|410955071|ref|XP_003984182.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3
[Felis catus]
Length = 1658
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLVGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMSNTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|397478119|ref|XP_003810404.1| PREDICTED: methylcytosine dioxygenase TET3 [Pan paniscus]
Length = 1660
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|297266306|ref|XP_001107194.2| PREDICTED: probable methylcytosine dioxygenase TET3 [Macaca mulatta]
Length = 1714
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 722 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 777
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 778 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 837
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 838 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 897
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 898 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 957
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 958 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1017
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 1018 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1062
>gi|291386514|ref|XP_002709671.1| PREDICTED: tet oncogene family member 3 [Oryctolagus cuniculus]
Length = 1822
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 204/376 (54%), Positives = 263/376 (69%), Gaps = 12/376 (3%)
Query: 160 HLKDGLCQGMRTQDEM-LEPKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDC 218
H +DG + T+ E L P E P +K D +K +LD + + E P C
Sbjct: 820 HSEDGGQEATPTKAENPLTPTLSGFLESP--LKYLDTPTKSLLDTPAK---RAQAEFPTC 874
Query: 219 KCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQG 278
C + + G YYTHLG+ ++ +R+ +EER G KGKA+R+EK++YTGKEGK+++G
Sbjct: 875 DCV-EQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRG 933
Query: 279 CPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKL 338
CP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P + D +Y LT+ L
Sbjct: 934 CPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTL 993
Query: 339 NKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS 398
KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+
Sbjct: 994 RKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLA 1053
Query: 399 VRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVT 456
+ EE+ + + LAT ++PLYK LAP A+ NQ E A +CRLG K GRPFSGVT
Sbjct: 1054 GDNPKEEEVLPKSFQGLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFSGVT 1113
Query: 457 ACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNK 513
AC DFCAH+H+D HN+ NGCTVV +LTK +R + K P+DEQLHVLPLY M +DEFG++
Sbjct: 1114 ACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYKMASTDEFGSE 1173
Query: 514 EAQEEKVNTGAIENLN 529
E Q KV +GAI+ L
Sbjct: 1174 ENQNAKVGSGAIQVLT 1189
>gi|440904536|gb|ELR55033.1| Putative methylcytosine dioxygenase TET3, partial [Bos grunniens
mutus]
Length = 1675
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 682 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 737
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 738 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 797
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 798 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 857
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 858 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 917
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 918 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 977
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 978 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1022
>gi|281343070|gb|EFB18654.1| hypothetical protein PANDA_010427 [Ailuropoda melanoleuca]
Length = 1674
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 685 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 740
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 741 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 800
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP TCGA
Sbjct: 801 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGA 860
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 861 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 920
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 921 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 980
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 981 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1025
>gi|355565799|gb|EHH22228.1| hypothetical protein EGK_05455, partial [Macaca mulatta]
Length = 1693
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 703 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 758
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 759 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 818
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 819 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 878
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 879 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 938
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 939 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 998
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 999 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1043
>gi|358414357|ref|XP_582145.4| PREDICTED: methylcytosine dioxygenase TET3 [Bos taurus]
Length = 1657
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 664 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 719
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 720 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 779
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 780 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 839
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 840 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 899
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 900 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 959
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 960 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1004
>gi|403260367|ref|XP_003922646.1| PREDICTED: methylcytosine dioxygenase TET3 [Saimiri boliviensis
boliviensis]
Length = 1659
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEVAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|332813444|ref|XP_515553.3| PREDICTED: methylcytosine dioxygenase TET3 [Pan troglodytes]
Length = 1662
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 670 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 725
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 726 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 785
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 786 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 845
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 846 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 905
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 906 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 966 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1010
>gi|431920360|gb|ELK18392.1| Putative methylcytosine dioxygenase TET3 [Pteropus alecto]
Length = 1631
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 669 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 724
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 725 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 784
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 785 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 844
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 845 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 904
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 905 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 964
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 965 RCVGKIPEDEQLHVLPLYKMATTDEFGSEENQNAKVGSGAIQVLT 1009
>gi|441643103|ref|XP_003268728.2| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3,
partial [Nomascus leucogenys]
Length = 1787
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 796 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 851
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 852 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRTGHHCQN 911
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 912 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 971
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 972 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 1031
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 1032 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1091
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 1092 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1136
>gi|402891273|ref|XP_003908876.1| PREDICTED: methylcytosine dioxygenase TET3 [Papio anubis]
Length = 1660
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1008
>gi|344283728|ref|XP_003413623.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase
TET3-like [Loxodonta africana]
Length = 1582
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 592 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 647
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 648 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 707
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 708 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 767
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 768 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 827
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 828 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 887
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 888 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 932
>gi|316990466|gb|ADU77107.1| putative methylcytosine dioxygenase [Homo sapiens]
Length = 1795
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 803 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 858
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 859 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 918
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 919 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 978
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 979 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 1038
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 1039 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1098
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 1099 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1143
>gi|355751424|gb|EHH55679.1| hypothetical protein EGM_04930, partial [Macaca fascicularis]
Length = 1621
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 703 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 758
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 759 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 818
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 819 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 878
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 879 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 938
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 939 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 998
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 999 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 1043
>gi|417406721|gb|JAA50005.1| Putative snf2 family dna-dependent atpase [Desmodus rotundus]
Length = 1759
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 770 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 825
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 826 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 885
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 886 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 945
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 946 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 1005
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 1006 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1065
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 1066 RCVGKIPEDEQLHVLPLYKMATTDEFGSEENQNAKVGSGAIQVLT 1110
>gi|348566495|ref|XP_003469037.1| PREDICTED: methylcytosine dioxygenase TET3-like [Cavia porcellus]
Length = 1670
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 679 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 734
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 735 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 794
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 795 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 854
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 855 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 914
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 915 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 974
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 975 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1019
>gi|345782422|ref|XP_540225.3| PREDICTED: methylcytosine dioxygenase TET3 [Canis lupus familiaris]
Length = 1660
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 670 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 725
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 726 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 785
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP TCGA
Sbjct: 786 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGA 845
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 846 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 905
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 906 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DE+G++E Q KV +GAI+ L
Sbjct: 966 RCVGKIPEDEQLHVLPLYKMANTDEYGSEENQNAKVGSGAIQVLT 1010
>gi|390347525|ref|XP_785530.3| PREDICTED: uncharacterized protein LOC580376 [Strongylocentrotus
purpuratus]
Length = 1458
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 179/334 (53%), Positives = 238/334 (71%), Gaps = 8/334 (2%)
Query: 201 LDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKA 260
+ H++ + + R E P+C C +D + YYTHLG +LP +R+ +E RSG++G
Sbjct: 491 MKHMQLISEDARIEAPNCGCLENDM---DEAPYYTHLGTGPNLPAIRELVEIRSGFQGSQ 547
Query: 261 LRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWE 320
+R+EK++Y+GKEGK++ GCP+AKW+IRR+S +EK+L++V+HR GH C T++I++ IVAWE
Sbjct: 548 VRIEKVVYSGKEGKSSTGCPIAKWIIRRSSTDEKILVLVRHRPGHRCDTSYIIIAIVAWE 607
Query: 321 GVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMY 380
GV +D Y +L L +PT RRC TNE +TCACQG PD+CGASF+FGCSWSMY
Sbjct: 608 GVNNYVADDTYEMLRTTLPNGAIPTVRRCGTNEDKTCACQGFSPDSCGASFTFGCSWSMY 667
Query: 381 YNGCKYARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFERE 438
YN CK+ARS+T RKF+L + E E + ++ +AT + PLYK LAP +F N FE E
Sbjct: 668 YNTCKFARSRTPRKFKLLEANPEVEDVLSDRFQNMATDLGPLYKRLAPESFNNMVVFEEE 727
Query: 439 ASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSK---PDDEQ 495
ECRLG + GRPF+GVTAC DFCAH+H+D HNMNNGCTVVV+LTK +K P DEQ
Sbjct: 728 GKECRLGKETGRPFAGVTACMDFCAHAHKDQHNMNNGCTVVVTLTKDDIRNKRPSPGDEQ 787
Query: 496 LHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
LHVLPLY +D +DEFG E Q+ KV G+IE L
Sbjct: 788 LHVLPLYYLDSTDEFGTAEGQQNKVRNGSIEVLT 821
>gi|432108066|gb|ELK33047.1| Methylcytosine dioxygenase TET3 [Myotis davidii]
Length = 1772
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 746 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 801
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 802 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 861
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 862 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 921
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 922 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 981
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 982 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1041
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 1042 RCVGRIPEDEQLHVLPLYKMATTDEFGSEENQNAKVGSGAIQVLT 1086
>gi|363742165|ref|XP_003642602.1| PREDICTED: methylcytosine dioxygenase TET3-like [Gallus gallus]
Length = 1308
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 201/370 (54%), Positives = 259/370 (70%), Gaps = 12/370 (3%)
Query: 166 CQGMRTQDEM-LEPKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASD 224
+G T+DE+ L P E P +K D +K +LD + + E P C C
Sbjct: 308 AEGTPTKDEVPLTPTLSGFLESP--LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQ 361
Query: 225 KLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKW 284
+ + G YYTHLG+ ++ +R+ +EER G KGKA+R+EK++YTGKEGK+++GCP+AKW
Sbjct: 362 IVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKW 421
Query: 285 VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLP 344
VIRR + EEKLL +V+HR GH C A I+++I+AWEG+P D +Y LT+ L KYG P
Sbjct: 422 VIRRHNQEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRTLGDTLYQELTDTLTKYGNP 481
Query: 345 TTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--E 402
T+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL + E
Sbjct: 482 TSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLVGDNPKE 541
Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
E+ + + LAT ++PLYK LAP A+ NQ E A +CRLG K GRPFSGVTAC DFC
Sbjct: 542 EELLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEDIAIDCRLGLKEGRPFSGVTACMDFC 601
Query: 463 AHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
AH+H+D HN+ NGCTVV +LTK +R + K P+DEQLHVLPLY M +DEFG++E Q K
Sbjct: 602 AHAHKDQHNLYNGCTVVCTLTKEDNRVVGKIPEDEQLHVLPLYKMSSTDEFGSEENQNAK 661
Query: 520 VNTGAIENLN 529
V +GAI+ L
Sbjct: 662 VGSGAIQVLT 671
>gi|444725167|gb|ELW65745.1| Methylcytosine dioxygenase TET1 [Tupaia chinensis]
Length = 1472
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/322 (58%), Positives = 238/322 (73%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
TEVP+C C + + G YYTHLGA ++ +R+ +E R G KGKA+R+E ++YTGKE
Sbjct: 770 TEVPECDCL-DRAIQKDKGPYYTHLGAGPTVAAVREIMENRYGQKGKAIRIETVVYTGKE 828
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWVIRR+S EEK+L +V+ R GH CSTA IVV+I+ WEG+PL +D +Y
Sbjct: 829 GKSSHGCPVAKWVIRRSSEEEKVLCLVRKRAGHHCSTAVIVVLIMVWEGIPLPMADQLYK 888
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 889 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 948
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 949 PRRFRIDPSSPLHEKNLEDNLQNLATQLAPIYKQFAPDAYKNQVEYEHVARECRLGSKEG 1008
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1009 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLAD 1068
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1069 TDEFGSQEGMEAKIKSGAIEVL 1090
>gi|291404265|ref|XP_002718498.1| PREDICTED: CXXC finger 5-like [Oryctolagus cuniculus]
Length = 2112
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 191/326 (58%), Positives = 241/326 (73%), Gaps = 15/326 (4%)
Query: 213 TEVPDCKCF----ASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILY 268
TEVP C C DK G YYTHLGA S+ +R+ +E R G KGKA+R+E+++Y
Sbjct: 1405 TEVPSCNCLDRGTQKDK-----GPYYTHLGAGPSVAAVREIMENRYGQKGKAVRIEEVVY 1459
Query: 269 TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
TGKEGK+++GCP+AKWV+RR+S EEK+L +V+ R GH CSTA IVV+I+ WEG+PL +D
Sbjct: 1460 TGKEGKSSRGCPVAKWVLRRSSEEEKVLCLVRKRPGHHCSTAVIVVLIMIWEGIPLPMAD 1519
Query: 329 GVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYA 387
+Y+ LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+
Sbjct: 1520 RLYSELTENLRSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFG 1579
Query: 388 RSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG
Sbjct: 1580 RSPSPRRFRIDPSSPLHEKNLEDNLQSLATELAPIYKQYAPVAYQNQVEYEHVARECRLG 1639
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLY 502
K GRPFSGVTAC DFCAHSHRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY
Sbjct: 1640 RKEGRPFSGVTACLDFCAHSHRDIHNMNNGSTVVCTLTREDNRSLGVVPQDEQLHVLPLY 1699
Query: 503 IMDDSDEFGNKEAQEEKVNTGAIENL 528
+ D+DEFG+KE E K+ +GAIE L
Sbjct: 1700 KLADTDEFGSKEGMERKIKSGAIEVL 1725
>gi|148666664|gb|EDK99080.1| mCG133587 [Mus musculus]
Length = 1707
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + ++E P C C + + G YYTHLG+ ++ +R+
Sbjct: 715 LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 770
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 771 MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 830
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 831 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 890
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 891 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 950
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 951 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1010
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 1011 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1055
>gi|256773243|ref|NP_898961.2| methylcytosine dioxygenase TET3 [Mus musculus]
gi|239938841|sp|Q8BG87.3|TET3_MOUSE RecName: Full=Methylcytosine dioxygenase TET3
Length = 1668
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + ++E P C C + + G YYTHLG+ ++ +R+
Sbjct: 676 LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 731
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 732 MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 791
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 792 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 851
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 852 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 911
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 912 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 972 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1016
>gi|313493537|gb|ADR57138.1| TET3 isoform 2 [Mus musculus]
Length = 1784
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + ++E P C C + + G YYTHLG+ ++ +R+
Sbjct: 792 LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 847
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 848 MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 907
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 908 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 967
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 968 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 1027
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 1028 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1087
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 1088 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1132
>gi|313493535|gb|ADR57137.1| TET3 isoform 1 [Mus musculus]
gi|432138979|gb|AGB05430.1| Tet methylcytosine deoxygenase 3 isoform [Mus musculus]
Length = 1803
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + ++E P C C + + G YYTHLG+ ++ +R+
Sbjct: 811 LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 866
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 867 MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 926
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 927 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 986
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 987 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 1046
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 1047 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1106
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 1107 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1151
>gi|432847164|ref|XP_004065962.1| PREDICTED: methylcytosine dioxygenase TET2-like [Oryzias latipes]
Length = 1755
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/330 (56%), Positives = 248/330 (75%), Gaps = 8/330 (2%)
Query: 207 LKNNMRTEVPDCKCFASDKL-PPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEK 265
L ++T+ C D++ + G YYTHLG+A ++P +R+ +E+RSG G+A+R+EK
Sbjct: 915 LDTPLKTQYDIASCHCVDQIVEKDEGPYYTHLGSAPTVPGIREMMEKRSGLTGRAIRIEK 974
Query: 266 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 325
++YTGKEGK+TQGCP+AKWVIRR+S+EEKLL++V+ R GH C TA I+VVI+ WEG+ +
Sbjct: 975 VIYTGKEGKSTQGCPIAKWVIRRSSVEEKLLVLVRERTGHRCETACIIVVILVWEGIQAS 1034
Query: 326 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 385
+D +Y L+ L K G T RRCA NE RTCACQGL+P+ GASFSFGCSWSMYYNGCK
Sbjct: 1035 LADRLYLELSETLKKNGAHTQRRCAFNEERTCACQGLNPEESGASFSFGCSWSMYYNGCK 1094
Query: 386 YARSKTVRKFRL---SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 442
+ARSK RKF+L VR EE+++E LAT ++PLYKA+AP A+ NQ + E A +C
Sbjct: 1095 FARSKIPRKFKLLGDDVR-EEEKVERNFQNLATLLAPLYKAMAPEAYGNQVEHEHRAPDC 1153
Query: 443 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVL 499
RLG K GRPFSGVTAC DFCAH+HRDLHNM G TVV +LT+ +R + + P+DEQLHVL
Sbjct: 1154 RLGLKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTVVCTLTREDNREIGRIPEDEQLHVL 1213
Query: 500 PLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
PLY ++DEFG++E Q+EK+ +GAI+ L+
Sbjct: 1214 PLYKASNTDEFGSEEGQQEKMKSGAIQVLS 1243
>gi|293346889|ref|XP_002726470.1| PREDICTED: methylcytosine dioxygenase TET3 [Rattus norvegicus]
gi|293358777|ref|XP_001057850.2| PREDICTED: methylcytosine dioxygenase TET3 [Rattus norvegicus]
gi|149036522|gb|EDL91140.1| rCG56357 [Rattus norvegicus]
Length = 1667
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 251/345 (72%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 675 LKYLDTPTKSLLDTPAK---KAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 730
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 731 MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 790
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 791 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 850
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 851 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 910
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 911 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 970
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 971 RCVGQIPEDEQLHVLPLYKMATTDEFGSEENQNAKVSSGAIQVLT 1015
>gi|327283432|ref|XP_003226445.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2-like
[Anolis carolinensis]
Length = 1631
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 184/320 (57%), Positives = 237/320 (74%), Gaps = 6/320 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G +YTHLGA ++ +R+ +EER KGKA+R+E+I+YTGKEG
Sbjct: 786 DFPSCSC-VEQIIEKDEGPFYTHLGAGPNVAAIRQIMEERYEQKGKAIRIERIVYTGKEG 844
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K+ QGCP+AKWVIRR S EEKLL +V+ R GH+C TA IVV+I+ WEG+P + +D +Y+
Sbjct: 845 KSAQGCPIAKWVIRRGSTEEKLLCLVRERAGHSCETAVIVVLILVWEGIPQSLADKLYSD 904
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
L+ L KYG T RRCA NE RTCACQGLD ++CGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 905 LSETLRKYGTLTNRRCALNEERTCACQGLDTESCGASFSFGCSWSMYYNGCKFARSKIPR 964
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 965 KFKLLGDDPKEEEKLETSLQTLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1024
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R + + P+DEQLHVLPLY + + D
Sbjct: 1025 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREIGQTPEDEQLHVLPLYKISNID 1084
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+ E QEEK G+I+ L
Sbjct: 1085 EFGSTEGQEEKKRNGSIQVL 1104
>gi|410922577|ref|XP_003974759.1| PREDICTED: methylcytosine dioxygenase TET3-like [Takifugu rubripes]
Length = 2020
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/324 (57%), Positives = 238/324 (73%), Gaps = 6/324 (1%)
Query: 210 NMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYT 269
+++ E P C C L + G YY HLGA ++ +R +E R+G KG A+R+EK++YT
Sbjct: 1049 DLQAEFPTCTCV-EQILEKDEGPYYNHLGAGPTVAAVRDLMERRTGLKGDAIRLEKVVYT 1107
Query: 270 GKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDG 329
G+EGK++QGCP+AKWVIRR + EKLL +V+ R GH C A I++VI+AWEGVP +D
Sbjct: 1108 GREGKSSQGCPIAKWVIRRGNETEKLLCLVRERAGHHCPNAVIIIVILAWEGVPRAMADM 1167
Query: 330 VYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 389
+Y L++ L KYG PT+RRC N+ RTCACQG DP+ CGASFSFGCSWSMY+NGCKYARS
Sbjct: 1168 LYRDLSDSLTKYGNPTSRRCGFNDDRTCACQGKDPEKCGASFSFGCSWSMYFNGCKYARS 1227
Query: 390 KTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
K RKFRL EE ++ ++ LAT ++PLYK LAP A++NQCQ E +A +CRLG K
Sbjct: 1228 KMPRKFRLQGERPEEEDKVGDRFQALATHVAPLYKQLAPQAYSNQCQTESKAPDCRLGLK 1287
Query: 448 PGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIM 504
GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + K PDDEQLHVLPLY +
Sbjct: 1288 EGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNREVKKIPDDEQLHVLPLYKV 1347
Query: 505 DDSDEFGNKEAQEEKVNTGAIENL 528
+DEFG +E Q K+ TGAI+ L
Sbjct: 1348 SLTDEFGREEGQRLKMKTGAIQVL 1371
>gi|449504705|ref|XP_002190919.2| PREDICTED: methylcytosine dioxygenase TET1 [Taeniopygia guttata]
Length = 2187
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/322 (58%), Positives = 233/322 (72%), Gaps = 6/322 (1%)
Query: 212 RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
++E+P C C + + G YYTHLG S+ +R+ +E R G KG A+R+E ++YTGK
Sbjct: 1427 QSELPTCDCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMENRYGAKGSAVRIEVVVYTGK 1485
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 1486 EGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 1545
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 1546 KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 1605
Query: 392 VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CRLG K G
Sbjct: 1606 PRKFRLLTDDPKQEELLENNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGCKDG 1665
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK R P DEQLHVLPLY +
Sbjct: 1666 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1725
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG +E E K+ GAI+ L
Sbjct: 1726 TDEFGTEEGLEAKIKAGAIQVL 1747
>gi|363735173|ref|XP_421571.3| PREDICTED: methylcytosine dioxygenase TET1 [Gallus gallus]
Length = 1541
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/322 (58%), Positives = 233/322 (72%), Gaps = 6/322 (1%)
Query: 212 RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
++E+P C C + + G YYTHLG S+ +R+ +E R G KG A+R+E ++YTGK
Sbjct: 781 QSELPTCDCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMENRYGAKGSAVRIEVVVYTGK 839
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 840 EGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 899
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 900 KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 959
Query: 392 VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CRLG K G
Sbjct: 960 PRKFRLLTDDPKQEELLEHNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGSKDG 1019
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK R P DEQLHVLPLY +
Sbjct: 1020 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1079
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG +E E K+ GAI+ L
Sbjct: 1080 TDEFGTEEGLEAKIKAGAIQVL 1101
>gi|449268998|gb|EMC79810.1| Methylcytosine dioxygenase TET1, partial [Columba livia]
Length = 1186
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/322 (58%), Positives = 233/322 (72%), Gaps = 6/322 (1%)
Query: 212 RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
++E+P C C + + G YYTHLG S+ +R+ +E R G KG A+R+E ++YTGK
Sbjct: 866 QSELPTCDCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMENRYGAKGSAVRIEVVVYTGK 924
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 925 EGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 984
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 985 KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 1044
Query: 392 VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CRLG K G
Sbjct: 1045 PRKFRLLTDDPKQEELLENNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGCKDG 1104
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK R P DEQLHVLPLY +
Sbjct: 1105 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1164
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG +E E K+ GAI+ L
Sbjct: 1165 TDEFGTEEGLEAKIKAGAIQVL 1186
>gi|47227721|emb|CAG09718.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2294
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/325 (57%), Positives = 236/325 (72%), Gaps = 6/325 (1%)
Query: 210 NMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYT 269
+++ E P C C L + G YY HLGA ++ +R +E R+G KG A+R+EK++YT
Sbjct: 1478 DLQAEFPTCTCV-EQILEKDEGPYYNHLGAGPTVAAVRDLMERRTGLKGDAIRLEKVVYT 1536
Query: 270 GKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDG 329
G+EGK++QGCP+AKWVIRR S EKLL +V+ R GH C A I++VI+AWEGVP +D
Sbjct: 1537 GREGKSSQGCPIAKWVIRRGSETEKLLCLVRERAGHHCPNAVIIIVILAWEGVPRAMADM 1596
Query: 330 VYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 389
+Y L++ L KYG PT RRC N+ RTCACQG DP+ GASFSFGCSWSMY+NGCKYARS
Sbjct: 1597 LYRDLSDSLTKYGNPTNRRCGFNDDRTCACQGKDPEKSGASFSFGCSWSMYFNGCKYARS 1656
Query: 390 KTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
K RKFRL EE ++ ++ LAT ++PLYK LAP A++NQCQ E +A +CRLG K
Sbjct: 1657 KMPRKFRLQGDRPEEEDKVRDRFQALATHVAPLYKQLAPQAYSNQCQTESKAPDCRLGLK 1716
Query: 448 PGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIM 504
GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + K PDDEQLHVLPLY +
Sbjct: 1717 EGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNREVQKIPDDEQLHVLPLYKV 1776
Query: 505 DDSDEFGNKEAQEEKVNTGAIENLN 529
+DEFG +E Q K+ TGAI+ L
Sbjct: 1777 SPTDEFGREEGQRLKMKTGAIQVLQ 1801
>gi|326667684|ref|XP_003198655.1| PREDICTED: methylcytosine dioxygenase TET3 [Danio rerio]
Length = 1799
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/340 (55%), Positives = 247/340 (72%), Gaps = 9/340 (2%)
Query: 194 DPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEER 253
D +K +LD + +++ + P C C L + G YY HLG+ +P +R+ +E+R
Sbjct: 833 DTPTKNLLDTPGK---DVQPDFPICDCV-DQVLEKDEGPYYNHLGSGRDIPSVRQLMEDR 888
Query: 254 SGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIV 313
G KG+A+R+EK++YTG+EGK++QGCP+AKWV+RR+S +EK+L +VK R GH C+ IV
Sbjct: 889 YGEKGEAVRIEKVVYTGREGKSSQGCPIAKWVLRRSSEKEKVLCVVKQRPGHHCANTVIV 948
Query: 314 VVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
VVI+AWEGVP D +Y +T + KYG PT+RRC NE RTCACQG DP+TCGASFSF
Sbjct: 949 VVILAWEGVPRALGDKLYREVTETITKYGNPTSRRCGLNEDRTCACQGKDPETCGASFSF 1008
Query: 374 GCSWSMYYNGCKYARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTN 431
GCSWSMY+NGCKYARSK RKFRL EE + + LAT ++PLYK LAP A++N
Sbjct: 1009 GCSWSMYFNGCKYARSKVPRKFRLQGEHPKEEDNLRDNFQALATHVAPLYKKLAPQAYSN 1068
Query: 432 QCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS 489
QC E AS+CRLG K GRPFSG+TAC DFCAH+H+D HN++NGCTVV +LTK +R++
Sbjct: 1069 QCLHEDVASDCRLGLKEGRPFSGITACMDFCAHAHKDQHNLHNGCTVVCTLTKEDNRTVG 1128
Query: 490 K-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
P+DEQLHVLPLY + +DEFG++E Q K+ TGAI+ L
Sbjct: 1129 TIPEDEQLHVLPLYKLATTDEFGSEENQRLKMQTGAIQVL 1168
>gi|432875799|ref|XP_004072913.1| PREDICTED: methylcytosine dioxygenase TET3-like [Oryzias latipes]
Length = 2014
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/340 (55%), Positives = 246/340 (72%), Gaps = 9/340 (2%)
Query: 194 DPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEER 253
D +K +LD + + + + P C C L + G YY HLG+ ++ +R +E R
Sbjct: 1022 DTPTKSLLDTPSK---DPQLDFPTCTCV-EQILEKDEGPYYNHLGSGPTVASIRTLMEAR 1077
Query: 254 SGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIV 313
G KG A+R+EK++YTGKEGK++ GCP+AKWVIRR S +EK+L +V+HR GH C A I+
Sbjct: 1078 FGEKGDAVRIEKVVYTGKEGKSSHGCPIAKWVIRRGSEKEKVLCLVRHRAGHHCENAVII 1137
Query: 314 VVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
++I+AWEGVP +D +Y +T+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSF
Sbjct: 1138 ILILAWEGVPKALADKLYREVTDTLTKYGNPTSRRCGLNDDRTCACQGKDPETCGASFSF 1197
Query: 374 GCSWSMYYNGCKYARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTN 431
GCSWSMY+NGCKYARSK RKFRL EE+++ + LAT ++PLYK LAP A++N
Sbjct: 1198 GCSWSMYFNGCKYARSKMPRKFRLQGDHPEEEEKLRDNFQNLATEVAPLYKRLAPQAYSN 1257
Query: 432 QCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS 489
QC E +AS+CRLG K GRPFSG+TAC DFCAH+H+D HN++NGCTVV +LTK +R +
Sbjct: 1258 QCLSEDKASDCRLGLKEGRPFSGITACMDFCAHAHKDQHNLHNGCTVVCTLTKEDNRKVG 1317
Query: 490 K-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
P+DEQLHVLPLY + +DEFG+ EAQ K+ TGAI+ L
Sbjct: 1318 GIPEDEQLHVLPLYTVSHTDEFGSAEAQRIKMQTGAIQAL 1357
>gi|410901250|ref|XP_003964109.1| PREDICTED: methylcytosine dioxygenase TET3-like [Takifugu rubripes]
Length = 1134
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/322 (58%), Positives = 238/322 (73%), Gaps = 6/322 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+++P C+C + E G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 592 SDLPSCQCM-DQIIEKEEGPYYTHLGAGPSIAAVREMMENRYGAKGNAVRIEAVVYTGKE 650
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++QGCP+AKWVIRR S EEKLL +V+ R GH C TA +V++I+AWEG+ +DG+Y
Sbjct: 651 GKSSQGCPIAKWVIRRDSEEEKLLCLVRRRPGHCCDTAVLVILILAWEGISRPVADGLYQ 710
Query: 333 ILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
LT L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 711 ELTTTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCKFARSKVP 770
Query: 393 RKFRLS--VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
RKFRL EE+++E + LAT ++PLYK LAP AF NQ + E +CRLG + GR
Sbjct: 771 RKFRLQGDYPEEEEKLETHLQGLATDLAPLYKRLAPEAFQNQVENEDGGGDCRLGQREGR 830
Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
PFSGVTAC DFCAH+H+D HNMNNG TVV +LTK +R++ P+DEQLHVLPLY + D
Sbjct: 831 PFSGVTACVDFCAHAHKDTHNMNNGSTVVCTLTKEDNRAVRNVPEDEQLHVLPLYRISDR 890
Query: 508 DEFGNKEAQEEKVNTGAIENLN 529
DEFG E Q K+ +G ++ L+
Sbjct: 891 DEFGQVEGQWAKIRSGGLQVLS 912
>gi|296482779|tpg|DAA24894.1| TPA: hypothetical protein BOS_11388 [Bos taurus]
Length = 964
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/304 (61%), Positives = 235/304 (77%), Gaps = 5/304 (1%)
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
G YYTHLG+ ++ +R+ +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +
Sbjct: 8 GPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHT 67
Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
LEEKLL +V+HR GH C A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC
Sbjct: 68 LEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCG 127
Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEE 408
N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + +
Sbjct: 128 LNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRK 187
Query: 409 KMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRD 468
LAT ++PLYK LAP A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D
Sbjct: 188 SFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKD 247
Query: 469 LHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAI 525
HN+ NGCTVV +LTK +R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI
Sbjct: 248 QHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAI 307
Query: 526 ENLN 529
+ L
Sbjct: 308 QVLT 311
>gi|355723845|gb|AES08024.1| tet oncoprotein family member 2 [Mustela putorius furo]
Length = 870
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/311 (59%), Positives = 236/311 (75%), Gaps = 12/311 (3%)
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGC-------PLAK 283
G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEGK++QGC P+AK
Sbjct: 8 GPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEGKSSQGCGKSSQGCPIAK 67
Query: 284 WVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGL 343
WV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+ LT L KYG
Sbjct: 68 WVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSELTETLRKYGT 127
Query: 344 PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRS 401
T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK RKF+L
Sbjct: 128 LTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPK 187
Query: 402 EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 461
EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRPFSGVTAC DF
Sbjct: 188 EEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDF 247
Query: 462 CAHSHRDLHNMNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSDEFGNKEAQEE 518
CAH+HRDLHNM NG T+V +LT+ +R + KP+DEQLHVLPLY + D DEFG+ EAQE+
Sbjct: 248 CAHAHRDLHNMQNGSTLVCTLTREDNREIGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEK 307
Query: 519 KVNTGAIENLN 529
K GAI+ L+
Sbjct: 308 KKQNGAIQVLS 318
>gi|432951908|ref|XP_004084919.1| PREDICTED: methylcytosine dioxygenase TET1-like [Oryzias latipes]
Length = 1530
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/321 (58%), Positives = 240/321 (74%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
++P C+C + E G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKEG
Sbjct: 829 DLPSCQCV-DQIIEKEEGPYYTHLGAGPSVAAVREMMENRYGAKGNAIRVEVVVYTGKEG 887
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
+++QGCP+AKWVIRR S EEKLL +V+ R GH+C +A +V++I+AWEG+P +D +Y
Sbjct: 888 RSSQGCPIAKWVIRRGSEEEKLLCLVRQRPGHSCDSAVLVILILAWEGIPRPVADHLYRE 947
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT+ L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK+ARSK R
Sbjct: 948 LTDTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCKFARSKVPR 1007
Query: 394 KFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KFRL EQE IE + LA+ ++PLYK LAP AF NQ + E S+CRLG + GRP
Sbjct: 1008 KFRLHGDFPEQEEKIENNLQNLASDLAPLYKKLAPQAFQNQVEHEVAGSDCRLGREEGRP 1067
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+H+D NMNNG TVV +LTK +R++ P+DEQLHVLPLY + D+D
Sbjct: 1068 FSGVTACVDFCAHAHKDTSNMNNGSTVVCTLTKEDNRAVRNIPEDEQLHVLPLYKVSDTD 1127
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG E Q K+ +GA++ L+
Sbjct: 1128 EFGMVEGQWAKIQSGALQILS 1148
>gi|351702491|gb|EHB05410.1| Methylcytosine dioxygenase TET1 [Heterocephalus glaber]
Length = 2011
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/321 (57%), Positives = 237/321 (73%), Gaps = 7/321 (2%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
EVP C C + + G YYTHLGA S+ +R+ +E R G KGKA+R+EK++YTGKEG
Sbjct: 1332 EVPACNC-PDRGIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGKAIRIEKVVYTGKEG 1390
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWVIRR+S EEK+L +V+ R GH C TA IVV+I+ W+G+PL +D +Y
Sbjct: 1391 KSSQGCPVAKWVIRRSSEEEKVLCLVRQRPGHQCETAVIVVLIMLWDGIPLPMADRLYTE 1450
Query: 334 LTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
LT L Y G PT RRC NE RTC CQG+DP+ CGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1451 LTENLKSYSGHPTDRRCTLNENRTCTCQGIDPERCGASFSFGCSWSMYFNGCKFGRSPSP 1510
Query: 393 RKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K GR
Sbjct: 1511 RRFRIDPSSPLHEKNLEDNLQNLATELAPIYKQYAPVAYQNQVEYEHVARECRLGRKEGR 1570
Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
PFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P+DEQLHVLPLY + D+
Sbjct: 1571 PFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPEDEQLHVLPLYRLSDT 1630
Query: 508 DEFGNKEAQEEKVNTGAIENL 528
DEFG+KE E K+ +GA++ L
Sbjct: 1631 DEFGSKEGMEAKIQSGAVQVL 1651
>gi|334313524|ref|XP_003339916.1| PREDICTED: methylcytosine dioxygenase TET3-like [Monodelphis
domestica]
Length = 1614
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/345 (55%), Positives = 247/345 (71%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C+C + + G YYTHLGA S+ +R+
Sbjct: 617 LKYLDTPTKNLLDTPSK---RAQAEFPVCECV-EQIVEKDEGPYYTHLGAGPSVAAIREL 672
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWV RR + EEKLL +V+HR GH C
Sbjct: 673 MEDRYGEKGKAIRIEKVVYTGKEGKSSRGCPIAKWVYRRYTEEEKLLCLVRHRSGHRCEQ 732
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A I+++I+ WEG+ D +Y LT L YG PTTRRC N+ RTCACQG DP TCGA
Sbjct: 733 AVIIILILVWEGISSELGDTLYRELTETLRCYGNPTTRRCGLNDDRTCACQGKDPSTCGA 792
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHL--LATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSK RKFRL+ + E+E + H LAT ++PLYK LAP
Sbjct: 793 SFSFGCSWSMYFNGCKYARSKFPRKFRLTGDNPEEEENLRKHFQNLATQVAPLYKKLAPQ 852
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ + E EA +CRLG KPGRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 853 AYQNQVKDEEEAIDCRLGLKPGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 912
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY +D +DEFG++E Q K+ +GAI+ L
Sbjct: 913 RCVGQIPEDEQLHVLPLYKIDSTDEFGSEENQRAKMASGAIQVLT 957
>gi|297686810|ref|XP_002820931.1| PREDICTED: methylcytosine dioxygenase TET1 [Pongo abelii]
Length = 2136
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737
>gi|355782886|gb|EHH64807.1| hypothetical protein EGM_18120, partial [Macaca fascicularis]
Length = 1479
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 760 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 818
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 819 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 878
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 879 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 938
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 939 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 998
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 999 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1058
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1059 TDEFGSKEGMEAKIKSGAIEVL 1080
>gi|327287144|ref|XP_003228289.1| PREDICTED: methylcytosine dioxygenase TET3-like [Anolis carolinensis]
Length = 1795
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 194/371 (52%), Positives = 256/371 (69%), Gaps = 14/371 (3%)
Query: 166 CQGMRTQDEMLEPKEPNNN---EEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFA 222
+G T++E+ P P + E P +K D +K +LD + + E P C C
Sbjct: 781 VEGTPTKEEVPPPLTPTLSGFLESP--LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV- 834
Query: 223 SDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLA 282
+ + G YYTHLG+ ++ +R+ +EER G KG A+R+EK++YTGKEGK+++GCP+A
Sbjct: 835 EQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGDAIRIEKVIYTGKEGKSSRGCPIA 894
Query: 283 KWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYG 342
KWVIRR +LEEKLL +V+HR GH C A I+++I+AWEG+P D +Y L++ L KYG
Sbjct: 895 KWVIRRHNLEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRTLGDTLYQELSDILTKYG 954
Query: 343 LPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVR-- 400
PTTRRC N+ RTCACQG DP++CGASFSFGCSWSMY+NGCKYARSK RKFRL
Sbjct: 955 NPTTRRCGLNDDRTCACQGKDPNSCGASFSFGCSWSMYFNGCKYARSKMPRKFRLQGYNP 1014
Query: 401 SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFD 460
+EE + + LAT ++PLY+ LAP A+ NQ E A +CRLG K GRPFSGVTAC D
Sbjct: 1015 NEEDVLRKNFQDLATEVAPLYQRLAPQAYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMD 1074
Query: 461 FCAHSHRDLHNMNNGCTVVVSLTKHRSLSK---PDDEQLHVLPLYIMDDSDEFGNKEAQE 517
FCAH+H+D HN+ NGCTVV +LTK + + P+DEQLHVLPLY M +DEFG++E Q
Sbjct: 1075 FCAHAHKDQHNLYNGCTVVCTLTKEDNRTTGQVPEDEQLHVLPLYKMSPTDEFGSEERQA 1134
Query: 518 EKVNTGAIENL 528
K+ +GAI+ L
Sbjct: 1135 AKMGSGAIQVL 1145
>gi|156139122|ref|NP_085128.2| methylcytosine dioxygenase TET1 [Homo sapiens]
gi|115502139|sp|Q8NFU7.2|TET1_HUMAN RecName: Full=Methylcytosine dioxygenase TET1; AltName:
Full=CXXC-type zinc finger protein 6; AltName:
Full=Leukemia-associated protein with a CXXC domain;
AltName: Full=Ten-eleven translocation 1 gene protein
gi|225000490|gb|AAI72365.1| Tet oncogene 1 [synthetic construct]
Length = 2136
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737
>gi|119574684|gb|EAW54299.1| CXXC finger 6 [Homo sapiens]
Length = 2150
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1431 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1489
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1490 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1549
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1550 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1609
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1610 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1669
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1670 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1729
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1730 TDEFGSKEGMEAKIKSGAIEVL 1751
>gi|22001093|gb|AAM88301.1|AF430147_1 leukemia-associated protein with a CXXC domain [Homo sapiens]
Length = 2136
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737
>gi|402880644|ref|XP_003903908.1| PREDICTED: methylcytosine dioxygenase TET1 [Papio anubis]
Length = 2132
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1413 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1471
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1472 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1531
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1532 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1591
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1592 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1651
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1652 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1711
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1712 TDEFGSKEGMEAKIKSGAIEVL 1733
>gi|296220536|ref|XP_002756350.1| PREDICTED: methylcytosine dioxygenase TET1 [Callithrix jacchus]
Length = 2134
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1415 SELPTCNCI-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGEKGNAIRIEIVVYTGKE 1473
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1474 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1533
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1534 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1593
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1594 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENIARECRLGSKEG 1653
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1654 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1713
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1714 TDEFGSKEGMEAKIQSGAIEVL 1735
>gi|397489915|ref|XP_003846089.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1 [Pan
paniscus]
Length = 2136
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737
>gi|410043931|ref|XP_507822.3| PREDICTED: methylcytosine dioxygenase TET1 [Pan troglodytes]
Length = 2220
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1501 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1559
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1560 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1619
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1620 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1679
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1680 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1739
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1740 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1799
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1800 TDEFGSKEGMEAKIKSGAIEVL 1821
>gi|426364946|ref|XP_004049552.1| PREDICTED: methylcytosine dioxygenase TET1 [Gorilla gorilla gorilla]
Length = 2136
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1716 TDEFGSREGMEAKIKSGAIEVL 1737
>gi|348575712|ref|XP_003473632.1| PREDICTED: methylcytosine dioxygenase TET1-like [Cavia porcellus]
Length = 2168
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/326 (56%), Positives = 237/326 (72%), Gaps = 15/326 (4%)
Query: 213 TEVPDCKC----FASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILY 268
TEVP C C DK G YYTHLGA S+ +R+ +E R G+KGKA+R+EK++Y
Sbjct: 1398 TEVPSCDCPDRGIQKDK-----GPYYTHLGAGPSVAAVREIMETRCGHKGKAVRIEKLVY 1452
Query: 269 TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
TGKEGK++QGCP+AK VIRR+S EE++L +V+ R GH C TA +V++IV W+G+P +D
Sbjct: 1453 TGKEGKSSQGCPVAKKVIRRSSEEEEVLCLVRERPGHQCQTAVMVMLIVVWDGIPRPMAD 1512
Query: 329 GVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYA 387
+Y LT L Y G PT RRC NE RTC CQG DP+TCGASFSFGCSWSMY+NGCK+
Sbjct: 1513 RLYTELTESLKSYNGHPTDRRCTLNENRTCTCQGTDPETCGASFSFGCSWSMYFNGCKFG 1572
Query: 388 RSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 445
RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG
Sbjct: 1573 RSPSPRRFRIDPSSPLNEKNLEDNLQNLATELAPIYKQYAPVAYQNQVEYEHVARECRLG 1632
Query: 446 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLY 502
K GRPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P+DEQLHVLPLY
Sbjct: 1633 RKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVVPEDEQLHVLPLY 1692
Query: 503 IMDDSDEFGNKEAQEEKVNTGAIENL 528
+ D+DEFG+KE E K+ +GA++ L
Sbjct: 1693 KLSDTDEFGSKEGMEAKIRSGAVQVL 1718
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 491 PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
P DEQLHVLPLY + D+DEFG+KE E K+ +GA++ L
Sbjct: 1794 PCDEQLHVLPLYKLSDTDEFGSKEGMEAKIRSGAVQVL 1831
>gi|73953303|ref|XP_536371.2| PREDICTED: methylcytosine dioxygenase TET1 [Canis lupus familiaris]
Length = 2137
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 180/322 (55%), Positives = 234/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+++P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1418 SDLPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1476
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1477 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1536
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1537 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1596
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1597 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1656
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1657 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1716
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1717 TDEFGSREGMEAKIKSGAIEVL 1738
>gi|281346965|gb|EFB22549.1| hypothetical protein PANDA_001619 [Ailuropoda melanoleuca]
Length = 2136
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 233/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1416 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1474
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1475 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1534
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1535 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1594
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1595 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1654
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1655 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1714
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+ E E K+ +GAIE L
Sbjct: 1715 TDEFGSSEGMEAKIQSGAIEVL 1736
>gi|338716538|ref|XP_003363468.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1-like
[Equus caballus]
Length = 1811
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1091 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1149
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1150 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1209
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1210 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1269
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1270 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1329
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1330 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1389
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1390 TDEFGSREGMEAKIKSGAIEVL 1411
>gi|301755888|ref|XP_002913781.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Ailuropoda
melanoleuca]
Length = 2143
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 233/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1423 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1481
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1482 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1541
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1542 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1601
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1602 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1661
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1662 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1721
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+ E E K+ +GAIE L
Sbjct: 1722 TDEFGSSEGMEAKIQSGAIEVL 1743
>gi|260781795|ref|XP_002585985.1| hypothetical protein BRAFLDRAFT_185107 [Branchiostoma floridae]
gi|229271061|gb|EEN41996.1| hypothetical protein BRAFLDRAFT_185107 [Branchiostoma floridae]
Length = 326
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/323 (57%), Positives = 241/323 (74%), Gaps = 10/323 (3%)
Query: 214 EVPDCKC--FASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
E+P C C F S+K G YYTHLG ++ +R+ +E+R G GKALR+EKI+YTGK
Sbjct: 1 ELPTCNCVDFVSEKAE---GPYYTHLGTGPTIQAIRELMEKRFGQSGKALRIEKIIYTGK 57
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK++QGCP+AKW++RR+S EEK+L +V+HR GH C++++I++ IVAWEG+ ++D +Y
Sbjct: 58 EGKSSQGCPIAKWIVRRSSEEEKVLTLVRHRPGHRCNSSYIIICIVAWEGIQRARADELY 117
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
L+ L+K GLPTTRRC N+ +TCACQG+D + CGASFSFGCSWSMYYNGCK+ARS+
Sbjct: 118 DYLSGTLSKAGLPTTRRCGVNDTKTCACQGVDDNNCGASFSFGCSWSMYYNGCKFARSRV 177
Query: 392 VRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG-- 449
+KF+L SEE IE+ LA + P+Y+ LAP AF NQ ++ AS+CRLGF P
Sbjct: 178 PKKFKLEDPSEEAIIEDHFQRLAGEVGPVYEQLAPDAFRNQTEYSEVASDCRLGFGPDNT 237
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLT--KHRSLS-KPDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH+HRD HNMNNG T+V +LT ++R L P+DEQLHVLPLY M
Sbjct: 238 RPFSGVTACVDFCAHAHRDQHNMNNGSTIVCTLTCPENRKLGPPPEDEQLHVLPLYKMAP 297
Query: 507 SDEFGNKEAQEEKVNTGAIENLN 529
+DEF ++E QEEKV TGA+E L
Sbjct: 298 TDEFDSEEGQEEKVRTGALEMLT 320
>gi|410975237|ref|XP_003994040.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1,
partial [Felis catus]
Length = 2153
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 180/322 (55%), Positives = 233/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+++P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1432 SDLPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAIREIMENRYGQKGNAIRIEIVVYTGKE 1490
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1491 GKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1550
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1551 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1610
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1611 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRLGSKEG 1670
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1671 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1730
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+ E E K+ +GAIE L
Sbjct: 1731 TDEFGSSEGMEAKIKSGAIEVL 1752
>gi|12697897|dbj|BAB21767.1| KIAA1676 protein [Homo sapiens]
Length = 735
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 16 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 74
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 75 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 134
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 135 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 194
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 195 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 254
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 255 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 314
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 315 TDEFGSKEGMEAKIKSGAIEVL 336
>gi|119626584|gb|EAX06179.1| hCG21336 [Homo sapiens]
Length = 839
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 178/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+E+++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC
Sbjct: 1 MEERFGQKGKAIRIERVIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEA 60
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+ WEG+PL+ +D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGA
Sbjct: 61 AVIVILILVWEGIPLSLADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGA 120
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMYYNGCK+ARSK RKF+L EE+++E + L+T ++P YK LAP
Sbjct: 121 SFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPD 180
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRS 487
A+ NQ ++E A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +
Sbjct: 181 AYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDN 240
Query: 488 L---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
KP+DEQLHVLPLY + D DEFG+ EAQEEK +GAI+ L+
Sbjct: 241 REFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEEKKRSGAIQVLS 285
>gi|395508956|ref|XP_003758773.1| PREDICTED: methylcytosine dioxygenase TET3 [Sarcophilus harrisii]
Length = 1685
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/345 (55%), Positives = 245/345 (71%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C+C + + G YYTHLGA S+ +R+
Sbjct: 667 LKYLDTPTKNLLDTPSK---RAQAEFPVCECV-EQIVEKDEGPYYTHLGAGPSVAAIREL 722
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWV RR + EEKLL +V+HR GH C
Sbjct: 723 MEDRYGEKGKAIRIEKVVYTGKEGKSSRGCPIAKWVYRRYTEEEKLLCLVRHRSGHRCEQ 782
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A I+++I+ WEG+ D +Y LT L YG PTTRRC N+ RTCACQG DP TCGA
Sbjct: 783 AVIIILIMVWEGIGPELGDTLYRELTETLRCYGNPTTRRCGLNDDRTCACQGKDPSTCGA 842
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSK RKFRLS + EE+ + + LAT ++PLYK LAP
Sbjct: 843 SFSFGCSWSMYFNGCKYARSKYPRKFRLSGDNPVEEENLRKHFQNLATQVAPLYKKLAPQ 902
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E EA +CRLG KPGRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 903 AYQNQVNNEEEAIDCRLGLKPGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 962
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY + ++DEFG++E Q K+ GAI+ L
Sbjct: 963 RLVGQIPEDEQLHVLPLYKIANTDEFGSEENQRAKMANGAIQVLT 1007
>gi|344275087|ref|XP_003409345.1| PREDICTED: methylcytosine dioxygenase TET1 [Loxodonta africana]
Length = 2139
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 180/322 (55%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+++P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1419 SDLPSCSCV-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1477
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1478 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1537
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1538 ELTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1597
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1598 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPIAYQNQVEYEHVARECRLGSKEG 1657
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1658 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1717
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1718 TDEFGSREGLEAKIKSGAIEVL 1739
>gi|316990462|gb|ADU77105.1| putative methylcytosine dioxygenase isoform 1 [Xenopus laevis]
Length = 1924
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 244/337 (72%), Gaps = 9/337 (2%)
Query: 197 SKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGY 256
+K ++D ++ + E P C C E G YYTHLG+ ++ +R+ +E+R G
Sbjct: 963 TKSLIDTPAKM---AQAEFPTCDCVEQINEKDE-GPYYTHLGSGPTVASIRELMEDRFGE 1018
Query: 257 KGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVI 316
KG+A+R+EK++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C A I+++I
Sbjct: 1019 KGEAIRIEKVIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILI 1078
Query: 317 VAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCS 376
+AWEG+P D +Y+ +T + KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCS
Sbjct: 1079 MAWEGIPRALGDSLYSDITETITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCS 1138
Query: 377 WSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQ 434
WSMY+NGCKYARSKT RKFRL + EE+ + + LAT ++P+Y+ LAP ++ NQ
Sbjct: 1139 WSMYFNGCKYARSKTPRKFRLIGDNPKEEEFLNDNFQDLATKVAPVYQMLAPQSYENQVN 1198
Query: 435 FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-P 491
E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R++ + P
Sbjct: 1199 NEEVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRTIGRIP 1258
Query: 492 DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
+DEQLHVLPLY + +DEFG+++ Q EK+ G I+ L
Sbjct: 1259 EDEQLHVLPLYKVSSTDEFGSEDGQAEKIRKGGIQVL 1295
>gi|426256086|ref|XP_004021676.1| PREDICTED: methylcytosine dioxygenase TET1, partial [Ovis aries]
Length = 2146
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 177/319 (55%), Positives = 230/319 (72%), Gaps = 7/319 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1427 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1485
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y+
Sbjct: 1486 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADKLYS 1545
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1546 QLTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1605
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ E A ECRLG K G
Sbjct: 1606 PRRFRIDPSSPLHEKNLEDNLQSLATELAPIYKQYAPAAYQNQVALEHIARECRLGKKEG 1665
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ + S P DEQLHVLPLY + D
Sbjct: 1666 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSFGVIPQDEQLHVLPLYKLSD 1725
Query: 507 SDEFGNKEAQEEKVNTGAI 525
+DEFG++E E K+ +GAI
Sbjct: 1726 TDEFGSREGMEAKIKSGAI 1744
>gi|148237918|ref|NP_001090656.1| tet methylcytosine dioxygenase 3 [Xenopus (Silurana) tropicalis]
gi|117558065|gb|AAI27290.1| LOC100036628 protein [Xenopus (Silurana) tropicalis]
Length = 1901
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/322 (55%), Positives = 233/322 (72%), Gaps = 6/322 (1%)
Query: 212 RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
+ E P C C E G YYTHLG+ ++ +R+ +EER G KG A+R+EK++YTGK
Sbjct: 951 QAEFPTCDCVEQINEKDE-GPYYTHLGSGPTVASIRELMEERFGQKGDAIRIEKVIYTGK 1009
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C A I+++I+AWEG+P + D +Y
Sbjct: 1010 EGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILIMAWEGIPRSLGDSLY 1069
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
+T + KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYARSKT
Sbjct: 1070 NDITETITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKT 1129
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
RKFRL + EE +++ LAT ++P+YK LAP A+ NQ E A +CRLG K G
Sbjct: 1130 PRKFRLIGENPKEEDGLKDNFQNLATKVAPVYKMLAPQAYQNQVNNEDIAIDCRLGLKEG 1189
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + + +DEQLHVLPLY +
Sbjct: 1190 RPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRMIGRVAEDEQLHVLPLYKVST 1249
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E Q EK+ G I L
Sbjct: 1250 TDEFGSEEGQLEKIKKGGIHVL 1271
>gi|316990464|gb|ADU77106.1| putative methylcytosine dioxygenase isoform 2 [Xenopus laevis]
Length = 1915
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/338 (53%), Positives = 243/338 (71%), Gaps = 9/338 (2%)
Query: 197 SKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGY 256
+K ++D ++ + E P C C E G YYTHLG+ ++ +R+ +EER G
Sbjct: 956 TKSLIDTPAKM---AQAEFPTCDCVEQINEKDE-GPYYTHLGSGPTVASIRELMEERFGE 1011
Query: 257 KGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVI 316
KG+A+R+EK++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C A I+++I
Sbjct: 1012 KGEAIRIEKVIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILI 1071
Query: 317 VAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCS 376
+AWEG+P D +Y ++ + KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCS
Sbjct: 1072 MAWEGIPRALGDSLYDDISGTITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCS 1131
Query: 377 WSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQ 434
WSMY+NGCKYARSKT RKFRL + EE+ +++ LAT ++P+YK LAP A+ NQ
Sbjct: 1132 WSMYFNGCKYARSKTPRKFRLIGDNPKEEEFLKDSFQDLATKVAPVYKMLAPQAYQNQAN 1191
Query: 435 FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-P 491
E A +CRLG + GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + K
Sbjct: 1192 NEDVAIDCRLGLEEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRMIGKIA 1251
Query: 492 DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
+DEQLHVLPLY + +DEFG++E Q EK+ G I+ L+
Sbjct: 1252 EDEQLHVLPLYKVSTTDEFGSEERQLEKIRKGGIQVLS 1289
>gi|293345707|ref|XP_001077411.2| PREDICTED: methylcytosine dioxygenase TET2 [Rattus norvegicus]
gi|293357583|ref|XP_227694.5| PREDICTED: methylcytosine dioxygenase TET2 [Rattus norvegicus]
Length = 1920
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/335 (54%), Positives = 235/335 (70%), Gaps = 5/335 (1%)
Query: 200 MLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGK 259
+L + ++ T V D C A + G YYTHLGA ++ +R +EER G KGK
Sbjct: 1041 VLTDVSESPSDSDTPVEDISCEACKNAEKDEGPYYTHLGAGPNVAAIRTIMEERFGEKGK 1100
Query: 260 ALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAW 319
A+R+E+++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA IV+VI+ W
Sbjct: 1101 AIRIERVIYTGKEGKSSQGCPIAKWVYRRSSTEEKLLCLVRVRAKHTCDTAVIVIVILLW 1160
Query: 320 EGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSM 379
+G+P + +Y+ LT L+ G+ T RRCA NE R C CQG +P+TCGASFS+GCSWSM
Sbjct: 1161 DGIPKPLASELYSELTEILSNRGICTNRRCAQNENRNCCCQGENPETCGASFSYGCSWSM 1220
Query: 380 YYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFER 437
YYNGCK+ARSK RKFRL EE+++ + LAT I+P+YK LAP A+ NQ +FE
Sbjct: 1221 YYNGCKFARSKNPRKFRLHGDEPKEEEKLGSHLQNLATVIAPIYKKLAPDAYRNQVEFEH 1280
Query: 438 EASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDE 494
A ECRLG K GRPFSGVTAC DF AH+HRD NM NG TVVV+LT+ + +P+DE
Sbjct: 1281 RAIECRLGLKEGRPFSGVTACLDFSAHAHRDQQNMANGSTVVVTLTREDNREVGGQPEDE 1340
Query: 495 QLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
QLHVLPLY + DEFG+ E QEEK+ G+I+ L+
Sbjct: 1341 QLHVLPLYTIATEDEFGSTEGQEEKILQGSIQVLH 1375
>gi|351698810|gb|EHB01729.1| Putative methylcytosine dioxygenase TET3 [Heterocephalus glaber]
Length = 1721
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/345 (54%), Positives = 241/345 (69%), Gaps = 29/345 (8%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + +R+
Sbjct: 750 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCVVAS---------------------IREL 785
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 786 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 845
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 846 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 905
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRLS + EE+ + + LAT ++PLYK LAP
Sbjct: 906 SFSFGCSWSMYFNGCKYARSKTPRKFRLSGDNPKEEEVLRKSFQDLATEVAPLYKQLAPQ 965
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 966 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1025
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 1026 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1070
>gi|444723357|gb|ELW64014.1| Methylcytosine dioxygenase TET3 [Tupaia chinensis]
Length = 2326
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/386 (50%), Positives = 250/386 (64%), Gaps = 50/386 (12%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 1245 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 1300
Query: 250 IEERS-----------------------------------------GYKGKALRMEKILY 268
+EER G KGKA+R+EK++Y
Sbjct: 1301 MEERGDDDESAHVRECRESDRNWCTEHTVLAVSTEADSRLPHIHMYGEKGKAIRIEKVIY 1360
Query: 269 TGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSD 328
TGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P + D
Sbjct: 1361 TGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGD 1420
Query: 329 GVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYAR 388
+Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCKYAR
Sbjct: 1421 TLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYAR 1480
Query: 389 SKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGF 446
SKT RKFRL + EE+ + + LAT ++PLYK LAP A+ NQ E A +CRLG
Sbjct: 1481 SKTPRKFRLVGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGL 1540
Query: 447 KPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYI 503
K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + K P+DEQLHVLPLY
Sbjct: 1541 KEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYK 1600
Query: 504 MDDSDEFGNKEAQEEKVNTGAIENLN 529
M +DEFG++E Q KV +GAI+ L
Sbjct: 1601 MASTDEFGSEENQNAKVGSGAIQVLT 1626
>gi|431904167|gb|ELK09589.1| Methylcytosine dioxygenase TET1 [Pteropus alecto]
Length = 2135
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 180/322 (55%), Positives = 232/322 (72%), Gaps = 9/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W G+PL D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRERTGHHCPTAVMVVLIMVWAGLPL--PDKLYT 1533
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1534 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1593
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1594 PRRFRIDPSSPLHEKNLEDNLQSLATQLAPVYKQYAPVAYQNQVEYEHVARECRLGSKEG 1653
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +R L P DEQLHVLPLY + D
Sbjct: 1654 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRLLGVIPQDEQLHVLPLYKLSD 1713
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1714 TDEFGSREGMEAKIRSGAIEVL 1735
>gi|281346571|gb|EFB22155.1| hypothetical protein PANDA_016408 [Ailuropoda melanoleuca]
Length = 830
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 175/282 (62%), Positives = 220/282 (78%), Gaps = 5/282 (1%)
Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
R G KGKA+R+E+++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A I
Sbjct: 1 RFGQKGKAIRIERVIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVI 60
Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
V++I+ WEG+PL+ +D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFS
Sbjct: 61 VILILVWEGIPLSLADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFS 120
Query: 373 FGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFT 430
FGCSWSMYYNGCK+ARSK RKF+L EE+++E + L+T ++P YK LAP A+
Sbjct: 121 FGCSWSMYYNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYN 180
Query: 431 NQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL 488
NQ ++E A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T+V +LT+ +R +
Sbjct: 181 NQIEYEHRAPECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREI 240
Query: 489 -SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
KP+DEQLHVLPLY + D DEFG+ EAQE+K GAI+ L+
Sbjct: 241 GGKPEDEQLHVLPLYKVSDVDEFGSVEAQEKKKQNGAIQVLS 282
>gi|359069958|ref|XP_002691249.2| PREDICTED: methylcytosine dioxygenase TET3 [Bos taurus]
Length = 938
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/285 (62%), Positives = 222/285 (77%), Gaps = 5/285 (1%)
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 1 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 60
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 61 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 120
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 121 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 180
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 181 AYQNQVTNEEIAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 240
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 241 RCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVLT 285
>gi|241896976|ref|NP_081660.1| methylcytosine dioxygenase TET1 isoform 2 [Mus musculus]
gi|239977645|sp|Q3URK3.2|TET1_MOUSE RecName: Full=Methylcytosine dioxygenase TET1; AltName:
Full=CXXC-type zinc finger protein 6; AltName:
Full=Ten-eleven translocation 1 gene protein homolog
Length = 2007
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 242/350 (69%), Gaps = 12/350 (3%)
Query: 187 PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
P T A+ + ++D + + N+ E C C + E G YYTHLGA
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392
Query: 242 SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
S+ +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452
Query: 302 RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
R H CSTA IVV+I+ WEG+P +D +Y LT L Y G PT RRC N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512
Query: 361 GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTIS 418
G+DP TCGASFSFGCSWSMY+NGCK+ RS+ RKFRL+ E+++E+ + LAT ++
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHEKQLEKNLQELATVLA 1572
Query: 419 PLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV
Sbjct: 1573 PLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTV 1632
Query: 479 VVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
V +L + R + P+DEQLHVLPLY + D+DEFG+ E + K+ +GAI+
Sbjct: 1633 VCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSGAIQ 1682
>gi|18490118|gb|AAH22243.1| TET3 protein [Homo sapiens]
gi|62702130|gb|AAX93057.1| unknown [Homo sapiens]
gi|168272980|dbj|BAG10329.1| KIAA0401 protein [synthetic construct]
gi|313882564|gb|ADR82768.1| Unknown protein [synthetic construct]
Length = 937
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 177/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 1 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 60
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 61 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 120
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 121 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 180
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 181 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 240
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 241 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 285
>gi|392338377|ref|XP_003753514.1| PREDICTED: methylcytosine dioxygenase TET1 isoform 2 [Rattus
norvegicus]
Length = 2008
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 177/319 (55%), Positives = 229/319 (71%), Gaps = 6/319 (1%)
Query: 214 EVPDCKCFASDKLP-PEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
E C+C D P + G YYTHLGA S+ +R+ +E R G KGKA+R+EKI++TGKE
Sbjct: 1364 EAATCQCARPDGGPQKDKGPYYTHLGAGPSVAAVRELMETRYGQKGKAIRIEKIVFTGKE 1423
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P +D +Y
Sbjct: 1424 GKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRLMADRLYK 1483
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGCK+ RS
Sbjct: 1484 ELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGCKFGRSAN 1543
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
RKFRL+ E+++EE + LAT ++P+YK +AP A+ NQ ++E A +CRLG + G
Sbjct: 1544 PRKFRLAPNYPLHEKQLEENLQDLATVLAPVYKQMAPVAYQNQVEYEDIAGDCRLGNEEG 1603
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIMDDS 507
RPFSGVT C DFCAHSH+D+HNMNNG TVV +L + R S P DEQLHVLPLY + D+
Sbjct: 1604 RPFSGVTCCMDFCAHSHKDIHNMNNGSTVVCTLIREDGRDRSVPGDEQLHVLPLYRLADT 1663
Query: 508 DEFGNKEAQEEKVNTGAIE 526
DEFG+ E + K+ +GAI+
Sbjct: 1664 DEFGSVEGMKAKIQSGAIQ 1682
>gi|157057152|ref|NP_001035490.2| methylcytosine dioxygenase TET2 [Mus musculus]
gi|239938840|sp|Q4JK59.3|TET2_MOUSE RecName: Full=Methylcytosine dioxygenase TET2; AltName: Full=Protein
Ayu17-449
Length = 1912
Score = 367 bits (942), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1060 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1119
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EKLL +V+ R HTC TA +V+ I+ W+G+P + +Y+ LT+ L K G+ T RRC+ N
Sbjct: 1120 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 1179
Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL + EE+ + +
Sbjct: 1180 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 1239
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD
Sbjct: 1240 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 1299
Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE
Sbjct: 1300 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 1359
Query: 528 L 528
L
Sbjct: 1360 L 1360
>gi|148700127|gb|EDL32074.1| mCG11334 [Mus musculus]
Length = 630
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 173/303 (57%), Positives = 226/303 (74%), Gaps = 5/303 (1%)
Query: 229 EPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRR 288
E G YYTHLGA S+ +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR
Sbjct: 3 EKGPYYTHLGAGPSVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRR 62
Query: 289 ASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTR 347
+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P +D +Y LT L Y G PT R
Sbjct: 63 SGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDR 122
Query: 348 RCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQE 405
RC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGCK+ RS+ RKFRL+ E++
Sbjct: 123 RCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHEKQ 182
Query: 406 IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHS 465
+E+ + LAT ++PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHS
Sbjct: 183 LEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHS 242
Query: 466 HRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTG 523
H+D+HNM+NG TVV +L + R + P+DEQLHVLPLY + D+DEFG+ E + K+ +G
Sbjct: 243 HKDIHNMHNGSTVVCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSG 302
Query: 524 AIE 526
AI+
Sbjct: 303 AIQ 305
>gi|359718960|ref|NP_001240786.1| methylcytosine dioxygenase TET1 isoform 1 [Mus musculus]
Length = 2039
Score = 364 bits (935), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 182/382 (47%), Positives = 244/382 (63%), Gaps = 44/382 (11%)
Query: 187 PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
P T A+ + ++D + + N+ E C C + E G YYTHLGA
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392
Query: 242 SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
S+ +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452
Query: 302 RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
R H CSTA IVV+I+ WEG+P +D +Y LT L Y G PT RRC N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512
Query: 361 GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS---------------------- 398
G+DP TCGASFSFGCSWSMY+NGCK+ RS+ RKFRL+
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDV 1572
Query: 399 ------------VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGF 446
+ EE+++E+ + LAT ++PLYK +AP A+ NQ ++E A +CRLG
Sbjct: 1573 KTGWIIPDRKTLISREEKQLEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGN 1632
Query: 447 KPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIM 504
+ GRPFSGVT C DFCAHSH+D+HNM+NG TVV +L + R + P+DEQLHVLPLY +
Sbjct: 1633 EEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTVVCTLIRADGRDTNCPEDEQLHVLPLYRL 1692
Query: 505 DDSDEFGNKEAQEEKVNTGAIE 526
D+DEFG+ E + K+ +GAI+
Sbjct: 1693 ADTDEFGSVEGMKAKIKSGAIQ 1714
>gi|262225296|gb|ACY38291.1| tet oncogene 1 [Mus musculus]
Length = 2039
Score = 364 bits (935), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 182/382 (47%), Positives = 244/382 (63%), Gaps = 44/382 (11%)
Query: 187 PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
P T A+ + ++D + + N+ E C C + E G YYTHLGA
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392
Query: 242 SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
S+ +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452
Query: 302 RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
R H CSTA IVV+I+ WEG+P +D +Y LT L Y G PT RRC N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512
Query: 361 GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS---------------------- 398
G+DP TCGASFSFGCSWSMY+NGCK+ RS+ RKFRL+
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDV 1572
Query: 399 ------------VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGF 446
+ EE+++E+ + LAT ++PLYK +AP A+ NQ ++E A +CRLG
Sbjct: 1573 KTGWIIPDRKTLISREEKQLEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGN 1632
Query: 447 KPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIM 504
+ GRPFSGVT C DFCAHSH+D+HNM+NG TVV +L + R + P+DEQLHVLPLY +
Sbjct: 1633 EEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTVVCTLIRADGRDTNCPEDEQLHVLPLYRL 1692
Query: 505 DDSDEFGNKEAQEEKVNTGAIE 526
D+DEFG+ E + K+ +GAI+
Sbjct: 1693 ADTDEFGSVEGMKAKIKSGAIQ 1714
>gi|74140016|dbj|BAE31842.1| unnamed protein product [Mus musculus]
gi|74151946|dbj|BAE32012.1| unnamed protein product [Mus musculus]
Length = 991
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 139 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 198
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EKLL +V+ R HTC TA +V+ I+ W+G+P + +Y+ LT+ L K G+ T RRC+ N
Sbjct: 199 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 258
Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL + EE+ + +
Sbjct: 259 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 318
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD
Sbjct: 319 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 378
Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE
Sbjct: 379 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 438
Query: 528 L 528
L
Sbjct: 439 L 439
>gi|392338375|ref|XP_003753513.1| PREDICTED: methylcytosine dioxygenase TET1 isoform 1 [Rattus
norvegicus]
Length = 2040
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 178/351 (50%), Positives = 231/351 (65%), Gaps = 38/351 (10%)
Query: 214 EVPDCKCFASDKLP-PEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
E C+C D P + G YYTHLGA S+ +R+ +E R G KGKA+R+EKI++TGKE
Sbjct: 1364 EAATCQCARPDGGPQKDKGPYYTHLGAGPSVAAVRELMETRYGQKGKAIRIEKIVFTGKE 1423
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P +D +Y
Sbjct: 1424 GKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRLMADRLYK 1483
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGCK+ RS
Sbjct: 1484 ELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGCKFGRSAN 1543
Query: 392 VRKFRLS----------------------------------VRSEEQEIEEKMHLLATTI 417
RKFRL+ + EE+++EE + LAT +
Sbjct: 1544 PRKFRLAPNYPLHDYYKRITGRCSEGSDVKTGWIIPERKTLISREEKQLEENLQDLATVL 1603
Query: 418 SPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCT 477
+P+YK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+HNMNNG T
Sbjct: 1604 APVYKQMAPVAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGST 1663
Query: 478 VVVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
VV +L + R S P DEQLHVLPLY + D+DEFG+ E + K+ +GAI+
Sbjct: 1664 VVCTLIREDGRDRSVPGDEQLHVLPLYRLADTDEFGSVEGMKAKIQSGAIQ 1714
>gi|392355330|ref|XP_003752007.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1
[Rattus norvegicus]
Length = 2038
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 178/351 (50%), Positives = 231/351 (65%), Gaps = 38/351 (10%)
Query: 214 EVPDCKCFASDKLP-PEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
E C+C D P + G YYTHLGA S+ +R+ +E R G KGKA+R+EKI++TGKE
Sbjct: 1362 EAATCQCARPDGGPQKDKGPYYTHLGAGPSVAAVRELMETRYGQKGKAIRIEKIVFTGKE 1421
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P +D +Y
Sbjct: 1422 GKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRLMADRLYK 1481
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGCK+ RS
Sbjct: 1482 ELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGCKFGRSAN 1541
Query: 392 VRKFRLS----------------------------------VRSEEQEIEEKMHLLATTI 417
RKFRL+ + EE+++EE + LAT +
Sbjct: 1542 PRKFRLAPNYPLHDYYKRITGRCSEGSDVKTGWIIPERKTLISREEKQLEENLQDLATVL 1601
Query: 418 SPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCT 477
+P+YK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+HNMNNG T
Sbjct: 1602 APVYKQMAPVAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGST 1661
Query: 478 VVVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
VV +L + R S P DEQLHVLPLY + D+DEFG+ E + K+ +GAI+
Sbjct: 1662 VVCTLIREDGRDRSVPGDEQLHVLPLYRLADTDEFGSVEGMKAKIQSGAIQ 1712
>gi|262225298|gb|ACY38292.1| tet oncogene 2 [Mus musculus]
Length = 1921
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 174/308 (56%), Positives = 222/308 (72%), Gaps = 12/308 (3%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1062 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1121
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EKLL +V+ R HTC TA +V+ I+ W+G+P + +Y+ LT+ L K G+ T RRC+ N
Sbjct: 1122 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 1181
Query: 353 E-------PRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEE 403
E PR C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL + EE
Sbjct: 1182 ETKKKQSPPRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEE 1241
Query: 404 QEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCA 463
+ + + LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF A
Sbjct: 1242 ERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSA 1301
Query: 464 HSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKV 520
HSHRD NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+
Sbjct: 1302 HSHRDQQNMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKI 1361
Query: 521 NTGAIENL 528
G+IE L
Sbjct: 1362 RMGSIEVL 1369
>gi|74191515|dbj|BAE30334.1| unnamed protein product [Mus musculus]
Length = 992
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 139 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 198
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EKLL +V+ R HTC TA +V+ I+ W+G+P + +Y+ LT+ L K G+ T RRC+ N
Sbjct: 199 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 258
Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL + EE+ + +
Sbjct: 259 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 318
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD
Sbjct: 319 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 378
Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE
Sbjct: 379 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 438
Query: 528 L 528
L
Sbjct: 439 L 439
>gi|74142256|dbj|BAE31892.1| unnamed protein product [Mus musculus]
gi|74214512|dbj|BAE31106.1| unnamed protein product [Mus musculus]
Length = 991
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 139 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 198
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EKLL +V+ R HTC TA +V+ I+ W+G+P + +Y+ LT+ L K G+ T RRC+ N
Sbjct: 199 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 258
Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL + EE+ + +
Sbjct: 259 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 318
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD
Sbjct: 319 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 378
Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE
Sbjct: 379 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 438
Query: 528 L 528
L
Sbjct: 439 L 439
>gi|443702254|gb|ELU00383.1| hypothetical protein CAPTEDRAFT_102094, partial [Capitella teleta]
Length = 316
Score = 361 bits (926), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 218/301 (72%), Gaps = 2/301 (0%)
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
G YYTHLGA ++P +R+ +E+R G ALR+EK++YTG+EGK+ QGCP+AKW++RR+S
Sbjct: 12 GPYYTHLGAGPTVPAIRELMEKRMNITGDALRIEKVIYTGREGKSPQGCPVAKWILRRSS 71
Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
+EK ++IV+ R GHTC TA +V+ IV W+G+P Q+ G+Y L + L G T RRC
Sbjct: 72 KDEKCMVIVRQRPGHTCPTAIMVIAIVVWDGIPETQATGLYDYLRHTLPDNGHETERRCG 131
Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS--KTVRKFRLSVRSEEQEIEE 408
TNE RTCACQG GASF+FGCSWSMY+NGCKYA+S V +FRL EE +E
Sbjct: 132 TNEKRTCACQGWSDAVGGASFTFGCSWSMYFNGCKYAKSSDSKVHRFRLRDPMEEPILER 191
Query: 409 KMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRD 468
+ LAT I PLYK +AP ++ N E EA++CRLG++ GRPF GVTA DFCAH+H+D
Sbjct: 192 HLQTLATDIGPLYKMVAPDSYANMTALEDEATDCRLGYRRGRPFGGVTAVVDFCAHAHKD 251
Query: 469 LHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
HNMNNGCTVV +LTKHR L KP+DEQLHVLPL +++ DEFG+ + Q K+ +GAIE L
Sbjct: 252 QHNMNNGCTVVATLTKHRGLEKPEDEQLHVLPLCVLESKDEFGSVDNQFAKIRSGAIEWL 311
Query: 529 N 529
Sbjct: 312 T 312
>gi|380803039|gb|AFE73395.1| methylcytosine dioxygenase TET1, partial [Macaca mulatta]
Length = 680
Score = 360 bits (925), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 169/288 (58%), Positives = 216/288 (75%), Gaps = 6/288 (2%)
Query: 247 RKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHT 306
R+ +E R G KG A+R+E ++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH
Sbjct: 1 REIMENRYGQKGNAIRIEIVVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHH 60
Query: 307 CSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPD 365
C TA +VV+I+ W+G+PL +D +Y LT L Y G PT RRC NE RTC CQG+DP+
Sbjct: 61 CPTAVMVVLIMVWDGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPE 120
Query: 366 TCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKA 423
TCGASFSFGCSWSMY+NGCK+ RS + R+FR+ S E+ +E+ + LAT ++P+YK
Sbjct: 121 TCGASFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQ 180
Query: 424 LAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLT 483
AP A+ NQ ++E A ECRLG K GRPFSGVTAC DFCAH HRD+HNMNNG TVV +LT
Sbjct: 181 YAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLT 240
Query: 484 K--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
+ +RSL P DEQLHVLPLY + D+DEFG+KE E K+ +GAIE L
Sbjct: 241 REDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEGMEAKIKSGAIEVL 288
>gi|345322870|ref|XP_003430647.1| PREDICTED: methylcytosine dioxygenase TET2 [Ornithorhynchus anatinus]
Length = 1462
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 166/290 (57%), Positives = 216/290 (74%), Gaps = 9/290 (3%)
Query: 207 LKNNMRTEV------PDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKA 260
+KN M T V P C C + + G +YTHLGA ++ +R+ +EER G KGKA
Sbjct: 1109 IKNLMDTPVKTQYDFPSCSCV-EHIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKA 1167
Query: 261 LRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWE 320
+R+E+++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA +V++I+ WE
Sbjct: 1168 IRIERVIYTGKEGKSSQGCPIAKWVVRRSSDEEKLLCLVRERAGHTCETAVVVILILVWE 1227
Query: 321 GVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMY 380
G+PL+ +D +Y+ LT L KYG T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY
Sbjct: 1228 GIPLSLADRLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMY 1287
Query: 381 YNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFERE 438
YNGCK+ARSK RKF+L EE+++E + L+T ++P+YK LAP A+ NQ ++E
Sbjct: 1288 YNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPIYKKLAPDAYNNQIEYEHR 1347
Query: 439 ASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL 488
A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T++ + T+ + +
Sbjct: 1348 APECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLLGAATEFKDV 1397
>gi|37360506|dbj|BAC98231.1| mKIAA1676 protein [Mus musculus]
Length = 625
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 177/350 (50%), Positives = 232/350 (66%), Gaps = 39/350 (11%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
E C C + E G YYTHLGA S+ +R+ +E R G KGKA+R+EKI++TGKEG
Sbjct: 17 EAAPCDCDGGTQ--KEKGPYYTHLGAGPSVAAVRELMETRFGQKGKAIRIEKIVFTGKEG 74
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWVIRR+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P +D +Y
Sbjct: 75 KSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRLMADRLYKE 134
Query: 334 LTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTV 392
LT L Y G PT RRC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGCK+ RS+
Sbjct: 135 LTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGCKFGRSENP 194
Query: 393 RKFRLS----------------------------------VRSEEQEIEEKMHLLATTIS 418
RKFRL+ + EE+++E+ + LAT ++
Sbjct: 195 RKFRLAPNYPLHNYYKRITGMSSEGSDVKTGWIIPDRKTLISREEKQLEKNLQELATVLA 254
Query: 419 PLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV
Sbjct: 255 PLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTV 314
Query: 479 VVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
V +L + R + P+DEQLHVLPLY + D+DEFG+ E + K+ +GAI+
Sbjct: 315 VCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSGAIQ 364
>gi|291230173|ref|XP_002735044.1| PREDICTED: CXXC finger 5-like [Saccoglossus kowalevskii]
Length = 1354
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 173/343 (50%), Positives = 245/343 (71%), Gaps = 11/343 (3%)
Query: 196 NSKEMLDHIERLKNNMRTE---VPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEE 252
N +E H +K++ E +P+C C + E G YYT LGA ++ ++R+ +E+
Sbjct: 421 NMEETPTHQLTIKDDKTEEKIVIPNCGCV-DNPNEKEEGPYYTQLGAGRTIAEIREIMEK 479
Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
R G GKA+R+E+++YTGKEGK + GCP+AKWVIRR+S EEK+L++V+HR H C+TA I
Sbjct: 480 RYGDTGKAIRIEQVIYTGKEGKGSMGCPIAKWVIRRSSSEEKVLVVVRHRVNHHCATAVI 539
Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
V+ IVAWE + ++++ Y L L+K+G PT RRC TNE ++CACQG D + GASFS
Sbjct: 540 VIAIVAWEALSSDKTNDAYDWLRTTLSKHGNPTVRRCGTNEEKSCACQGYDSEKSGASFS 599
Query: 373 FGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQ---EIEEKMHLLATTISPLYKALAPGAF 429
FGCSWSMYYNGCK+ARSKT +KF+L ++ + ++E ++ LAT I+P+YK +AP ++
Sbjct: 600 FGCSWSMYYNGCKFARSKTPKKFKLGNNADSRKDVKLEHRLQTLATLIAPIYKKMAPESY 659
Query: 430 TNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RS 487
NQ E+E+ CRLG++ GRPFSGVTAC DFCAH+H+D HNMN GCT +++LT R+
Sbjct: 660 ANQSAHEQESLPCRLGYEEGRPFSGVTACVDFCAHAHKDQHNMNTGCTTLLTLTGEEIRT 719
Query: 488 LSKP--DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
++KP DEQLHVLPLY + DE+G+ E Q+EK+ G++E L
Sbjct: 720 IAKPRGADEQLHVLPLYKISPVDEYGSFEGQQEKIKNGSLEIL 762
>gi|449488387|ref|XP_002188340.2| PREDICTED: methylcytosine dioxygenase TET3-like [Taeniopygia
guttata]
Length = 1419
Score = 353 bits (907), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 171/282 (60%), Positives = 211/282 (74%), Gaps = 7/282 (2%)
Query: 255 GYKGKALRMEKILYTGKEGKTTQGCPLAKW--VIRRASLEEKLLLIVKHRQGHTCSTAWI 312
G KGKA+R+EK++Y GKEGK+ +GC +AKW VIRR + EEKLL +V+HR GH C A I
Sbjct: 503 GRKGKAIRIEKVMYAGKEGKSFRGCTIAKWMSVIRRHNQEEKLLCLVRHRAGHHCQNAVI 562
Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
+++I+AWEG+P D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFS
Sbjct: 563 IILILAWEGIPRTLGDTLYQELTDTLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFS 622
Query: 373 FGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFT 430
FGCSWSMY+NGCKYARSKT RKFRL + EE+ + LAT ++PLYK LAP A+
Sbjct: 623 FGCSWSMYFNGCKYARSKTPRKFRLVGDNPKEEELLRRSFQDLATEVAPLYKRLAPQAYQ 682
Query: 431 NQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSL 488
NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R +
Sbjct: 683 NQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRVV 742
Query: 489 SK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 743 GKIPEDEQLHVLPLYKMSSTDEFGSEENQNAKVGSGAIQVLT 784
>gi|440910400|gb|ELR60199.1| Putative methylcytosine dioxygenase TET2, partial [Bos grunniens
mutus]
Length = 1394
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 158/268 (58%), Positives = 204/268 (76%), Gaps = 3/268 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1128 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1186
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+P++ +D +Y+
Sbjct: 1187 KSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVSLADKLYSE 1246
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1247 LTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCKFARSKIPR 1306
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1307 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1366
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVV 479
FSGVTAC DFCAH+HRDL NM NG T+V
Sbjct: 1367 FSGVTACLDFCAHAHRDLQNMQNGSTLV 1394
>gi|297293151|ref|XP_001082840.2| PREDICTED: probable methylcytosine dioxygenase TET2-like [Macaca
mulatta]
Length = 1973
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 171/321 (53%), Positives = 219/321 (68%), Gaps = 32/321 (9%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1127 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1185
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A I + IV +P
Sbjct: 1186 KSSQGCPIAKWVVRRSSSEEKLLCLVRERGGHTCEAAVISIGIVLCVVMP---------- 1235
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
N RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1236 ----------------RLNTERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1279
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1280 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1339
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1340 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1399
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1400 EFGSVEAQEEKKRSGAIQVLS 1420
>gi|321462649|gb|EFX73671.1| hypothetical protein DAPPUDRAFT_200491 [Daphnia pulex]
Length = 401
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 153/238 (64%), Positives = 195/238 (81%), Gaps = 4/238 (1%)
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R+G G+A+R EKI+YTGKEGKT QGCP+AKW+IRR+SLEEK+L ++K R+GH C T
Sbjct: 1 MEQRTGLAGRAIRFEKIIYTGKEGKTAQGCPIAKWIIRRSSLEEKVLCLIKERRGHRCQT 60
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
W++V+ VAWEG+ L SD +Y L +LN +G+ T RRCATNE RTCACQGLDPDTCGA
Sbjct: 61 TWLIVISVAWEGLALRDSDYLYGELVYRLNAHGVATNRRCATNEDRTCACQGLDPDTCGA 120
Query: 370 SFSFGCSWSMYYNGCKYARSK--TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSM++NGCK+ARSK TVRKFRL+ S+E ++ +++ AT I+PLYK +AP
Sbjct: 121 SFSFGCSWSMFFNGCKFARSKQQTVRKFRLTDESQEADMGDRLQRFATAIAPLYKRIAPD 180
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH 485
A+ NQ QFE +A +CRLG PGRPF+GVTACFDFCAHSH+D+H+MNNGCT V+L +H
Sbjct: 181 AYANQVQFEGKAVDCRLGLAPGRPFAGVTACFDFCAHSHKDIHDMNNGCT--VNLLRH 236
>gi|345321271|ref|XP_001520561.2| PREDICTED: methylcytosine dioxygenase TET1-like [Ornithorhynchus
anatinus]
Length = 2358
Score = 344 bits (883), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 160/270 (59%), Positives = 200/270 (74%), Gaps = 3/270 (1%)
Query: 212 RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
+ E P C C + + G YYTHLG S+ +R+ +E R G KG+A+R+E ++YTGK
Sbjct: 2076 QAEFPTCNCV-EQIIEKDEGPYYTHLGTGPSVAAVREIMETRYGAKGRAIRIEVVVYTGK 2134
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 2135 EGKSSQGCPIAKWVIRRSSNEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 2194
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 2195 QELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 2254
Query: 392 VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FRL +QE +E + LAT ++P+YK LAP AF NQ + E +CRLG K G
Sbjct: 2255 PRRFRLLTDDPKQEESLENNLQNLATDVAPVYKKLAPDAFQNQVENEHLGPDCRLGCKDG 2314
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVV 479
RPFSGVTAC DFCAH+H+D HNM+NG TVV
Sbjct: 2315 RPFSGVTACIDFCAHAHKDTHNMHNGSTVV 2344
>gi|149043923|gb|EDL97374.1| CXXC finger 6 (predicted) [Rattus norvegicus]
Length = 608
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 162/282 (57%), Positives = 209/282 (74%), Gaps = 5/282 (1%)
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+ EEK++ +V+ R H CST
Sbjct: 1 METRYGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCST 60
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCG 368
A IVV+I+ WEG+P +D +Y LT L Y G PT RRC N+ RTC CQG +P TCG
Sbjct: 61 AVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCG 120
Query: 369 ASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAP 426
ASFSFGCSWSMY+NGCK+ RS RKFRL+ E+++EE + LAT ++P+YK +AP
Sbjct: 121 ASFSFGCSWSMYFNGCKFGRSANPRKFRLAPNYPLHEKQLEENLQDLATVLAPVYKQMAP 180
Query: 427 GAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH- 485
A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+HNMNNG TVV +L +
Sbjct: 181 VAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGSTVVCTLIRED 240
Query: 486 -RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
R S P DEQLHVLPLY + D+DEFG+ E + K+ +GAI+
Sbjct: 241 GRDRSVPGDEQLHVLPLYRLADTDEFGSVEGMKAKIQSGAIQ 282
>gi|198433354|ref|XP_002125458.1| PREDICTED: similar to Protein TET2 [Ciona intestinalis]
Length = 1706
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 171/328 (52%), Positives = 220/328 (67%), Gaps = 17/328 (5%)
Query: 215 VPDCKCFAS----DKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTG 270
P C C ++LPP YYTH+GA+ S+ +RK EER G+ G+ALR+EK+ YTG
Sbjct: 767 FPRCTCIPGSDGLEELPP----YYTHIGASHSIQGIRKLFEERCGFTGRALRIEKVCYTG 822
Query: 271 KEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGV 330
KEGKT++GCP+AKWV+RR+S +EK++++ + R GH C TA +VVVI+ WEGV +D
Sbjct: 823 KEGKTSRGCPIAKWVLRRSSEQEKIMVVCRQRPGHRCITAVMVVVIMLWEGVSRPLADFS 882
Query: 331 YAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
Y T + G T RRC TNE RTCACQG DP+ GAS+SFGCSWSMYYNGCK+ARS
Sbjct: 883 YNKCTQLIPTNGTATERRCGTNEERTCACQGFDPEKGGASYSFGCSWSMYYNGCKFARST 942
Query: 391 TVRKFRLSVRSE---EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
KF+L+ + E + + LA+ +S LYK AP A NQ + E E ECRLG+
Sbjct: 943 KPNKFKLNGTKDSNAESCVADFCQRLASAMSVLYKTAAPDAHMNQIERECEGQECRLGYN 1002
Query: 448 P---GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLS-KPDDEQLHVLPL 501
P GRPFSGVT C DFCAH+H+D HNM NG T+V++LTK R + KP DEQLHVLPL
Sbjct: 1003 PPNEGRPFSGVTCCMDFCAHAHKDQHNMENGTTLVLTLTKPELRVIGQKPPDEQLHVLPL 1062
Query: 502 YIMDDSDEFGNKEAQEEKVNTGAIENLN 529
Y +D ++E G E +K+ G+IE LN
Sbjct: 1063 YKLDLTNEEGTFEGVGQKIREGSIEILN 1090
>gi|350597142|ref|XP_003484366.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Sus scrofa]
Length = 1048
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 202/284 (71%), Gaps = 4/284 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 763 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 821
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y+
Sbjct: 822 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRAGHHCPTAVMVVLIMVWDGIPLPLADRLYS 881
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQGLDP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 882 ELTESLKSYNGHPTDRRCTLNENRTCTCQGLDPETCGASFSFGCSWSMYFNGCKFGRSPS 941
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ IE+ + LAT ++P+YK AP A+ NQ FE A ECRLG K G
Sbjct: 942 PRRFRIDPSSPLHEKNIEDNLQTLATELAPIYKQYAPVAYENQVAFEHVARECRLGKKEG 1001
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDD 493
RPFSGVTAC DFCAH HRD+HNMNNG TVV + S + P +
Sbjct: 1002 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTWVAFDSKAPPQN 1045
>gi|117167823|gb|AAI10511.2| TET2 protein [Homo sapiens]
gi|117167991|gb|AAI10510.1| TET2 protein [Homo sapiens]
Length = 805
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 155/251 (61%), Positives = 191/251 (76%), Gaps = 5/251 (1%)
Query: 284 WVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGL 343
WV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+ LT L KYG
Sbjct: 1 WVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSELTETLRKYGT 60
Query: 344 PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRS 401
T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK RKF+L
Sbjct: 61 LTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPK 120
Query: 402 EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 461
EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRPFSGVTAC DF
Sbjct: 121 EEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDF 180
Query: 462 CAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEE 518
CAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D DEFG+ EAQEE
Sbjct: 181 CAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEE 240
Query: 519 KVNTGAIENLN 529
K +GAI+ L+
Sbjct: 241 KKRSGAIQVLS 251
>gi|354475486|ref|XP_003499959.1| PREDICTED: methylcytosine dioxygenase TET1 [Cricetulus griseus]
Length = 1956
Score = 327 bits (839), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 220/321 (68%), Gaps = 13/321 (4%)
Query: 214 EVPDCKC---FASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTG 270
E P C C + +DK G YYTHLGA S+ +R+ +E R KGKA+R+EKI Y G
Sbjct: 1329 EGPPCDCKGEYQTDK-----GPYYTHLGAGPSVAAIRELMETRYCEKGKAIRIEKIEYMG 1383
Query: 271 KEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGV 330
KE K+++GCP+ K V+R+ + +EK+L + + R GH C TA +VV IV W+ + +D +
Sbjct: 1384 KESKSSRGCPVVKTVLRQNNDDEKVLCLARERVGHHCQTAVMVVGIVLWQPISPPLADHL 1443
Query: 331 YAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 389
Y +T+ L Y G PT RRC NE RTC CQGL+P TCGASFSFGCSWSMY NGCK+ RS
Sbjct: 1444 YDEITDNLRSYSGHPTDRRCTFNEKRTCTCQGLNPRTCGASFSFGCSWSMYLNGCKFGRS 1503
Query: 390 KTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFK 447
RKF+L+ E++IE ++ +A T++P+YK +AP A+ NQ ++E A++CRLG K
Sbjct: 1504 PNPRKFKLAPNYPLNEKKIEGILNKVADTLAPIYKQMAPVAYQNQVKYEDVAADCRLGTK 1563
Query: 448 PGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH--RSLSKPDDEQLHVLPLYIMD 505
GRPFSGVT C DFCAHSH+D HNM NG TVV++L + R + DEQ HVLPL+ +
Sbjct: 1564 KGRPFSGVTCCMDFCAHSHKDNHNMINGSTVVLTLLRKDARDRNNLQDEQFHVLPLHRLA 1623
Query: 506 DSDEFGNKEAQEEKVNTGAIE 526
D+DEFG++E E K+ +GAIE
Sbjct: 1624 DTDEFGSREGMEAKIRSGAIE 1644
>gi|47223312|emb|CAF98696.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1615
Score = 324 bits (830), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 157/278 (56%), Positives = 193/278 (69%), Gaps = 31/278 (11%)
Query: 255 GYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVV 314
G KG A+R+E ++YTGKEGK++QGCP+AKWVIRR S EEKLL +V+ R GH C TA +V+
Sbjct: 970 GAKGNAVRVEVVVYTGKEGKSSQGCPIAKWVIRRDSEEEKLLCLVRRRPGHCCDTAVLVI 1029
Query: 315 VIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFG 374
+I+AWEG+ +DG+Y LT L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFG
Sbjct: 1030 LILAWEGISRPVADGLYQELTRTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFG 1089
Query: 375 CSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQ 434
CSWSMY+NGCK+ARSK RKFRL G + + +
Sbjct: 1090 CSWSMYFNGCKFARSKVPRKFRLQ----------------------------GDYPEEVE 1121
Query: 435 FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-P 491
E +CRLG + GRPFSGVTAC DFCAH+HRD NMNNG TVV +LTK +R++ P
Sbjct: 1122 NEEAGRDCRLGQREGRPFSGVTACVDFCAHAHRDTQNMNNGSTVVCTLTKEDNRAVRNVP 1181
Query: 492 DDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
+DEQLHVLPLY + D DEFG E Q K+ +GA++ L+
Sbjct: 1182 EDEQLHVLPLYRISDRDEFGQVEGQWAKIRSGALQVLS 1219
>gi|68342456|gb|AAY90126.1| Ayu17-449 [Mus musculus]
Length = 1919
Score = 321 bits (822), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 166/310 (53%), Positives = 213/310 (68%), Gaps = 16/310 (5%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1060 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1119
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAW---EGVPLNQSDGVYAILTN--KLNKYGLPT-- 345
EKLL +V+ R HTC TA +V+ V + + Y L +++ L +
Sbjct: 1120 EKLLCLVRVRPNHTCETAVMVIASVVGRNPKATRIRTLLRTYRYLGQVWHMHQPSLFSDE 1179
Query: 346 TRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQE 405
T++ + R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL R E +
Sbjct: 1180 TKKKQSPPSRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRL--RGAEPK 1237
Query: 406 IEEKM--HL--LATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 461
EE++ HL LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF
Sbjct: 1238 EEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDF 1297
Query: 462 CAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEE 518
AHSHRD NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+
Sbjct: 1298 SAHSHRDQQNMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEK 1357
Query: 519 KVNTGAIENL 528
K+ G+IE L
Sbjct: 1358 KIRMGSIEVL 1367
>gi|326923418|ref|XP_003207933.1| PREDICTED: methylcytosine dioxygenase TET1-like [Meleagris gallopavo]
Length = 1500
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/322 (50%), Positives = 199/322 (61%), Gaps = 45/322 (13%)
Query: 212 RTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
++E+P C C + + G YYTHLG
Sbjct: 779 QSELPTCDCV-EQIIEKDEGPYYTHLGTG------------------------------- 806
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
P VIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y
Sbjct: 807 --------PSVAAVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLY 858
Query: 332 AILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK
Sbjct: 859 KELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKN 918
Query: 392 VRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CRLG K G
Sbjct: 919 PRKFRLLTDDPKQEELLEHNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGSKDG 978
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKH---RSLSKPDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH+H+D HNM+NG TVV +LTK R P DEQLHVLPLY +
Sbjct: 979 RPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDNRRVGVIPSDEQLHVLPLYKISQ 1038
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG +E E K+ GAI+ L
Sbjct: 1039 TDEFGTEEGLEAKIKAGAIQVL 1060
>gi|47219959|emb|CAG11492.1| unnamed protein product [Tetraodon nigroviridis]
Length = 400
Score = 312 bits (799), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 146/228 (64%), Positives = 177/228 (77%), Gaps = 2/228 (0%)
Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
RSG G A+R+EK++YTGKEGK+TQGCP+AKWVIRR S +EKLL++V+ R GHTC+TA I
Sbjct: 1 RSGITGSAIRIEKVVYTGKEGKSTQGCPIAKWVIRRGSEKEKLLVLVRERTGHTCNTACI 60
Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
+VVI+ WEG+ + +D +Y L+ L K+G T RRCA NE RTCACQGLDP+ CGASFS
Sbjct: 61 IVVILVWEGILPSLADRLYNELSETLRKHGALTQRRCAHNEERTCACQGLDPEACGASFS 120
Query: 373 FGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFT 430
FGCSWSMYYNGCK+ARSK RKF+L EE+ IE+ LAT ++PLYK LAP A+
Sbjct: 121 FGCSWSMYYNGCKFARSKNPRKFKLLGDDMREEERIEQNFQGLATLLAPLYKTLAPEAYG 180
Query: 431 NQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
NQ + E+ A +CRLG K GRPFSGVTAC DFCAH+HRDLHNM G TV
Sbjct: 181 NQVEHEQRALDCRLGLKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTV 228
>gi|449664940|ref|XP_002161163.2| PREDICTED: uncharacterized protein LOC100213294 [Hydra
magnipapillata]
Length = 1336
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 143/291 (49%), Positives = 197/291 (67%), Gaps = 4/291 (1%)
Query: 217 DCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTT 276
+C C SD + G +Y HLG+ SL +LR + +R + AL ++ + +T EGK
Sbjct: 325 ECGCAVSD--TSDSGPFYNHLGSGYSLNELRNTLLDRFSIQNSALNLQLVKHTSVEGKNG 382
Query: 277 QGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTN 336
GCPLAKW+IRR S EEK L++V+H +GHTCS+ + V+VIVAWEG+ +D +Y LT
Sbjct: 383 DGCPLAKWIIRRTSDEEKYLVVVRHHEGHTCSSTFTVIVIVAWEGISKQYADDMYRYLTK 442
Query: 337 KLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFR 396
LN+ G T RRC+ NE +TC CQG ++ GASFSFGCSWSM+++GCK+ +S RKF+
Sbjct: 443 TLNESGFRTRRRCSANESKTCLCQGEVEESQGASFSFGCSWSMFFDGCKFTKSTNARKFK 502
Query: 397 LSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVT 456
+ +E+EIE+ + + T +SPL K AP + N FE A +CR+G GRPFSGVT
Sbjct: 503 MQDPVKEEEIEKVLQEMTTQVSPLLKIWAPKCYENMTHFEEIADKCRIGLNKGRPFSGVT 562
Query: 457 ACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLY-IMDD 506
C DFCAHSHRD+H++NNG T+V +L K + ++ DEQLHVLPLY ++DD
Sbjct: 563 CCLDFCAHSHRDIHDLNNGTTMVCTLLK-PNYNERTDEQLHVLPLYQLLDD 612
>gi|301610531|ref|XP_002934823.1| PREDICTED: probable methylcytosine dioxygenase TET2 [Xenopus
(Silurana) tropicalis]
Length = 1737
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 209/321 (65%), Gaps = 31/321 (9%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C C + + G YYTHLGA ++ +R+ +EER G KG A+R+E+++YTGKEG
Sbjct: 929 DFPSCSC-VDQIIEKDEGPYYTHLGAGPNVAAIREMMEERFGQKGNAIRIERVVYTGKEG 987
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K+ QGCP+AKWVIRR+ +EK+L +V+ R GH+C TA IV++I+ WEG+ + +D +Y+
Sbjct: 988 KSAQGCPIAKWVIRRSGTDEKMLCLVRERAGHSCETAVIVILILVWEGISFSLADRLYSE 1047
Query: 334 LTNKLNKYGLPTTRRCATNE---PRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
LT LNKYG T RRCA NE + +G+ T G +++F
Sbjct: 1048 LTETLNKYGTLTNRRCARNEEVWEESGVLRGISGIT-GRTYTF----------------- 1089
Query: 391 TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
L+ S E+++E + L+T ++P+YK LAP A+ NQ + E A +CRLG K GR
Sbjct: 1090 ------LADSSLEEKLEANLQHLSTLMAPIYKKLAPDAYHNQIEHEHRAPDCRLGLKEGR 1143
Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDS 507
PFSGVTAC DFCAHSHRDLHNM NG T+V +LT+ +R K P DEQLHVLPLY + +
Sbjct: 1144 PFSGVTACLDFCAHSHRDLHNMQNGSTLVCTLTREDNRENGKIPQDEQLHVLPLYKVSNV 1203
Query: 508 DEFGNKEAQEEKVNTGAIENL 528
DEFG+ E+QEEK TGAI+ L
Sbjct: 1204 DEFGSSESQEEKKRTGAIQVL 1224
>gi|348564585|ref|XP_003468085.1| PREDICTED: methylcytosine dioxygenase TET2-like [Cavia porcellus]
Length = 1937
Score = 301 bits (770), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 158/341 (46%), Positives = 211/341 (61%), Gaps = 57/341 (16%)
Query: 194 DPNSKEMLDHIERLKNNMRT--EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIE 251
DP+ K +L+ M+T E P C+C + + G +YTHLGA ++ +R+ +E
Sbjct: 1108 DPSIKNLLE------TTMKTQYEFPSCRCV-EQIIEKDEGPFYTHLGAGPNVAAIREIME 1160
Query: 252 ERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAW 311
ER G KGKA+R+EKI+YTGKEGK++QGCP+AKWV RR+S +EKLL +V+ R GHTCS A
Sbjct: 1161 ERFGQKGKAIRIEKIIYTGKEGKSSQGCPIAKWVFRRSSSKEKLLCLVRERTGHTCSAAV 1220
Query: 312 IVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASF 371
I+V+I+ W+ +P + +D +Y L L+K+G T RRCA NE
Sbjct: 1221 ILVMIMVWDAIPRSLADQLYTELRETLHKHGTLTNRRCALNE------------------ 1262
Query: 372 SFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTN 431
SW E+++E + LAT I+P+YK LAP A+ N
Sbjct: 1263 --ETSW-------------------------EEKLESHLQNLATLIAPIYKKLAPDAYNN 1295
Query: 432 QCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL--- 488
Q ++E A +CRLG K GRPFSGVTAC DFCAH+HRDLHNM NG TVV +LT+ +
Sbjct: 1296 QVEYEHRAPDCRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTVVCTLTREDNRDPD 1355
Query: 489 SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
S P+DEQLHVLPLY + D DEFG+ EAQEEK +GAI+ L+
Sbjct: 1356 STPEDEQLHVLPLYKISDVDEFGSAEAQEEKKRSGAIQVLS 1396
>gi|441614500|ref|XP_004088220.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1-like
[Nomascus leucogenys]
Length = 1989
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 200/323 (61%), Gaps = 10/323 (3%)
Query: 213 TEVPDCKCFASDKLPPE-PGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGK 271
+E+P C C D++ + GSYY HL A + +R+ + G KG +R+E +++TG
Sbjct: 1267 SELPTCNCL--DRVTQKIKGSYYIHLXAGPGVAAVREIMVNMYGKKGNTIRIETVVHTGN 1324
Query: 272 EGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVY 331
EGK++ CP+ KWV+ R+S EK L V R GH C TA IV++I+ W+G +D +Y
Sbjct: 1325 EGKSSNRCPIIKWVLTRSSDTEKAXL-VXQRTGHYCPTAVIVMLIMVWDGNHFPVADWLY 1383
Query: 332 AILTNKLNKYGL-PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
LT L + PT RRC + RTC C G+DP+TCGASFSFGCSWSMY+N CK+ R
Sbjct: 1384 TELTENLRSXNMHPTNRRCTLHXNRTCTCXGIDPETCGASFSFGCSWSMYFNDCKFGRGP 1443
Query: 391 TVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKP 448
+ R+FR+ S E+ +E+ + LAT + P+YK AP A+ NQ + E A EC LG K
Sbjct: 1444 SCRRFRIDSSSLLHEKNLEDNLQSLATQLVPIYKQHAPLAYQNQVEHENVAXECXLGSKD 1503
Query: 449 GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMD 505
FSG+ AC DF A HRD+HNMNNG TVV +L + + S P D+QLHVL LY +
Sbjct: 1504 SFSFSGIIACLDFSAQPHRDIHNMNNGSTVVCTLIQEDNFSLSVIPQDKQLHVLILYTLS 1563
Query: 506 DSDEFGNKEAQEEKVNTGAIENL 528
D+DEFG +E E K+ +G E L
Sbjct: 1564 DTDEFGLREGMEAKIKSGTTEVL 1586
>gi|359080787|ref|XP_003588047.1| PREDICTED: methylcytosine dioxygenase TET1-like [Bos taurus]
Length = 2105
Score = 290 bits (743), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 195/317 (61%), Gaps = 35/317 (11%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y+
Sbjct: 1476 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADKLYS 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE W +
Sbjct: 1536 QLTESLKSYNGHPTDRRCTLNE----------------------KWVVV--------GTD 1565
Query: 392 VRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
V +R E+ +E+ + LAT ++P+YK AP A+ NQ E A ECRLG K GRP
Sbjct: 1566 VEMMTREIRYREKNLEDNLQSLATELAPIYKQYAPAAYQNQVALEHIARECRLGKKEGRP 1625
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH HRD+HNMNNG TVV +LT+ + S P DEQLHVLPLY + D+D
Sbjct: 1626 FSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSFGVIPQDEQLHVLPLYKLSDTD 1685
Query: 509 EFGNKEAQEEKVNTGAI 525
EFG++E E K+ +GAI
Sbjct: 1686 EFGSREGMEAKIKSGAI 1702
>gi|395820931|ref|XP_003783809.1| PREDICTED: methylcytosine dioxygenase TET1 [Otolemur garnettii]
Length = 2169
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 202/322 (62%), Gaps = 43/322 (13%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1489 SELPTCNCI-DRVIQKDKGPNYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1547
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCPLAKWVIRR+S EEK+L +V+ R GH C A +VV+I+ WEG+PL +D +Y
Sbjct: 1548 GKSSHGCPLAKWVIRRSSKEEKVLCLVRKRIGHRCPAAVMVVLIMVWEGIPLPMADRLYT 1607
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1608 ELTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1667
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT + PLY+ AP A+ NQ FE
Sbjct: 1668 PRRFRIDPSSPVHEKNLEDNLQGLATVLGPLYQQYAPVAYQNQVHFE------------- 1714
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
LH+ ++V +LT+ +R+L P DEQLHVLPLY + D
Sbjct: 1715 ------------------TLHS-----SLVCTLTREDNRTLGVIPQDEQLHVLPLYKLAD 1751
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG++E E K+ +GAIE L
Sbjct: 1752 TDEFGSREGMEAKIRSGAIEVL 1773
>gi|380805809|gb|AFE74780.1| methylcytosine dioxygenase TET3, partial [Macaca mulatta]
Length = 348
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 140/230 (60%), Positives = 173/230 (75%), Gaps = 5/230 (2%)
Query: 304 GHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLD 363
GH C A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG D
Sbjct: 1 GHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKD 60
Query: 364 PDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLY 421
P+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLY
Sbjct: 61 PNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLY 120
Query: 422 KALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVS 481
K LAP A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +
Sbjct: 121 KRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCT 180
Query: 482 LTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
LTK +R + K P+DEQLHVLPLY M +DEFG++E Q KV +GAI+ L
Sbjct: 181 LTKEDNRCVGKIPEDEQLHVLPLYKMASTDEFGSEENQNAKVGSGAIQVL 230
>gi|156389231|ref|XP_001634895.1| predicted protein [Nematostella vectensis]
gi|156221983|gb|EDO42832.1| predicted protein [Nematostella vectensis]
Length = 256
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 137/257 (53%), Positives = 181/257 (70%), Gaps = 2/257 (0%)
Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
R G KGKALR+E I+YT KEG+ QGCP+A+WVIRR+ +EK+L++V+ R GH CS A +
Sbjct: 1 RFGIKGKALRIELIIYTNKEGRNAQGCPIARWVIRRSGNDEKVLVLVRKRPGHHCSMALV 60
Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFS 372
V +V WEG+ + +Y L+ + + PT RRC N+ ++CACQG+ DTCGASFS
Sbjct: 61 VTSVVIWEGISEERGHSLYKELSGLIPENAAPTIRRCGLNDSKSCACQGVGEDTCGASFS 120
Query: 373 FGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQ 432
FGCSW+MY+NGCK+ARSK+ RK++L S+E+ +E + +AT I+P+Y AP AF NQ
Sbjct: 121 FGCSWNMYFNGCKFARSKSPRKYKLLDSSKEETLERILEGIATEIAPVYSKAAPVAFANQ 180
Query: 433 CQFEREASECRLGFKP-GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKP 491
+ ER ECR+G GRPFSGVT C DFCAHSHRD NM+ G TVV +L K ++P
Sbjct: 181 TREERNGHECRIGHSAVGRPFSGVTCCMDFCAHSHRDKQNMDGGATVVCTLLKP-GCAQP 239
Query: 492 DDEQLHVLPLYIMDDSD 508
+DEQLHVLPLY + D
Sbjct: 240 EDEQLHVLPLYQLLSKD 256
>gi|403274103|ref|XP_003928828.1| PREDICTED: methylcytosine dioxygenase TET1 [Saimiri boliviensis
boliviensis]
Length = 2088
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 148/320 (46%), Positives = 195/320 (60%), Gaps = 49/320 (15%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1415 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGEKGNAIRIEIVVYTGKE 1473
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1474 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1533
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1534 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1593
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQ-CQFEREASECRLGFKP 448
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ C RE
Sbjct: 1594 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVCTLTRED--------- 1644
Query: 449 GRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSD 508
N VV P DEQLHVLPLY + D+D
Sbjct: 1645 ------------------------NRSLGVV-----------PQDEQLHVLPLYKLSDTD 1669
Query: 509 EFGNKEAQEEKVNTGAIENL 528
EFG+KE E K+ +GAIE L
Sbjct: 1670 EFGSKEGMEAKIKSGAIEVL 1689
>gi|340371755|ref|XP_003384410.1| PREDICTED: methylcytosine dioxygenase TET1-like [Amphimedon
queenslandica]
Length = 1077
Score = 275 bits (702), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 183/282 (64%), Gaps = 6/282 (2%)
Query: 231 GSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRAS 290
G +YTHLGA ++ LR+ +E+R G LRM +I YTG E KT++GCP A+WV+RR S
Sbjct: 181 GIFYTHLGAGSTPETLRETLEKRFNVTGIELRMLEITYTGIEAKTSEGCPTAEWVVRRKS 240
Query: 291 LEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCA 350
EEK L++ +H GH+C + VV IV W+ + ++ Y L L + G PT R+C
Sbjct: 241 KEEKFLVLYRHHIGHSCDEQYTVVSIVYWDALTPERAGYTYNKLVEILPQNGFPTPRKCE 300
Query: 351 TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM 410
N+ +TC+CQG D GAS+SFGCSWS+YY+GCK+ +SK RKF+L V +E E+E +
Sbjct: 301 FNDSKTCSCQGDDKTVHGASYSFGCSWSVYYDGCKFGKSKIPRKFKLQVPEKEPELEGNV 360
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
LAT ++PLYK LAP A++NQ + ECR+G P +PFSG+T C D+CAHSH D H
Sbjct: 361 DELATYLAPLYKRLAPKAYSNQVATQASGEECRIGLGPEKPFSGMTCCMDYCAHSHYDKH 420
Query: 471 NM-NNGCTVVVSLTKH-----RSLSKPDDEQLHVLPLYIMDD 506
NM + G TVVV++ K + + + EQ+H LPLY + D
Sbjct: 421 NMPDGGATVVVTILKEGVYPDQYVKEDTGEQIHCLPLYRLKD 462
>gi|358419451|ref|XP_003584239.1| PREDICTED: methylcytosine dioxygenase TET1-like [Bos taurus]
Length = 2131
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 195/318 (61%), Gaps = 11/318 (3%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y+
Sbjct: 1476 GKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADKLYS 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEP-RTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSK 390
LT L Y G PT RRC NE + + ++ + +NGCK+ RS
Sbjct: 1536 QLTESLKSYNGHPTDRRCTLNENCKLLVLNNTSENEVQYNYQYNYQNQYVFNGCKFXRSP 1595
Query: 391 TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGR 450
+ R+FR+ + L + A A ++ E A ECRLG K GR
Sbjct: 1596 SPRRFRIDPSLPYMKKHSSFPELRKD-----QCEAQQARESEVALEHIARECRLGKKEGR 1650
Query: 451 PFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDS 507
PFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ + S P DEQLHVLPLY + D+
Sbjct: 1651 PFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSFGVIPQDEQLHVLPLYKLSDT 1710
Query: 508 DEFGNKEAQEEKVNTGAI 525
DEFG++E E K+ +GAI
Sbjct: 1711 DEFGSREGMEAKIKSGAI 1728
>gi|297301263|ref|XP_002805756.1| PREDICTED: methylcytosine dioxygenase TET1-like [Macaca mulatta]
Length = 1972
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 117/224 (52%), Positives = 159/224 (70%), Gaps = 4/224 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1413 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1471
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1472 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1531
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1532 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1591
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQC 433
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ
Sbjct: 1592 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQV 1635
>gi|349604556|gb|AEQ00074.1| Methylcytosine dioxygenase TET1-like protein, partial [Equus
caballus]
Length = 375
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/222 (47%), Positives = 141/222 (63%), Gaps = 32/222 (14%)
Query: 255 GYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVV 314
G KG A+R+E ++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV
Sbjct: 154 GQKGNAVRIEIVVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVV 213
Query: 315 VIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSF 373
+I +G+PL +D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSF
Sbjct: 214 LIWYGDGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSF 273
Query: 374 GCSWSMYYNGCKYARSKTVRKFRLSVRS-------------------------------E 402
GCSWSMY+NGCK+ RS + R+FR+ S
Sbjct: 274 GCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHNYYERITKGRNPERRYMKPEPICPGHEAM 333
Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRL 444
E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRL
Sbjct: 334 EKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVARECRL 375
>gi|26350989|dbj|BAC39131.1| unnamed protein product [Mus musculus]
Length = 267
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/155 (59%), Positives = 115/155 (74%), Gaps = 5/155 (3%)
Query: 379 MYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 436
MY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E
Sbjct: 1 MYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNE 60
Query: 437 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDD 493
A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + + P+D
Sbjct: 61 DVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGQIPED 120
Query: 494 EQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
EQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 121 EQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVL 155
>gi|66396578|gb|AAH96437.1| Tet3 protein [Mus musculus]
Length = 695
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 91/155 (58%), Positives = 114/155 (73%), Gaps = 5/155 (3%)
Query: 379 MYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 436
MY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E
Sbjct: 1 MYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNE 60
Query: 437 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDD 493
A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCT V +LTK +R + + P+D
Sbjct: 61 DVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTAVCTLTKEDNRCVGQIPED 120
Query: 494 EQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
EQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 121 EQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVL 155
>gi|441657124|ref|XP_003258249.2| PREDICTED: methylcytosine dioxygenase TET1-like [Nomascus
leucogenys]
Length = 583
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/184 (48%), Positives = 114/184 (61%), Gaps = 34/184 (18%)
Query: 379 MYYNGCKYARSKTVRKFRLSVRSE-------------------------------EQEIE 407
MY+NGCK+ RS + R+FR+ S E+ +E
Sbjct: 1 MYFNGCKFGRSPSPRRFRIDPSSPLHTYYERITKGRNPERRYMKPERISPGHEAMEKNLE 60
Query: 408 EKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHR 467
+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K GRPFSGVTAC DFCAH HR
Sbjct: 61 DNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHR 120
Query: 468 DLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGA 524
D+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D+DEFG+KE E K+ +GA
Sbjct: 121 DIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEGMEAKIKSGA 180
Query: 525 IENL 528
IE L
Sbjct: 181 IEVL 184
>gi|195998193|ref|XP_002108965.1| hypothetical protein TRIADDRAFT_52488 [Trichoplax adhaerens]
gi|190589741|gb|EDV29763.1| hypothetical protein TRIADDRAFT_52488 [Trichoplax adhaerens]
Length = 687
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 152/290 (52%), Gaps = 40/290 (13%)
Query: 217 DCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTT 276
+C+C D P +YT LG A S +LR+ + +R LR+ ++ YTG E KT+
Sbjct: 179 NCQCQDEDGAP-----FYTQLGVAGSTEELREMLRDRFRIDESKLRVIEVEYTGVESKTS 233
Query: 277 QGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTN 336
GCP A+W IRR S EKLL +V R+GHTC + +++ IVAW+G+ +++
Sbjct: 234 DGCPRAEWAIRRISKSEKLLALVHRRRGHTCKASVVLMAIVAWDGIHPDRA--------- 284
Query: 337 KLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFR 396
N + C CQG D GA+F+ G + +G K + + ++
Sbjct: 285 ---------------NVLQDCHCQGTDNQREGAAFTLGNMYQTEDDGLKIILNASA--YQ 327
Query: 397 LSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVT 456
L+ ++EQE+ E + LA ++P+YK AP A+ NQ +++ + + PFSGV
Sbjct: 328 LADSAKEQELAEALESLAADLAPVYKKFAPWAYNNQIKYQENCVGKSINEEKNGPFSGVI 387
Query: 457 ACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSLSKPDDE----QLHVLPLY 502
DFCAH+H + +++G ++V +L L P+DE QLH+ P+Y
Sbjct: 388 CSLDFCAHNHVNTEGLDDGASMVCTL-----LKDPEDENVKNQLHIYPMY 432
>gi|10047157|dbj|BAB13372.1| KIAA1546 protein [Homo sapiens]
Length = 684
Score = 161 bits (408), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 76/130 (58%), Positives = 97/130 (74%), Gaps = 3/130 (2%)
Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
E+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRPFSGVTAC DFC
Sbjct: 1 EEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFC 60
Query: 463 AHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
AH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D DEFG+ EAQEEK
Sbjct: 61 AHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEEK 120
Query: 520 VNTGAIENLN 529
+GAI+ L+
Sbjct: 121 KRSGAIQVLS 130
>gi|432106709|gb|ELK32361.1| Methylcytosine dioxygenase TET1 [Myotis davidii]
Length = 2018
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 73/129 (56%), Positives = 95/129 (73%), Gaps = 3/129 (2%)
Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
E+ +E+ + LAT ++P+Y+ AP A+ NQ QFE A ECRLG K GRPFSGVTAC DFC
Sbjct: 1515 EKNLEDNLQSLATQLAPIYRQYAPVAYQNQIQFEHIARECRLGNKEGRPFSGVTACVDFC 1574
Query: 463 AHSHRDLHNMNNGCTVVVSLTKHRSLS---KPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
H+HRD+HNMNNG TVV +LT+ + S P DEQLHVLPLY + D+DEFG++E E K
Sbjct: 1575 THAHRDIHNMNNGSTVVCTLTREDNRSFGIVPQDEQLHVLPLYKLSDTDEFGSREGMEAK 1634
Query: 520 VNTGAIENL 528
+ +GA++ L
Sbjct: 1635 IRSGAVDVL 1643
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 50/72 (69%), Gaps = 1/72 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + E G YYTHLGA ++ +R+ +E R G KG A+R+EK++YTGKE
Sbjct: 1444 SELPSCNCL-DRVIQKEKGPYYTHLGAGPNVAAVREIMETRYGQKGSAVRIEKVIYTGKE 1502
Query: 273 GKTTQGCPLAKW 284
K++ GCP+AKW
Sbjct: 1503 AKSSHGCPVAKW 1514
>gi|355723851|gb|AES08026.1| tet oncoprotein family member 3 [Mustela putorius furo]
Length = 104
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 67/99 (67%), Positives = 79/99 (79%)
Query: 300 KHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCAC 359
+HR GH C A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCAC
Sbjct: 1 RHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCAC 60
Query: 360 QGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS 398
QG DP TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+
Sbjct: 61 QGKDPSTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLA 99
>gi|241849264|ref|XP_002415674.1| hypothetical protein IscW_ISCW023647 [Ixodes scapularis]
gi|215509888|gb|EEC19341.1| hypothetical protein IscW_ISCW023647 [Ixodes scapularis]
Length = 750
Score = 148 bits (374), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 74/161 (45%), Positives = 104/161 (64%), Gaps = 12/161 (7%)
Query: 195 PNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYT--HLGAAASLPDLRKDIEE 252
P + + +ERL++N + E P C C+ +D E Y T A+SLP + E
Sbjct: 599 PGADPWWERLERLRSNAKAEPPACDCYGAD----ETREYRTPRSPSLASSLPAAM--LNE 652
Query: 253 RSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWI 312
R G ALR+EK+LY+GKEGKT+QGCP+AKW+IRR+ EK+L +++HR GH C +A+I
Sbjct: 653 R----GPALRIEKVLYSGKEGKTSQGCPVAKWIIRRSGPSEKVLAVLRHRPGHRCLSAYI 708
Query: 313 VVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNE 353
V+ IVAWEGV + +D +Y +T+K +G PT RRC TNE
Sbjct: 709 VMAIVAWEGVQADMADDLYRTVTHKTVNFGFPTQRRCGTNE 749
>gi|351712963|gb|EHB15882.1| Methylcytosine dioxygenase TET1 [Heterocephalus glaber]
Length = 561
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 67/129 (51%), Positives = 90/129 (69%), Gaps = 3/129 (2%)
Query: 403 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 462
E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G PFSGVTAC DF
Sbjct: 195 EKNLEDNLQNLATELAPIYKQYAPAAYQNQVEYEHVAQECRLGAKEGHPFSGVTACLDFS 254
Query: 463 AHSHRDLHNMNNGCTVVVSLTK--HRSLS-KPDDEQLHVLPLYIMDDSDEFGNKEAQEEK 519
AH H D+HNMN+ TVV +L + +RSL P+D+ LHVL LY + D DEFG+KE E K
Sbjct: 255 AHLHWDIHNMNHRNTVVSTLAREDNRSLGVVPEDKHLHVLLLYRLSDKDEFGSKEGMEAK 314
Query: 520 VNTGAIENL 528
+ +GA++ L
Sbjct: 315 IQSGAVQVL 323
>gi|344237690|gb|EGV93793.1| Methylcytosine dioxygenase TET1 [Cricetulus griseus]
Length = 337
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/105 (56%), Positives = 77/105 (73%), Gaps = 2/105 (1%)
Query: 424 LAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLT 483
+AP A+ NQ ++E A++CRLG K GRPFSGVT C DFCAHSH+D HNM NG TVV++L
Sbjct: 1 MAPVAYQNQVKYEDVAADCRLGTKKGRPFSGVTCCMDFCAHSHKDNHNMINGSTVVLTLL 60
Query: 484 KH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
+ R + DEQ HVLPL+ + D+DEFG++E E K+ +GAIE
Sbjct: 61 RKDARDRNNLQDEQFHVLPLHRLADTDEFGSREGMEAKIRSGAIE 105
>gi|355723854|gb|AES08027.1| tet oncoprotein family member 3 [Mustela putorius furo]
Length = 91
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/89 (65%), Positives = 71/89 (79%), Gaps = 3/89 (3%)
Query: 443 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVL 499
RLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +R + K P+DEQLHVL
Sbjct: 1 RLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVL 60
Query: 500 PLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
PLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 61 PLYKMANTDEFGSEENQNAKVGSGAIQVL 89
>gi|344237691|gb|EGV93794.1| Methylcytosine dioxygenase TET1 [Cricetulus griseus]
Length = 1466
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 88/141 (62%), Gaps = 5/141 (3%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
E P C C K + G YYTHLGA S+ +R+ +E R KGKA+R+EKI Y GKE
Sbjct: 1329 EGPPCDC----KDQTDKGPYYTHLGAGPSVAAIRELMETRYCEKGKAIRIEKIEYMGKES 1384
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K+++GCP+ K V+R+ + +EK+L + + R GH C TA +VV IV W+ + +D +Y
Sbjct: 1385 KSSRGCPVVKTVLRQNNDDEKVLCLARERVGHHCQTAVMVVGIVLWQPISPPLADHLYDE 1444
Query: 334 LTNKLNKY-GLPTTRRCATNE 353
+T+ L Y G PT RRC NE
Sbjct: 1445 ITDNLRSYSGHPTDRRCTFNE 1465
>gi|332025525|gb|EGI65688.1| hypothetical protein G5I_05788 [Acromyrmex echinatior]
Length = 1048
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/131 (43%), Positives = 80/131 (61%), Gaps = 26/131 (19%)
Query: 126 KSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPKEPNNNE 185
+ Y F G+GGP + G WCCR GGTE PT EHL+DG CQG++T+DE+L+ ++
Sbjct: 927 RDYKFRGDGGPAKVSPGTGSWCCRRGGTEQPTPEHLRDGCCQGLQTKDEILD-----DSM 981
Query: 186 EPATVKAEDPNS----------KEMLDHIERLKNNMRTEVPDCKCFASDK------LPPE 229
E A +K E P+S ++ DH+++LKNN+RTEVPDC CF +DK +P
Sbjct: 982 EKAELKNEGPHSPHTPTTTTVTTKLQDHLDKLKNNVRTEVPDCNCFPADKCELQLRIP-- 1039
Query: 230 PGSYYTHLGAA 240
YYT++ A
Sbjct: 1040 ---YYTNIEKA 1047
>gi|307213412|gb|EFN88848.1| hypothetical protein EAI_08435 [Harpegnathos saltator]
Length = 685
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 69/115 (60%), Gaps = 7/115 (6%)
Query: 118 EEHSDSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEML- 176
EE S Y F G+GGP + + G WCCR GGTE PT EHL+DG CQG++T+DEML
Sbjct: 554 EEESQKIVPDYKFRGDGGPAKVSPATGSWCCRRGGTEQPTPEHLRDGCCQGLQTKDEMLA 613
Query: 177 ------EPKEPNNNEEPATVKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDK 225
E K P T + ++ DH+++LKNN+RTEVP+C CF +DK
Sbjct: 614 DSPQRDELKSEGGPHSPRTPSTATTTTTKLQDHLDKLKNNVRTEVPNCNCFPADK 668
>gi|355723824|gb|AES08017.1| tet oncoprotein 1 [Mustela putorius furo]
Length = 70
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 45/70 (64%), Positives = 53/70 (75%), Gaps = 1/70 (1%)
Query: 320 EGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWS 378
+G+PL +D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWS
Sbjct: 1 DGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWS 60
Query: 379 MYYNGCKYAR 388
MY+NGCK+ R
Sbjct: 61 MYFNGCKFGR 70
>gi|68161848|emb|CAD28467.3| hypothetical protein [Homo sapiens]
Length = 414
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/73 (64%), Positives = 56/73 (76%), Gaps = 3/73 (4%)
Query: 459 FDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDDSDEFGNKEA 515
DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D+DEFG+KE
Sbjct: 1 LDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEG 60
Query: 516 QEEKVNTGAIENL 528
E K+ +GAIE L
Sbjct: 61 MEAKIKSGAIEVL 73
>gi|355723821|gb|AES08016.1| tet oncoprotein 1 [Mustela putorius furo]
Length = 218
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 55/77 (71%), Gaps = 1/77 (1%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 143 SELPTCNCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAVRIEIVVYTGKE 201
Query: 273 GKTTQGCPLAKWVIRRA 289
GK++QGCP+AKWV+RR
Sbjct: 202 GKSSQGCPIAKWVLRRG 218
>gi|21410433|gb|AAH31159.1| Tet2 protein [Mus musculus]
gi|26251882|gb|AAH40785.1| Tet2 protein [Mus musculus]
gi|148680233|gb|EDL12180.1| mCG123956 [Mus musculus]
Length = 612
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 44/60 (73%), Gaps = 3/60 (5%)
Query: 472 MNNGCTVVVSLTK--HRSL-SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
M NG TVVV+L + +R + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE L
Sbjct: 1 MPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEVL 60
>gi|351715495|gb|EHB18414.1| Transmembrane protease, serine 11A [Heterocephalus glaber]
Length = 588
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 41/62 (66%), Gaps = 1/62 (1%)
Query: 468 DLHNMNN-GCTVVVSLTKHRSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
DLH ++N G + + + +S+ DEQLH+LPLY + D DEFG+KE E K+ +GA++
Sbjct: 128 DLHIISNSGQKITCQIKDLQEMSENLDEQLHILPLYRLSDKDEFGSKEGMEAKIQSGAVQ 187
Query: 527 NL 528
L
Sbjct: 188 VL 189
>gi|440895445|gb|ELR47628.1| hypothetical protein M91_16421, partial [Bos grunniens mutus]
Length = 614
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 490 KPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
KP+DEQLHVLPLY + D DEFG+ EAQEEK GAI+ L
Sbjct: 15 KPEDEQLHVLPLYKVSDVDEFGSVEAQEEKKRNGAIQVL 53
>gi|149025993|gb|EDL82236.1| similar to KIAA1546 protein (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 644
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 40/57 (70%), Gaps = 3/57 (5%)
Query: 476 CTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
T+VV+LT+ + +P+DEQLHVLPLY + DEFG+ E QEEK+ G+I+ L+
Sbjct: 43 VTLVVTLTREDNREVGGQPEDEQLHVLPLYTIATEDEFGSTEGQEEKILQGSIQVLH 99
>gi|37360442|dbj|BAC98199.1| mKIAA1546 protein [Mus musculus]
Length = 614
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 31/40 (77%)
Query: 489 SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
+KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE L
Sbjct: 23 AKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEVL 62
>gi|335309464|ref|XP_003361647.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Sus
scrofa]
Length = 453
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 491 PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENL 528
P DEQLHVLPLY + D+DEFG++E E K+ +GAI+ L
Sbjct: 16 PQDEQLHVLPLYKLSDTDEFGSREGIEAKIKSGAIKVL 53
>gi|193227751|emb|CAQ60121.1| hypothetical protein [Homo sapiens]
Length = 96
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 16/103 (15%)
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM---------HLLATTISPL 420
SFSFGCSWSMY+NGCK+ RS + R+FR+ S E++ ++ ISP
Sbjct: 1 SFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHTYYERITKGRNPERRYMKPERISPG 60
Query: 421 YKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCA 463
++A+ C+ E LG P + CF CA
Sbjct: 61 HEAM------EDCEAENVWEMGGLGILTSVPITPRVVCF-LCA 96
>gi|195192277|ref|XP_002029595.1| GL24721 [Drosophila persimilis]
gi|194104035|gb|EDW26078.1| GL24721 [Drosophila persimilis]
Length = 209
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 6/55 (10%)
Query: 128 YVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQ--DEMLEPKE 180
Y + G+G P + G CCR GGT PPT+EHLKDG C G+ Q +E+L+ E
Sbjct: 107 YTYLGDGKPLN----NGFSCCRQGGTRPPTAEHLKDGTCLGLGIQPKEELLDEDE 157
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.134 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,425,253,563
Number of Sequences: 23463169
Number of extensions: 427106682
Number of successful extensions: 858874
Number of sequences better than 100.0: 236
Number of HSP's better than 100.0 without gapping: 229
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 858134
Number of HSP's gapped (non-prelim): 307
length of query: 529
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 382
effective length of database: 8,910,109,524
effective search space: 3403661838168
effective search space used: 3403661838168
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)