BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy6131
(225 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|345487243|ref|XP_001599461.2| PREDICTED: hypothetical protein LOC100114438 [Nasonia vitripennis]
Length = 2706
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 178/218 (81%), Positives = 203/218 (93%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKWVIRR+ +EEK+L IVKHRQGH C+TAWIVV +VAWEGVP +
Sbjct: 1570 VVYTGKEGKTTQGCPMAKWVIRRSGIEEKILTIVKHRQGHKCATAWIVVAMVAWEGVPNH 1629
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1630 EADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1689
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 1690 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 1749
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAH+HRDLHNMNNGCTV V L+
Sbjct: 1750 FKPGRPFSGVTACIDFCAHAHRDLHNMNNGCTVVVSLT 1787
>gi|383857295|ref|XP_003704140.1| PREDICTED: uncharacterized protein LOC100883443 [Megachile
rotundata]
Length = 1646
Score = 405 bits (1041), Expect = e-111, Method: Compositional matrix adjust.
Identities = 178/218 (81%), Positives = 202/218 (92%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +
Sbjct: 488 VIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTH 547
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 548 EADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 607
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 608 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 667
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V L+
Sbjct: 668 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTLT 705
>gi|340722271|ref|XP_003399531.1| PREDICTED: hypothetical protein LOC100642293 [Bombus terrestris]
Length = 1697
Score = 405 bits (1040), Expect = e-111, Method: Compositional matrix adjust.
Identities = 179/223 (80%), Positives = 204/223 (91%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +
Sbjct: 534 VIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTH 593
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 594 EADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 653
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 654 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 713
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSL 223
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V ++ SL
Sbjct: 714 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTMTKHRSL 756
>gi|350416717|ref|XP_003491069.1| PREDICTED: hypothetical protein LOC100741227 [Bombus impatiens]
Length = 1697
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 179/223 (80%), Positives = 204/223 (91%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +
Sbjct: 534 VIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTH 593
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 594 EADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 653
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 654 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 713
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSL 223
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V ++ SL
Sbjct: 714 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTMTKHRSL 756
>gi|380029496|ref|XP_003698406.1| PREDICTED: uncharacterized protein LOC100866593 [Apis florea]
Length = 1865
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 176/218 (80%), Positives = 202/218 (92%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +
Sbjct: 699 VIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTH 758
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 759 EADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGLDPETCGASFSFGCSWSMYYNGCK 818
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 819 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 878
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V ++
Sbjct: 879 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTMT 916
>gi|307188349|gb|EFN73124.1| Protein TET2 [Camponotus floridanus]
Length = 1632
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 177/218 (81%), Positives = 202/218 (92%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW+IRR+ ++EK+L IVKHRQGH C+TAWIVV +VAWEGVP +
Sbjct: 484 VIYTGKEGKTTQGCPMAKWIIRRSGMDEKILTIVKHRQGHKCATAWIVVAMVAWEGVPTH 543
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++LT+KLN++GLPTTRRC TNEPRTCACQGLDPD CGASFSFGCSWSMYYNGCK
Sbjct: 544 EADRIYSLLTHKLNRFGLPTTRRCGTNEPRTCACQGLDPDNCGASFSFGCSWSMYYNGCK 603
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 604 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 663
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V L+
Sbjct: 664 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVSLT 701
>gi|328780619|ref|XP_396330.4| PREDICTED: hypothetical protein LOC412878 [Apis mellifera]
Length = 1695
Score = 402 bits (1033), Expect = e-110, Method: Compositional matrix adjust.
Identities = 176/218 (80%), Positives = 202/218 (92%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW++RR+ LEEK+L IVKHRQGH C TAWIVV +VAWEGVP +
Sbjct: 529 VIYTGKEGKTTQGCPMAKWILRRSGLEEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTH 588
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L++KLN++GLPTTRRC TNEPRTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 589 EADRIYSLLSHKLNRFGLPTTRRCGTNEPRTCACQGLDPETCGASFSFGCSWSMYYNGCK 648
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 649 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 708
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V ++
Sbjct: 709 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVTMT 746
>gi|307213413|gb|EFN88849.1| Protein TET2 [Harpegnathos saltator]
Length = 1214
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 177/218 (81%), Positives = 201/218 (92%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGKTTQGCP+AKW+IRR+ ++EK+L IVKHRQGH C TAWIVV +VAWEGVP +
Sbjct: 47 VIYTGKEGKTTQGCPMAKWIIRRSGIDEKILTIVKHRQGHKCPTAWIVVAMVAWEGVPTH 106
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y++L +KLN++GLPTTRRC TNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 107 EADRIYSLLCHKLNRFGLPTTRRCGTNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 166
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQE+EE+MH+LAT +SPLY +LAP AF NQ QFEREASECRLG
Sbjct: 167 YARSKTVRKFRLSVRSEEQEVEERMHVLATLLSPLYLSLAPEAFNNQTQFEREASECRLG 226
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAHSHRDLHNMNNGCTV V L+
Sbjct: 227 FKPGRPFSGVTACIDFCAHSHRDLHNMNNGCTVVVSLT 264
>gi|242005152|ref|XP_002423436.1| hypothetical protein Phum_PHUM059340 [Pediculus humanus corporis]
gi|212506514|gb|EEB10698.1| hypothetical protein Phum_PHUM059340 [Pediculus humanus corporis]
Length = 1861
Score = 391 bits (1005), Expect = e-107, Method: Compositional matrix adjust.
Identities = 175/218 (80%), Positives = 199/218 (91%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I YTGKEGKTT+GCPLAKWVIRR+ L+EK+L+IVKHR GHTCSTAWIVV +VAW+GVP
Sbjct: 994 ICYTGKEGKTTRGCPLAKWVIRRSGLDEKVLIIVKHRPGHTCSTAWIVVCLVAWDGVPTP 1053
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +YA+LT+KLNK+GLPT RRCATNE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1054 EADRIYALLTHKLNKFGLPTIRRCATNETRTCACQGLDPNTCGASFSFGCSWSMYYNGCK 1113
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSVRSEEQ++EE+MH+LAT +SPLY LAP A++NQ FEREA+ECRLG
Sbjct: 1114 YARSKTVRKFRLSVRSEEQDVEERMHVLATLLSPLYNTLAPEAYSNQTSFEREAAECRLG 1173
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
FKPGRPFSGVTAC DFCAH+HRDLHNMNNGCTV L+
Sbjct: 1174 FKPGRPFSGVTACIDFCAHAHRDLHNMNNGCTVVFTLT 1211
>gi|357624916|gb|EHJ75511.1| hypothetical protein KGM_05166 [Danaus plexippus]
Length = 2066
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 164/214 (76%), Positives = 191/214 (89%)
Query: 5 GKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDG 64
GKEGKT QGCP+AKW+IRR+S EK+L +VK R GH CST+WIVV +VAWEG+P +++D
Sbjct: 1062 GKEGKTAQGCPMAKWIIRRSSYTEKVLAVVKFRNGHKCSTSWIVVCLVAWEGIPQSEADL 1121
Query: 65 VYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARS 124
Y +L++KLN+YGLPTTRRCATNE RTCACQGLDP+TCGAS+SFGCSWSMYYNGCKYARS
Sbjct: 1122 DYTLLSHKLNRYGLPTTRRCATNENRTCACQGLDPETCGASYSFGCSWSMYYNGCKYARS 1181
Query: 125 KTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 184
KTVRKFRLSV++EE EIEE+MH+LAT +SPLY LAP +F NQCQFE+EAS+CRLGFKPG
Sbjct: 1182 KTVRKFRLSVKTEESEIEERMHVLATLLSPLYMNLAPKSFENQCQFEKEASDCRLGFKPG 1241
Query: 185 RPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
RPFSGVTAC DFCAH+HRDLHNMNNGCT V L+
Sbjct: 1242 RPFSGVTACIDFCAHAHRDLHNMNNGCTAVVTLA 1275
>gi|328712256|ref|XP_001947546.2| PREDICTED: hypothetical protein LOC100159694 [Acyrthosiphon pisum]
Length = 2023
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 161/218 (73%), Positives = 195/218 (89%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+LYTGKEGKTTQGCPLAKWVIRR+S +EKLL++VK+R+GH C +WIV+ IV+WEG+ +
Sbjct: 1306 VLYTGKEGKTTQGCPLAKWVIRRSSTDEKLLVVVKNRRGHKCQHSWIVICIVSWEGILSD 1365
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y +L++KLNKYG+PTTRRC TN+PRTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1366 EADFLYTMLSHKLNKYGVPTTRRCGTNDPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1425
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSK VRKFRLSVR+EEQE+EE++H+LAT +SPLYK+LAP ++ NQ Q ERE S+CRLG
Sbjct: 1426 YARSKDVRKFRLSVRTEEQELEERLHVLATNLSPLYKSLAPRSYNNQIQCEREGSDCRLG 1485
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
KPGRPF+ VTAC DFCAH+HRD HNM+NGCTV V L+
Sbjct: 1486 LKPGRPFASVTACIDFCAHAHRDFHNMHNGCTVVVTLN 1523
>gi|157125426|ref|XP_001654335.1| hypothetical protein AaeL_AAEL001921 [Aedes aegypti]
gi|108882699|gb|EAT46924.1| AAEL001921-PA [Aedes aegypti]
Length = 1953
Score = 363 bits (933), Expect = 2e-98, Method: Composition-based stats.
Identities = 159/220 (72%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR EEKLL IVK RQGH C A+IV+ IV W+G+P
Sbjct: 1132 VVYTGKEGKSSQGCPIAKWVIRRVDPEEKLLFIVKRRQGHRCKAAFIVICIVVWDGIPTQ 1191
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D VY +L+ KLNKYGLPT RRCATNE RTCACQGLDP+TCG S+SFGCSWSMYYNGCK
Sbjct: 1192 EADSVYRMLSVKLNKYGLPTVRRCATNENRTCACQGLDPETCGVSYSFGCSWSMYYNGCK 1251
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY +AP AF NQ Q+EREA +CRLG
Sbjct: 1252 YARSKTVRKFRLSVKNEEAEIEERMNILATMLSPLYVTVAPQAFQNQVQYEREAPDCRLG 1311
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V L P
Sbjct: 1312 LKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKP 1351
>gi|170047947|ref|XP_001851464.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870207|gb|EDS33590.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1872
Score = 361 bits (927), Expect = 1e-97, Method: Composition-based stats.
Identities = 157/220 (71%), Positives = 185/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR EEKLL +VK RQGH C ++IV+ IV W+G+P
Sbjct: 1040 VVYTGKEGKSSQGCPIAKWVIRRVDQEEKLLFVVKRRQGHRCKASFIVICIVVWDGIPTQ 1099
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D VY +L KLNKYGLPT RRCATNE RTCACQGLDP+TCG S+SFGCSWSMYYNGCK
Sbjct: 1100 EADSVYRMLAVKLNKYGLPTVRRCATNENRTCACQGLDPETCGVSYSFGCSWSMYYNGCK 1159
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY +AP AF NQ Q+EREA +CRLG
Sbjct: 1160 YARSKTVRKFRLSVKNEEAEIEERMNVLATMLSPLYVTVAPQAFQNQVQYEREAPDCRLG 1219
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V L P
Sbjct: 1220 LKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKP 1259
>gi|158286121|ref|XP_001688023.1| AGAP007180-PA [Anopheles gambiae str. PEST]
gi|157020316|gb|EDO64672.1| AGAP007180-PA [Anopheles gambiae str. PEST]
Length = 2328
Score = 359 bits (921), Expect = 6e-97, Method: Composition-based stats.
Identities = 155/220 (70%), Positives = 185/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR EEKLL +VK RQGH C ++IV+ IV W+G+P +
Sbjct: 1389 VVYTGKEGKSSQGCPIAKWVIRRVDPEEKLLFVVKRRQGHRCKASFIVICIVVWDGIPTH 1448
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D VY +L KLNK+GLPT RRCATNE RTCACQGLDP+ CG S+SFGCSWSMYYNGCK
Sbjct: 1449 EADSVYRMLAVKLNKFGLPTVRRCATNENRTCACQGLDPELCGVSYSFGCSWSMYYNGCK 1508
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV++EE EIEE+M++LAT +SPLY +AP AF NQ Q+EREA +CRLG
Sbjct: 1509 YARSKTVRKFRLSVKNEEAEIEERMNVLATMLSPLYVTVAPQAFQNQVQYEREAPDCRLG 1568
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
KPG+PFSGVT C DFCAH+HRDLHNM +GCTV V L P
Sbjct: 1569 LKPGKPFSGVTCCLDFCAHTHRDLHNMQDGCTVQVTLLKP 1608
>gi|194749274|ref|XP_001957064.1| GF10236 [Drosophila ananassae]
gi|190624346|gb|EDV39870.1| GF10236 [Drosophila ananassae]
Length = 2255
Score = 350 bits (899), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 155/220 (70%), Positives = 187/220 (85%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKTTQGCP+AKWVIRRA +EEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1077 IVYTGKEGKTTQGCPVAKWVIRRADMEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1136
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCK
Sbjct: 1137 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCK 1196
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E+EAS+CRLG
Sbjct: 1197 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEQEASDCRLG 1256
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1257 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1296
>gi|195020981|ref|XP_001985305.1| GH14579 [Drosophila grimshawi]
gi|193898787|gb|EDV97653.1| GH14579 [Drosophila grimshawi]
Length = 2971
Score = 349 bits (896), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 156/220 (70%), Positives = 185/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKTTQGCP+AKWVIRRA EEK+L++VK R GH C A+IVV +VAW+GVP
Sbjct: 1762 IVYTGKEGKTTQGCPVAKWVIRRADPEEKILVVVKKRPGHRCIAAYIVVCMVAWDGVPRL 1821
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCK
Sbjct: 1822 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCK 1881
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1882 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1941
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1942 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1981
>gi|198463152|ref|XP_002135446.1| GA28319 [Drosophila pseudoobscura pseudoobscura]
gi|198151134|gb|EDY74073.1| GA28319 [Drosophila pseudoobscura pseudoobscura]
Length = 2141
Score = 348 bits (893), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 155/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 919 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 978
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCK
Sbjct: 979 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCK 1038
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1039 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEGEASDCRLG 1098
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1099 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1138
>gi|386770417|ref|NP_001246581.1| CG43444, isoform A [Drosophila melanogaster]
gi|383291702|gb|AFH04252.1| CG43444, isoform A [Drosophila melanogaster]
Length = 2860
Score = 348 bits (893), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1686 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1745
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1746 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1805
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1806 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1865
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1866 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1905
>gi|442629819|ref|NP_001261343.1| CG43444, isoform E [Drosophila melanogaster]
gi|440215220|gb|AGB94038.1| CG43444, isoform E [Drosophila melanogaster]
Length = 2866
Score = 348 bits (893), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1692 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1751
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1752 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1811
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1812 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1871
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1872 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1911
>gi|442629821|ref|NP_001261344.1| CG43444, isoform F [Drosophila melanogaster]
gi|440215221|gb|AGB94039.1| CG43444, isoform F [Drosophila melanogaster]
Length = 2921
Score = 348 bits (892), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1747 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1806
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1807 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1866
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1867 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1926
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1927 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1966
>gi|386770419|ref|NP_001246582.1| CG43444, isoform B [Drosophila melanogaster]
gi|383291703|gb|AFH04253.1| CG43444, isoform B [Drosophila melanogaster]
Length = 2915
Score = 348 bits (892), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1741 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1800
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1801 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1860
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1861 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1920
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1921 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1960
>gi|195429100|ref|XP_002062602.1| GK17629 [Drosophila willistoni]
gi|194158687|gb|EDW73588.1| GK17629 [Drosophila willistoni]
Length = 2132
Score = 347 bits (890), Expect = 2e-93, Method: Composition-based stats.
Identities = 155/222 (69%), Positives = 188/222 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1021 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1080
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCK
Sbjct: 1081 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCK 1140
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1141 YARSKTVRKFRLSVKSEETAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1200
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P++
Sbjct: 1201 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPNN 1242
>gi|194865212|ref|XP_001971317.1| GG14498 [Drosophila erecta]
gi|190653100|gb|EDV50343.1| GG14498 [Drosophila erecta]
Length = 2186
Score = 347 bits (889), Expect = 3e-93, Method: Composition-based stats.
Identities = 155/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1009 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1068
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1069 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1128
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IEE M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1129 YARSKTVRKFRLSVKSEEAAIEEHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1188
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1189 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1228
>gi|195336964|ref|XP_002035103.1| GM14104 [Drosophila sechellia]
gi|194128196|gb|EDW50239.1| GM14104 [Drosophila sechellia]
Length = 1253
Score = 346 bits (888), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 250 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 309
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 310 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 369
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 370 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 429
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 430 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 469
>gi|195377914|ref|XP_002047732.1| GJ11762 [Drosophila virilis]
gi|194154890|gb|EDW70074.1| GJ11762 [Drosophila virilis]
Length = 2228
Score = 346 bits (887), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 153/220 (69%), Positives = 185/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA EEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1025 IVYTGKEGKTSQGCPVAKWVIRRADPEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 1084
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1085 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1144
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1145 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1204
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1205 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1244
>gi|386770421|ref|NP_647750.4| CG43444, isoform C [Drosophila melanogaster]
gi|386770423|ref|NP_001246583.1| CG43444, isoform D [Drosophila melanogaster]
gi|383291704|gb|AAF47691.4| CG43444, isoform C [Drosophila melanogaster]
gi|383291705|gb|AFH04254.1| CG43444, isoform D [Drosophila melanogaster]
Length = 2056
Score = 345 bits (886), Expect = 6e-93, Method: Composition-based stats.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 882 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 941
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 942 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1001
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1002 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1061
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1062 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1101
>gi|195129473|ref|XP_002009180.1| GI13905 [Drosophila mojavensis]
gi|193920789|gb|EDW19656.1| GI13905 [Drosophila mojavensis]
Length = 2290
Score = 345 bits (886), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 152/224 (67%), Positives = 187/224 (83%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA EEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 1090 IVYTGKEGKTSQGCPVAKWVIRRADPEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRK 1149
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNK+GLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 1150 EADDAYVNLIPKLNKFGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1209
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1210 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1269
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P + +
Sbjct: 1270 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKPSNRD 1313
>gi|195492879|ref|XP_002094180.1| GE21689 [Drosophila yakuba]
gi|194180281|gb|EDW93892.1| GE21689 [Drosophila yakuba]
Length = 2053
Score = 345 bits (886), Expect = 6e-93, Method: Composition-based stats.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 884 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 943
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 944 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 1003
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 1004 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 1063
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 1064 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 1103
>gi|195587298|ref|XP_002083402.1| GD13372 [Drosophila simulans]
gi|194195411|gb|EDX08987.1| GD13372 [Drosophila simulans]
Length = 907
Score = 345 bits (886), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 23 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 82
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP++ GAS+SFGCSWSMYYNGCK
Sbjct: 83 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPESSGASYSFGCSWSMYYNGCK 142
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 143 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEHEASDCRLG 202
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 203 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 242
>gi|195167891|ref|XP_002024766.1| GL22434 [Drosophila persimilis]
gi|194108171|gb|EDW30214.1| GL22434 [Drosophila persimilis]
Length = 567
Score = 345 bits (885), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 155/220 (70%), Positives = 186/220 (84%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT+QGCP+AKWVIRRA LEEK+L++VK R GH C A+IVV +VAW+G+P
Sbjct: 44 IVYTGKEGKTSQGCPVAKWVIRRADLEEKILVVVKKRPGHRCIAAYIVVCMVAWDGMPRL 103
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D Y L KLNKYGLPTTRRCATNE RTCACQGLDP+T GAS+SFGCSWSMYYNGCK
Sbjct: 104 EADNAYKNLIPKLNKYGLPTTRRCATNENRTCACQGLDPETSGASYSFGCSWSMYYNGCK 163
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSKTVRKFRLSV+SEE IE+ M+L+AT ++P++K + P ++ NQ ++E EAS+CRLG
Sbjct: 164 YARSKTVRKFRLSVKSEEAAIEDHMNLIATLLAPVFKQVCPRSYDNQTKYEGEASDCRLG 223
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNP 220
+PG+PFSGVTAC DFCAHSHRDLHNM +GCTV V L P
Sbjct: 224 LEPGKPFSGVTACLDFCAHSHRDLHNMQDGCTVHVALLKP 263
>gi|270007246|gb|EFA03694.1| hypothetical protein TcasGA2_TC013798 [Tribolium castaneum]
Length = 856
Score = 338 bits (868), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 152/218 (69%), Positives = 182/218 (83%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I YTGKEGKT QGCP+AKWVIRR+ +EK L+IVKHR GH+C +A+IVV IV W+G+P
Sbjct: 394 IRYTGKEGKTAQGCPIAKWVIRRSGSDEKYLIIVKHRPGHSCPSAFIVVCIVMWDGLPQP 453
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
SD +Y +LT+KLNK+GL RRCATNE +TCACQGL+PDTCGASFSFGCSWSMYYNGCK
Sbjct: 454 TSDELYTLLTSKLNKFGLANRRRCATNESKTCACQGLNPDTCGASFSFGCSWSMYYNGCK 513
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
++RSK VRKFRL+V+ EE+ +EEK+ +LAT +SP+Y++LAP AF NQC FE ECRLG
Sbjct: 514 FSRSKFVRKFRLNVQPEEKIVEEKLQILATYLSPIYRSLAPVAFRNQCFFEEGGRECRLG 573
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
+PGRPFSGVTAC DFCAHSH+D HNM NGCTV V L+
Sbjct: 574 LRPGRPFSGVTACLDFCAHSHKDSHNMVNGCTVVVTLT 611
>gi|321462649|gb|EFX73671.1| hypothetical protein DAPPUDRAFT_200491 [Daphnia pulex]
Length = 401
Score = 325 bits (833), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 143/218 (65%), Positives = 181/218 (83%), Gaps = 2/218 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGKT QGCP+AKW+IRR+SLEEK+L ++K R+GH C T W++V+ VAWEG+ L
Sbjct: 17 IIYTGKEGKTAQGCPIAKWIIRRSSLEEKVLCLIKERRGHRCQTTWLIVISVAWEGLALR 76
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
SD +Y L +LN +G+ T RRCATNE RTCACQGLDPDTCGASFSFGCSWSM++NGCK
Sbjct: 77 DSDYLYGELVYRLNAHGVATNRRCATNEDRTCACQGLDPDTCGASFSFGCSWSMFFNGCK 136
Query: 121 YARSK--TVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK TVRKFRL+ S+E ++ +++ AT I+PLYK +AP A+ NQ QFE +A +CR
Sbjct: 137 FARSKQQTVRKFRLTDESQEADMGDRLQRFATAIAPLYKRIAPDAYANQVQFEGKAVDCR 196
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVV 216
LG PGRPF+GVTACFDFCAHSH+D+H+MNNGCTV+++
Sbjct: 197 LGLAPGRPFAGVTACFDFCAHSHKDIHDMNNGCTVNLL 234
>gi|395501402|ref|XP_003755084.1| PREDICTED: methylcytosine dioxygenase TET1 [Sarcophilus harrisii]
Length = 1578
Score = 317 bits (811), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 144/224 (64%), Positives = 174/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P
Sbjct: 878 VVYTGKEGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHL 937
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT LNKYG PTTRRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK
Sbjct: 938 LADTLYQELTQSLNKYGCPTTRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCK 997
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK R+FRL EE+ +E + LAT ++P+YK LAP AF NQ + E S+CR
Sbjct: 998 FARSKNPRRFRLIADDPKEEENLESNLQTLATDVAPVYKKLAPDAFQNQVENEHLGSDCR 1057
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HNMNNG TV L+ D+
Sbjct: 1058 LGRKDGRPFSGVTACIDFCAHAHKDTHNMNNGSTVVCTLTKEDN 1101
>gi|326918544|ref|XP_003205548.1| PREDICTED: methylcytosine dioxygenase TET2-like [Meleagris gallopavo]
Length = 1955
Score = 316 bits (809), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P +
Sbjct: 1153 VVYTGKEGKSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTS 1212
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT+ L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1213 LADKLYSELTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1272
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1273 FARSKIPRKFKLMGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1332
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1333 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1378
>gi|297674086|ref|XP_002815070.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pongo abelii]
Length = 2023
Score = 315 bits (808), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+
Sbjct: 1201 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLS 1260
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1261 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1320
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1321 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1380
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1381 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1426
>gi|296195853|ref|XP_002745572.1| PREDICTED: methylcytosine dioxygenase TET2 [Callithrix jacchus]
Length = 1998
Score = 315 bits (808), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|332819906|ref|XP_526645.2| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pan
troglodytes]
Length = 2023
Score = 315 bits (807), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1201 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1260
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1261 LADKLYSELTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1320
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1321 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1380
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1381 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1426
>gi|397519749|ref|XP_003830016.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pan paniscus]
Length = 2023
Score = 315 bits (807), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1201 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1260
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1261 LADKLYSELTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1320
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1321 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1380
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1381 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1426
>gi|403275632|ref|XP_003929543.1| PREDICTED: methylcytosine dioxygenase TET2 [Saimiri boliviensis
boliviensis]
Length = 1999
Score = 315 bits (807), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1181 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1240
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1241 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1300
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1301 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1360
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1361 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1406
>gi|397519747|ref|XP_003830015.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pan paniscus]
Length = 2002
Score = 315 bits (807), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|410213066|gb|JAA03752.1| tet oncogene family member 2 [Pan troglodytes]
gi|410301428|gb|JAA29314.1| tet oncogene family member 2 [Pan troglodytes]
Length = 2002
Score = 315 bits (807), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|332819904|ref|XP_003310448.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Pan
troglodytes]
gi|410352429|gb|JAA42818.1| tet oncogene family member 2 [Pan troglodytes]
Length = 2002
Score = 315 bits (807), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTSRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|297674088|ref|XP_002815071.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Pongo abelii]
Length = 2002
Score = 315 bits (807), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 176/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC +A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCESAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|355749478|gb|EHH53877.1| hypothetical protein EGM_14586 [Macaca fascicularis]
Length = 1999
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1178 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1237
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1238 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1297
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1298 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1357
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1358 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1403
>gi|426345124|ref|XP_004040272.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Gorilla gorilla
gorilla]
Length = 2002
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|426345126|ref|XP_004040273.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Gorilla gorilla
gorilla]
Length = 2023
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1201 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1260
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1261 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1320
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1321 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1380
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1381 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1426
>gi|332216742|ref|XP_003257511.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2
[Nomascus leucogenys]
Length = 1996
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1173 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1232
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1233 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1292
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1293 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1352
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1353 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1398
>gi|187761317|ref|NP_001120680.1| methylcytosine dioxygenase TET2 isoform a [Homo sapiens]
gi|239938839|sp|Q6N021.3|TET2_HUMAN RecName: Full=Methylcytosine dioxygenase TET2
gi|227806663|emb|CAX30492.1| tet oncogene family member 2 [Homo sapiens]
Length = 2002
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|402870138|ref|XP_003899096.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2
[Papio anubis]
Length = 2027
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1205 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1264
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1265 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1324
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1325 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1384
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1385 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1430
>gi|449265874|gb|EMC77004.1| putative methylcytosine dioxygenase TET2, partial [Columba livia]
Length = 1470
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P +
Sbjct: 666 VVYTGKEGKSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTS 725
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT+ L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 726 LADKLYTELTDTLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 785
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 786 FARSKIPRKFKLMGDDPKEEEKLESNLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 845
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 846 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 891
>gi|338722529|ref|XP_001503267.3| PREDICTED: methylcytosine dioxygenase TET2 [Equus caballus]
Length = 1933
Score = 314 bits (805), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 142/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VVYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADRLYSELTETLRKYGALTNRRCAHNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L V EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLVDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|355687510|gb|EHH26094.1| hypothetical protein EGK_15982 [Macaca mulatta]
Length = 2003
Score = 314 bits (805), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1182 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1241
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1242 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1301
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1302 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1361
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1362 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1407
>gi|224049493|ref|XP_002193886.1| PREDICTED: methylcytosine dioxygenase TET2 [Taeniopygia guttata]
Length = 1960
Score = 314 bits (805), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA IV++I+ WEG+P +
Sbjct: 1155 VVYTGKEGKSSQGCPIAKWVVRRSSQEEKLLCLVRERAGHTCETAVIVILILVWEGIPTS 1214
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT+ L KYG T RRCA NE R CACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1215 LADRLYSELTDTLRKYGTLTNRRCALNEERNCACQGLDPETCGASFSFGCSWSMYYNGCK 1274
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1275 FARSKIPRKFKLMGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1334
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1335 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1380
>gi|345322870|ref|XP_003430647.1| PREDICTED: methylcytosine dioxygenase TET2 [Ornithorhynchus anatinus]
Length = 1462
Score = 314 bits (804), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 139/215 (64%), Positives = 172/215 (80%), Gaps = 2/215 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC TA +V++I+ WEG+PL+
Sbjct: 1173 VIYTGKEGKSSQGCPIAKWVVRRSSDEEKLLCLVRERAGHTCETAVVVILILVWEGIPLS 1232
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1233 LADRLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1292
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P+YK LAP A+ NQ ++E A ECR
Sbjct: 1293 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPIYKKLAPDAYNNQIEYEHRAPECR 1352
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 213
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+
Sbjct: 1353 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTL 1387
>gi|431897123|gb|ELK06385.1| Putative methylcytosine dioxygenase TET2 [Pteropus alecto]
Length = 2040
Score = 313 bits (803), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ +EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1176 VIYTGKEGKSSQGCPIAKWVVRRSCIEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1235
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1236 LADKLYSELTETLKKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1295
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1296 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1355
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1356 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1401
>gi|380805593|gb|AFE74672.1| methylcytosine dioxygenase TET2 isoform a, partial [Macaca mulatta]
Length = 430
Score = 313 bits (801), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 91 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 150
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 151 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 210
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 211 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 270
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 271 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 316
>gi|395542103|ref|XP_003772974.1| PREDICTED: methylcytosine dioxygenase TET2 [Sarcophilus harrisii]
Length = 2011
Score = 312 bits (800), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1196 VIYTGKEGKSSQGCPIAKWVVRRSCNEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1255
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1256 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1315
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1316 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYENRAPECR 1375
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1376 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1421
>gi|334313839|ref|XP_001368961.2| PREDICTED: methylcytosine dioxygenase TET1 [Monodelphis domestica]
Length = 2124
Score = 312 bits (800), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 142/224 (63%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P
Sbjct: 1425 VVYTGKEGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHL 1484
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT LNKYG PTTRRCA NE RTCACQG+DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1485 LADTLYQELTQSLNKYGCPTTRRCALNEDRTCACQGMDPETCGASFSFGCSWSMYFNGCK 1544
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK R+FRL EE+ +E + LAT ++P+YK LAP AF NQ + E +CR
Sbjct: 1545 FARSKNPRRFRLIADDPKEEEILESNLQSLATDVAPVYKKLAPDAFRNQVENEPLGPDCR 1604
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HNMNNG TV L+ D+
Sbjct: 1605 LGRKDGRPFSGVTACIDFCAHAHKDTHNMNNGSTVVCTLTKEDN 1648
>gi|119626584|gb|EAX06179.1| hCG21336 [Homo sapiens]
Length = 839
Score = 312 bits (800), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 141/226 (62%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 17 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 76
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 77 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 136
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 137 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 196
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 197 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 242
>gi|345795800|ref|XP_535678.3| PREDICTED: methylcytosine dioxygenase TET2 [Canis lupus familiaris]
Length = 2018
Score = 312 bits (799), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1190 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1249
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1250 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1309
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1310 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1369
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1370 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1415
>gi|350587911|ref|XP_003129326.3| PREDICTED: methylcytosine dioxygenase TET2 [Sus scrofa]
Length = 2019
Score = 312 bits (799), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 173/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSGSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLP 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|301782603|ref|XP_002926716.1| PREDICTED: probable methylcytosine dioxygenase TET2-like [Ailuropoda
melanoleuca]
Length = 2006
Score = 312 bits (799), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1182 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1241
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1242 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1301
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1302 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1361
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1362 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1407
>gi|410957091|ref|XP_003985168.1| PREDICTED: methylcytosine dioxygenase TET2 [Felis catus]
Length = 2017
Score = 311 bits (798), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|334330961|ref|XP_003341431.1| PREDICTED: methylcytosine dioxygenase TET2 [Monodelphis domestica]
Length = 2016
Score = 311 bits (798), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1199 VIYTGKEGKSSQGCPIAKWVVRRSCNEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1258
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1259 LADRLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1318
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1319 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1378
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1379 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1424
>gi|417406864|gb|JAA50073.1| Putative vesicle coat complex copii subunit sec31 [Desmodus rotundus]
Length = 2036
Score = 311 bits (797), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1180 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1239
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1240 LADQLYSELTETLRKYGALTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1299
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1300 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1359
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1360 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1405
>gi|344277247|ref|XP_003410414.1| PREDICTED: methylcytosine dioxygenase TET2 [Loxodonta africana]
Length = 2013
Score = 311 bits (797), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1181 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 1240
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1241 LADRLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1300
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1301 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEDRAPECR 1360
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1361 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1406
>gi|395847439|ref|XP_003796382.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Otolemur
garnettii]
gi|395847441|ref|XP_003796383.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 2 [Otolemur
garnettii]
Length = 2014
Score = 311 bits (797), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 140/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 1178 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCFVRERAGHTCEAAVIVILILVWEGIPLS 1237
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1238 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1297
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1298 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1357
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+
Sbjct: 1358 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDN 1401
>gi|426231353|ref|XP_004009704.1| PREDICTED: methylcytosine dioxygenase TET2 [Ovis aries]
Length = 2001
Score = 311 bits (796), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 139/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+P++
Sbjct: 1179 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVS 1238
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1239 LADKLYSELTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1298
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1299 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1358
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDL NM NG T+ L+ D+ E
Sbjct: 1359 LGLKEGRPFSGVTACLDFCAHAHRDLQNMQNGSTLVCTLTREDNRE 1404
>gi|444723451|gb|ELW64107.1| Methylcytosine dioxygenase TET2 [Tupaia chinensis]
Length = 2020
Score = 311 bits (796), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 173/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL
Sbjct: 1217 VIYTGKEGKSSQGCPIAKWVVRRSCDEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLT 1276
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1277 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1336
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1337 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1396
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1397 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1442
>gi|297466579|ref|XP_001790198.2| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Bos taurus]
gi|297475658|ref|XP_002688138.1| PREDICTED: methylcytosine dioxygenase TET2 isoform 1 [Bos taurus]
gi|296486794|tpg|DAA28907.1| TPA: tet oncogene family member 2 [Bos taurus]
Length = 2007
Score = 310 bits (795), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 139/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+P++
Sbjct: 1179 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVS 1238
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1239 LADKLYSELTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1298
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1299 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1358
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDL NM NG T+ L+ D+ E
Sbjct: 1359 LGLKEGRPFSGVTACLDFCAHAHRDLQNMQNGSTLVCTLTREDNRE 1404
>gi|363735173|ref|XP_421571.3| PREDICTED: methylcytosine dioxygenase TET1 [Gallus gallus]
Length = 1541
Score = 309 bits (792), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 141/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P
Sbjct: 834 VVYTGKEGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHL 893
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK
Sbjct: 894 LADTLYKELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCK 953
Query: 121 YARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CR
Sbjct: 954 FARSKNPRKFRLLTDDPKQEELLEHNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCR 1013
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HNM+NG TV L+ D+
Sbjct: 1014 LGSKDGRPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDN 1057
>gi|281346571|gb|EFB22155.1| hypothetical protein PANDA_016408 [Ailuropoda melanoleuca]
Length = 830
Score = 309 bits (792), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 174/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+PL+
Sbjct: 14 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLS 73
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 74 LADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 133
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 134 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 193
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 194 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 239
>gi|449504705|ref|XP_002190919.2| PREDICTED: methylcytosine dioxygenase TET1 [Taeniopygia guttata]
Length = 2187
Score = 309 bits (791), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 141/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P
Sbjct: 1480 VVYTGKEGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHL 1539
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1540 LADTLYKELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCK 1599
Query: 121 YARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CR
Sbjct: 1600 FARSKNPRKFRLLTDDPKQEELLENNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCR 1659
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HNM+NG TV L+ D+
Sbjct: 1660 LGCKDGRPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDN 1703
>gi|327283432|ref|XP_003226445.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET2-like
[Anolis carolinensis]
Length = 1631
Score = 308 bits (789), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 172/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGK+ QGCP+AKWVIRR S EEKLL +V+ R GH+C TA IVV+I+ WEG+P +
Sbjct: 837 IVYTGKEGKSAQGCPIAKWVIRRGSTEEKLLCLVRERAGHSCETAVIVVLILVWEGIPQS 896
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ L+ L KYG T RRCA NE RTCACQGLD ++CGASFSFGCSWSMYYNGCK
Sbjct: 897 LADKLYSDLSETLRKYGTLTNRRCALNEERTCACQGLDTESCGASFSFGCSWSMYYNGCK 956
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 957 FARSKIPRKFKLLGDDPKEEEKLETSLQTLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1016
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1017 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1062
>gi|449268998|gb|EMC79810.1| Methylcytosine dioxygenase TET1, partial [Columba livia]
Length = 1186
Score = 308 bits (789), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 141/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P
Sbjct: 919 VVYTGKEGKSSQGCPIAKWVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHL 978
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK
Sbjct: 979 LADTLYKELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCK 1038
Query: 121 YARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL +QE +E + LAT ++P+YK LAP AF NQ + E +CR
Sbjct: 1039 FARSKNPRKFRLLTDDPKQEELLENNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCR 1098
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HNM+NG TV L+ D+
Sbjct: 1099 LGCKDGRPFSGVTACIDFCAHAHKDTHNMHNGSTVVCTLTKEDN 1142
>gi|440910400|gb|ELR60199.1| Putative methylcytosine dioxygenase TET2, partial [Bos grunniens
mutus]
Length = 1394
Score = 307 bits (787), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 136/215 (63%), Positives = 169/215 (78%), Gaps = 2/215 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+ WEG+P++
Sbjct: 1179 VIYTGKEGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPVS 1238
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y+ LT L KYG+ T RRCA NE RTCACQGLDPDTCGASFSFGCSWSMYYNGCK
Sbjct: 1239 LADKLYSELTETLRKYGMLTNRRCALNEERTCACQGLDPDTCGASFSFGCSWSMYYNGCK 1298
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1299 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1358
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 213
LG K GRPFSGVTAC DFCAH+HRDL NM NG T+
Sbjct: 1359 LGLKEGRPFSGVTACLDFCAHAHRDLQNMQNGSTL 1393
>gi|427788369|gb|JAA59636.1| Putative thyroid hormone receptor-associated protein complex
subunit [Rhipicephalus pulchellus]
Length = 1666
Score = 306 bits (785), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 147/218 (67%), Positives = 178/218 (81%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+LYTGKEGKT+QGCP+AKWVIRR+S EK+L +++HRQGH C +A+IV+ IVAWEGV +
Sbjct: 640 VLYTGKEGKTSQGCPVAKWVIRRSSPNEKVLAVLRHRQGHRCLSAYIVMAIVAWEGVHAD 699
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y + +K +G PT RRC TNE RTCACQG D + CGASFSFGCSWSMYYNGCK
Sbjct: 700 MADDLYRTVVHKTVNFGFPTQRRCGTNEQRTCACQGADSENCGASFSFGCSWSMYYNGCK 759
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
YARSK+VRKF+LS +SEEQE+EEK+ LAT ++PLY +AP ++ NQ +FE E CRLG
Sbjct: 760 YARSKSVRKFKLSEQSEEQELEEKLQQLATDMAPLYARVAPESYKNQTEFESEGISCRLG 819
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
KPGRPFSGVTAC DFCAHSH+DLHNMNNGCTV V L+
Sbjct: 820 LKPGRPFSGVTACVDFCAHSHKDLHNMNNGCTVVVTLT 857
>gi|291401335|ref|XP_002717242.1| PREDICTED: tet oncogene family member 2 [Oryctolagus cuniculus]
Length = 2011
Score = 306 bits (784), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 138/226 (61%), Positives = 173/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+ EEKLL +V+ R GHTC A IV++I+ WEG+P +
Sbjct: 1181 VIYTGKEGKSSQGCPIAKWVIRRSCSEEKLLCLVRERAGHTCEAAVIVILILLWEGIPQS 1240
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y+ LT L +G T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1241 LATELYSELTETLKNHGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1300
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P+YK LAP A+ NQ ++E A ECR
Sbjct: 1301 FARSKVPRKFKLLGDDPKEEEKLESHLQNLSTLLAPIYKKLAPDAYNNQIEYEHRAPECR 1360
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM+NG TV L+ D+ E
Sbjct: 1361 LGLKEGRPFSGVTACLDFCAHAHRDLHNMHNGSTVVCTLTKEDNRE 1406
>gi|410901250|ref|XP_003964109.1| PREDICTED: methylcytosine dioxygenase TET3-like [Takifugu rubripes]
Length = 1134
Score = 306 bits (783), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 142/224 (63%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR S EEKLL +V+ R GH C TA +V++I+AWEG+
Sbjct: 644 VVYTGKEGKSSQGCPIAKWVIRRDSEEEKLLCLVRRRPGHCCDTAVLVILILAWEGISRP 703
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+DG+Y LT L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK
Sbjct: 704 VADGLYQELTTTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCK 763
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL EE+++E + LAT ++PLYK LAP AF NQ + E +CR
Sbjct: 764 FARSKVPRKFRLQGDYPEEEEKLETHLQGLATDLAPLYKRLAPEAFQNQVENEDGGGDCR 823
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG + GRPFSGVTAC DFCAH+H+D HNMNNG TV L+ D+
Sbjct: 824 LGQREGRPFSGVTACVDFCAHAHKDTHNMNNGSTVVCTLTKEDN 867
>gi|351694675|gb|EHA97593.1| Putative methylcytosine dioxygenase TET2 [Heterocephalus glaber]
Length = 1947
Score = 306 bits (783), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 175/226 (77%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R+GHTC A IVV+I+ WEG+PL
Sbjct: 1142 VVYTGKEGKSSQGCPIAKWVIRRSSREEKLLCLVRERRGHTCEVAVIVVLILLWEGIPLP 1201
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++ +Y LTN L + G T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1202 LANRLYTELTNTLCRNGSLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCK 1261
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T +SP+Y+ LAP A+ NQ + E A +CR
Sbjct: 1262 FARSKVPRKFKLVGDDPKEEEKLESNLQNLSTFLSPMYQKLAPDAYNNQVELEHRAPDCR 1321
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM+NG T+ L+ D+ E
Sbjct: 1322 LGLKEGRPFSGVTACLDFCAHAHRDLHNMHNGSTLVCTLTREDNRE 1367
>gi|432875799|ref|XP_004072913.1| PREDICTED: methylcytosine dioxygenase TET3-like [Oryzias latipes]
Length = 2014
Score = 306 bits (783), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 136/224 (60%), Positives = 174/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWVIRR S +EK+L +V+HR GH C A I+++I+AWEGVP
Sbjct: 1090 VVYTGKEGKSSHGCPIAKWVIRRGSEKEKVLCLVRHRAGHHCENAVIIILILAWEGVPKA 1149
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y +T+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1150 LADKLYREVTDTLTKYGNPTSRRCGLNDDRTCACQGKDPETCGASFSFGCSWSMYFNGCK 1209
Query: 121 YARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRL EE+++ + LAT ++PLYK LAP A++NQC E +AS+CR
Sbjct: 1210 YARSKMPRKFRLQGDHPEEEEKLRDNFQNLATEVAPLYKRLAPQAYSNQCLSEDKASDCR 1269
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSG+TAC DFCAH+H+D HN++NGCTV L+ D+
Sbjct: 1270 LGLKEGRPFSGITACMDFCAHAHKDQHNLHNGCTVVCTLTKEDN 1313
>gi|432951908|ref|XP_004084919.1| PREDICTED: methylcytosine dioxygenase TET1-like [Oryzias latipes]
Length = 1530
Score = 305 bits (781), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 174/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEG+++QGCP+AKWVIRR S EEKLL +V+ R GH+C +A +V++I+AWEG+P
Sbjct: 880 VVYTGKEGRSSQGCPIAKWVIRRGSEEEKLLCLVRQRPGHSCDSAVLVILILAWEGIPRP 939
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT+ L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK
Sbjct: 940 VADHLYRELTDTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCK 999
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL +E++IE + LA+ ++PLYK LAP AF NQ + E S+CR
Sbjct: 1000 FARSKVPRKFRLHGDFPEQEEKIENNLQNLASDLAPLYKKLAPQAFQNQVEHEVAGSDCR 1059
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG + GRPFSGVTAC DFCAH+H+D NMNNG TV L+ D+
Sbjct: 1060 LGREEGRPFSGVTACVDFCAHAHKDTSNMNNGSTVVCTLTKEDN 1103
>gi|410922577|ref|XP_003974759.1| PREDICTED: methylcytosine dioxygenase TET3-like [Takifugu rubripes]
Length = 2020
Score = 304 bits (779), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 137/226 (60%), Positives = 172/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTG+EGK++QGCP+AKWVIRR + EKLL +V+ R GH C A I++VI+AWEGVP
Sbjct: 1104 VVYTGREGKSSQGCPIAKWVIRRGNETEKLLCLVRERAGHHCPNAVIIIVILAWEGVPRA 1163
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L++ L KYG PT+RRC N+ RTCACQG DP+ CGASFSFGCSWSMY+NGCK
Sbjct: 1164 MADMLYRDLSDSLTKYGNPTSRRCGFNDDRTCACQGKDPEKCGASFSFGCSWSMYFNGCK 1223
Query: 121 YARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRL EE ++ ++ LAT ++PLYK LAP A++NQCQ E +A +CR
Sbjct: 1224 YARSKMPRKFRLQGERPEEEDKVGDRFQALATHVAPLYKQLAPQAYSNQCQTESKAPDCR 1283
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+ E
Sbjct: 1284 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRE 1329
>gi|351698810|gb|EHB01729.1| Putative methylcytosine dioxygenase TET3 [Heterocephalus glaber]
Length = 1721
Score = 304 bits (778), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 140/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 802 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 861
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 862 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 921
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRLS + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 922 YARSKTPRKFRLSGDNPKEEEVLRKSFQDLATEVAPLYKQLAPQAYQNQVTNEEIAIDCR 981
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 982 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1025
>gi|354495922|ref|XP_003510077.1| PREDICTED: methylcytosine dioxygenase TET3 [Cricetulus griseus]
gi|344253854|gb|EGW09958.1| putative methylcytosine dioxygenase TET3 [Cricetulus griseus]
Length = 1668
Score = 303 bits (777), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A I+++I+AWEG+P +
Sbjct: 748 VIYTGKEGKSSRGCPIAKWVIRRGTLEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRS 807
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 808 LGDALYRELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 867
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 868 YARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNEEVAIDCR 927
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 928 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971
>gi|345321271|ref|XP_001520561.2| PREDICTED: methylcytosine dioxygenase TET1-like [Ornithorhynchus
anatinus]
Length = 2358
Score = 303 bits (776), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 138/215 (64%), Positives = 168/215 (78%), Gaps = 2/215 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P
Sbjct: 2129 VVYTGKEGKSSQGCPIAKWVIRRSSNEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHL 2188
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT L KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK
Sbjct: 2189 LADTLYQELTQSLRKYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCK 2248
Query: 121 YARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK R+FRL +QE +E + LAT ++P+YK LAP AF NQ + E +CR
Sbjct: 2249 FARSKNPRRFRLLTDDPKQEESLENNLQNLATDVAPVYKKLAPDAFQNQVENEHLGPDCR 2308
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 213
LG K GRPFSGVTAC DFCAH+H+D HNM+NG TV
Sbjct: 2309 LGCKDGRPFSGVTACIDFCAHAHKDTHNMHNGSTV 2343
>gi|395841208|ref|XP_003793438.1| PREDICTED: methylcytosine dioxygenase TET3 [Otolemur garnettii]
Length = 1655
Score = 303 bits (776), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 734 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 793
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PTTRRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 794 LGDTLYQELTDTLRKYGNPTTRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 853
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 854 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 913
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 914 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 957
>gi|426226468|ref|XP_004007365.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3
[Ovis aries]
Length = 1498
Score = 303 bits (776), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 714 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 773
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 774 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 833
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 834 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 893
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 894 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 937
>gi|335285293|ref|XP_003125075.2| PREDICTED: methylcytosine dioxygenase TET3 [Sus scrofa]
Length = 1660
Score = 303 bits (775), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 738 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 797
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 798 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 857
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 858 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 917
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 918 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 961
>gi|326667684|ref|XP_003198655.1| PREDICTED: methylcytosine dioxygenase TET3 [Danio rerio]
Length = 1799
Score = 303 bits (775), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 137/224 (61%), Positives = 170/224 (75%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTG+EGK++QGCP+AKWV+RR+S +EK+L +VK R GH C+ IVVVI+AWEGVP
Sbjct: 901 VVYTGREGKSSQGCPIAKWVLRRSSEKEKVLCVVKQRPGHHCANTVIVVVILAWEGVPRA 960
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y +T + KYG PT+RRC NE RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 961 LGDKLYREVTETITKYGNPTSRRCGLNEDRTCACQGKDPETCGASFSFGCSWSMYFNGCK 1020
Query: 121 YARSKTVRKFRLSVR--SEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRL EE + + LAT ++PLYK LAP A++NQC E AS+CR
Sbjct: 1021 YARSKVPRKFRLQGEHPKEEDNLRDNFQALATHVAPLYKKLAPQAYSNQCLHEDVASDCR 1080
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSG+TAC DFCAH+H+D HN++NGCTV L+ D+
Sbjct: 1081 LGLKEGRPFSGITACMDFCAHAHKDQHNLHNGCTVVCTLTKEDN 1124
>gi|348566495|ref|XP_003469037.1| PREDICTED: methylcytosine dioxygenase TET3-like [Cavia porcellus]
Length = 1670
Score = 303 bits (775), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 751 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 810
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 811 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 870
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 871 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 930
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 931 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 974
>gi|358414357|ref|XP_582145.4| PREDICTED: methylcytosine dioxygenase TET3 [Bos taurus]
Length = 1657
Score = 303 bits (775), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 736 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 795
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 796 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 855
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 856 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 915
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 916 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 959
>gi|338714173|ref|XP_001917149.2| PREDICTED: methylcytosine dioxygenase TET3 [Equus caballus]
Length = 1664
Score = 303 bits (775), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|355723845|gb|AES08024.1| tet oncoprotein family member 2 [Mustela putorius furo]
Length = 870
Score = 303 bits (775), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 140/233 (60%), Positives = 174/233 (74%), Gaps = 9/233 (3%)
Query: 1 ILYTGKEGKTTQGC-------PLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVA 53
++YTGKEGK++QGC P+AKWV+RR+ EEKLL +V+ R GHTC A IV++I+
Sbjct: 43 VIYTGKEGKSSQGCGKSSQGCPIAKWVVRRSCSEEKLLCLVRERAGHTCEAAVIVILILV 102
Query: 54 WEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWS 113
WEG+PL+ +D +Y+ LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWS
Sbjct: 103 WEGIPLSLADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWS 162
Query: 114 MYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 171
MYYNGCK+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E
Sbjct: 163 MYYNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYE 222
Query: 172 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
A ECRLG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 223 HRAPECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 275
>gi|440904536|gb|ELR55033.1| Putative methylcytosine dioxygenase TET3, partial [Bos grunniens
mutus]
Length = 1675
Score = 303 bits (775), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 754 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 813
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 814 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 873
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 874 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 933
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 934 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 977
>gi|410918022|ref|XP_003972485.1| PREDICTED: methylcytosine dioxygenase TET2-like [Takifugu rubripes]
Length = 939
Score = 303 bits (775), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 139/226 (61%), Positives = 172/226 (76%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+TQGCP+AKWVIRR S EEK+L++V+ R GHTC+TA I+VVI+ WEG+ N
Sbjct: 420 VVYTGKEGKSTQGCPIAKWVIRRGSEEEKILVLVRERTGHTCNTACIIVVILVWEGILPN 479
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L++ L K+G T RRCA NE RTCACQGL+P+ CGASFSFGCSWSMYYNGCK
Sbjct: 480 LADRLYHELSDTLRKHGALTQRRCAHNEERTCACQGLNPEACGASFSFGCSWSMYYNGCK 539
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+ +E+ LAT + PLYK+LAP A+ NQ + E+ +CR
Sbjct: 540 FARSKNPRKFKLLGDDMKEEERLEQNFQSLATLLGPLYKSLAPEAYGNQVEHEQRGLDCR 599
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM G TV L+ D+ E
Sbjct: 600 LGHKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTVVCTLTKEDNRE 645
>gi|431920360|gb|ELK18392.1| Putative methylcytosine dioxygenase TET3 [Pteropus alecto]
Length = 1631
Score = 302 bits (774), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 741 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 800
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 801 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 860
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 861 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 920
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 921 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 964
>gi|417406721|gb|JAA50005.1| Putative snf2 family dna-dependent atpase [Desmodus rotundus]
Length = 1759
Score = 302 bits (774), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 842 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 901
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 902 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 961
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 962 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 1021
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1022 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1065
>gi|291386514|ref|XP_002709671.1| PREDICTED: tet oncogene family member 3 [Oryctolagus cuniculus]
Length = 1822
Score = 302 bits (774), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 921 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 980
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 981 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 1040
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 1041 YARSKTPRKFRLAGDNPKEEEVLPKSFQGLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 1100
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1101 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1144
>gi|256773243|ref|NP_898961.2| methylcytosine dioxygenase TET3 [Mus musculus]
gi|239938841|sp|Q8BG87.3|TET3_MOUSE RecName: Full=Methylcytosine dioxygenase TET3
Length = 1668
Score = 302 bits (774), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 748 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 807
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 808 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 867
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 868 YARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNEDVAIDCR 927
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 928 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971
>gi|432108066|gb|ELK33047.1| Methylcytosine dioxygenase TET3 [Myotis davidii]
Length = 1772
Score = 302 bits (774), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 818 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 877
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 878 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 937
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 938 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 997
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 998 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1041
>gi|297266306|ref|XP_001107194.2| PREDICTED: probable methylcytosine dioxygenase TET3 [Macaca mulatta]
Length = 1714
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 794 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 853
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 854 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 913
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 914 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 973
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 974 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1017
>gi|345782422|ref|XP_540225.3| PREDICTED: methylcytosine dioxygenase TET3 [Canis lupus familiaris]
Length = 1660
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 742 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 801
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP TCGASFSFGCSWSMY+NGCK
Sbjct: 802 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGASFSFGCSWSMYFNGCK 861
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 862 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 921
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 922 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965
>gi|47227721|emb|CAG09718.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2294
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 137/226 (60%), Positives = 170/226 (75%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTG+EGK++QGCP+AKWVIRR S EKLL +V+ R GH C A I++VI+AWEGVP
Sbjct: 1533 VVYTGREGKSSQGCPIAKWVIRRGSETEKLLCLVRERAGHHCPNAVIIIVILAWEGVPRA 1592
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L++ L KYG PT RRC N+ RTCACQG DP+ GASFSFGCSWSMY+NGCK
Sbjct: 1593 MADMLYRDLSDSLTKYGNPTNRRCGFNDDRTCACQGKDPEKSGASFSFGCSWSMYFNGCK 1652
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRL EE ++ ++ LAT ++PLYK LAP A++NQCQ E +A +CR
Sbjct: 1653 YARSKMPRKFRLQGDRPEEEDKVRDRFQALATHVAPLYKQLAPQAYSNQCQTESKAPDCR 1712
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+ E
Sbjct: 1713 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRE 1758
>gi|301772240|ref|XP_002921545.1| PREDICTED: probable methylcytosine dioxygenase TET3-like [Ailuropoda
melanoleuca]
Length = 1695
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 778 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 837
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP TCGASFSFGCSWSMY+NGCK
Sbjct: 838 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGASFSFGCSWSMYFNGCK 897
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 898 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 957
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 958 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1001
>gi|293346889|ref|XP_002726470.1| PREDICTED: methylcytosine dioxygenase TET3 [Rattus norvegicus]
gi|293358777|ref|XP_001057850.2| PREDICTED: methylcytosine dioxygenase TET3 [Rattus norvegicus]
gi|149036522|gb|EDL91140.1| rCG56357 [Rattus norvegicus]
Length = 1667
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 747 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 806
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 807 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 866
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 867 YARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNEDVAIDCR 926
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 927 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 970
>gi|390474307|ref|XP_002757648.2| PREDICTED: methylcytosine dioxygenase TET3 [Callithrix jacchus]
Length = 1660
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEVAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|281343070|gb|EFB18654.1| hypothetical protein PANDA_010427 [Ailuropoda melanoleuca]
Length = 1674
Score = 302 bits (773), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 757 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 816
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP TCGASFSFGCSWSMY+NGCK
Sbjct: 817 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPSTCGASFSFGCSWSMYFNGCK 876
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 877 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 936
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 937 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 980
>gi|119620108|gb|EAW99702.1| hCG40738 [Homo sapiens]
Length = 1714
Score = 302 bits (773), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 794 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 853
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 854 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 913
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 914 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 973
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 974 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1017
>gi|410955071|ref|XP_003984182.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3
[Felis catus]
Length = 1658
Score = 302 bits (773), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 171/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLVGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|402891273|ref|XP_003908876.1| PREDICTED: methylcytosine dioxygenase TET3 [Papio anubis]
Length = 1660
Score = 302 bits (773), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|148666664|gb|EDK99080.1| mCG133587 [Mus musculus]
Length = 1707
Score = 302 bits (773), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 787 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 846
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 847 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 906
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 907 YARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNEDVAIDCR 966
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 967 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1010
>gi|313493537|gb|ADR57138.1| TET3 isoform 2 [Mus musculus]
Length = 1784
Score = 301 bits (772), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 864 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 923
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 924 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 983
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 984 YARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNEDVAIDCR 1043
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1044 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1087
>gi|426336006|ref|XP_004029495.1| PREDICTED: methylcytosine dioxygenase TET3 [Gorilla gorilla
gorilla]
Length = 1662
Score = 301 bits (772), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 742 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 801
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 802 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 861
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 862 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 921
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 922 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965
>gi|355565799|gb|EHH22228.1| hypothetical protein EGK_05455, partial [Macaca mulatta]
Length = 1693
Score = 301 bits (772), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 775 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 834
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 835 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 894
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 895 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 954
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 955 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 998
>gi|395731669|ref|XP_002811944.2| PREDICTED: methylcytosine dioxygenase TET3 [Pongo abelii]
Length = 1659
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|332813444|ref|XP_515553.3| PREDICTED: methylcytosine dioxygenase TET3 [Pan troglodytes]
Length = 1662
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 742 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 801
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 802 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 861
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 862 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 921
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 922 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 965
>gi|397478119|ref|XP_003810404.1| PREDICTED: methylcytosine dioxygenase TET3 [Pan paniscus]
Length = 1660
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|149944516|ref|NP_659430.1| methylcytosine dioxygenase TET3 [Homo sapiens]
gi|190358928|sp|O43151.3|TET3_HUMAN RecName: Full=Methylcytosine dioxygenase TET3
Length = 1660
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|316990466|gb|ADU77107.1| putative methylcytosine dioxygenase [Homo sapiens]
Length = 1795
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 875 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 934
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 935 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 994
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 995 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 1054
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1055 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1098
>gi|313493535|gb|ADR57137.1| TET3 isoform 1 [Mus musculus]
gi|432138979|gb|AGB05430.1| Tet methylcytosine deoxygenase 3 isoform [Mus musculus]
Length = 1803
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 172/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 883 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 942
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 943 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 1002
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 1003 YARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNEDVAIDCR 1062
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1063 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1106
>gi|403260367|ref|XP_003922646.1| PREDICTED: methylcytosine dioxygenase TET3 [Saimiri boliviensis
boliviensis]
Length = 1659
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 740 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 799
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 800 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 859
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 860 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEVAIDCR 919
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 920 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
>gi|355751424|gb|EHH55679.1| hypothetical protein EGM_04930, partial [Macaca fascicularis]
Length = 1621
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 775 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 834
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 835 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 894
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 895 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 954
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 955 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 998
>gi|444723357|gb|ELW64014.1| Methylcytosine dioxygenase TET3 [Tupaia chinensis]
Length = 2326
Score = 301 bits (771), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 171/224 (76%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 1358 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 1417
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1418 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 1477
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 1478 YARSKTPRKFRLVGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 1537
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1538 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1581
>gi|441643103|ref|XP_003268728.2| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET3,
partial [Nomascus leucogenys]
Length = 1787
Score = 301 bits (771), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 868 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRTGHHCQNAVIVILILAWEGIPRS 927
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 928 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 987
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 988 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 1047
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1048 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1091
>gi|296482779|tpg|DAA24894.1| TPA: hypothetical protein BOS_11388 [Bos taurus]
Length = 964
Score = 301 bits (771), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 43 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 102
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 103 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 162
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 163 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 222
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 223 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 266
>gi|344283728|ref|XP_003413623.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase
TET3-like [Loxodonta africana]
Length = 1582
Score = 301 bits (771), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 664 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 723
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 724 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 783
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 784 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 843
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 844 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 887
>gi|359069958|ref|XP_002691249.2| PREDICTED: methylcytosine dioxygenase TET3 [Bos taurus]
Length = 938
Score = 301 bits (770), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 17 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 76
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 77 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 136
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 137 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 196
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 197 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 240
>gi|432847164|ref|XP_004065962.1| PREDICTED: methylcytosine dioxygenase TET2-like [Oryzias latipes]
Length = 1755
Score = 300 bits (768), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 140/227 (61%), Positives = 172/227 (75%), Gaps = 4/227 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+TQGCP+AKWVIRR+S+EEKLL++V+ R GH C TA I+VVI+ WEG+ +
Sbjct: 975 VIYTGKEGKSTQGCPIAKWVIRRSSVEEKLLVLVRERTGHRCETACIIVVILVWEGIQAS 1034
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L+ L K G T RRCA NE RTCACQGL+P+ GASFSFGCSWSMYYNGCK
Sbjct: 1035 LADRLYLELSETLKKNGAHTQRRCAFNEERTCACQGLNPEESGASFSFGCSWSMYYNGCK 1094
Query: 121 YARSKTVRKFRL---SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
+ARSK RKF+L VR EE+++E LAT ++PLYKA+AP A+ NQ + E A +C
Sbjct: 1095 FARSKIPRKFKLLGDDVR-EEEKVERNFQNLATLLAPLYKAMAPEAYGNQVEHEHRAPDC 1153
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
RLG K GRPFSGVTAC DFCAH+HRDLHNM G TV L+ D+ E
Sbjct: 1154 RLGLKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTVVCTLTREDNRE 1200
>gi|18490118|gb|AAH22243.1| TET3 protein [Homo sapiens]
gi|62702130|gb|AAX93057.1| unknown [Homo sapiens]
gi|168272980|dbj|BAG10329.1| KIAA0401 protein [synthetic construct]
gi|313882564|gb|ADR82768.1| Unknown protein [synthetic construct]
Length = 937
Score = 300 bits (768), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A IV++I+AWEG+P +
Sbjct: 17 VIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRS 76
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 77 LGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 136
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 137 YARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCR 196
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 197 LGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 240
>gi|444725167|gb|ELW65745.1| Methylcytosine dioxygenase TET1 [Tupaia chinensis]
Length = 1472
Score = 300 bits (767), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 135/225 (60%), Positives = 169/225 (75%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWVIRR+S EEK+L +V+ R GH CSTA IVV+I+ WEG+PL
Sbjct: 822 VVYTGKEGKSSHGCPVAKWVIRRSSEEEKVLCLVRKRAGHHCSTAVIVVLIMVWEGIPLP 881
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 882 MADQLYKELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 941
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 942 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQNLATQLAPIYKQFAPDAYKNQVEYEHVAREC 1001
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1002 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1046
>gi|291404265|ref|XP_002718498.1| PREDICTED: CXXC finger 5-like [Oryctolagus cuniculus]
Length = 2112
Score = 299 bits (765), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 135/225 (60%), Positives = 172/225 (76%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWV+RR+S EEK+L +V+ R GH CSTA IVV+I+ WEG+PL
Sbjct: 1457 VVYTGKEGKSSRGCPVAKWVLRRSSEEEKVLCLVRKRPGHHCSTAVIVVLIMIWEGIPLP 1516
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y+ LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1517 MADRLYSELTENLRSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1576
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1577 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATELAPIYKQYAPVAYQNQVEYEHVAREC 1636
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAHSHRD+HNMNNG TV L+ D+
Sbjct: 1637 RLGRKEGRPFSGVTACLDFCAHSHRDIHNMNNGSTVVCTLTREDN 1681
>gi|363742165|ref|XP_003642602.1| PREDICTED: methylcytosine dioxygenase TET3-like [Gallus gallus]
Length = 1308
Score = 298 bits (762), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 137/224 (61%), Positives = 169/224 (75%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR + EEKLL +V+HR GH C A I+++I+AWEG+P
Sbjct: 403 VIYTGKEGKSSRGCPIAKWVIRRHNQEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRT 462
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 463 LGDTLYQELTDTLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 522
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL EE+ + + LAT ++PLYK LAP A+ NQ E A +CR
Sbjct: 523 YARSKTPRKFRLVGDNPKEEELLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEDIAIDCR 582
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 583 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 626
>gi|405950810|gb|EKC18772.1| Putative methylcytosine dioxygenase TET2 [Crassostrea gigas]
Length = 1231
Score = 296 bits (757), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 136/218 (62%), Positives = 164/218 (75%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I YTGKEGK++QGCP+AKW+IRR+ EEK L +V+ R GH C TA I+ V+VAWEGVP N
Sbjct: 431 IRYTGKEGKSSQGCPIAKWIIRRSGQEEKYLCVVRQRPGHFCETACIIAVLVAWEGVPQN 490
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L L G T RRC TNE +TCACQG+D GASFSFGCSWSMYYNGCK
Sbjct: 491 MADDLYQYLRTTLPTNGFETERRCGTNERKTCACQGIDLVRRGASFSFGCSWSMYYNGCK 550
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ARS+ RKF+L ++E E+E K+ LAT ++PLY+ +AP A++NQ QFE A CRLG
Sbjct: 551 FARSREARKFKLKDTTKEVELEGKLQDLATKMAPLYQQMAPDAYSNQTQFEDTARMCRLG 610
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
+ GRPFSGVTAC DFCAHSHRDLHNMNNG TV V L+
Sbjct: 611 NEEGRPFSGVTACVDFCAHSHRDLHNMNNGSTVVVTLT 648
>gi|47219959|emb|CAG11492.1| unnamed protein product [Tetraodon nigroviridis]
Length = 400
Score = 295 bits (755), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 138/215 (64%), Positives = 167/215 (77%), Gaps = 2/215 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+TQGCP+AKWVIRR S +EKLL++V+ R GHTC+TA I+VVI+ WEG+ +
Sbjct: 14 VVYTGKEGKSTQGCPIAKWVIRRGSEKEKLLVLVRERTGHTCNTACIIVVILVWEGILPS 73
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L+ L K+G T RRCA NE RTCACQGLDP+ CGASFSFGCSWSMYYNGCK
Sbjct: 74 LADRLYNELSETLRKHGALTQRRCAHNEERTCACQGLDPEACGASFSFGCSWSMYYNGCK 133
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+ IE+ LAT ++PLYK LAP A+ NQ + E+ A +CR
Sbjct: 134 FARSKNPRKFKLLGDDMREEERIEQNFQGLATLLAPLYKTLAPEAYGNQVEHEQRALDCR 193
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 213
LG K GRPFSGVTAC DFCAH+HRDLHNM G TV
Sbjct: 194 LGLKEGRPFSGVTACMDFCAHAHRDLHNMQGGSTV 228
>gi|148237918|ref|NP_001090656.1| tet methylcytosine dioxygenase 3 [Xenopus (Silurana) tropicalis]
gi|117558065|gb|AAI27290.1| LOC100036628 protein [Xenopus (Silurana) tropicalis]
Length = 1901
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 132/224 (58%), Positives = 168/224 (75%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C A I+++I+AWEG+P +
Sbjct: 1004 VIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILIMAWEGIPRS 1063
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y +T + KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1064 LGDSLYNDITETITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 1123
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL EE +++ LAT ++P+YK LAP A+ NQ E A +CR
Sbjct: 1124 YARSKTPRKFRLIGENPKEEDGLKDNFQNLATKVAPVYKMLAPQAYQNQVNNEDIAIDCR 1183
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1184 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1227
>gi|327287144|ref|XP_003228289.1| PREDICTED: methylcytosine dioxygenase TET3-like [Anolis carolinensis]
Length = 1795
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 135/224 (60%), Positives = 169/224 (75%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C A I+++I+AWEG+P
Sbjct: 878 VIYTGKEGKSSRGCPIAKWVIRRHNLEEKLLCLVRHRAGHHCQNAVIIILILAWEGIPRT 937
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y L++ L KYG PTTRRC N+ RTCACQG DP++CGASFSFGCSWSMY+NGCK
Sbjct: 938 LGDTLYQELSDILTKYGNPTTRRCGLNDDRTCACQGKDPNSCGASFSFGCSWSMYFNGCK 997
Query: 121 YARSKTVRKFRLS--VRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRL +EE + + LAT ++PLY+ LAP A+ NQ E A +CR
Sbjct: 998 YARSKMPRKFRLQGYNPNEEDVLRKNFQDLATEVAPLYQRLAPQAYQNQVTNEDVAIDCR 1057
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1058 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1101
>gi|351702491|gb|EHB05410.1| Methylcytosine dioxygenase TET1 [Heterocephalus glaber]
Length = 2011
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 133/225 (59%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR+S EEK+L +V+ R GH C TA IVV+I+ W+G+PL
Sbjct: 1383 VVYTGKEGKSSQGCPVAKWVIRRSSEEEKVLCLVRQRPGHQCETAVIVVLIMLWDGIPLP 1442
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+ CGASFSFGCSWSMY+NGC
Sbjct: 1443 MADRLYTELTENLKSYSGHPTDRRCTLNENRTCTCQGIDPERCGASFSFGCSWSMYFNGC 1502
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1503 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQNLATELAPIYKQYAPVAYQNQVEYEHVAREC 1562
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1563 RLGRKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1607
>gi|260781795|ref|XP_002585985.1| hypothetical protein BRAFLDRAFT_185107 [Branchiostoma floridae]
gi|229271061|gb|EEN41996.1| hypothetical protein BRAFLDRAFT_185107 [Branchiostoma floridae]
Length = 326
Score = 294 bits (753), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 131/224 (58%), Positives = 173/224 (77%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGK++QGCP+AKW++RR+S EEK+L +V+HR GH C++++I++ IVAWEG+
Sbjct: 52 IIYTGKEGKSSQGCPIAKWIVRRSSEEEKVLTLVRHRPGHRCNSSYIIICIVAWEGIQRA 111
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++D +Y L+ L+K GLPTTRRC N+ +TCACQG+D + CGASFSFGCSWSMYYNGCK
Sbjct: 112 RADELYDYLSGTLSKAGLPTTRRCGVNDTKTCACQGVDDNNCGASFSFGCSWSMYYNGCK 171
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ARS+ +KF+L SEE IE+ LA + P+Y+ LAP AF NQ ++ AS+CRLG
Sbjct: 172 FARSRVPKKFKLEDPSEEAIIEDHFQRLAGEVGPVYEQLAPDAFRNQTEYSEVASDCRLG 231
Query: 181 FKPG--RPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
F P RPFSGVTAC DFCAH+HRD HNMNNG T+ L+ P++
Sbjct: 232 FGPDNTRPFSGVTACVDFCAHAHRDQHNMNNGSTIVCTLTCPEN 275
>gi|355782886|gb|EHH64807.1| hypothetical protein EGM_18120, partial [Macaca fascicularis]
Length = 1479
Score = 294 bits (752), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 812 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 871
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 872 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 931
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 932 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 991
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 992 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1036
>gi|297686810|ref|XP_002820931.1| PREDICTED: methylcytosine dioxygenase TET1 [Pongo abelii]
Length = 2136
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1529 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1588
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1589 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1648
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1649 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1693
>gi|22001093|gb|AAM88301.1|AF430147_1 leukemia-associated protein with a CXXC domain [Homo sapiens]
Length = 2136
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1529 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1588
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1589 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1648
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1649 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1693
>gi|156139122|ref|NP_085128.2| methylcytosine dioxygenase TET1 [Homo sapiens]
gi|115502139|sp|Q8NFU7.2|TET1_HUMAN RecName: Full=Methylcytosine dioxygenase TET1; AltName:
Full=CXXC-type zinc finger protein 6; AltName:
Full=Leukemia-associated protein with a CXXC domain;
AltName: Full=Ten-eleven translocation 1 gene protein
gi|225000490|gb|AAI72365.1| Tet oncogene 1 [synthetic construct]
Length = 2136
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1529 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1588
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1589 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1648
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1649 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1693
>gi|410975237|ref|XP_003994040.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1,
partial [Felis catus]
Length = 2153
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 167/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1484 VVYTGKEGKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1543
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1544 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1603
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1604 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVAREC 1663
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1664 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1708
>gi|344275087|ref|XP_003409345.1| PREDICTED: methylcytosine dioxygenase TET1 [Loxodonta africana]
Length = 2139
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1471 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1530
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1531 MADRLYTELTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1590
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1591 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPIAYQNQVEYEHVAREC 1650
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1651 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1695
>gi|73953303|ref|XP_536371.2| PREDICTED: methylcytosine dioxygenase TET1 [Canis lupus familiaris]
Length = 2137
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 167/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1470 VVYTGKEGKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1529
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1530 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1589
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1590 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVAREC 1649
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1650 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1694
>gi|119574684|gb|EAW54299.1| CXXC finger 6 [Homo sapiens]
Length = 2150
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1483 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1542
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1543 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1602
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1603 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1662
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1663 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1707
>gi|402880644|ref|XP_003903908.1| PREDICTED: methylcytosine dioxygenase TET1 [Papio anubis]
Length = 2132
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1465 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1524
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1525 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1584
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1585 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1644
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1645 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1689
>gi|301755888|ref|XP_002913781.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Ailuropoda
melanoleuca]
Length = 2143
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 167/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1475 VVYTGKEGKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1534
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1535 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1594
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1595 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVAREC 1654
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1655 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1699
>gi|426364946|ref|XP_004049552.1| PREDICTED: methylcytosine dioxygenase TET1 [Gorilla gorilla gorilla]
Length = 2136
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1529 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1588
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1589 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1648
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1649 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1693
>gi|397489915|ref|XP_003846089.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1 [Pan
paniscus]
Length = 2136
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1529 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1588
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1589 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1648
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1649 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1693
>gi|296220536|ref|XP_002756350.1| PREDICTED: methylcytosine dioxygenase TET1 [Callithrix jacchus]
Length = 2134
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1467 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1526
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1527 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1586
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1587 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENIAREC 1646
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1647 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1691
>gi|281346965|gb|EFB22549.1| hypothetical protein PANDA_001619 [Ailuropoda melanoleuca]
Length = 2136
Score = 293 bits (750), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 167/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1468 VVYTGKEGKSSHGCPIAKWVLRRGSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1527
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1528 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1587
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1588 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVAREC 1647
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1648 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1692
>gi|316990462|gb|ADU77105.1| putative methylcytosine dioxygenase isoform 1 [Xenopus laevis]
Length = 1924
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 130/224 (58%), Positives = 168/224 (75%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C A I+++I+AWEG+P
Sbjct: 1028 VIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILIMAWEGIPRA 1087
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y+ +T + KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1088 LGDSLYSDITETITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 1147
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL EE+ + + LAT ++P+Y+ LAP ++ NQ E A +CR
Sbjct: 1148 YARSKTPRKFRLIGDNPKEEEFLNDNFQDLATKVAPVYQMLAPQSYENQVNNEEVAIDCR 1207
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1208 LGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1251
>gi|334313524|ref|XP_003339916.1| PREDICTED: methylcytosine dioxygenase TET3-like [Monodelphis
domestica]
Length = 1614
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 136/224 (60%), Positives = 167/224 (74%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWV RR + EEKLL +V+HR GH C A I+++I+ WEG+
Sbjct: 689 VVYTGKEGKSSRGCPIAKWVYRRYTEEEKLLCLVRHRSGHRCEQAVIIILILVWEGISSE 748
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT L YG PTTRRC N+ RTCACQG DP TCGASFSFGCSWSMY+NGCK
Sbjct: 749 LGDTLYRELTETLRCYGNPTTRRCGLNDDRTCACQGKDPSTCGASFSFGCSWSMYFNGCK 808
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHL--LATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRL+ + E+E + H LAT ++PLYK LAP A+ NQ + E EA +CR
Sbjct: 809 YARSKFPRKFRLTGDNPEEEENLRKHFQNLATQVAPLYKKLAPQAYQNQVKDEEEAIDCR 868
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG KPGRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 869 LGLKPGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 912
>gi|426256086|ref|XP_004021676.1| PREDICTED: methylcytosine dioxygenase TET1, partial [Ovis aries]
Length = 2146
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 167/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1479 VVYTGKEGKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1538
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y+ LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1539 MADKLYSQLTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1598
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ E A EC
Sbjct: 1599 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATELAPIYKQYAPAAYQNQVALEHIAREC 1658
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1659 RLGKKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1703
>gi|338716538|ref|XP_003363468.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1-like
[Equus caballus]
Length = 1811
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1143 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1202
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1203 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1262
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1263 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYEHVAREC 1322
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1323 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1367
>gi|410043931|ref|XP_507822.3| PREDICTED: methylcytosine dioxygenase TET1 [Pan troglodytes]
Length = 2220
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1553 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1612
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1613 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1672
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1673 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 1732
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1733 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1777
>gi|350597142|ref|XP_003484366.1| PREDICTED: methylcytosine dioxygenase TET1-like, partial [Sus scrofa]
Length = 1048
Score = 293 bits (749), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 132/216 (61%), Positives = 164/216 (75%), Gaps = 3/216 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 815 VVYTGKEGKSSNGCPVAKWVLRRSSDEEKVLCLVRQRAGHHCPTAVMVVLIMVWDGIPLP 874
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y+ LT L Y G PT RRC NE RTC CQGLDP+TCGASFSFGCSWSMY+NGC
Sbjct: 875 LADRLYSELTESLKSYNGHPTDRRCTLNENRTCTCQGLDPETCGASFSFGCSWSMYFNGC 934
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ IE+ + LAT ++P+YK AP A+ NQ FE A EC
Sbjct: 935 KFGRSPSPRRFRIDPSSPLHEKNIEDNLQTLATELAPIYKQYAPVAYENQVAFEHVAREC 994
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 213
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV
Sbjct: 995 RLGKKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTV 1030
>gi|395508956|ref|XP_003758773.1| PREDICTED: methylcytosine dioxygenase TET3 [Sarcophilus harrisii]
Length = 1685
Score = 293 bits (749), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 136/224 (60%), Positives = 166/224 (74%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWV RR + EEKLL +V+HR GH C A I+++I+ WEG+
Sbjct: 739 VVYTGKEGKSSRGCPIAKWVYRRYTEEEKLLCLVRHRSGHRCEQAVIIILIMVWEGIGPE 798
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y LT L YG PTTRRC N+ RTCACQG DP TCGASFSFGCSWSMY+NGCK
Sbjct: 799 LGDTLYRELTETLRCYGNPTTRRCGLNDDRTCACQGKDPSTCGASFSFGCSWSMYFNGCK 858
Query: 121 YARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSK RKFRLS + EE+ + + LAT ++PLYK LAP A+ NQ E EA +CR
Sbjct: 859 YARSKYPRKFRLSGDNPVEEENLRKHFQNLATQVAPLYKKLAPQAYQNQVNNEEEAIDCR 918
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG KPGRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 919 LGLKPGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 962
>gi|316990464|gb|ADU77106.1| putative methylcytosine dioxygenase isoform 2 [Xenopus laevis]
Length = 1915
Score = 292 bits (748), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 130/224 (58%), Positives = 168/224 (75%), Gaps = 2/224 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+++GCP+AKWVIRR S +EKL+ +V+ R GH C A I+++I+AWEG+P
Sbjct: 1021 VIYTGKEGKSSRGCPIAKWVIRRQSEDEKLMCLVRQRAGHHCENAVIIILIMAWEGIPRA 1080
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
D +Y ++ + KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NGCK
Sbjct: 1081 LGDSLYDDISGTITKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCK 1140
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YARSKT RKFRL EE+ +++ LAT ++P+YK LAP A+ NQ E A +CR
Sbjct: 1141 YARSKTPRKFRLIGDNPKEEEFLKDSFQDLATKVAPVYKMLAPQAYQNQANNEDVAIDCR 1200
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
LG + GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1201 LGLEEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 1244
>gi|380803039|gb|AFE73395.1| methylcytosine dioxygenase TET1, partial [Macaca mulatta]
Length = 680
Score = 292 bits (747), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 20 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 79
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 80 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 139
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 140 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 199
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 200 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 244
>gi|12697897|dbj|BAB21767.1| KIAA1676 protein [Homo sapiens]
Length = 735
Score = 291 bits (746), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 168/225 (74%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 68 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 127
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 128 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 187
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 188 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVAREC 247
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 248 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 292
>gi|390347525|ref|XP_785530.3| PREDICTED: uncharacterized protein LOC580376 [Strongylocentrotus
purpuratus]
Length = 1458
Score = 291 bits (745), Expect = 1e-76, Method: Composition-based stats.
Identities = 128/223 (57%), Positives = 166/223 (74%), Gaps = 2/223 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++Y+GKEGK++ GCP+AKW+IRR+S +EK+L++V+HR GH C T++I++ IVAWEGV
Sbjct: 553 VVYSGKEGKSSTGCPIAKWIIRRSSTDEKILVLVRHRPGHRCDTSYIIIAIVAWEGVNNY 612
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D Y +L L +PT RRC TNE +TCACQG PD+CGASF+FGCSWSMYYN CK
Sbjct: 613 VADDTYEMLRTTLPNGAIPTVRRCGTNEDKTCACQGFSPDSCGASFTFGCSWSMYYNTCK 672
Query: 121 YARSKTVRKFRLSVRSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARS+T RKF+L + E E + ++ +AT + PLYK LAP +F N FE E ECR
Sbjct: 673 FARSRTPRKFKLLEANPEVEDVLSDRFQNMATDLGPLYKRLAPESFNNMVVFEEEGKECR 732
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPD 221
LG + GRPF+GVTAC DFCAH+H+D HNMNNGCTV V L+ D
Sbjct: 733 LGKETGRPFAGVTACMDFCAHAHKDQHNMNNGCTVVVTLTKDD 775
>gi|431904167|gb|ELK09589.1| Methylcytosine dioxygenase TET1 [Pteropus alecto]
Length = 2135
Score = 287 bits (734), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 166/225 (73%), Gaps = 5/225 (2%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W G+PL
Sbjct: 1469 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRERTGHHCPTAVMVVLIMVWAGLPL- 1527
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1528 -PDKLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1586
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1587 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATQLAPVYKQYAPVAYQNQVEYEHVAREC 1646
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1647 RLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1691
>gi|293345707|ref|XP_001077411.2| PREDICTED: methylcytosine dioxygenase TET2 [Rattus norvegicus]
gi|293357583|ref|XP_227694.5| PREDICTED: methylcytosine dioxygenase TET2 [Rattus norvegicus]
Length = 1920
Score = 286 bits (732), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 138/228 (60%), Positives = 169/228 (74%), Gaps = 6/228 (2%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA IV+VI+ W+G+P
Sbjct: 1107 VIYTGKEGKSSQGCPIAKWVYRRSSTEEKLLCLVRVRAKHTCDTAVIVIVILLWDGIPKP 1166
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y+ LT L+ G+ T RRCA NE R C CQG +P+TCGASFS+GCSWSMYYNGCK
Sbjct: 1167 LASELYSELTEILSNRGICTNRRCAQNENRNCCCQGENPETCGASFSYGCSWSMYYNGCK 1226
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKM--HL--LATTISPLYKALAPGAFTNQCQFEREASE 176
+ARSK RKFRL +E + EEK+ HL LAT I+P+YK LAP A+ NQ +FE A E
Sbjct: 1227 FARSKNPRKFRL--HGDEPKEEEKLGSHLQNLATVIAPIYKKLAPDAYRNQVEFEHRAIE 1284
Query: 177 CRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
CRLG K GRPFSGVTAC DF AH+HRD NM NG TV V L+ D+ E
Sbjct: 1285 CRLGLKEGRPFSGVTACLDFSAHAHRDQQNMANGSTVVVTLTREDNRE 1332
>gi|348575712|ref|XP_003473632.1| PREDICTED: methylcytosine dioxygenase TET1-like [Cavia porcellus]
Length = 2168
Score = 286 bits (731), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 130/225 (57%), Positives = 166/225 (73%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AK VIRR+S EE++L +V+ R GH C TA +V++IV W+G+P
Sbjct: 1450 LVYTGKEGKSSQGCPVAKKVIRRSSEEEEVLCLVRERPGHQCQTAVMVMLIVVWDGIPRP 1509
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1510 MADRLYTELTESLKSYNGHPTDRRCTLNENRTCTCQGTDPETCGASFSFGCSWSMYFNGC 1569
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A EC
Sbjct: 1570 KFGRSPSPRRFRIDPSSPLNEKNLEDNLQNLATELAPIYKQYAPVAYQNQVEYEHVAREC 1629
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1630 RLGRKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1674
>gi|449488387|ref|XP_002188340.2| PREDICTED: methylcytosine dioxygenase TET3-like [Taeniopygia
guttata]
Length = 1419
Score = 286 bits (731), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 135/226 (59%), Positives = 165/226 (73%), Gaps = 4/226 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKW--VIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVP 58
++Y GKEGK+ +GC +AKW VIRR + EEKLL +V+HR GH C A I+++I+AWEG+P
Sbjct: 514 VMYAGKEGKSFRGCTIAKWMSVIRRHNQEEKLLCLVRHRAGHHCQNAVIIILILAWEGIP 573
Query: 59 LNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNG 118
D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGASFSFGCSWSMY+NG
Sbjct: 574 RTLGDTLYQELTDTLTKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNG 633
Query: 119 CKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASE 176
CKYARSKT RKFRL EE+ + LAT ++PLYK LAP A+ NQ E A +
Sbjct: 634 CKYARSKTPRKFRLVGDNPKEEELLRRSFQDLATEVAPLYKRLAPQAYQNQVTNEDVAID 693
Query: 177 CRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 694 CRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 739
>gi|241896976|ref|NP_081660.1| methylcytosine dioxygenase TET1 isoform 2 [Mus musculus]
gi|239977645|sp|Q3URK3.2|TET1_MOUSE RecName: Full=Methylcytosine dioxygenase TET1; AltName:
Full=CXXC-type zinc finger protein 6; AltName:
Full=Ten-eleven translocation 1 gene protein homolog
Length = 2007
Score = 283 bits (723), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 129/220 (58%), Positives = 165/220 (75%), Gaps = 3/220 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 1417 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRL 1476
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGC
Sbjct: 1477 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGC 1536
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS+ RKFRL+ E+++E+ + LAT ++PLYK +AP A+ NQ ++E A +C
Sbjct: 1537 KFGRSENPRKFRLAPNYPLHEKQLEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDC 1596
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVL 217
RLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV L
Sbjct: 1597 RLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTVVCTL 1636
>gi|157057152|ref|NP_001035490.2| methylcytosine dioxygenase TET2 [Mus musculus]
gi|239938840|sp|Q4JK59.3|TET2_MOUSE RecName: Full=Methylcytosine dioxygenase TET2; AltName: Full=Protein
Ayu17-449
Length = 1912
Score = 281 bits (719), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 132/226 (58%), Positives = 166/226 (73%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA +V+ I+ W+G+P
Sbjct: 1093 VIYTGKEGKSSQGCPIAKWVYRRSSEEEKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKL 1152
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y+ LT+ L K G+ T RRC+ NE R C CQG +P+TCGASFSFGCSWSMYYNGCK
Sbjct: 1153 LASELYSELTDILGKCGICTNRRCSQNETRNCCCQGENPETCGASFSFGCSWSMYYNGCK 1212
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL + EE+ + + LAT I+P+YK LAP A+ NQ +FE +A +C
Sbjct: 1213 FARSKKPRKFRLHGAEPKEEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCC 1272
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DF AHSHRD NM NG TV V L+ D+ E
Sbjct: 1273 LGLKEGRPFSGVTACLDFSAHSHRDQQNMPNGSTVVVTLNREDNRE 1318
>gi|148700127|gb|EDL32074.1| mCG11334 [Mus musculus]
Length = 630
Score = 280 bits (716), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 130/224 (58%), Positives = 166/224 (74%), Gaps = 3/224 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 40 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRL 99
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGC
Sbjct: 100 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGC 159
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS+ RKFRL+ E+++E+ + LAT ++PLYK +AP A+ NQ ++E A +C
Sbjct: 160 KFGRSENPRKFRLAPNYPLHEKQLEKNLQELATVLAPLYKQMAPVAYQNQVEYEEVAGDC 219
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPD 221
RLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV L D
Sbjct: 220 RLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTVVCTLIRAD 263
>gi|117167823|gb|AAI10511.2| TET2 protein [Homo sapiens]
gi|117167991|gb|AAI10510.1| TET2 protein [Homo sapiens]
Length = 805
Score = 280 bits (716), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 128/208 (61%), Positives = 157/208 (75%), Gaps = 2/208 (0%)
Query: 19 WVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGL 78
WV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+ LT L KYG
Sbjct: 1 WVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSELTETLRKYGT 60
Query: 79 PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRS 136
T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK RKF+L
Sbjct: 61 LTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPK 120
Query: 137 EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDF 196
EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRPFSGVTAC DF
Sbjct: 121 EEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDF 180
Query: 197 CAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
CAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 181 CAHAHRDLHNMQNGSTLVCTLTREDNRE 208
>gi|392338377|ref|XP_003753514.1| PREDICTED: methylcytosine dioxygenase TET1 isoform 2 [Rattus
norvegicus]
Length = 2008
Score = 280 bits (715), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 128/220 (58%), Positives = 163/220 (74%), Gaps = 3/220 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 1417 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRL 1476
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGC
Sbjct: 1477 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGC 1536
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS RKFRL+ E+++EE + LAT ++P+YK +AP A+ NQ ++E A +C
Sbjct: 1537 KFGRSANPRKFRLAPNYPLHEKQLEENLQDLATVLAPVYKQMAPVAYQNQVEYEDIAGDC 1596
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVL 217
RLG + GRPFSGVT C DFCAHSH+D+HNMNNG TV L
Sbjct: 1597 RLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGSTVVCTL 1636
>gi|74140016|dbj|BAE31842.1| unnamed protein product [Mus musculus]
gi|74151946|dbj|BAE32012.1| unnamed protein product [Mus musculus]
Length = 991
Score = 279 bits (714), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 132/226 (58%), Positives = 166/226 (73%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA +V+ I+ W+G+P
Sbjct: 172 VIYTGKEGKSSQGCPIAKWVYRRSSEEEKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKL 231
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y+ LT+ L K G+ T RRC+ NE R C CQG +P+TCGASFSFGCSWSMYYNGCK
Sbjct: 232 LASELYSELTDILGKCGICTNRRCSQNETRNCCCQGENPETCGASFSFGCSWSMYYNGCK 291
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL + EE+ + + LAT I+P+YK LAP A+ NQ +FE +A +C
Sbjct: 292 FARSKKPRKFRLHGAEPKEEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCC 351
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DF AHSHRD NM NG TV V L+ D+ E
Sbjct: 352 LGLKEGRPFSGVTACLDFSAHSHRDQQNMPNGSTVVVTLNREDNRE 397
>gi|74142256|dbj|BAE31892.1| unnamed protein product [Mus musculus]
gi|74214512|dbj|BAE31106.1| unnamed protein product [Mus musculus]
Length = 991
Score = 279 bits (714), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 132/226 (58%), Positives = 166/226 (73%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA +V+ I+ W+G+P
Sbjct: 172 VIYTGKEGKSSQGCPIAKWVYRRSSEEEKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKL 231
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y+ LT+ L K G+ T RRC+ NE R C CQG +P+TCGASFSFGCSWSMYYNGCK
Sbjct: 232 LASELYSELTDILGKCGICTNRRCSQNETRNCCCQGENPETCGASFSFGCSWSMYYNGCK 291
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL + EE+ + + LAT I+P+YK LAP A+ NQ +FE +A +C
Sbjct: 292 FARSKKPRKFRLHGAEPKEEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCC 351
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DF AHSHRD NM NG TV V L+ D+ E
Sbjct: 352 LGLKEGRPFSGVTACLDFSAHSHRDQQNMPNGSTVVVTLNREDNRE 397
>gi|74191515|dbj|BAE30334.1| unnamed protein product [Mus musculus]
Length = 992
Score = 279 bits (714), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 132/226 (58%), Positives = 166/226 (73%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA +V+ I+ W+G+P
Sbjct: 172 VIYTGKEGKSSQGCPIAKWVYRRSSEEEKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKL 231
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y+ LT+ L K G+ T RRC+ NE R C CQG +P+TCGASFSFGCSWSMYYNGCK
Sbjct: 232 LASELYSELTDILGKCGICTNRRCSQNETRNCCCQGENPETCGASFSFGCSWSMYYNGCK 291
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKFRL + EE+ + + LAT I+P+YK LAP A+ NQ +FE +A +C
Sbjct: 292 FARSKKPRKFRLHGAEPKEEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCC 351
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DF AHSHRD NM NG TV V L+ D+ E
Sbjct: 352 LGLKEGRPFSGVTACLDFSAHSHRDQQNMPNGSTVVVTLNREDNRE 397
>gi|291230173|ref|XP_002735044.1| PREDICTED: CXXC finger 5-like [Saccoglossus kowalevskii]
Length = 1354
Score = 279 bits (713), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 122/214 (57%), Positives = 164/214 (76%), Gaps = 3/214 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK + GCP+AKWVIRR+S EEK+L++V+HR H C+TA IV+ IVAWE + +
Sbjct: 493 VIYTGKEGKGSMGCPIAKWVIRRSSSEEKVLVVVRHRVNHHCATAVIVIAIVAWEALSSD 552
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+++ Y L L+K+G PT RRC TNE ++CACQG D + GASFSFGCSWSMYYNGCK
Sbjct: 553 KTNDAYDWLRTTLSKHGNPTVRRCGTNEEKSCACQGYDSEKSGASFSFGCSWSMYYNGCK 612
Query: 121 YARSKTVRKFRLSVRSEEQ---EIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
+ARSKT +KF+L ++ + ++E ++ LAT I+P+YK +AP ++ NQ E+E+ C
Sbjct: 613 FARSKTPKKFKLGNNADSRKDVKLEHRLQTLATLIAPIYKKMAPESYANQSAHEQESLPC 672
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGC 211
RLG++ GRPFSGVTAC DFCAH+H+D HNMN GC
Sbjct: 673 RLGYEEGRPFSGVTACVDFCAHAHKDQHNMNTGC 706
>gi|262225298|gb|ACY38292.1| tet oncogene 2 [Mus musculus]
Length = 1921
Score = 278 bits (710), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 133/233 (57%), Positives = 167/233 (71%), Gaps = 9/233 (3%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA +V+ I+ W+G+P
Sbjct: 1095 VIYTGKEGKSSQGCPIAKWVYRRSSEEEKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKL 1154
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNE-------PRTCACQGLDPDTCGASFSFGCSWS 113
+ +Y+ LT+ L K G+ T RRC+ NE PR C CQG +P+TCGASFSFGCSWS
Sbjct: 1155 LASELYSELTDILGKCGICTNRRCSQNETKKKQSPPRNCCCQGENPETCGASFSFGCSWS 1214
Query: 114 MYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 171
MYYNGCK+ARSK RKFRL + EE+ + + LAT I+P+YK LAP A+ NQ +FE
Sbjct: 1215 MYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHLQNLATVIAPIYKKLAPDAYNNQVEFE 1274
Query: 172 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
+A +C LG K GRPFSGVTAC DF AHSHRD NM NG TV V L+ D+ E
Sbjct: 1275 HQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQNMPNGSTVVVTLNREDNRE 1327
>gi|149043923|gb|EDL97374.1| CXXC finger 6 (predicted) [Rattus norvegicus]
Length = 608
Score = 276 bits (706), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 129/224 (57%), Positives = 164/224 (73%), Gaps = 3/224 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 17 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRL 76
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGC
Sbjct: 77 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGC 136
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS RKFRL+ E+++EE + LAT ++P+YK +AP A+ NQ ++E A +C
Sbjct: 137 KFGRSANPRKFRLAPNYPLHEKQLEENLQDLATVLAPVYKQMAPVAYQNQVEYEDIAGDC 196
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPD 221
RLG + GRPFSGVT C DFCAHSH+D+HNMNNG TV L D
Sbjct: 197 RLGNEEGRPFSGVTCCMDFCAHSHKDIHNMNNGSTVVCTLIRED 240
>gi|359718960|ref|NP_001240786.1| methylcytosine dioxygenase TET1 isoform 1 [Mus musculus]
Length = 2039
Score = 275 bits (702), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 167/252 (66%), Gaps = 35/252 (13%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 1417 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRL 1476
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGC
Sbjct: 1477 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGC 1536
Query: 120 KYARSKTVRKFRLS----------------------------------VRSEEQEIEEKM 145
K+ RS+ RKFRL+ + EE+++E+ +
Sbjct: 1537 KFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDVKTGWIIPDRKTLISREEKQLEKNL 1596
Query: 146 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 205
LAT ++PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+H
Sbjct: 1597 QELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIH 1656
Query: 206 NMNNGCTVSVVL 217
NM+NG TV L
Sbjct: 1657 NMHNGSTVVCTL 1668
>gi|262225296|gb|ACY38291.1| tet oncogene 1 [Mus musculus]
Length = 2039
Score = 275 bits (702), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/256 (51%), Positives = 168/256 (65%), Gaps = 35/256 (13%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 1417 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRL 1476
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGC
Sbjct: 1477 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGC 1536
Query: 120 KYARSKTVRKFRLS----------------------------------VRSEEQEIEEKM 145
K+ RS+ RKFRL+ + EE+++E+ +
Sbjct: 1537 KFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDVKTGWIIPDRKTLISREEKQLEKNL 1596
Query: 146 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 205
LAT ++PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+H
Sbjct: 1597 QELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIH 1656
Query: 206 NMNNGCTVSVVLSNPD 221
NM+NG TV L D
Sbjct: 1657 NMHNGSTVVCTLIRAD 1672
>gi|326923418|ref|XP_003207933.1| PREDICTED: methylcytosine dioxygenase TET1-like [Meleagris gallopavo]
Length = 1500
Score = 274 bits (701), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 128/210 (60%), Positives = 154/210 (73%), Gaps = 2/210 (0%)
Query: 15 PLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLN 74
P VIRR+S EEKLL +V+ R GH C TA IV++I+AWEG+P +D +Y LT L
Sbjct: 807 PSVAAVIRRSSDEEKLLCLVRQRAGHHCQTAVIVILILAWEGIPHLLADTLYKELTQSLR 866
Query: 75 KYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSV 134
KYG PT+RRCA NE RTCACQGLDP+TCGASFSFGCSWSMY+NGCK+ARSK RKFRL
Sbjct: 867 KYGCPTSRRCALNEDRTCACQGLDPETCGASFSFGCSWSMYFNGCKFARSKNPRKFRLLT 926
Query: 135 RSEEQE--IEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTA 192
+QE +E + LAT ++P+YK LAP AF NQ + E +CRLG K GRPFSGVTA
Sbjct: 927 DDPKQEELLEHNLQTLATDVAPVYKKLAPEAFQNQVENEHMGPDCRLGSKDGRPFSGVTA 986
Query: 193 CFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
C DFCAH+H+D HNM+NG TV L+ D+
Sbjct: 987 CIDFCAHAHKDTHNMHNGSTVVCTLTKEDN 1016
>gi|37360506|dbj|BAC98231.1| mKIAA1676 protein [Mus musculus]
Length = 625
Score = 271 bits (694), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 131/256 (51%), Positives = 168/256 (65%), Gaps = 35/256 (13%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 67 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRERVDHHCSTAVIVVLILLWEGIPRL 126
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG+DP TCGASFSFGCSWSMY+NGC
Sbjct: 127 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGIDPKTCGASFSFGCSWSMYFNGC 186
Query: 120 KYARSKTVRKFRLS----------------------------------VRSEEQEIEEKM 145
K+ RS+ RKFRL+ + EE+++E+ +
Sbjct: 187 KFGRSENPRKFRLAPNYPLHNYYKRITGMSSEGSDVKTGWIIPDRKTLISREEKQLEKNL 246
Query: 146 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 205
LAT ++PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+H
Sbjct: 247 QELATVLAPLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIH 306
Query: 206 NMNNGCTVSVVLSNPD 221
NM+NG TV L D
Sbjct: 307 NMHNGSTVVCTLIRAD 322
>gi|392338375|ref|XP_003753513.1| PREDICTED: methylcytosine dioxygenase TET1 isoform 1 [Rattus
norvegicus]
Length = 2040
Score = 271 bits (694), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 129/252 (51%), Positives = 165/252 (65%), Gaps = 35/252 (13%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 1417 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRL 1476
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGC
Sbjct: 1477 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGC 1536
Query: 120 KYARSKTVRKFRLS----------------------------------VRSEEQEIEEKM 145
K+ RS RKFRL+ + EE+++EE +
Sbjct: 1537 KFGRSANPRKFRLAPNYPLHDYYKRITGRCSEGSDVKTGWIIPERKTLISREEKQLEENL 1596
Query: 146 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 205
LAT ++P+YK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+H
Sbjct: 1597 QDLATVLAPVYKQMAPVAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIH 1656
Query: 206 NMNNGCTVSVVL 217
NMNNG TV L
Sbjct: 1657 NMNNGSTVVCTL 1668
>gi|392355330|ref|XP_003752007.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1
[Rattus norvegicus]
Length = 2038
Score = 271 bits (693), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 129/252 (51%), Positives = 165/252 (65%), Gaps = 35/252 (13%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I++TGKEGK++QGCP+AKWVIRR+ EEK++ +V+ R H CSTA IVV+I+ WEG+P
Sbjct: 1415 IVFTGKEGKSSQGCPVAKWVIRRSGPEEKVICLVRERVDHYCSTAVIVVLILLWEGIPRL 1474
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC N+ RTC CQG +P TCGASFSFGCSWSMY+NGC
Sbjct: 1475 MADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQGTNPKTCGASFSFGCSWSMYFNGC 1534
Query: 120 KYARSKTVRKFRLS----------------------------------VRSEEQEIEEKM 145
K+ RS RKFRL+ + EE+++EE +
Sbjct: 1535 KFGRSANPRKFRLAPNYPLHDYYKRITGRCSEGSDVKTGWIIPERKTLISREEKQLEENL 1594
Query: 146 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 205
LAT ++P+YK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+H
Sbjct: 1595 QDLATVLAPVYKQMAPVAYQNQVEYEDIAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIH 1654
Query: 206 NMNNGCTVSVVL 217
NMNNG TV L
Sbjct: 1655 NMNNGSTVVCTL 1666
>gi|443702254|gb|ELU00383.1| hypothetical protein CAPTEDRAFT_102094, partial [Capitella teleta]
Length = 316
Score = 270 bits (690), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 124/226 (54%), Positives = 159/226 (70%), Gaps = 2/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTG+EGK+ QGCP+AKW++RR+S +EK ++IV+ R GHTC TA +V+ IV W+G+P
Sbjct: 47 VIYTGREGKSPQGCPVAKWILRRSSKDEKCMVIVRQRPGHTCPTAIMVIAIVVWDGIPET 106
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
Q+ G+Y L + L G T RRC TNE RTCACQG GASF+FGCSWSMY+NGCK
Sbjct: 107 QATGLYDYLRHTLPDNGHETERRCGTNEKRTCACQGWSDAVGGASFTFGCSWSMYFNGCK 166
Query: 121 YARSKT--VRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
YA+S V +FRL EE +E + LAT I PLYK +AP ++ N E EA++CR
Sbjct: 167 YAKSSDSKVHRFRLRDPMEEPILERHLQTLATDIGPLYKMVAPDSYANMTALEDEATDCR 226
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG++ GRPF GVTA DFCAH+H+D HNMNNGCTV L+ LE
Sbjct: 227 LGYRRGRPFGGVTAVVDFCAHAHKDQHNMNNGCTVVATLTKHRGLE 272
>gi|47223312|emb|CAF98696.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1615
Score = 266 bits (680), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 126/222 (56%), Positives = 151/222 (68%), Gaps = 28/222 (12%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWVIRR S EEKLL +V+ R GH C TA +V++I+AWEG+
Sbjct: 981 VVYTGKEGKSSQGCPIAKWVIRRDSEEEKLLCLVRRRPGHCCDTAVLVILILAWEGISRP 1040
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+DG+Y LT L KYG PT+RRCA NE RTCACQGLDPDTCGASFSFGCSWSMY+NGCK
Sbjct: 1041 VADGLYQELTRTLFKYGSPTSRRCALNEDRTCACQGLDPDTCGASFSFGCSWSMYFNGCK 1100
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ARSK RKFRL G + + + E +CRLG
Sbjct: 1101 FARSKVPRKFRLQ----------------------------GDYPEEVENEEAGRDCRLG 1132
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
+ GRPFSGVTAC DFCAH+HRD NMNNG TV L+ D+
Sbjct: 1133 QREGRPFSGVTACVDFCAHAHRDTQNMNNGSTVVCTLTKEDN 1174
>gi|449664940|ref|XP_002161163.2| PREDICTED: uncharacterized protein LOC100213294 [Hydra
magnipapillata]
Length = 1336
Score = 260 bits (665), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 153/221 (69%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+ +T EGK GCPLAKW+IRR S EEK L++V+H +GHTCS+ + V+VIVAWEG+
Sbjct: 372 VKHTSVEGKNGDGCPLAKWIIRRTSDEEKYLVVVRHHEGHTCSSTFTVIVIVAWEGISKQ 431
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y LT LN+ G T RRC+ NE +TC CQG ++ GASFSFGCSWSM+++GCK
Sbjct: 432 YADDMYRYLTKTLNESGFRTRRRCSANESKTCLCQGEVEESQGASFSFGCSWSMFFDGCK 491
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ +S RKF++ +E+EIE+ + + T +SPL K AP + N FE A +CR+G
Sbjct: 492 FTKSTNARKFKMQDPVKEEEIEKVLQEMTTQVSPLLKIWAPKCYENMTHFEEIADKCRIG 551
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPD 221
GRPFSGVT C DFCAHSHRD+H++NNG T+ L P+
Sbjct: 552 LNKGRPFSGVTCCLDFCAHSHRDIHDLNNGTTMVCTLLKPN 592
>gi|297293151|ref|XP_001082840.2| PREDICTED: probable methylcytosine dioxygenase TET2-like [Macaca
mulatta]
Length = 1973
Score = 255 bits (652), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 123/226 (54%), Positives = 153/226 (67%), Gaps = 28/226 (12%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A I + IV
Sbjct: 1178 VIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERGGHTCEAAVISIGIV-------- 1229
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++ +LN RTCACQGLDP+TCGASFSFGCSWSMYYNGCK
Sbjct: 1230 -----LCVVMPRLNTE-------------RTCACQGLDPETCGASFSFGCSWSMYYNGCK 1271
Query: 121 YARSKTVRKFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECR 178
+ARSK RKF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECR
Sbjct: 1272 FARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECR 1331
Query: 179 LGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
LG K GRPFSGVTAC DFCAH+HRDLHNM NG T+ L+ D+ E
Sbjct: 1332 LGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNRE 1377
>gi|156389231|ref|XP_001634895.1| predicted protein [Nematostella vectensis]
gi|156221983|gb|EDO42832.1| predicted protein [Nematostella vectensis]
Length = 256
Score = 253 bits (646), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 116/226 (51%), Positives = 155/226 (68%), Gaps = 1/226 (0%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YT KEG+ QGCP+A+WVIRR+ +EK+L++V+ R GH CS A +V +V WEG+
Sbjct: 14 IIYTNKEGRNAQGCPIARWVIRRSGNDEKVLVLVRKRPGHHCSMALVVTSVVIWEGISEE 73
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+ +Y L+ + + PT RRC N+ ++CACQG+ DTCGASFSFGCSW+MY+NGCK
Sbjct: 74 RGHSLYKELSGLIPENAAPTIRRCGLNDSKSCACQGVGEDTCGASFSFGCSWNMYFNGCK 133
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ARSK+ RK++L S+E+ +E + +AT I+P+Y AP AF NQ + ER ECR+G
Sbjct: 134 FARSKSPRKYKLLDSSKEETLERILEGIATEIAPVYSKAAPVAFANQTREERNGHECRIG 193
Query: 181 FKP-GRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLEP 225
GRPFSGVT C DFCAHSHRD NM+ G TV L P +P
Sbjct: 194 HSAVGRPFSGVTCCMDFCAHSHRDKQNMDGGATVVCTLLKPGCAQP 239
>gi|198433354|ref|XP_002125458.1| PREDICTED: similar to Protein TET2 [Ciona intestinalis]
Length = 1706
Score = 249 bits (635), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 121/227 (53%), Positives = 155/227 (68%), Gaps = 6/227 (2%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+ YTGKEGKT++GCP+AKWV+RR+S +EK++++ + R GH C TA +VVVI+ WEGV
Sbjct: 818 VCYTGKEGKTSRGCPIAKWVLRRSSEQEKIMVVCRQRPGHRCITAVMVVVIMLWEGVSRP 877
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D Y T + G T RRC TNE RTCACQG DP+ GAS+SFGCSWSMYYNGCK
Sbjct: 878 LADFSYNKCTQLIPTNGTATERRCGTNEERTCACQGFDPEKGGASYSFGCSWSMYYNGCK 937
Query: 121 YARSKTVRKFRLSVRSE---EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
+ARS KF+L+ + E + + LA+ +S LYK AP A NQ + E E EC
Sbjct: 938 FARSTKPNKFKLNGTKDSNAESCVADFCQRLASAMSVLYKTAAPDAHMNQIERECEGQEC 997
Query: 178 RLGFKP---GRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPD 221
RLG+ P GRPFSGVT C DFCAH+H+D HNM NG T+ + L+ P+
Sbjct: 998 RLGYNPPNEGRPFSGVTCCMDFCAHAHKDQHNMENGTTLVLTLTKPE 1044
>gi|354475486|ref|XP_003499959.1| PREDICTED: methylcytosine dioxygenase TET1 [Cricetulus griseus]
Length = 1956
Score = 246 bits (629), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 118/225 (52%), Positives = 157/225 (69%), Gaps = 3/225 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I Y GKE K+++GCP+ K V+R+ + +EK+L + + R GH C TA +VV IV W+ +
Sbjct: 1379 IEYMGKESKSSRGCPVVKTVLRQNNDDEKVLCLARERVGHHCQTAVMVVGIVLWQPISPP 1438
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y +T+ L Y G PT RRC NE RTC CQGL+P TCGASFSFGCSWSMY NGC
Sbjct: 1439 LADHLYDEITDNLRSYSGHPTDRRCTFNEKRTCTCQGLNPRTCGASFSFGCSWSMYLNGC 1498
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ RS RKF+L+ E++IE ++ +A T++P+YK +AP A+ NQ ++E A++C
Sbjct: 1499 KFGRSPNPRKFKLAPNYPLNEKKIEGILNKVADTLAPIYKQMAPVAYQNQVKYEDVAADC 1558
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVT C DFCAHSH+D HNM NG TV + L D+
Sbjct: 1559 RLGTKKGRPFSGVTCCMDFCAHSHKDNHNMINGSTVVLTLLRKDA 1603
>gi|380805809|gb|AFE74780.1| methylcytosine dioxygenase TET3, partial [Macaca mulatta]
Length = 348
Score = 242 bits (617), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 112/186 (60%), Positives = 138/186 (74%), Gaps = 2/186 (1%)
Query: 39 GHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLD 98
GH C A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG D
Sbjct: 1 GHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKD 60
Query: 99 PDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLY 156
P+TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLY
Sbjct: 61 PNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLY 120
Query: 157 KALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVV 216
K LAP A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTV
Sbjct: 121 KRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCT 180
Query: 217 LSNPDS 222
L+ D+
Sbjct: 181 LTKEDN 186
>gi|68342456|gb|AAY90126.1| Ayu17-449 [Mus musculus]
Length = 1919
Score = 237 bits (604), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 125/235 (53%), Positives = 157/235 (66%), Gaps = 13/235 (5%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++QGCP+AKWV RR+S EEKLL +V+ R HTC TA +V+ V
Sbjct: 1093 VIYTGKEGKSSQGCPIAKWVYRRSSEEEKLLCLVRVRPNHTCETAVMVIASVVGRNPKAT 1152
Query: 61 QSDGV---YAILTN--KLNKYGL--PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWS 113
+ + Y L +++ L T++ + R C CQG +P+TCGASFSFGCSWS
Sbjct: 1153 RIRTLLRTYRYLGQVWHMHQPSLFSDETKKKQSPPSRNCCCQGENPETCGASFSFGCSWS 1212
Query: 114 MYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM--HL--LATTISPLYKALAPGAFTNQCQ 169
MYYNGCK+ARSK RKFRL R E + EE++ HL LAT I+P+YK LAP A+ NQ +
Sbjct: 1213 MYYNGCKFARSKKPRKFRL--RGAEPKEEERLGSHLQNLATVIAPIYKKLAPDAYNNQVE 1270
Query: 170 FEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
FE +A +C LG K GRPFSGVTAC DF AHSHRD NM NG TV V L+ D+ E
Sbjct: 1271 FEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQNMPNGSTVVVTLNREDNRE 1325
>gi|340371755|ref|XP_003384410.1| PREDICTED: methylcytosine dioxygenase TET1-like [Amphimedon
queenslandica]
Length = 1077
Score = 233 bits (595), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 106/216 (49%), Positives = 143/216 (66%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I YTG E KT++GCP A+WV+RR S EEK L++ +H GH+C + VV IV W+ +
Sbjct: 216 ITYTGIEAKTSEGCPTAEWVVRRKSKEEKFLVLYRHHIGHSCDEQYTVVSIVYWDALTPE 275
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
++ Y L L + G PT R+C N+ +TC+CQG D GAS+SFGCSWS+YY+GCK
Sbjct: 276 RAGYTYNKLVEILPQNGFPTPRKCEFNDSKTCSCQGDDKTVHGASYSFGCSWSVYYDGCK 335
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ +SK RKF+L V +E E+E + LAT ++PLYK LAP A++NQ + ECR+G
Sbjct: 336 FGKSKIPRKFKLQVPEKEPELEGNVDELATYLAPLYKRLAPKAYSNQVATQASGEECRIG 395
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVV 216
P +PFSG+T C D+CAHSH D HNM +G VV
Sbjct: 396 LGPEKPFSGMTCCMDYCAHSHYDKHNMPDGGATVVV 431
>gi|441614500|ref|XP_004088220.1| PREDICTED: LOW QUALITY PROTEIN: methylcytosine dioxygenase TET1-like
[Nomascus leucogenys]
Length = 1989
Score = 223 bits (569), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 142/226 (62%), Gaps = 4/226 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+++TG EGK++ CP+ KWV+ R+S EK L V R GH C TA IV++I+ W+G
Sbjct: 1319 VVHTGNEGKSSNRCPIIKWVLTRSSDTEKAXL-VXQRTGHYCPTAVIVMLIMVWDGNHFP 1377
Query: 61 QSDGVYAILTNKLNKYGL-PTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L + PT RRC + RTC C G+DP+TCGASFSFGCSWSMY+N C
Sbjct: 1378 VADWLYTELTENLRSXNMHPTNRRCTLHXNRTCTCXGIDPETCGASFSFGCSWSMYFNDC 1437
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
K+ R + R+FR+ S E+ +E+ + LAT + P+YK AP A+ NQ + E A EC
Sbjct: 1438 KFGRGPSCRRFRIDSSSLLHEKNLEDNLQSLATQLVPIYKQHAPLAYQNQVEHENVAXEC 1497
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSL 223
LG K FSG+ AC DF A HRD+HNMNNG TV L D+
Sbjct: 1498 XLGSKDSFSFSGIIACLDFSAQPHRDIHNMNNGSTVVCTLIQEDNF 1543
>gi|395820931|ref|XP_003783809.1| PREDICTED: methylcytosine dioxygenase TET1 [Otolemur garnettii]
Length = 2169
Score = 218 bits (556), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 99/174 (56%), Positives = 127/174 (72%), Gaps = 3/174 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCPLAKWVIRR+S EEK+L +V+ R GH C A +VV+I+ WEG+PL
Sbjct: 1541 VVYTGKEGKSSHGCPLAKWVIRRSSKEEKVLCLVRKRIGHRCPAAVMVVLIMVWEGIPLP 1600
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1601 MADRLYTELTESLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1660
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 171
K+ RS + R+FR+ S E+ +E+ + LAT + PLY+ AP A+ NQ FE
Sbjct: 1661 KFGRSPSPRRFRIDPSSPVHEKNLEDNLQGLATVLGPLYQQYAPVAYQNQVHFE 1714
>gi|403274103|ref|XP_003928828.1| PREDICTED: methylcytosine dioxygenase TET1 [Saimiri boliviensis
boliviensis]
Length = 2088
Score = 217 bits (552), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 101/187 (54%), Positives = 134/187 (71%), Gaps = 5/187 (2%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1467 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1526
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1527 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1586
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQ-CQFEREASE 176
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ C RE +
Sbjct: 1587 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVCTLTREDNR 1646
Query: 177 CRLGFKP 183
LG P
Sbjct: 1647 S-LGVVP 1652
>gi|297301263|ref|XP_002805756.1| PREDICTED: methylcytosine dioxygenase TET1-like [Macaca mulatta]
Length = 1972
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 95/171 (55%), Positives = 127/171 (74%), Gaps = 3/171 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1465 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1524
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 1525 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 1584
Query: 120 KYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQC 168
K+ RS + R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ
Sbjct: 1585 KFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQV 1635
>gi|301610531|ref|XP_002934823.1| PREDICTED: probable methylcytosine dioxygenase TET2 [Xenopus
(Silurana) tropicalis]
Length = 1737
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 105/227 (46%), Positives = 144/227 (63%), Gaps = 27/227 (11%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK+ QGCP+AKWVIRR+ +EK+L +V+ R GH+C TA IV++I+ WEG+ +
Sbjct: 980 VVYTGKEGKSAQGCPIAKWVIRRSGTDEKMLCLVRERAGHSCETAVIVILILVWEGISFS 1039
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNE---PRTCACQGLDPDTCGASFSFGCSWSMYYN 117
+D +Y+ LT LNKYG T RRCA NE + +G+ T G +++F
Sbjct: 1040 LADRLYSELTETLNKYGTLTNRRCARNEEVWEESGVLRGISGIT-GRTYTF--------- 1089
Query: 118 GCKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASEC 177
L+ S E+++E + L+T ++P+YK LAP A+ NQ + E A +C
Sbjct: 1090 --------------LADSSLEEKLEANLQHLSTLMAPIYKKLAPDAYHNQIEHEHRAPDC 1135
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLE 224
RLG K GRPFSGVTAC DFCAHSHRDLHNM NG T+ L+ D+ E
Sbjct: 1136 RLGLKEGRPFSGVTACLDFCAHSHRDLHNMQNGSTLVCTLTREDNRE 1182
>gi|349604556|gb|AEQ00074.1| Methylcytosine dioxygenase TET1-like protein, partial [Equus
caballus]
Length = 375
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 133/211 (63%), Gaps = 32/211 (15%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I +G+PL
Sbjct: 165 VVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIWYGDGIPLP 224
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGC
Sbjct: 225 MADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGC 284
Query: 120 KYARSKTVRKFRLSVRSE-------------------------------EQEIEEKMHLL 148
K+ RS + R+FR+ S E+ +E+ + L
Sbjct: 285 KFGRSPSPRRFRIDPSSPLHNYYERITKGRNPERRYMKPEPICPGHEAMEKNLEDNLQSL 344
Query: 149 ATTISPLYKALAPGAFTNQCQFEREASECRL 179
AT ++P+YK AP A+ NQ ++E A ECRL
Sbjct: 345 ATRLAPIYKQYAPVAYQNQVEYEHVARECRL 375
>gi|348564585|ref|XP_003468085.1| PREDICTED: methylcytosine dioxygenase TET2-like [Cavia porcellus]
Length = 1937
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 102/225 (45%), Positives = 134/225 (59%), Gaps = 45/225 (20%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I+YTGKEGK++QGCP+AKWV RR+S +EKLL +V+ R GHTCS A I+V+I+ W+ +P +
Sbjct: 1175 IIYTGKEGKSSQGCPIAKWVFRRSSSKEKLLCLVRERTGHTCSAAVILVMIMVWDAIPRS 1234
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
+D +Y L L+K+G T RRCA NE SW
Sbjct: 1235 LADQLYTELRETLHKHGTLTNRRCALNE--------------------ETSW-------- 1266
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
E+++E + LAT I+P+YK LAP A+ NQ ++E A +CRLG
Sbjct: 1267 -----------------EEKLESHLQNLATLIAPIYKKLAPDAYNNQVEYEHRAPDCRLG 1309
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDSLEP 225
K GRPFSGVTAC DFCAH+HRDLHNM NG TV L+ D+ +P
Sbjct: 1310 LKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTVVCTLTREDNRDP 1354
>gi|359080787|ref|XP_003588047.1| PREDICTED: methylcytosine dioxygenase TET1-like [Bos taurus]
Length = 2105
Score = 203 bits (516), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 132/223 (59%), Gaps = 31/223 (13%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGC 119
+D +Y+ LT L Y G PT RRC NE W +
Sbjct: 1529 MADKLYSQLTESLKSYNGHPTDRRCTLNE----------------------KWVVV---- 1562
Query: 120 KYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRL 179
V +R E+ +E+ + LAT ++P+YK AP A+ NQ E A ECRL
Sbjct: 1563 ----GTDVEMMTREIRYREKNLEDNLQSLATELAPIYKQYAPAAYQNQVALEHIARECRL 1618
Query: 180 GFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
G K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1619 GKKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1661
>gi|358419451|ref|XP_003584239.1| PREDICTED: methylcytosine dioxygenase TET1-like [Bos taurus]
Length = 2131
Score = 187 bits (474), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 100/226 (44%), Positives = 135/226 (59%), Gaps = 11/226 (4%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
++YTGKEGK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL
Sbjct: 1469 VVYTGKEGKSSNGCPVAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLP 1528
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNEP-RTCACQGLDPDTCGASFSFGCSWSMYYNG 118
+D +Y+ LT L Y G PT RRC NE + + ++ + +NG
Sbjct: 1529 MADKLYSQLTESLKSYNGHPTDRRCTLNENCKLLVLNNTSENEVQYNYQYNYQNQYVFNG 1588
Query: 119 CKYARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKAL--APGAFTNQCQFEREASE 176
CK+ RS + R+FR+ K H ++ L K A A ++ E A E
Sbjct: 1589 CKFXRSPSPRRFRIDPSLPYM----KKH---SSFPELRKDQCEAQQARESEVALEHIARE 1641
Query: 177 CRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
CRLG K GRPFSGVTAC DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1642 CRLGKKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDN 1687
>gi|355723851|gb|AES08026.1| tet oncoprotein family member 3 [Mustela putorius furo]
Length = 104
Score = 149 bits (376), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 67/99 (67%), Positives = 79/99 (79%)
Query: 35 KHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCAC 94
+HR GH C A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCAC
Sbjct: 1 RHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCAC 60
Query: 95 QGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLS 133
QG DP TCGASFSFGCSWSMY+NGCKYARSKT RKFRL+
Sbjct: 61 QGKDPSTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLA 99
>gi|195998193|ref|XP_002108965.1| hypothetical protein TRIADDRAFT_52488 [Trichoplax adhaerens]
gi|190589741|gb|EDV29763.1| hypothetical protein TRIADDRAFT_52488 [Trichoplax adhaerens]
Length = 687
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 117/223 (52%), Gaps = 27/223 (12%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+ YTG E KT+ GCP A+W IRR S EKLL +V R+GHTC + +++ IVAW
Sbjct: 223 VEYTGVESKTSDGCPRAEWAIRRISKSEKLLALVHRRRGHTCKASVVLMAIVAW------ 276
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCK 120
DG++ N L + C CQG D GA+F+ G + +G K
Sbjct: 277 --DGIHPDRANVL----------------QDCHCQGTDNQREGAAFTLGNMYQTEDDGLK 318
Query: 121 YARSKTVRKFRLSVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLG 180
+ + ++L+ ++EQE+ E + LA ++P+YK AP A+ NQ +++ +
Sbjct: 319 IILNASA--YQLADSAKEQELAEALESLAADLAPVYKKFAPWAYNNQIKYQENCVGKSIN 376
Query: 181 FKPGRPFSGVTACFDFCAHSHRDLHNMNNGCT-VSVVLSNPDS 222
+ PFSGV DFCAH+H + +++G + V +L +P+
Sbjct: 377 EEKNGPFSGVICSLDFCAHNHVNTEGLDDGASMVCTLLKDPED 419
>gi|26350989|dbj|BAC39131.1| unnamed protein product [Mus musculus]
Length = 267
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/111 (58%), Positives = 79/111 (71%), Gaps = 2/111 (1%)
Query: 114 MYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 171
MY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E
Sbjct: 1 MYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNE 60
Query: 172 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 61 DVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 111
>gi|66396578|gb|AAH96437.1| Tet3 protein [Mus musculus]
Length = 695
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 64/111 (57%), Positives = 78/111 (70%), Gaps = 2/111 (1%)
Query: 114 MYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFE 171
MY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP A+ NQ E
Sbjct: 1 MYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQAYQNQVTNE 60
Query: 172 REASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCT L+ D+
Sbjct: 61 DVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTAVCTLTKEDN 111
>gi|441657124|ref|XP_003258249.2| PREDICTED: methylcytosine dioxygenase TET1-like [Nomascus
leucogenys]
Length = 583
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/140 (43%), Positives = 79/140 (56%), Gaps = 31/140 (22%)
Query: 114 MYYNGCKYARSKTVRKFRLSVRSE-------------------------------EQEIE 142
MY+NGCK+ RS + R+FR+ S E+ +E
Sbjct: 1 MYFNGCKFGRSPSPRRFRIDPSSPLHTYYERITKGRNPERRYMKPERISPGHEAMEKNLE 60
Query: 143 EKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHR 202
+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K GRPFSGVTAC DFCAH HR
Sbjct: 61 DNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHR 120
Query: 203 DLHNMNNGCTVSVVLSNPDS 222
D+HNMNNG TV L+ D+
Sbjct: 121 DIHNMNNGSTVVCTLTREDN 140
>gi|241849264|ref|XP_002415674.1| hypothetical protein IscW_ISCW023647 [Ixodes scapularis]
gi|215509888|gb|EEC19341.1| hypothetical protein IscW_ISCW023647 [Ixodes scapularis]
Length = 750
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 49/88 (55%), Positives = 68/88 (77%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
+LY+GKEGKT+QGCP+AKW+IRR+ EK+L +++HR GH C +A+IV+ IVAWEGV +
Sbjct: 662 VLYSGKEGKTSQGCPVAKWIIRRSGPSEKVLAVLRHRPGHRCLSAYIVMAIVAWEGVQAD 721
Query: 61 QSDGVYAILTNKLNKYGLPTTRRCATNE 88
+D +Y +T+K +G PT RRC TNE
Sbjct: 722 MADDLYRTVTHKTVNFGFPTQRRCGTNE 749
>gi|10047157|dbj|BAB13372.1| KIAA1546 protein [Homo sapiens]
Length = 684
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 49/87 (56%), Positives = 63/87 (72%)
Query: 138 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 197
E+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRPFSGVTAC DFC
Sbjct: 1 EEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFC 60
Query: 198 AHSHRDLHNMNNGCTVSVVLSNPDSLE 224
AH+HRDLHNM NG T+ L+ D+ E
Sbjct: 61 AHAHRDLHNMQNGSTLVCTLTREDNRE 87
>gi|432106709|gb|ELK32361.1| Methylcytosine dioxygenase TET1 [Myotis davidii]
Length = 2018
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/85 (57%), Positives = 62/85 (72%)
Query: 138 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 197
E+ +E+ + LAT ++P+Y+ AP A+ NQ QFE A ECRLG K GRPFSGVTAC DFC
Sbjct: 1515 EKNLEDNLQSLATQLAPIYRQYAPVAYQNQIQFEHIARECRLGNKEGRPFSGVTACVDFC 1574
Query: 198 AHSHRDLHNMNNGCTVSVVLSNPDS 222
H+HRD+HNMNNG TV L+ D+
Sbjct: 1575 THAHRDIHNMNNGSTVVCTLTREDN 1599
Score = 37.7 bits (86), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 22/30 (73%), Gaps = 3/30 (10%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKL 30
++YTGKE K++ GCP+AKW +LE+ L
Sbjct: 1496 VIYTGKEAKSSHGCPVAKW---EKNLEDNL 1522
>gi|355723824|gb|AES08017.1| tet oncoprotein 1 [Mustela putorius furo]
Length = 70
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 45/70 (64%), Positives = 53/70 (75%), Gaps = 1/70 (1%)
Query: 55 EGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWS 113
+G+PL +D +Y LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWS
Sbjct: 1 DGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWS 60
Query: 114 MYYNGCKYAR 123
MY+NGCK+ R
Sbjct: 61 MYFNGCKFGR 70
>gi|351712963|gb|EHB15882.1| Methylcytosine dioxygenase TET1 [Heterocephalus glaber]
Length = 561
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 58/85 (68%)
Query: 138 EQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFC 197
E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G PFSGVTAC DF
Sbjct: 195 EKNLEDNLQNLATELAPIYKQYAPAAYQNQVEYEHVAQECRLGAKEGHPFSGVTACLDFS 254
Query: 198 AHSHRDLHNMNNGCTVSVVLSNPDS 222
AH H D+HNMN+ TV L+ D+
Sbjct: 255 AHLHWDIHNMNHRNTVVSTLAREDN 279
>gi|344237690|gb|EGV93793.1| Methylcytosine dioxygenase TET1 [Cricetulus griseus]
Length = 337
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/64 (59%), Positives = 47/64 (73%)
Query: 159 LAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLS 218
+AP A+ NQ ++E A++CRLG K GRPFSGVT C DFCAHSH+D HNM NG TV + L
Sbjct: 1 MAPVAYQNQVKYEDVAADCRLGTKKGRPFSGVTCCMDFCAHSHKDNHNMINGSTVVLTLL 60
Query: 219 NPDS 222
D+
Sbjct: 61 RKDA 64
>gi|344237691|gb|EGV93794.1| Methylcytosine dioxygenase TET1 [Cricetulus griseus]
Length = 1466
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 57/89 (64%), Gaps = 1/89 (1%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLN 60
I Y GKE K+++GCP+ K V+R+ + +EK+L + + R GH C TA +VV IV W+ +
Sbjct: 1377 IEYMGKESKSSRGCPVVKTVLRQNNDDEKVLCLARERVGHHCQTAVMVVGIVLWQPISPP 1436
Query: 61 QSDGVYAILTNKLNKY-GLPTTRRCATNE 88
+D +Y +T+ L Y G PT RRC NE
Sbjct: 1437 LADHLYDEITDNLRSYSGHPTDRRCTFNE 1465
>gi|355723854|gb|AES08027.1| tet oncoprotein family member 3 [Mustela putorius furo]
Length = 91
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 35/45 (77%)
Query: 178 RLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
RLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTV L+ D+
Sbjct: 1 RLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 45
>gi|193227751|emb|CAQ60121.1| hypothetical protein [Homo sapiens]
Length = 96
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 15/99 (15%)
Query: 105 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRSEEQEIEEKM---------HLLATTISPL 155
SFSFGCSWSMY+NGCK+ RS + R+FR+ S E++ ++ ISP
Sbjct: 1 SFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHTYYERITKGRNPERRYMKPERISPG 60
Query: 156 YKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACF 194
++A+ C+ E LG P + CF
Sbjct: 61 HEAM------EDCEAENVWEMGGLGILTSVPITPRVVCF 93
>gi|355723821|gb|AES08016.1| tet oncoprotein 1 [Mustela putorius furo]
Length = 218
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 23/24 (95%)
Query: 1 ILYTGKEGKTTQGCPLAKWVIRRA 24
++YTGKEGK++QGCP+AKWV+RR
Sbjct: 195 VVYTGKEGKSSQGCPIAKWVLRRG 218
>gi|68161848|emb|CAD28467.3| hypothetical protein [Homo sapiens]
Length = 414
Score = 46.2 bits (108), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 194 FDFCAHSHRDLHNMNNGCTVSVVLSNPDS 222
DFCAH HRD+HNMNNG TV L+ D+
Sbjct: 1 LDFCAHPHRDIHNMNNGSTVVCTLTREDN 29
>gi|212716093|ref|ZP_03324221.1| hypothetical protein BIFCAT_01006 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
gi|212661460|gb|EEB22035.1| hypothetical protein BIFCAT_01006 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
Length = 399
Score = 39.7 bits (91), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 155 LYKALAPGAFTNQCQFEREASECRLGFK------PGRPFSG-VTACFDFCAHSHRDLHNM 207
++++LA A T C E E F+ G+ +G CFDFC RDLH+
Sbjct: 77 IFESLA--ARTRPCILESEPVYLEKVFRSIDMLLDGKQLTGQAKQCFDFCRKKFRDLHDK 134
Query: 208 NNGCTVSVVLSNPDSL 223
NNG + S+ + + D++
Sbjct: 135 NNGESYSIQMYDKDNV 150
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.134 0.432
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,662,219,213
Number of Sequences: 23463169
Number of extensions: 138140679
Number of successful extensions: 234984
Number of sequences better than 100.0: 221
Number of HSP's better than 100.0 without gapping: 221
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 234529
Number of HSP's gapped (non-prelim): 242
length of query: 225
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 88
effective length of database: 9,144,741,214
effective search space: 804737226832
effective search space used: 804737226832
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 74 (33.1 bits)