BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy6128
(529 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6N021|TET2_HUMAN Methylcytosine dioxygenase TET2 OS=Homo sapiens GN=TET2 PE=1 SV=3
Length = 2002
Score = 407 bits (1047), Expect = e-113, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)
Query: 214 EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
+ P C+C + + G +YTHLGA ++ +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187
Query: 274 KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC A IV++I+ WEG+PL+ +D +Y+
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247
Query: 334 LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
LT L KYG T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307
Query: 394 KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
KF+L EE+++E + L+T ++P YK LAP A+ NQ ++E A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367
Query: 452 FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
FSGVTAC DFCAH+HRDLHNM NG T+V +LT+ + KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427
Query: 509 EFGNKEAQEEKVNTGAIENLN 529
EFG+ EAQEEK +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448
>sp|O43151|TET3_HUMAN Methylcytosine dioxygenase TET3 OS=Homo sapiens GN=TET3 PE=2 SV=3
Length = 1660
Score = 398 bits (1022), Expect = e-110, Method: Compositional matrix adjust.
Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + + E P C C + + G YYTHLG+ ++ +R+
Sbjct: 668 LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 724 MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 784 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + + LAT ++PLYK LAP
Sbjct: 844 SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 904 AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + K P+DEQLHVLPLY M ++DEFG++E Q KV +GAI+ L
Sbjct: 964 RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008
>sp|Q8BG87|TET3_MOUSE Methylcytosine dioxygenase TET3 OS=Mus musculus GN=Tet3 PE=1 SV=3
Length = 1668
Score = 395 bits (1015), Expect = e-109, Method: Compositional matrix adjust.
Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)
Query: 190 VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
+K D +K +LD + ++E P C C + + G YYTHLG+ ++ +R+
Sbjct: 676 LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 731
Query: 250 IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
+E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C
Sbjct: 732 MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 791
Query: 310 AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
A IV++I+AWEG+P + D +Y LT+ L KYG PT+RRC N+ RTCACQG DP+TCGA
Sbjct: 792 AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 851
Query: 370 SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
SFSFGCSWSMY+NGCKYARSKT RKFRL+ + EE+ + LAT ++PLYK LAP
Sbjct: 852 SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 911
Query: 428 AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
A+ NQ E A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK +
Sbjct: 912 AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971
Query: 486 RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
R + + P+DEQLHVLPLY M +DEFG++E Q KV++GAI+ L
Sbjct: 972 RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1016
>sp|Q8NFU7|TET1_HUMAN Methylcytosine dioxygenase TET1 OS=Homo sapiens GN=TET1 PE=1 SV=2
Length = 2136
Score = 386 bits (991), Expect = e-106, Method: Compositional matrix adjust.
Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)
Query: 213 TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
+E+P C C + + G YYTHLGA S+ +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475
Query: 273 GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL +D +Y
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535
Query: 333 ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
LT L Y G PT RRC NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595
Query: 392 VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
R+FR+ S E+ +E+ + LAT ++P+YK AP A+ NQ ++E A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655
Query: 450 RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+ +RSL P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715
Query: 507 SDEFGNKEAQEEKVNTGAIENL 528
+DEFG+KE E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737
>sp|Q3URK3|TET1_MOUSE Methylcytosine dioxygenase TET1 OS=Mus musculus GN=Tet1 PE=1 SV=2
Length = 2007
Score = 372 bits (955), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 242/350 (69%), Gaps = 12/350 (3%)
Query: 187 PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
P T A+ + ++D + + N+ E C C + E G YYTHLGA
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392
Query: 242 SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
S+ +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+ EEKL+ +V+
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452
Query: 302 RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
R H CSTA IVV+I+ WEG+P +D +Y LT L Y G PT RRC N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512
Query: 361 GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTIS 418
G+DP TCGASFSFGCSWSMY+NGCK+ RS+ RKFRL+ E+++E+ + LAT ++
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHEKQLEKNLQELATVLA 1572
Query: 419 PLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
PLYK +AP A+ NQ ++E A +CRLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV
Sbjct: 1573 PLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTV 1632
Query: 479 VVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
V +L + R + P+DEQLHVLPLY + D+DEFG+ E + K+ +GAI+
Sbjct: 1633 VCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSGAIQ 1682
>sp|Q4JK59|TET2_MOUSE Methylcytosine dioxygenase TET2 OS=Mus musculus GN=Tet2 PE=1 SV=3
Length = 1912
Score = 367 bits (942), Expect = e-100, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)
Query: 233 YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
YYTHLGA + +R +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1060 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1119
Query: 293 EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
EKLL +V+ R HTC TA +V+ I+ W+G+P + +Y+ LT+ L K G+ T RRC+ N
Sbjct: 1120 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 1179
Query: 353 EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK RKFRL + EE+ + +
Sbjct: 1180 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 1239
Query: 411 HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD
Sbjct: 1240 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 1299
Query: 471 NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
NM NG TVVV+L + + +KP+DEQ HVLP+YI+ DEFG+ E QE+K+ G+IE
Sbjct: 1300 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 1359
Query: 528 L 528
L
Sbjct: 1360 L 1360
>sp|Q8NZ80|C5AP_STRP8 C5a peptidase OS=Streptococcus pyogenes serotype M18 (strain
MGAS8232) GN=scpA PE=3 SV=1
Length = 1150
Score = 32.7 bits (73), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)
Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
D KK++ +A G S+V SAG GG T P ++H G+ D L
Sbjct: 273 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 331
Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
P+ E ATVK D KEM
Sbjct: 332 SPDKQLTETATVKTADQQDKEM 353
>sp|P0DD35|C5AP_STRPQ C5a peptidase OS=Streptococcus pyogenes serotype M3 (strain SSI-1)
GN=scpA PE=3 SV=1
Length = 1169
Score = 32.3 bits (72), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)
Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
D KK++ +A G S+V SAG GG T P ++H G+ D L
Sbjct: 275 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 333
Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
P+ E ATVK D KEM
Sbjct: 334 SPDKQLTETATVKTADQQDKEM 355
>sp|P0DD34|C5AP_STRP3 C5a peptidase OS=Streptococcus pyogenes serotype M3 (strain ATCC
BAA-595 / MGAS315) GN=scpA PE=3 SV=1
Length = 1169
Score = 32.3 bits (72), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)
Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
D KK++ +A G S+V SAG GG T P ++H G+ D L
Sbjct: 275 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 333
Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
P+ E ATVK D KEM
Sbjct: 334 SPDKQLTETATVKTADQQDKEM 355
>sp|Q5X9R0|C5AP_STRP6 C5a peptidase OS=Streptococcus pyogenes serotype M6 (strain ATCC
BAA-946 / MGAS10394) GN=scpA PE=3 SV=1
Length = 1184
Score = 32.3 bits (72), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)
Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
D KK++ +A G S+V SAG GG T P ++H G+ D L
Sbjct: 273 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 331
Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
P+ E ATVK D KEM
Sbjct: 332 SPDKQLTETATVKTADQQDKEM 353
>sp|P58099|C5AP_STRP1 C5a peptidase OS=Streptococcus pyogenes serotype M1 GN=scpA PE=3
SV=1
Length = 1181
Score = 32.3 bits (72), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)
Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
D KK++ +A G S+V SAG GG T P ++H G+ D L
Sbjct: 273 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 331
Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
P+ E ATVK D KEM
Sbjct: 332 SPDKQLTETATVKTADQQDKEM 353
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.134 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 223,419,117
Number of Sequences: 539616
Number of extensions: 10187648
Number of successful extensions: 21340
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 21308
Number of HSP's gapped (non-prelim): 22
length of query: 529
length of database: 191,569,459
effective HSP length: 122
effective length of query: 407
effective length of database: 125,736,307
effective search space: 51174676949
effective search space used: 51174676949
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (29.3 bits)