BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy6128
         (529 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q6N021|TET2_HUMAN Methylcytosine dioxygenase TET2 OS=Homo sapiens GN=TET2 PE=1 SV=3
          Length = 2002

 Score =  407 bits (1047), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 243/321 (75%), Gaps = 6/321 (1%)

Query: 214  EVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEG 273
            + P C+C     +  + G +YTHLGA  ++  +R+ +EER G KGKA+R+E+++YTGKEG
Sbjct: 1129 DFPSCRC-VEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEG 1187

Query: 274  KTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAI 333
            K++QGCP+AKWV+RR+S EEKLL +V+ R GHTC  A IV++I+ WEG+PL+ +D +Y+ 
Sbjct: 1188 KSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSE 1247

Query: 334  LTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVR 393
            LT  L KYG  T RRCA NE RTCACQGLDP+TCGASFSFGCSWSMYYNGCK+ARSK  R
Sbjct: 1248 LTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPR 1307

Query: 394  KFRL--SVRSEEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRP 451
            KF+L      EE+++E  +  L+T ++P YK LAP A+ NQ ++E  A ECRLG K GRP
Sbjct: 1308 KFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRP 1367

Query: 452  FSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSD 508
            FSGVTAC DFCAH+HRDLHNM NG T+V +LT+  +     KP+DEQLHVLPLY + D D
Sbjct: 1368 FSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVD 1427

Query: 509  EFGNKEAQEEKVNTGAIENLN 529
            EFG+ EAQEEK  +GAI+ L+
Sbjct: 1428 EFGSVEAQEEKKRSGAIQVLS 1448


>sp|O43151|TET3_HUMAN Methylcytosine dioxygenase TET3 OS=Homo sapiens GN=TET3 PE=2 SV=3
          Length = 1660

 Score =  398 bits (1022), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 194/345 (56%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     + E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 668  LKYLDTPTKSLLDTPAK---RAQAEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 723

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +EER G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 724  MEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 783

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 784  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 843

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ + +    LAT ++PLYK LAP 
Sbjct: 844  SFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQ 903

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPF+GVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 904  AYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 963

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + K P+DEQLHVLPLY M ++DEFG++E Q  KV +GAI+ L 
Sbjct: 964  RCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLT 1008


>sp|Q8BG87|TET3_MOUSE Methylcytosine dioxygenase TET3 OS=Mus musculus GN=Tet3 PE=1 SV=3
          Length = 1668

 Score =  395 bits (1015), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 193/345 (55%), Positives = 252/345 (73%), Gaps = 9/345 (2%)

Query: 190  VKAEDPNSKEMLDHIERLKNNMRTEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKD 249
            +K  D  +K +LD   +     ++E P C C     +  + G YYTHLG+  ++  +R+ 
Sbjct: 676  LKYLDTPTKSLLDTPAK---KAQSEFPTCDCV-EQIVEKDEGPYYTHLGSGPTVASIREL 731

Query: 250  IEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCST 309
            +E+R G KGKA+R+EK++YTGKEGK+++GCP+AKWVIRR +LEEKLL +V+HR GH C  
Sbjct: 732  MEDRYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQN 791

Query: 310  AWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATNEPRTCACQGLDPDTCGA 369
            A IV++I+AWEG+P +  D +Y  LT+ L KYG PT+RRC  N+ RTCACQG DP+TCGA
Sbjct: 792  AVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGA 851

Query: 370  SFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPG 427
            SFSFGCSWSMY+NGCKYARSKT RKFRL+  +  EE+ +      LAT ++PLYK LAP 
Sbjct: 852  SFSFGCSWSMYFNGCKYARSKTPRKFRLTGDNPKEEEVLRNSFQDLATEVAPLYKRLAPQ 911

Query: 428  AFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--H 485
            A+ NQ   E  A +CRLG K GRPFSGVTAC DFCAH+H+D HN+ NGCTVV +LTK  +
Sbjct: 912  AYQNQVTNEDVAIDCRLGLKEGRPFSGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDN 971

Query: 486  RSLSK-PDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIENLN 529
            R + + P+DEQLHVLPLY M  +DEFG++E Q  KV++GAI+ L 
Sbjct: 972  RCVGQIPEDEQLHVLPLYKMASTDEFGSEENQNAKVSSGAIQVLT 1016


>sp|Q8NFU7|TET1_HUMAN Methylcytosine dioxygenase TET1 OS=Homo sapiens GN=TET1 PE=1 SV=2
          Length = 2136

 Score =  386 bits (991), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 235/322 (72%), Gaps = 7/322 (2%)

Query: 213  TEVPDCKCFASDKLPPEPGSYYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKE 272
            +E+P C C     +  + G YYTHLGA  S+  +R+ +E R G KG A+R+E ++YTGKE
Sbjct: 1417 SELPTCSCL-DRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKE 1475

Query: 273  GKTTQGCPLAKWVIRRASLEEKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYA 332
            GK++ GCP+AKWV+RR+S EEK+L +V+ R GH C TA +VV+I+ W+G+PL  +D +Y 
Sbjct: 1476 GKSSHGCPIAKWVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYT 1535

Query: 333  ILTNKLNKY-GLPTTRRCATNEPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKT 391
             LT  L  Y G PT RRC  NE RTC CQG+DP+TCGASFSFGCSWSMY+NGCK+ RS +
Sbjct: 1536 ELTENLKSYNGHPTDRRCTLNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPS 1595

Query: 392  VRKFRLSVRS--EEQEIEEKMHLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPG 449
             R+FR+   S   E+ +E+ +  LAT ++P+YK  AP A+ NQ ++E  A ECRLG K G
Sbjct: 1596 PRRFRIDPSSPLHEKNLEDNLQSLATRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEG 1655

Query: 450  RPFSGVTACFDFCAHSHRDLHNMNNGCTVVVSLTK--HRSLSK-PDDEQLHVLPLYIMDD 506
            RPFSGVTAC DFCAH HRD+HNMNNG TVV +LT+  +RSL   P DEQLHVLPLY + D
Sbjct: 1656 RPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSD 1715

Query: 507  SDEFGNKEAQEEKVNTGAIENL 528
            +DEFG+KE  E K+ +GAIE L
Sbjct: 1716 TDEFGSKEGMEAKIKSGAIEVL 1737


>sp|Q3URK3|TET1_MOUSE Methylcytosine dioxygenase TET1 OS=Mus musculus GN=Tet1 PE=1 SV=2
          Length = 2007

 Score =  372 bits (955), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/350 (51%), Positives = 242/350 (69%), Gaps = 12/350 (3%)

Query: 187  PATVKAEDPNSKEMLDHIERLKNNM-----RTEVPDCKCFASDKLPPEPGSYYTHLGAAA 241
            P T  A+    + ++D + +   N+       E   C C    +   E G YYTHLGA  
Sbjct: 1335 PTTDSAQSEFKESIMDLLSKPAKNLIAGLKEQEAAPCDCDGGTQ--KEKGPYYTHLGAGP 1392

Query: 242  SLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLEEKLLLIVKH 301
            S+  +R+ +E R G KGKA+R+EKI++TGKEGK++QGCP+AKWVIRR+  EEKL+ +V+ 
Sbjct: 1393 SVAAVRELMETRFGQKGKAIRIEKIVFTGKEGKSSQGCPVAKWVIRRSGPEEKLICLVRE 1452

Query: 302  RQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKY-GLPTTRRCATNEPRTCACQ 360
            R  H CSTA IVV+I+ WEG+P   +D +Y  LT  L  Y G PT RRC  N+ RTC CQ
Sbjct: 1453 RVDHHCSTAVIVVLILLWEGIPRLMADRLYKELTENLRSYSGHPTDRRCTLNKKRTCTCQ 1512

Query: 361  GLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRLSVRS--EEQEIEEKMHLLATTIS 418
            G+DP TCGASFSFGCSWSMY+NGCK+ RS+  RKFRL+      E+++E+ +  LAT ++
Sbjct: 1513 GIDPKTCGASFSFGCSWSMYFNGCKFGRSENPRKFRLAPNYPLHEKQLEKNLQELATVLA 1572

Query: 419  PLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLHNMNNGCTV 478
            PLYK +AP A+ NQ ++E  A +CRLG + GRPFSGVT C DFCAHSH+D+HNM+NG TV
Sbjct: 1573 PLYKQMAPVAYQNQVEYEEVAGDCRLGNEEGRPFSGVTCCMDFCAHSHKDIHNMHNGSTV 1632

Query: 479  VVSLTKH--RSLSKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIE 526
            V +L +   R  + P+DEQLHVLPLY + D+DEFG+ E  + K+ +GAI+
Sbjct: 1633 VCTLIRADGRDTNCPEDEQLHVLPLYRLADTDEFGSVEGMKAKIKSGAIQ 1682


>sp|Q4JK59|TET2_MOUSE Methylcytosine dioxygenase TET2 OS=Mus musculus GN=Tet2 PE=1 SV=3
          Length = 1912

 Score =  367 bits (942), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 173/301 (57%), Positives = 221/301 (73%), Gaps = 5/301 (1%)

Query: 233  YYTHLGAAASLPDLRKDIEERSGYKGKALRMEKILYTGKEGKTTQGCPLAKWVIRRASLE 292
            YYTHLGA   +  +R  +EER G KGKA+R+EK++YTGKEGK++QGCP+AKWV RR+S E
Sbjct: 1060 YYTHLGAGPDVAAIRTLMEERYGEKGKAIRIEKVIYTGKEGKSSQGCPIAKWVYRRSSEE 1119

Query: 293  EKLLLIVKHRQGHTCSTAWIVVVIVAWEGVPLNQSDGVYAILTNKLNKYGLPTTRRCATN 352
            EKLL +V+ R  HTC TA +V+ I+ W+G+P   +  +Y+ LT+ L K G+ T RRC+ N
Sbjct: 1120 EKLLCLVRVRPNHTCETAVMVIAIMLWDGIPKLLASELYSELTDILGKCGICTNRRCSQN 1179

Query: 353  EPRTCACQGLDPDTCGASFSFGCSWSMYYNGCKYARSKTVRKFRL--SVRSEEQEIEEKM 410
            E R C CQG +P+TCGASFSFGCSWSMYYNGCK+ARSK  RKFRL  +   EE+ +   +
Sbjct: 1180 ETRNCCCQGENPETCGASFSFGCSWSMYYNGCKFARSKKPRKFRLHGAEPKEEERLGSHL 1239

Query: 411  HLLATTISPLYKALAPGAFTNQCQFEREASECRLGFKPGRPFSGVTACFDFCAHSHRDLH 470
              LAT I+P+YK LAP A+ NQ +FE +A +C LG K GRPFSGVTAC DF AHSHRD  
Sbjct: 1240 QNLATVIAPIYKKLAPDAYNNQVEFEHQAPDCCLGLKEGRPFSGVTACLDFSAHSHRDQQ 1299

Query: 471  NMNNGCTVVVSLTKHRSL---SKPDDEQLHVLPLYIMDDSDEFGNKEAQEEKVNTGAIEN 527
            NM NG TVVV+L +  +    +KP+DEQ HVLP+YI+   DEFG+ E QE+K+  G+IE 
Sbjct: 1300 NMPNGSTVVVTLNREDNREVGAKPEDEQFHVLPMYIIAPEDEFGSTEGQEKKIRMGSIEV 1359

Query: 528  L 528
            L
Sbjct: 1360 L 1360


>sp|Q8NZ80|C5AP_STRP8 C5a peptidase OS=Streptococcus pyogenes serotype M18 (strain
           MGAS8232) GN=scpA PE=3 SV=1
          Length = 1150

 Score = 32.7 bits (73), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)

Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
           D  KK++ +A   G  S+V SAG     GG T  P ++H   G+       D  L     
Sbjct: 273 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 331

Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
            P+    E ATVK  D   KEM
Sbjct: 332 SPDKQLTETATVKTADQQDKEM 353


>sp|P0DD35|C5AP_STRPQ C5a peptidase OS=Streptococcus pyogenes serotype M3 (strain SSI-1)
           GN=scpA PE=3 SV=1
          Length = 1169

 Score = 32.3 bits (72), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)

Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
           D  KK++ +A   G  S+V SAG     GG T  P ++H   G+       D  L     
Sbjct: 275 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 333

Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
            P+    E ATVK  D   KEM
Sbjct: 334 SPDKQLTETATVKTADQQDKEM 355


>sp|P0DD34|C5AP_STRP3 C5a peptidase OS=Streptococcus pyogenes serotype M3 (strain ATCC
           BAA-595 / MGAS315) GN=scpA PE=3 SV=1
          Length = 1169

 Score = 32.3 bits (72), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)

Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
           D  KK++ +A   G  S+V SAG     GG T  P ++H   G+       D  L     
Sbjct: 275 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 333

Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
            P+    E ATVK  D   KEM
Sbjct: 334 SPDKQLTETATVKTADQQDKEM 355


>sp|Q5X9R0|C5AP_STRP6 C5a peptidase OS=Streptococcus pyogenes serotype M6 (strain ATCC
           BAA-946 / MGAS10394) GN=scpA PE=3 SV=1
          Length = 1184

 Score = 32.3 bits (72), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)

Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
           D  KK++ +A   G  S+V SAG     GG T  P ++H   G+       D  L     
Sbjct: 273 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 331

Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
            P+    E ATVK  D   KEM
Sbjct: 332 SPDKQLTETATVKTADQQDKEM 353


>sp|P58099|C5AP_STRP1 C5a peptidase OS=Streptococcus pyogenes serotype M1 GN=scpA PE=3
           SV=1
          Length = 1181

 Score = 32.3 bits (72), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 36/82 (43%), Gaps = 4/82 (4%)

Query: 122 DSGKKSYVFAGEGGPCSLVDSAGPWCCRGGGTEPPTSEHLKDGLCQGMRTQDEMLEPK-- 179
           D  KK++ +A   G  S+V SAG     GG T  P ++H   G+       D  L     
Sbjct: 273 DETKKAFDYAKSKG-VSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASY 331

Query: 180 EPNNN-EEPATVKAEDPNSKEM 200
            P+    E ATVK  D   KEM
Sbjct: 332 SPDKQLTETATVKTADQQDKEM 353


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.316    0.134    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 223,419,117
Number of Sequences: 539616
Number of extensions: 10187648
Number of successful extensions: 21340
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 21308
Number of HSP's gapped (non-prelim): 22
length of query: 529
length of database: 191,569,459
effective HSP length: 122
effective length of query: 407
effective length of database: 125,736,307
effective search space: 51174676949
effective search space used: 51174676949
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (29.3 bits)