BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021252
(315 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O43670|ZN207_HUMAN Zinc finger protein 207 OS=Homo sapiens GN=ZNF207 PE=1 SV=1
Length = 478
Score = 142 bits (358), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 59/87 (67%), Positives = 72/87 (82%), Gaps = 1/87 (1%)
Query: 1 MGKKKKRVSSKVWCYYCDREFDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHK 60
MG+KKK+ K WC+YC+R+FDDEKIL+QHQKAKHFKCH+CHKKL T G+AIH +QVHK
Sbjct: 1 MGRKKKK-QLKPWCWYCNRDFDDEKILIQHQKAKHFKCHICHKKLYTGPGLAIHCMQVHK 59
Query: 61 ENVTKVPNAKPGRESTDIEIYGMQGIP 87
E + VPNA PGR ++EIYGM+GIP
Sbjct: 60 ETIDAVPNAIPGRTDIELEIYGMEGIP 86
>sp|Q5R8K4|ZN207_PONAB Zinc finger protein 207 OS=Pongo abelii GN=ZNF207 PE=2 SV=1
Length = 494
Score = 140 bits (353), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 59/87 (67%), Positives = 71/87 (81%), Gaps = 1/87 (1%)
Query: 1 MGKKKKRVSSKVWCYYCDREFDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHK 60
MG+KKK+ K WC+YC+R+FDDEKIL QHQKAKHFKCH+CHKKL T G+AIH +QVHK
Sbjct: 1 MGRKKKK-QLKPWCWYCNRDFDDEKILTQHQKAKHFKCHICHKKLYTGPGLAIHCMQVHK 59
Query: 61 ENVTKVPNAKPGRESTDIEIYGMQGIP 87
E + VPNA PGR ++EIYGM+GIP
Sbjct: 60 ETIDAVPNAIPGRTDIELEIYGMEGIP 86
>sp|P16952|SSP5_STRGN Agglutinin receptor OS=Streptococcus gordonii GN=ssp5 PE=1 SV=2
Length = 1500
Score = 45.1 bits (105), Expect = 7e-04, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 49/125 (39%), Gaps = 25/125 (20%)
Query: 125 LGVSYPPQSTLGMMQPIYSSAVPVPPA-------GWPVPP-----RPQPWYPQHAAVSI- 171
+GV P + QP+Y + P+ PA P PP +P+P P+
Sbjct: 754 IGVPTKPVAPTAPTQPMYETEKPLEPAPVAPSYENEPTPPVKTPDQPEPSKPEEPTYETE 813
Query: 172 ----PPPAAVGYAQQPLFPVQNVRPPLAS-------STSPALQPSPVVPPGMPSSTPPVT 220
P P A Y +P PV+ P S T L+P+PV P TPPV
Sbjct: 814 KPLEPAPVAPSYENEPTPPVKTPDQPEPSKPEEPNYETEKPLEPAPVAPSYENEPTPPVK 873
Query: 221 V-SQP 224
+ QP
Sbjct: 874 IPDQP 878
>sp|Q8NI51|CTCFL_HUMAN Transcriptional repressor CTCFL OS=Homo sapiens GN=CTCFL PE=1 SV=2
Length = 663
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 14 CYYCDRE-FDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHKENV 63
C Y R+ + ++ + H K ++CH+CH + + +G M IH+LQ H ENV
Sbjct: 375 CSYASRDTYKLKRHMRTHSGEKPYECHICHTRFTQSGTMKIHILQKHGENV 425
>sp|Q8IRH5|Y2199_DROME Zinc finger protein CG2199 OS=Drosophila melanogaster GN=CG2199
PE=1 SV=1
Length = 733
Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 9 SSKVWCYYCDREFDDEKILVQHQKAKH-------FKCHVCHKKLSTAGGMAIHVLQVHKE 61
S+ C CD E K + +H K H FKCHVC K L+T + H + +H +
Sbjct: 413 SNNFQCEICDCELMTAKQMQEHMKTVHSIDKPKVFKCHVCEKSLATKQSLKTH-MTLHAD 471
Query: 62 NVTKVPNAKPGR----ESTDIEIYGMQGIPPDVLAAHYGEEEEEVPSKMAK 108
+ PN+ + E D++I G I ++ ++ P+K AK
Sbjct: 472 GA-EAPNSSKRKILQDEDEDVDILGTTQIENTAEKVEGPKKSQQSPTKAAK 521
>sp|Q08705|CTCF_CHICK Transcriptional repressor CTCF OS=Gallus gallus GN=CTCF PE=1 SV=1
Length = 728
Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 13 WCYYCDRE-FDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHKENVTK 65
C Y R+ + ++ + H K ++C++CH + + +G M +H+LQ H ENV K
Sbjct: 383 LCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAK 436
Score = 34.3 bits (77), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 14 CYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIH 54
C YCD F + L+QHQK+ K FKC C M +H
Sbjct: 469 CRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQERHMVMH 513
>sp|P49711|CTCF_HUMAN Transcriptional repressor CTCF OS=Homo sapiens GN=CTCF PE=1 SV=1
Length = 727
Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 13 WCYYCDRE-FDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHKENVTK 65
C Y R+ + ++ + H K ++C++CH + + +G M +H+LQ H ENV K
Sbjct: 383 LCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAK 436
Score = 33.9 bits (76), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 14 CYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIH 54
C YCD F + L+QHQK+ K FKC C M +H
Sbjct: 469 CRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQERHMIMH 513
>sp|Q61164|CTCF_MOUSE Transcriptional repressor CTCF OS=Mus musculus GN=Ctcf PE=1 SV=2
Length = 736
Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 13 WCYYCDRE-FDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHKENVTK 65
C Y R+ + ++ + H K ++C++CH + + +G M +H+LQ H ENV K
Sbjct: 383 LCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAK 436
Score = 33.9 bits (76), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 14 CYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIH 54
C YCD F + L+QHQK+ K FKC C M +H
Sbjct: 469 CRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQERHMIMH 513
>sp|Q9R1D1|CTCF_RAT Transcriptional repressor CTCF OS=Rattus norvegicus GN=Ctcf PE=2
SV=1
Length = 737
Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 13 WCYYCDRE-FDDEKILVQHQKAKHFKCHVCHKKLSTAGGMAIHVLQVHKENVTK 65
C Y R+ + ++ + H K ++C++CH + + +G M +H+LQ H ENV K
Sbjct: 383 LCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAK 436
Score = 33.5 bits (75), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 14 CYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIH 54
C YCD F + L+QHQK+ K FKC C M +H
Sbjct: 469 CRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQERHMIMH 513
>sp|P39806|SALM_DROVI Homeotic protein spalt-major OS=Drosophila virilis GN=salm PE=2
SV=1
Length = 1402
Score = 41.2 bits (95), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 2 GKKKKRVSSKVWCYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIHVLQ 57
GK + K C YC + F + L H ++ + FKC+VC + +T G + +H Q
Sbjct: 422 GKGRNEPFFKHRCRYCGKVFGSDSALQIHIRSHTGERPFKCNVCGSRFTTKGNLKVH-FQ 480
Query: 58 VHKENVTKVP-NAKPGRESTD 77
H + VP NA P E D
Sbjct: 481 RHAQKFPHVPMNATPIPEHMD 501
Score = 39.7 bits (91), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 4 KKKRVSSKVWCYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIHVLQVH 59
K K++S C CDR + L H + + FKC +C + +T G + H + VH
Sbjct: 837 KNKKISDPNQCVVCDRVLSCKSALQMHYRTHTGERPFKCRICGRAFTTKGNLKTH-MAVH 895
Query: 60 K 60
K
Sbjct: 896 K 896
>sp|Q9NSC2|SALL1_HUMAN Sal-like protein 1 OS=Homo sapiens GN=SALL1 PE=1 SV=2
Length = 1324
Score = 40.8 bits (94), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 5/55 (9%)
Query: 11 KVWCYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIHVLQVHKE 61
K C +C + F + L H ++ + FKC++C + ST G + +H Q HKE
Sbjct: 448 KHKCRFCAKVFGSDSALQIHLRSHTGERPFKCNICGNRFSTKGNLKVH-FQRHKE 501
Score = 37.0 bits (84), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 11/81 (13%)
Query: 6 KRVSSKVWCYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIHVLQVHKE 61
+R + +C C + F L H++ K F C +C + +T G + +H+
Sbjct: 1128 RRTPKQHYCNTCGKTFSSSSALQIHERTHTGEKPFACTICGRAFTTKGNLKVHM------ 1181
Query: 62 NVTKVPNAKPGRESTDIEIYG 82
T + N+ P R + + G
Sbjct: 1182 -GTHMWNSTPARRGRRLSVDG 1201
>sp|Q9BXA9|SALL3_HUMAN Sal-like protein 3 OS=Homo sapiens GN=SALL3 PE=1 SV=2
Length = 1300
Score = 40.4 bits (93), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 5/55 (9%)
Query: 11 KVWCYYCDREFDDEKILVQHQKA----KHFKCHVCHKKLSTAGGMAIHVLQVHKE 61
K C +C + F + L H ++ + FKC++C + ST G + +H Q HKE
Sbjct: 419 KHKCRFCAKVFGSDSALQIHLRSHTGERPFKCNICGNRFSTKGNLKVH-FQRHKE 472
>sp|Q13029|PRDM2_HUMAN PR domain zinc finger protein 2 OS=Homo sapiens GN=PRDM2 PE=1 SV=3
Length = 1718
Score = 34.7 bits (78), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 56/138 (40%), Gaps = 20/138 (14%)
Query: 14 CYYCDREFDDEKILVQHQKAKH-----FKCHVCHKKLSTAGGMAIHVLQVHKENVTKVPN 68
C +C + F D+ L +H+ H F C VC K+ + + H +H + V
Sbjct: 1164 CEFCVQLFKDKTDLSEHRFLLHGVGNIFVCSVCKKEFAFLCNLQQHQRDLHPDKVCTHHE 1223
Query: 69 AKPG----RESTD---IEIYGMQGIPPDVLAAHYGEEEEEVPSKMAKVDTSFPQLVGGVV 121
+ G + TD + MQ +P D L +EEEE+ ++ T+ + G+
Sbjct: 1224 FESGTLRPQNFTDPSKAHVEHMQSLPEDPLET--SKEEEELNDSSEELYTTIKIMASGI- 1280
Query: 122 PGQLGVSYPPQSTLGMMQ 139
+ P LG+ Q
Sbjct: 1281 -----KTKDPDVRLGLNQ 1293
>sp|Q80YR4|ZN598_MOUSE Zinc finger protein 598 OS=Mus musculus GN=Znf598 PE=2 SV=1
Length = 908
Score = 33.5 bits (75), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 14 CYYCDREFDDEKILVQHQKAKHFKCHVC 41
C +CD + D L++H + H+ CH C
Sbjct: 187 CKFCDERYLDNDELLKHLRRDHYFCHFC 214
>sp|O88778|BSN_RAT Protein bassoon OS=Rattus norvegicus GN=Bsn PE=1 SV=3
Length = 3938
Score = 33.1 bits (74), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 58/157 (36%), Gaps = 30/157 (19%)
Query: 93 AHYGEEEEEVPSKMAKVDTSFPQLVGGVVPGQLGVSYPPQSTLGMMQPIYSSAV------ 146
A EEE+E+ +K+ ++ Q + ++G YPP LG + S +
Sbjct: 2957 AELDEEEKEIDAKLKYLELGITQRKESLAKDRVGRDYPPLRGLGEHRDYLSDSELNQLRL 3016
Query: 147 --PVPPAGWPVPPRPQPWYPQHAAVSIPPPAAVGYAQQPLFPVQNVRPPLASS------- 197
PAG V YP AAV P + QQP FP + SS
Sbjct: 3017 QGCTTPAGQYV------DYPASAAVPATPSGPTAF-QQPRFPPAATQYTAGSSGPTQNGF 3069
Query: 198 --------TSPALQPSPVVPPGMPSSTPPVTVSQPLF 226
T P+ P+P PPG P SQP F
Sbjct: 3070 LAHQAPTYTGPSTYPAPTYPPGTSYPAEPGLPSQPAF 3106
>sp|O88737|BSN_MOUSE Protein bassoon OS=Mus musculus GN=Bsn PE=1 SV=4
Length = 3942
Score = 31.6 bits (70), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 58/157 (36%), Gaps = 30/157 (19%)
Query: 93 AHYGEEEEEVPSKMAKVDTSFPQLVGGVVPGQLGVSYPPQSTLGMMQPIYSSAV------ 146
A EEE+E+ +K+ ++ Q + + G YPP LG + S +
Sbjct: 2972 AELDEEEKEIDAKLKYLELGITQRKESLAKDRGGRDYPPLRGLGEHRDYLSDSELNQLRL 3031
Query: 147 --PVPPAGWPVPPRPQPWYPQHAAVSIPPPAAVGYAQQPLFP-------------VQNVR 191
PAG V YP AAV P + QQP FP QN
Sbjct: 3032 QGCTTPAGQYV------DYPASAAVPATPSGPTAF-QQPRFPPAAPQYTAGSSGPTQNGF 3084
Query: 192 PP--LASSTSPALQPSPVVPPGMPSSTPPVTVSQPLF 226
P + T P+ P+P PPG P SQP F
Sbjct: 3085 PAHQAPTYTGPSTYPAPTYPPGTGYPAEPGLPSQPAF 3121
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.132 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 123,138,865
Number of Sequences: 539616
Number of extensions: 5776617
Number of successful extensions: 47631
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 251
Number of HSP's successfully gapped in prelim test: 1878
Number of HSP's that attempted gapping in prelim test: 33265
Number of HSP's gapped (non-prelim): 9529
length of query: 315
length of database: 191,569,459
effective HSP length: 117
effective length of query: 198
effective length of database: 128,434,387
effective search space: 25430008626
effective search space used: 25430008626
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)