BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 046396
(314 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SEZ1|HB29_ARATH Transcription factor HB29 OS=Arabidopsis thaliana GN=HB29 PE=1 SV=1
Length = 242
Score = 217 bits (553), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 127/260 (48%), Positives = 155/260 (59%), Gaps = 43/260 (16%)
Query: 26 PKNKKSQSFYTSSSFVPPLQMVVSYKECLKNHAASLGGHALDGCGEFMPSPTATSNDPAS 85
P+ + S + +M V YKECLKNHAA+LGGHALDGCGEFMPSPTATS DP+S
Sbjct: 7 PQQQLLNSLPIAGELTVTGEMGVCYKECLKNHAANLGGHALDGCGEFMPSPTATSTDPSS 66
Query: 86 LNCAACGCHRNFHRRDPYEPNAAATHRLPPAATASHSTDPSTVPSPDTNTNSPQHHQPVT 145
L CAACGCHRNFHRRDP E T PP + SP + P H V+
Sbjct: 67 LRCAACGCHRNFHRRDPSENLNFLT--APP------------ISSPSGTESPPSRH--VS 110
Query: 146 SPTPCSYYSSAP--HMLLALSTGFSAPPDDGDDNHPRHHHYPQHQPPFNQVLAANSSDNG 203
SP PCSYY+SAP H++L+LS+GF P D Q P V + NSS
Sbjct: 111 SPVPCSYYTSAPPHHVILSLSSGFPGPSD---------------QDP-TVVRSENSSRGA 154
Query: 204 -KKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNKNA 262
+KR+RTKFT EQK KM +FAE+ GWK+N +EK ++FC+EVG+ R V KVWMHNNK
Sbjct: 155 MRKRTRTKFTPEQKIKMRAFAEKAGWKINGCDEKSVREFCNEVGIERGVLKVWMHNNK-- 212
Query: 263 SGRKDQRSINNKNINDIVNG 282
S+ N I +I +G
Sbjct: 213 ------YSLLNGKIREIEHG 226
>sp|Q9SB61|Y4466_ARATH ZF-HD homeobox protein At4g24660 OS=Arabidopsis thaliana
GN=At4g24660 PE=1 SV=1
Length = 220
Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/219 (41%), Positives = 121/219 (55%), Gaps = 46/219 (21%)
Query: 48 VSYKECLKNHAASLGGHALDGCGEFMPSPTATSNDPASLNCAACGCHRNFHRRDPYEPNA 107
+ Y+ECLKNHA ++GGHA+DGC EFMPS + D +L CAACGCHRNFHR++ E
Sbjct: 47 IRYRECLKNHAVNIGGHAVDGCCEFMPSGEDGTLD--ALKCAACGCHRNFHRKET-ESIG 103
Query: 108 AATHRLPPAATASHSTDPSTVPSPDTNTNSPQHHQPVTSPTPCSYYSSAPHMLLALSTGF 167
HR+P ++ P PQ HQP P S A ++G
Sbjct: 104 GRAHRVP-----TYYNRP------------PQPHQP---PGYLHLTSPAAPYRPPAASG- 142
Query: 168 SAPPDDGDDNHPRHHHYPQHQPPFNQVLAANSSDNGKKRSRTKFTQEQKEKMLSFAERLG 227
D+ D ++P +SS KR RTKFT EQKEKML+FAERLG
Sbjct: 143 ----DEEDTSNP------------------SSSGGTTKRFRTKFTAEQKEKMLAFAERLG 180
Query: 228 WKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNKNASGRK 266
W++ + ++ + FC+E GV R+V K+WMHNNKN+ G+K
Sbjct: 181 WRIQKHDDVAVEQFCAETGVRRQVLKIWMHNNKNSLGKK 219
>sp|Q9FKP8|Y5541_ARATH ZF-HD homeobox protein At5g65410 OS=Arabidopsis thaliana
GN=At5g65410 PE=1 SV=1
Length = 279
Score = 89.4 bits (220), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 52/70 (74%)
Query: 196 AANSSDNGKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVW 255
+A + +KR RTKFT EQKE+ML+ AER+GW++ R ++++ Q FC E GV R+V KVW
Sbjct: 183 SAEAGGGIRKRHRTKFTAEQKERMLALAERIGWRIQRQDDEVIQRFCQETGVPRQVLKVW 242
Query: 256 MHNNKNASGR 265
+HNNK+ G+
Sbjct: 243 LHNNKHTLGK 252
>sp|Q9C0A1|ZFHX2_HUMAN Zinc finger homeobox protein 2 OS=Homo sapiens GN=ZFHX2 PE=2 SV=3
Length = 2572
Score = 40.4 bits (93), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 203 GKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNKN 261
G++ SRTKFT+ Q + + SF E + + E+L S +G++ RV VW N +
Sbjct: 1594 GRRFSRTKFTEFQTQALQSFFETSAYPKDGEVERLA----SLLGLASRVVVVWFQNARQ 1648
>sp|Q9UPM6|LHX6_HUMAN LIM/homeobox protein Lhx6 OS=Homo sapiens GN=LHX6 PE=2 SV=2
Length = 363
Score = 37.4 bits (85), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 205 KRSRTKFTQEQKEKMLS-FAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNK 260
KR+RT FT EQ + M + FA+ N + + Q G+SRRV +VW N +
Sbjct: 220 KRARTSFTAEQLQVMQAQFAQD-----NNPDAQTLQKLADMTGLSRRVIQVWFQNCR 271
>sp|Q9R1R0|LHX6_MOUSE LIM/homeobox protein Lhx6 OS=Mus musculus GN=Lhx6 PE=1 SV=2
Length = 363
Score = 37.4 bits (85), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 205 KRSRTKFTQEQKEKMLS-FAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNK 260
KR+RT FT EQ + M + FA+ N + + Q G+SRRV +VW N +
Sbjct: 220 KRARTSFTAEQLQVMQAQFAQD-----NNPDAQTLQKLADMTGLSRRVIQVWFQNCR 271
>sp|Q68G74|LHX8_HUMAN LIM/homeobox protein Lhx8 OS=Homo sapiens GN=LHX8 PE=2 SV=2
Length = 356
Score = 35.8 bits (81), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 205 KRSRTKFTQEQKEKMLS-FAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNK 260
KR+RT FT +Q + M + FA+ N + + Q G+SRRV +VW N +
Sbjct: 226 KRARTSFTADQLQVMQAQFAQD-----NNPDAQTLQKLAERTGLSRRVIQVWFQNCR 277
>sp|Q05640|HMEN_ARTSF Homeobox protein engrailed OS=Artemia franciscana PE=2 SV=1
Length = 349
Score = 35.4 bits (80), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 10/69 (14%)
Query: 204 KKRSRTKFTQEQKEKMLSFAERLGWKMNR-AEEKLTQDFCSEVGVSRRVFKVWMHNN--- 259
+KR RT FT EQ ++ + + NR E+ QD E+G+ K+W NN
Sbjct: 249 EKRPRTAFTAEQLSRL-----KHEFNENRYLTERRRQDLARELGLHENQIKIWFQNNRAK 303
Query: 260 -KNASGRKD 267
K +SG+K+
Sbjct: 304 LKKSSGQKN 312
>sp|P09145|HMEN_DROVI Segmentation polarity homeobox protein engrailed OS=Drosophila
virilis GN=en PE=3 SV=1
Length = 584
Score = 35.4 bits (80), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 44/105 (41%), Gaps = 7/105 (6%)
Query: 180 RHHHYPQHQPPFNQVLAANSSDNGKKRSRTKFTQEQKEKMLSFAERLGWKMNR-AEEKLT 238
R+ P P + + N +KR RT F+ EQ ++ + + NR E+
Sbjct: 462 RYSDRPSSGPRYRRPKQPKDKTNDEKRPRTAFSSEQLARL-----KREFNENRYLTERRR 516
Query: 239 QDFCSEVGVSRRVFKVWMHNNKNASGRKDQRSINNKNINDIVNGC 283
Q SE+G++ K+W NK A +K S N + + G
Sbjct: 517 QQLSSELGLNEAQIKIWFQ-NKRAKIKKSTGSKNPLALQLMAQGL 560
>sp|O35652|LHX8_MOUSE LIM/homeobox protein Lhx8 OS=Mus musculus GN=Lhx8 PE=2 SV=4
Length = 367
Score = 35.0 bits (79), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 205 KRSRTKFTQEQKEKMLS-FAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNK 260
KR+RT FT +Q + M + FA+ N + + Q G+SRRV +VW N +
Sbjct: 247 KRARTSFTADQLQVMQAQFAQD-----NNPDAQTLQKLAERTGLSRRVIQVWFQNCR 298
>sp|P53413|LMX1B_CHICK LIM/homeobox protein LMX-1.2 OS=Gallus gallus GN=LMX1B PE=2 SV=1
Length = 377
Score = 34.7 bits (78), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 9/78 (11%)
Query: 188 QPPFNQVLAANSSDNGK-----KRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFC 242
+P QV SD+GK KR RT T +Q+ + E + E L
Sbjct: 174 KPTKGQVTQGKGSDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLA---- 229
Query: 243 SEVGVSRRVFKVWMHNNK 260
+E G+S RV +VW N +
Sbjct: 230 AETGLSVRVVQVWFQNQR 247
>sp|P09070|HXB4_XENLA Homeobox protein Hox-B4 OS=Xenopus laevis GN=hoxb4 PE=2 SV=1
Length = 232
Score = 34.3 bits (77), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 10/102 (9%)
Query: 200 SDNGKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNN 259
SD KRSRT +T++Q +L + + + + + +S R K+W N
Sbjct: 137 SDGEAKRSRTAYTRQQ---VLELEKEFHYNRYLTRRRRV-EIAHTLRLSERQIKIWF-QN 191
Query: 260 KNASGRKDQRSINNKNINDIVNGCSRVSFDVNGNCKNHNDGN 301
+ +KD + N K + V+ + G N N GN
Sbjct: 192 RRMKWKKDHKLPNTK-----IKSNPSVNLQIAGGSPNQNKGN 228
>sp|Q25132|LHX3_HALRO LIM/homeobox protein Lhx3 OS=Halocynthia roretzi GN=LHX3 PE=2 SV=2
Length = 692
Score = 34.3 bits (77), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 4/65 (6%)
Query: 201 DNGKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNK 260
+NG KR RT T +Q E + S + E+L+ SE G+ RV +VW N +
Sbjct: 407 ENGVKRPRTTITAKQLETLKSAYNQSPKPARHVREQLS----SETGLDMRVVQVWFQNRR 462
Query: 261 NASGR 265
R
Sbjct: 463 AKEKR 467
>sp|P02836|HMEN_DROME Segmentation polarity homeobox protein engrailed OS=Drosophila
melanogaster GN=en PE=1 SV=2
Length = 552
Score = 33.9 bits (76), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 44/105 (41%), Gaps = 7/105 (6%)
Query: 180 RHHHYPQHQPPFNQVLAANSSDNGKKRSRTKFTQEQKEKMLSFAERLGWKMNR-AEEKLT 238
R+ P P + + N +KR RT F+ EQ ++ + + NR E+
Sbjct: 430 RYSDRPSSGPRYRRPKQPKDKTNDEKRPRTAFSSEQLARL-----KREFNENRYLTERRR 484
Query: 239 QDFCSEVGVSRRVFKVWMHNNKNASGRKDQRSINNKNINDIVNGC 283
Q SE+G++ K+W NK A +K S N + + G
Sbjct: 485 QQLSSELGLNEAQIKIWFQ-NKRAKIKKSTGSKNPLALQLMAQGL 528
>sp|O13074|HXB4A_TAKRU Homeobox protein Hox-B4a OS=Takifugu rubripes GN=hoxb4a PE=3 SV=2
Length = 288
Score = 33.9 bits (76), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/103 (23%), Positives = 45/103 (43%), Gaps = 5/103 (4%)
Query: 184 YPQHQPPFNQVLAANSSDNGKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCS 243
YP + ++++N + KRSRT +T++Q ++ + R ++ C
Sbjct: 140 YPWMKKVHVNIVSSNYTGGEPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHTLC- 198
Query: 244 EVGVSRRVFKVWMHNNKNASGRKDQRSINNKNINDIVNGCSRV 286
+S R K+W N+ +KD + N K + N S+V
Sbjct: 199 ---LSERQIKIWF-QNRRMKWKKDHKLPNTKVRSGSTNTNSQV 237
>sp|P22574|HXB4A_DANRE Homeobox protein Hox-B4a OS=Danio rerio GN=hoxb4a PE=2 SV=3
Length = 246
Score = 33.5 bits (75), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 5/81 (6%)
Query: 194 VLAANSSDNGKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFK 253
+++ N S KRSRT +T++Q ++ + R ++ C +S R K
Sbjct: 141 IVSPNYSGGEPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHTLC----LSERQIK 196
Query: 254 VWMHNNKNASGRKDQRSINNK 274
+W N+ +KD + N K
Sbjct: 197 IWF-QNRRMKWKKDHKLPNTK 216
>sp|P09015|HME2A_DANRE Homeobox protein engrailed-2a OS=Danio rerio GN=eng2a PE=2 SV=2
Length = 265
Score = 33.5 bits (75), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 10/72 (13%)
Query: 205 KRSRTKFTQEQKEKMLSFAERLGWKMNR-AEEKLTQDFCSEVGVSRRVFKVWMHNN---- 259
KR RT FT EQ +++ + ++ NR E+ Q E+G++ K+W N
Sbjct: 177 KRPRTAFTAEQLQRL-----KAEFQTNRYLTEQRRQSLAQELGLNESQIKIWFQNKRAKI 231
Query: 260 KNASGRKDQRSI 271
K ASG K+ +I
Sbjct: 232 KKASGVKNGLAI 243
>sp|O02491|HMEN_ANOGA Segmentation polarity homeobox protein engrailed OS=Anopheles
gambiae GN=en PE=2 SV=3
Length = 594
Score = 32.3 bits (72), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 7/84 (8%)
Query: 201 DNGKKRSRTKFTQEQKEKMLSFAERLGWKMNR-AEEKLTQDFCSEVGVSRRVFKVWMHNN 259
D+ +KR RT F+ Q +++ + + NR EK Q +E+G++ K+W N
Sbjct: 493 DSEEKRPRTAFSNAQLQRL-----KNEFNENRYLTEKRRQTLSAELGLNEAQIKIWFQ-N 546
Query: 260 KNASGRKDQRSINNKNINDIVNGC 283
K A +K N + + G
Sbjct: 547 KRAKIKKSSSEKNPLALQLMAQGL 570
>sp|Q9CAR4|WRK36_ARATH Probable WRKY transcription factor 36 OS=Arabidopsis thaliana
GN=WRKY36 PE=2 SV=1
Length = 387
Score = 32.3 bits (72), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 47/107 (43%), Gaps = 16/107 (14%)
Query: 201 DNGKKRSRTKFTQEQKEKMLSFAERLGWKMNRAEEKLTQDFCSEVGVSRRVFKVWMHNNK 260
+N + TK +E K+K + G+++ E D C +V K+ NK
Sbjct: 108 ENKVDKISTKNVEESKDKRSALG--FGFQIQSYEASKLDDLCRQV-------KLANAENK 158
Query: 261 NASGRKDQRSINNKNINDIVN-----GCSRVSFDVNGNCKN--HNDG 300
S RKD +S+ N+N D++ G + V +C++ NDG
Sbjct: 159 CVSSRKDVKSVRNENHQDVLEEHEQTGLKKTRVCVKASCEDPSINDG 205
>sp|P52730|HME2B_XENLA Homeobox protein engrailed-2-B OS=Xenopus laevis GN=en2-b PE=2 SV=1
Length = 265
Score = 32.0 bits (71), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 10/72 (13%)
Query: 205 KRSRTKFTQEQKEKMLSFAERLGWKMNR-AEEKLTQDFCSEVGVSRRVFKVWMHNNKNAS 263
KR RT FT EQ +++ + ++ NR E+ Q E+G++ K+W N +
Sbjct: 177 KRPRTAFTAEQLQRL-----KAEFQTNRYLTEQRRQSLAQELGLNESQIKIWFQNKRA-- 229
Query: 264 GRKDQRSINNKN 275
K ++S NKN
Sbjct: 230 --KIKKSTGNKN 239
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.127 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 130,314,504
Number of Sequences: 539616
Number of extensions: 5826736
Number of successful extensions: 37529
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 177
Number of HSP's successfully gapped in prelim test: 289
Number of HSP's that attempted gapping in prelim test: 25875
Number of HSP's gapped (non-prelim): 10019
length of query: 314
length of database: 191,569,459
effective HSP length: 117
effective length of query: 197
effective length of database: 128,434,387
effective search space: 25301574239
effective search space used: 25301574239
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (28.1 bits)