BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3393
(433 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O08755|HNF6_MOUSE Hepatocyte nuclear factor 6 OS=Mus musculus GN=Onecut1 PE=1 SV=1
Length = 465
Score = 213 bits (542), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>sp|P70512|HNF6_RAT Hepatocyte nuclear factor 6 OS=Rattus norvegicus GN=Onecut1 PE=1
SV=1
Length = 465
Score = 213 bits (541), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNDFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>sp|Q9UBC0|HNF6_HUMAN Hepatocyte nuclear factor 6 OS=Homo sapiens GN=ONECUT1 PE=2 SV=1
Length = 465
Score = 202 bits (513), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 190/349 (54%), Gaps = 81/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 137
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT D M ++GL +I +Q+
Sbjct: 138 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGL 197
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ + P + G H +
Sbjct: 198 P-----HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLT 235
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 236 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 260
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 261 HGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 320
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 369
>sp|Q9NJB5|ONEC_DROME Homeobox protein onecut OS=Drosophila melanogaster GN=onecut PE=2
SV=2
Length = 1081
Score = 185 bits (470), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 100/155 (64%), Positives = 112/155 (72%), Gaps = 17/155 (10%)
Query: 280 PTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNA 339
P +NG TP +V V Q SP S + N Q S SQQ + + T +
Sbjct: 698 PGING----TPQSVCVIHQQSP--SALGNGFQSS---------SQQAKVSSCASPKGTVS 742
Query: 340 VVGAVS--LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
GAVS + NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP
Sbjct: 743 SGGAVSNRIANSSDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 802
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 803 KPWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 837
Score = 79.0 bits (193), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 64/102 (62%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS L + +YQTLT+VN
Sbjct: 266 EVLSVIVHSQDSDKEDCEENDDGD-AEGDLENEDDDERDSRSREQLLSHSSYQTLTSVND 324
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 325 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 363
>sp|Q6XBJ3|ONEC2_MOUSE One cut domain family member 2 OS=Mus musculus GN=Onecut2 PE=2 SV=2
Length = 505
Score = 176 bits (447), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 325 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 384
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 385 GRETFRRMWKWLQEPEFQRMSALRLAA 411
>sp|O95948|ONEC2_HUMAN One cut domain family member 2 OS=Homo sapiens GN=ONECUT2 PE=2 SV=2
Length = 504
Score = 176 bits (446), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 324 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 383
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 384 GRETFRRMWKWLQEPEFQRMSALRLAA 410
>sp|O60422|ONEC3_HUMAN One cut domain family member 3 OS=Homo sapiens GN=ONECUT3 PE=2 SV=2
Length = 494
Score = 169 bits (427), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 319 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 378
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 379 MWKWLQEPEFQRMSALRLA 397
Score = 34.3 bits (77), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 13/21 (61%), Positives = 18/21 (85%)
Query: 130 SYATLTPLQPLPPISTMSDKF 150
+Y TLTPLQ LPP++ ++DKF
Sbjct: 104 TYTTLTPLQHLPPLAAVADKF 124
>sp|Q8K557|ONEC3_MOUSE One cut domain family member 3 OS=Mus musculus GN=Onecut3 PE=1 SV=2
Length = 490
Score = 168 bits (426), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 316 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 375
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 376 MWKWLQEPEFQRMSALRLAA 395
Score = 34.7 bits (78), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 13/21 (61%), Positives = 18/21 (85%)
Query: 130 SYATLTPLQPLPPISTMSDKF 150
+Y TLTPLQ LPP++ ++DKF
Sbjct: 103 TYTTLTPLQHLPPLAAVADKF 123
>sp|Q19720|HM38_CAEEL Homeobox protein ceh-38 OS=Caenorhabditis elegans GN=ceh-38 PE=2
SV=2
Length = 641
Score = 123 bits (308), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 52/76 (68%), Positives = 66/76 (86%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK +SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 317 IDTKDLCKRIAFELKNHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMY 376
Query: 415 KWLQEPEFQRMSALRL 430
W+ +P R++ L +
Sbjct: 377 NWVAQPLATRLAILDM 392
>sp|Q22811|HM21_CAEEL Homeobox protein ceh-21 OS=Caenorhabditis elegans GN=ceh-21 PE=2
SV=3
Length = 495
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/76 (55%), Positives = 58/76 (76%), Gaps = 1/76 (1%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 409
G+ EE++T ++A+RI +ELK IPQ A+++L RSQGTLSDLLR PKPWS +K+GR T
Sbjct: 288 GDDEELDTVDIARRILSELKERCIPQTALAEKILARSQGTLSDLLRMPKPWSVMKNGRAT 347
Query: 410 FRRMWKWLQ-EPEFQR 424
F+RM WL +P+ +R
Sbjct: 348 FQRMSNWLGLDPDVRR 363
>sp|Q22812|HM39_CAEEL Homeobox protein ceh-39 OS=Caenorhabditis elegans GN=ceh-39 PE=3
SV=1
Length = 399
Score = 87.0 bits (214), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 50/68 (73%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 409
G+ EE++T+ +A RI ELK IPQA A +VL RSQGTLSDLLR PKPW +K+GR T
Sbjct: 209 GDDEELDTQLIAHRILDELKEQCIPQASLAVKVLGRSQGTLSDLLRKPKPWGIMKNGRGT 268
Query: 410 FRRMWKWL 417
F+RM WL
Sbjct: 269 FQRMANWL 276
>sp|P10180|CUT_DROME Homeobox protein cut OS=Drosophila melanogaster GN=ct PE=1 SV=1
Length = 2175
Score = 68.2 bits (165), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1320 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1379
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1380 RPKPWHMLTQKGREPFIRMKMFLED 1404
Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 886 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 945
Query: 414 WKW 416
W
Sbjct: 946 HAW 948
Score = 55.8 bits (133), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1615 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1674
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1675 RMQLWLSDANNVERLQLLK 1693
>sp|P53565|CUX1_RAT Homeobox protein cut-like 1 (Fragment) OS=Rattus norvegicus GN=Cux1
PE=2 SV=1
Length = 862
Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 283 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 342
Query: 412 RMWKWL 417
RM WL
Sbjct: 343 RMQLWL 348
Score = 57.0 bits (136), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 467 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 526
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 527 MQLWLNDP 534
>sp|P70298|CUX2_MOUSE Homeobox protein cut-like 2 OS=Mus musculus GN=Cux2 PE=2 SV=2
Length = 1426
Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 836 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 895
Query: 413 MWKWL 417
M WL
Sbjct: 896 MQLWL 900
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 991 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1050
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1051 MQLWLSDP 1058
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 483 GAGPEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 542
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 543 GKEPFIKMKQFLSDE--QNVLALR 564
>sp|P53564|CUX1_MOUSE Homeobox protein cut-like 1 OS=Mus musculus GN=Cux1 PE=1 SV=3
Length = 1515
Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 936 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 995
Query: 412 RMWKWL 417
RM WL
Sbjct: 996 RMQLWL 1001
Score = 63.5 bits (153), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 539 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 598
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 599 VRGKEPFHKMKQFLSDE--QNILALR 622
Score = 56.6 bits (135), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1120 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1179
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1180 MQLWLNDP 1187
>sp|P39880|CUX1_HUMAN Homeobox protein cut-like 1 OS=Homo sapiens GN=CUX1 PE=1 SV=3
Length = 1505
Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 941 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1000
Query: 412 RMWKWL 417
RM WL
Sbjct: 1001 RMQLWL 1006
Score = 62.4 bits (150), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 541 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 600
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 601 VRGKEPFHKMKQFLSDE--QNILALR 624
Score = 56.6 bits (135), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1125 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1184
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1185 MQLWLNDP 1192
>sp|O14529|CUX2_HUMAN Homeobox protein cut-like 2 OS=Homo sapiens GN=CUX2 PE=1 SV=4
Length = 1486
Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 895 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 954
Query: 413 MWKWL 417
M WL
Sbjct: 955 MQLWL 959
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1046 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1105
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1106 MQLWLNDP 1113
Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 552 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 611
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 612 MKQFLSDE--QNVLALR 626
>sp|P39881|CUX1_CANFA Homeobox protein cut-like 1 (Fragment) OS=Canis familiaris GN=CUX1
PE=2 SV=1
Length = 975
Score = 67.0 bits (162), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 381 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 440
Query: 412 RMWKWL 417
RM WL
Sbjct: 441 RMQLWL 446
Score = 56.6 bits (135), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 565 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 624
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 625 MQLWLNDP 632
Score = 48.5 bits (114), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 46/70 (65%), Gaps = 3/70 (4%)
Query: 361 AQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQE 419
++++ +L +++I Q IF VL SQG++S++L PKPW+KL G+E F +M ++L
Sbjct: 1 SRQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEPFHKMKQFLS- 59
Query: 420 PEFQRMSALR 429
+ Q + ALR
Sbjct: 60 -DEQNILALR 68
>sp|Q9BL02|CUT_CAEEL Homeobox protein cut-like ceh-44 OS=Caenorhabditis elegans
GN=ceh-44 PE=2 SV=1
Length = 1273
Score = 58.9 bits (141), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 45/71 (63%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A + Y+I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE F
Sbjct: 600 LNTTEIASHCKRLMIAYNIGQRLFAKHVMNQVVKSQGSLSELLSKPRHWNKLTDKGREAF 659
Query: 411 RRMWKWLQEPE 421
RR++ W+ + E
Sbjct: 660 RRIYGWISDDE 670
Score = 46.6 bits (109), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T L ++I L SI Q F + +L SQG++SDLL PK W++L + GRE F RM
Sbjct: 841 LDTDFLVKQIKEFLTMNSISQRQFGEYILGLSQGSVSDLLARPKTWAQLTQKGREPFIRM 900
Query: 414 WKWLQEPE 421
++ + E
Sbjct: 901 QLFMDDVE 908
>sp|Q01826|SATB1_HUMAN DNA-binding protein SATB1 OS=Homo sapiens GN=SATB1 PE=1 SV=1
Length = 763
Score = 46.2 bits (108), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
Score = 35.0 bits (79), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
I E+KR + QA+FA+ +SQG L +LLR
Sbjct: 502 IQQEMKRAKVSQALFAKVAATKSQGWLCELLR 533
>sp|Q60611|SATB1_MOUSE DNA-binding protein SATB1 OS=Mus musculus GN=Satb1 PE=1 SV=2
Length = 764
Score = 46.2 bits (108), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
Score = 35.0 bits (79), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
I E+KR + QA+FA+ +SQG L +LLR
Sbjct: 502 IQQEMKRAKVSQALFAKVAATKSQGWLCELLR 533
>sp|Q8VI24|SATB2_MOUSE DNA-binding protein SATB2 OS=Mus musculus GN=Satb2 PE=1 SV=1
Length = 733
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
Score = 37.0 bits (84), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 14/98 (14%)
Query: 332 KTVTPTNAVVGAVSLGNSGELEEIN-TKELAQRISAELKRYSIPQAIFAQRVLCRSQGTL 390
KT TPT + V N +N T + I E+KR + QA+FA+ +SQG L
Sbjct: 464 KTSTPTTDLPIKVDGAN------VNITAAIYDEIQQEMKRAKVSQALFAKVAANKSQGWL 517
Query: 391 SDLLR---NPKPWSK-LKSGRETFRRMWKWLQEPEFQR 424
+LLR NP P ++ L T RR +L P+ +R
Sbjct: 518 CELLRWKENPSPENRTLWENLCTIRR---FLNLPQHER 552
>sp|Q9UPW6|SATB2_HUMAN DNA-binding protein SATB2 OS=Homo sapiens GN=SATB2 PE=1 SV=2
Length = 733
Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
Score = 37.4 bits (85), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 48/98 (48%), Gaps = 14/98 (14%)
Query: 332 KTVTPTNAVVGAVSLGNSGELEEIN-TKELAQRISAELKRYSIPQAIFAQRVLCRSQGTL 390
KT TPT + V N IN T + I E+KR + QA+FA+ +SQG L
Sbjct: 464 KTSTPTTDLPIKVDGAN------INITAAIYDEIQQEMKRAKVSQALFAKVAANKSQGWL 517
Query: 391 SDLLR---NPKPWSK-LKSGRETFRRMWKWLQEPEFQR 424
+LLR NP P ++ L T RR +L P+ +R
Sbjct: 518 CELLRWKENPSPENRTLWENLCTIRR---FLNLPQHER 552
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.309 0.126 0.355
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,214,186
Number of Sequences: 539616
Number of extensions: 7290169
Number of successful extensions: 18335
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 31
Number of HSP's successfully gapped in prelim test: 262
Number of HSP's that attempted gapping in prelim test: 17719
Number of HSP's gapped (non-prelim): 733
length of query: 433
length of database: 191,569,459
effective HSP length: 120
effective length of query: 313
effective length of database: 126,815,539
effective search space: 39693263707
effective search space used: 39693263707
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 63 (28.9 bits)