BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3393
(433 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|270014519|gb|EFA10967.1| hypothetical protein TcasGA2_TC004129 [Tribolium castaneum]
Length = 565
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 249/417 (59%), Positives = 287/417 (68%), Gaps = 68/417 (16%)
Query: 58 KTEG-LTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNV 116
+ EG L+VIV PQDDSRDSE+LSPGK+SP+ SVSVA+++D+++FR + EPTYQTLT+V
Sbjct: 56 EAEGPLSVIVQPQDDSRDSELLSPGKLSPT--SVSVANMIDAADFR-MQPEPTYQTLTSV 112
Query: 117 NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN---IGLG- 172
NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH GNVTGSFTVMQNN IGLG
Sbjct: 113 NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHAGNVTGSFTVMQNNSLGIGLGL 172
Query: 173 ---INSHYTYDKIQMSPPQHYS-PNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSP 228
+N+ Y+ + M+PP +YS P GL Q+ SP+SPQS YSQNGL SPQKS+SP
Sbjct: 173 GVNVNAAYSMPTMGMTPPHNYSSPGMGLQ-----QQPSPISPQSAYSQNGLPSPQKSMSP 227
Query: 229 NSYDSPY--NQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGIT 286
YDSPY QRDLV RS+ QSP LSP SL+S TLV F G+ T+NG+
Sbjct: 228 PGYDSPYGAQQRDLVARSL-----QSPQLSPPSASLNSPDGTLVAGFGGS----TLNGLG 278
Query: 287 TITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKI------------------------- 321
+ H + +PQ QR SP PP +
Sbjct: 279 -LQHHPPTLVQIAAPQ--------QRDCSPSPPVLSLQPQQQTQQQQQQPQPQQQQQQQQ 329
Query: 322 -----ISQQTATLTLKTVTPTNAVVGAVSL-GNSGELEEINTKELAQRISAELKRYSIPQ 375
QQ ++ NA GA+S+ + ++EEINTKELAQRISAELKRYSIPQ
Sbjct: 330 QPQQQQQQQPQPAVAMQISSKNASSGALSVPAIAADVEEINTKELAQRISAELKRYSIPQ 389
Query: 376 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG
Sbjct: 390 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 446
>gi|189233657|ref|XP_001807372.1| PREDICTED: similar to GA15133-PA [Tribolium castaneum]
Length = 1151
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 249/417 (59%), Positives = 287/417 (68%), Gaps = 68/417 (16%)
Query: 58 KTEG-LTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNV 116
+ EG L+VIV PQDDSRDSE+LSPGK+SP+ SVSVA+++D+++FR + EPTYQTLT+V
Sbjct: 566 EAEGPLSVIVQPQDDSRDSELLSPGKLSPT--SVSVANMIDAADFR-MQPEPTYQTLTSV 622
Query: 117 NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN---IGLG- 172
NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH GNVTGSFTVMQNN IGLG
Sbjct: 623 NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHAGNVTGSFTVMQNNSLGIGLGL 682
Query: 173 ---INSHYTYDKIQMSPPQHYS-PNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSP 228
+N+ Y+ + M+PP +YS P GL Q+ SP+SPQS YSQNGL SPQKS+SP
Sbjct: 683 GVNVNAAYSMPTMGMTPPHNYSSPGMGLQ-----QQPSPISPQSAYSQNGLPSPQKSMSP 737
Query: 229 NSYDSPY--NQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGIT 286
YDSPY QRDLV RS+ QSP LSP SL+S TLV F G+ T+NG+
Sbjct: 738 PGYDSPYGAQQRDLVARSL-----QSPQLSPPSASLNSPDGTLVAGFGGS----TLNGLG 788
Query: 287 TITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKI------------------------- 321
+ H + +PQ QR SP PP +
Sbjct: 789 -LQHHPPTLVQIAAPQ--------QRDCSPSPPVLSLQPQQQTQQQQQQPQPQQQQQQQQ 839
Query: 322 -----ISQQTATLTLKTVTPTNAVVGAVSL-GNSGELEEINTKELAQRISAELKRYSIPQ 375
QQ ++ NA GA+S+ + ++EEINTKELAQRISAELKRYSIPQ
Sbjct: 840 QPQQQQQQQPQPAVAMQISSKNASSGALSVPAIAADVEEINTKELAQRISAELKRYSIPQ 899
Query: 376 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG
Sbjct: 900 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 956
>gi|340722035|ref|XP_003399417.1| PREDICTED: one cut domain family member 2-like [Bombus terrestris]
Length = 668
Score = 364 bits (934), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 224/391 (57%), Positives = 254/391 (64%), Gaps = 86/391 (21%)
Query: 60 EGLTVIVAPQD---DSRD--SEILSPGKISP-SGISVSVASIMDSSEFRNLHAEPTYQTL 113
E L+VIV PQD DSRD SE+LSPGK++P SGISVSVAS++D+++FR + EPTYQTL
Sbjct: 241 ESLSVIVPPQDVDVDSRDADSELLSPGKLTPTSGISVSVASMIDATDFRAMQPEPTYQTL 300
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GHT-GNVTGSFTVMQNN--- 168
T+V R+SPPG+SP SSYATLTPLQPLPPISTMSDKF Y H G VTGSF VMQNN
Sbjct: 301 TSVAERMSPPGFSPGSSYATLTPLQPLPPISTMSDKFVYSAHAAGGVTGSFAVMQNNGLG 360
Query: 169 -IGLGINSH-YTYDKI---QMSPPQHY-SPNNGLSTIVITQENSPLSPQSTYSQNGLNSP 222
IGLG+ S Y YDK+ MSPP +Y SP GL + SPLSPQS YSQ+GLNSP
Sbjct: 361 NIGLGMGSSPYAYDKLPSMSMSPPHNYPSPGAGL-------QPSPLSPQSAYSQSGLNSP 413
Query: 223 QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTM 282
KS SP+ YD + R L SP L A+ PT+
Sbjct: 414 HKSASPH-YDPAFLPR-----------------------LQQSPAAL-----SPASPPTV 444
Query: 283 NGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVG 342
+ T P S + P HS + Q ++ +KTV P
Sbjct: 445 SATATFAP-----SHHSPPTHSHV------------------QHTSVVMKTVPPP----- 476
Query: 343 AVSLGN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
GN +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 477 ----GNGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 532
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 533 KLKSGRETFRRMWKWLQEPEFQRMSALRLAA 563
>gi|328787290|ref|XP_624996.3| PREDICTED: one cut domain family member 2 [Apis mellifera]
Length = 672
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 221/391 (56%), Positives = 254/391 (64%), Gaps = 86/391 (21%)
Query: 60 EGLTVIVAPQD---DSRD--SEILSPGKISP-SGISVSVASIMDSSEFRNLHAEPTYQTL 113
E L+VIV PQD DSRD SE+LSPGK++P SGISVSVAS++D+++FR + EPTYQTL
Sbjct: 243 ESLSVIVPPQDVDVDSRDADSELLSPGKLTPTSGISVSVASMIDATDFRAMQPEPTYQTL 302
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GHT-GNVTGSFTVMQNN--- 168
T+V R+SPPG+SP SSYATLTPLQPLPPISTMSDKF Y H G VTGSF VMQNN
Sbjct: 303 TSVAERMSPPGFSPGSSYATLTPLQPLPPISTMSDKFVYSAHAAGGVTGSFAVMQNNGLG 362
Query: 169 -IGLGIN-SHYTYDKI---QMSPPQHY-SPNNGLSTIVITQENSPLSPQSTYSQNGLNSP 222
IGLG+ S Y YDK+ MSPP +Y SP GL + SPLSPQS YSQ+GLNSP
Sbjct: 363 NIGLGMGGSPYAYDKLPTMSMSPPHNYPSPGAGL-------QPSPLSPQSAYSQSGLNSP 415
Query: 223 QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTM 282
KS SP+ YD + R L SP L ++ P +
Sbjct: 416 HKSASPH-YDPAFLPR-----------------------LQQSPAAL-----SPSSPPAV 446
Query: 283 NGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVG 342
+ + P S + P HS + Q ++ +KTV+
Sbjct: 447 SATASFAP-----SHHSPPTHSHV------------------QHTSVVMKTVS------- 476
Query: 343 AVSLGN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
S GN +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 477 --SAGNGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 534
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 535 KLKSGRETFRRMWKWLQEPEFQRMSALRLAA 565
>gi|380016009|ref|XP_003691986.1| PREDICTED: one cut domain family member 2-like [Apis florea]
Length = 510
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 221/391 (56%), Positives = 254/391 (64%), Gaps = 86/391 (21%)
Query: 60 EGLTVIVAPQD---DSRD--SEILSPGKISP-SGISVSVASIMDSSEFRNLHAEPTYQTL 113
E L+VIV PQD DSRD SE+LSPGK++P SGISVSVAS++D+++FR + EPTYQTL
Sbjct: 81 ESLSVIVPPQDVDVDSRDADSELLSPGKLTPTSGISVSVASMIDATDFRAMQPEPTYQTL 140
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GHT-GNVTGSFTVMQNN--- 168
T+V R+SPPG+SP SSYATLTPLQPLPPISTMSDKF Y H G VTGSF VMQNN
Sbjct: 141 TSVAERMSPPGFSPGSSYATLTPLQPLPPISTMSDKFVYSAHAAGGVTGSFAVMQNNGLG 200
Query: 169 -IGLGIN-SHYTYDKI---QMSPPQHY-SPNNGLSTIVITQENSPLSPQSTYSQNGLNSP 222
IGLG+ S Y YDK+ MSPP +Y SP GL + SPLSPQS YSQ+GLNSP
Sbjct: 201 NIGLGMGGSPYAYDKLPTMSMSPPHNYPSPGAGL-------QPSPLSPQSAYSQSGLNSP 253
Query: 223 QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTM 282
KS SP+ YD + R L SP L ++ P +
Sbjct: 254 HKSASPH-YDPAFLPR-----------------------LQQSPAAL-----SPSSPPAV 284
Query: 283 NGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVG 342
+ + P S + P HS + Q ++ +KTV+
Sbjct: 285 SATASFAP-----SHHSPPTHSHV------------------QHTSVVMKTVS------- 314
Query: 343 AVSLGN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
S GN +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 315 --SAGNGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 372
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 373 KLKSGRETFRRMWKWLQEPEFQRMSALRLAA 403
>gi|328710625|ref|XP_001946230.2| PREDICTED: hypothetical protein LOC100158731 isoform 1
[Acyrthosiphon pisum]
Length = 811
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 219/390 (56%), Positives = 257/390 (65%), Gaps = 73/390 (18%)
Query: 107 EPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQ 166
EPTYQTLT+VNGR+SPPGYSP+SSYATLTPLQPLPPISTMSDKF YGH+ NV+GSFTVMQ
Sbjct: 283 EPTYQTLTSVNGRMSPPGYSPNSSYATLTPLQPLPPISTMSDKFVYGHSNNVSGSFTVMQ 342
Query: 167 NNIG-LG--INSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQS-TYSQNG---- 218
NNIG LG +NS YTYDK+ M P HYSP + + Q+ SP+SPQS +YSQNG
Sbjct: 343 NNIGGLGMVVNSPYTYDKMGMHSPNHYSPTHMHH--MPPQQGSPISPQSASYSQNGGGGG 400
Query: 219 --LNSPQKSLS-PNS-YDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQ 274
LNSPQKS+S PNS YDSPY Q RS SP LS N GSLHS P ++++
Sbjct: 401 GGLNSPQKSMSSPNSCYDSPYTQIGPGGRSGASNAMHSPDLSQNSGSLHSPP---MSSYG 457
Query: 275 GAATIPTMNGITTITPH----------------TVVVSPQTSPQHSLISNQLQRSDSPGP 318
G T+ +NG+TTITPH ++V+ P PQ +++ + SP P
Sbjct: 458 GTTTLSNVNGLTTITPHPGRGGGGVVVSPRHSPSLVIHPIIQPQEVVVT-----TSSPSP 512
Query: 319 PK-----------------------------IISQQTATLTLKTVTPTNAVV------GA 343
P + QQT + +++T T V+ +
Sbjct: 513 PDHSQRMQMMHQQQQQQQQQQQQQQQQQQQQLQQQQTTLIKTQSITTTTTVLVQNASSAS 572
Query: 344 VSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
S G+ ++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL
Sbjct: 573 GSNGSGSDMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 632
Query: 404 KSGRETFRRMWKWLQEPEFQRMSALRLAGN 433
KSGRETFRRMWKWLQEPEFQRMSALRLA +
Sbjct: 633 KSGRETFRRMWKWLQEPEFQRMSALRLADD 662
>gi|328710627|ref|XP_003244317.1| PREDICTED: hypothetical protein LOC100158731 isoform 2
[Acyrthosiphon pisum]
Length = 821
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 219/390 (56%), Positives = 257/390 (65%), Gaps = 73/390 (18%)
Query: 107 EPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQ 166
EPTYQTLT+VNGR+SPPGYSP+SSYATLTPLQPLPPISTMSDKF YGH+ NV+GSFTVMQ
Sbjct: 283 EPTYQTLTSVNGRMSPPGYSPNSSYATLTPLQPLPPISTMSDKFVYGHSNNVSGSFTVMQ 342
Query: 167 NNIG-LG--INSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQS-TYSQNG---- 218
NNIG LG +NS YTYDK+ M P HYSP + + Q+ SP+SPQS +YSQNG
Sbjct: 343 NNIGGLGMVVNSPYTYDKMGMHSPNHYSPTHMHH--MPPQQGSPISPQSASYSQNGGGGG 400
Query: 219 --LNSPQKSLS-PNS-YDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQ 274
LNSPQKS+S PNS YDSPY Q RS SP LS N GSLHS P ++++
Sbjct: 401 GGLNSPQKSMSSPNSCYDSPYTQIGPGGRSGASNAMHSPDLSQNSGSLHSPP---MSSYG 457
Query: 275 GAATIPTMNGITTITPH----------------TVVVSPQTSPQHSLISNQLQRSDSPGP 318
G T+ +NG+TTITPH ++V+ P PQ +++ + SP P
Sbjct: 458 GTTTLSNVNGLTTITPHPGRGGGGVVVSPRHSPSLVIHPIIQPQEVVVT-----TSSPSP 512
Query: 319 PK-----------------------------IISQQTATLTLKTVTPTNAVV------GA 343
P + QQT + +++T T V+ +
Sbjct: 513 PDHSQRMQMMHQQQQQQQQQQQQQQQQQQQQLQQQQTTLIKTQSITTTTTVLVQNASSAS 572
Query: 344 VSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
S G+ ++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL
Sbjct: 573 GSNGSGSDMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 632
Query: 404 KSGRETFRRMWKWLQEPEFQRMSALRLAGN 433
KSGRETFRRMWKWLQEPEFQRMSALRLA +
Sbjct: 633 KSGRETFRRMWKWLQEPEFQRMSALRLADD 662
>gi|242017908|ref|XP_002429426.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212514352|gb|EEB16688.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 483
Score = 343 bits (881), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 215/390 (55%), Positives = 250/390 (64%), Gaps = 85/390 (21%)
Query: 44 SNHSSGKDDDNESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRN 103
S S KD+D E L+VIV+P+ DSRDSE+LSP + AS++D+SEFR
Sbjct: 50 SERPSCKDEDQEV-----LSVIVSPRADSRDSELLSP----------TSASMIDASEFR- 93
Query: 104 LHAEPTYQTLTNVNGR-LSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSF 162
+ EPTYQTLT++ R +SP G SP+SSYATLTPLQPLPPIST+SDKFAYGH G+V+GSF
Sbjct: 94 MQPEPTYQTLTSMVDRGISPAGLSPNSSYATLTPLQPLPPISTVSDKFAYGHAGSVSGSF 153
Query: 163 TVMQNNIGLGINSHYTYDKIQMSPPQHYS-PNNGLSTIVITQENSPLSPQSTYSQNGLNS 221
TVMQNN+GLG Y YDK+ +S HYS P GL +SPLSPQS YSQ G+ S
Sbjct: 154 TVMQNNLGLG----YGYDKMGVS--SHYSTPALGLGI-----PHSPLSPQSPYSQ-GIQS 201
Query: 222 PQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPT 281
PQKS+SP Y+SP R + G QSP LSP SLHS P +F G
Sbjct: 202 PQKSMSPTGYESPNYHRSM-------AGPQSPALSPPSASLHS-PVNSTIHFGGEL---- 249
Query: 282 MNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVV 341
+ + +P TV+ P + Q I N++
Sbjct: 250 ---VESPSPPTVIHRPPS--QGKPIQNEM------------------------------- 273
Query: 342 GAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
AV++ EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 274 -AVNV------EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 326
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
KLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 327 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 356
>gi|405959224|gb|EKC25281.1| One cut domain family member 2 [Crassostrea gigas]
Length = 486
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 168/364 (46%), Positives = 205/364 (56%), Gaps = 80/364 (21%)
Query: 87 GISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTM 146
I+V+VAS++DSS+FR + TYQTL NGR+SP YSP+S YATLTPLQPLPPIST+
Sbjct: 68 AITVTVASMIDSSDFRAQMGDVTYQTL---NGRMSP-SYSPNS-YATLTPLQPLPPISTV 122
Query: 147 SDKFAYGHTGNVTGSFTVMQNNIGLG----INSHYTYDKI-------------QMSPPQH 189
SDKF ++G+ ++ N GLG +NS Y YDK+
Sbjct: 123 SDKFGQIQASQISGNGFMINNVNGLGNVVDMNS-YRYDKMVGVNMGQALASTPMTMSMSM 181
Query: 190 YSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGG 249
+NG + V N +S QNGL SP++
Sbjct: 182 PMMSNGYTQSVPYGYNVNVS----MGQNGLPSPKEE------------------------ 213
Query: 250 SQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQ 309
Q P LSPN + + +L + + P M+ + ++ +T V +P TSP + S Q
Sbjct: 214 -QKPVLSPNSLTYVDTYRSLAPPARLHSPNPMMSSLNSLHVNTTV-TPDTSPHSHMGSPQ 271
Query: 310 LQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGN-SGELEEINTKELAQRISAEL 368
+RS P A S N S E+EEINTKELAQRIS+EL
Sbjct: 272 RERSLGP--------------------------ADSQSNKSQEVEEINTKELAQRISSEL 305
Query: 369 KRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSAL 428
KRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSAL
Sbjct: 306 KRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSAL 365
Query: 429 RLAG 432
RLAG
Sbjct: 366 RLAG 369
>gi|301618950|ref|XP_002938867.1| PREDICTED: hepatocyte nuclear factor 6-like [Xenopus (Silurana)
tropicalis]
Length = 439
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 161/359 (44%), Positives = 200/359 (55%), Gaps = 79/359 (22%)
Query: 90 VSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDK 149
S+ SI+D E+R+ H N++ SP G S SS+Y TLTPLQ LPPIST+SDK
Sbjct: 46 TSMTSILDGGEYRSEHLGGPLHPAMNMSCE-SPSGMSLSSTYTTLTPLQHLPPISTVSDK 104
Query: 150 FAYGH------------TGNVTGSFTVMQNNIGLG----INSHYTYDKIQMSPPQHYSP- 192
F + H GNV+GSFT+M++ GL + HY D M P SP
Sbjct: 105 FHHPHHHHHHHHHHQRLAGNVSGSFTLMRDERGLASMGNLYGHYPKDMTTMGQP--LSPL 162
Query: 193 NNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQS 252
+NGL T+ +Q++ L P Y G S +K LSPNS+D S +
Sbjct: 163 SNGLGTLHNSQQS--LGP---YGPGGHLSNEKMLSPNSFD-----------------SHA 200
Query: 253 PTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQR 312
LS +E L T GA +P +NG+ H V S SL+ ++ ++
Sbjct: 201 AMLSRSEDHL-----TRGLGGPGAGMMPPLNGMHPHGHHHV------SSNGSLLGDRERQ 249
Query: 313 SDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
+ + G G +G++EEINTKE+AQRI+AELKRYS
Sbjct: 250 ASTSGS--------------------------QAGGNGQVEEINTKEVAQRITAELKRYS 283
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
IPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 284 IPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 342
>gi|241022759|ref|XP_002406008.1| ONECUT DNA binding protein, putative [Ixodes scapularis]
gi|215491835|gb|EEC01476.1| ONECUT DNA binding protein, putative [Ixodes scapularis]
Length = 285
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 192/342 (56%), Gaps = 62/342 (18%)
Query: 95 IMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFA-YG 153
++D+SEFR L GR+SP G+S SSSYATLTPLQPLPPISTMSDKF+ YG
Sbjct: 1 MIDASEFRGLDP-------GAYPGRMSPGGFS-SSSYATLTPLQPLPPISTMSDKFSHYG 52
Query: 154 HTGNVTGSFTVMQNNIGLGINSHYTYDKI-QMSPPQHYSPNNGLSTIVITQENSPLSPQS 212
H GS G+ + YDK+ MS + SP +G + ++ + P P
Sbjct: 53 HHHQHPGSNGGFAPLAMHGMTGYPQYDKLGSMSGAMNMSPPHGTAPMLGAGSSLPSPP-- 110
Query: 213 TYSQNGLNSPQKSLS---PNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTL 269
Y+QNGL P+K L+ SY QR+L G QSPT SLHS
Sbjct: 111 PYAQNGL--PEKGLNGYETASYGGAAPQRELTRL----GSPQSPT------SLHS----- 153
Query: 270 VTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATL 329
+P +NG+ SP T PQ L +
Sbjct: 154 -------PMLPPLNGL----------SPHTPPQAHLAAPATPPQPP--------PPQEAK 188
Query: 330 TLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT 389
+ + P + V + G+ EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT
Sbjct: 189 PARVLLPVVSSVPTL-----GQAEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT 243
Query: 390 LSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
LSDLLRNPKPWSKLKSGRETFRRM+KWL+EPEFQRMSALRLA
Sbjct: 244 LSDLLRNPKPWSKLKSGRETFRRMYKWLEEPEFQRMSALRLA 285
>gi|224062442|ref|XP_002197866.1| PREDICTED: hepatocyte nuclear factor 6 [Taeniopygia guttata]
Length = 457
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 160/370 (43%), Positives = 207/370 (55%), Gaps = 85/370 (22%)
Query: 89 SVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLS-----PPGYSPSSSYATLTPLQPLPPI 143
S+ +ASI+D ++ + H P + ++ ++ PPG S SS+Y TLTPLQPLPPI
Sbjct: 51 SMGMASILDGGDYHHHHRPPEHALTGPLHPTMTMACETPPGMSMSSTYTTLTPLQPLPPI 110
Query: 144 STMSDKFAYGHT-------------GNVTGSFTVMQNNIGLG-INSHYT-YDKIQMSPPQ 188
ST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y K Q
Sbjct: 111 STVSDKFPHHHHHHHHHHHPHQRIPGNVSGSFTLMRDERGLASMNNLYTPYHKDVTGMGQ 170
Query: 189 HYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSV 245
SP +GL +I +Q+ P Y+ P +K L+PN +++ +
Sbjct: 171 SLSPLSGSGLGSIHNSQQGLP-----HYAHPSATMPAEKMLTPNGFEAHH---------- 215
Query: 246 LGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSL 305
P + G H +PT+ A +P +NGI P H
Sbjct: 216 -------PAMLARHGDQHLTPTS-------AGMVP-INGI---------------PHH-- 243
Query: 306 ISNQLQRSDSPGPPKIISQQTATLTLKTVTPTN-AVVGA-VSLG-NSGELEEINTKELAQ 362
P ++ Q+ L + N +V G+ V+ G NSG++EEINTKE+AQ
Sbjct: 244 ------------PHAHLNAQSHGQILGSAREQNPSVTGSQVNSGSNSGQMEEINTKEVAQ 291
Query: 363 RISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEF 422
RI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEF
Sbjct: 292 RITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEF 351
Query: 423 QRMSALRLAG 432
QRMSALRLA
Sbjct: 352 QRMSALRLAA 361
>gi|383860558|ref|XP_003705756.1| PREDICTED: homeobox protein onecut-like [Megachile rotundata]
Length = 477
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 152/285 (53%), Positives = 170/285 (59%), Gaps = 80/285 (28%)
Query: 159 TGSFTVMQNN----IGLGINSH-YTYDKI---QMSPPQHY-SPNNGLSTIVITQENSPLS 209
TGSF VMQNN IGLG+ S Y YDK+ MSPP +Y SP GL + SPLS
Sbjct: 155 TGSFAVMQNNGLGNIGLGMGSSPYAYDKLPSMSMSPPHNYPSPGAGL-------QPSPLS 207
Query: 210 PQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPT-LSPNEGSLHSSPTT 268
PQS YSQ+GLNSP KS SP+ YD + R QSP LSP SSP
Sbjct: 208 PQSAYSQSGLNSPHKSASPH-YDPAFLPRL----------QQSPAALSP------SSP-- 248
Query: 269 LVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTAT 328
P ++ + P S + P HS + Q +
Sbjct: 249 -----------PAVSATASFAP-----SHHSPPTHSHV------------------QHTS 274
Query: 329 LTLKTVTPTNAVVGAVSLGN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQ 387
+ +KTV+ S GN +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQ
Sbjct: 275 VVMKTVS---------SAGNGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQ 325
Query: 388 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 326 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 370
>gi|41055714|ref|NP_956867.1| hepatocyte nuclear factor 6 [Danio rerio]
gi|33989484|gb|AAH56601.1| One cut domain, family member 1 [Danio rerio]
Length = 461
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 162/378 (42%), Positives = 213/378 (56%), Gaps = 87/378 (23%)
Query: 83 ISPSGISVSVASIMDSSEFRNLHAEPTYQTL-TNVNGRLS-----PPGYSPSSSYATLTP 136
+S S+ +ASI+DS ++ + H P + L T+++ +S PPG S SS+Y TLTP
Sbjct: 42 LSAHARSMGMASILDSGDYHH-HRPPEHPGLATHLHPAMSMACEAPPGMSMSSTYTTLTP 100
Query: 137 LQPLPPISTMSDKFAYGHT--------------GNVTGSFTVMQNNIGLG-INSHYT-YD 180
LQPLPPIST+SDKF + H GNV+GSFT+M+++ GL +N+ Y+ Y
Sbjct: 101 LQPLPPISTVSDKFPHHHHHHHHHHHHPHQRIPGNVSGSFTLMRDDRGLAPMNNLYSPYH 160
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K S Q SP +GLS I +Q+ P Y+ G P +K L+PN +++ +
Sbjct: 161 KDVASMGQSLSPLSGSGLSGIHNSQQGLP-----PYAHPGATMPAEKMLTPNGFEAHH-- 213
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H S ++ A+ +P I I H
Sbjct: 214 ---------------PAMLARHGEQHMSASS-------ASMVP----INVIHHHP----- 242
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGN---SGELEE 354
H+ ++ Q G +++ T ++ V L N SG++EE
Sbjct: 243 -----HAHLNAQ-------GHGQVLG--------STREQNHSSVPGSQLNNGSSSGQMEE 282
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW
Sbjct: 283 VNTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 342
Query: 415 KWLQEPEFQRMSALRLAG 432
KWLQEPEFQRMSALRLAG
Sbjct: 343 KWLQEPEFQRMSALRLAG 360
>gi|56606956|gb|AAW02846.1| onecut-1 isoform b [Danio rerio]
Length = 461
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 161/378 (42%), Positives = 210/378 (55%), Gaps = 87/378 (23%)
Query: 83 ISPSGISVSVASIMDSSEFRNLHAEPTYQTL-TNVNGRLS-----PPGYSPSSSYATLTP 136
+S S+ +ASI+DS ++ + H P + L T+++ +S PPG S SS+Y TLTP
Sbjct: 42 LSAHARSMGMASILDSGDYHH-HRPPEHPGLATHLHPAMSMACEAPPGMSMSSTYTTLTP 100
Query: 137 LQPLPPISTMSDKFAYGHT--------------GNVTGSFTVMQNNIGLG-INSHYT-YD 180
LQPLPPIST+SDKF + H GNV+GSFT+M+++ GL +N+ Y+ Y
Sbjct: 101 LQPLPPISTVSDKFPHHHHHHHHHHHHPHQRIPGNVSGSFTLMRDDRGLAPMNNLYSPYH 160
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K S Q SP +GLS I +Q+ P Y+ G P +K L+PN +++ +
Sbjct: 161 KDVASMGQSLSPLSGSGLSGIHNSQQGLP-----PYAHPGATMPAEKMLTPNGFEAHH-- 213
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H S ++ A+ +P +NGI PH
Sbjct: 214 ---------------PAMLARHGEQHMSASS-------ASMVP-INGIHH-HPHA----- 244
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGN---SGELEE 354
+ +Q + T ++ V L N SG++EE
Sbjct: 245 ----------------------HLNAQGHGQVLGSTREQNHSSVPGSQLNNGSSSGQMEE 282
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTKE+AQRI+ ELKRYSIPQAI AQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW
Sbjct: 283 VNTKEVAQRITTELKRYSIPQAILAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 342
Query: 415 KWLQEPEFQRMSALRLAG 432
KWLQEPEFQRMSALRLAG
Sbjct: 343 KWLQEPEFQRMSALRLAG 360
>gi|42542744|gb|AAH66466.1| Onecut1 protein [Danio rerio]
Length = 456
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 161/378 (42%), Positives = 212/378 (56%), Gaps = 87/378 (23%)
Query: 83 ISPSGISVSVASIMDSSEFRNLHAEPTYQTL-TNVNGRLS-----PPGYSPSSSYATLTP 136
+S S+ +ASI+DS ++ + H P + L T+++ +S PPG S SS+Y TLTP
Sbjct: 42 LSAHARSMGMASILDSGDYHH-HRPPEHPGLATHLHPAMSMACEAPPGMSMSSTYTTLTP 100
Query: 137 LQPLPPISTMSDKFAYGHT--------------GNVTGSFTVMQNNIGLG-INSHYT-YD 180
LQPLPPIST+SDKF + H GNV+GSFT+M+++ GL +N+ Y+ Y
Sbjct: 101 LQPLPPISTVSDKFPHHHHHHHHHHHHPHQRIPGNVSGSFTLMRDDRGLAPMNNLYSPYH 160
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K S Q SP +GLS I +Q+ P Y+ G P +K L+PN +++ +
Sbjct: 161 KDVASMGQSLSPLSGSGLSGIHNSQQGLP-----PYAHPGATMPAEKMLTPNGFEAHH-- 213
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H S ++ A+ +P I I H
Sbjct: 214 ---------------PAMLARHGEQHMSASS-------ASMVP----INVIHHHP----- 242
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGN---SGELEE 354
H+ ++ Q G +++ T ++ V L N SG++EE
Sbjct: 243 -----HAHLNAQ-------GHGQVLG--------STREQNHSSVPGSQLNNGSSSGQMEE 282
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW
Sbjct: 283 VNTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 342
Query: 415 KWLQEPEFQRMSALRLAG 432
KWLQEPEFQRMSALRLA
Sbjct: 343 KWLQEPEFQRMSALRLAA 360
>gi|126277592|ref|XP_001370228.1| PREDICTED: hepatocyte nuclear factor 6 [Monodelphis domestica]
Length = 460
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 163/373 (43%), Positives = 202/373 (54%), Gaps = 88/373 (23%)
Query: 89 SVSVASIMDSSEFRNLHAEPTYQTL------TNVNGRLSPPGYSPSSSYATLTPLQPLPP 142
S+ +ASI+DS + H P L T +PPG S +S+Y TLTPLQPLPP
Sbjct: 51 SMGMASILDSGGDYHHHHRPPEHALAGPLHPTMTMACETPPGMSMTSTYTTLTPLQPLPP 110
Query: 143 ISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQMS 185
IST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y K
Sbjct: 111 ISTVSDKFPHHHHHHHHHHHPHHHQRIAGNVSGSFTLMRDERGLASMNNLYTPYHKDVPG 170
Query: 186 PPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVD 242
Q SP +GL I +Q+ P Y+ P +K L+PN +++ +
Sbjct: 171 MGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPSATMPAEKMLTPNGFEAHH------- 218
Query: 243 RSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQ 302
P + G H +PT+ A +P +NG+ P
Sbjct: 219 ----------PAMLARHGEQHLTPTS-------AGMVP-INGL---------------PH 245
Query: 303 HSLISNQLQRSDSPGPPKIISQQTATLTLKTV-TPTNAVVGA-VSLG-NSGELEEINTKE 359
H P ++ Q L T P +V G+ VS G NSG++EEINTKE
Sbjct: 246 H--------------PHAHLNAQGHGQILGTAREPNPSVTGSQVSNGSNSGQMEEINTKE 291
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE 419
+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE
Sbjct: 292 VAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE 351
Query: 420 PEFQRMSALRLAG 432
PEFQRMSALRLA
Sbjct: 352 PEFQRMSALRLAA 364
>gi|345320030|ref|XP_001515819.2| PREDICTED: one cut domain family member 2-like, partial
[Ornithorhynchus anatinus]
Length = 341
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 157/358 (43%), Positives = 194/358 (54%), Gaps = 80/358 (22%)
Query: 91 SVASIMDSSEFRNLH--AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSD 148
S+ASI++ E+R H + P + ++ SP G S S +Y TLTPLQ LPPIST+SD
Sbjct: 47 SMASILEGGEYRPEHPLSAPLHPAMSMAC--ESPSGMSLSGTYTTLTPLQHLPPISTVSD 104
Query: 149 KFAYGHT----------GNVTGSFTVMQNNIGL----GINSHYTYDKIQMSPPQHYSP-N 193
KF + H GNV+GSFT+M++ L + SHY D M P SP +
Sbjct: 105 KFHHHHHHPHHHHQRLPGNVSGSFTLMRDERSLTSVGNLYSHYPKDMTGMGQP--LSPLS 162
Query: 194 NGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSP 253
NGL T+ +Q+ PL+P Y G + K LSPN +++ +
Sbjct: 163 NGLGTLHNSQQ--PLAP---YGPGGHLANDKMLSPNGFET-----------------HAT 200
Query: 254 TLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRS 313
LS E P GA+ +P +NG+ PH
Sbjct: 201 MLSRGE-----DPLARGLGAPGASMMPPLNGMH---PH---------------------G 231
Query: 314 DSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSI 373
PG + +L A G SG++EEINTKE+AQRI+AELKRYSI
Sbjct: 232 HHPG--------QGSGSLLGERERQAAASGSQPGGSGQVEEINTKEVAQRITAELKRYSI 283
Query: 374 PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 284 PQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 341
>gi|56606954|gb|AAW02845.1| onecut-1 isoform a [Danio rerio]
Length = 456
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 160/377 (42%), Positives = 209/377 (55%), Gaps = 87/377 (23%)
Query: 83 ISPSGISVSVASIMDSSEFRNLHAEPTYQTL-TNVNGRLS-----PPGYSPSSSYATLTP 136
+S S+ +ASI+DS ++ + H P + L T+++ +S PPG S SS+Y TLTP
Sbjct: 42 LSAHARSMGMASILDSGDYHH-HRPPEHPGLATHLHPAMSMACEAPPGMSMSSTYTTLTP 100
Query: 137 LQPLPPISTMSDKFAYGHT--------------GNVTGSFTVMQNNIGLG-INSHYT-YD 180
LQPLPPIST+SDKF + H GNV+GSFT+M+++ GL +N+ Y+ Y
Sbjct: 101 LQPLPPISTVSDKFPHHHHHHHHHHHHPHQRIPGNVSGSFTLMRDDRGLAPMNNLYSPYH 160
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K S Q SP +GLS I +Q+ P Y+ G P +K L+PN +++ +
Sbjct: 161 KDVASMGQSLSPLSGSGLSGIHNSQQGLP-----PYAHPGATMPAEKMLTPNGFEAHH-- 213
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H S ++ A+ +P +NGI PH
Sbjct: 214 ---------------PAMLARHGEQHMSASS-------ASMVP-INGIHH-HPHA----- 244
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGN---SGELEE 354
+ +Q + T ++ V L N SG++EE
Sbjct: 245 ----------------------HLNAQGHGQVLGSTREQNHSSVPGSQLNNGSSSGQMEE 282
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTKE+AQRI+ ELKRYSIPQAI AQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW
Sbjct: 283 VNTKEVAQRITTELKRYSIPQAILAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 342
Query: 415 KWLQEPEFQRMSALRLA 431
KWLQEPEFQRMSALRLA
Sbjct: 343 KWLQEPEFQRMSALRLA 359
>gi|154147602|ref|NP_001093730.1| one cut homeobox 1 [Xenopus (Silurana) tropicalis]
gi|134025773|gb|AAI35776.1| onecut1 protein [Xenopus (Silurana) tropicalis]
Length = 493
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 159/369 (43%), Positives = 203/369 (55%), Gaps = 82/369 (22%)
Query: 89 SVSVASIMDSSEFRNL---------HA--EPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +ASI+D ++ + HA P + T+T +PPG S SS+Y TLTPL
Sbjct: 51 SMGMASILDGGDYHHHHHHHHRPPDHALTGPLHPTMTMACD--TPPGMSMSSTYTTLTPL 108
Query: 138 QPLPPISTMSDKFAYGHT---------GNVTGSFTVMQNNIGLG-INSHYT-YDKIQMSP 186
QPLPPIST+SDKF + H GNV+GSFT+M+++ GL +N+ Y+ Y K
Sbjct: 109 QPLPPISTVSDKFPHHHHHHHPHQRIPGNVSGSFTLMRDDRGLASMNNLYSPYHKEVTGM 168
Query: 187 PQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDR 243
Q SP +GL +I Q+ P Y+ P +K L+PN +++ +
Sbjct: 169 GQSLSPLSGSGLGSIHGAQQGPP-----HYAHPSAAMPTEKMLTPNGFEAHH-------- 215
Query: 244 SVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQH 303
P + G H +P + A +P +NGI PH + + H
Sbjct: 216 ---------PAMLTRHGEQHLTPPS-------AGMVP-INGIPH-HPHAHL----NAQSH 253
Query: 304 SLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQR 363
I L + PP + Q + NSG++EEINTKE+AQR
Sbjct: 254 GQI---LASTRDQNPPSVTGSQINNGS-----------------NSGQMEEINTKEVAQR 293
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ 423
I+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ
Sbjct: 294 ITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ 353
Query: 424 RMSALRLAG 432
RMSALRLA
Sbjct: 354 RMSALRLAA 362
>gi|449269309|gb|EMC80099.1| Hepatocyte nuclear factor 6, partial [Columba livia]
Length = 395
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 190/333 (57%), Gaps = 80/333 (24%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHT-------------GNVTGSFTVMQN 167
+PPG S SS+Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M++
Sbjct: 26 TPPGMSMSSTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHQRIPGNVSGSFTLMRD 85
Query: 168 NIGL-GINSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP- 222
GL +N+ YT Y K Q SP +GL +I +Q+ P Y+ P
Sbjct: 86 ERGLTSMNNLYTPYHKDVTGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPSATMPA 140
Query: 223 QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTM 282
+K L+PN +++ + P + G H +PT+ A +P +
Sbjct: 141 EKMLTPNGFEAHH-----------------PAMLARHGDQHLTPTS-------AGMVP-I 175
Query: 283 NGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTN-AVV 341
NGI P H P ++ Q+ L + N +V
Sbjct: 176 NGI---------------PHH--------------PHAHLNAQSHGQILGSAREQNPSVT 206
Query: 342 GA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKP 399
G+ V+ G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKP
Sbjct: 207 GSQVNSGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKP 266
Query: 400 WSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
WSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 267 WSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 299
>gi|335279519|ref|XP_001926642.3| PREDICTED: one cut domain family member 2 [Sus scrofa]
Length = 490
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 159/365 (43%), Positives = 204/365 (55%), Gaps = 74/365 (20%)
Query: 84 SPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPI 143
S S + S+ASI+D ++R + P + ++ ++ SPPG S++Y TLTPLQPLPPI
Sbjct: 104 SRSAMVTSMASILDGGDYRPELSIPLHHAMS-MSCDSSPPGMGMSNTYTTLTPLQPLPPI 162
Query: 144 STMSDKFAYGH----------------TGNVTGSFTVMQNNIGL-GINSHYTYDKIQMSP 186
ST+SDKF + H +GNV+GSFT+M++ GL +N+ Y SP
Sbjct: 163 STVSDKFHHPHPHHHPHHHHHHHHQRLSGNVSGSFTLMRDERGLPAMNNLY-------SP 215
Query: 187 PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVL 246
Y G+S + +PL GL++ Q+SL PN Y P + +
Sbjct: 216 ---YKEMPGMSQSLSPLTATPLG----NGLGGLHNAQQSL-PN-YGPPGHDK-------- 258
Query: 247 GGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLI 306
LSPN + H++ T +G + G TP ++S H
Sbjct: 259 -------MLSPNFEAHHTAMLT-----RGEQHLSRGLG----TPPAAMMSHLNGLHHP-- 300
Query: 307 SNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISA 366
+ S GP L P ++ G+ + SG+LEEINTKE+AQRI+A
Sbjct: 301 ----GHAQSHGP---------VLAPSRERPPSSSSGS-QVATSGQLEEINTKEVAQRITA 346
Query: 367 ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMS 426
ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMS
Sbjct: 347 ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMS 406
Query: 427 ALRLA 431
ALRLA
Sbjct: 407 ALRLA 411
>gi|391326747|ref|XP_003737873.1| PREDICTED: one cut domain family member 2-like [Metaseiulus
occidentalis]
Length = 487
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 196/351 (55%), Gaps = 75/351 (21%)
Query: 95 IMDSSEFR-----NLHAEPTYQTLTNVNGRLSPPG--YSPSSSYATLTPLQPLPPISTMS 147
++D+SEFR H P+ GR+SP G S +SSYATLTPLQPLPPISTMS
Sbjct: 1 MIDASEFRASLGDGYHHHPS--------GRMSPGGSFTSNASSYATLTPLQPLPPISTMS 52
Query: 148 DKFA-YGHTGNVTGSFTVMQNNIGLGINSHYTYDKI------QMSPPQHYSPNNGLSTIV 200
DKF+ YGH G ++M G+G+ + +YDK+ MSPP +P L+
Sbjct: 53 DKFSHYGHPHPNAGFSSLMP---GMGMGGYPSYDKLGSMGGMNMSPPGGQAPM--LANAA 107
Query: 201 ITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEG 260
I P YSQNG + S +SY + R+L S LG SP +
Sbjct: 108 I--------PSPPYSQNGSGAGGLVSSYDSYT--HATREL---SRLG----SPQSPGSSV 150
Query: 261 SLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPK 320
+LHS PTT++ G ++N + P
Sbjct: 151 ALHS-PTTMIGGLNG------------------------------LANTPPAAHLAAPAT 179
Query: 321 IISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQ 380
Q T +++ ++A S ++ E+EEINTKELAQRISAELKRYSIPQAIFAQ
Sbjct: 180 PPQQDVKPFTAVSISTSSATPTTSSSSSADEVEEINTKELAQRISAELKRYSIPQAIFAQ 239
Query: 381 RVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
RVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM+KWL+EPEFQRMSALRLA
Sbjct: 240 RVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMYKWLEEPEFQRMSALRLA 290
>gi|449491615|ref|XP_002190661.2| PREDICTED: one cut domain family member 2-like [Taeniopygia
guttata]
Length = 413
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 164/392 (41%), Positives = 207/392 (52%), Gaps = 95/392 (24%)
Query: 72 SRDSEILSPGK--------ISPSGISVSV---ASIMDSSEFRNLHA--EPTYQTLTNVNG 118
S+ +E+LSP + P G V ASI++ ++R H+ P + ++
Sbjct: 17 SQPAELLSPAHGRQSSHRNLVPHGRPAMVSGMASILEGGDYRGEHSLGAPLHPAMSMSCE 76
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH------------TGNVTGSFTVMQ 166
SP G S SS+Y TLTPLQ LPPIST+S+KF + H GNV+GSFT+M+
Sbjct: 77 --SPSGMSLSSTYTTLTPLQHLPPISTVSEKFHHPHHHHHHHHPHQRLAGNVSGSFTLMR 134
Query: 167 NNIGLG----INSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP 222
+ L + SHY D M P PN GL ++ Q+ PL+P Y G
Sbjct: 135 DERSLASMGNLYSHYPKDMPAMGQPLSPLPN-GLGSLHNAQQ--PLAP---YGPAGHLPN 188
Query: 223 QKSLSPNSYDSPYNQRDLVDRSV--LGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
+K LSPN +DS ++ R L G +P+ +A +P
Sbjct: 189 EKMLSPNGFDS---HAAMLSRGEEHLARGLAAPS---------------------SAMMP 224
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH P P SL+ + + P A
Sbjct: 225 PLNGMH---PHG---HPHGQPGASLLGER----ERP----------------------AP 252
Query: 341 VGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G SG++EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPW
Sbjct: 253 GPGSQPGGSGQVEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPW 312
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
SKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 313 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 344
>gi|62910168|gb|AAY21054.1| one cut domain, family member 1 [Mus musculus]
Length = 441
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|6680241|ref|NP_032288.1| hepatocyte nuclear factor 6 [Mus musculus]
gi|13627024|sp|O08755.1|HNF6_MOUSE RecName: Full=Hepatocyte nuclear factor 6; Short=HNF-6; AltName:
Full=One cut domain family member 1; AltName: Full=One
cut homeobox 1
gi|2072998|gb|AAB53863.1| hepatocyte nuclear factor 6 [Mus musculus]
gi|23271452|gb|AAH24053.1| One cut domain, family member 1 [Mus musculus]
Length = 465
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|149019183|gb|EDL77824.1| one cut domain, family member 1, isoform CRA_b [Rattus norvegicus]
Length = 465
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|3250876|emb|CAA75150.1| hepatocyte nuclear factor 6 beta [Rattus norvegicus]
Length = 491
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNDFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|149019182|gb|EDL77823.1| one cut domain, family member 1, isoform CRA_a [Rattus norvegicus]
Length = 491
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|12083605|ref|NP_073162.1| hepatocyte nuclear factor 6 [Rattus norvegicus]
gi|13626998|sp|P70512.1|HNF6_RAT RecName: Full=Hepatocyte nuclear factor 6; Short=HNF-6; AltName:
Full=One cut domain family member 1; AltName: Full=One
cut homeobox 1
gi|1550795|emb|CAA65389.1| hepatocyte nuclear factor 6 alpha [Rattus norvegicus]
Length = 465
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 162/376 (43%), Positives = 205/376 (54%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YD 180
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNDFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|338717463|ref|XP_003363643.1| PREDICTED: LOW QUALITY PROTEIN: hepatocyte nuclear factor 6-like
[Equus caballus]
Length = 496
Score = 211 bits (536), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 162/381 (42%), Positives = 203/381 (53%), Gaps = 92/381 (24%)
Query: 84 SPSGISVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYA 132
P + +AS++D S + + H A P + T+T +PPG S ++Y
Sbjct: 79 CPRTALMGMASLLDGGAAXSRYHHHHRAPEHSLAGPXHPTMTMACE--TPPGMSMPTTYT 136
Query: 133 TLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSH 176
TLTPLQPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+
Sbjct: 137 TLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNL 196
Query: 177 YT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYD 232
YT Y K Q SP +GL I +Q+ P Y+ G P K L+PN ++
Sbjct: 197 YTPYHKDVAGMGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFE 251
Query: 233 SPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHT 292
+ + P + G H +PT+ A +P +NG+ PH
Sbjct: 252 AHH-----------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA 286
Query: 293 VVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSG 350
+ +Q L P +V GA VS G NSG
Sbjct: 287 ---------------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSG 319
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 320 QMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 379
Query: 411 RRMWKWLQEPEFQRMSALRLA 431
RRMWKWLQEPEFQRMSALRLA
Sbjct: 380 RRMWKWLQEPEFQRMSALRLA 400
>gi|431895985|gb|ELK05403.1| Hepatocyte nuclear factor 6 [Pteropus alecto]
Length = 464
Score = 211 bits (536), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 156/349 (44%), Positives = 192/349 (55%), Gaps = 81/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 79 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 136
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT Y K Q SP +GL +I +Q+
Sbjct: 137 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGSIHNSQQGL 196
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ + P + G H +
Sbjct: 197 P-----HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLSRHGEQHLT 234
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 235 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 259
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 260 HGQLLSTAREPNPSVTGAQVSSGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 319
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 320 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 368
>gi|291402980|ref|XP_002717789.1| PREDICTED: one cut homeobox 1 [Oryctolagus cuniculus]
Length = 466
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 157/349 (44%), Positives = 192/349 (55%), Gaps = 82/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHHPH 137
Query: 155 -----TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQEN 205
GNV+GSFT+M++ GL +N+ YT Y K S Q SP +GL I +Q+
Sbjct: 138 HHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVASMGQSLSPLSGSGLGGIHNSQQG 197
Query: 206 SPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHS 264
P Y+ G P K L+PN +++ + P + G H
Sbjct: 198 LP-----HYAHPGAAMPADKMLTPNGFEAHH-----------------PAMLGRHGEQHL 235
Query: 265 SPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQ 324
+PT+ A +P +NG+ PH + +Q
Sbjct: 236 TPTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQ 260
Query: 325 QTATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRV 382
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRV
Sbjct: 261 GHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRV 320
Query: 383 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 369
>gi|426233286|ref|XP_004010648.1| PREDICTED: hepatocyte nuclear factor 6 [Ovis aries]
Length = 464
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 152/333 (45%), Positives = 185/333 (55%), Gaps = 79/333 (23%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVM 165
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M
Sbjct: 93 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLM 152
Query: 166 QNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ GL +N+ YT Y K Q SP +GL I +Q+ P Y+ G
Sbjct: 153 RDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPGAAM 207
Query: 222 P-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
P K L+PN +++ + P + G H +PT+ A +P
Sbjct: 208 PTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP 243
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH + +Q L P +V
Sbjct: 244 -INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSV 275
Query: 341 VGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 276 TGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 335
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 336 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 368
>gi|194034700|ref|XP_001929026.1| PREDICTED: hepatocyte nuclear factor 6 [Sus scrofa]
Length = 465
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 152/333 (45%), Positives = 185/333 (55%), Gaps = 79/333 (23%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVM 165
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M
Sbjct: 93 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLM 152
Query: 166 QNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ GL +N+ YT Y K Q SP +GL I +Q+ P Y+ G
Sbjct: 153 RDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPGAAM 207
Query: 222 P-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
P K L+PN +++ + P + G H +PT+ A +P
Sbjct: 208 PTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP 243
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH + +Q L P +V
Sbjct: 244 -INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSV 275
Query: 341 VGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 276 TGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 335
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 336 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 368
>gi|300793892|ref|NP_001178090.1| hepatocyte nuclear factor 6 [Bos taurus]
gi|296483131|tpg|DAA25246.1| TPA: one cut homeobox 1 [Bos taurus]
gi|440893322|gb|ELR46134.1| Hepatocyte nuclear factor 6 [Bos grunniens mutus]
Length = 465
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 152/333 (45%), Positives = 185/333 (55%), Gaps = 79/333 (23%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVM 165
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M
Sbjct: 93 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLM 152
Query: 166 QNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ GL +N+ YT Y K Q SP +GL I +Q+ P Y+ G
Sbjct: 153 RDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPGAAM 207
Query: 222 P-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
P K L+PN +++ + P + G H +PT+ A +P
Sbjct: 208 PTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP 243
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH + +Q L P +V
Sbjct: 244 -INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSV 275
Query: 341 VGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 276 TGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 335
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 336 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 368
>gi|348572296|ref|XP_003471929.1| PREDICTED: hepatocyte nuclear factor 6 [Cavia porcellus]
Length = 465
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 185/335 (55%), Gaps = 81/335 (24%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH-----------------TGNVTGSFT 163
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT
Sbjct: 92 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHHPPHHHQRLAGNVSGSFT 151
Query: 164 VMQNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGL 219
+M++ GL +N+ YT Y K Q SP +GL I +Q+ P Y+ G
Sbjct: 152 LMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPGA 206
Query: 220 NSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAAT 278
P K L+PN +++ + P + G H +PT+ A
Sbjct: 207 AMPADKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGM 242
Query: 279 IPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTN 338
+P +NG+ PH + +Q L P
Sbjct: 243 VP-INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNP 274
Query: 339 AVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRN 396
+V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRN
Sbjct: 275 SVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRN 334
Query: 397 PKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 335 PKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 369
>gi|432113976|gb|ELK36033.1| Hepatocyte nuclear factor 6 [Myotis davidii]
Length = 399
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 173/318 (54%), Gaps = 81/318 (25%)
Query: 139 PLPPISTMSDKFAYGH-----------------TGNVTGSFTVMQNNIGLG-INSHYT-Y 179
PLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y
Sbjct: 16 PLPPISTVSDKFPHHHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPY 75
Query: 180 DKIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYN 236
K Q SP +GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 76 HKDVAGMGQSLSPLSGSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH- 129
Query: 237 QRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVS 296
P + G H +PT+ A +P +NG+ PH
Sbjct: 130 ----------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA---- 161
Query: 297 PQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEE 354
+ +Q L P +V GA VS G NSG++EE
Sbjct: 162 -----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEE 198
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
INTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW
Sbjct: 199 INTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 258
Query: 415 KWLQEPEFQRMSALRLAG 432
KWLQEPEFQRMSALRLAG
Sbjct: 259 KWLQEPEFQRMSALRLAG 276
>gi|49183028|gb|AAT57874.1| Hnf6 beta-a [Patiria miniata]
Length = 477
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 196/356 (55%), Gaps = 71/356 (19%)
Query: 87 GISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTM 146
G S+AS+++ +FR++ A+ L N+NG + S+++ TLTPLQPLPPIST+
Sbjct: 64 GFGASLASMLEP-DFRSV-ADYGMGPL-NMNGYHQQSAATSSNNFTTLTPLQPLPPISTV 120
Query: 147 SDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKIQMSPPQHYSPNNGLSTIVITQENS 206
SDKF + NV+GSFT+M + G++S+ YDK+ N G S +V
Sbjct: 121 SDKFRH-EASNVSGSFTLMNGSNLHGMSSN-PYDKLG-------GMNMGHSFMV------ 165
Query: 207 PLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSP 266
+S G NS P+ +S+ G GS P+ G++ +P
Sbjct: 166 ------AHSGMGHNSVLNGYPPH-------------QSMGGYGS------PHHGTMSCTP 200
Query: 267 TTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLI----SNQLQRSDSPGPPKII 322
Q MNG T H +SP H S+ QRS P ++
Sbjct: 201 -------QDPRYASPMNGYEQFTHHA------SSPHHMRALQPPSHHHQRSGLTSP--MV 245
Query: 323 SQQTATLTLKTVTPTNAV-------VGAVSLGNSGELEEINTKELAQRISAELKRYSIPQ 375
A +T T N+V V GN+GE E+NTKE+A R+++ELKRYSIPQ
Sbjct: 246 LNGYAGMTGHTHPSPNSVGRSGDRGSSRVDSGNTGE--EVNTKEVAGRVTSELKRYSIPQ 303
Query: 376 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
A+FAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 304 AVFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 359
>gi|114657130|ref|XP_001170549.1| PREDICTED: hepatocyte nuclear factor 6 [Pan troglodytes]
Length = 465
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 159/376 (42%), Positives = 203/376 (53%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT--- 178
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 179 YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
D M ++GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSSSGLGSIHNSQQGLP-----HYAHPGTTMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|402874366|ref|XP_003901010.1| PREDICTED: hepatocyte nuclear factor 6 [Papio anubis]
Length = 465
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 159/376 (42%), Positives = 202/376 (53%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 52 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 109
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT--- 178
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT
Sbjct: 110 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 169
Query: 179 YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
D M ++GL I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 170 KDVAGMGQSLSPLSSSGLGGIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 222
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 223 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 254
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 255 ----------------------HLNTQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 292
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 293 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 352
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 353 WLQEPEFQRMSALRLA 368
>gi|410931443|ref|XP_003979105.1| PREDICTED: one cut domain family member 2-like, partial [Takifugu
rubripes]
Length = 346
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 155/386 (40%), Positives = 199/386 (51%), Gaps = 89/386 (23%)
Query: 67 APQDDSRDSEILSPGKISPSGISVSVASIMDSS-EFRNLHA--EPTYQTLTNVNGRLSPP 123
A Q + ++S G+ S + S+ASI++ + ++R HA P + +T
Sbjct: 29 ARQSSASHRNLVSHGR---SAMVSSMASILEGAGDYRTDHALSGPLHPAMTMS----CDS 81
Query: 124 GYSPSSSYATLTPLQPLPPISTMSDKFAYGH-------------TGNVTGSFTVMQNNIG 170
G S SS+Y TLTPLQ LPPIST+S+KF + H GNV+GSFT+M+++
Sbjct: 82 GMSLSSTYTTLTPLQHLPPISTVSEKFHHPHPHAHHHPAHQRLAAGNVSGSFTLMRDDRS 141
Query: 171 LG----INSHYTYDKIQMSPPQHYSP-NNGLSTIVITQENSPLSPQSTYSQNGLNSPQKS 225
L I HY D M P SP +NGL ++ S Q T S G P
Sbjct: 142 LASMGNIYGHYPKDMSGMGQP--LSPLSNGLGSLH--------SSQQTLSAYG---PGAH 188
Query: 226 LSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGI 285
LS D+ + GG +S + G H + GA + ++NGI
Sbjct: 189 LSN-------------DKIISSGGFESHAAMLSRGEEHLARGL---GGHGAGLMASLNGI 232
Query: 286 TTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVS 345
+ P + S++S++ +QT +
Sbjct: 233 HHHS------HPHSQANGSMLSDR-------------DRQTVVGGGQGAGSGQV------ 267
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 268 -------EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKS 320
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 GRETFRRMWKWLQEPEFQRMSALRLA 346
>gi|355692730|gb|EHH27333.1| Hepatocyte nuclear factor 6, partial [Macaca mulatta]
Length = 435
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 159/376 (42%), Positives = 203/376 (53%), Gaps = 92/376 (24%)
Query: 89 SVSVASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPL 137
S+ +AS++D S++ + H A P + T+T +PPG S ++Y TLTPL
Sbjct: 22 SMGMASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPL 79
Query: 138 QPLPPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT--- 178
QPLPPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT
Sbjct: 80 QPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYH 139
Query: 179 YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQ 237
D M ++GL +I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 140 KDVAGMGQSLSPLSSSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH-- 192
Query: 238 RDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSP 297
P + G H +PT+ A +P +NG+ PH
Sbjct: 193 ---------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA----- 224
Query: 298 QTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEI 355
+ +Q L P +V GA VS G NSG++EEI
Sbjct: 225 ----------------------HLNTQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEI 262
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK
Sbjct: 263 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 322
Query: 416 WLQEPEFQRMSALRLA 431
WLQEPEFQRMSALRLA
Sbjct: 323 WLQEPEFQRMSALRLA 338
>gi|281345940|gb|EFB21524.1| hypothetical protein PANDA_013121 [Ailuropoda melanoleuca]
Length = 435
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 149/329 (45%), Positives = 183/329 (55%), Gaps = 74/329 (22%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------TGNVTGSFTVMQNNIG 170
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M++ G
Sbjct: 68 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHPHHHQRLAGNVSGSFTLMRDERG 127
Query: 171 LG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKS 225
L +N+ YT D M ++GL I +Q+ P Y+ G P K
Sbjct: 128 LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGGIHNSQQGLP-----HYAHPGAAMPTDKM 182
Query: 226 LSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGI 285
L+PN +++ + P + G H +PT+ A +P +NG+
Sbjct: 183 LTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP-INGL 217
Query: 286 TTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-V 344
PH + +Q L P +V GA V
Sbjct: 218 PPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSVTGAQV 250
Query: 345 SLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
S G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL
Sbjct: 251 SNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 310
Query: 404 KSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 311 KSGRETFRRMWKWLQEPEFQRMSALRLAA 339
>gi|301776949|ref|XP_002923894.1| PREDICTED: hepatocyte nuclear factor 6-like [Ailuropoda
melanoleuca]
Length = 434
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 149/329 (45%), Positives = 183/329 (55%), Gaps = 74/329 (22%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------TGNVTGSFTVMQNNIG 170
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M++ G
Sbjct: 67 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHPHHHQRLAGNVSGSFTLMRDERG 126
Query: 171 LG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKS 225
L +N+ YT D M ++GL I +Q+ P Y+ G P K
Sbjct: 127 LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGGIHNSQQGLP-----HYAHPGAAMPTDKM 181
Query: 226 LSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGI 285
L+PN +++ + P + G H +PT+ A +P +NG+
Sbjct: 182 LTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP-INGL 216
Query: 286 TTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-V 344
PH + +Q L P +V GA V
Sbjct: 217 PPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSVTGAQV 249
Query: 345 SLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
S G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL
Sbjct: 250 SNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 309
Query: 404 KSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 310 KSGRETFRRMWKWLQEPEFQRMSALRLAA 338
>gi|395822210|ref|XP_003784416.1| PREDICTED: hepatocyte nuclear factor 6 [Otolemur garnettii]
Length = 465
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 190/349 (54%), Gaps = 81/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 137
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT D M ++GL +I +Q+
Sbjct: 138 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGL 197
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ + P + G H +
Sbjct: 198 P-----HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLSRHGEQHLT 235
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 236 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 260
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 261 HGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 320
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 369
>gi|24307887|ref|NP_004489.1| hepatocyte nuclear factor 6 [Homo sapiens]
gi|297696684|ref|XP_002825515.1| PREDICTED: hepatocyte nuclear factor 6 [Pongo abelii]
gi|332235594|ref|XP_003266991.1| PREDICTED: hepatocyte nuclear factor 6 [Nomascus leucogenys]
gi|13627184|sp|Q9UBC0.1|HNF6_HUMAN RecName: Full=Hepatocyte nuclear factor 6; Short=HNF-6; AltName:
Full=One cut domain family member 1; AltName: Full=One
cut homeobox 1
gi|4100316|gb|AAD00826.1| hepatocyte nuclear factor-6 alpha [Homo sapiens]
gi|4104419|gb|AAD02033.1| hepatocyte nuclear factor 6 [Homo sapiens]
gi|119597867|gb|EAW77461.1| one cut domain, family member 1 [Homo sapiens]
gi|187954629|gb|AAI40831.1| One cut homeobox 1 [Homo sapiens]
gi|208966968|dbj|BAG73498.1| one cut homeobox 1 [synthetic construct]
Length = 465
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 190/349 (54%), Gaps = 81/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 137
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT D M ++GL +I +Q+
Sbjct: 138 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGL 197
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ + P + G H +
Sbjct: 198 P-----HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLT 235
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 236 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 260
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 261 HGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 320
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 369
>gi|397515704|ref|XP_003828087.1| PREDICTED: hepatocyte nuclear factor 6 [Pan paniscus]
Length = 503
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 184/334 (55%), Gaps = 79/334 (23%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVM 165
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M
Sbjct: 131 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLM 190
Query: 166 QNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ GL +N+ YT D M ++GL +I +Q+ P Y+ G
Sbjct: 191 RDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLP-----HYAHPGAAM 245
Query: 222 P-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
P K L+PN +++ + P + G H +PT+ A +P
Sbjct: 246 PTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP 281
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH + +Q L P +V
Sbjct: 282 -INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSV 313
Query: 341 VGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 314 TGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 373
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 374 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 407
>gi|410961217|ref|XP_003987180.1| PREDICTED: hepatocyte nuclear factor 6 [Felis catus]
Length = 465
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 189/349 (54%), Gaps = 81/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 137
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT D M ++GL I +Q+
Sbjct: 138 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGGIHNSQQGL 197
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ + P + G H +
Sbjct: 198 P-----HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLT 235
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 236 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 260
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 261 HGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 320
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 369
>gi|345794965|ref|XP_851227.2| PREDICTED: hepatocyte nuclear factor 6 isoform 2 [Canis lupus
familiaris]
Length = 465
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 190/349 (54%), Gaps = 81/349 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 137
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT D M ++GLS I +Q+
Sbjct: 138 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLSGIHNSQQGL 197
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ + P + G H +
Sbjct: 198 P-----HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLT 235
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 236 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 260
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 261 HGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 320
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 369
>gi|5514674|emb|CAB50769.1| hepatocyte nuclear factor 6 [Homo sapiens]
Length = 368
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 189/348 (54%), Gaps = 81/348 (23%)
Query: 106 AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH----------- 154
A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 80 AGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHH 137
Query: 155 ----TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENS 206
GNV+GSFT+M++ GL +N+ YT D M ++GL +I +Q+
Sbjct: 138 HQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGL 197
Query: 207 PLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSS 265
P Y+ G P K L+PN +++ P + G H +
Sbjct: 198 P-----HYAHPGAAMPTDKMLTPNGFET-----------------HHPAMLGRHGEQHLT 235
Query: 266 PTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQ 325
PT+ A +P +NG+ PH + +Q
Sbjct: 236 PTS-------AGMVP-INGLPPHHPHA---------------------------HLNAQG 260
Query: 326 TATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVL
Sbjct: 261 HGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVL 320
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 368
>gi|307178290|gb|EFN67062.1| Hepatocyte nuclear factor 6 [Camponotus floridanus]
Length = 238
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 136/277 (49%), Positives = 149/277 (53%), Gaps = 83/277 (29%)
Query: 165 MQNN----IGLGIN-SHYTYDKI---QMSPPQHY-SPNNGLSTIVITQENSPLSPQSTYS 215
MQNN IGLG+ S Y YDK+ MSPP +Y SP GL + SPLSPQS YS
Sbjct: 1 MQNNGLGNIGLGMGGSPYAYDKLPSMGMSPPHNYPSPGAGL-------QPSPLSPQSAYS 53
Query: 216 QNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQG 275
Q+GLNSP KS + YD + R L SP L
Sbjct: 54 QSGLNSPHKSSASPHYDPAFLPR-----------------------LQQSPAAL------ 84
Query: 276 AATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVT 335
+ +P V + +P H PG +S T
Sbjct: 85 ----------SPSSPPAVTATASFAPSH---------HSPPGTHSTVS-----------T 114
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
+N G V EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR
Sbjct: 115 ASNGGAGEV--------EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 166
Query: 396 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 167 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 203
>gi|395503264|ref|XP_003755990.1| PREDICTED: hepatocyte nuclear factor 6 [Sarcophilus harrisii]
Length = 460
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 161/372 (43%), Positives = 201/372 (54%), Gaps = 88/372 (23%)
Query: 89 SVSVASIMDSSEFRNLHAEPTYQTL------TNVNGRLSPPGYSPSSSYATLTPLQPLPP 142
S+ +ASI+DS + H P L T +PPG S +S+Y+ L PLQPLPP
Sbjct: 51 SMGMASILDSGGDYHHHHRPPEHALAGPLHPTMTMACETPPGMSMTSTYSPLPPLQPLPP 110
Query: 143 ISTMSDKFA------------YGH---TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQMS 185
IST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y K
Sbjct: 111 ISTVSDKFPHHHHHHHXXXHPHHHQRIAGNVSGSFTLMRDERGLASMNNLYTPYHKDVPG 170
Query: 186 PPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVD 242
Q SP +GL I +Q+ P Y+ P +K L+PN +++ +
Sbjct: 171 MGQSLSPLSGSGLGGIHNSQQGLP-----HYAHPSATMPAEKMLTPNGFEAHH------- 218
Query: 243 RSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQ 302
P + G H +PT+ A +P +NG+ P
Sbjct: 219 ----------PAMLARHGEQHLTPTS-------AGMVP-INGL---------------PH 245
Query: 303 HSLISNQLQRSDSPGPPKIISQQTATLTLKTVT-PTNAVVGA-VSLG-NSGELEEINTKE 359
H P ++ Q L T P +V G+ VS G NSG++EEINTKE
Sbjct: 246 H--------------PHAHLNAQGHGQILGTAREPNPSVTGSQVSNGSNSGQMEEINTKE 291
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE 419
+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE
Sbjct: 292 VAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE 351
Query: 420 PEFQRMSALRLA 431
PEFQRMSALRLA
Sbjct: 352 PEFQRMSALRLA 363
>gi|403302060|ref|XP_003941684.1| PREDICTED: uncharacterized protein LOC101039550 [Saimiri
boliviensis boliviensis]
Length = 1007
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 154/359 (42%), Positives = 195/359 (54%), Gaps = 81/359 (22%)
Query: 95 IMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH 154
+++S+ + A P + T+T +PPG S ++Y TLTPLQPLPPIST+SDKF + H
Sbjct: 611 LLESNRLWHSLAGPLHPTMTMACE--TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHH 668
Query: 155 ---------------TGNVTGSFTVMQNNIGLG-INSHYT---YDKIQMSPPQHYSPNNG 195
GNV+GSFT+M++ GL +N+ YT D M ++G
Sbjct: 669 HHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSG 728
Query: 196 LSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPT 254
L +I +Q+ P Y+ G P K L+PN +++ P
Sbjct: 729 LGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEA-----------------HHPA 766
Query: 255 LSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSD 314
+ G H +PT+ A +P +NG+ PH
Sbjct: 767 MLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA---------------------- 796
Query: 315 SPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYS 372
+ +Q L P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYS
Sbjct: 797 -----HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYS 851
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 852 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 910
>gi|170676473|gb|ACB30402.1| one cut homeobox 1 [Homo sapiens]
Length = 381
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 184/334 (55%), Gaps = 79/334 (23%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVM 165
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M
Sbjct: 9 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLM 68
Query: 166 QNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ GL +N+ YT D M ++GL +I +Q+ P Y+ G
Sbjct: 69 RDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLP-----HYAHPGAAM 123
Query: 222 P-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
P K L+PN +++ + P + G H +PT+ A +P
Sbjct: 124 PTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS-------AGMVP 159
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH + +Q L P +V
Sbjct: 160 -INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSV 191
Query: 341 VGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 192 TGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 251
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 252 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 285
>gi|426379127|ref|XP_004056256.1| PREDICTED: hepatocyte nuclear factor 6 [Gorilla gorilla gorilla]
Length = 465
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 146/333 (43%), Positives = 181/333 (54%), Gaps = 79/333 (23%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH---------------TGNVTGSFTVM 165
+PPG S ++Y TLTPLQPLPPIST+SDKF + H GNV+GSFT+M
Sbjct: 93 TPPGMSMPTTYTTLTPLQPLPPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLM 152
Query: 166 QNNIGLG-INSHYT---YDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ GL +N+ YT D M ++ L +I +Q+ P Y+ G
Sbjct: 153 RDERGLASMNNLYTPYHKDVAGMGQSLSPLSSSSLGSIHNSQQGLP-----HYAHPGAAM 207
Query: 222 P-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP 280
P K L+PN +++ + P + G H + T+ A +P
Sbjct: 208 PTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTSTS-------AGMVP 243
Query: 281 TMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV 340
+NG+ PH + +Q L P +V
Sbjct: 244 -INGLPPHHPHA---------------------------HLNAQGHGQLLGTAREPNPSV 275
Query: 341 VGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
G VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 276 TGGQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 335
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 336 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 368
>gi|68399729|ref|XP_694209.1| PREDICTED: one cut domain family member 2 [Danio rerio]
Length = 451
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 148/368 (40%), Positives = 182/368 (49%), Gaps = 84/368 (22%)
Query: 86 SGISVSVASIMDSS-EFRNLHA--EPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQP-LP 141
S + S+ASI+D + E+R H+ P + +T S SS+Y TLTPLQ LP
Sbjct: 45 SAMVSSMASILDGAGEYRTDHSLSGPLHPAMTMS----CDSSMSLSSTYTTLTPLQHHLP 100
Query: 142 PISTMSDKFAYGH--------------TGNVTGSFTVMQNNIGLG----INSHYTYDKIQ 183
PIS +SDKF + GNV+GSFT+M+++ GL + HY+ D
Sbjct: 101 PISNVSDKFHHHPHPHAHHHPAHQRLAAGNVSGSFTLMRDDRGLASMSNLYGHYSKDLSG 160
Query: 184 MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDR 243
M P PLSP S NGL S S S P D+
Sbjct: 161 MGP--------------------PLSPLS----NGLGSLHNSQQTLSAYGPSAHLG-NDK 195
Query: 244 SVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQH 303
+ GG +S G H + GA + ++NGI H+ S
Sbjct: 196 MISSGGFESHAAMLGRGEEHLARGL---GGHGAGIMSSLNGINHHHHHSHSHSQANGS-- 250
Query: 304 SLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQR 363
++S++ + V EEINTKE+AQR
Sbjct: 251 -----------------VLSERDRQAGAGGQGGGSGQV-----------EEINTKEVAQR 282
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ 423
I+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ
Sbjct: 283 ITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ 342
Query: 424 RMSALRLA 431
RMSALRLA
Sbjct: 343 RMSALRLA 350
>gi|344243763|gb|EGV99866.1| One cut domain family member 2 [Cricetulus griseus]
Length = 431
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 128/284 (45%), Positives = 158/284 (55%), Gaps = 64/284 (22%)
Query: 155 TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSP 210
GNV+GSFT+M++ GL +N+ YT Y K Q SP +GL +I +Q+ P
Sbjct: 82 AGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGSIHNSQQGLP--- 138
Query: 211 QSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTL 269
Y+ G P K L+PN +++ + P + G H +PT+
Sbjct: 139 --HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS- 178
Query: 270 VTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATL 329
A +P +NG+ PH + +Q L
Sbjct: 179 ------AGMVP-INGLPPHHPHA---------------------------HLNAQGHGQL 204
Query: 330 TLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQ 387
P +V G+ VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQ
Sbjct: 205 LGTAREPNPSVTGSQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQ 264
Query: 388 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 265 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 308
>gi|354465194|ref|XP_003495065.1| PREDICTED: hypothetical protein LOC100765766 [Cricetulus griseus]
Length = 405
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 128/284 (45%), Positives = 158/284 (55%), Gaps = 64/284 (22%)
Query: 155 TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQMSPPQHYSP--NNGLSTIVITQENSPLSP 210
GNV+GSFT+M++ GL +N+ YT Y K Q SP +GL +I +Q+ P
Sbjct: 82 AGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLSGSGLGSIHNSQQGLP--- 138
Query: 211 QSTYSQNGLNSP-QKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTL 269
Y+ G P K L+PN +++ + P + G H +PT+
Sbjct: 139 --HYAHPGAAMPTDKMLTPNGFEAHH-----------------PAMLGRHGEQHLTPTS- 178
Query: 270 VTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATL 329
A +P +NG+ PH + +Q L
Sbjct: 179 ------AGMVP-INGLPPHHPHA---------------------------HLNAQGHGQL 204
Query: 330 TLKTVTPTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQ 387
P +V G+ VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQ
Sbjct: 205 LGTAREPNPSVTGSQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQ 264
Query: 388 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 265 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 308
>gi|345305696|ref|XP_001510495.2| PREDICTED: one cut domain family member 2-like [Ornithorhynchus
anatinus]
Length = 583
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 142/325 (43%), Positives = 164/325 (50%), Gaps = 92/325 (28%)
Query: 136 PLQPLPPISTMSDKFAYGH------------------TGNVTGSFTVMQNNIGL-GINSH 176
PLQPLPPIST+SDKF +GNV+GSFT+M++ GL +N+
Sbjct: 229 PLQPLPPISTVSDKFHPHPHPHPHHHHHHHHHHHQRLSGNVSGSFTLMRDERGLPAMNNL 288
Query: 177 YTYDKIQMSPPQHYSP------NNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
Y K Q SP NGL ++ +PQ G K L PN
Sbjct: 289 YGPYKEMPGMGQSLSPLAAAPLGNGLGSLH--------TPQQGLHNYGPPGHDKMLGPN- 339
Query: 231 YDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGIT---T 287
+D+ + ++L G Q P L T AA + +NGI
Sbjct: 340 FDAHHT-------AMLARGDQ------------HLPRGLGTP-PAAAMMSHLNGIHHPGH 379
Query: 288 ITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLG 347
PH V+ P D P SQ V GA
Sbjct: 380 PQPHGPVLGPG--------------RDRPPSSSSASQ---------------VAGA---- 406
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 407
G+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR
Sbjct: 407 --GQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 464
Query: 408 ETFRRMWKWLQEPEFQRMSALRLAG 432
ETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 465 ETFRRMWKWLQEPEFQRMSALRLAA 489
>gi|410903215|ref|XP_003965089.1| PREDICTED: one cut domain family member 2-like [Takifugu rubripes]
Length = 452
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 191/356 (53%), Gaps = 73/356 (20%)
Query: 92 VASIMDSS-EFRNLHAEPTYQTLTNVNGRLSPP-GYSPSSSYATLTPLQPLPPISTMSDK 149
+A+I+D S E+R + P + ++ SP G S +Y TLTPLQPLPPIST+SDK
Sbjct: 58 MATILDGSGEYRPELSLPLHHAMSVPCDTSSPGMGMGMSGTYTTLTPLQPLPPISTVSDK 117
Query: 150 FAYGH-------TGNVTGSFTVMQNNIGL-GINSHYT-YDKIQMSP-PQHYSP--NNGLS 197
F + H +GNV+GSFT+M++ GL G+N+ Y+ Y K MS Q+ SP NGLS
Sbjct: 118 FHHHHHHHHQRLSGNVSGSFTLMRDERGLPGMNNLYSPYHKDHMSGMGQNLSPVLGNGLS 177
Query: 198 TIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD--SPYNQRDLVDRSVLGGGSQSPTL 255
+I TQ++ L ST + G + K L+ +++ S + D LGG +
Sbjct: 178 SIHNTQQS--LHGYSTTTHGGHD---KMLNFDTHHTASMLARADHHQHRGLGGTA----- 227
Query: 256 SPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDS 315
A +P +NG+ H SN +S
Sbjct: 228 --------------------AGMMPHLNGM-----------------HHHHSNPHLQSPP 250
Query: 316 PGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQ 375
GP +++ + T G + N+ E+ + T EL KRYSIPQ
Sbjct: 251 HGP--VLASSRERPPSSSGTQGVNSSGQLEEINTKEVAQRITAEL--------KRYSIPQ 300
Query: 376 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 301 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 356
>gi|432853533|ref|XP_004067754.1| PREDICTED: one cut domain family member 2-like [Oryzias latipes]
Length = 443
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 149/394 (37%), Positives = 195/394 (49%), Gaps = 105/394 (26%)
Query: 67 APQDDSRDSEILSPGKISPSGISVSVASIMDSS-EFRNLHAEPTYQTLTNVNGRLSPP-- 123
A Q + ++S G+ S + S+ASI++ + ++R HA ++G L P
Sbjct: 29 ARQSSASHRNLVSHGR---SAMVSSMASILEGAGDYRTDHA---------LSGPLHPAMT 76
Query: 124 -----GYSPSSSYATLTPLQPLPPISTMSDKFAYGH-------------TGNVTGSFTVM 165
G S SS+Y TLTPLQ LPPIST+S+KF + H GNV+GSFT+M
Sbjct: 77 MSCDSGMSLSSTYTTLTPLQHLPPISTVSEKFHHPHPHPHHHQVHQRLSAGNVSGSFTLM 136
Query: 166 QNNIGLG----INSHYTYDKIQMSPPQHYSP-NNGLSTIVITQENSPLSPQSTYSQNGLN 220
++ L + HY D M Q SP +NGL ++ +Q+ T+ N
Sbjct: 137 RDERSLASMSNLYGHYPKDMSGMG--QSLSPLSNGLGSLHSSQQTLGAYGPGTHLSN--- 191
Query: 221 SPQKSLSPNSYDSPYNQRDLVDRSV---LGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAA 277
K LS S++S D + LGG GA
Sbjct: 192 --DKMLSSGSFESHAAMLSRGDEHLARGLGG-------------------------HGAG 224
Query: 278 TIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPT 337
+ ++NGI + P + S++S++ +++ G S Q + K V
Sbjct: 225 LMTSLNGIHHHS------HPHSQANGSMLSDRDRQTVVGGGQGAGSGQVEEINTKEV--- 275
Query: 338 NAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNP
Sbjct: 276 -----------------------AQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNP 312
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 313 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 346
>gi|444514630|gb|ELV10615.1| Hepatocyte nuclear factor 6 [Tupaia chinensis]
Length = 376
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 166/318 (52%), Gaps = 86/318 (27%)
Query: 143 ISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YDK---- 181
IST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y K
Sbjct: 19 ISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDGAG 78
Query: 182 -----IQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPY 235
+S +GL I +Q+ P Y+ G P K L+PN +++ +
Sbjct: 79 MGQSLSPLSQSLSPLSGSGLGGIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEAHH 133
Query: 236 NQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVV 295
P + G H +PT+ A +P +NG+ PH
Sbjct: 134 -----------------PAMLGRHGEQHLTPTS-------AGMVP-INGLPPHHPHA--- 165
Query: 296 SPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA-VSLG-NSGELE 353
+ +Q L P +V GA VS G NSG++E
Sbjct: 166 ------------------------HLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQME 201
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 413
EINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM
Sbjct: 202 EINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 261
Query: 414 WKWLQEPEFQRMSALRLA 431
WKWLQEPEFQRMSALRLA
Sbjct: 262 WKWLQEPEFQRMSALRLA 279
>gi|6708088|gb|AAF25796.1|AF169227_1 cut-homeodomain protein [Drosophila melanogaster]
Length = 1081
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/155 (64%), Positives = 112/155 (72%), Gaps = 17/155 (10%)
Query: 280 PTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNA 339
P +NG TP +V V Q SP S + N Q S SQQ + + T +
Sbjct: 698 PGING----TPQSVCVIHQQSP--SALGNGFQSS---------SQQAKVSSCASPKGTVS 742
Query: 340 VVGAVS--LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
GAVS + NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP
Sbjct: 743 SGGAVSNRIANSSDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 802
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 803 KPWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 837
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 65/102 (63%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS+ L + +YQTLT+VN
Sbjct: 266 EVLSVIVHSQDSDKEDCEENDDGD-AEGDLENEDDDERDSASREQLLSHSSYQTLTSVND 324
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 325 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 363
>gi|16197895|gb|AAL13705.1| GH28062p [Drosophila melanogaster]
Length = 976
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/155 (64%), Positives = 112/155 (72%), Gaps = 17/155 (10%)
Query: 280 PTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNA 339
P +NG TP +V V Q SP S + N Q S SQQ + + T +
Sbjct: 593 PGING----TPQSVCVIHQQSP--SALGNGFQSS---------SQQAKVSSCASPKGTVS 637
Query: 340 VVGAVS--LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
GAVS + NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP
Sbjct: 638 SGGAVSNRIANSSDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 697
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 698 KPWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 732
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 64/102 (62%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS L + +YQTLT+VN
Sbjct: 161 EVLSVIVHSQDSDKEDCEENDDGD-AEGDLENEDDDERDSRSREQLLSHSSYQTLTSVND 219
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 220 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 258
>gi|24638658|ref|NP_524842.2| onecut, isoform A [Drosophila melanogaster]
gi|442614427|ref|NP_001259065.1| onecut, isoform B [Drosophila melanogaster]
gi|13632097|sp|Q9NJB5.2|ONEC_DROME RecName: Full=Homeobox protein onecut
gi|10726317|gb|AAF59330.2| onecut, isoform A [Drosophila melanogaster]
gi|440218152|gb|AGB96555.1| onecut, isoform B [Drosophila melanogaster]
Length = 1081
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/155 (64%), Positives = 112/155 (72%), Gaps = 17/155 (10%)
Query: 280 PTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNA 339
P +NG TP +V V Q SP S + N Q S SQQ + + T +
Sbjct: 698 PGING----TPQSVCVIHQQSP--SALGNGFQSS---------SQQAKVSSCASPKGTVS 742
Query: 340 VVGAVS--LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
GAVS + NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP
Sbjct: 743 SGGAVSNRIANSSDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 802
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 803 KPWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 837
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 64/102 (62%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS L + +YQTLT+VN
Sbjct: 266 EVLSVIVHSQDSDKEDCEENDDGD-AEGDLENEDDDERDSRSREQLLSHSSYQTLTSVND 324
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 325 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 363
>gi|322797315|gb|EFZ19433.1| hypothetical protein SINV_05413 [Solenopsis invicta]
Length = 654
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 87/107 (81%), Positives = 93/107 (86%), Gaps = 7/107 (6%)
Query: 325 QTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLC 384
Q ++ +KTV+ A G +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVLC
Sbjct: 555 QHTSVVMKTVS-------AAGNGGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVLC 607
Query: 385 RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 608 RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 654
>gi|328751785|gb|AEB39648.1| IP22193p [Drosophila melanogaster]
Length = 1018
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/155 (64%), Positives = 112/155 (72%), Gaps = 17/155 (10%)
Query: 280 PTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNA 339
P +NG TP +V V Q SP S + N Q S SQQ + + T +
Sbjct: 698 PGING----TPQSVCVIHQQSP--SALGNGFQSS---------SQQAKVSSCASPKGTVS 742
Query: 340 VVGAVS--LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
GAVS + NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP
Sbjct: 743 SGGAVSNRIANSSDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 802
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 803 KPWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 837
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 64/102 (62%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS L + +YQTLT+VN
Sbjct: 266 EVLSVIVHSQDSDKEDCEENDDGD-AEGDLENEDDDERDSRSREQLLSHSSYQTLTSVND 324
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 325 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 363
>gi|47228342|emb|CAG07737.1| unnamed protein product [Tetraodon nigroviridis]
Length = 490
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 174/343 (50%), Gaps = 95/343 (27%)
Query: 120 LSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH------------TGNVTGSFTVMQN 167
+ G S S++Y LTPLQ LPPIS +SDKF + H +GNV+GSFT+M++
Sbjct: 82 MCETGMSLSNTYTALTPLQHLPPISNVSDKFHHHHSHHHAAAHQRLSSGNVSGSFTLMRD 141
Query: 168 NIGLGINSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLS 227
+ + GL+++ + P S +G+ SLS
Sbjct: 142 D------------------------HRGLASMSNLYSHYP----KEMSVSGMG--HGSLS 171
Query: 228 PNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIP----TMN 283
P LS GSLH+S L + + A +P ++
Sbjct: 172 P--------------------------LSSGLGSLHNSQQPL-SAYGPGAHLPGDSKMLS 204
Query: 284 GITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA 343
+T PH ++S ++ +H L RS +IS + GA
Sbjct: 205 PVTGFEPHATMLS-RSDQEH------LARSLGGHSHGMISNLNGMHHHPHSHLHSQANGA 257
Query: 344 VSL------GNSG---------ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQG 388
V L G+ G + EEINTKE+AQRI+AELKRYSIPQAIFAQR+L RSQG
Sbjct: 258 VMLAERERHGHGGAQGVAGSGIQAEEINTKEVAQRITAELKRYSIPQAIFAQRILSRSQG 317
Query: 389 TLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
TLSDLLRNPKPWSKLKSGRETFRRMW+WLQEPEFQRMSALRLA
Sbjct: 318 TLSDLLRNPKPWSKLKSGRETFRRMWRWLQEPEFQRMSALRLA 360
>gi|395830749|ref|XP_003788481.1| PREDICTED: one cut domain family member 2 [Otolemur garnettii]
Length = 399
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 143/362 (39%), Positives = 190/362 (52%), Gaps = 74/362 (20%)
Query: 88 ISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMS 147
+ S+ASI+D ++R + P + ++ ++ SPPG S++Y TLTPLQPLPPIST+S
Sbjct: 1 MVTSMASILDGGDYRPELSIPLHHAMS-MSCDSSPPGMGMSNTYTTLTPLQPLPPISTVS 59
Query: 148 DKFAYGH----------------TGNVTGSFTVMQNNIGL-GINSHYTYDKIQMSPPQHY 190
DKF + H +GNV+GSFT+M++ GL +N+ Y SP Y
Sbjct: 60 DKFHHPHPHHHPHHHHHHHHQRLSGNVSGSFTLMRDERGLPAMNNLY-------SP---Y 109
Query: 191 SPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGS 250
G+S + +PL GL++ Q+SL PN Y P + +
Sbjct: 110 KEMPGMSQSLSPLAATPLG----NGLGGLHNAQQSL-PN-YGPPGHDK------------ 151
Query: 251 QSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQL 310
LSPN + H++ T +G + G TP ++S H
Sbjct: 152 ---MLSPNFEAHHTAMLT-----RGEQHLSRGLG----TPPAAMMSHLNGLHHP------ 193
Query: 311 QRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKR 370
+ + GP +++ + A G + N+ E+ + T E LKR
Sbjct: 194 GHTQTHGP--VLAPSRERPPSSSSGSQVATSGQLEEINTKEVAQRITAE--------LKR 243
Query: 371 YSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL 430
YSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL
Sbjct: 244 YSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL 303
Query: 431 AG 432
A
Sbjct: 304 AA 305
>gi|332030395|gb|EGI70102.1| Homeobox protein onecut [Acromyrmex echinatior]
Length = 229
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 87/107 (81%), Positives = 93/107 (86%), Gaps = 7/107 (6%)
Query: 325 QTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLC 384
Q ++ +KTV+ A G +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVLC
Sbjct: 111 QHTSVVMKTVS-------AAGNGGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVLC 163
Query: 385 RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 164 RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 210
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 35/58 (60%), Gaps = 8/58 (13%)
Query: 182 IQMSPPQHY-SPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQR 238
+ MSPP +Y SP GL + SPLSPQS YSQ+GLNSP KS + YD + R
Sbjct: 1 MSMSPPHNYPSPGAGL-------QPSPLSPQSAYSQSGLNSPHKSSASPHYDPAFLPR 51
>gi|49183030|gb|AAT57875.1| Hnf6 beta-b [Patiria miniata]
Length = 489
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 144/356 (40%), Positives = 191/356 (53%), Gaps = 72/356 (20%)
Query: 87 GISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTM 146
G S+AS+++ +FR++ A+ L N+NG + S+++ TLTPLQPLPPIST+
Sbjct: 64 GFGASLASMLEP-DFRSV-ADYGMGPL-NMNGYHQQSAATSSNNFTTLTPLQPLPPISTV 120
Query: 147 SDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKIQMSPPQHYSPNNGLSTIVITQENS 206
SDKF + NV+GSFT+M + G++S+ YDK+ N G S +V
Sbjct: 121 SDKFRH-EASNVSGSFTLMNGSNLHGMSSN-PYDKLG-------GMNMGHSFMV------ 165
Query: 207 PLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSP 266
+S G NS +P+ +S+ G G SP+ G++ +P
Sbjct: 166 ------AHSGMGHNSVLNGYAPH-------------QSMGGYG------SPHHGTMSCTP 200
Query: 267 TTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLI----SNQLQRSDSPGPPKII 322
Q MNG T H +SP H S+ QRS P ++
Sbjct: 201 -------QDPRYASPMNGYEQFTHHA------SSPHHMRALQPPSHHHQRSGLTSP--MV 245
Query: 323 SQQTATLTLKTVTPTNAV-------VGAVSLGNSGELEEINTKELAQRISAELKRYSIPQ 375
A +T T N+V V GN+G EE+NTKE+A R+++ELKRYSIP+
Sbjct: 246 LNGYAGMTGHTHPSPNSVGRSGDRGSSRVDSGNTG--EEVNTKEVAGRVTSELKRYSIPK 303
Query: 376 AIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
+F + RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 304 -LFLPEGVVRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 358
>gi|301611997|ref|XP_002935504.1| PREDICTED: one cut domain family member 2-like [Xenopus (Silurana)
tropicalis]
Length = 417
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 167/320 (52%), Gaps = 88/320 (27%)
Query: 121 SPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH------TGNVTGSFTVMQNNIGLG-- 172
SPP + ++SY TLTPLQPL DKF + GNV GSFT+M+ + GLG
Sbjct: 71 SPP--ASANSYTTLTPLQPL------HDKFHHHQHHQCVPVGNVIGSFTLMREDRGLGPP 122
Query: 173 --INSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
SHY D I M P+ L+T+ S +G + +K +S N
Sbjct: 123 SNFYSHYPKD-ISMGQSLSSLPSPNLTTM-----------HSYGHPSGHLANEKMVSGNG 170
Query: 231 YDSPYNQRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITP 290
+++ ++ +D+ S LSP P
Sbjct: 171 FEAHHSMFSRIDQHF------SRELSP-------------------------------PP 193
Query: 291 HTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSG 350
+ V SP + QH + ++ + + PP + +Q TA S
Sbjct: 194 SSGVGSPNSMHQHHYVHHRPEINCRQNPP-MQTQNTAI--------------------SN 232
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
+LEEINTK++AQRI AELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 233 QLEEINTKDVAQRIIAELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWSKLKSGRETF 292
Query: 411 RRMWKWLQEPEFQRMSALRL 430
+RMW+WLQEPEFQRM+ALRL
Sbjct: 293 KRMWRWLQEPEFQRMAALRL 312
>gi|195564300|ref|XP_002105760.1| GD24376 [Drosophila simulans]
gi|194201634|gb|EDX15210.1| GD24376 [Drosophila simulans]
Length = 851
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/145 (65%), Positives = 108/145 (74%), Gaps = 13/145 (8%)
Query: 289 TPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKT--VTPTNAVVGAVSL 346
TPH+V V Q S S + N Q S SQQ + ++ VT ++ G+
Sbjct: 636 TPHSVCVIHQQS--QSALGNGFQSS---------SQQAKVSSCESPKVTASSGGGGSSRN 684
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG
Sbjct: 685 ANSTDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 744
Query: 407 RETFRRMWKWLQEPEFQRMSALRLA 431
RETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 745 RETFRRMYKWLQEPEFQRMSALRMA 769
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 63/102 (61%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS L + +YQ LT+VN
Sbjct: 274 EVLSVIVHSQDSDKEDCEENDDGD-AEGDLENEDDDERDSRSREQLLSHSSYQALTSVND 332
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 333 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 371
>gi|194913534|ref|XP_001982719.1| GG16442 [Drosophila erecta]
gi|190647935|gb|EDV45238.1| GG16442 [Drosophila erecta]
Length = 1085
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 93/150 (62%), Positives = 110/150 (73%), Gaps = 9/150 (6%)
Query: 290 PHTVVVSPQTSP-----QHSLISNQLQRSDSPGPPKIISQQTATLTLKT---VTPTNAVV 341
P +V+V Q SP HS+ + Q Q + G +S Q A ++ T ++
Sbjct: 692 PSSVIVE-QCSPGIHGTSHSVGAIQQQSQSALGNEFQVSSQQAKVSACASPKATVSSGGA 750
Query: 342 GAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
G+ N+ ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 751 GSSRNANASDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 810
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
KLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 811 KLKSGRETFRRMYKWLQEPEFQRMSALRMA 840
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/122 (46%), Positives = 71/122 (58%), Gaps = 5/122 (4%)
Query: 55 ESIKTEGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
ES E L+VIV QD D D E G + + DS L + +YQTL
Sbjct: 265 ESGHGEVLSVIVHSQDSDKEDGEENGDGDAEGDAENED-DNERDSRSREQLLSHSSYQTL 323
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GH-TGNVTGSFTVMQNNIGL 171
T VN RLS PG+S +SYATLTP+QPLPPISTMS+KFAY GH +G +G V ++ G
Sbjct: 324 TAVNDRLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAYSGHISGGDSGDTDVNGDDGGR 382
Query: 172 GI 173
G+
Sbjct: 383 GV 384
>gi|195354397|ref|XP_002043684.1| GM26776 [Drosophila sechellia]
gi|194128872|gb|EDW50915.1| GM26776 [Drosophila sechellia]
Length = 1085
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/146 (65%), Positives = 108/146 (73%), Gaps = 13/146 (8%)
Query: 289 TPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKT--VTPTNAVVGAVSL 346
TPH+V V Q S S + N Q S SQQ + ++ VT ++ G+
Sbjct: 707 TPHSVCVIHQQS--QSALGNGFQSS---------SQQAKVSSCESPKVTVSSGGGGSSRN 755
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
NS ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG
Sbjct: 756 ANSTDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 815
Query: 407 RETFRRMWKWLQEPEFQRMSALRLAG 432
RETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 816 RETFRRMYKWLQEPEFQRMSALRMAA 841
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 63/102 (61%), Gaps = 5/102 (4%)
Query: 60 EGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNG 118
E L+VIV QD D D E G + + DS L + +YQ LT+VN
Sbjct: 272 EVLSVIVHSQDSDKEDCEENDNGD-AEGDLENEDDDERDSRSREQLLSHSSYQALTSVND 330
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTG 160
RLS PG+S +SYATLTP+QPLPPISTMS+KFAY +G+++G
Sbjct: 331 RLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAY--SGHISG 369
>gi|348505378|ref|XP_003440238.1| PREDICTED: one cut domain family member 2-like [Oreochromis
niloticus]
Length = 504
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 80/83 (96%), Positives = 83/83 (100%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 408
SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE
Sbjct: 326 SGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 385
Query: 409 TFRRMWKWLQEPEFQRMSALRLA 431
TFRRMWKWLQEPEFQRMSALRLA
Sbjct: 386 TFRRMWKWLQEPEFQRMSALRLA 408
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/128 (44%), Positives = 79/128 (61%), Gaps = 16/128 (12%)
Query: 92 VASIMDSS-EFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKF 150
+A+I+D S E+R + P + ++ V S PG S +Y TLTPLQPLPPIST+SDKF
Sbjct: 99 MATILDGSGEYRPELSLPLHHAMS-VPCDTSSPGMGMSGTYTTLTPLQPLPPISTVSDKF 157
Query: 151 AYGH---------TGNVTGSFTVMQNNIGL-GINSHYT-YDKIQMSP-PQHYSP--NNGL 196
+ H +GNV+GSFT+M++ GL G+N+ Y+ Y K MS Q+ SP NGL
Sbjct: 158 HHHHHHHHHHQRLSGNVSGSFTLMRDERGLPGMNNIYSPYHKDHMSGMGQNLSPVLGNGL 217
Query: 197 STIVITQE 204
+I TQ+
Sbjct: 218 GSIHNTQQ 225
>gi|126321625|ref|XP_001366432.1| PREDICTED: one cut domain family member 2 [Monodelphis domestica]
Length = 489
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 81/85 (95%), Positives = 84/85 (98%)
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 407
NSG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR
Sbjct: 311 NSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 370
Query: 408 ETFRRMWKWLQEPEFQRMSALRLAG 432
ETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 371 ETFRRMWKWLQEPEFQRMSALRLAA 395
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 80/147 (54%), Gaps = 27/147 (18%)
Query: 84 SPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPI 143
S S + S+ASI+D ++R + P + ++ ++ SPPG S++Y TLTPLQPLPPI
Sbjct: 85 SRSAMVTSMASILDGGDYRPELSIPLHHAMS-MSCDSSPPGMGMSNTYTTLTPLQPLPPI 143
Query: 144 STMSDKFAYGH-------------------TGNVTGSFTVMQNNIGL-GINSHYTYDKIQ 183
ST+SDKF + H +GNV+GSFT+M++ GL +N+ Y+ K
Sbjct: 144 STVSDKFHHPHPHHHPHHHPHHHHHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPYKEM 203
Query: 184 MSPPQHYSP------NNGLSTIVITQE 204
Q SP NGL TI +Q+
Sbjct: 204 SGMGQSLSPLAATPLGNGLGTIHNSQQ 230
>gi|345491020|ref|XP_003426511.1| PREDICTED: one cut domain family member 2-like [Nasonia
vitripennis]
Length = 504
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 80/82 (97%), Positives = 81/82 (98%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
E+EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 371 EVEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 430
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRMWKWLQEPEFQRMSALRLA
Sbjct: 431 RRMWKWLQEPEFQRMSALRLAA 452
>gi|387915478|gb|AFK11348.1| one cut domain, family member 1 [Callorhinchus milii]
Length = 481
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 81/90 (90%), Positives = 86/90 (95%)
Query: 342 GAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
G+ + G SG+LEEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 273 GSQASGGSGQLEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 332
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
KLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 333 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 362
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 92/169 (54%), Gaps = 34/169 (20%)
Query: 91 SVASIMDS--SEFR----NLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPIS 144
S+ASI+DS S++ + A P + T+T +PPG S SS+Y TLTPLQPLPPIS
Sbjct: 50 SMASILDSPASDYHRPPEHALAAPLHPTMTMACE--TPPGMSMSSTYTTLTPLQPLPPIS 107
Query: 145 TMSDKFAYGHT-----------GNVTGSFTVMQNNIGL----GINSHYTYDKIQ-----M 184
T+SDKF + H GNV+GSFT+M+++ GL + SHY + ++ +
Sbjct: 108 TVSDKFPHHHHHHHHHHHQRLPGNVSGSFTLMRDDRGLSSMNNLYSHYHHKEVTGMGQGL 167
Query: 185 SPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYD 232
SP S NGL I +Q+ ++Y G P K L+PN +D
Sbjct: 168 SPLSGSSLANGLGAIHTSQQG-----LASYGHPGGPMPGDKMLTPNGFD 211
>gi|350426167|ref|XP_003494354.1| PREDICTED: hepatocyte nuclear factor 6-like [Bombus impatiens]
Length = 225
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 88/109 (80%), Positives = 93/109 (85%), Gaps = 10/109 (9%)
Query: 325 QTATLTLKTVTPTNAVVGAVSLGN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVL 383
Q ++ +KTV P GN +GE+EEINTKELAQRISAELKRYSIPQAIFAQRVL
Sbjct: 13 QHTSVVMKTVPPP---------GNGAGEVEEINTKELAQRISAELKRYSIPQAIFAQRVL 63
Query: 384 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 64 CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 112
>gi|195064440|ref|XP_001996570.1| GH24019 [Drosophila grimshawi]
gi|193892116|gb|EDV90982.1| GH24019 [Drosophila grimshawi]
Length = 1178
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 86/111 (77%), Positives = 95/111 (85%), Gaps = 2/111 (1%)
Query: 322 ISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQR 381
+Q+ L+ K+ P N+ G+ S S +LEEINTKELAQRISAELKRYSIPQAIFAQR
Sbjct: 707 FNQEQPKLSPKSAAPLNST-GSAS-NRSSDLEEINTKELAQRISAELKRYSIPQAIFAQR 764
Query: 382 VLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
VLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 765 VLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMFKWLQEPEFQRMSALRMAA 815
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/40 (77%), Positives = 34/40 (85%), Gaps = 2/40 (5%)
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GHTGN 157
R+S P YSP+S YATLTP+QPLPPISTMSDKFAY GH N
Sbjct: 367 RISAPIYSPTS-YATLTPIQPLPPISTMSDKFAYTGHISN 405
>gi|109081188|ref|XP_001084951.1| PREDICTED: hepatocyte nuclear factor 6-like [Macaca mulatta]
Length = 363
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 86/98 (87%), Positives = 91/98 (92%), Gaps = 2/98 (2%)
Query: 336 PTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDL 393
P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDL
Sbjct: 169 PNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDL 228
Query: 394 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 229 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 266
>gi|444732528|gb|ELW72818.1| One cut domain family member 2 [Tupaia chinensis]
Length = 231
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 81/85 (95%), Positives = 84/85 (98%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 408
SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE
Sbjct: 144 SGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 203
Query: 409 TFRRMWKWLQEPEFQRMSALRLAGN 433
TFRRMWKWLQEPEFQRMSALRLAG
Sbjct: 204 TFRRMWKWLQEPEFQRMSALRLAGK 228
>gi|296222711|ref|XP_002807552.1| PREDICTED: LOW QUALITY PROTEIN: one cut domain family member 2
[Callithrix jacchus]
Length = 505
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 325 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 384
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 385 GRETFRRMWKWLQEPEFQRMSALRLAA 411
>gi|395511560|ref|XP_003760026.1| PREDICTED: one cut domain family member 2 [Sarcophilus harrisii]
Length = 302
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 81/85 (95%), Positives = 84/85 (98%)
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 407
NSG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR
Sbjct: 124 NSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 183
Query: 408 ETFRRMWKWLQEPEFQRMSALRLAG 432
ETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 184 ETFRRMWKWLQEPEFQRMSALRLAA 208
>gi|326677507|ref|XP_001920308.3| PREDICTED: one cut domain family member 2-like [Danio rerio]
Length = 461
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 80/84 (95%), Positives = 83/84 (98%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 408
SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE
Sbjct: 286 SGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 345
Query: 409 TFRRMWKWLQEPEFQRMSALRLAG 432
TFRRMWKWLQEPEFQRMSALRLA
Sbjct: 346 TFRRMWKWLQEPEFQRMSALRLAA 369
>gi|355778058|gb|EHH63094.1| Hepatocyte nuclear factor 6, partial [Macaca fascicularis]
Length = 311
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/99 (86%), Positives = 91/99 (91%), Gaps = 2/99 (2%)
Query: 336 PTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDL 393
P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDL
Sbjct: 117 PNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDL 176
Query: 394 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 177 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 215
>gi|353229064|emb|CCD75235.1| putative one cut domain family member [Schistosoma mansoni]
Length = 1112
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 75/82 (91%), Positives = 79/82 (96%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
E+EEINTK LA RIS+ELKRYSIPQA+FAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 496 EMEEINTKVLAHRISSELKRYSIPQAVFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETF 555
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRMWKWL EPEFQRMSALRLAG
Sbjct: 556 RRMWKWLHEPEFQRMSALRLAG 577
>gi|297702661|ref|XP_002828291.1| PREDICTED: one cut domain family member 2 [Pongo abelii]
Length = 472
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 292 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 351
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 352 GRETFRRMWKWLQEPEFQRMSALRLAA 378
>gi|119220568|ref|NP_919244.2| one cut domain family member 2 [Mus musculus]
gi|294862455|sp|Q6XBJ3.2|ONEC2_MOUSE RecName: Full=One cut domain family member 2; AltName: Full=One cut
homeobox 2; AltName: Full=Transcription factor ONECUT-2;
Short=OC-2
Length = 505
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 325 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 384
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 385 GRETFRRMWKWLQEPEFQRMSALRLAA 411
>gi|432889300|ref|XP_004075208.1| PREDICTED: one cut domain family member 2-like [Oryzias latipes]
Length = 263
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 82/87 (94%), Positives = 85/87 (97%), Gaps = 1/87 (1%)
Query: 347 GN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
GN SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 82 GNGSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 141
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 142 GRETFRRMWKWLQEPEFQRMSALRLAA 168
>gi|73909017|gb|AAI03669.1| Onecut2 protein [Mus musculus]
Length = 438
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 258 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 317
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 318 GRETFRRMWKWLQEPEFQRMSALRLAA 344
>gi|34558615|gb|AAP69907.1| transcription factor ONECUT2 [Mus musculus]
Length = 486
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 306 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 365
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 366 GRETFRRMWKWLQEPEFQRMSALRLAA 392
>gi|339247081|ref|XP_003375174.1| putative CUT domain protein [Trichinella spiralis]
gi|316971553|gb|EFV55311.1| putative CUT domain protein [Trichinella spiralis]
Length = 380
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 81/87 (93%), Positives = 85/87 (97%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
G++ +LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG
Sbjct: 279 GSANDLEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 338
Query: 407 RETFRRMWKWLQEPEFQRMSALRLAGN 433
RETFRRM KWLQEPEFQRMSALRLAG+
Sbjct: 339 RETFRRMLKWLQEPEFQRMSALRLAGS 365
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 46/71 (64%), Gaps = 12/71 (16%)
Query: 91 SVASIMDSSEFRNLHAEPTYQTL--------TNVNGRLSPPGYSPSSSYATLTPLQPLPP 142
++ S+ D+ +FR+ HA Y TL NV G ++P G+ S +YATLTPLQPLPP
Sbjct: 40 AMGSLHDAGDFRS-HAPAGYHTLNGRMSPSSVNVLGSVAPLGFG-SVTYATLTPLQPLPP 97
Query: 143 ISTMS--DKFA 151
IST++ DKF+
Sbjct: 98 ISTVTQIDKFS 108
>gi|351715220|gb|EHB18139.1| Hepatocyte nuclear factor 6 [Heterocephalus glaber]
Length = 258
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/99 (86%), Positives = 91/99 (91%), Gaps = 2/99 (2%)
Query: 336 PTNAVVGA-VSLG-NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDL 393
P +V GA VS G NSG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDL
Sbjct: 65 PNPSVTGAQVSNGSNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDL 124
Query: 394 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 125 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 163
>gi|119220564|ref|NP_004843.2| one cut domain family member 2 [Homo sapiens]
gi|294862454|sp|O95948.2|ONEC2_HUMAN RecName: Full=One cut domain family member 2; AltName:
Full=Hepatocyte nuclear factor 6-beta; Short=HNF-6-beta;
AltName: Full=One cut homeobox 2; AltName:
Full=Transcription factor ONECUT-2; Short=OC-2
gi|119583449|gb|EAW63045.1| one cut domain, family member 2 [Homo sapiens]
Length = 504
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 324 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 383
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 384 GRETFRRMWKWLQEPEFQRMSALRLAA 410
>gi|281337493|gb|EFB13077.1| hypothetical protein PANDA_009021 [Ailuropoda melanoleuca]
Length = 398
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 218 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 277
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 278 GRETFRRMWKWLQEPEFQRMSALRLAA 304
>gi|410977830|ref|XP_003995303.1| PREDICTED: one cut domain family member 2, partial [Felis catus]
Length = 412
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 232 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 291
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 292 GRETFRRMWKWLQEPEFQRMSALRLAA 318
>gi|344297814|ref|XP_003420591.1| PREDICTED: LOW QUALITY PROTEIN: hepatocyte nuclear factor 6-like
[Loxodonta africana]
Length = 352
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 84/105 (80%), Positives = 90/105 (85%), Gaps = 3/105 (2%)
Query: 331 LKTVTPTNAVVGAVSLGN---SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQ 387
L T +N V V + N SG++EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQ
Sbjct: 152 LGTTRESNPSVTGVQVSNGTNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQ 211
Query: 388 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 212 GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 256
>gi|194678259|ref|XP_001251176.2| PREDICTED: one cut domain family member 2 [Bos taurus]
Length = 428
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 248 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 307
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 308 GRETFRRMWKWLQEPEFQRMSALRLA 333
>gi|392354744|ref|XP_002728603.2| PREDICTED: one cut domain family member 2-like [Rattus norvegicus]
Length = 460
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 280 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 339
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 340 GRETFRRMWKWLQEPEFQRMSALRLAA 366
>gi|293343116|ref|XP_002725406.1| PREDICTED: one cut domain family member 2-like, partial [Rattus
norvegicus]
Length = 452
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 272 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 331
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 332 GRETFRRMWKWLQEPEFQRMSALRLAA 358
>gi|301769775|ref|XP_002920305.1| PREDICTED: one cut domain family member 2-like [Ailuropoda
melanoleuca]
Length = 396
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 216 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 275
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 276 GRETFRRMWKWLQEPEFQRMSALRLAA 302
>gi|348576826|ref|XP_003474187.1| PREDICTED: one cut domain family member 2-like [Cavia porcellus]
Length = 469
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 289 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 348
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 349 GRETFRRMWKWLQEPEFQRMSALRLA 374
>gi|402903185|ref|XP_003914458.1| PREDICTED: one cut domain family member 2-like [Papio anubis]
Length = 502
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 324 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 383
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 384 GRETFRRMWKWLQEPEFQRMSALRLA 409
>gi|4468941|emb|CAB38253.1| ONECUT-2 transcription factor (OC-2) [Homo sapiens]
Length = 485
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 305 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 364
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 365 GRETFRRMWKWLQEPEFQRMSALRLAA 391
>gi|256077272|ref|XP_002574931.1| one cut domain family member [Schistosoma mansoni]
Length = 1064
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 75/82 (91%), Positives = 79/82 (96%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
E+EEINTK LA RIS+ELKRYSIPQA+FAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 496 EMEEINTKVLAHRISSELKRYSIPQAVFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETF 555
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRMWKWL EPEFQRMSALRLAG
Sbjct: 556 RRMWKWLHEPEFQRMSALRLAG 577
>gi|195172534|ref|XP_002027052.1| GL18146 [Drosophila persimilis]
gi|194112830|gb|EDW34873.1| GL18146 [Drosophila persimilis]
Length = 1018
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 81/92 (88%), Positives = 86/92 (93%), Gaps = 1/92 (1%)
Query: 342 GAVSLGNSG-ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G+ S SG ++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 708 GSRSANASGSDMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 767
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
SKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 768 SKLKSGRETFRRMFKWLQEPEFQRMSALRMAA 799
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/44 (81%), Positives = 42/44 (95%), Gaps = 1/44 (2%)
Query: 109 TYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY 152
+YQTLT+VN R+SPPG+SP+S YATLTP+QPLPPISTMSDKFAY
Sbjct: 315 SYQTLTSVNDRISPPGFSPTS-YATLTPIQPLPPISTMSDKFAY 357
>gi|198462296|ref|XP_001382225.2| GA15133 [Drosophila pseudoobscura pseudoobscura]
gi|198140047|gb|EAL29279.2| GA15133 [Drosophila pseudoobscura pseudoobscura]
Length = 1018
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 81/92 (88%), Positives = 86/92 (93%), Gaps = 1/92 (1%)
Query: 342 GAVSLGNSG-ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G+ S SG ++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 708 GSRSANASGSDMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 767
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
SKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 768 SKLKSGRETFRRMFKWLQEPEFQRMSALRMAA 799
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/44 (81%), Positives = 42/44 (95%), Gaps = 1/44 (2%)
Query: 109 TYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY 152
+YQTLT+VN R+SPPG+SP+S YATLTP+QPLPPISTMSDKFAY
Sbjct: 315 SYQTLTSVNDRISPPGFSPTS-YATLTPIQPLPPISTMSDKFAY 357
>gi|440905160|gb|ELR55581.1| One cut domain family member 2, partial [Bos grunniens mutus]
Length = 377
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 198 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 257
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 258 GRETFRRMWKWLQEPEFQRMSALRLA 283
>gi|301609860|ref|XP_002934489.1| PREDICTED: one cut domain family member 2-like [Xenopus (Silurana)
tropicalis]
Length = 407
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 79/83 (95%), Positives = 82/83 (98%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 409
G+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET
Sbjct: 287 GQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 346
Query: 410 FRRMWKWLQEPEFQRMSALRLAG 432
FRRMWKWLQEPEFQRMSALRLA
Sbjct: 347 FRRMWKWLQEPEFQRMSALRLAA 369
>gi|195450723|ref|XP_002072605.1| GK13692 [Drosophila willistoni]
gi|194168690|gb|EDW83591.1| GK13692 [Drosophila willistoni]
Length = 1032
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 78/84 (92%), Positives = 82/84 (97%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 408
S ++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE
Sbjct: 733 SSDMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 792
Query: 409 TFRRMWKWLQEPEFQRMSALRLAG 432
TFRRM+KWLQEPEFQRMSALR+A
Sbjct: 793 TFRRMFKWLQEPEFQRMSALRMAA 816
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/53 (69%), Positives = 42/53 (79%), Gaps = 8/53 (15%)
Query: 109 TYQTLTNV------NGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GH 154
TYQTLT+ N R+SP G+SP+S YATLTP+QPLPPISTMSDKFAY GH
Sbjct: 402 TYQTLTSASVNDHDNDRISPTGFSPTS-YATLTPIQPLPPISTMSDKFAYAGH 453
>gi|431906952|gb|ELK11071.1| One cut domain family member 2 [Pteropus alecto]
Length = 395
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 215 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 274
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 275 GRETFRRMWKWLQEPEFQRMSALRLAA 301
>gi|297275354|ref|XP_001087035.2| PREDICTED: hypothetical protein LOC698537 [Macaca mulatta]
Length = 1140
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 960 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 1019
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 1020 GRETFRRMWKWLQEPEFQRMSALRLA 1045
>gi|194214699|ref|XP_001488942.2| PREDICTED: one cut domain family member 2 [Equus caballus]
Length = 372
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 192 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 251
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 252 GRETFRRMWKWLQEPEFQRMSALRLAA 278
>gi|426254093|ref|XP_004020720.1| PREDICTED: one cut domain family member 2, partial [Ovis aries]
Length = 368
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 188 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 247
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 248 GRETFRRMWKWLQEPEFQRMSALRLA 273
>gi|297489834|ref|XP_002697878.1| PREDICTED: one cut domain family member 2, partial [Bos taurus]
gi|296473801|tpg|DAA15916.1| TPA: one cut domain, family member 2-like [Bos taurus]
Length = 351
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 171 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 230
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 231 GRETFRRMWKWLQEPEFQRMSALRLA 256
>gi|72679554|gb|AAI00756.1| Onecut2 protein, partial [Mus musculus]
gi|72679721|gb|AAI00751.1| Onecut2 protein, partial [Mus musculus]
Length = 342
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 162 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 221
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 222 GRETFRRMWKWLQEPEFQRMSALRLAA 248
>gi|397514161|ref|XP_003827365.1| PREDICTED: one cut domain family member 2 [Pan paniscus]
Length = 487
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 307 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 366
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 367 GRETFRRMWKWLQEPEFQRMSALRLA 392
>gi|355755052|gb|EHH58919.1| Transcription factor ONECUT-2, partial [Macaca fascicularis]
Length = 281
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 101 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 160
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 161 GRETFRRMWKWLQEPEFQRMSALRLAA 187
>gi|148677762|gb|EDL09709.1| one cut domain, family member 2 [Mus musculus]
Length = 307
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 127 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 186
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 187 GRETFRRMWKWLQEPEFQRMSALRLAA 213
>gi|149064465|gb|EDM14668.1| rCG46748 [Rattus norvegicus]
Length = 307
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 127 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 186
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 187 GRETFRRMWKWLQEPEFQRMSALRLAA 213
>gi|355701964|gb|EHH29317.1| Transcription factor ONECUT-2, partial [Macaca mulatta]
Length = 309
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 80/88 (90%), Positives = 84/88 (95%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK
Sbjct: 128 KVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 187
Query: 405 SGRETFRRMWKWLQEPEFQRMSALRLAG 432
SGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 188 SGRETFRRMWKWLQEPEFQRMSALRLAA 215
>gi|426386060|ref|XP_004059512.1| PREDICTED: one cut domain family member 2 [Gorilla gorilla gorilla]
Length = 307
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 127 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 186
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 187 GRETFRRMWKWLQEPEFQRMSALRLAA 213
>gi|351704026|gb|EHB06945.1| One cut domain family member 2 [Heterocephalus glaber]
Length = 307
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 127 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 186
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 187 GRETFRRMWKWLQEPEFQRMSALRLAA 213
>gi|2224905|gb|AAB61705.1| hepatocyte nuclear factor 6 [Homo sapiens]
Length = 292
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 85/99 (85%), Positives = 90/99 (90%), Gaps = 2/99 (2%)
Query: 336 PTNAVVGA-VSLGN-SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDL 393
P +V GA VS GN SG +EEINTKE+AQRI+ ELKRYSIPQAIFAQ+VLCRSQGTLSDL
Sbjct: 98 PNPSVTGAQVSNGNNSGHMEEINTKEVAQRITTELKRYSIPQAIFAQKVLCRSQGTLSDL 157
Query: 394 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 158 LRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 196
>gi|432101405|gb|ELK29587.1| One cut domain family member 2 [Myotis davidii]
Length = 307
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 127 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 186
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 187 GRETFRRMWKWLQEPEFQRMSALRLAA 213
>gi|73945471|ref|XP_541087.2| PREDICTED: one cut domain family member 2 isoform 1 [Canis lupus
familiaris]
Length = 307
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 80/87 (91%), Positives = 84/87 (96%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 127 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 186
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLAG 432
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 187 GRETFRRMWKWLQEPEFQRMSALRLAA 213
>gi|358333598|dbj|GAA37091.2| hepatocyte nuclear factor 6 [Clonorchis sinensis]
Length = 1091
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 77/82 (93%), Positives = 80/82 (97%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
E+EEINTK LAQRISAELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 491 EMEEINTKALAQRISAELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 550
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRMWKWLQEPEFQRMS+LRLA
Sbjct: 551 RRMWKWLQEPEFQRMSSLRLAA 572
>gi|195469367|ref|XP_002099609.1| GE14509 [Drosophila yakuba]
gi|194185710|gb|EDW99321.1| GE14509 [Drosophila yakuba]
Length = 1084
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 77/86 (89%), Positives = 83/86 (96%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
N+ ++EEINTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG
Sbjct: 755 ANASDMEEINTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 814
Query: 407 RETFRRMWKWLQEPEFQRMSALRLAG 432
RETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 815 RETFRRMFKWLQEPEFQRMSALRMAA 840
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/122 (46%), Positives = 71/122 (58%), Gaps = 5/122 (4%)
Query: 55 ESIKTEGLTVIVAPQD-DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
ES E L+VIV QD D D E G + DS L + +YQTL
Sbjct: 266 ESAHGEVLSVIVHSQDSDKEDGEENGDGDAEGDADNEDDDE-RDSRSREQLLSHSSYQTL 324
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GH-TGNVTGSFTVMQNNIGL 171
T+VN RLS PG+S +SYATLTP+QPLPPISTMS+KFAY GH +G +G V ++ G
Sbjct: 325 TSVNDRLSSPGFS-QTSYATLTPIQPLPPISTMSEKFAYSGHISGGDSGDTDVNGDDGGR 383
Query: 172 GI 173
G+
Sbjct: 384 GV 385
>gi|344269846|ref|XP_003406758.1| PREDICTED: one cut domain family member 2-like [Loxodonta africana]
Length = 502
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 322 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 381
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 382 GRETFRRMWKWLQEPEFQRMSALRLA 407
>gi|332850176|ref|XP_523935.3| PREDICTED: one cut domain family member 2 [Pan troglodytes]
Length = 507
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 327 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 386
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 387 GRETFRRMWKWLQEPEFQRMSALRLA 412
>gi|195133982|ref|XP_002011417.1| GI14052 [Drosophila mojavensis]
gi|193912040|gb|EDW10907.1| GI14052 [Drosophila mojavensis]
Length = 1059
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 77/82 (93%), Positives = 81/82 (98%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 707 DMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 766
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRM+KWLQEPEFQRMSALR+A
Sbjct: 767 RRMFKWLQEPEFQRMSALRMAA 788
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/49 (75%), Positives = 42/49 (85%), Gaps = 4/49 (8%)
Query: 109 TYQTLTNVNG-RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY--GH 154
TYQTLT+VN R+S P +SP+S YATLTP+QPLPPISTMSDKFAY GH
Sbjct: 326 TYQTLTSVNNQRISEPSFSPTS-YATLTPIQPLPPISTMSDKFAYAGGH 373
>gi|449514858|ref|XP_002186674.2| PREDICTED: one cut domain family member 2 [Taeniopygia guttata]
Length = 539
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 79/84 (94%), Positives = 83/84 (98%)
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 407
+G+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR
Sbjct: 361 GAGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 420
Query: 408 ETFRRMWKWLQEPEFQRMSALRLA 431
ETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 421 ETFRRMWKWLQEPEFQRMSALRLA 444
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 70/144 (48%), Gaps = 24/144 (16%)
Query: 79 SPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQ 138
SPG++ P A+ E+R + P + ++ V SPPG S +Y TLTPLQ
Sbjct: 144 SPGQLPPP------ANGGGGGEYRPERSIPLHHAMS-VPCEASPPGMGMSGTYTTLTPLQ 196
Query: 139 PLPPISTMSDKFAYGHT---------------GNVTGSFTVMQNNIGL-GINSHYTYDKI 182
PLPPIS +SDKF + H G+ G FT+M++ GL +NS Y K
Sbjct: 197 PLPPISAVSDKFHHPHAHPHAHHHHHHHQRLPGSAGGGFTLMRDERGLPAVNSLYGPYKE 256
Query: 183 QMSPPQHYSP-NNGLSTIVITQEN 205
+ Q SP NGL + +Q+
Sbjct: 257 VPAVGQSLSPLGNGLGPLPGSQQG 280
>gi|403268195|ref|XP_003945193.1| PREDICTED: LOW QUALITY PROTEIN: one cut domain family member 2
[Saimiri boliviensis boliviensis]
Length = 498
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 80/86 (93%), Positives = 84/86 (97%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
+ SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS
Sbjct: 318 VATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 377
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
GRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 378 GRETFRRMWKWLQEPEFQRMSALRLA 403
>gi|347963081|ref|XP_001237363.3| AGAP000061-PA [Anopheles gambiae str. PEST]
gi|333467368|gb|EAU77345.3| AGAP000061-PA [Anopheles gambiae str. PEST]
Length = 673
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 81/95 (85%), Positives = 88/95 (92%), Gaps = 4/95 (4%)
Query: 343 AVSLGN----SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
VSLG+ +G+LEEINTKELA RIS+ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 452 CVSLGDVGCLNGDLEEINTKELAHRISSELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 511
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAGN 433
PWSKLKSGRETFRRM+KW+QEPE+QRMSALRLA N
Sbjct: 512 PWSKLKSGRETFRRMYKWIQEPEYQRMSALRLAVN 546
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/148 (37%), Positives = 80/148 (54%), Gaps = 25/148 (16%)
Query: 58 KTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVN 117
K + L+VI+ PQ+ S + G +P SV S+M S +QTL+ ++
Sbjct: 72 KQDALSVIIHPQE-SMQQINMCLGSQNPG----SVDSMMSSD----------FQTLSPMH 116
Query: 118 GRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQN---NIGLGIN 174
R SP + ++SYATLTPLQPLPPISTMSDKFAY GN+ +F + + + + IN
Sbjct: 117 DRDSPVSLNTNTSYATLTPLQPLPPISTMSDKFAYSIGGNICEAFPGINSGAVGVNIEIN 176
Query: 175 SHYTYDK-------IQMSPPQHYSPNNG 195
S Y +K + +S Q SP+ G
Sbjct: 177 SCYNLEKLGVIHSPVHLSAVQELSPDGG 204
>gi|195402213|ref|XP_002059701.1| GJ19930 [Drosophila virilis]
gi|194155915|gb|EDW71099.1| GJ19930 [Drosophila virilis]
Length = 1074
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 77/82 (93%), Positives = 81/82 (98%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 765 DMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 824
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRM+KWLQEPEFQRMSALR+A
Sbjct: 825 RRMFKWLQEPEFQRMSALRMAA 846
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/47 (80%), Positives = 43/47 (91%), Gaps = 2/47 (4%)
Query: 109 TYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GH 154
TYQTLT+VN R+S PG+SP+S YATLTP+QPLPPISTMSDKFAY GH
Sbjct: 329 TYQTLTSVNNRISEPGFSPTS-YATLTPIQPLPPISTMSDKFAYVGH 374
>gi|263359692|gb|ACY70528.1| hypothetical protein DVIR88_6g0065 [Drosophila virilis]
Length = 1076
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 77/82 (93%), Positives = 81/82 (98%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 767 DMEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 826
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRM+KWLQEPEFQRMSALR+A
Sbjct: 827 RRMFKWLQEPEFQRMSALRMAA 848
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/47 (80%), Positives = 43/47 (91%), Gaps = 2/47 (4%)
Query: 109 TYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAY-GH 154
TYQTLT+VN R+S PG+SP+S YATLTP+QPLPPISTMSDKFAY GH
Sbjct: 329 TYQTLTSVNNRISEPGFSPTS-YATLTPIQPLPPISTMSDKFAYVGH 374
>gi|443731149|gb|ELU16386.1| hypothetical protein CAPTEDRAFT_52697, partial [Capitella teleta]
Length = 246
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 79/91 (86%), Positives = 85/91 (93%)
Query: 342 GAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
G ++ ELEEINTK+LAQ+IS+ELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 76 GQPTVSGKQELEEINTKDLAQKISSELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPWS 135
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
KLKSGRETFRRMWKWLQEPEFQRMSALRLAG
Sbjct: 136 KLKSGRETFRRMWKWLQEPEFQRMSALRLAG 166
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/56 (62%), Positives = 42/56 (75%), Gaps = 3/56 (5%)
Query: 130 SYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN-IGLGIN--SHYTYDKI 182
SYATLTPLQPLPPIST+SDKF + NV+GSFT MQNN + + +N Y YDK+
Sbjct: 1 SYATLTPLQPLPPISTVSDKFGASASPNVSGSFTFMQNNSLTMDMNPAGAYRYDKL 56
>gi|347963083|ref|XP_003436903.1| AGAP000061-PB [Anopheles gambiae str. PEST]
gi|333467369|gb|EGK96543.1| AGAP000061-PB [Anopheles gambiae str. PEST]
Length = 583
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 80/94 (85%), Positives = 87/94 (92%), Gaps = 4/94 (4%)
Query: 343 AVSLGN----SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
VSLG+ +G+LEEINTKELA RIS+ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK
Sbjct: 452 CVSLGDVGCLNGDLEEINTKELAHRISSELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 511
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
PWSKLKSGRETFRRM+KW+QEPE+QRMSALRLA
Sbjct: 512 PWSKLKSGRETFRRMYKWIQEPEYQRMSALRLAA 545
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 80/148 (54%), Gaps = 25/148 (16%)
Query: 58 KTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVN 117
K + L+VI+ PQ+ + + G +P SV S+M S +QTL+ ++
Sbjct: 72 KQDALSVIIHPQESMQQIN-MCLGSQNPG----SVDSMMSSD----------FQTLSPMH 116
Query: 118 GRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQN---NIGLGIN 174
R SP + ++SYATLTPLQPLPPISTMSDKFAY GN+ +F + + + + IN
Sbjct: 117 DRDSPVSLNTNTSYATLTPLQPLPPISTMSDKFAYSIGGNICEAFPGINSGAVGVNIEIN 176
Query: 175 SHYTYDK-------IQMSPPQHYSPNNG 195
S Y +K + +S Q SP+ G
Sbjct: 177 SCYNLEKLGVIHSPVHLSAVQELSPDGG 204
>gi|449279578|gb|EMC87150.1| One cut domain family member 2, partial [Columba livia]
Length = 271
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 78/86 (90%), Positives = 84/86 (97%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
G SG++EE+NTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSG
Sbjct: 117 GGSGQVEELNTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSG 176
Query: 407 RETFRRMWKWLQEPEFQRMSALRLAG 432
RETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 177 RETFRRMWKWLQEPEFQRMSALRLAA 202
>gi|312378316|gb|EFR24929.1| hypothetical protein AND_10175 [Anopheles darlingi]
Length = 705
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 78/92 (84%), Positives = 86/92 (93%), Gaps = 4/92 (4%)
Query: 341 VGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
VG ++ G++EEINTKELA RIS+ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 519 VGCIN----GDIEEINTKELAHRISSELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 574
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
SKLKSGRETFRRM+KW+QEPE+QRMSALRLAG
Sbjct: 575 SKLKSGRETFRRMYKWIQEPEYQRMSALRLAG 606
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 79/144 (54%), Gaps = 19/144 (13%)
Query: 58 KTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVN 117
K + L+VI+ PQ+ S + G +P SV S+M S +QTL+ ++
Sbjct: 72 KQDALSVIIHPQE-SMQQLNMCLGSQNPG----SVDSMMSSD----------FQTLSPMH 116
Query: 118 GRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQN---NIGLGIN 174
R SP S S+SYATLTPLQPLPPISTMSDKFAY GN+ +F + + + + IN
Sbjct: 117 DRDSPVSLSTSTSYATLTPLQPLPPISTMSDKFAYSLGGNICEAFPGINSGSVGVNIEIN 176
Query: 175 SHYTYDKIQ-MSPPQHYSPNNGLS 197
S Y +K+ + P H S LS
Sbjct: 177 SCYNLEKLGVIHSPVHLSAVQELS 200
>gi|410931521|ref|XP_003979144.1| PREDICTED: hepatocyte nuclear factor 6-like, partial [Takifugu
rubripes]
Length = 358
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 76/82 (92%), Positives = 80/82 (97%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EE+NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 274 MEEVNTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 333
Query: 412 RMWKWLQEPEFQRMSALRLAGN 433
RMWKWLQEPEFQRMSALRLAG
Sbjct: 334 RMWKWLQEPEFQRMSALRLAGE 355
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 90/176 (51%), Gaps = 34/176 (19%)
Query: 84 SPSGI---SVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPL 140
+P G+ ++ +A+++DS ++ H + ++ +PPG S SS+Y TLTPLQPL
Sbjct: 38 TPRGLGHRTMGMATLLDSGDYHPGHPGHLHPAISMCE---APPGMSASSTYTTLTPLQPL 94
Query: 141 PPISTMSDKFAYGHT------------------GNVTGSFTVMQNNIGLG-INSHY-TYD 180
PPIST+SDKF H GNV+GSFT+M+ + L +NS Y Y
Sbjct: 95 PPISTVSDKFPPHHHHHHHPHHPHHPHPHQRIPGNVSGSFTLMREDRSLAPMNSLYPAYH 154
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDS 233
Q SP +GL++I +Q P Y+ G P +K L+P+ +++
Sbjct: 155 HKDPCMGQSLSPLSGSGLASIHTSQAGIP-----PYAHPGAAIPGEKMLTPSGFEA 205
>gi|47551283|ref|NP_999824.1| ONECUT transcription factor HNF6 beta [Strongylocentrotus
purpuratus]
gi|39546182|gb|AAR28054.1| onecut [Strongylocentrotus purpuratus]
Length = 483
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 78/92 (84%), Positives = 86/92 (93%), Gaps = 1/92 (1%)
Query: 342 GAVSLGNSGEL-EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G+VS + G+ EEINTKE+A R+++ELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 267 GSVSSRDGGQAGEEINTKEIAARVTSELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPW 326
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG
Sbjct: 327 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 358
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 37/55 (67%), Gaps = 6/55 (10%)
Query: 128 SSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKI 182
S SY TLTPLQ LPPI+T++DK H NV+ F++M N+ L +N+ YDK+
Sbjct: 103 SGSYTTLTPLQSLPPINTVADKLR--HDTNVSTGFSLM-NSSNLAMNN---YDKL 151
>gi|157128226|ref|XP_001661353.1| homeobox protein onecut [Aedes aegypti]
gi|108882240|gb|EAT46465.1| AAEL002359-PA [Aedes aegypti]
Length = 648
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 79/93 (84%), Positives = 86/93 (92%), Gaps = 1/93 (1%)
Query: 341 VGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
+G V N G++EEINTKELA RIS+ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 397 MGDVGCMN-GDVEEINTKELAHRISSELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 455
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAGN 433
SKLKSGRETFRRM+KW+QEPE+QRMSALRLA N
Sbjct: 456 SKLKSGRETFRRMYKWIQEPEYQRMSALRLAVN 488
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 74/128 (57%), Gaps = 17/128 (13%)
Query: 58 KTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVN 117
K + L+VI+ PQ+ + + +S SG SV SIM S +QTL+ ++
Sbjct: 90 KQDALSVIIHPQEGMQQINMC----LSGSGNPGSVDSIMSSD----------FQTLSPMH 135
Query: 118 GRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN---IGLGIN 174
R SP S ++SYATLTPLQPLPPISTMSDKFAY GN+ F M N+ + + IN
Sbjct: 136 DRDSPVSISANTSYATLTPLQPLPPISTMSDKFAYSIGGNICEPFPGMNNSSVGVNIEIN 195
Query: 175 SHYTYDKI 182
S Y +K+
Sbjct: 196 SCYNLEKL 203
>gi|348527698|ref|XP_003451356.1| PREDICTED: one cut domain family member 2-like [Oreochromis
niloticus]
Length = 443
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 268 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 327
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 328 MWKWLQEPEFQRMSALRLAA 347
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 84/164 (51%), Gaps = 40/164 (24%)
Query: 67 APQDDSRDSEILSPGKISPSGISVSVASIMDSS-EFRNLHAEPTYQTLTNVNGRLSPP-- 123
A Q + ++S G+ S + S+ASI++ + ++R HA ++G L P
Sbjct: 29 ARQSSASHRNLVSHGR---SAMVSSMASILEGAGDYRPDHA---------LSGPLHPAMT 76
Query: 124 -----GYSPSSSYATLTPLQPLPPISTMSDKFAYGH-------------TGNVTGSFTVM 165
G S SS+Y TLTPLQ LPPIST+S+KF + H GNV+GSFT+M
Sbjct: 77 MSCDSGMSLSSTYTTLTPLQHLPPISTVSEKFHHPHPHAHHHPAHQRLAAGNVSGSFTLM 136
Query: 166 QNNIGLG----INSHYTYDKIQMSPPQHYSP-NNGLSTIVITQE 204
+++ L + HY D M P SP +NGL ++ +Q+
Sbjct: 137 RDDRSLASMSNLYGHYPKDMSGMGQP--LSPLSNGLGSLHSSQQ 178
>gi|34733341|gb|AAQ81630.1| HNF6 beta [Strongylocentrotus purpuratus]
Length = 483
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 78/92 (84%), Positives = 86/92 (93%), Gaps = 1/92 (1%)
Query: 342 GAVSLGNSGEL-EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G+VS + G+ EEINTKE+A R+++ELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 267 GSVSSRDGGQAGEEINTKEIAARVTSELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPW 326
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG
Sbjct: 327 SKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 358
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 37/55 (67%), Gaps = 6/55 (10%)
Query: 128 SSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKI 182
S SY TLTPLQ LPPI+T++DK H NV+ F++M N+ L +N+ YDK+
Sbjct: 103 SGSYTTLTPLQSLPPINTVADKLR--HDTNVSTGFSLM-NSSNLAMNN---YDKL 151
>gi|194770207|ref|XP_001967188.1| GF15916 [Drosophila ananassae]
gi|190619334|gb|EDV34858.1| GF15916 [Drosophila ananassae]
Length = 448
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 79/96 (82%), Positives = 85/96 (88%)
Query: 337 TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRN 396
+ A G+ S ++EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLL N
Sbjct: 112 STACGGSSQSAVSSDVEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLHN 171
Query: 397 PKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
PKPWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 172 PKPWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 207
>gi|292611171|ref|XP_002661005.1| PREDICTED: one cut domain family member 2-like [Danio rerio]
Length = 441
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 80/80 (100%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 265 VEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFR 324
Query: 412 RMWKWLQEPEFQRMSALRLA 431
RMWKWLQEPEFQRMSALRLA
Sbjct: 325 RMWKWLQEPEFQRMSALRLA 344
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 80/164 (48%), Gaps = 31/164 (18%)
Query: 94 SIMDSSEFRNLHAEPTYQTLTNVNGRLSPP------GYSPSSSYATLTPLQPLPPISTMS 147
S M SS L Y++ ++G L P G S S++Y TLTPLQ LPPIST+S
Sbjct: 45 SAMVSSMASILEGAGDYRSDPALSGHLHPAMTMCESGMSLSNTYTTLTPLQHLPPISTVS 104
Query: 148 DKFAYGH-------------TGNVTGSFTVMQNNIGLG----INSHYTYDKIQMSPPQHY 190
DKF + H TGNV+GSFT+M+++ GL + HY + M P
Sbjct: 105 DKFHHAHPHPHHHAAHQRLATGNVSGSFTLMRDDRGLASMGNLYGHYPKEMSGMGQP--L 162
Query: 191 SP-NNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDS 233
SP +NGL ++ +Q+ S Y K LSP ++S
Sbjct: 163 SPLSNGLGSLHNSQQ-----ALSAYGPGAHIPNDKMLSPGGFES 201
>gi|339263738|ref|XP_003367004.1| putative One cut domain family member 2 [Trichinella spiralis]
gi|316954864|gb|EFV46387.1| putative One cut domain family member 2 [Trichinella spiralis]
Length = 166
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 81/89 (91%), Positives = 83/89 (93%)
Query: 342 GAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
GA S G + LEEINTKELAQRISAEL+RYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS
Sbjct: 24 GAASNGIAYYLEEINTKELAQRISAELRRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 83
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRL 430
KLKSGRETFRRM KWLQEPEFQRMSALRL
Sbjct: 84 KLKSGRETFRRMLKWLQEPEFQRMSALRL 112
>gi|357607758|gb|EHJ65677.1| putative One cut domain family member 2 [Danaus plexippus]
Length = 365
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 78/80 (97%), Positives = 78/80 (97%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKELAQRIS ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 162 EEINTKELAQRISGELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 221
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 222 MWKWLQEPEFQRMSALRLAA 241
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 69/181 (38%), Positives = 91/181 (50%), Gaps = 58/181 (32%)
Query: 52 DDNESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQ 111
DD + LTVIVAP + +SP ++SP+ + + D + +
Sbjct: 2 DDTRLRERAPLTVIVAPSN-------VSPPRLSPADL------LPD--------GDAAFH 40
Query: 112 TLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGL 171
L+ VNGRL+PPG P+S YATLTPL PLPPIST+SDKFAY H G G+FTV+Q
Sbjct: 41 PLSAVNGRLTPPGLEPAS-YATLTPLLPLPPISTVSDKFAY-HAG---GTFTVIQQQ--- 92
Query: 172 GINSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSY 231
Q Y+ ++ T N PLSPQS YS ++S SP SY
Sbjct: 93 ----------------QSYA------SLSPTAYNEPLSPQSAYS-------RRSASPGSY 123
Query: 232 D 232
+
Sbjct: 124 E 124
>gi|432852700|ref|XP_004067341.1| PREDICTED: hepatocyte nuclear factor 6-like [Oryzias latipes]
Length = 449
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 75/81 (92%), Positives = 80/81 (98%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++EE+NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 274 QMEEVNTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 333
Query: 411 RRMWKWLQEPEFQRMSALRLA 431
RRMWKWLQEPEFQRMSALRLA
Sbjct: 334 RRMWKWLQEPEFQRMSALRLA 354
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 91/178 (51%), Gaps = 34/178 (19%)
Query: 84 SPSGIS---VSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPL 140
SP G+S + +A+++DS ++ H + ++ +PPG S S++Y TLTPLQPL
Sbjct: 38 SPRGLSHRAMGMATLLDSGDYHPSHHGHLHPAISMCE---APPGMSASTTYTTLTPLQPL 94
Query: 141 PPISTMSDKFAYGHT------------------GNVTGSFTVMQNNIGLG-INSHY-TYD 180
PPIST+SDKF H GNV+GSFT+M+ + L +NS Y Y
Sbjct: 95 PPISTVSDKFPPHHHHHHHHPHHPHPHSHQRIPGNVSGSFTLMREDRSLAPMNSLYPAYH 154
Query: 181 KIQMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPY 235
Q SP +GL++I +Q P Y+ G P +K L+P+ +++ +
Sbjct: 155 HKDPCMGQSLSPLSGSGLASIHTSQAGIP-----PYAHPGAAMPGEKMLTPSGFEAHH 207
>gi|363743976|ref|XP_003642951.1| PREDICTED: one cut domain family member 2-like [Gallus gallus]
Length = 453
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 78/82 (95%), Positives = 81/82 (98%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
+LEEINTKE+AQRI+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 278 QLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 337
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRMWKWLQEPEFQRMSALRLA
Sbjct: 338 RRMWKWLQEPEFQRMSALRLAA 359
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 57/101 (56%), Gaps = 16/101 (15%)
Query: 92 VASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFA 151
+A ++D +EFR + P + ++ V SPPG SS+Y TLTPLQPLPPIS +SDKF
Sbjct: 82 MAPLLDGAEFRPELSVPLHHAMS-VPCEPSPPGMGMSSTYTTLTPLQPLPPISAVSDKFH 140
Query: 152 YGH--------------TGNVTGSFTVMQNNIGL-GINSHY 177
+ H GNV G F +M++ GL +N+ Y
Sbjct: 141 HPHPHPHHHHHHHHQRLAGNVGGGFALMRDERGLPAVNNLY 181
>gi|148699593|gb|EDL31540.1| one cut domain, family member 3 [Mus musculus]
Length = 489
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/106 (79%), Positives = 87/106 (82%), Gaps = 2/106 (1%)
Query: 329 LTLKTVTPTNAVVGA--VSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRS 386
LT V P AV A G EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRS
Sbjct: 289 LTQAAVAPEGAVALARHGGPGAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRS 348
Query: 387 QGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
QGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 349 QGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAA 394
>gi|348500418|ref|XP_003437770.1| PREDICTED: hepatocyte nuclear factor 6-like [Oreochromis niloticus]
Length = 448
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 75/80 (93%), Positives = 79/80 (98%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EE+NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 273 MEEVNTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 332
Query: 412 RMWKWLQEPEFQRMSALRLA 431
RMWKWLQEPEFQRMSALRLA
Sbjct: 333 RMWKWLQEPEFQRMSALRLA 352
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 92/176 (52%), Gaps = 32/176 (18%)
Query: 84 SPSGIS---VSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPL 140
SP G+S + +A+++D+ ++ H+ + ++ +PPG S SS+Y TLTPLQPL
Sbjct: 38 SPRGLSHRAMGMATLLDTGDYHPSHSGHLHPAISMCE---APPGMSASSTYTTLTPLQPL 94
Query: 141 PPISTMSDKFAYGHT----------------GNVTGSFTVMQNNIGLG-INSHY-TYDKI 182
PPIST+SDKF H GNV+GSFT+M+ + L +NS Y Y
Sbjct: 95 PPISTVSDKFPPHHHHHHPHHPHPHPHQRIPGNVSGSFTLMREDRSLAPMNSLYPAYHHK 154
Query: 183 QMSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDSPY 235
Q SP +GL++I +Q P Y+ G P +K L+P+ +++ +
Sbjct: 155 DPCMGQSLSPLSGSGLASIHTSQAGIP-----PYAHPGAAMPGEKMLTPSGFEAHH 205
>gi|410912294|ref|XP_003969625.1| PREDICTED: uncharacterized protein LOC101077949 [Takifugu rubripes]
Length = 378
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 76/83 (91%), Positives = 81/83 (97%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 409
G++EE+NTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET
Sbjct: 201 GQMEEVNTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 260
Query: 410 FRRMWKWLQEPEFQRMSALRLAG 432
FRRMWKWLQEPEFQRMSALRLA
Sbjct: 261 FRRMWKWLQEPEFQRMSALRLAA 283
>gi|322366551|gb|ADW95349.1| Onecut/Hnf6 [Paracentrotus lividus]
Length = 465
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 77/91 (84%), Positives = 86/91 (94%), Gaps = 1/91 (1%)
Query: 342 GAVSLGNSGEL-EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G+VS ++G+ EEINTKE+A R+++ELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPW
Sbjct: 261 GSVSSRDAGQAGEEINTKEIAARVTSELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPW 320
Query: 401 SKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
SKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 321 SKLKSGRETFRRMWKWLQEPEFQRMSALRLA 351
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 38/55 (69%), Gaps = 6/55 (10%)
Query: 128 SSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKI 182
S SY TLTPLQPLPPI+T++DK H NV+ F++M N+ L +N+ YDK+
Sbjct: 103 SGSYTTLTPLQPLPPINTVADKLR--HDTNVSTGFSLM-NSSNLAMNN---YDKL 151
>gi|363743730|ref|XP_003642905.1| PREDICTED: one cut domain family member 2-like [Gallus gallus]
Length = 441
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 76/81 (93%), Positives = 80/81 (98%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 265 VEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFR 324
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RMWKWLQEPEFQRMSALRLA
Sbjct: 325 RMWKWLQEPEFQRMSALRLAA 345
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 95/192 (49%), Gaps = 38/192 (19%)
Query: 72 SRDSEILSPGK--------ISPSGISVSV---ASIMDSSEFRNLH--AEPTYQTLTNVNG 118
S+ +E+LSP + P G V ASI++ ++R H A P + ++
Sbjct: 17 SQPAELLSPAHGRQSSHRNLVPHGRPAMVSGMASILEGGDYRAEHSLAAPLHPAMSMSCE 76
Query: 119 RLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH-------------TGNVTGSFTVM 165
SP G S SS+Y TLTPLQ LPPIST+S+KF + H GNV+GSFT+M
Sbjct: 77 --SPSGMSLSSTYTTLTPLQHLPPISTVSEKFHHPHHHHHHHHHPHQRLAGNVSGSFTLM 134
Query: 166 QNNIGLG----INSHYTYDKIQMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNS 221
++ L + SHY D M P P NGL ++ Q+ PL+P Y G
Sbjct: 135 RDERSLASMGNLYSHYPKDMPAMGQPLSPLP-NGLGSLHNAQQ--PLTP---YGPGGHLP 188
Query: 222 PQKSLSPNSYDS 233
K LSPN +DS
Sbjct: 189 NDKMLSPNGFDS 200
>gi|126323585|ref|XP_001371302.1| PREDICTED: hypothetical protein LOC100017888 [Monodelphis
domestica]
Length = 476
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 80/80 (100%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 298 VEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFR 357
Query: 412 RMWKWLQEPEFQRMSALRLA 431
RMWKWLQEPEFQRMSALRLA
Sbjct: 358 RMWKWLQEPEFQRMSALRLA 377
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 64/141 (45%), Gaps = 37/141 (26%)
Query: 81 GKISPSGISVSVASIMDS--SEFRNLHAEPTYQTLTNVNGRLSP-PGYSPSSSYATLTPL 137
G P+ +S VAS+++S +++R A P + ++ SP P S S SY TLTPL
Sbjct: 37 GPARPAMVS-GVASLLESGAADYRAELAAPLHPAMSLACD--SPGPALSLSGSYTTLTPL 93
Query: 138 QPLPPISTMSDKFAYGHT---------------------------GNVTGSFTVMQNNIG 170
Q LPPIS +SDKF NV+GSFT+M+++ G
Sbjct: 94 QHLPPISAVSDKFHPHPPPHPHHHPHPHHPAGPPPHGAHPHQRLPANVSGSFTLMRDDRG 153
Query: 171 LG----INSHYTYDKIQMSPP 187
L + HY D M P
Sbjct: 154 LASVGNLYGHYAKDVPAMGQP 174
>gi|122937424|ref|NP_001073957.1| one cut domain family member 3 [Homo sapiens]
gi|122064635|sp|O60422.2|ONEC3_HUMAN RecName: Full=One cut domain family member 3; AltName: Full=One cut
homeobox 3; AltName: Full=Transcription factor ONECUT-3;
Short=OC-3
Length = 494
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 319 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 378
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 379 MWKWLQEPEFQRMSALRLA 397
>gi|392349224|ref|XP_003750326.1| PREDICTED: one cut domain family member 3-like [Rattus norvegicus]
Length = 491
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 317 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 376
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 377 MWKWLQEPEFQRMSALRLAA 396
>gi|395831317|ref|XP_003788749.1| PREDICTED: one cut domain family member 3 [Otolemur garnettii]
Length = 493
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 318 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 377
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 378 MWKWLQEPEFQRMSALRLA 396
>gi|40538836|ref|NP_631972.2| one cut domain family member 3 [Mus musculus]
gi|122064636|sp|Q8K557.2|ONEC3_MOUSE RecName: Full=One cut domain family member 3; AltName: Full=One cut
homeobox 3; AltName: Full=Transcription factor ONECUT-3;
Short=OC-3
gi|38614222|gb|AAH58700.1| One cut domain, family member 3 [Mus musculus]
Length = 490
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 316 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 375
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 376 MWKWLQEPEFQRMSALRLAA 395
>gi|20373113|gb|AAL86921.1| onecut 3 [Mus musculus]
Length = 490
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 316 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 375
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 376 MWKWLQEPEFQRMSALRLAA 395
>gi|395513243|ref|XP_003760837.1| PREDICTED: homeobox protein cut-like 2-like [Sarcophilus harrisii]
Length = 207
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 77/81 (95%), Positives = 81/81 (100%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 115 VEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFR 174
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RMWKWLQEPEFQRMSALRLAG
Sbjct: 175 RMWKWLQEPEFQRMSALRLAG 195
>gi|392341355|ref|XP_003754319.1| PREDICTED: one cut domain family member 3-like [Rattus norvegicus]
Length = 320
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 146 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 205
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 206 MWKWLQEPEFQRMSALRLAA 225
>gi|119589870|gb|EAW69464.1| hCG2040174 [Homo sapiens]
Length = 493
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 318 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 377
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 378 MWKWLQEPEFQRMSALRLA 396
>gi|426386521|ref|XP_004059732.1| PREDICTED: uncharacterized protein LOC101132668 [Gorilla gorilla
gorilla]
Length = 844
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 669 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 728
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 729 MWKWLQEPEFQRMSALRLA 747
>gi|312089948|ref|XP_003146433.1| hypothetical protein LOAG_10861 [Loa loa]
Length = 331
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 84/118 (71%), Positives = 95/118 (80%), Gaps = 3/118 (2%)
Query: 317 GPPKIISQQT-ATLTLKTVTPTNA--VVGAVSLGNSGELEEINTKELAQRISAELKRYSI 373
GP I+ +T T+ +P A + A S + +++E+NTK+LAQRISAELKRYSI
Sbjct: 214 GPSDILDTETMETVKRNGSSPLQADTISHAGSPDSGSDMDELNTKDLAQRISAELKRYSI 273
Query: 374 PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM KWLQEPEFQRMSALRLA
Sbjct: 274 PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMAKWLQEPEFQRMSALRLA 331
>gi|256072911|ref|XP_002572777.1| one cut domain family member [Schistosoma mansoni]
gi|360043057|emb|CCD78469.1| putative one cut domain family member [Schistosoma mansoni]
Length = 761
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 78/102 (76%), Positives = 86/102 (84%), Gaps = 4/102 (3%)
Query: 334 VTPTNAVVGAVSLGNS----GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT 389
V+P V G+ GN ++ E+NTK+LAQRISAELKRYSIPQA+FAQRVLCRSQGT
Sbjct: 300 VSPIPQVDGSDQEGNDQSLPDDMSELNTKDLAQRISAELKRYSIPQAVFAQRVLCRSQGT 359
Query: 390 LSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
LSDLLRNPKPWSKLKSGRETFRRMW WL EPE+QRMSALRLA
Sbjct: 360 LSDLLRNPKPWSKLKSGRETFRRMWNWLNEPEYQRMSALRLA 401
>gi|170571398|ref|XP_001891713.1| CUT domain containing protein [Brugia malayi]
gi|158603631|gb|EDP39485.1| CUT domain containing protein [Brugia malayi]
Length = 197
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 94/167 (56%), Positives = 109/167 (65%), Gaps = 32/167 (19%)
Query: 274 QGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPP-KIISQQTATLTLK 332
Q AAT N TT+TP LQ P PP ++ +
Sbjct: 54 QTAATTTAFNQYTTLTP-------------------LQ----PLPPISTVTNSSVKFNRS 90
Query: 333 TVTPTNAV--VGAVSLGNSG------ELEEINTKELAQRISAELKRYSIPQAIFAQRVLC 384
T +P N + A ++ ++G +++E+NTK+LAQRISAELKRYSIPQAIFAQRVLC
Sbjct: 91 TASPANGSSPLQADTISHAGSPDSGSDMDELNTKDLAQRISAELKRYSIPQAIFAQRVLC 150
Query: 385 RSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
RSQGTLSDLLRNPKPWSKLKSGRETFRRM KWLQEPEFQRMSALRLA
Sbjct: 151 RSQGTLSDLLRNPKPWSKLKSGRETFRRMAKWLQEPEFQRMSALRLA 197
>gi|350580762|ref|XP_003123062.3| PREDICTED: one cut domain family member 3-like [Sus scrofa]
Length = 415
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 240 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 299
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 300 MWKWLQEPEFQRMSALRLA 318
>gi|348500918|ref|XP_003438018.1| PREDICTED: one cut domain family member 2-like [Oreochromis
niloticus]
Length = 467
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 129/207 (62%), Gaps = 28/207 (13%)
Query: 244 SVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPT----MNGITTITPHTVVVSPQT 299
S +G GS SP LS GSLH+S L + + +A +P ++ ++ H ++S ++
Sbjct: 163 SGMGHGSLSP-LSSGLGSLHNSQQPL-SAYGPSAHLPADAKMLSPVSGFETHASMLS-RS 219
Query: 300 SPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGN----------- 348
+H L RS +IS + + GAV LG+
Sbjct: 220 DQEH------LARSLGGHGHGMISNISGMHHHPHGHLHSQANGAVMLGDRERHGHGPSQG 273
Query: 349 ---SG-ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
SG + EEINTKE+AQRI+AELKRYSIPQAIFAQR+L RSQGTLSDLLRNPKPWSKLK
Sbjct: 274 VSGSGIQAEEINTKEVAQRITAELKRYSIPQAIFAQRILSRSQGTLSDLLRNPKPWSKLK 333
Query: 405 SGRETFRRMWKWLQEPEFQRMSALRLA 431
SGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 334 SGRETFRRMWKWLQEPEFQRMSALRLA 360
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 17/78 (21%)
Query: 120 LSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH------------TGNVTGSFTVMQN 167
+ G S S++Y TLTPLQ LPPIST++DKF + H GNV+GSFT+M++
Sbjct: 82 MCETGMSLSNTYTTLTPLQHLPPISTVADKFHHPHSHHHAAAHQRLSAGNVSGSFTLMRD 141
Query: 168 N-IGL----GINSHYTYD 180
+ GL + SHY+ +
Sbjct: 142 DHRGLTSMGNLYSHYSKE 159
>gi|297477208|ref|XP_002689228.1| PREDICTED: one cut domain family member 3 [Bos taurus]
gi|296485427|tpg|DAA27542.1| TPA: hCG2040174-like [Bos taurus]
Length = 313
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 76/80 (95%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 138 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 197
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 198 MWKWLQEPEFQRMSALRLAA 217
>gi|393905285|gb|EJD73920.1| HrHNF-6 [Loa loa]
Length = 486
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 84/119 (70%), Positives = 95/119 (79%), Gaps = 3/119 (2%)
Query: 317 GPPKIISQQT-ATLTLKTVTPTNA--VVGAVSLGNSGELEEINTKELAQRISAELKRYSI 373
GP I+ +T T+ +P A + A S + +++E+NTK+LAQRISAELKRYSI
Sbjct: 214 GPSDILDTETMETVKRNGSSPLQADTISHAGSPDSGSDMDELNTKDLAQRISAELKRYSI 273
Query: 374 PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM KWLQEPEFQRMSALRLA
Sbjct: 274 PQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMAKWLQEPEFQRMSALRLAA 332
>gi|324507255|gb|ADY43079.1| Hepatocyte nuclear factor 6 [Ascaris suum]
Length = 485
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 75/82 (91%), Positives = 80/82 (97%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
+++E+NTK+LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 296 DMDELNTKDLAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 355
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRM KWLQEPEFQRMSALRLA
Sbjct: 356 RRMAKWLQEPEFQRMSALRLAA 377
>gi|345787390|ref|XP_855080.2| PREDICTED: one cut domain family member 3 [Canis lupus familiaris]
Length = 488
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 316 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 375
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 376 MWKWLQEPEFQRMSALRLA 394
>gi|432916709|ref|XP_004079355.1| PREDICTED: one cut domain family member 2-like [Oryzias latipes]
Length = 467
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 76/85 (89%), Positives = 82/85 (96%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
G++ + EEINTKE+AQRI+AELKRYSIPQAIFAQR+L RSQGTLSDLLRNPKPWSKLKSG
Sbjct: 273 GSNIQAEEINTKEVAQRITAELKRYSIPQAIFAQRILSRSQGTLSDLLRNPKPWSKLKSG 332
Query: 407 RETFRRMWKWLQEPEFQRMSALRLA 431
RETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 333 RETFRRMWKWLQEPEFQRMSALRLA 357
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 38/57 (66%), Gaps = 12/57 (21%)
Query: 124 GYSPSSSYATLTPLQPLPPISTMSDKFAYGH------------TGNVTGSFTVMQNN 168
G S S++Y TLTPLQ LPPIST++DKF + H GNV+GSFT+M+++
Sbjct: 86 GMSLSNTYTTLTPLQHLPPISTVADKFHHPHSHHHAAAHQRLSAGNVSGSFTLMRDD 142
>gi|431922230|gb|ELK19321.1| One cut domain family member 3, partial [Pteropus alecto]
Length = 401
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 77/85 (90%), Positives = 80/85 (94%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSG 406
G EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSG
Sbjct: 4 GAGAASEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSG 63
Query: 407 RETFRRMWKWLQEPEFQRMSALRLA 431
RETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 64 RETFRRMWKWLQEPEFQRMSALRLA 88
>gi|119389124|pdb|2D5V|A Chain A, Crystal Structure Of Hnf-6alpha Dna-Binding Domain In
Complex With The Ttr Promoter
gi|119389125|pdb|2D5V|B Chain B, Crystal Structure Of Hnf-6alpha Dna-Binding Domain In
Complex With The Ttr Promoter
Length = 164
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 76/81 (93%), Positives = 79/81 (97%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 1 MEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 60
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RMWKWLQEPEFQRMSALRLA
Sbjct: 61 RMWKWLQEPEFQRMSALRLAA 81
>gi|349980937|dbj|GAA32120.1| hepatocyte nuclear factor 6 [Clonorchis sinensis]
Length = 707
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 72/81 (88%), Positives = 78/81 (96%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++ E+NTK+LAQRISAELKRY+IPQA+FAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 264 DMTELNTKDLAQRISAELKRYTIPQAVFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 323
Query: 411 RRMWKWLQEPEFQRMSALRLA 431
RRMW WL EPEFQRMSALRLA
Sbjct: 324 RRMWNWLNEPEFQRMSALRLA 344
>gi|403274062|ref|XP_003928808.1| PREDICTED: uncharacterized protein LOC101036540 [Saimiri
boliviensis boliviensis]
Length = 803
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/129 (65%), Positives = 93/129 (72%), Gaps = 20/129 (15%)
Query: 303 HSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQ 362
H+ N+ +R + P P T T + P AV EINTKE+AQ
Sbjct: 598 HTETRNRRRRIEDPSP---------TWTESVLRPPPAVP-----------REINTKEVAQ 637
Query: 363 RISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEF 422
RI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEF
Sbjct: 638 RITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEF 697
Query: 423 QRMSALRLA 431
QRMSALRLA
Sbjct: 698 QRMSALRLA 706
>gi|226481475|emb|CAX73635.1| one cut domain, family member 1 [Schistosoma japonicum]
gi|226481477|emb|CAX73636.1| one cut domain, family member 1 [Schistosoma japonicum]
Length = 759
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 72/81 (88%), Positives = 78/81 (96%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++ E+NTK+LAQRISAELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 320 DMSELNTKDLAQRISAELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 379
Query: 411 RRMWKWLQEPEFQRMSALRLA 431
RRMW WL EPE+QRMSALRLA
Sbjct: 380 RRMWNWLNEPEYQRMSALRLA 400
>gi|10566821|dbj|BAB15952.1| HrHNF-6 [Halocynthia roretzi]
Length = 695
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 73/80 (91%), Positives = 77/80 (96%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EE+NTKE+A +I+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 475 EEVNTKEVAAKITQELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 534
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 535 MWKWLQEPEFQRMSALRLAA 554
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 26/44 (59%), Gaps = 8/44 (18%)
Query: 108 PTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFA 151
PT T N +G P SYATLTPLQPLP IST S+K+
Sbjct: 273 PTGTTCGNTDG--------PGLSYATLTPLQPLPSISTASEKYG 308
>gi|170040715|ref|XP_001848135.1| homeobox protein onecut [Culex quinquefasciatus]
gi|167864318|gb|EDS27701.1| homeobox protein onecut [Culex quinquefasciatus]
Length = 680
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 74/80 (92%), Positives = 78/80 (97%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKELA RIS+ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 442 EEINTKELAHRISSELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 501
Query: 413 MWKWLQEPEFQRMSALRLAG 432
M+KW+QEPE+QRMSALRLA
Sbjct: 502 MYKWIQEPEYQRMSALRLAA 521
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 78/129 (60%), Gaps = 16/129 (12%)
Query: 58 KTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVN 117
K + L+VI+ PQ+ + ++ +S SG SV SIM +++F QTL+ ++
Sbjct: 85 KQDALSVIIHPQEGMQQ---ITSMCLSGSGNPGSVDSIMSAADF---------QTLSPMH 132
Query: 118 GRLSPPGYSP-SSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN---IGLGI 173
R SP S ++SYATLTPLQPLPPISTMSDKFAY GN+ +F M N+ + + I
Sbjct: 133 DRDSPVSISGGNTSYATLTPLQPLPPISTMSDKFAYSIGGNICEAFPGMTNSSVGVNIEI 192
Query: 174 NSHYTYDKI 182
NS Y +K+
Sbjct: 193 NSCYNLEKL 201
>gi|344243413|gb|EGV99516.1| One cut domain family member 3 [Cricetulus griseus]
Length = 170
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 75/80 (93%), Positives = 79/80 (98%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
+EINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 1 KEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 60
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MWKWLQEPEFQRMSALRLA
Sbjct: 61 MWKWLQEPEFQRMSALRLAA 80
>gi|118344340|ref|NP_001071994.1| transcription factor protein [Ciona intestinalis]
gi|70569877|dbj|BAE06494.1| transcription factor protein [Ciona intestinalis]
Length = 764
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 72/82 (87%), Positives = 79/82 (96%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
++EE+NTKE+A +I+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF
Sbjct: 475 QIEEVNTKEVASKITQELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 534
Query: 411 RRMWKWLQEPEFQRMSALRLAG 432
RRMWKWLQEPEFQRMS+LRLA
Sbjct: 535 RRMWKWLQEPEFQRMSSLRLAA 556
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 28/41 (68%), Gaps = 4/41 (9%)
Query: 114 TNVNGRLSPPGYSPSS----SYATLTPLQPLPPISTMSDKF 150
TN N +P PSS SYATLTPLQPLP IS++SDK+
Sbjct: 247 TNANPNSNPIDSVPSSVDGPSYATLTPLQPLPSISSVSDKY 287
>gi|321477224|gb|EFX88183.1| hypothetical protein DAPPUDRAFT_32500 [Daphnia pulex]
Length = 148
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/78 (98%), Positives = 77/78 (98%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 413
EINTKELA RISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM
Sbjct: 1 EINTKELAARISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 60
Query: 414 WKWLQEPEFQRMSALRLA 431
WKWLQEPEFQRMSALRLA
Sbjct: 61 WKWLQEPEFQRMSALRLA 78
>gi|253723104|pdb|1S7E|A Chain A, Solution Structure Of Hnf-6
Length = 147
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 75/79 (94%), Positives = 77/79 (97%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 413
EINTKE+AQRI+ ELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM
Sbjct: 1 EINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 60
Query: 414 WKWLQEPEFQRMSALRLAG 432
WKWLQEPEFQRMSALRLA
Sbjct: 61 WKWLQEPEFQRMSALRLAA 79
>gi|410924457|ref|XP_003975698.1| PREDICTED: one cut domain family member 2-like [Takifugu rubripes]
Length = 456
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 74/80 (92%), Positives = 78/80 (97%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+L RSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 282 EEINTKEVAQRITAELKRYSIPQAIFAQRILSRSQGTLSDLLRNPKPWSKLKSGRETFRR 341
Query: 413 MWKWLQEPEFQRMSALRLAG 432
MW+WLQEPEFQRMSALRLA
Sbjct: 342 MWRWLQEPEFQRMSALRLAA 361
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 36/57 (63%), Gaps = 12/57 (21%)
Query: 124 GYSPSSSYATLTPLQPLPPISTMSDKFAYGH------------TGNVTGSFTVMQNN 168
G S S++Y LTPLQ LPPIS +SDKF + H GNV+GSFT+M+++
Sbjct: 86 GMSLSNTYTALTPLQHLPPISNVSDKFHHHHSHHHAAAHQRLSAGNVSGSFTLMRDD 142
>gi|269784935|ref|NP_001161619.1| onecut transcription factor [Saccoglossus kowalevskii]
gi|268054243|gb|ACY92608.1| onecut transcription factor [Saccoglossus kowalevskii]
Length = 405
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 72/80 (90%), Positives = 78/80 (97%)
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 407
+SG EEINTKE+AQR+++ELKRYSIPQA+FAQRVLCRSQGTLSDLLRNPKPWSKLKSGR
Sbjct: 234 DSGNGEEINTKEVAQRVTSELKRYSIPQAVFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR 293
Query: 408 ETFRRMWKWLQEPEFQRMSA 427
ETFRRMWKWLQEPEFQRMSA
Sbjct: 294 ETFRRMWKWLQEPEFQRMSA 313
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 32/54 (59%), Positives = 41/54 (75%), Gaps = 4/54 (7%)
Query: 128 SSSYATLTPLQPLPPISTM-SDKFAYGH---TGNVTGSFTVMQNNIGLGINSHY 177
SSSY TLTPLQPLPPIS++ SDKF + H TGNV+GSFT+M ++ +N+ Y
Sbjct: 70 SSSYTTLTPLQPLPPISSVTSDKFRHHHHHDTGNVSGSFTLMNSHTNHMMNNPY 123
>gi|47198560|emb|CAF87207.1| unnamed protein product [Tetraodon nigroviridis]
Length = 101
Score = 162 bits (409), Expect = 4e-37, Method: Composition-based stats.
Identities = 76/79 (96%), Positives = 79/79 (100%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTKE+AQRI+AELKRYSIPQAIFAQR+LCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 23 EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLRNPKPWSKLKSGRETFRR 82
Query: 413 MWKWLQEPEFQRMSALRLA 431
MWKWLQEPEFQRMSALRLA
Sbjct: 83 MWKWLQEPEFQRMSALRLA 101
>gi|76156509|gb|AAX27705.2| SJCHGC07952 protein [Schistosoma japonicum]
Length = 149
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 77/102 (75%), Positives = 85/102 (83%), Gaps = 4/102 (3%)
Query: 334 VTPTNAVVGAVSLGNS----GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT 389
V+P G+ GN ++ E+NTK+LAQRISAELKRYSIPQA+FAQRVLCRSQGT
Sbjct: 14 VSPIPQADGSDQEGNDQSLPDDMSELNTKDLAQRISAELKRYSIPQAVFAQRVLCRSQGT 73
Query: 390 LSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
LSDLLRNPKPWSKLKSGRETFRRMW WL EPE+QRMSALRLA
Sbjct: 74 LSDLLRNPKPWSKLKSGRETFRRMWNWLNEPEYQRMSALRLA 115
>gi|47206865|emb|CAF90521.1| unnamed protein product [Tetraodon nigroviridis]
Length = 347
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/90 (84%), Positives = 80/90 (88%), Gaps = 6/90 (6%)
Query: 342 GAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
G L +SG+LEEINTKE+AQRI+AELKRYSIPQAIFAQR GTLSDLLRNPKPWS
Sbjct: 264 GTQGLSSSGQLEEINTKEVAQRITAELKRYSIPQAIFAQR------GTLSDLLRNPKPWS 317
Query: 402 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
KLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 318 KLKSGRETFRRMWKWLQEPEFQRMSALRLA 347
>gi|17538688|ref|NP_501046.1| Protein CEH-48 [Caenorhabditis elegans]
gi|352645614|emb|CCD64998.1| Protein CEH-48 [Caenorhabditis elegans]
Length = 459
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 72/80 (90%), Positives = 77/80 (96%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EE+NTKELA +I+AELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 224 EELNTKELALQIAAELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 283
Query: 413 MWKWLQEPEFQRMSALRLAG 432
M KWL+EPEFQRMSALRLA
Sbjct: 284 MAKWLEEPEFQRMSALRLAA 303
>gi|268535782|ref|XP_002633026.1| C. briggsae CBR-CEH-48 protein [Caenorhabditis briggsae]
Length = 444
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 71/80 (88%), Positives = 77/80 (96%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EE+NTKEL+ +I+AELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 224 EELNTKELSIQIAAELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 283
Query: 413 MWKWLQEPEFQRMSALRLAG 432
M KWL+EPEFQRMSALRLA
Sbjct: 284 MAKWLEEPEFQRMSALRLAA 303
>gi|260792489|ref|XP_002591247.1| hypothetical protein BRAFLDRAFT_190875 [Branchiostoma floridae]
gi|229276451|gb|EEN47258.1| hypothetical protein BRAFLDRAFT_190875 [Branchiostoma floridae]
Length = 83
Score = 158 bits (399), Expect = 6e-36, Method: Composition-based stats.
Identities = 72/83 (86%), Positives = 83/83 (100%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 408
+GE+EEINTKE+AQR+++ELKRYSIPQA+FAQ++LCRSQGTLSDLLRNPKPWSKLKSGRE
Sbjct: 1 TGEVEEINTKEVAQRVTSELKRYSIPQAVFAQKILCRSQGTLSDLLRNPKPWSKLKSGRE 60
Query: 409 TFRRMWKWLQEPEFQRMSALRLA 431
TFRRM+KWL+EPEFQRMSALRLA
Sbjct: 61 TFRRMYKWLEEPEFQRMSALRLA 83
>gi|313246463|emb|CBY35369.1| unnamed protein product [Oikopleura dioica]
Length = 476
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 71/81 (87%), Positives = 77/81 (95%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTK +AQ+I+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 298 IEEINTKVVAQQITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 357
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RM WL+EPEFQRMS+LRLA
Sbjct: 358 RMQSWLREPEFQRMSSLRLAA 378
>gi|313228451|emb|CBY23602.1| unnamed protein product [Oikopleura dioica]
Length = 476
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 71/81 (87%), Positives = 77/81 (95%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTK +AQ+I+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 298 IEEINTKVVAQQITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 357
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RM WL+EPEFQRMS+LRLA
Sbjct: 358 RMQSWLREPEFQRMSSLRLAA 378
>gi|56791840|gb|AAW30421.1| HNF6c [Oikopleura dioica]
Length = 463
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 71/81 (87%), Positives = 77/81 (95%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
+EEINTK +AQ+I+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR
Sbjct: 298 IEEINTKVVAQQITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 357
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RM WL+EPEFQRMS+LRLA
Sbjct: 358 RMQSWLREPEFQRMSSLRLAA 378
>gi|341895267|gb|EGT51202.1| hypothetical protein CAEBREN_28920 [Caenorhabditis brenneri]
Length = 456
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 70/80 (87%), Positives = 77/80 (96%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EE+NTKELA +I++ELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPWSKLKSGRETF+R
Sbjct: 222 EELNTKELALQIASELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWSKLKSGRETFKR 281
Query: 413 MWKWLQEPEFQRMSALRLAG 432
M KWL+EPEFQRMSALRLA
Sbjct: 282 MAKWLEEPEFQRMSALRLAA 301
>gi|308469331|ref|XP_003096904.1| CRE-CEH-48 protein [Caenorhabditis remanei]
gi|308241319|gb|EFO85271.1| CRE-CEH-48 protein [Caenorhabditis remanei]
Length = 465
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 75/104 (72%), Positives = 87/104 (83%), Gaps = 4/104 (3%)
Query: 333 TVTPTNAVVGAVSLGNS----GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQG 388
T +P A+ +G+S + EE+NTKELA +I++ELKRYSIPQAIFA+RVLCRSQG
Sbjct: 205 TKSPKGTSPAALIMGHSEDDIDDGEELNTKELALQIASELKRYSIPQAIFAERVLCRSQG 264
Query: 389 TLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
TLSDLLRNPKPW+KLKSGRETFRRM KWL+EPEFQRMSALRLA
Sbjct: 265 TLSDLLRNPKPWNKLKSGRETFRRMAKWLEEPEFQRMSALRLAA 308
>gi|308452881|ref|XP_003089217.1| hypothetical protein CRE_22230 [Caenorhabditis remanei]
gi|308241577|gb|EFO85529.1| hypothetical protein CRE_22230 [Caenorhabditis remanei]
Length = 480
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 75/104 (72%), Positives = 87/104 (83%), Gaps = 4/104 (3%)
Query: 333 TVTPTNAVVGAVSLGNS----GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQG 388
T +P A+ +G+S + EE+NTKELA +I++ELKRYSIPQAIFA+RVLCRSQG
Sbjct: 220 TKSPKGTSPAALIMGHSEDDIDDGEELNTKELALQIASELKRYSIPQAIFAERVLCRSQG 279
Query: 389 TLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
TLSDLLRNPKPW+KLKSGRETFRRM KWL+EPEFQRMSALRLA
Sbjct: 280 TLSDLLRNPKPWNKLKSGRETFRRMAKWLEEPEFQRMSALRLAA 323
>gi|242010366|ref|XP_002425939.1| homeobox protein onecut, putative [Pediculus humanus corporis]
gi|212509922|gb|EEB13201.1| homeobox protein onecut, putative [Pediculus humanus corporis]
Length = 590
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 75/103 (72%), Positives = 84/103 (81%), Gaps = 9/103 (8%)
Query: 336 PTNAVVGAVSLGN------SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT 389
PTN A S GN + EEINTK+LAQ+ISAELKRYSIPQ++FAQ+VLCRSQGT
Sbjct: 57 PTNE---AFSNGNVEHVSQAANEEEINTKQLAQQISAELKRYSIPQSVFAQKVLCRSQGT 113
Query: 390 LSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
LSDLLRNPKPWSKLKSGR+TFRRMWKWLQEPE R++ LRLA
Sbjct: 114 LSDLLRNPKPWSKLKSGRDTFRRMWKWLQEPEASRITTLRLAC 156
>gi|348525685|ref|XP_003450352.1| PREDICTED: one cut domain family member 2-like [Oreochromis
niloticus]
Length = 451
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 161/320 (50%), Gaps = 67/320 (20%)
Query: 130 SYATLTPLQPLPPISTMSDKFAYGH-------TGNVTGSFTVMQNNIGLGINSHYTYDKI 182
+Y TLTPLQP DKF + H NV GSFT+M+ + GL N + Y K
Sbjct: 76 TYTTLTPLQPF------DDKFHHHHHHHPCLPVSNVIGSFTLMREDRGLSTNFYNPYGK- 128
Query: 183 QMSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVD 242
+ ++Q LSP ST S GL S S + Y S N
Sbjct: 129 ---------------ELAMSQS---LSPPSTGS--GLGS-----SMHGYGSLGN------ 157
Query: 243 RSVLGGGSQSPTLSPNEGSLHSSPTTLVT-NFQGAATIPTMNGITTITPHTV--VVSPQT 299
S G GSQ + P +H T +F + P + G H + + + Q
Sbjct: 158 -SPNGNGSQ---MLPGGYDVHGGSLFCRTSDFGREMSPPGLGGGDVSVGHQLNKMETHQH 213
Query: 300 SPQHS--LISNQLQRSDSPGPPKIISQQTATL-------TLKTVTPTNAVVGAVSLGNSG 350
+P H + S Q P SQQ + + + +P+ ++
Sbjct: 214 APGHHPHIYSQHYQPHHHP------SQQATKMGDHLHSSSSASSSPSEGMLPGSQGSGGS 267
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
EEINT+++AQRI ELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPW KLKSGRETF
Sbjct: 268 TGEEINTRDVAQRIITELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWGKLKSGRETF 327
Query: 411 RRMWKWLQEPEFQRMSALRL 430
+RM +WLQEPEFQRM++LRL
Sbjct: 328 KRMSRWLQEPEFQRMASLRL 347
>gi|56791836|gb|AAW30419.1| HNF6a [Oikopleura dioica]
Length = 270
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 68/80 (85%), Positives = 76/80 (95%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EE++TK +A +I+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 43 EEVDTKAVAAKITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 102
Query: 413 MWKWLQEPEFQRMSALRLAG 432
M KWLQEPE+QRM+ALR+A
Sbjct: 103 MKKWLQEPEYQRMAALRIAA 122
>gi|313239088|emb|CBY14067.1| unnamed protein product [Oikopleura dioica]
Length = 260
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 68/80 (85%), Positives = 76/80 (95%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EE++TK +A +I+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR
Sbjct: 43 EEVDTKAVAAKITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 102
Query: 413 MWKWLQEPEFQRMSALRLAG 432
M KWLQEPE+QRM+ALR+A
Sbjct: 103 MKKWLQEPEYQRMAALRIAA 122
>gi|190338384|gb|AAI63394.1| One cut domain, family member, like [Danio rerio]
gi|190338386|gb|AAI63397.1| One cut domain, family member, like [Danio rerio]
gi|190339672|gb|AAI63019.1| One cut domain, family member, like [Danio rerio]
Length = 446
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 67/78 (85%), Positives = 74/78 (94%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTK++AQRI ELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPW KLKSGRETF+R
Sbjct: 265 EEINTKDVAQRIITELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWGKLKSGRETFKR 324
Query: 413 MWKWLQEPEFQRMSALRL 430
M +WLQEPEFQRM++LRL
Sbjct: 325 MSRWLQEPEFQRMASLRL 342
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 105 HAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGH------TGNV 158
H PTY + + P + S+Y TLTPLQP DKF + H NV
Sbjct: 47 HQSPTYDHMATLGYSRDAP-TTCGSTYTTLTPLQPF------DDKFHHHHHHPCLPVSNV 99
Query: 159 TGSFTVMQNNIGLGINSHYTYDK 181
GSFT+M+ + LG N + +Y K
Sbjct: 100 IGSFTLMREDRSLGGNYYTSYGK 122
>gi|56790319|ref|NP_694516.1| one cut domain, family member, like [Danio rerio]
gi|17999893|gb|AAL02365.1| onecut homeodomain protein [Danio rerio]
Length = 446
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 67/78 (85%), Positives = 74/78 (94%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINTK++AQRI ELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPW KLKSGRETF+R
Sbjct: 265 EEINTKDVAQRIITELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWGKLKSGRETFKR 324
Query: 413 MWKWLQEPEFQRMSALRL 430
M +WLQEPEFQRM++LRL
Sbjct: 325 MSRWLQEPEFQRMASLRL 342
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 105 HAEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHT------GNV 158
H PTY + + P + S+Y TLTPLQP DKF + H NV
Sbjct: 47 HQSPTYDHMATLGYSRDAP-TTCGSTYTTLTPLQPF------DDKFHHHHHHPCLPISNV 99
Query: 159 TGSFTVMQNNIGLGINSHYTYDK 181
GSFT+M+ + LG N + +Y K
Sbjct: 100 IGSFTLMREDRSLGGNYYTSYGK 122
>gi|432910516|ref|XP_004078394.1| PREDICTED: one cut domain family member 2-like [Oryzias latipes]
Length = 452
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 66/78 (84%), Positives = 74/78 (94%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINT+++AQRI ELKRYSIPQAIFA+RVLCRSQGTLSDLLRNPKPW KLKSGRETF+R
Sbjct: 271 EEINTRDVAQRIITELKRYSIPQAIFAERVLCRSQGTLSDLLRNPKPWGKLKSGRETFKR 330
Query: 413 MWKWLQEPEFQRMSALRL 430
M +WLQEPEFQRM++LRL
Sbjct: 331 MSRWLQEPEFQRMASLRL 348
>gi|313244997|emb|CBY42476.1| unnamed protein product [Oikopleura dioica]
Length = 226
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 66/78 (84%), Positives = 74/78 (94%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK +A +I+AELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM
Sbjct: 1 VDTKAVAAKITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMK 60
Query: 415 KWLQEPEFQRMSALRLAG 432
KWLQEPE+QRM+ALR+A
Sbjct: 61 KWLQEPEYQRMAALRIAA 78
>gi|410904775|ref|XP_003965867.1| PREDICTED: one cut domain family member 2-like [Takifugu rubripes]
Length = 453
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/78 (83%), Positives = 73/78 (93%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINT+++AQRI ELKRYSIPQAIFA+RVL RSQGTLSDLLRNPKPW KLKSGRETF+R
Sbjct: 272 EEINTRDVAQRIITELKRYSIPQAIFAERVLSRSQGTLSDLLRNPKPWGKLKSGRETFKR 331
Query: 413 MWKWLQEPEFQRMSALRL 430
M +WLQEPEFQRM++LRL
Sbjct: 332 MSRWLQEPEFQRMASLRL 349
>gi|47213722|emb|CAF95153.1| unnamed protein product [Tetraodon nigroviridis]
Length = 496
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 65/78 (83%), Positives = 73/78 (93%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
EEINT+++AQRI ELKRYSIPQAIFA+RVL RSQGTLSDLLRNPKPW KLKSGRETF+R
Sbjct: 270 EEINTRDVAQRIITELKRYSIPQAIFAERVLSRSQGTLSDLLRNPKPWGKLKSGRETFKR 329
Query: 413 MWKWLQEPEFQRMSALRL 430
M +WLQEPEFQRM++LRL
Sbjct: 330 MSRWLQEPEFQRMASLRL 347
>gi|194770918|ref|XP_001967529.1| GF20615 [Drosophila ananassae]
gi|190622704|gb|EDV38228.1| GF20615 [Drosophila ananassae]
Length = 1109
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/94 (71%), Positives = 72/94 (76%), Gaps = 13/94 (13%)
Query: 339 AVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK 398
A G+ S ++EEINTKELAQRISA QRVLCRSQGTLSDLLRNPK
Sbjct: 716 ACGGSSRSAVSSDVEEINTKELAQRISA-------------QRVLCRSQGTLSDLLRNPK 762
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
PWSKLKSGRETFRRM+KWLQEPEFQRMSALR+A
Sbjct: 763 PWSKLKSGRETFRRMYKWLQEPEFQRMSALRMAA 796
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 40/65 (61%), Positives = 51/65 (78%), Gaps = 3/65 (4%)
Query: 109 TYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN 168
+YQTLT+VN RLSPP +SP+S YATLTP+QPLPPISTMS+KFAY +G+++G N
Sbjct: 324 SYQTLTSVNDRLSPPEFSPTS-YATLTPIQPLPPISTMSEKFAY--SGHISGREEDGNNR 380
Query: 169 IGLGI 173
G G+
Sbjct: 381 RGGGV 385
>gi|324504819|gb|ADY42077.1| Homeobox protein ceh-38 [Ascaris suum]
Length = 766
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 53/76 (69%), Positives = 67/76 (88%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK++SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 370 IDTKDLCKRIAWELKQHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMF 429
Query: 415 KWLQEPEFQRMSALRL 430
W+Q+P R+ L +
Sbjct: 430 NWVQQPLELRLGILDM 445
>gi|308502530|ref|XP_003113449.1| CRE-CEH-38 protein [Caenorhabditis remanei]
gi|308263408|gb|EFP07361.1| CRE-CEH-38 protein [Caenorhabditis remanei]
Length = 510
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/120 (53%), Positives = 82/120 (68%), Gaps = 10/120 (8%)
Query: 313 SDSPGPPKIISQQTATL--TLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKR 370
S+ P PP Q T + T + P NA +G E I+TK+L +RI+ ELK
Sbjct: 277 SNIPTPPSAHVQPTRRIYSTQDSNDPLNAEIG--------EDIYIDTKDLCKRIAFELKN 328
Query: 371 YSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL 430
+SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+ W+ +P R++ L +
Sbjct: 329 HSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMYNWVAQPLATRLAILDM 388
>gi|324509293|gb|ADY43913.1| Homeobox protein ceh-38 [Ascaris suum]
Length = 586
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 53/76 (69%), Positives = 67/76 (88%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK++SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 190 IDTKDLCKRIAWELKQHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMF 249
Query: 415 KWLQEPEFQRMSALRL 430
W+Q+P R+ L +
Sbjct: 250 NWVQQPLELRLGILDM 265
>gi|341899848|gb|EGT55783.1| CBN-CEH-38 protein [Caenorhabditis brenneri]
Length = 639
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 52/76 (68%), Positives = 66/76 (86%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK +SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 330 IDTKDLCKRIAFELKNHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMY 389
Query: 415 KWLQEPEFQRMSALRL 430
W+ +P R++ L +
Sbjct: 390 NWVSQPLATRLAILDM 405
>gi|25147950|ref|NP_741018.1| Protein CEH-38, isoform a [Caenorhabditis elegans]
gi|373218560|emb|CCD61546.1| Protein CEH-38, isoform a [Caenorhabditis elegans]
Length = 632
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 52/76 (68%), Positives = 66/76 (86%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK +SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 308 IDTKDLCKRIAFELKNHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMY 367
Query: 415 KWLQEPEFQRMSALRL 430
W+ +P R++ L +
Sbjct: 368 NWVAQPLATRLAILDM 383
>gi|25147952|ref|NP_741017.1| Protein CEH-38, isoform b [Caenorhabditis elegans]
gi|21264445|sp|Q19720.2|HM38_CAEEL RecName: Full=Homeobox protein ceh-38
gi|373218561|emb|CCD61547.1| Protein CEH-38, isoform b [Caenorhabditis elegans]
Length = 641
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 52/76 (68%), Positives = 66/76 (86%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK +SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 317 IDTKDLCKRIAFELKNHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMY 376
Query: 415 KWLQEPEFQRMSALRL 430
W+ +P R++ L +
Sbjct: 377 NWVAQPLATRLAILDM 392
>gi|7499644|pir||T16136 hypothetical protein F22D3.1 - Caenorhabditis elegans
Length = 644
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 52/76 (68%), Positives = 66/76 (86%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK+L +RI+ ELK +SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+
Sbjct: 320 IDTKDLCKRIAFELKNHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMY 379
Query: 415 KWLQEPEFQRMSALRL 430
W+ +P R++ L +
Sbjct: 380 NWVAQPLATRLAILDM 395
>gi|268571617|ref|XP_002648769.1| Hypothetical protein CBG10702 [Caenorhabditis briggsae]
Length = 419
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/161 (45%), Positives = 92/161 (57%), Gaps = 14/161 (8%)
Query: 272 NFQGAATIPTMN-GITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLT 330
NF G + I I + P V S Q S ++ NQL P K +S+Q
Sbjct: 149 NFSGNSIIQHQQIDIDEVLP--AVNSMQISSKYDEQENQL------NPMKTLSKQKPKRV 200
Query: 331 LKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTL 390
K P A NS EEI+TK++A R+ ELK + IPQ FA+R+LCRSQGTL
Sbjct: 201 RK---PRQAAQKG--FDNSDNEEEIDTKDIAMRVVNELKTHQIPQKTFAERILCRSQGTL 255
Query: 391 SDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
SDLLRNPKPW+ LKSG+ETFRRM KWLQ+P R+ L ++
Sbjct: 256 SDLLRNPKPWNMLKSGQETFRRMHKWLQQPLTARLRILHMS 296
>gi|268530666|ref|XP_002630459.1| C. briggsae CBR-CEH-38 protein [Caenorhabditis briggsae]
Length = 626
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 61/118 (51%), Positives = 79/118 (66%), Gaps = 11/118 (9%)
Query: 316 PGPPKIISQ---QTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
P PP I Q + T + P NA +G I+T++L +RI+ ELK +S
Sbjct: 270 PTPPTAIVQPPPRRVYSTQDSNDPLNAEIGDDIY--------IDTRDLCKRIAFELKNHS 321
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL 430
IPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+ W+ +P R++ L +
Sbjct: 322 IPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMYNWVSQPLDTRLAILDM 379
>gi|170590954|ref|XP_001900236.1| CUT domain containing protein [Brugia malayi]
gi|158592386|gb|EDP30986.1| CUT domain containing protein [Brugia malayi]
Length = 720
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 50/71 (70%), Positives = 62/71 (87%)
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE 419
L +RI+ ELK++SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+ W+Q+
Sbjct: 334 LCKRIAWELKQHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMFNWVQQ 393
Query: 420 PEFQRMSALRL 430
P R+ L +
Sbjct: 394 PLELRLGILDM 404
>gi|312084474|ref|XP_003144290.1| hypothetical protein LOAG_08712 [Loa loa]
gi|307760546|gb|EFO19780.1| hypothetical protein LOAG_08712 [Loa loa]
Length = 751
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 50/71 (70%), Positives = 62/71 (87%)
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQE 419
L +RI+ ELK++SIPQAIFA+R+LCRSQGTLSDLLRNPKPW+KLKSGRETFRRM+ W+Q+
Sbjct: 365 LCKRIAWELKQHSIPQAIFAERILCRSQGTLSDLLRNPKPWNKLKSGRETFRRMFNWVQQ 424
Query: 420 PEFQRMSALRL 430
P R+ L +
Sbjct: 425 PLELRLGILDM 435
>gi|56791838|gb|AAW30420.1| HNF6b [Oikopleura dioica]
gi|313234542|emb|CBY10499.1| unnamed protein product [Oikopleura dioica]
Length = 335
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/82 (64%), Positives = 68/82 (82%), Gaps = 4/82 (4%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A+ I A LKRYSIPQ++FA ++L RSQGTLSDLLRNPKPW KLK+GRETFRRM
Sbjct: 180 MDTKQVAEDILAVLKRYSIPQSVFANKILGRSQGTLSDLLRNPKPWDKLKAGRETFRRME 239
Query: 415 KWLQEPEFQRMSAL----RLAG 432
+WLQ+PE R++ L R+AG
Sbjct: 240 QWLQQPEMTRVTELQQATRIAG 261
>gi|268571659|ref|XP_002648778.1| Hypothetical protein CBG11796 [Caenorhabditis briggsae]
Length = 419
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/86 (62%), Positives = 67/86 (77%)
Query: 346 LGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS 405
L +S EEI+TK++A RI ELK + IPQ FA+R+LCRSQGTLSDLLR PKPW+ LKS
Sbjct: 210 LHDSEVEEEIDTKDVAMRIVNELKTHHIPQKTFAERILCRSQGTLSDLLRKPKPWNMLKS 269
Query: 406 GRETFRRMWKWLQEPEFQRMSALRLA 431
G+ETFRRM KWLQ+P R+S L ++
Sbjct: 270 GQETFRRMHKWLQQPLTARLSILDMS 295
>gi|402594547|gb|EJW88473.1| CUT domain-containing protein [Wuchereria bancrofti]
Length = 567
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/77 (62%), Positives = 61/77 (79%), Gaps = 6/77 (7%)
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQ------GTLSDLLRNPKPWSKLKSGRETFRRM 413
L +RI+ ELK++SIPQAIFA+R+LCR + GTLSDLLRNPKPW+KLKSGRETFRRM
Sbjct: 175 LCKRIAWELKQHSIPQAIFAERILCRYRLFSSILGTLSDLLRNPKPWNKLKSGRETFRRM 234
Query: 414 WKWLQEPEFQRMSALRL 430
+ W+Q+P R+ L +
Sbjct: 235 FNWVQQPLELRLGILDM 251
>gi|402577362|gb|EJW71319.1| hypothetical protein WUBG_17771, partial [Wuchereria bancrofti]
Length = 79
Score = 105 bits (263), Expect = 3e-20, Method: Composition-based stats.
Identities = 48/51 (94%), Positives = 48/51 (94%)
Query: 383 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAGN 433
L RSQGTLSDLLRNPKPWSKLKSGRETFRRM KWLQEPEFQRMSALRLA N
Sbjct: 22 LFRSQGTLSDLLRNPKPWSKLKSGRETFRRMAKWLQEPEFQRMSALRLAEN 72
>gi|47217875|emb|CAG02368.1| unnamed protein product [Tetraodon nigroviridis]
Length = 421
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/45 (97%), Positives = 45/45 (100%)
Query: 387 QGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 431
+GTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
Sbjct: 277 EGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA 321
>gi|156395336|ref|XP_001637067.1| predicted protein [Nematostella vectensis]
gi|156224176|gb|EDO45004.1| predicted protein [Nematostella vectensis]
Length = 177
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 42/76 (55%), Positives = 62/76 (81%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK +A +++ EL+RY IPQA+FA++VL RSQGTLSDLLR PKPW++++ G+ F+RM
Sbjct: 25 IDTKAMALKLTEELRRYHIPQAVFAKKVLNRSQGTLSDLLRKPKPWNEMRLGKAIFKRMK 84
Query: 415 KWLQEPEFQRMSALRL 430
+WL+ PE +R+ LR+
Sbjct: 85 EWLELPEVKRIPQLRM 100
>gi|341902545|gb|EGT58480.1| hypothetical protein CAEBREN_20520 [Caenorhabditis brenneri]
Length = 251
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/80 (56%), Positives = 59/80 (73%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRE 408
S E + I+TK+L +ISAEL+ IPQA+FA R++ RSQGTLSDLL PKPW LK+GR
Sbjct: 69 SVERDYIDTKDLCLKISAELEGRHIPQALFANRIISRSQGTLSDLLGKPKPWEILKAGRN 128
Query: 409 TFRRMWKWLQEPEFQRMSAL 428
TF+RM+ W+ +P RM+ L
Sbjct: 129 TFKRMYNWMVQPYETRMAIL 148
>gi|341875995|gb|EGT31930.1| hypothetical protein CAEBREN_00529 [Caenorhabditis brenneri]
Length = 233
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 62/91 (68%)
Query: 338 NAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNP 397
N AV S E + I+TK+L +I AELK ++IPQ +FA R++ RSQGTLSDLL P
Sbjct: 39 NYTAQAVEDMLSVERDYIDTKDLCLKILAELKGHNIPQELFATRIISRSQGTLSDLLGKP 98
Query: 398 KPWSKLKSGRETFRRMWKWLQEPEFQRMSAL 428
KPW L++GR TF+RM+ W+ +P RM+ L
Sbjct: 99 KPWEMLRAGRNTFKRMYNWMVQPYETRMAIL 129
>gi|341886192|gb|EGT42127.1| hypothetical protein CAEBREN_14492 [Caenorhabditis brenneri]
Length = 484
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 59/78 (75%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
E + I TK+L +ISAEL+ + IPQ +F++R++ RSQGTLSDLL+ P+PWS L++ R+T+
Sbjct: 331 EGDYIYTKDLCMKISAELRTHEIPQHLFSERIINRSQGTLSDLLKKPRPWSSLRTARKTY 390
Query: 411 RRMWKWLQEPEFQRMSAL 428
RM+ WL +P RM+ +
Sbjct: 391 IRMYNWLSQPLETRMAMV 408
>gi|25148837|ref|NP_508341.2| Protein CEH-21 [Caenorhabditis elegans]
gi|21431779|sp|Q22811.3|HM21_CAEEL RecName: Full=Homeobox protein ceh-21
gi|18157195|emb|CAD20808.1| CEH-21 protein [Caenorhabditis elegans]
gi|351057922|emb|CCD64528.1| Protein CEH-21 [Caenorhabditis elegans]
Length = 495
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/76 (55%), Positives = 58/76 (76%), Gaps = 1/76 (1%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 409
G+ EE++T ++A+RI +ELK IPQ A+++L RSQGTLSDLLR PKPWS +K+GR T
Sbjct: 288 GDDEELDTVDIARRILSELKERCIPQTALAEKILARSQGTLSDLLRMPKPWSVMKNGRAT 347
Query: 410 FRRMWKWLQ-EPEFQR 424
F+RM WL +P+ +R
Sbjct: 348 FQRMSNWLGLDPDVRR 363
>gi|17551242|ref|NP_508342.1| Protein CEH-39 [Caenorhabditis elegans]
gi|13627102|sp|Q22812.1|HM39_CAEEL RecName: Full=Homeobox protein ceh-39
gi|351057923|emb|CCD64529.1| Protein CEH-39 [Caenorhabditis elegans]
Length = 399
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 50/68 (73%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET 409
G+ EE++T+ +A RI ELK IPQA A +VL RSQGTLSDLLR PKPW +K+GR T
Sbjct: 209 GDDEELDTQLIAHRILDELKEQCIPQASLAVKVLGRSQGTLSDLLRKPKPWGIMKNGRGT 268
Query: 410 FRRMWKWL 417
F+RM WL
Sbjct: 269 FQRMANWL 276
>gi|308476767|ref|XP_003100599.1| hypothetical protein CRE_19829 [Caenorhabditis remanei]
gi|308264855|gb|EFP08808.1| hypothetical protein CRE_19829 [Caenorhabditis remanei]
Length = 1331
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 53/76 (69%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
+ I+TK L + + LK++SI Q +FA V R QGT+SDLLRNP+ W K+KSGR+ +RR
Sbjct: 772 DYIDTKRLCDTVCSLLKKHSISQTLFAHVVAHRRQGTMSDLLRNPRCWFKMKSGRDVYRR 831
Query: 413 MWKWLQEPEFQRMSAL 428
M+ W++ E +R+ L
Sbjct: 832 MYNWVRMKEHERLEIL 847
>gi|268533570|ref|XP_002631913.1| Hypothetical protein CBG07897 [Caenorhabditis briggsae]
Length = 483
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 49/74 (66%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK L I EL+ IPQ IFA+RV+ R QGTLS+L+R PKPW LKSG + R++
Sbjct: 304 LDTKVLCNSILQELETKKIPQTIFAKRVINRCQGTLSELIRKPKPWDSLKSGHGIYVRLY 363
Query: 415 KWLQEPEFQRMSAL 428
W Q PE QRM +
Sbjct: 364 NWYQLPEEQRMEIV 377
>gi|268562924|ref|XP_002646810.1| Hypothetical protein CBG19488 [Caenorhabditis briggsae]
Length = 668
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/71 (57%), Positives = 48/71 (67%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK L I E+K I Q+IFA RV RSQGTLS+LLR PKPW LKSGR T+ R +
Sbjct: 493 LDTKILCDSILQEMKTRYISQSIFATRVANRSQGTLSELLRKPKPWDSLKSGRGTYVRFF 552
Query: 415 KWLQEPEFQRM 425
W Q PE QRM
Sbjct: 553 NWYQLPENQRM 563
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 47/74 (63%)
Query: 357 TKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKW 416
++EL ++++ E+ Y+I FA++VL + L +L+NPKPW++L++GR + R+ W
Sbjct: 131 SEELCEKLTREIHNYNICCQNFAKKVLNKHPRVLRKMLKNPKPWNELRTGRLNYIRIHNW 190
Query: 417 LQEPEFQRMSALRL 430
L P +R+ L +
Sbjct: 191 LNLPLEKRLGILDM 204
>gi|401712708|gb|AFP99087.1| Hnf6, partial [Ophiocoma wendtii]
Length = 206
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 78/165 (47%), Gaps = 28/165 (16%)
Query: 256 SPNEGSLHSSPTTLVTNFQGAATIPTMNGI--------TTITPHTVVVSPQTSPQHS--- 304
SPN S+ +SP + G A +NG T TPH PQ + +
Sbjct: 42 SPNLASMAASPYDKLGGMGGMAPNSMLNGYPQSMGGYGTMGTPHHASCGPQDARYATAMN 101
Query: 305 ----LISNQLQRSDSPGPPKIISQQTATLTLKTV-----TPTNAVVGAV--------SLG 347
S Q R P T + L + +PT V G + +
Sbjct: 102 GYDQFASTQHMRGLQPPAHHQRGGLTNPMVLNGLGHAHGSPTGPVGGRITHDRNTNSARD 161
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSD 392
SG EEINTKE+A R+++ELKRYSIPQA+FAQRVLCRSQGTLSD
Sbjct: 162 GSGAGEEINTKEVAARVTSELKRYSIPQAVFAQRVLCRSQGTLSD 206
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/55 (58%), Positives = 39/55 (70%), Gaps = 3/55 (5%)
Query: 128 SSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKI 182
S+SY TLTPLQPLPPIST+SDKF NV+GSFT+M N+ L + YDK+
Sbjct: 5 SNSYTTLTPLQPLPPISTVSDKFR--DPSNVSGSFTLM-NSPNLASMAASPYDKL 56
>gi|268534632|ref|XP_002632447.1| Hypothetical protein CBG13668 [Caenorhabditis briggsae]
Length = 688
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 48/63 (76%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK+L R++ +LK SIPQ+ F+ ++L RSQGTLSD+LRNPKPW ++ G ET+ RM
Sbjct: 481 VDTKDLCNRLAQDLKDNSIPQSAFSTKILGRSQGTLSDILRNPKPWDQINRGAETYERMI 540
Query: 415 KWL 417
WL
Sbjct: 541 NWL 543
>gi|308466012|ref|XP_003095262.1| hypothetical protein CRE_18178 [Caenorhabditis remanei]
gi|308245546|gb|EFO89498.1| hypothetical protein CRE_18178 [Caenorhabditis remanei]
Length = 375
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 64/110 (58%), Gaps = 10/110 (9%)
Query: 319 PKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIF 378
PK S++++T+T + P NA I+TK L + + LK Y I Q +F
Sbjct: 196 PKRRSKKSSTVTTDSSDPLNAPTPDY----------IDTKPLCGTVWSILKEYGISQTLF 245
Query: 379 AQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSAL 428
AQ V R QGT+SDLLRNP+ W K+KSGR+ +RRM+ W++ +R+ L
Sbjct: 246 AQVVAQRRQGTMSDLLRNPRDWFKMKSGRDVYRRMYNWVRMAAHERLEIL 295
>gi|268534924|ref|XP_002632595.1| Hypothetical protein CBG13865 [Caenorhabditis briggsae]
Length = 509
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 343 AVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSK 402
A L N L+ TK L + EL+ IPQ IFA RV R QGTLS+L+R PKPW
Sbjct: 311 AAPLANGAALD---TKVLCNAVFQELETRKIPQTIFATRVANRVQGTLSELIRKPKPWDC 367
Query: 403 LKSGRETFRRMWKWLQEPEFQRMSAL 428
+KSGR + R++ W Q PE QRM +
Sbjct: 368 VKSGRGVYVRLFNWYQLPEDQRMEIV 393
>gi|308466046|ref|XP_003095279.1| hypothetical protein CRE_18161 [Caenorhabditis remanei]
gi|308245563|gb|EFO89515.1| hypothetical protein CRE_18161 [Caenorhabditis remanei]
Length = 370
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/74 (48%), Positives = 50/74 (67%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+TK L + + LK Y I Q +FAQ V R QGT+SDLLRNP+ W K+KSGR+ +RRM+
Sbjct: 217 IDTKPLCGTVWSILKEYGISQTLFAQVVAQRRQGTMSDLLRNPRDWFKMKSGRDVYRRMY 276
Query: 415 KWLQEPEFQRMSAL 428
W++ +R+ L
Sbjct: 277 NWVRMAAHERLEIL 290
>gi|268562928|ref|XP_002646811.1| Hypothetical protein CBG19489 [Caenorhabditis briggsae]
Length = 717
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 48/74 (64%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK L+ I EL IPQ IFA+RV R QGTLS LLR PKPW +K+GR + R++
Sbjct: 536 LDTKILSDSILQELAIKKIPQTIFAKRVANRCQGTLSVLLRKPKPWDCVKTGRGIYVRLY 595
Query: 415 KWLQEPEFQRMSAL 428
W Q PE QRM +
Sbjct: 596 NWYQLPENQRMEIV 609
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 50/76 (65%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+T+EL ++++ E+ +Y I + FA++VL + L +L++PKPW+KL+S R + R++
Sbjct: 177 IDTEELCEKLTQEIHKYYISRQYFAKKVLNKGPKVLRKMLKDPKPWNKLQSARVNYIRIY 236
Query: 415 KWLQEPEFQRMSALRL 430
WL P +R+ L +
Sbjct: 237 NWLNLPLEKRLGILDM 252
>gi|341901473|gb|EGT57408.1| hypothetical protein CAEBREN_24852 [Caenorhabditis brenneri]
Length = 409
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 55/76 (72%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
I+T +L +I+ E+K +SI Q + A++V+CR+QGTLS++L P+ W +L +GR+T+ R+
Sbjct: 252 IDTIDLISKIAQEIKDHSIHQEVLAEKVMCRTQGTLSEMLSKPRQWHELTTGRQTYLRLH 311
Query: 415 KWLQEPEFQRMSALRL 430
WL +P R++ L++
Sbjct: 312 NWLSQPLETRLAILKM 327
>gi|341885490|gb|EGT41425.1| hypothetical protein CAEBREN_19537 [Caenorhabditis brenneri]
Length = 427
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 48/63 (76%)
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 415
+T+E+ + L + I QA+FA+++L RSQGTL DLL+NPKPW++L+ G+ET+ R++
Sbjct: 267 DTREIMNNLKQRLSQIEISQAVFAEKILGRSQGTLCDLLKNPKPWAELRCGKETYTRIYN 326
Query: 416 WLQ 418
WL+
Sbjct: 327 WLR 329
>gi|341890940|gb|EGT46875.1| hypothetical protein CAEBREN_19327 [Caenorhabditis brenneri]
Length = 227
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 52/76 (68%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRR 412
++I+TKEL + I EL +IPQ +FA R++ RS+ TLSDLL PKPW+ +K+ R T+ R
Sbjct: 55 DDIDTKELGKTILTELSSLNIPQEVFAARLVNRSKRTLSDLLSRPKPWNVIKASRTTYVR 114
Query: 413 MWKWLQEPEFQRMSAL 428
M+ W+ P+ +++
Sbjct: 115 MYNWMMLPKETKLAIF 130
>gi|308466097|ref|XP_003095304.1| hypothetical protein CRE_19941 [Caenorhabditis remanei]
gi|308245482|gb|EFO89434.1| hypothetical protein CRE_19941 [Caenorhabditis remanei]
Length = 415
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 54/75 (72%), Gaps = 1/75 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRM 413
++T +L+++I ELK + I Q +FA++VL RSQGT+S+LL++PKPW L +GRET+ +M
Sbjct: 248 LDTNDLSEKILNELKEHQISQTVFAKKVLGRSQGTVSELLKHPKPWELLHTAGRETYHKM 307
Query: 414 WKWLQEPEFQRMSAL 428
WL P +R+ L
Sbjct: 308 NNWLVLPLQERLDKL 322
>gi|308462763|ref|XP_003093662.1| hypothetical protein CRE_29190 [Caenorhabditis remanei]
gi|308249526|gb|EFO93478.1| hypothetical protein CRE_29190 [Caenorhabditis remanei]
Length = 280
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 55/78 (70%), Gaps = 1/78 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E I+T +L I ELK IPQA+FA+ VL R+QGTLSD L+ PKPW+++K SG++ ++
Sbjct: 121 EYIDTCQLCHNIYNELKNAGIPQALFAKCVLGRTQGTLSDTLKKPKPWNEMKPSGQKIYQ 180
Query: 412 RMWKWLQEPEFQRMSALR 429
+M+ WL +RM+ L+
Sbjct: 181 QMYDWLALSVDERMAMLK 198
>gi|260820856|ref|XP_002605750.1| hypothetical protein BRAFLDRAFT_78017 [Branchiostoma floridae]
gi|229291085|gb|EEN61760.1| hypothetical protein BRAFLDRAFT_78017 [Branchiostoma floridae]
Length = 1314
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 50/72 (69%), Gaps = 1/72 (1%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRET 409
+ +I+T +L ++I L R +I Q +F ++VL SQG++SD+L +PKPWSKL + G+E
Sbjct: 554 KFNDIDTMDLTRKIKDTLARLAIGQKVFGEKVLGLSQGSVSDMLAHPKPWSKLTQKGKEP 613
Query: 410 FRRMWKWLQEPE 421
F RM WL+E E
Sbjct: 614 FIRMQLWLEEVE 625
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 48/70 (68%), Gaps = 1/70 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFR 411
+E++T E+A+++ L +I Q +F Q VL SQG++S++L PK WS+L S GRE F
Sbjct: 268 DELDTWEIARQVKDLLLHNNIGQKVFGQHVLGMSQGSVSEILSKPKHWSRLTSKGREPFI 327
Query: 412 RMWKWLQEPE 421
+M+ WLQ+ E
Sbjct: 328 KMYSWLQDKE 337
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 42/69 (60%), Gaps = 1/69 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T + +R+ L +I Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 705 DLDTYAITKRVKEVLMENNIGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 764
Query: 413 MWKWLQEPE 421
M WL PE
Sbjct: 765 MHLWLSNPE 773
>gi|341879599|gb|EGT35534.1| hypothetical protein CAEBREN_18017 [Caenorhabditis brenneri]
Length = 272
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 1/76 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSK-LKSGRETFRR 412
E K+L ++S E+ I +A+F++R+L RSQGT+S+LL PKPW L+ GR + R
Sbjct: 112 EFGLKDLCHKVSTEIAERGISRALFSERILHRSQGTISELLNWPKPWENILEKGRRPYIR 171
Query: 413 MWKWLQEPEFQRMSAL 428
M+ WL++P RM+ L
Sbjct: 172 MYNWLKQPFEIRMAML 187
>gi|308462791|ref|XP_003093676.1| hypothetical protein CRE_29192 [Caenorhabditis remanei]
gi|308249540|gb|EFO93492.1| hypothetical protein CRE_29192 [Caenorhabditis remanei]
Length = 462
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 50/79 (63%), Gaps = 1/79 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
++T+ + IS ELK I Q+ FA RVL R+QGTLSD L+ PKPW LK G++ + +M
Sbjct: 294 LDTRRVCSDISFELKMCGISQSKFANRVLKRTQGTLSDTLKKPKPWEDLKEVGKQIYIQM 353
Query: 414 WKWLQEPEFQRMSALRLAG 432
+W+ P ++M+ + G
Sbjct: 354 AEWMSLPLDEKMAKVEEEG 372
>gi|402579867|gb|EJW73818.1| hypothetical protein WUBG_15276, partial [Wuchereria bancrofti]
Length = 145
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 49/71 (69%), Gaps = 3/71 (4%)
Query: 317 GPPKIISQQT-ATLTLKTVTPTNA--VVGAVSLGNSGELEEINTKELAQRISAELKRYSI 373
GP I+ +T T+ +P A + A S + +++E+NTK+LAQRISAELKRYSI
Sbjct: 75 GPNDILDTETMETVKRNGSSPLQADTISHAGSPDSGSDMDELNTKDLAQRISAELKRYSI 134
Query: 374 PQAIFAQRVLC 384
PQAIFAQRVLC
Sbjct: 135 PQAIFAQRVLC 145
>gi|195439170|ref|XP_002067504.1| GK16157 [Drosophila willistoni]
gi|194163589|gb|EDW78490.1| GK16157 [Drosophila willistoni]
Length = 2462
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 29/149 (19%)
Query: 272 NFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTL 331
N + AAT + + +IT ++++ P +P HS Q QR
Sbjct: 1596 NEEFAATASPLQRMASIT-NSLITQPPVTPHHS----QPQR------------------- 1631
Query: 332 KTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLS 391
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++S
Sbjct: 1632 ----PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVS 1687
Query: 392 DLLRNPKPWSKL-KSGRETFRRMWKWLQE 419
DLL PKPW L + GRE F RM +L++
Sbjct: 1688 DLLARPKPWHMLTQKGREPFIRMKMFLED 1716
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 1158 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1217
Query: 414 WKW 416
W
Sbjct: 1218 HAW 1220
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1908 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1967
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1968 RMQLWLSDANNVERLQLLK 1986
>gi|327287208|ref|XP_003228321.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 1-like
[Anolis carolinensis]
Length = 1662
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL ++I +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 1080 QEVDTIELTRQIKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1139
Query: 412 RMWKWL 417
RM WL
Sbjct: 1140 RMQLWL 1145
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 56/83 (67%), Gaps = 3/83 (3%)
Query: 348 NSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SG 406
N+ + E+++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL G
Sbjct: 710 NASDGEDMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRG 769
Query: 407 RETFRRMWKWLQEPEFQRMSALR 429
+E F +M ++L + Q + ALR
Sbjct: 770 KEPFHKMKQFLSDE--QNILALR 790
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1266 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1325
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1326 MQLWLNDP 1333
>gi|221329759|ref|NP_001138174.1| cut, isoform C [Drosophila melanogaster]
gi|220901699|gb|ACL82906.1| cut, isoform C [Drosophila melanogaster]
Length = 2383
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1528 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1587
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1588 RPKPWHMLTQKGREPFIRMKMFLED 1612
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 1094 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1153
Query: 414 WKW 416
W
Sbjct: 1154 HAW 1156
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1823 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1882
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1883 RMQLWLSDANNVERLQLLK 1901
>gi|386763990|ref|NP_001245565.1| cut, isoform D [Drosophila melanogaster]
gi|383293264|gb|AFH07279.1| cut, isoform D [Drosophila melanogaster]
Length = 2165
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1310 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1369
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1370 RPKPWHMLTQKGREPFIRMKMFLED 1394
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 876 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 935
Query: 414 WKW 416
W
Sbjct: 936 HAW 938
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1605 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1664
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1665 RMQLWLSDANNVERLQLLK 1683
>gi|195044138|ref|XP_001991761.1| GH11890 [Drosophila grimshawi]
gi|193901519|gb|EDW00386.1| GH11890 [Drosophila grimshawi]
Length = 2253
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1397 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1456
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1457 RPKPWHMLTQKGREPFIRMKMFLED 1481
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 937 QALDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 996
Query: 412 RMWKW 416
+M W
Sbjct: 997 KMHAW 1001
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1706 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1765
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1766 RMQLWLSDANNVERLQLLK 1784
>gi|307207678|gb|EFN85315.1| Homeobox protein cut [Harpegnathos saltator]
Length = 1476
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 81/177 (45%), Gaps = 41/177 (23%)
Query: 244 SVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQH 303
S LG P ++PNE F GAA + + +IT ++++ P T H
Sbjct: 741 SPLGNSILPPAMTPNE------------EFSGAAAASPLQRMASIT-NSLISQPSTPTHH 787
Query: 304 SLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQR 363
+ SNQ P AV+ ++ +NT+++ +R
Sbjct: 788 A--SNQ-------------------------RPLKAVLPPITQQQFDMYNNLNTEDIVKR 820
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQE 419
+ +L +YSI Q +F + VL SQG++SDLL PKPW L + GRE F RM +L++
Sbjct: 821 VKEQLSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLED 877
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 402 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 461
Query: 412 RMWKW 416
+M W
Sbjct: 462 KMHGW 466
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1044 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1103
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 1104 RMQLWLSDAHNVDRLQALK 1122
>gi|195480403|ref|XP_002101250.1| GE15727 [Drosophila yakuba]
gi|194188774|gb|EDX02358.1| GE15727 [Drosophila yakuba]
Length = 2261
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1385 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1444
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1445 RPKPWHMLTQKGREPFIRMKMFLED 1469
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 952 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1011
Query: 414 WKW 416
W
Sbjct: 1012 HAW 1014
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1688 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1747
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1748 RMQLWLSDANNVERLQLLK 1766
>gi|195393016|ref|XP_002055150.1| GJ19212 [Drosophila virilis]
gi|194149660|gb|EDW65351.1| GJ19212 [Drosophila virilis]
Length = 2408
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1577 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1636
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1637 RPKPWHMLTQKGREPFIRMKMFLED 1661
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 1113 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1172
Query: 414 WKW 416
W
Sbjct: 1173 HAW 1175
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1862 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1921
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1922 RMQLWLSDANNVERLQLLK 1940
>gi|301626252|ref|XP_002942311.1| PREDICTED: homeobox protein cut-like 1-like, partial [Xenopus
(Silurana) tropicalis]
Length = 1437
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 857 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 916
Query: 412 RMWKWL 417
RM WL
Sbjct: 917 RMQLWL 922
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E E+I+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 463 SAGSVSEGEDIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 522
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 523 VRGKEPFHKMKQFLSDE--QNILALR 546
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1040 ELDTYGITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1099
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1100 MQLWLNDP 1107
>gi|195353497|ref|XP_002043241.1| GM17531 [Drosophila sechellia]
gi|194127339|gb|EDW49382.1| GM17531 [Drosophila sechellia]
Length = 2349
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1492 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1551
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1552 RPKPWHMLTQKGREPFIRMKMFLED 1576
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 1057 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1116
Query: 414 WKW 416
W
Sbjct: 1117 HAW 1119
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1787 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1846
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1847 RMQLWLSDANNVERLQLLK 1865
>gi|442615409|ref|NP_001259310.1| cut, isoform E [Drosophila melanogaster]
gi|440216510|gb|AGB95155.1| cut, isoform E [Drosophila melanogaster]
Length = 2174
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1319 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1378
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1379 RPKPWHMLTQKGREPFIRMKMFLED 1403
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 885 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 944
Query: 414 WKW 416
W
Sbjct: 945 HAW 947
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1614 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1673
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1674 RMQLWLSDANNVERLQLLK 1692
>gi|195163756|ref|XP_002022715.1| GL14717 [Drosophila persimilis]
gi|194104738|gb|EDW26781.1| GL14717 [Drosophila persimilis]
Length = 1888
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1014 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1073
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1074 RPKPWHMLTQKGREPFIRMKMFLED 1098
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 541 QALDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 600
Query: 412 RMWKW 416
+M W
Sbjct: 601 KMHAW 605
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1318 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1377
Query: 412 RMWKWL 417
RM WL
Sbjct: 1378 RMQLWL 1383
>gi|17864374|ref|NP_524764.1| cut, isoform A [Drosophila melanogaster]
gi|123334|sp|P10180.1|CUT_DROME RecName: Full=Homeobox protein cut
gi|7768|emb|CAA30794.1| cut protein [Drosophila melanogaster]
gi|10728519|gb|AAF46264.2| cut, isoform A [Drosophila melanogaster]
gi|226049|prf||1408257A cut gene
Length = 2175
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1320 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1379
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1380 RPKPWHMLTQKGREPFIRMKMFLED 1404
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 886 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 945
Query: 414 WKW 416
W
Sbjct: 946 HAW 948
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1615 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1674
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1675 RMQLWLSDANNVERLQLLK 1693
>gi|149062987|gb|EDM13310.1| cut-like 1 (Drosophila), isoform CRA_a [Rattus norvegicus]
gi|149062989|gb|EDM13312.1| cut-like 1 (Drosophila), isoform CRA_a [Rattus norvegicus]
Length = 1029
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 450 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 509
Query: 412 RMWKWL 417
RM WL
Sbjct: 510 RMQLWL 515
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 53 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 112
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 113 VRGKEPFHKMKQFLS--DEQNILALR 136
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 634 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 693
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 694 MQLWLNDP 701
>gi|195565610|ref|XP_002106392.1| GD16852 [Drosophila simulans]
gi|194203768|gb|EDX17344.1| GD16852 [Drosophila simulans]
Length = 1734
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 878 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 937
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 938 RPKPWHMLTQKGREPFIRMKMFLED 962
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 443 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 502
Query: 414 WKW 416
W
Sbjct: 503 HAW 505
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1173 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1232
Query: 412 RMWKWL 417
RM WL
Sbjct: 1233 RMQLWL 1238
>gi|351698755|gb|EHB01674.1| Homeobox protein cut-like 1, partial [Heterocephalus glaber]
Length = 1318
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 907 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 966
Query: 412 RMWKWL 417
RM WL
Sbjct: 967 RMQLWL 972
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 534 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 593
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 594 VRGKEPFHKMKQFLSDE--QNILALR 617
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1091 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1150
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1151 MQVWLNDP 1158
>gi|194896882|ref|XP_001978555.1| GG19652 [Drosophila erecta]
gi|190650204|gb|EDV47482.1| GG19652 [Drosophila erecta]
Length = 2414
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1549 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1608
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1609 RPKPWHMLTQKGREPFIRMKMFLED 1633
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 1112 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1171
Query: 414 WKW 416
W
Sbjct: 1172 HAW 1174
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1845 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1904
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1905 RMQLWLSDANNVERLQLLK 1923
>gi|293352505|ref|XP_347164.4| PREDICTED: cut-like homeobox 1 isoform 3 [Rattus norvegicus]
Length = 1613
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 1034 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1093
Query: 412 RMWKWL 417
RM WL
Sbjct: 1094 RMQLWL 1099
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 637 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 696
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 697 VRGKEPFHKMKQFLSDE--QNILALR 720
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1218 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1277
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1278 MQLWLNDP 1285
>gi|402590|emb|CAA52922.1| Cux homeodomain protein [Mus musculus]
Length = 1332
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 753 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 812
Query: 412 RMWKWL 417
RM WL
Sbjct: 813 RMQLWL 818
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 356 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 415
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 416 VRGKEPFHKMKQFLSDE--QNILALR 439
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 937 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 996
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 997 MQLWLNDP 1004
>gi|344251333|gb|EGW07437.1| Homeobox protein cut-like 2 [Cricetulus griseus]
Length = 1422
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 835 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 894
Query: 413 MWKWL 417
M WL
Sbjct: 895 MQLWL 899
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 987 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1046
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1047 MQLWLSDP 1054
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 467 GAGLEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 526
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 527 GKEPFIKMKQFLSDE--QNVLALR 548
>gi|334327088|ref|XP_003340828.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2-like
[Monodelphis domestica]
Length = 1518
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 921 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 980
Query: 413 MWKWL 417
M WL
Sbjct: 981 MQLWL 985
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G + E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 558 GGPADEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 617
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 618 GKEPFIKMKQFLSDE--QNVLALR 639
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1072 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1131
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1132 MQLWLNDP 1139
>gi|159163346|pdb|1WH6|A Chain A, Solution Structure Of The Second Cut Domain Of Human
Homeobox Protein Cux-2
Length = 101
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 51/77 (66%), Gaps = 6/77 (7%)
Query: 347 GNSGELE-----EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWS 401
G+SG+ E E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWS
Sbjct: 4 GSSGQYELYMYREVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWS 63
Query: 402 KL-KSGRETFRRMWKWL 417
KL + GRE F RM WL
Sbjct: 64 KLTQKGREPFIRMQLWL 80
>gi|406035344|ref|NP_001258309.1| homeobox protein cut-like 2 [Rattus norvegicus]
gi|392352540|ref|XP_003751237.1| PREDICTED: cut-like homeobox 2 isoform 2 [Rattus norvegicus]
Length = 1426
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 838 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 897
Query: 413 MWKWL 417
M WL
Sbjct: 898 MQLWL 902
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 991 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1050
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1051 MRLWLSDP 1058
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 483 GAGPEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 542
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 543 GKEPFIKMKQFLSDE--QNVLALR 564
>gi|432875831|ref|XP_004072929.1| PREDICTED: homeobox protein cut-like 2-like [Oryzias latipes]
Length = 1580
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 970 EVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1029
Query: 413 MWKWL 417
M WL
Sbjct: 1030 MQLWL 1034
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1135 ELDTYAITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1194
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1195 MQLWLNDP 1202
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 54/82 (65%), Gaps = 3/82 (3%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGR 407
+G+ ++++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+
Sbjct: 599 AGDDDQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGK 658
Query: 408 ETFRRMWKWLQEPEFQRMSALR 429
E F +M ++L + Q + ALR
Sbjct: 659 EPFIKMKQFLSDE--QNILALR 678
>gi|1705737|sp|P53565.1|CUX1_RAT RecName: Full=Homeobox protein cut-like 1; AltName: Full=CCAAT
displacement protein; Short=CDP; AltName: Full=CDP2;
AltName: Full=Homeobox protein cux-1
gi|532532|gb|AAA21123.1| transcription factor, partial [Rattus norvegicus]
Length = 862
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 283 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 342
Query: 412 RMWKWL 417
RM WL
Sbjct: 343 RMQLWL 348
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 467 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 526
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 527 MQLWLNDP 534
>gi|40789277|ref|NP_031830.2| homeobox protein cut-like 2 [Mus musculus]
gi|341940415|sp|P70298.2|CUX2_MOUSE RecName: Full=Homeobox protein cut-like 2; AltName: Full=Homeobox
protein Cux-2
gi|40675455|gb|AAH65113.1| Cut-like homeobox 2 [Mus musculus]
Length = 1426
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 836 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 895
Query: 413 MWKWL 417
M WL
Sbjct: 896 MQLWL 900
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 991 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1050
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1051 MQLWLSDP 1058
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 483 GAGPEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 542
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 543 GKEPFIKMKQFLSDE--QNVLALR 564
>gi|1518932|gb|AAC52762.1| Cux-2 [Mus musculus]
Length = 1426
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 836 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 895
Query: 413 MWKWL 417
M WL
Sbjct: 896 MQLWL 900
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 991 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1050
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1051 MQLWLSDP 1058
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 483 GAGPEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 542
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 543 GKEPFIKMKQFLSDE--QNVLALR 564
>gi|410922613|ref|XP_003974777.1| PREDICTED: homeobox protein cut-like 2-like [Takifugu rubripes]
Length = 1499
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 904 EVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 963
Query: 413 MWKWL 417
M WL
Sbjct: 964 MQLWL 968
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 66/116 (56%), Gaps = 14/116 (12%)
Query: 326 TATLTLKTVTPTNAVVGAVSLG-----------NSGELEEINTKELAQRISAELKRYSIP 374
T+ + L + +A VG++ G +G+ ++++T E+A ++ +L +++I
Sbjct: 504 TSKVALMSGAHVSAGVGSMDAGLPSDQSESGSSTAGDEDQVDTAEIAFQVKEQLLKHNIG 563
Query: 375 QAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
Q +F VL SQG++S++L PKPW KL G+E F +M ++L + Q + ALR
Sbjct: 564 QRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKMKQFLSDE--QNILALR 617
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1068 ELDTFTITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1127
Query: 413 MWKWLQEPE-FQRMSALR 429
M WL +P +++ A++
Sbjct: 1128 MQLWLNDPHNVEKLRAMK 1145
>gi|431898197|gb|ELK06892.1| Homeobox protein cut-like 1 [Pteropus alecto]
Length = 1035
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 910 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 969
Query: 412 RMWKWL 417
RM WL
Sbjct: 970 RMQLWL 975
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 530 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 589
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 590 VRGKEPFHKMKQFLSDE--QNILALR 613
>gi|350581477|ref|XP_003481046.1| PREDICTED: homeobox protein cut-like 1-like, partial [Sus scrofa]
Length = 1150
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 855 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 914
Query: 412 RMWKWL 417
RM WL
Sbjct: 915 RMQLWL 920
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 451 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 510
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 511 VRGKEPFHKMKQFLSDE--QNILALR 534
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1040 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1099
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1100 MQLWLNDP 1107
>gi|410976628|ref|XP_003994719.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2 [Felis
catus]
Length = 1395
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 804 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 863
Query: 413 MWKWL 417
M WL
Sbjct: 864 MQLWL 868
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 955 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1014
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1015 MQLWLNDP 1022
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 553 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 612
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 613 FIKMKQFLSDE--QNVLALR 630
>gi|395513733|ref|XP_003761077.1| PREDICTED: homeobox protein cut-like 2 [Sarcophilus harrisii]
Length = 1503
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 901 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 960
Query: 413 MWKWL 417
M WL
Sbjct: 961 MQLWL 965
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 59/98 (60%), Gaps = 3/98 (3%)
Query: 333 TVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSD 392
T T T A G G + + E+++T E+A ++ +L +++I Q +F VL SQG++S+
Sbjct: 524 TDTTTPAPPGDQVEGGTVDEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSE 583
Query: 393 LLRNPKPWSKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
+L PKPW KL G+E F +M ++L + Q + ALR
Sbjct: 584 ILARPKPWRKLTVKGKEPFIKMKQFLSDE--QNVLALR 619
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1056 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1115
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1116 MQLWLNDP 1123
>gi|392332580|ref|XP_003752628.1| PREDICTED: cut-like homeobox 1 isoform 3 [Rattus norvegicus]
gi|392352438|ref|XP_003751208.1| PREDICTED: cut-like homeobox 1 isoform 1 [Rattus norvegicus]
Length = 1528
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 949 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1008
Query: 412 RMWKWL 417
RM WL
Sbjct: 1009 RMQLWL 1014
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 552 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 611
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 612 VRGKEPFHKMKQFLSDE--QNILALR 635
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1133 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1192
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1193 MQLWLNDP 1200
>gi|449480037|ref|XP_002192937.2| PREDICTED: cut-like homeobox 1-like [Taeniopygia guttata]
Length = 1541
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 956 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1015
Query: 412 RMWKWL 417
RM WL
Sbjct: 1016 RMQLWL 1021
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 562 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 621
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 622 VRGKEPFHKMKQFLSDE--QNILALR 645
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1140 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1199
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1200 MQLWLNDP 1207
>gi|198469902|ref|XP_002134446.1| GA23816 [Drosophila pseudoobscura pseudoobscura]
gi|198147090|gb|EDY73073.1| GA23816 [Drosophila pseudoobscura pseudoobscura]
Length = 2117
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1243 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1302
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1303 RPKPWHMLTQKGREPFIRMKMFLED 1327
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 772 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 831
Query: 414 WKW 416
W
Sbjct: 832 HAW 834
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1547 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1606
Query: 412 RMWKWL 417
RM WL
Sbjct: 1607 RMQLWL 1612
>gi|348522189|ref|XP_003448608.1| PREDICTED: homeobox protein cut-like 2 [Oreochromis niloticus]
Length = 1551
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 947 EVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1006
Query: 413 MWKWL 417
M WL
Sbjct: 1007 MQLWL 1011
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1111 ELDTYAITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1170
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1171 MQLWLNDP 1178
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 54/82 (65%), Gaps = 3/82 (3%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGR 407
+G+ ++++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+
Sbjct: 575 AGDDDQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGK 634
Query: 408 ETFRRMWKWLQEPEFQRMSALR 429
E F +M ++L + Q + ALR
Sbjct: 635 EPFIKMKQFLSDE--QNILALR 654
>gi|109497079|ref|XP_001070410.1| PREDICTED: cut-like homeobox 1 isoform 1 [Rattus norvegicus]
gi|392352440|ref|XP_003751209.1| PREDICTED: cut-like homeobox 1 isoform 2 [Rattus norvegicus]
Length = 1426
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 847 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 906
Query: 412 RMWKWL 417
RM WL
Sbjct: 907 RMQLWL 912
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 450 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 509
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 510 VRGKEPFHKMKQFLSDE--QNILALR 533
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1031 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1090
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1091 MQLWLNDP 1098
>gi|76363508|sp|P53564.3|CUX1_MOUSE RecName: Full=Homeobox protein cut-like 1; AltName: Full=CCAAT
displacement protein; Short=CDP; AltName: Full=Homeobox
protein cux-1
Length = 1515
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 936 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 995
Query: 412 RMWKWL 417
RM WL
Sbjct: 996 RMQLWL 1001
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 539 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 598
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 599 VRGKEPFHKMKQFLSDE--QNILALR 622
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1120 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1179
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1180 MQLWLNDP 1187
>gi|397471336|ref|XP_003807252.1| PREDICTED: homeobox protein cut-like 1, partial [Pan paniscus]
Length = 1315
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 966 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1025
Query: 412 RMWKWL 417
RM WL
Sbjct: 1026 RMQLWL 1031
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 566 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 625
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 626 VRGKEPFHKMKQFLSDE--QNILALR 649
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1150 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1209
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1210 MQLWLNDP 1217
>gi|297288071|ref|XP_001114534.2| PREDICTED: cut-like 1 [Macaca mulatta]
Length = 1514
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 941 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1000
Query: 412 RMWKWL 417
RM WL
Sbjct: 1001 RMQLWL 1006
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 541 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 600
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 601 VRGKEPFHKMKQFLSDE--QNILALR 624
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1125 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1184
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1185 MQLWLNDP 1192
>gi|148277064|ref|NP_853530.2| protein CASP isoform a [Homo sapiens]
gi|296439379|sp|P39880.3|CUX1_HUMAN RecName: Full=Homeobox protein cut-like 1; AltName: Full=CCAAT
displacement protein; Short=CDP; AltName: Full=Homeobox
protein cux-1
Length = 1505
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 941 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1000
Query: 412 RMWKWL 417
RM WL
Sbjct: 1001 RMQLWL 1006
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 541 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 600
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 601 VRGKEPFHKMKQFLSDE--QNILALR 624
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1125 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1184
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1185 MQLWLNDP 1192
>gi|194763152|ref|XP_001963697.1| GF21154 [Drosophila ananassae]
gi|190618622|gb|EDV34146.1| GF21154 [Drosophila ananassae]
Length = 2273
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
PT AV+ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 1431 PTKAVLPPITQQQFDMFNNLNTEDIVRRVKEALSQYSISQRLFGESVLGLSQGSVSDLLA 1490
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1491 RPKPWHMLTQKGREPFIRMKMFLED 1515
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 1005 LDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 1064
Query: 414 WKW 416
W
Sbjct: 1065 HAW 1067
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1739 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1798
Query: 412 RMWKWL 417
RM WL
Sbjct: 1799 RMQLWL 1804
>gi|60360228|dbj|BAD90358.1| mKIAA4047 protein [Mus musculus]
Length = 1506
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 927 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 986
Query: 412 RMWKWL 417
RM WL
Sbjct: 987 RMQLWL 992
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 552 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 611
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 612 VRGKEPFHKMKQFLSDE--QNILALR 635
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1111 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1170
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1171 MQLWLNDP 1178
>gi|2198820|gb|AAD12485.1| Cux/CDP(1B1) [Mus musculus]
Length = 1293
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 714 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 773
Query: 412 RMWKWL 417
RM WL
Sbjct: 774 RMQLWL 779
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 317 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 376
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 377 VRGKEPFHKMKQFLSDE--QNILALR 400
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 898 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 957
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 958 MQLWLNDP 965
>gi|359319634|ref|XP_546939.4| PREDICTED: cut-like homeobox 1 [Canis lupus familiaris]
Length = 1514
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 929 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 988
Query: 412 RMWKWL 417
RM WL
Sbjct: 989 RMQLWL 994
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 533 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 592
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 593 VRGKEPFHKMKQFLSDE--QNILALR 616
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1113 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1172
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1173 MQLWLNDP 1180
>gi|334324976|ref|XP_001378627.2| PREDICTED: homeobox protein cut-like 1-like [Monodelphis domestica]
Length = 1391
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 856 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 915
Query: 412 RMWKWL 417
RM WL
Sbjct: 916 RMQLWL 921
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 56/84 (66%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 461 GSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVR 520
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 521 GKEPFHKMKQFLSDE--QNILALR 542
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1040 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1099
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1100 MQLWLNDP 1107
>gi|301754519|ref|XP_002913157.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2-like
[Ailuropoda melanoleuca]
Length = 1505
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 914 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 973
Query: 413 MWKWL 417
M WL
Sbjct: 974 MQLWL 978
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 46/77 (59%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1065 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1124
Query: 413 MWKWLQEPEFQRMSALR 429
M WL +P Q + LR
Sbjct: 1125 MQLWLNDP--QNVEKLR 1139
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 561 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 620
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 621 FIKMKQFLSDE--QNVLALR 638
>gi|14346042|gb|AAK59986.1| CCAAT displacement protein CDP [Mus musculus]
Length = 1517
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 938 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 997
Query: 412 RMWKWL 417
RM WL
Sbjct: 998 RMQLWL 1003
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 541 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 600
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 601 VRGKEPFHKMKQFLSDE--QNILALR 624
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1122 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1181
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1182 MQLWLNDP 1189
>gi|326929703|ref|XP_003210996.1| PREDICTED: homeobox protein cut-like 2-like [Meleagris gallopavo]
Length = 1529
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 986 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1045
Query: 413 MWKWL 417
M WL
Sbjct: 1046 MQLWL 1050
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 55/84 (65%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G+S + E+++T ++A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 632 GSSADEEQLDTADIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 691
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 692 GKEPFIKMKQFLSDE--QNVLALR 713
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1134 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1193
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1194 MQLWLNDP 1201
>gi|440908427|gb|ELR58441.1| Homeobox protein cut-like 1, partial [Bos grunniens mutus]
Length = 1332
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 927 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 986
Query: 412 RMWKWL 417
RM WL
Sbjct: 987 RMQLWL 992
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 532 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 591
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 592 VRGKEPFHKMKQFLSDE--QNILALR 615
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1111 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1170
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1171 MQLWLNDP 1178
>gi|354477598|ref|XP_003501006.1| PREDICTED: homeobox protein cut-like 1 [Cricetulus griseus]
Length = 1688
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 946 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1005
Query: 412 RMWKWL 417
RM WL
Sbjct: 1006 RMQLWL 1011
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 549 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 608
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 609 VRGKEPFHKMKQFLSDE--QNILALR 632
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1130 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1189
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1190 MQLWLNDP 1197
>gi|395842972|ref|XP_003794280.1| PREDICTED: homeobox protein cut-like 1 [Otolemur garnettii]
Length = 1627
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 1046 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1105
Query: 412 RMWKWL 417
RM WL
Sbjct: 1106 RMQLWL 1111
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 655 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 714
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 715 VRGKEPFHKMKQFLSDE--QNILALR 738
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1230 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1289
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1290 MQLWLNDP 1297
>gi|281337674|gb|EFB13258.1| hypothetical protein PANDA_015425 [Ailuropoda melanoleuca]
Length = 1113
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 847 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 906
Query: 412 RMWKWL 417
RM WL
Sbjct: 907 RMQLWL 912
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 58/86 (67%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G++ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 452 SAGSASEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 511
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 512 VRGKEPFHKMKQFLSDE--QNILALR 535
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1031 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1090
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1091 MQLWLNDP 1098
>gi|363739947|ref|XP_415167.3| PREDICTED: cut-like homeobox 2 [Gallus gallus]
Length = 1550
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 1007 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1066
Query: 413 MWKWL 417
M WL
Sbjct: 1067 MQLWL 1071
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 68/118 (57%), Gaps = 6/118 (5%)
Query: 316 PGPPKIISQQTATLTLKTVTPTNA---VVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
P P +++++ TV PT + S G+S + E+++T ++A ++ +L +++
Sbjct: 619 PFPTAFFGAKSSSVPPNTVPPTTSPPDEPAEGSGGSSADEEQLDTADIAFQVKEQLLKHN 678
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
I Q +F VL SQG++S++L PKPW KL G+E F +M ++L + Q + ALR
Sbjct: 679 IGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKMKQFLSDE--QNVLALR 734
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1155 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1214
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1215 MQLWLNDP 1222
>gi|321400107|ref|NP_001189472.1| protein CASP isoform d [Homo sapiens]
gi|42793995|gb|AAH66592.1| CUX1 protein [Homo sapiens]
gi|119570613|gb|EAW50228.1| cut-like 1, CCAAT displacement protein (Drosophila), isoform CRA_b
[Homo sapiens]
Length = 1516
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 952 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1011
Query: 412 RMWKWL 417
RM WL
Sbjct: 1012 RMQLWL 1017
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 552 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 611
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 612 VRGKEPFHKMKQFLSDE--QNILALR 635
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1136 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1195
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1196 MQLWLNDP 1203
>gi|300300|gb|AAB26579.1| CCAAT displacement protein, CDP [human, Peptide, 1505 aa]
Length = 1505
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 941 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1000
Query: 412 RMWKWL 417
RM WL
Sbjct: 1001 RMQLWL 1006
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 541 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 600
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 601 VRGKEPFHKMKQFLSDE--QNILALR 624
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1125 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1184
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1185 MQLWLNDP 1192
>gi|344297433|ref|XP_003420403.1| PREDICTED: homeobox protein cut-like 2 [Loxodonta africana]
Length = 1521
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 930 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 989
Query: 413 MWKWL 417
M WL
Sbjct: 990 MQLWL 994
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 57/95 (60%), Gaps = 10/95 (10%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV GA E E+++T E+A ++ +L +++I Q +F VL SQG++S++L
Sbjct: 563 PVVAVAGA-------EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILA 615
Query: 396 NPKPWSKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
PKPW KL G+E F +M ++L + Q + ALR
Sbjct: 616 RPKPWRKLTVKGKEPFIKMKQFLSDE--QNVLALR 648
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1081 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1140
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1141 MQLWLNDP 1148
>gi|359079821|ref|XP_003587888.1| PREDICTED: homeobox protein cut-like 1-like [Bos taurus]
gi|296473075|tpg|DAA15190.1| TPA: cut-like homeobox 1 [Bos taurus]
Length = 1532
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 946 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1005
Query: 412 RMWKWL 417
RM WL
Sbjct: 1006 RMQLWL 1011
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 550 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 609
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 610 VRGKEPFHKMKQFLSDE--QNILALR 633
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1130 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1189
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1190 MQLWLNDP 1197
>gi|345790865|ref|XP_543393.3| PREDICTED: cut-like homeobox 2 [Canis lupus familiaris]
Length = 1504
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 913 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 972
Query: 413 MWKWL 417
M WL
Sbjct: 973 MQLWL 977
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1064 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1123
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1124 MQLWLNDP 1131
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 559 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 618
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 619 FIKMKQFLSDE--QNVLALR 636
>gi|440904726|gb|ELR55197.1| Homeobox protein cut-like 2, partial [Bos grunniens mutus]
Length = 1482
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 893 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 952
Query: 413 MWKWL 417
M WL
Sbjct: 953 MQLWL 957
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1045 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1104
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1105 MQLWLNDP 1112
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 540 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 599
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 600 FIKMKQFLSDE--QNVLALR 617
>gi|355786534|gb|EHH66717.1| hypothetical protein EGM_03761, partial [Macaca fascicularis]
Length = 1342
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 751 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 810
Query: 413 MWKWL 417
M WL
Sbjct: 811 MQLWL 815
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 902 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 961
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 962 MQLWLNDP 969
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 531 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 590
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 591 MKQFLSDE--QNVLALR 605
>gi|410059445|ref|XP_003949274.1| PREDICTED: cut-like homeobox 1 [Pan troglodytes]
Length = 1515
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 952 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1011
Query: 412 RMWKWL 417
RM WL
Sbjct: 1012 RMQLWL 1017
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 552 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 611
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 612 VRGKEPFHKMKQFLSDE--QNILALR 635
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1136 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1195
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1196 MQLWLNDP 1203
>gi|148687362|gb|EDL19309.1| cut-like 1 (Drosophila), isoform CRA_b [Mus musculus]
Length = 1519
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 940 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 999
Query: 412 RMWKWL 417
RM WL
Sbjct: 1000 RMQLWL 1005
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 543 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 602
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 603 VRGKEPFHKMKQFLSDE--QNILALR 626
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1124 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1183
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1184 MQLWLNDP 1191
>gi|402863115|ref|XP_003895877.1| PREDICTED: homeobox protein cut-like 1-like [Papio anubis]
Length = 1523
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 952 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1011
Query: 412 RMWKWL 417
RM WL
Sbjct: 1012 RMQLWL 1017
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 552 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 611
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 612 VRGKEPFHKMKQFLSDE--QNILALR 635
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1136 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1195
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1196 MQLWLNDP 1203
>gi|351694717|gb|EHA97635.1| Homeobox protein cut-like 2 [Heterocephalus glaber]
Length = 1379
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 788 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 847
Query: 413 MWKWL 417
M WL
Sbjct: 848 MQLWL 852
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 939 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 998
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 999 MQLWLSDP 1006
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 574 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 633
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 634 FIKMKQFLSDE--QNVLALR 651
>gi|296192315|ref|XP_002744007.1| PREDICTED: homeobox protein cut-like 1 [Callithrix jacchus]
Length = 1531
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 955 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1014
Query: 412 RMWKWL 417
RM WL
Sbjct: 1015 RMQLWL 1020
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E E+++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 552 SAGSVSEGEDMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 611
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 612 VRGKEPFHKMKQFLSDE--QNILALR 635
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1139 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1198
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1199 MQLWLNDP 1206
>gi|355747563|gb|EHH52060.1| hypothetical protein EGM_12428, partial [Macaca fascicularis]
Length = 1306
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 932 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 991
Query: 412 RMWKWL 417
RM WL
Sbjct: 992 RMQLWL 997
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 532 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 591
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 592 VRGKEPFHKMKQFLSDE--QNILALR 615
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1116 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1175
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1176 MQLWLNDP 1183
>gi|348554341|ref|XP_003462984.1| PREDICTED: homeobox protein cut-like 2-like [Cavia porcellus]
Length = 1493
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 903 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 962
Query: 413 MWKWL 417
M WL
Sbjct: 963 MQLWL 967
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 55/90 (61%), Gaps = 3/90 (3%)
Query: 341 VGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
VG + E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW
Sbjct: 543 VGGSTAAAGAEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPW 602
Query: 401 SKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
KL G+E F +M ++L + Q + ALR
Sbjct: 603 RKLTVKGKEPFIKMKQFLSDE--QNVLALR 630
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1054 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1113
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1114 MQLWLNDP 1121
>gi|60688551|gb|AAH90847.1| Cut-like homeobox 1 [Mus musculus]
Length = 1426
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 847 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 906
Query: 412 RMWKWL 417
RM WL
Sbjct: 907 RMQLWL 912
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 450 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 509
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 510 VRGKEPFHKMKQFLSDE--QNILALR 533
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1031 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1090
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1091 MQLWLNDP 1098
>gi|395846701|ref|XP_003796037.1| PREDICTED: homeobox protein cut-like 2 [Otolemur garnettii]
Length = 1494
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 897 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 956
Query: 413 MWKWL 417
M WL
Sbjct: 957 MQLWL 961
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 57/90 (63%), Gaps = 4/90 (4%)
Query: 342 GAVSLGNSG-ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPW 400
G S+ +G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW
Sbjct: 539 GGGSMAGTGTEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPW 598
Query: 401 SKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
KL G+E F +M ++L + Q + ALR
Sbjct: 599 RKLTVKGKEPFIKMKQFLSDE--QNVLALR 626
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1052 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1111
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1112 MQLWLNDP 1119
>gi|119917099|ref|XP_601985.3| PREDICTED: cut-like homeobox 1 [Bos taurus]
Length = 1532
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 946 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1005
Query: 412 RMWKWL 417
RM WL
Sbjct: 1006 RMQLWL 1011
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 550 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 609
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 610 VRGKEPFHKMKQFLSDE--QNILALR 633
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1130 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1189
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1190 MQLWLNDP 1197
>gi|118100079|ref|XP_425393.2| PREDICTED: cut-like homeobox 1 [Gallus gallus]
Length = 1673
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 1080 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1139
Query: 412 RMWKWL 417
RM WL
Sbjct: 1140 RMQLWL 1145
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 686 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 745
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 746 VRGKEPFHKMKQFLSDE--QNILALR 769
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1264 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1323
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1324 MQLWLNDP 1331
>gi|110835729|ref|NP_034116.3| protein CASP isoform a [Mus musculus]
Length = 1426
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 847 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 906
Query: 412 RMWKWL 417
RM WL
Sbjct: 907 RMQLWL 912
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 450 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 509
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 510 VRGKEPFHKMKQFLSDE--QNILALR 533
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1031 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1090
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1091 MQLWLNDP 1098
>gi|387539868|gb|AFJ70561.1| homeobox protein cut-like 2 [Macaca mulatta]
Length = 1486
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 895 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 954
Query: 413 MWKWL 417
M WL
Sbjct: 955 MQLWL 959
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1046 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1105
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1106 MQLWLNDP 1113
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 552 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 611
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 612 MKQFLSDE--QNVLALR 626
>gi|296212937|ref|XP_002807189.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2
[Callithrix jacchus]
Length = 1496
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 906 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 965
Query: 413 MWKWL 417
M WL
Sbjct: 966 MQLWL 970
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1057 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1116
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1117 MQLWLNDP 1124
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 564 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 623
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 624 MKQFLSDE--QNVLALR 638
>gi|355560534|gb|EHH17220.1| hypothetical protein EGK_13565, partial [Macaca mulatta]
Length = 1303
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 932 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 991
Query: 412 RMWKWL 417
RM WL
Sbjct: 992 RMQLWL 997
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 532 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 591
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 592 VRGKEPFHKMKQFLSDE--QNILALR 615
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1116 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1175
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1176 MQLWLNDP 1183
>gi|402887673|ref|XP_003907212.1| PREDICTED: homeobox protein cut-like 2 [Papio anubis]
Length = 1486
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 895 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 954
Query: 413 MWKWL 417
M WL
Sbjct: 955 MQLWL 959
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1046 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1105
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1106 MQLWLNDP 1113
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 549 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 608
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 609 FIKMKQFLSDE--QNVLALR 626
>gi|354472562|ref|XP_003498507.1| PREDICTED: homeobox protein cut-like 2-like [Cricetulus griseus]
Length = 1592
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 1005 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1064
Query: 413 MWKWL 417
M WL
Sbjct: 1065 MQLWL 1069
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1157 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1216
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1217 MQLWLSDP 1224
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 648 GAGLEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 707
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 708 GKEPFIKMKQFLSDE--QNVLALR 729
>gi|431912199|gb|ELK14337.1| Homeobox protein cut-like 2 [Pteropus alecto]
Length = 1305
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 714 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 773
Query: 413 MWKWL 417
M WL
Sbjct: 774 MQLWL 778
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T L +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 865 ELDTYSLTKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 924
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 925 MQLWLSDP 932
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 58/91 (63%), Gaps = 4/91 (4%)
Query: 341 VGAVSLGNSG-ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKP 399
VG+ + +G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKP
Sbjct: 352 VGSTAGAGTGTEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKP 411
Query: 400 WSKLK-SGRETFRRMWKWLQEPEFQRMSALR 429
W KL G+E F +M ++L + Q + ALR
Sbjct: 412 WRKLTVKGKEPFIKMKQFLSDE--QNVLALR 440
>gi|350592484|ref|XP_001927249.3| PREDICTED: homeobox protein cut-like 2-like [Sus scrofa]
Length = 1671
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 1082 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1141
Query: 413 MWKWL 417
M WL
Sbjct: 1142 MQLWL 1146
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1234 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1293
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1294 MQLWLNDP 1301
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 726 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 785
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 786 MKQFLSDE--QNVLALR 800
>gi|119618354|gb|EAW97948.1| hCG1812168 [Homo sapiens]
Length = 783
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 192 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 251
Query: 413 MWKWL 417
M WL
Sbjct: 252 MQLWL 256
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 343 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 402
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 403 MQLWLNDP 410
>gi|426247735|ref|XP_004017632.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2 [Ovis
aries]
Length = 1282
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 781 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 840
Query: 413 MWKWL 417
M WL
Sbjct: 841 MQLWL 845
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 933 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 992
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 993 MQLWLNDP 1000
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F
Sbjct: 546 EQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFI 605
Query: 412 RMWKWLQEPEFQRMSALR 429
+M +L + Q + ALR
Sbjct: 606 KMKHFLSDE--QNVLALR 621
>gi|403286072|ref|XP_003934331.1| PREDICTED: homeobox protein cut-like 1 [Saimiri boliviensis
boliviensis]
Length = 1328
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 947 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1006
Query: 412 RMWKWL 417
RM WL
Sbjct: 1007 RMQLWL 1012
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E E+++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 544 SAGSVSEGEDMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 603
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 604 VRGKEPFHKMKQFLSDE--QNILALR 627
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1131 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1190
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1191 MQLWLNDP 1198
>gi|14133201|dbj|BAA22962.2| KIAA0293 [Homo sapiens]
Length = 1505
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 914 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 973
Query: 413 MWKWL 417
M WL
Sbjct: 974 MQLWL 978
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1065 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1124
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1125 MQLWLNDP 1132
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 571 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 630
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 631 MKQFLSDE--QNVLALR 645
>gi|153217467|gb|AAI51246.1| Cut-like homeobox 2 [Homo sapiens]
gi|168267284|dbj|BAG09698.1| homeobox protein cut-like 2 [synthetic construct]
Length = 1486
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 895 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 954
Query: 413 MWKWL 417
M WL
Sbjct: 955 MQLWL 959
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1046 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1105
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1106 MQLWLNDP 1113
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 552 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 611
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 612 MKQFLSDE--QNVLALR 626
>gi|326669835|ref|XP_693902.5| PREDICTED: cut-like [Danio rerio]
Length = 1452
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 901 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 960
Query: 413 MWKWL 417
M WL
Sbjct: 961 MQLWL 965
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1060 ELDTYAITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1119
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1120 MQLWLNDP 1127
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 49/76 (64%), Gaps = 3/76 (3%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRM 413
++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +M
Sbjct: 550 LDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKM 609
Query: 414 WKWLQEPEFQRMSALR 429
++L + Q + ALR
Sbjct: 610 KQFLSDE--QNILALR 623
>gi|116517292|ref|NP_056082.2| homeobox protein cut-like 2 [Homo sapiens]
gi|313104302|sp|O14529.4|CUX2_HUMAN RecName: Full=Homeobox protein cut-like 2; AltName: Full=Homeobox
protein cux-2
gi|147898013|gb|AAI40441.1| Cut-like homeobox 2 [synthetic construct]
Length = 1486
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 895 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 954
Query: 413 MWKWL 417
M WL
Sbjct: 955 MQLWL 959
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1046 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1105
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1106 MQLWLNDP 1113
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 552 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 611
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 612 MKQFLSDE--QNVLALR 626
>gi|296478572|tpg|DAA20687.1| TPA: cut-like homeobox 2 [Bos taurus]
Length = 1502
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 913 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 972
Query: 413 MWKWL 417
M WL
Sbjct: 973 MQLWL 977
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1065 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1124
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1125 MQLWLNDP 1132
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 560 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 619
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 620 FIKMKQFLSDE--QNVLALR 637
>gi|300797293|ref|NP_001179526.1| homeobox protein cut-like 2 [Bos taurus]
Length = 1502
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 913 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 972
Query: 413 MWKWL 417
M WL
Sbjct: 973 MQLWL 977
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1065 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1124
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1125 MQLWLNDP 1132
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 560 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 619
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 620 FIKMKQFLSDE--QNVLALR 637
>gi|301780872|ref|XP_002925852.1| PREDICTED: homeobox protein cut-like 1-like [Ailuropoda melanoleuca]
Length = 1292
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 936 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 995
Query: 412 RMWKWL 417
RM WL
Sbjct: 996 RMQLWL 1001
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 58/86 (67%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G++ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 541 SAGSASEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 600
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 601 VRGKEPFHKMKQFLSDE--QNILALR 624
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1120 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1179
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1180 MQLWLNDP 1187
>gi|444726036|gb|ELW66585.1| Homeobox protein cut-like 2 [Tupaia chinensis]
Length = 1629
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 1039 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1098
Query: 413 MWKWL 417
M WL
Sbjct: 1099 MQLWL 1103
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1190 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1249
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1250 MQLWLNDP 1257
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 614 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 673
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 674 FIKMKQFLSDE--QNVLALR 691
>gi|1524390|gb|AAC52775.1| Cux-1, partial [Mus musculus]
Length = 754
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 175 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 234
Query: 412 RMWKWL 417
RM WL
Sbjct: 235 RMQLWL 240
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 359 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 418
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 419 MQLWLNDP 426
>gi|426374158|ref|XP_004053947.1| PREDICTED: homeobox protein cut-like 2 [Gorilla gorilla gorilla]
Length = 1367
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 514 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 573
Query: 413 MWKWL 417
M WL
Sbjct: 574 MQLWL 578
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 927 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 986
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 987 MQLWLNDP 994
>gi|380795679|gb|AFE69715.1| homeobox protein cut-like 2, partial [Macaca mulatta]
Length = 715
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 124 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 183
Query: 413 MWKWL 417
M WL
Sbjct: 184 MQLWL 188
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 275 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 334
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 335 MQLWLNDP 342
>gi|344289761|ref|XP_003416609.1| PREDICTED: homeobox protein cut-like 1 [Loxodonta africana]
Length = 1330
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 968 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1027
Query: 412 RMWKWL 417
RM WL
Sbjct: 1028 RMQLWL 1033
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 58/86 (67%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G++ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 573 SAGSTSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 632
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 633 VRGKEPFHKMKQFLSDE--QNILALR 656
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1152 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1211
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1212 MQLWLNDP 1219
>gi|449476826|ref|XP_004176478.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2
[Taeniopygia guttata]
Length = 1184
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 835 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 894
Query: 413 MWKWL 417
M WL
Sbjct: 895 MQLWL 899
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 56/86 (65%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+S E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 474 SAGSSSEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWHKLT 533
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 534 VKGKEPFIKMKQFLSDE--QNVLALR 557
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 369 KRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQEP----EFQ 423
K +++ Q +F + +L +QG++S LL PKPW KL GRE F RM WL +P + +
Sbjct: 986 KSWALGQRLFGENILGLTQGSVSHLLSRPKPWHKLSLKGREPFVRMXLWLNDPHNVEKLR 1045
Query: 424 RMSALR 429
M LR
Sbjct: 1046 DMKKLR 1051
>gi|326931234|ref|XP_003211738.1| PREDICTED: homeobox protein cut-like 1-like, partial [Meleagris
gallopavo]
Length = 1403
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 888 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 947
Query: 412 RMWKWL 417
RM WL
Sbjct: 948 RMQLWL 953
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 494 SAGSISEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 553
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 554 VRGKEPFHKMKQFLSDE--QNILALR 577
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1072 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1131
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1132 MQLWLNDP 1139
>gi|194218881|ref|XP_001916140.1| PREDICTED: LOW QUALITY PROTEIN: cut-like homeobox 1 [Equus caballus]
Length = 1327
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 1022 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1081
Query: 412 RMWKWL 417
RM WL
Sbjct: 1082 RMQLWL 1087
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EEI+T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 629 SAGSVSEGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 688
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 689 VRGKEPFHKMKQFLSDE--QNILALR 712
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1206 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1265
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1266 MQLWLNDP 1273
>gi|395536492|ref|XP_003770249.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 1
[Sarcophilus harrisii]
Length = 1452
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 888 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 947
Query: 412 RMWKWL 417
RM WL
Sbjct: 948 RMQLWL 953
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 345 SLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK 404
S G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 581 SAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLT 640
Query: 405 -SGRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 641 VRGKEPFHKMKQFLSDE--QNILALR 664
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1072 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1131
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1132 MQLWLNDP 1139
>gi|729093|sp|P39881.1|CUX1_CANFA RecName: Full=Homeobox protein cut-like 1; AltName: Full=CCAAT
displacement protein; Short=CDP; AltName: Full=Clox-1;
AltName: Full=Homeobox protein Clox; AltName:
Full=Homeobox protein cux-1
gi|2202|emb|CAA48782.1| Clox [Canis sp.]
Length = 975
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 381 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 440
Query: 412 RMWKWL 417
RM WL
Sbjct: 441 RMQLWL 446
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 565 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 624
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 625 MQLWLNDP 632
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 46/70 (65%), Gaps = 3/70 (4%)
Query: 361 AQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQE 419
++++ +L +++I Q IF VL SQG++S++L PKPW+KL G+E F +M ++L
Sbjct: 1 SRQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEPFHKMKQFLS- 59
Query: 420 PEFQRMSALR 429
+ Q + ALR
Sbjct: 60 -DEQNILALR 68
>gi|332264634|ref|XP_003281340.1| PREDICTED: homeobox protein cut-like 2 [Nomascus leucogenys]
Length = 1540
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 950 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1009
Query: 413 MWKWL 417
M WL
Sbjct: 1010 MQLWL 1014
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1101 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1160
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1161 MQLWLNDP 1168
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 607 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 666
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 667 MKQFLSDE--QNVLALR 681
>gi|397525512|ref|XP_003832709.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like 2 [Pan
paniscus]
Length = 1382
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 791 EVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 850
Query: 413 MWKWL 417
M WL
Sbjct: 851 MQLWL 855
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 942 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1001
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1002 MQLWLNDP 1009
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 609 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 668
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 669 FIKMKQFLSDE--QNVLALR 686
>gi|328711839|ref|XP_003244655.1| PREDICTED: hypothetical protein LOC100161988 isoform 2
[Acyrthosiphon pisum]
Length = 1375
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ + +NT+E+ +++ +L ++SI Q +F + VL SQG++SDLL
Sbjct: 722 PLKAVLPPITQQQFDQFNNLNTEEIVRKVKEQLSQFSISQRLFGESVLGLSQGSVSDLLA 781
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 782 RPKPWHMLTQKGREPFIRMKMFLED 806
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L +++ Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 350 QNLDTLHIARRVRELLSIHNVGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 409
Query: 412 RMWKW 416
+M W
Sbjct: 410 KMHAW 414
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 986 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1045
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + +R+ A++
Sbjct: 1046 RMQLWLSDQHNVERLQAIK 1064
>gi|339243803|ref|XP_003377827.1| putative CUT domain protein [Trichinella spiralis]
gi|316973322|gb|EFV56928.1| putative CUT domain protein [Trichinella spiralis]
Length = 1276
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 82/171 (47%), Gaps = 23/171 (13%)
Query: 255 LSPNEGSLHSSPTTLVTNFQGAATIPTM--NGITTITPHTVVVSPQTSPQHSLISNQLQR 312
L ++GSL L+ + Q T+PT+ + + + ++ +V T S S R
Sbjct: 735 LRNHDGSL------LIPSDQQTPTVPTLQPSDLNVVVSNSTLVPAFTGFSSSTSSAGSNR 788
Query: 313 SDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
S PP T T K V P ++ INT++L QR+ L +YS
Sbjct: 789 CASASPP--------TRTFKAVLPP------ITQEQFERFGRINTEKLVQRVKEHLLQYS 834
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQEPEF 422
I Q +F ++VL SQG++SDLL PK W L + GRE F RM +L++ E
Sbjct: 835 ISQRVFGEQVLGLSQGSVSDLLARPKAWHMLTQKGREPFIRMHAFLEDNEL 885
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR-ETFRRM 413
++T + + A L +SI Q +F + VL SQG +SDLL PKPW L + R E RM
Sbjct: 952 LDTVAIVEETRAVLAAHSIGQKLFGEAVLHLSQGFVSDLLSKPKPWEALSAKRKEALLRM 1011
Query: 414 WKWLQEPE 421
WL++ +
Sbjct: 1012 QAWLKDAD 1019
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
E ++T E+A++ L ++I Q + A+ VL +SQG +S+LL PK WS L + + +FR
Sbjct: 535 EPLDTAEIARQCRRVLTEHNIGQRLIAKYVLNQSQGAVSELLSKPKKWSSLTERSKNSFR 594
Query: 412 RMWKWL 417
R+ WL
Sbjct: 595 RLKAWL 600
>gi|256078012|ref|XP_002575292.1| cut-like 1 (ctl1) [Schistosoma mansoni]
Length = 1223
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 49/71 (69%), Gaps = 1/71 (1%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRET 409
++E +NT E+A + L +++I Q F++ +L SQGT+S+LL P+PW +L + GR++
Sbjct: 482 QIEPLNTFEIAATVRDLLTKHNISQRQFSRHILKLSQGTMSELLSKPRPWYRLTARGRDS 541
Query: 410 FRRMWKWLQEP 420
+RR+ W+ +P
Sbjct: 542 YRRLQAWISDP 552
>gi|193631961|ref|XP_001948782.1| PREDICTED: hypothetical protein LOC100161988 isoform 1
[Acyrthosiphon pisum]
Length = 1431
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ + +NT+E+ +++ +L ++SI Q +F + VL SQG++SDLL
Sbjct: 778 PLKAVLPPITQQQFDQFNNLNTEEIVRKVKEQLSQFSISQRLFGESVLGLSQGSVSDLLA 837
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 838 RPKPWHMLTQKGREPFIRMKMFLED 862
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L +++ Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 406 QNLDTLHIARRVRELLSIHNVGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 465
Query: 412 RMWKW 416
+M W
Sbjct: 466 KMHAW 470
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1042 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1101
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + +R+ A++
Sbjct: 1102 RMQLWLSDQHNVERLQAIK 1120
>gi|353231498|emb|CCD77916.1| cut-like 1 (ctl1) [Schistosoma mansoni]
Length = 1223
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 49/71 (69%), Gaps = 1/71 (1%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRET 409
++E +NT E+A + L +++I Q F++ +L SQGT+S+LL P+PW +L + GR++
Sbjct: 482 QIEPLNTFEIAATVRDLLTKHNISQRQFSRHILKLSQGTMSELLSKPRPWYRLTARGRDS 541
Query: 410 FRRMWKWLQEP 420
+RR+ W+ +P
Sbjct: 542 YRRLQAWISDP 552
>gi|270009064|gb|EFA05512.1| hypothetical protein TcasGA2_TC015699 [Tribolium castaneum]
Length = 1283
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ + +NT+++ +++ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 669 PLKAVLPPITQQQFDQYNNLNTEDIVKKVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 728
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 729 RPKPWHMLTQKGREPFIRMKMFLED 753
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 312 QNLDTLNISRRVRELLSIHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 371
Query: 412 RMWKW 416
+M W
Sbjct: 372 KMHAW 376
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 893 QDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 952
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 953 RMQLWLNDAHNIDRLQALK 971
>gi|189238507|ref|XP_970668.2| PREDICTED: similar to Homeobox protein cut [Tribolium castaneum]
Length = 1297
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ + +NT+++ +++ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 662 PLKAVLPPITQQQFDQYNNLNTEDIVKKVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 721
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 722 RPKPWHMLTQKGREPFIRMKMFLED 746
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 312 QNLDTLNISRRVRELLSIHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 371
Query: 412 RMWKW 416
+M W
Sbjct: 372 KMHAW 376
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 886 QDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 945
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 946 RMQLWLNDAHNIDRLQALK 964
>gi|339243779|ref|XP_003377815.1| putative CUT domain protein [Trichinella spiralis]
gi|316973338|gb|EFV56940.1| putative CUT domain protein [Trichinella spiralis]
Length = 771
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 82/171 (47%), Gaps = 23/171 (13%)
Query: 255 LSPNEGSLHSSPTTLVTNFQGAATIPTM--NGITTITPHTVVVSPQTSPQHSLISNQLQR 312
L ++GSL L+ + Q T+PT+ + + + ++ +V T S S R
Sbjct: 230 LRNHDGSL------LIPSDQQTPTVPTLQPSDLNVVVSNSTLVPAFTGFSSSTSSAGSNR 283
Query: 313 SDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
S PP T T K V P ++ INT++L QR+ L +YS
Sbjct: 284 CASASPP--------TRTFKAVLP------PITQEQFERFGRINTEKLVQRVKEHLLQYS 329
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQEPEF 422
I Q +F ++VL SQG++SDLL PK W L + GRE F RM +L++ E
Sbjct: 330 ISQRVFGEQVLGLSQGSVSDLLARPKAWHMLTQKGREPFIRMHAFLEDNEL 380
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGR-ETFRRM 413
++T + + A L +SI Q +F + VL SQG +SDLL PKPW L + R E RM
Sbjct: 447 LDTVAIVEETRAVLAAHSIGQKLFGEAVLHLSQGFVSDLLSKPKPWEALSAKRKEALLRM 506
Query: 414 WKWLQEPE 421
WL++ +
Sbjct: 507 QAWLKDAD 514
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
E ++T E+A++ L ++I Q + A+ VL +SQG +S+LL PK WS L + + +FR
Sbjct: 30 EPLDTAEIARQCRRVLTEHNIGQRLIAKYVLNQSQGAVSELLSKPKKWSSLTERSKNSFR 89
Query: 412 RMWKWL 417
R+ WL
Sbjct: 90 RLKAWL 95
>gi|383861555|ref|XP_003706251.1| PREDICTED: homeobox protein cut-like, partial [Megachile rotundata]
Length = 1662
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ +NT+++ +R+ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 986 PLKAVLPPITQQQFDMYNNLNTEDIVKRVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 1045
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1046 RPKPWHMLTQKGREPFIRMKMFLED 1070
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 615 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 674
Query: 412 RMWKW 416
+M W
Sbjct: 675 KMHGW 679
>gi|426357361|ref|XP_004046012.1| PREDICTED: homeobox protein cut-like 1-like [Gorilla gorilla gorilla]
Length = 1506
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 979 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1038
Query: 412 RMWKWL 417
RM WL
Sbjct: 1039 RMQLWL 1044
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1163 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1222
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1223 MQLWLNDP 1230
>gi|350276180|ref|NP_001231882.1| homeobox protein cut-like 2 isoform 2 [Xenopus (Silurana)
tropicalis]
Length = 1255
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T +L +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 659 EVDTLDLTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 718
Query: 413 MWKWL 417
M WL
Sbjct: 719 MQLWL 723
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 809 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 868
Query: 413 MWKWLQEPE-FQRMSALR 429
M WL +P +R+ ++
Sbjct: 869 MQLWLSDPHNIERLRDMK 886
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 49/76 (64%), Gaps = 3/76 (3%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRM 413
I+T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +M
Sbjct: 314 IDTSEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKM 373
Query: 414 WKWLQEPEFQRMSALR 429
++L + Q + ALR
Sbjct: 374 KQFLSDE--QNVLALR 387
>gi|345305012|ref|XP_001506212.2| PREDICTED: cut-like homeobox 2 [Ornithorhynchus anatinus]
Length = 1585
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T +L +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 990 EVDTLDLTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1049
Query: 413 MWKWL 417
M WL
Sbjct: 1050 MQLWL 1054
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1141 ELDTYAITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1200
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1201 MQLWLNDP 1208
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 51/78 (65%), Gaps = 3/78 (3%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F
Sbjct: 635 EQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFI 694
Query: 412 RMWKWLQEPEFQRMSALR 429
+M ++L + Q + ALR
Sbjct: 695 KMKQFLSDE--QNVLALR 710
>gi|307170364|gb|EFN62686.1| Homeobox protein cut [Camponotus floridanus]
Length = 1517
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ +NT+++ +R+ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 842 PLKAVLPPITQQQFDMYNNLNTEDIVKRVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 901
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 902 RPKPWHMLTQKGREPFIRMKMFLED 926
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 460 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 519
Query: 412 RMWKW 416
+M W
Sbjct: 520 KMHGW 524
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1085 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1144
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 1145 RMQLWLSDAHNVDRLQALK 1163
>gi|345313284|ref|XP_003429368.1| PREDICTED: homeobox protein cut-like 1-like, partial [Ornithorhynchus
anatinus]
Length = 1635
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F
Sbjct: 1532 QEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFI 1591
Query: 412 RMWKWL 417
RM WL
Sbjct: 1592 RMQLWL 1597
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 56/84 (66%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G+ E EE++T E+A+++ +L +++I Q IF VL SQG++S++L PKPW+KL
Sbjct: 1132 GSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVR 1191
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 1192 GKEPFHKMKQFLADE--QNILALR 1213
>gi|332028054|gb|EGI68105.1| Homeobox protein cut [Acromyrmex echinatior]
Length = 1436
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ +NT+++ +R+ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 761 PLKAVLPPITQQQFDLYNNLNTEDIVKRVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 820
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 821 RPKPWHMLTQKGREPFIRMKMFLED 845
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 382 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 441
Query: 412 RMWKW 416
+M W
Sbjct: 442 KMHGW 446
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1004 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1063
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 1064 RMQLWLSDAHNVDRLQALK 1082
>gi|327284411|ref|XP_003226931.1| PREDICTED: homeobox protein cut-like 2-like [Anolis carolinensis]
Length = 1653
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T +L +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 1088 EVDTLDLTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 1147
Query: 413 MWKWL 417
M WL
Sbjct: 1148 MQLWL 1152
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1239 ELDTYAITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1298
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1299 MQLWLNDP 1306
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E ++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F
Sbjct: 738 EHLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFI 797
Query: 412 RMWKWLQEPEFQRMSALR 429
+M ++L + Q + ALR
Sbjct: 798 KMKQFLSDE--QNVLALR 813
>gi|197245578|gb|AAI68490.1| cux2 protein [Xenopus (Silurana) tropicalis]
Length = 1436
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T +L +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 840 EVDTLDLTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 899
Query: 413 MWKWL 417
M WL
Sbjct: 900 MQLWL 904
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 990 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1049
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1050 MQLWLSDP 1057
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 49/76 (64%), Gaps = 3/76 (3%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRM 413
I+T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +M
Sbjct: 495 IDTSEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKM 554
Query: 414 WKWLQEPEFQRMSALR 429
++L + Q + ALR
Sbjct: 555 KQFLSDE--QNVLALR 568
>gi|345480393|ref|XP_001607026.2| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like [Nasonia
vitripennis]
Length = 1702
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ +NT+++ +R+ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 1022 PLKAVLPPITQQQFDIYNNLNTEDIVKRVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 1081
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1082 RPKPWHMLTQKGREPFIRMKMFLED 1106
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 623 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 682
Query: 412 RMWKW 416
+M W
Sbjct: 683 KMHGW 687
>gi|350276182|ref|NP_001128286.2| homeobox protein cut-like 2 isoform 1 [Xenopus (Silurana)
tropicalis]
Length = 1498
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRR 412
E++T +L +++ +L + I Q IF ++VL SQG++SD+L PKPWSKL + GRE F R
Sbjct: 902 EVDTLDLTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIR 961
Query: 413 MWKWL 417
M WL
Sbjct: 962 MQLWL 966
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1052 ELDTYSITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1111
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1112 MQLWLSDP 1119
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 49/76 (64%), Gaps = 3/76 (3%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRM 413
I+T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +M
Sbjct: 557 IDTSEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKM 616
Query: 414 WKWLQEPEFQRMSALR 429
++L + Q + ALR
Sbjct: 617 KQFLSDE--QNVLALR 630
>gi|350401842|ref|XP_003486278.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein cut-like [Bombus
impatiens]
Length = 1748
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 58/101 (57%), Gaps = 2/101 (1%)
Query: 321 IISQ-QTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFA 379
+ISQ T T P AV+ ++ +NT+++ +R+ +L +YSI Q +F
Sbjct: 1056 LISQPSTPTHHAANQRPLKAVLPPITQQQFDLYNNLNTEDIVKRVKEQLSQYSISQRLFG 1115
Query: 380 QRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQE 419
+ VL SQG++SDLL PKPW L + GRE F RM +L++
Sbjct: 1116 ESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLED 1156
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 691 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 750
Query: 412 RMWKW 416
+M W
Sbjct: 751 KMHGW 755
>gi|340718568|ref|XP_003397737.1| PREDICTED: homeobox protein cut-like [Bombus terrestris]
Length = 1740
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 58/101 (57%), Gaps = 2/101 (1%)
Query: 321 IISQ-QTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFA 379
+ISQ T T P AV+ ++ +NT+++ +R+ +L +YSI Q +F
Sbjct: 1048 LISQPSTPTHHAANQRPLKAVLPPITQQQFDLYNNLNTEDIVKRVKEQLSQYSISQRLFG 1107
Query: 380 QRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQE 419
+ VL SQG++SDLL PKPW L + GRE F RM +L++
Sbjct: 1108 ESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLED 1148
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 683 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 742
Query: 412 RMWKW 416
+M W
Sbjct: 743 KMHGW 747
>gi|347965459|ref|XP_551659.3| AGAP001187-PA [Anopheles gambiae str. PEST]
gi|333470501|gb|EAL38642.3| AGAP001187-PA [Anopheles gambiae str. PEST]
Length = 1942
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 85/168 (50%), Gaps = 19/168 (11%)
Query: 257 PNEGSLHSSPTTLVTNFQGAATIPTMNGIT--TITPHTVVVSPQTSPQHSL--ISNQLQR 312
P+E S +PTT ++ AA P N I TITP T S SP + I+N L
Sbjct: 1017 PSEPSGMCTPTTTASS---AAPSPIGNSILPPTITP-TDDFSVAASPLQRMASITNSLI- 1071
Query: 313 SDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
+ P P +QQ P A++ ++ +NT+++ +R+ L +YS
Sbjct: 1072 TQPPVTPHHTTQQR---------PLKAILPPITQQQFDLYNNLNTEDIVRRVKESLSQYS 1122
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQE 419
I Q +F + VL SQG++SDLL PKPW L + GRE F RM +L++
Sbjct: 1123 ISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLED 1170
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 576 QNLDTLHISRRVRELLSVHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 635
Query: 412 RMWKW 416
+M W
Sbjct: 636 KMHAW 640
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1403 QDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1462
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 1463 RMQLWLSDANNVERLQVLK 1481
>gi|328792123|ref|XP_623857.3| PREDICTED: homeobox protein cut [Apis mellifera]
Length = 1704
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ +NT+++ +R+ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 1016 PLKAVLPPITQQQFDLYNNLNTEDIVKRVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 1075
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 1076 RPKPWHMLTQKGREPFIRMKMFLED 1100
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+++ L ++I Q +FA+ +L SQGT+S+LL PKPW KL + GR+++R
Sbjct: 644 QNLDTLHIARKVRELLSIHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKLTEKGRDSYR 703
Query: 412 RMWKW 416
+M W
Sbjct: 704 KMHGW 708
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1269 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1328
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 1329 RMQLWLSDAHNVDRLQALK 1347
>gi|312376309|gb|EFR23435.1| hypothetical protein AND_12875 [Anopheles darlingi]
Length = 1322
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P A++ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 396 PLKAILPPITQQQFDLYNNLNTEDIVRRVKESLSQYSISQRLFGESVLGLSQGSVSDLLA 455
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 456 RPKPWHMLTQKGREPFIRMKMFLED 480
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 726 QDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 785
Query: 412 RMWKWLQEP-EFQRMSALR 429
RM WL + +R+ L+
Sbjct: 786 RMQLWLSDANNVERLQTLK 804
>gi|157116428|ref|XP_001658488.1| hypothetical protein AaeL_AAEL007605 [Aedes aegypti]
gi|108876456|gb|EAT40681.1| AAEL007605-PA [Aedes aegypti]
Length = 1502
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P A++ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 715 PLKAILPPITQQQFDLYNNLNTEDIVRRVKESLSQYSISQRLFGESVLGLSQGSVSDLLA 774
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 775 RPKPWHMLTQKGREPFIRMKMFLED 799
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 306 QNLDTLHISRRVRELLSVHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 365
Query: 412 RMWKW 416
+M W
Sbjct: 366 KMHAW 370
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 63/119 (52%), Gaps = 10/119 (8%)
Query: 313 SDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
S PGPP ++ L + ++PT + A++ ++++T+ + +I L +
Sbjct: 988 SHQPGPPPGVNSMRGALH-QHISPTVYEMAALT-------QDLDTQTITTKIKEALLANN 1039
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQEP-EFQRMSALR 429
I Q IF + VL SQG++S+LL PKPW L GRE F RM WL + +R+ L+
Sbjct: 1040 IGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIRMQLWLSDANNVERLQMLK 1098
>gi|405977106|gb|EKC41572.1| Protein CASP [Crassostrea gigas]
Length = 1528
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 333 TVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSD 392
+ P AV+ ++ + INT +L +++ L +YSI Q +F + VL SQG++SD
Sbjct: 911 SAKPLKAVLPPITQEQFDKYSNINTDDLVKQVKETLSQYSISQRLFGESVLGLSQGSVSD 970
Query: 393 LLRNPKPWSKL-KSGRETFRRMWKWLQEPE 421
LL PKPW L + GRE F RM +L++ E
Sbjct: 971 LLARPKPWHMLTQKGREPFIRMQIFLEDTE 1000
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
EI+T EL R+ L+ +++ Q +F + VL SQG++S+LL PKPW L GRE F +
Sbjct: 1107 EIDTLELTSRVKDVLQFHNLGQKLFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIK 1166
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1167 MHLWLSDP 1174
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 43/63 (68%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
I+T +A+ + L ++I Q +FA+ VL SQGT+S+LL PK W KL + GRE++R+M
Sbjct: 544 IDTSLVAKTVRELLSIHNIGQRLFAKHVLGLSQGTVSELLSKPKSWDKLTEKGRESYRKM 603
Query: 414 WKW 416
+ W
Sbjct: 604 YAW 606
>gi|357618141|gb|EHJ71235.1| putative Homeobox protein cut [Danaus plexippus]
Length = 976
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
+NT+E+ +R+ L +YSI Q +F + VL SQG++SDLL PKPW L + GRE F RM
Sbjct: 367 LNTEEIVKRVKEALSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRM 426
Query: 414 WKWLQE 419
+L++
Sbjct: 427 KMFLED 432
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 569 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 628
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 629 RMQLWLSDAHNIDRLQALK 647
>gi|2605704|gb|AAB86814.1| CEH-21 [Caenorhabditis elegans]
Length = 183
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 39/51 (76%), Gaps = 1/51 (1%)
Query: 375 QAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWL-QEPEFQR 424
Q A+++L RSQGTLSDLLR PKPWS +K+GR TF+RM WL +P+ +R
Sbjct: 1 QTALAEKILARSQGTLSDLLRMPKPWSVMKNGRATFQRMSNWLGLDPDVRR 51
>gi|242013583|ref|XP_002427482.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212511877|gb|EEB14744.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1028
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P AV+ ++ +NT+E+ +R+ +L +YSI Q +F + VL SQG++SDLL
Sbjct: 371 PLKAVLPPITQQQFDMFNNLNTEEIVKRVKEQLSQYSISQRLFGESVLGLSQGSVSDLLA 430
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PK W L + GRE F RM +L++
Sbjct: 431 RPKSWHMLTQKGREPFIRMKMFLED 455
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 67/129 (51%), Gaps = 11/129 (8%)
Query: 305 LISNQLQRSDSPG--PPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQ 362
+++N L++ +P PP+ + ++P+ + A++ ++++T+ +
Sbjct: 574 MVANDLKKGLAPASMPPQPHQLTNMRALHQHISPSVYEMAALT-------QDLDTQIITT 626
Query: 363 RISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQEPE 421
+I L +I Q IF + VL SQG++S+LL PKPW L GRE F RM WL +
Sbjct: 627 KIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIRMQLWLSDAH 686
Query: 422 -FQRMSALR 429
R+ AL+
Sbjct: 687 NIDRLQALK 695
>gi|326671334|ref|XP_003199414.1| PREDICTED: LOW QUALITY PROTEIN: si:ch211-251d10.5 [Danio rerio]
Length = 1455
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T +L Q++ +L + I Q IF ++VL SQG++SD+L PK WSKL + GRE F
Sbjct: 967 QEVDTVDLTQQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKHWSKLTQKGREPFI 1026
Query: 412 RMWKWL 417
RM WL
Sbjct: 1027 RMQLWL 1032
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 56/84 (66%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G++ E EE++T E+A+++ +L +++I Q +F VL SQG++S++L PKPWSKL
Sbjct: 576 GSTSEGEEMDTAEIARQVKEQLIKHNIGQRVFGHYVLGLSQGSVSEILARPKPWSKLTIR 635
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M +L + Q + ALR
Sbjct: 636 GKEPFHKMKHFLSDE--QNILALR 657
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1122 ELDTYVITKKVKEVLTDNNLGQRLFGESILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1181
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1182 MQLWLSDP 1189
>gi|170058228|ref|XP_001864829.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877409|gb|EDS40792.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1650
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 336 PTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 395
P A++ ++ +NT+++ +R+ L +YSI Q +F + VL SQG++SDLL
Sbjct: 853 PLKAILPPITQQQFDLYNSLNTEDIVRRVKESLSQYSISQRLFGESVLGLSQGSVSDLLA 912
Query: 396 NPKPWSKL-KSGRETFRRMWKWLQE 419
PKPW L + GRE F RM +L++
Sbjct: 913 RPKPWHMLTQKGREPFIRMKMFLED 937
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 424 QNLDTLHISRRVRELLSVHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 483
Query: 412 RMWKW 416
+M W
Sbjct: 484 KMHAW 488
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 1165 QDLDTQVITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 1224
Query: 412 RMWKWLQE 419
RM WL +
Sbjct: 1225 RMQLWLSD 1232
>gi|391334039|ref|XP_003741416.1| PREDICTED: uncharacterized protein LOC100899643 [Metaseiulus
occidentalis]
Length = 1296
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 325 QTATLTLKTVTP--TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRV 382
Q+A +L T +P AV+ ++ + +NT+++ +++ +L +YSI Q +F + V
Sbjct: 662 QSAMPSLATPSPRQAKAVLPPITQQQFDKYNNLNTEDIVKKVKEQLSQYSISQRLFGESV 721
Query: 383 LCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQEPE 421
L SQG++SDLL PKPW L + GRE F RM +L++ +
Sbjct: 722 LGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKIFLEDEQ 761
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
E +NT ++A+ + L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 385 EALNTGQIARCVRELLSAHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 444
Query: 412 RMWKW 416
+M W
Sbjct: 445 KMHAW 449
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T+ + RI L ++I Q IF + VL SQG++S+LL PKPW L GRE F R
Sbjct: 915 DLDTQVITARIKETLMAHNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIR 974
Query: 413 MWKWLQEPE-FQRMSALR 429
M WL +P ++ AL+
Sbjct: 975 MQLWLSDPHNVDKLQALK 992
>gi|241713077|ref|XP_002413467.1| hypothetical protein IscW_ISCW022880 [Ixodes scapularis]
gi|215507281|gb|EEC16775.1| hypothetical protein IscW_ISCW022880 [Ixodes scapularis]
Length = 1388
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
+NT+E+ +++ +L +YSI Q +F + VL SQG++SDLL PKPW L + GRE F RM
Sbjct: 699 LNTEEIVKKVKEQLSQYSISQRLFGENVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRM 758
Query: 414 WKWLQE 419
+L++
Sbjct: 759 KIFLED 764
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETF 410
E +NT+ +A+ + L +++ Q +FA+ VL SQGT+S+LL PKPW KL + GR+++
Sbjct: 277 CEALNTQNIARVVRELLSVHNVGQRLFARFVLGLSQGTVSELLSKPKPWDKLTEKGRDSY 336
Query: 411 RRMWKW 416
R+M W
Sbjct: 337 RKMHAW 342
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T+ + +I L ++I Q IF + VL SQG++S+LL PKPW L GRE F R
Sbjct: 908 DLDTQNITAKIKETLMAHNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFIR 967
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 968 MQLWLNDP 975
>gi|312066880|ref|XP_003136480.1| CUT domain-containing protein [Loa loa]
Length = 1271
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
+NT E+A++ + ++I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE FRRM
Sbjct: 579 LNTTEIAKQCKHLMVTHNIGQRLFAKYVMNQSQGSLSELLSKPRHWNKLTDKGREAFRRM 638
Query: 414 WKWLQEPE 421
+ W+ +P+
Sbjct: 639 YGWVSDPK 646
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 46/75 (61%), Gaps = 4/75 (5%)
Query: 351 ELEE---INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSG 406
ELE ++T+++ ++I L SI Q F + VL SQG++SDLL PKPW+ L + G
Sbjct: 839 ELERYPVLDTEDIVRQIKDFLCLNSISQRQFGEYVLGLSQGSVSDLLARPKPWTMLTQKG 898
Query: 407 RETFRRMWKWLQEPE 421
RE F RM +L E +
Sbjct: 899 REPFVRMQLFLNEAQ 913
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMW 414
+T E+A+++ L I Q F VL S G +S+LL PK W +L R+ + RM
Sbjct: 1004 DTSEIARQVKDRLHSSGISQKTFGDVVLGVSPGGVSELLAKPKTWEQLSPRARDLYLRMS 1063
Query: 415 KWLQEPEFQR 424
+WL E + R
Sbjct: 1064 EWLGELDGNR 1073
>gi|308454600|ref|XP_003089911.1| hypothetical protein CRE_09901 [Caenorhabditis remanei]
gi|308267792|gb|EFP11745.1| hypothetical protein CRE_09901 [Caenorhabditis remanei]
Length = 338
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 48/77 (62%), Gaps = 1/77 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
INTKE+ I++ LK+ I Q FA++VL ++QG S + RNP PW +L + G+ F RM
Sbjct: 189 INTKEVVGDINSWLKKAGINQTTFAEKVLEKTQGHFSVISRNPAPWEELLAPGKAVFVRM 248
Query: 414 WKWLQEPEFQRMSALRL 430
+ W++ E ++ L +
Sbjct: 249 YNWMKLSEEEKNKILNV 265
>gi|312376305|gb|EFR23432.1| hypothetical protein AND_12876 [Anopheles darlingi]
Length = 729
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 611 QNLDTLHISRRVRELLSVHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 670
Query: 412 RMWKW 416
+M W
Sbjct: 671 KMHAW 675
>gi|118344294|ref|NP_001071971.1| zinc finger protein [Ciona intestinalis]
gi|70571632|dbj|BAE06789.1| zinc finger protein [Ciona intestinalis]
Length = 831
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 46/77 (59%), Gaps = 1/77 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFR 411
+EINT LA + A L+ +I FA+ L RSQGTLSDLL +PKPW L S GR+ +
Sbjct: 162 KEINTSTLAADMRAYLRSVNISLDNFAKNYLLRSQGTLSDLLNHPKPWHGLSSRGRDIYM 221
Query: 412 RMWKWLQEPEFQRMSAL 428
+M ++L + A+
Sbjct: 222 KMIEFLSAENLNELVAV 238
>gi|268534922|ref|XP_002632594.1| Hypothetical protein CBG13862 [Caenorhabditis briggsae]
Length = 602
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 45/79 (56%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
E +NTK+LA+R+ E+ + I +A FA+ +L SQ LS LL PK W K+K E +
Sbjct: 447 EYVNLNTKQLAERLIQEMNTHKISRAKFAKNILNTSQMALSYLLNKPKSWEKIKDRSENY 506
Query: 411 RRMWKWLQEPEFQRMSALR 429
R++ WL P RM +
Sbjct: 507 VRIYNWLNLPVKARMEYFK 525
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 65/121 (53%), Gaps = 8/121 (6%)
Query: 310 LQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELK 369
+ +SD P P I T L+ + + + SG ++ +KEL++++S E++
Sbjct: 306 IHQSDKPIPENI-----DTEVLRKNNSIDHITNELDAPLSGSMD---SKELSEKLSQEIQ 357
Query: 370 RYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALR 429
Y I + FA++VL + L +L++PKPW+KL+S R R++ WL P +R+ L
Sbjct: 358 NYDICRQNFAKKVLNKGPQVLRKMLKDPKPWNKLQSARVNKIRIYNWLNLPLKKRLGILD 417
Query: 430 L 430
+
Sbjct: 418 M 418
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 39/58 (67%)
Query: 371 YSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSAL 428
+ I Q+I +RV+ RSQGTLS +L P+PWS + GR+ F R++ W+ PE +R+ +
Sbjct: 4 HEISQSILGRRVVNRSQGTLSVILNKPEPWSFIGPGRQAFVRLYNWINLPEKKRLEIV 61
>gi|118344440|ref|NP_001072041.1| zinc finger protein [Ciona intestinalis]
gi|92081484|dbj|BAE93289.1| zinc finger protein [Ciona intestinalis]
Length = 831
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 46/77 (59%), Gaps = 1/77 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFR 411
+EINT LA + A L+ +I FA+ L RSQGTLSDLL +PKPW L S GR+ +
Sbjct: 162 KEINTSTLAADMRAYLRSVNISLDNFAKNYLLRSQGTLSDLLNHPKPWHGLSSRGRDIYM 221
Query: 412 RMWKWLQEPEFQRMSAL 428
+M ++L + A+
Sbjct: 222 KMIEFLSAENLNELVAV 238
>gi|322791714|gb|EFZ15990.1| hypothetical protein SINV_06724 [Solenopsis invicta]
Length = 335
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 254 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 313
Query: 412 RMWKW 416
+M W
Sbjct: 314 KMHGW 318
>gi|242013581|ref|XP_002427481.1| hypothetical protein Phum_PHUM322020 [Pediculus humanus corporis]
gi|212511876|gb|EEB14743.1| hypothetical protein Phum_PHUM322020 [Pediculus humanus corporis]
Length = 350
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 45/65 (69%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +A+R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 266 QNLDTLHIARRVRELLSIHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 325
Query: 412 RMWKW 416
+M W
Sbjct: 326 KMHAW 330
>gi|393907009|gb|EFO27593.2| CUT domain-containing protein [Loa loa]
Length = 1288
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 48/71 (67%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A++ + ++I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE F
Sbjct: 593 LNTTEIAKQCKHLMVTHNIGQRLFAKYVMNQVVKSQGSLSELLSKPRHWNKLTDKGREAF 652
Query: 411 RRMWKWLQEPE 421
RRM+ W+ +P+
Sbjct: 653 RRMYGWVSDPK 663
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 46/75 (61%), Gaps = 4/75 (5%)
Query: 351 ELEE---INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSG 406
ELE ++T+++ ++I L SI Q F + VL SQG++SDLL PKPW+ L + G
Sbjct: 856 ELERYPVLDTEDIVRQIKDFLCLNSISQRQFGEYVLGLSQGSVSDLLARPKPWTMLTQKG 915
Query: 407 RETFRRMWKWLQEPE 421
RE F RM +L E +
Sbjct: 916 REPFVRMQLFLNEAQ 930
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMW 414
+T E+A+++ L I Q F VL S G +S+LL PK W +L R+ + RM
Sbjct: 1021 DTSEIARQVKDRLHSSGISQKTFGDVVLGVSPGGVSELLAKPKTWEQLSPRARDLYLRMS 1080
Query: 415 KWLQEPEFQR 424
+WL E + R
Sbjct: 1081 EWLGELDGNR 1090
>gi|170591642|ref|XP_001900579.1| CUT domain containing protein [Brugia malayi]
gi|158592191|gb|EDP30793.1| CUT domain containing protein [Brugia malayi]
Length = 1247
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 48/71 (67%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A++ + ++I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE F
Sbjct: 552 LNTTEIAKQCKHLMVTHNIGQRLFAKYVMNQVVKSQGSLSELLSKPRHWNKLTDKGREAF 611
Query: 411 RRMWKWLQEPE 421
RRM+ W+ +P+
Sbjct: 612 RRMYGWVSDPK 622
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T+++ ++I L SI Q F + VL SQG++SDLL PKPW+ L + GRE F RM
Sbjct: 822 LDTEDIVRQIKDFLCLNSISQRQFGEHVLGLSQGSVSDLLARPKPWAMLTQKGREPFVRM 881
Query: 414 WKWLQEPE 421
+L E +
Sbjct: 882 QLFLNEAQ 889
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
EL ++T E+A+++ L I Q F +L S G +S+LL PK W +L R+
Sbjct: 976 ELSMVDTSEIARQVKDRLHSSGISQKTFGDVILGVSPGGVSELLTKPKTWEQLSPRARDL 1035
Query: 410 FRRMWKWLQE 419
+ +M +WL E
Sbjct: 1036 YLKMSEWLSE 1045
>gi|432094939|gb|ELK26347.1| Homeobox protein cut-like 2 [Myotis davidii]
Length = 1360
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 45/73 (61%), Gaps = 9/73 (12%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRV--------LCRSQGTLSDLLRNPKPWSKL-K 404
E++T EL +++ +L + I Q IF ++V L SQG++SD+L PKPWSKL +
Sbjct: 761 EVDTLELTRQVKEKLAKNGICQRIFGEKVKTGAGEQVLGLSQGSVSDMLSRPKPWSKLTQ 820
Query: 405 SGRETFRRMWKWL 417
GRE F RM WL
Sbjct: 821 KGREPFIRMQLWL 833
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 920 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 979
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 980 MQLWLNDP 987
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 44/65 (67%), Gaps = 1/65 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 410 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 469
Query: 413 MWKWL 417
M ++L
Sbjct: 470 MKQFL 474
>gi|402592099|gb|EJW86028.1| hypothetical protein WUBG_03061, partial [Wuchereria bancrofti]
Length = 1221
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 48/71 (67%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A++ + ++I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE F
Sbjct: 526 LNTTEIAKQCKHLMVTHNIGQRLFAKYVMNQVVKSQGSLSELLSKPRHWNKLTDKGREAF 585
Query: 411 RRMWKWLQEPE 421
RRM+ W+ +P+
Sbjct: 586 RRMYGWVSDPK 596
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T+++ ++I L SI Q F + VL SQG++SDLL PKPW+ L + GRE F RM
Sbjct: 809 LDTEDIVRQIKDFLCLNSISQRQFGEHVLGLSQGSVSDLLARPKPWAMLTQKGREPFVRM 868
Query: 414 WKWLQEPE 421
+L E +
Sbjct: 869 QLFLNEAQ 876
>gi|308469531|ref|XP_003097003.1| hypothetical protein CRE_21438 [Caenorhabditis remanei]
gi|308241203|gb|EFO85155.1| hypothetical protein CRE_21438 [Caenorhabditis remanei]
Length = 525
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 49/101 (48%), Gaps = 7/101 (6%)
Query: 330 TLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGT 389
TL T NA++ + +NTK +A + LKR Q FA ++L R + T
Sbjct: 423 TLITAESFNAIINR-------SVNYVNTKNIANMMKNWLKRTHATQEWFATKILGRCKRT 475
Query: 390 LSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL 430
LS L NPK W L RE + +M+ W+ E QR +R+
Sbjct: 476 LSRCLNNPKDWKDLSENREIYVKMYNWMCLTEEQRHEMMRV 516
>gi|341877743|gb|EGT33678.1| hypothetical protein CAEBREN_06506 [Caenorhabditis brenneri]
Length = 1278
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 45/71 (63%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A + Y+I Q +FA+ V+ +SQG+LS+LL P+ WSKL GRE F
Sbjct: 597 LNTTEIASNCKRLMIAYNIGQRLFAKHVMNQVVKSQGSLSELLSKPRHWSKLTDKGREAF 656
Query: 411 RRMWKWLQEPE 421
RR++ W+ + E
Sbjct: 657 RRIYGWISDDE 667
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T+ L ++I L SI Q F + +L SQG++SDLL PK W +L + GRE F RM
Sbjct: 843 LDTERLVKQIKEFLSNNSISQRQFGEHILGLSQGSVSDLLARPKSWVQLTQKGREPFIRM 902
Query: 414 WKWLQE 419
+L E
Sbjct: 903 KLFLDE 908
>gi|432892167|ref|XP_004075686.1| PREDICTED: homeobox protein cut-like 1-like [Oryzias latipes]
Length = 1377
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1039 ELDTYAITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1098
Query: 413 MWKWLQEP 420
M WLQ+P
Sbjct: 1099 MQLWLQDP 1106
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 53/78 (67%), Gaps = 3/78 (3%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E+++T E+A+++ +L +++I Q +F VL SQG++S++L PKPW+KL G+E F
Sbjct: 601 EDMDTAEIARQVKEQLIKHNIGQRVFGHYVLGLSQGSVSEILARPKPWNKLTIRGKEPFH 660
Query: 412 RMWKWLQEPEFQRMSALR 429
+M ++L + Q + ALR
Sbjct: 661 KMRQFLADE--QNILALR 676
>gi|268573146|ref|XP_002641550.1| C. briggsae CBR-CEH-44 protein [Caenorhabditis briggsae]
Length = 1309
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 45/71 (63%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A + Y+I Q +FA+ V+ +SQG+LS+LL P+ WSKL GRE F
Sbjct: 620 LNTTEIASHCKRLMIAYNIGQRLFAKVVMNQVVKSQGSLSELLSKPRHWSKLTDKGREAF 679
Query: 411 RRMWKWLQEPE 421
RR++ W+ + E
Sbjct: 680 RRIYGWISDDE 690
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T++L + I L SI Q F + VL SQG++SDLL PK W++L + GRE F RM
Sbjct: 872 LDTEQLVRTIKEFLTVNSISQRQFGEHVLGLSQGSVSDLLARPKTWTQLTQKGREPFIRM 931
Query: 414 WKWLQEPE 421
++ + E
Sbjct: 932 QLFMDDVE 939
>gi|308501371|ref|XP_003112870.1| CRE-CEH-44 protein [Caenorhabditis remanei]
gi|308265171|gb|EFP09124.1| CRE-CEH-44 protein [Caenorhabditis remanei]
Length = 1461
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 45/71 (63%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A + Y+I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE F
Sbjct: 594 LNTTEIASHCKRLMIAYNIGQRLFAKHVMNQVVKSQGSLSELLSKPRHWNKLTDKGREAF 653
Query: 411 RRMWKWLQEPE 421
RR++ W+ + E
Sbjct: 654 RRIYGWISDDE 664
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T+ L ++I L S+ Q F + +L SQG++SDLL PK W +L + GRE F RM
Sbjct: 830 LDTELLVKQIKEFLTVNSMSQRQFGEFILGLSQGSVSDLLARPKTWIQLTQKGREPFIRM 889
Query: 414 WKWLQE 419
++ E
Sbjct: 890 QLFMDE 895
>gi|17556164|ref|NP_497576.1| Protein CEH-44, isoform a [Caenorhabditis elegans]
gi|75021616|sp|Q9BL02.1|CUT_CAEEL RecName: Full=Homeobox protein cut-like ceh-44; Short=Homeobox
protein 44
gi|373254568|emb|CCD73822.1| Protein CEH-44, isoform a [Caenorhabditis elegans]
Length = 1273
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 45/71 (63%), Gaps = 4/71 (5%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVL---CRSQGTLSDLLRNPKPWSKL-KSGRETF 410
+NT E+A + Y+I Q +FA+ V+ +SQG+LS+LL P+ W+KL GRE F
Sbjct: 600 LNTTEIASHCKRLMIAYNIGQRLFAKHVMNQVVKSQGSLSELLSKPRHWNKLTDKGREAF 659
Query: 411 RRMWKWLQEPE 421
RR++ W+ + E
Sbjct: 660 RRIYGWISDDE 670
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T L ++I L SI Q F + +L SQG++SDLL PK W++L + GRE F RM
Sbjct: 841 LDTDFLVKQIKEFLTMNSISQRQFGEYILGLSQGSVSDLLARPKTWAQLTQKGREPFIRM 900
Query: 414 WKWLQEPE 421
++ + E
Sbjct: 901 QLFMDDVE 908
>gi|297692965|ref|XP_002823799.1| PREDICTED: homeobox protein cut-like 2-like [Pongo abelii]
Length = 460
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 20 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 79
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 80 MQLWLNDP 87
>gi|348534809|ref|XP_003454894.1| PREDICTED: homeobox protein cut-like 1-like [Oreochromis niloticus]
Length = 1383
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1044 ELDTYAITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 1103
Query: 413 MWKWLQEP 420
M WLQ+P
Sbjct: 1104 MQLWLQDP 1111
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 53/78 (67%), Gaps = 3/78 (3%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E+++T E+A+++ +L +++I Q +F VL SQG++S++L PKPW+KL G+E F
Sbjct: 604 EDMDTAEIARQVKEQLIKHNIGQRVFGHYVLGLSQGSVSEILARPKPWNKLTIRGKEPFH 663
Query: 412 RMWKWLQEPEFQRMSALR 429
+M ++L + Q + ALR
Sbjct: 664 KMRQFLADE--QNILALR 679
>gi|308470425|ref|XP_003097446.1| hypothetical protein CRE_16984 [Caenorhabditis remanei]
gi|308240154|gb|EFO84106.1| hypothetical protein CRE_16984 [Caenorhabditis remanei]
Length = 338
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
INTKE+ I +K+ I Q FA++VL ++QG S + RNP PW +L + G+ F RM
Sbjct: 189 INTKEVMGDIKTWMKKAGINQTKFAEKVLEKTQGHFSVISRNPTPWEELLAPGKSVFVRM 248
Query: 414 WKWLQEPEFQRMSAL 428
+ W++ E ++ L
Sbjct: 249 YNWMKLSEEEKNKIL 263
>gi|443689262|gb|ELT91709.1| hypothetical protein CAPTEDRAFT_49550, partial [Capitella teleta]
Length = 76
Score = 58.5 bits (140), Expect = 6e-06, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
I+T++L +++ L ++SI Q +F + VL SQG++SDLL PKPW L + GRE F RM
Sbjct: 10 IDTEDLVRQVKDMLSQFSISQRLFGENVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRM 69
Query: 414 WKWLQE 419
+L++
Sbjct: 70 KLFLEQ 75
>gi|326667506|ref|XP_690156.5| PREDICTED: sb:cb1008 [Danio rerio]
Length = 642
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + QR+ L ++I Q +F + VL +Q ++S+LL +PKPW+KL G+E F R
Sbjct: 410 ELDTFSITQRVKKTLIVHNIGQRVFGEEVLGLTQSSVSELLSHPKPWTKLSLKGKENFIR 469
Query: 413 MWKWLQEPE-FQRMSA 427
M WL++P Q+++A
Sbjct: 470 MHLWLRDPHSVQKLNA 485
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+E++T ++ Q++ L+ +I Q F VL SQG++SD+L PKPWSKL + G+E F
Sbjct: 182 QELDTLQVTQQVKQALQSRNIGQRAFGHYVLGLSQGSVSDILARPKPWSKLTRRGKEPFL 241
Query: 412 RMWKWL 417
RM +L
Sbjct: 242 RMKHFL 247
>gi|256078010|ref|XP_002575291.1| cut-like 1 (ctl1) [Schistosoma mansoni]
gi|353231499|emb|CCD77917.1| cut-like 1 (ctl1) [Schistosoma mansoni]
Length = 510
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 1/75 (1%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETF 410
L +++T + +I L+R+SIPQ +F VL Q ++SD+L +PWS L R +
Sbjct: 190 LTDLDTASMCGKIKELLQRHSIPQRLFGDVVLGMCQASVSDILSKTRPWSHLSNKARIPY 249
Query: 411 RRMWKWLQEPEFQRM 425
R+ WLQEP+ M
Sbjct: 250 VRLHLWLQEPDHLEM 264
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 386 SQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQEPE 421
SQG++SDLL PKPW L + GRE F RM +L P+
Sbjct: 31 SQGSVSDLLARPKPWKLLTQKGREPFIRMQLFLNNPD 67
>gi|47224998|emb|CAF97413.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1183
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 856 ELDTYAITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 915
Query: 413 MWKWLQEP 420
M WLQ+P
Sbjct: 916 MQLWLQDP 923
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 53/78 (67%), Gaps = 3/78 (3%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
E+++T E+A+++ +L +++I Q +F VL SQG++S++L PKPW+KL G+E F
Sbjct: 466 EDMDTAEIARQVKEQLIKHNIGQRVFGHYVLGLSQGSVSEILARPKPWNKLTIRGKEPFH 525
Query: 412 RMWKWLQEPEFQRMSALR 429
+M ++L + Q + ALR
Sbjct: 526 KMRQFLADE--QNILALR 541
>gi|392332673|ref|XP_003752656.1| PREDICTED: cut-like homeobox 2 [Rattus norvegicus]
Length = 1341
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 906 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 965
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 966 MRLWLSDP 973
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 53/84 (63%), Gaps = 3/84 (3%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-S 405
G E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL
Sbjct: 632 GAGPEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVK 691
Query: 406 GRETFRRMWKWLQEPEFQRMSALR 429
G+E F +M ++L + Q + ALR
Sbjct: 692 GKEPFIKMKQFLSDE--QNVLALR 713
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 56/107 (52%), Gaps = 14/107 (13%)
Query: 321 IISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSI--PQAI- 377
I +Q ++T + TP A+ LE+ + +Q+ RY++ P+++
Sbjct: 715 IQVRQRGSITPRIRTPETGSDDAIK----SILEQAKKEIESQKGECSKARYNLQDPRSLP 770
Query: 378 FAQR------VLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWL 417
AQR VL SQG++SD+L PKPWSKL + GRE F RM WL
Sbjct: 771 GAQRYATVSLVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWL 817
>gi|308469479|ref|XP_003096977.1| hypothetical protein CRE_21435 [Caenorhabditis remanei]
gi|308241177|gb|EFO85129.1| hypothetical protein CRE_21435 [Caenorhabditis remanei]
Length = 540
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 41/76 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK +A + LKR Q FA ++L R + TLS L NPK W L RE + +M+
Sbjct: 456 VNTKNIANMMKNWLKRTHATQEWFATKILGRCRRTLSRCLNNPKDWKDLSENREIYVKMY 515
Query: 415 KWLQEPEFQRMSALRL 430
W+ + QR +R+
Sbjct: 516 NWMCLTKEQRHEMMRV 531
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 42/78 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A+++ QA FA ++L + L ++L P + K+K+G+E F +M
Sbjct: 309 LDTKDIARQMKEWFTLGICTQAFFAVKILGIPRNRLYEILTTPPRFEKMKTGKELFIKMN 368
Query: 415 KWLQEPEFQRMSALRLAG 432
WL+ E + L + G
Sbjct: 369 NWLKMSEDVKKEILSIFG 386
>gi|297263540|ref|XP_002798825.1| PREDICTED: homeobox protein cut-like 2-like [Macaca mulatta]
Length = 1197
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 757 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 816
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 817 MQLWLNDP 824
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 605 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 664
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 665 MKQFLSDE--QNVLALR 679
>gi|159163348|pdb|1WH8|A Chain A, Solution Structure Of The Third Cut Domain Of Human
Homeobox Protein Cux-2
Length = 111
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 26 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 85
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 86 MQLWLNDP 93
>gi|345313484|ref|XP_001516287.2| PREDICTED: homeobox protein cut-like 1-like, partial
[Ornithorhynchus anatinus]
Length = 247
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 77 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 136
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 137 MQLWLNDP 144
>gi|357618142|gb|EHJ71236.1| hypothetical protein KGM_08612 [Danaus plexippus]
Length = 311
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T +++R+ L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R+M
Sbjct: 229 LDTLHISRRVRELLSVHNIGQRLFAKYVLGLSQGTVSELLSKPKPWDKLTEKGRDSYRKM 288
Query: 414 WKW 416
W
Sbjct: 289 HAW 291
>gi|332840615|ref|XP_003314025.1| PREDICTED: cut-like homeobox 2 [Pan troglodytes]
Length = 1545
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1105 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1164
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1165 MQLWLNDP 1172
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 645 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 704
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 705 MKQFLSDE--QNVLALR 719
>gi|47217555|emb|CAG02482.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1581
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 54/82 (65%), Gaps = 3/82 (3%)
Query: 349 SGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGR 407
+G+ ++++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+
Sbjct: 656 AGDEDQVDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGK 715
Query: 408 ETFRRMWKWLQEPEFQRMSALR 429
E F +M ++L + Q + ALR
Sbjct: 716 EPFIKMKQFLSDE--QNILALR 735
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +++ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1215 ELDTFTITKKVKEVLTDNNLGQRLFGETILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1274
Query: 413 MWKWLQEPE-FQRMSALR 429
M WL +P +++ A++
Sbjct: 1275 MQLWLNDPHNVEKLRAMK 1292
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 32/46 (69%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKP 399
E++T EL +++ +L + I Q IF ++VL SQG++SD+L PKP
Sbjct: 1026 EVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKP 1071
>gi|390332858|ref|XP_780858.3| PREDICTED: homeobox protein cut-like 1-like [Strongylocentrotus
purpuratus]
Length = 1568
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 47/75 (62%), Gaps = 1/75 (1%)
Query: 344 VSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
+S G +E++T ++A+++ L + +IPQ +F + ++ SQG++SD+L PK W KL
Sbjct: 817 LSCGQYENSKELDTVDIARQVRELLVKGNIPQRVFGEHIIGMSQGSVSDILSKPKTWDKL 876
Query: 404 K-SGRETFRRMWKWL 417
G+E F RM WL
Sbjct: 877 TIKGKEPFIRMKIWL 891
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETF 410
+E ++T + Q++ L+++ + Q F + VL +QG++SDLL PK W KL GRE +
Sbjct: 1018 IEYLDTFAVTQQVKNLLQQHGLGQKAFGESVLGLTQGSVSDLLSKPKMWLKLSMKGREPY 1077
Query: 411 RRMWKWLQE 419
RM+ WLQ+
Sbjct: 1078 VRMYLWLQD 1086
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 43/64 (67%), Gaps = 1/64 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
++T ++A+RI L +I Q +F + VL SQG++SDLL PKPW K+ + GRE F +M
Sbjct: 573 VDTVQVARRIKDILADNNIGQRLFGEVVLNMSQGSVSDLLSRPKPWDKMTAKGREPFIKM 632
Query: 414 WKWL 417
++L
Sbjct: 633 LQFL 636
>gi|403281879|ref|XP_003932400.1| PREDICTED: homeobox protein cut-like 2 [Saimiri boliviensis
boliviensis]
Length = 1510
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 1070 ELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVR 1129
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 1130 MQLWLNDP 1137
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 680 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 739
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 740 MKQFLSDE--QNVLALR 754
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 382 VLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWL 417
VL SQG++SD+L PKPWSKL + GRE F RM WL
Sbjct: 951 VLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWL 987
>gi|380804883|gb|AFE74317.1| protein CASP isoform d, partial [Macaca mulatta]
Length = 344
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 81 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 140
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 141 MQLWLNDP 148
>gi|1790882|gb|AAB41146.1| cut-related homeobox Cux-1 [Mus musculus]
Length = 460
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 65 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 124
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 125 MQLWLNDP 132
>gi|74199902|dbj|BAE20769.1| unnamed protein product [Mus musculus]
Length = 460
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 65 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 124
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 125 MQLWLNDP 132
>gi|380803047|gb|AFE73399.1| homeobox protein cut-like 2, partial [Macaca mulatta]
Length = 150
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 52/81 (64%), Gaps = 3/81 (3%)
Query: 350 GELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRE 408
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 1 AEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKE 60
Query: 409 TFRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 61 PFIKMKQFLSDE--QNVLALR 79
>gi|17559806|ref|NP_504581.1| Protein CEH-49 [Caenorhabditis elegans]
gi|373219674|emb|CCD69359.1| Protein CEH-49 [Caenorhabditis elegans]
Length = 313
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 36/56 (64%)
Query: 375 QAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRL 430
Q FA+ +L R+QG+LS+LL +PKPW + GR ++RM+ WL+ E R +L
Sbjct: 180 QTNFAEHILNRTQGSLSELLNHPKPWDAVSMGRTVYQRMFNWLEMSEDDRAEIWKL 235
>gi|431898196|gb|ELK06891.1| Homeobox protein cut-like 1 [Pteropus alecto]
Length = 642
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + +R+ L ++ Q +F + +L +QG++SDLL PKPW KL GRE F R
Sbjct: 99 ELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVR 158
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 159 MQLWLNDP 166
>gi|321476535|gb|EFX87495.1| hypothetical protein DAPPUDRAFT_43067 [Daphnia pulex]
Length = 95
Score = 56.2 bits (134), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 44/65 (67%), Gaps = 1/65 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFR 411
+ ++T +++ + L ++I Q +FA+ VL SQGT+S+LL PKPW KL + GR+++R
Sbjct: 7 QALDTLAISRGVRELLSMHNIGQRLFAKFVLGLSQGTVSELLSKPKPWDKLTEKGRDSYR 66
Query: 412 RMWKW 416
+M W
Sbjct: 67 KMHAW 71
>gi|308470399|ref|XP_003097433.1| hypothetical protein CRE_16955 [Caenorhabditis remanei]
gi|308240141|gb|EFO84093.1| hypothetical protein CRE_16955 [Caenorhabditis remanei]
Length = 378
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 46/81 (56%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
++ ++T ++A I LK + + FA ++L R++ TLSD++++P+ W L E F
Sbjct: 283 IDYVDTYKIAAEILDWLKTAPVSRDWFAGKILNRTKRTLSDIIKHPRDWKDLNHRTEYFI 342
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
+M+ WL E QR+ + G
Sbjct: 343 KMYNWLNMSEEQRLQIMHCYG 363
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 413
E NTK+++ + Q+ FA ++L +G L L+R+ + ++ LKSG++ + RM
Sbjct: 103 EANTKKISLEMKIWFSLEICSQSYFAVKILNVDRGRLHSLMRDARDFNTLKSGKDLYTRM 162
Query: 414 WKWLQEPEFQRMSALR--LAGN 433
+ WL+ +R L+ L GN
Sbjct: 163 YNWLKLSSGERDELLKMDLFGN 184
>gi|159163504|pdb|1X2L|A Chain A, Solution Structure Of The Cut Domain Of Human Homeobox
Protein Cux-2 (Cut-Like 2)
Length = 101
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 52/80 (65%), Gaps = 3/80 (3%)
Query: 351 ELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRET 409
E E+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E
Sbjct: 13 EEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEP 72
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F +M ++L + Q + ALR
Sbjct: 73 FIKMKQFLSDE--QNVLALR 90
>gi|308470455|ref|XP_003097461.1| hypothetical protein CRE_16957 [Caenorhabditis remanei]
gi|308240169|gb|EFO84121.1| hypothetical protein CRE_16957 [Caenorhabditis remanei]
Length = 318
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 46/81 (56%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
++ ++T ++A I LK + + FA ++L R++ TLSD++++P+ W L E F
Sbjct: 223 IDYVDTYKIAAEILDWLKTAPVSRDWFAGKILNRTKRTLSDIIKHPRDWKDLNHRTEYFI 282
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
+M+ WL E QR+ + G
Sbjct: 283 KMYNWLNMSEEQRLQIMHCYG 303
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 413
E NTK+++ + Q+ FA ++L +G L L+R+ + ++ LKSG++ + RM
Sbjct: 48 EANTKKISLEMKIWFSLEICSQSYFAVKILNVDRGRLHSLMRDARDFNTLKSGKDLYTRM 107
Query: 414 WKWLQEPEFQRMSALR--LAGN 433
+ WL+ +R L+ L GN
Sbjct: 108 YNWLKLSSGERDELLKMDLFGN 129
>gi|322791709|gb|EFZ15985.1| hypothetical protein SINV_04152 [Solenopsis invicta]
Length = 993
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 38/57 (66%), Gaps = 1/57 (1%)
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQE 419
+ +L +YSI Q +F + VL SQG++SDLL PKPW L + GRE F RM +L++
Sbjct: 340 VKEQLSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGREPFIRMKMFLED 396
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 561 QDLDTQTITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 620
Query: 412 RMWKWLQEPE-FQRMSALR 429
RM WL + R+ AL+
Sbjct: 621 RMQLWLSDAHNVDRLQALK 639
>gi|308451742|ref|XP_003088784.1| hypothetical protein CRE_20140 [Caenorhabditis remanei]
gi|308245330|gb|EFO89282.1| hypothetical protein CRE_20140 [Caenorhabditis remanei]
Length = 458
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 42/76 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK+++ I L+ Q FA ++L R + TL+ L NPK W +L RE + +M
Sbjct: 374 VNTKKISMLIKDWLEETQATQEWFATKILKRCRRTLNQCLNNPKDWKELSQKREIYVKMH 433
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR+ +R+
Sbjct: 434 NWMCLTEEQRLEIMRV 449
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 39/79 (49%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTKE+A ++ QA FA VL + +L P + LK+G+E F +M+
Sbjct: 175 LNTKEIASQMREWFNLAICTQAFFAVHVLGVVRNRFHRVLTVPPLFDSLKTGKELFIKMY 234
Query: 415 KWLQEPEFQRMSALRLAGN 433
WL+ E + L + GN
Sbjct: 235 NWLKLSEDVKKEILSVFGN 253
>gi|308470381|ref|XP_003097424.1| hypothetical protein CRE_16960 [Caenorhabditis remanei]
gi|308240132|gb|EFO84084.1| hypothetical protein CRE_16960 [Caenorhabditis remanei]
Length = 415
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 46/81 (56%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
++ ++T ++A I LK + + FA ++L R++ TLSD++++P+ W L E F
Sbjct: 320 IDYVDTYKIAAEILDWLKTAPVSRDWFAGKILNRTKRTLSDIIKHPRDWKDLNHRTEYFI 379
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
+M+ WL E QR+ + G
Sbjct: 380 KMYNWLNMSEEQRLQIMHCYG 400
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM 413
E NTK+++ + Q+ FA ++L +G L L+R+ + ++ LKSG++ + RM
Sbjct: 145 EANTKKISLEMKIWFSLEICSQSYFAVKILNVDRGRLHSLMRDARDFNTLKSGKDLYTRM 204
Query: 414 WKWLQEPEFQRMSALR--LAGN 433
+ WL+ +R L+ L GN
Sbjct: 205 YNWLKLSSGERDELLKMDLFGN 226
>gi|195131165|ref|XP_002010021.1| GI15688 [Drosophila mojavensis]
gi|193908471|gb|EDW07338.1| GI15688 [Drosophila mojavensis]
Length = 737
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T ++ +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 185 QDLDTHDITTKIKEALLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 244
Query: 412 RMWKWL 417
RM WL
Sbjct: 245 RMQLWL 250
>gi|324509819|gb|ADY44116.1| Homeobox protein ceh-44 [Ascaris suum]
Length = 580
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRM 413
++T+EL ++I L SI Q F + VL SQG++SDLL PKPWS L + GRE F RM
Sbjct: 186 LDTEELVRQIKEYLCANSISQRQFGEHVLGLSQGSVSDLLARPKPWSMLTQKGREPFIRM 245
Query: 414 WKWLQE 419
+L E
Sbjct: 246 RLFLNE 251
>gi|268562972|ref|XP_002646821.1| Hypothetical protein CBG19500 [Caenorhabditis briggsae]
Length = 234
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 43/75 (57%), Gaps = 1/75 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
+NTK L IS E+K I Q +FA++V S L +LR P PW + S GR T+ R+
Sbjct: 76 LNTKSLCDSISREMKARKITQKVFAKKVANLSHYYLPRILRAPMPWENVSSLGRMTYIRL 135
Query: 414 WKWLQEPEFQRMSAL 428
W+Q PE +RM +
Sbjct: 136 HNWIQLPEKERMEIV 150
>gi|308455341|ref|XP_003090217.1| hypothetical protein CRE_03526 [Caenorhabditis remanei]
gi|308265840|gb|EFP09793.1| hypothetical protein CRE_03526 [Caenorhabditis remanei]
Length = 153
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRMWKWLQEPEFQRMSALRLA 431
I Q+ FA RVL R+QGTLSD L+ PKPW LK G++ + +M +W+ P ++M+ +
Sbjct: 4 ISQSKFANRVLKRTQGTLSDTLKKPKPWEDLKEVGKQIYIQMAEWMSLPLDEKMAKVEEE 63
Query: 432 G 432
G
Sbjct: 64 G 64
>gi|308454598|ref|XP_003089910.1| hypothetical protein CRE_09896 [Caenorhabditis remanei]
gi|308267791|gb|EFP11744.1| hypothetical protein CRE_09896 [Caenorhabditis remanei]
Length = 306
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 44/75 (58%), Gaps = 1/75 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
I+TK ++ LK + Q FA++VL ++QG S + RNP PW +L + GR F RM
Sbjct: 157 IDTKRTVGDVNNWLKLNGVNQTKFAEKVLEKTQGHFSVISRNPAPWEELLAPGRAVFVRM 216
Query: 414 WKWLQEPEFQRMSAL 428
+ W++ + ++M L
Sbjct: 217 YNWMKLSDEEKMKIL 231
>gi|308454878|ref|XP_003090027.1| hypothetical protein CRE_06207 [Caenorhabditis remanei]
gi|308266858|gb|EFP10811.1| hypothetical protein CRE_06207 [Caenorhabditis remanei]
Length = 419
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 42/76 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK+++ I L++ Q FA ++L R + TLS L NPK W +L RE + +M
Sbjct: 335 VNTKKISMLIKDWLEKTQATQEWFATKILGRCRRTLSQCLNNPKDWKELSQKREIYVKMH 394
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +++
Sbjct: 395 NWMCLTEEQRHQMMKV 410
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TKE+A +I QA FA VL + +L P P+S +K+G+E F +M+
Sbjct: 188 LDTKEIASQIREWFTLAICSQAFFAVHVLGVVRNRFHRVLTVPPPFSSMKTGKELFIKMY 247
Query: 415 KWLQEPEFQRMSALRLAG 432
WL+ E + L + G
Sbjct: 248 NWLKMSEDVKKEILSVVG 265
>gi|308454604|ref|XP_003089913.1| hypothetical protein CRE_09900 [Caenorhabditis remanei]
gi|308267794|gb|EFP11747.1| hypothetical protein CRE_09900 [Caenorhabditis remanei]
Length = 282
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFR 411
E I+TK + L + I Q FA++VL ++QG S + RNP PW +L + GR F
Sbjct: 158 EAIDTKRTVGDVKNWLNKAGINQTTFAEKVLEKTQGHFSVISRNPAPWEELLTPGRAVFV 217
Query: 412 RMWKWLQEPEFQRMSAL 428
RM W++ + ++M L
Sbjct: 218 RMHNWMKLSDEEKMKIL 234
>gi|308451744|ref|XP_003088785.1| hypothetical protein CRE_20139 [Caenorhabditis remanei]
gi|308245331|gb|EFO89283.1| hypothetical protein CRE_20139 [Caenorhabditis remanei]
Length = 404
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK ++ + L+R Q FA ++L R + TL+ L NPK W L RE + +M
Sbjct: 320 VNTKRISVVVKNWLERTQATQEWFATKILKRCRRTLNQCLNNPKDWKDLSQKREIYVKMH 379
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +R+
Sbjct: 380 NWMCLTEEQRHEMMRV 395
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A++I QA FA VL + L +L P+P++ LK+G+E F +M+
Sbjct: 176 LDTKDIARQIKEWFTLGICTQAFFAVHVLGTVRNRLHRVLTIPRPFNSLKTGKELFIKMY 235
Query: 415 KWLQEPEFQRMSALRLAG 432
WL+ E + L + G
Sbjct: 236 NWLKLSEDVKKEILSIFG 253
>gi|119618353|gb|EAW97947.1| hCG2015287 [Homo sapiens]
Length = 572
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 490 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 549
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 550 MKQFLSDE--QNVLALR 564
>gi|308455020|ref|XP_003090088.1| hypothetical protein CRE_21619 [Caenorhabditis remanei]
gi|308266605|gb|EFP10558.1| hypothetical protein CRE_21619 [Caenorhabditis remanei]
Length = 309
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFR 411
E I+TK + LK ++ Q FA++VL ++QG S + RNP PW +L + GR F
Sbjct: 158 EAIDTKRTVGDVKNWLKSNNVNQTTFAEKVLEKTQGHFSVISRNPAPWEELLAPGRAVFV 217
Query: 412 RMWKWLQEPEFQRMSAL 428
RM W++ + ++M L
Sbjct: 218 RMHNWMKLSDEEKMKIL 234
>gi|308451703|ref|XP_003088768.1| hypothetical protein CRE_24335 [Caenorhabditis remanei]
gi|308245407|gb|EFO89359.1| hypothetical protein CRE_24335 [Caenorhabditis remanei]
Length = 122
Score = 53.5 bits (127), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 46/81 (56%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR 411
++ ++T ++A I LK + + FA ++L R++ TLSD++++P+ W L E F
Sbjct: 37 IDYVDTYKIAAEILDWLKTAPVSRDWFAGKILNRTKRTLSDIIKHPRDWKDLNHRTEYFI 96
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
+M+ WL E QR+ + G
Sbjct: 97 KMYNWLNMSEEQRLQIMHCYG 117
>gi|321476536|gb|EFX87496.1| hypothetical protein DAPPUDRAFT_17828 [Daphnia pulex]
Length = 87
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 42/69 (60%), Gaps = 1/69 (1%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFR 411
++++T+ + +I L +I Q IF + VL SQG++S+LL PKPW L GRE F
Sbjct: 9 QDLDTQGMTTKIKEVLLANNIGQKIFGEAVLGLSQGSVSELLSKPKPWHMLSIKGREPFI 68
Query: 412 RMWKWLQEP 420
RM WL +P
Sbjct: 69 RMQLWLSDP 77
>gi|308470449|ref|XP_003097458.1| hypothetical protein CRE_16959 [Caenorhabditis remanei]
gi|308240166|gb|EFO84118.1| hypothetical protein CRE_16959 [Caenorhabditis remanei]
Length = 593
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 56/120 (46%), Gaps = 12/120 (10%)
Query: 314 DSPGPPKIISQQTATLTLKTVTP-TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
++ P KI Q AT+T + + N V +NTK++ Q I L R
Sbjct: 451 ENDTPKKIFRQNPATMTAERIRELMNQPVAY-----------MNTKKVTQDIKEWLARTQ 499
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
+ FA ++ R++ TL L PK W +L G+E + R++ W++ E +R +R G
Sbjct: 500 TTRKWFATNIVGRAKRTLVINLNYPKEWKELTRGKEAYVRLYNWMRMSEEERQDIMRFYG 559
Score = 38.9 bits (89), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 39/76 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NT E+ + I L S FA +L L+++ P+ ++ L++ +ETF +M+
Sbjct: 364 VNTTEIVKEIKEWLASSSYTNKYFASNILNIKGNHLTNIFAQPRDFNSLRNTKETFIKMY 423
Query: 415 KWLQEPEFQRMSALRL 430
WL+ E R L++
Sbjct: 424 NWLEMSEDVRTEMLKM 439
>gi|308454987|ref|XP_003090074.1| hypothetical protein CRE_23218 [Caenorhabditis remanei]
gi|308266627|gb|EFP10580.1| hypothetical protein CRE_23218 [Caenorhabditis remanei]
Length = 515
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 42/76 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK ++ + L+R Q FA+++L RS+ TL L PK W L R+ + +M+
Sbjct: 431 VNTKNISNMMKNWLQRTQATQRWFAKKILGRSRKTLGQCLNKPKDWKDLSQKRKIYVKMY 490
Query: 415 KWLQEPEFQRMSALRL 430
W+ + QR+ +R+
Sbjct: 491 NWMCLTKEQRLEIMRV 506
>gi|297692963|ref|XP_002823798.1| PREDICTED: cut-like homeobox 2 [Pongo abelii]
Length = 809
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 3/77 (3%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
+++T E+A ++ +L +++I Q +F VL SQG++S++L PKPW KL G+E F +
Sbjct: 481 QLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIK 540
Query: 413 MWKWLQEPEFQRMSALR 429
M ++L + Q + ALR
Sbjct: 541 MKQFLSDE--QNVLALR 555
>gi|308454975|ref|XP_003090068.1| hypothetical protein CRE_23220 [Caenorhabditis remanei]
gi|308266621|gb|EFP10574.1| hypothetical protein CRE_23220 [Caenorhabditis remanei]
Length = 312
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK+++ I L+ Q FA ++L R + TL+ L NPK W L RE + +M
Sbjct: 228 VNTKKISMLIKDWLEETQATQEWFATKILKRCRRTLNQCLNNPKDWKDLSQKREIYVKMH 287
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +R+
Sbjct: 288 NWMCLTEEQRHEMMRI 303
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 41/79 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTKE+A+++ QA FA VL + +L P P+ LK+G+E F +M+
Sbjct: 85 LNTKEIARQMREWFNLAICTQAFFAVHVLGVVRNRFHRVLTVPPPFDSLKTGKELFIKMY 144
Query: 415 KWLQEPEFQRMSALRLAGN 433
WL+ E + L + GN
Sbjct: 145 NWLKLSEDVKKEILSVFGN 163
>gi|380468394|gb|AFD61670.1| onecut1 protein, partial [Macaca mulatta]
Length = 170
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 63/173 (36%), Positives = 91/173 (52%), Gaps = 38/173 (21%)
Query: 92 VASIMDS----SEFRNLH-------AEPTYQTLTNVNGRLSPPGYSPSSSYATLTPLQPL 140
+AS++D S++ + H A P + T+T +PPG S ++Y TLTPLQPL
Sbjct: 2 MASLLDGGSGGSDYHHHHRAPEHSLAGPLHPTMTMAC--ETPPGMSMPTTYTTLTPLQPL 59
Query: 141 PPISTMSDKFAYGH---------------TGNVTGSFTVMQNNIGLG-INSHYT-YDKIQ 183
PPIST+SDKF + H GNV+GSFT+M++ GL +N+ YT Y K
Sbjct: 60 PPISTVSDKFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDV 119
Query: 184 MSPPQHYSP--NNGLSTIVITQENSPLSPQSTYSQNGLNSP-QKSLSPNSYDS 233
Q SP N+GL +I +Q+ P Y+ G P K L+PN +++
Sbjct: 120 AGMGQSLSPLSNSGLGSIHNSQQGLP-----HYAHPGAAMPTDKMLTPNGFEA 167
>gi|443689261|gb|ELT91708.1| hypothetical protein CAPTEDRAFT_49316, partial [Capitella teleta]
Length = 82
Score = 52.4 bits (124), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 354 EINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRR 412
E++T + ++ L+ ++ Q +F + VL SQG++S+LL PKPW L GRE F +
Sbjct: 8 ELDTLLITCKVKEVLQFNNLGQKLFGEAVLGLSQGSVSELLSKPKPWHMLSLKGREPFIK 67
Query: 413 MWKWLQEP 420
M WL +P
Sbjct: 68 MHMWLSDP 75
>gi|308454985|ref|XP_003090073.1| hypothetical protein CRE_23214 [Caenorhabditis remanei]
gi|308266626|gb|EFP10579.1| hypothetical protein CRE_23214 [Caenorhabditis remanei]
Length = 319
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 41/76 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK ++ + L+ + Q FA ++L R + TL+ L NPK W +L RE + +M
Sbjct: 235 VDTKRISVVVKNWLEETQVTQEWFATKILKRCRRTLNQCLNNPKDWKELNQKREIYVKMH 294
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +R+
Sbjct: 295 NWMCLTEEQRHEMMRV 310
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 14/114 (12%)
Query: 319 PKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIF 378
P+ +S + A L T+ P N ++TK++A+++ QA F
Sbjct: 23 PRSLSAEQAHELLSTLIPDNVY--------------LDTKDIARQMKEWFTFGICTQAFF 68
Query: 379 AQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
A VL + L +L P+P++ LK+G++ + +M+ WL+ E + L + G
Sbjct: 69 AVHVLGTVRNCLHRVLTIPRPFNSLKTGKKLYIKMYNWLKLSEDVKKEILSVFG 122
>gi|308454882|ref|XP_003090029.1| hypothetical protein CRE_06206 [Caenorhabditis remanei]
gi|308266860|gb|EFP10813.1| hypothetical protein CRE_06206 [Caenorhabditis remanei]
Length = 513
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK ++ + L+R Q FA+++L RS+ TL L PK W L R+ + +M
Sbjct: 429 VNTKNISILMKNWLQRTQATQRWFAKKILGRSRKTLGQCLNKPKDWKDLSQKRKIYVKMH 488
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR+ +R+
Sbjct: 489 NWMCLAEEQRLEIMRI 504
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 37/67 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A+++ Q+ FA ++L L +L P + KLK+G+E F +M+
Sbjct: 294 LDTKDIARQMREWFTLGICTQSFFAAKILGTPSNRLHTILTTPPRFKKLKAGKELFIKMY 353
Query: 415 KWLQEPE 421
WL+ E
Sbjct: 354 NWLKMSE 360
>gi|308469503|ref|XP_003096989.1| hypothetical protein CRE_21456 [Caenorhabditis remanei]
gi|308241189|gb|EFO85141.1| hypothetical protein CRE_21456 [Caenorhabditis remanei]
Length = 467
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 40/76 (52%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK ++ + L+R Q FA ++L R + TL+ L NPK W L RE + +M
Sbjct: 383 VDTKRISVVVKNWLERTQATQEWFATKILKRCRRTLNQCLNNPKDWKDLTQKREIYVKMH 442
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +R+
Sbjct: 443 NWMCLTEEQRHEMMRV 458
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 37/64 (57%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A+++ QA FA VL + L +L P+P+ LK+G+E + +M+
Sbjct: 177 LDTKDIARQMKKWFSLTICTQAFFALNVLGTVRNRLHRVLTIPRPFDSLKAGKELYIKMY 236
Query: 415 KWLQ 418
WL+
Sbjct: 237 NWLK 240
>gi|308469517|ref|XP_003096996.1| hypothetical protein CRE_21437 [Caenorhabditis remanei]
gi|308241196|gb|EFO85148.1| hypothetical protein CRE_21437 [Caenorhabditis remanei]
Length = 405
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK+++ I L+ Q FA +L R + TLS L NPK W L RE + +M
Sbjct: 321 VNTKKISNLIKDWLEETQATQEWFATMILGRCRRTLSQCLNNPKDWKDLSQKREIYVKMH 380
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +R+
Sbjct: 381 NWMCLREEQRHKIMRV 396
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 42/78 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TKE+A ++ QA FA VL ++ +L P P+S +K+G+E F +M+
Sbjct: 175 LDTKEIAGQMREWFTLAICTQAFFAVHVLGVARNRFHRVLTIPPPFSSMKNGKELFIKMY 234
Query: 415 KWLQEPEFQRMSALRLAG 432
WL+ E + L ++G
Sbjct: 235 NWLKMSEDVKKEILSVSG 252
>gi|341901411|gb|EGT57346.1| hypothetical protein CAEBREN_20884 [Caenorhabditis brenneri]
Length = 389
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 68/123 (55%), Gaps = 8/123 (6%)
Query: 301 PQHSLISNQLQRSDSPGPPKIISQQTATLTLKT-VTPTNAVVGAVSLGNSGELEEINTKE 359
PQ+S+ + QRS P K++ ++ + L+ V ++V+ L E++ I+TKE
Sbjct: 196 PQYSVGVDYEQRS----PQKLVQEEQPEVMLEAEVIALDSVLANQLLDE--EVDFIDTKE 249
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQ 418
+ R+ + + Q FA+R+L SQ +LS ++ +PK + LK G++ + RM+ +L+
Sbjct: 250 MMGRLKVWMTKTGKLQKSFAERILNLSQSSLSTMISSPKTFGNLKRGGKKQYYRMFNFLK 309
Query: 419 EPE 421
E
Sbjct: 310 YDE 312
>gi|308468817|ref|XP_003096649.1| hypothetical protein CRE_29106 [Caenorhabditis remanei]
gi|308241596|gb|EFO85548.1| hypothetical protein CRE_29106 [Caenorhabditis remanei]
Length = 393
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 42/76 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK +++ + L+ + FA ++ R++ TL D L NPK W+ + G+E F RM
Sbjct: 296 VDTKRVSKDLKEWLRSTKTSREWFATTIMERAKRTLRDHLNNPKDWNDMVKGQEIFLRMH 355
Query: 415 KWLQEPEFQRMSALRL 430
W+ E +R + LR
Sbjct: 356 NWMAISEEERQAILRF 371
>gi|156371439|ref|XP_001628771.1| predicted protein [Nematostella vectensis]
gi|156215756|gb|EDO36708.1| predicted protein [Nematostella vectensis]
gi|400621374|gb|AFP87452.1| onecut-like protein [Nematostella vectensis]
Length = 984
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 356 NTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLR--NPKPWSKLKSGRETFRRM 413
N KE+ +R+S LKR + Q FA+ +L R Q TL+ LL + W K + R + +M
Sbjct: 51 NPKEIVERVSLWLKRNRVSQIQFAKTILGRGQPTLNALLAQGQTEEWPKEGTKRRHYIKM 110
Query: 414 WKWLQEPEFQRMSALRLAG 432
+ ++++ + Q S LRLAG
Sbjct: 111 YDFMEDEDLQ--STLRLAG 127
>gi|308447215|ref|XP_003087366.1| hypothetical protein CRE_13242 [Caenorhabditis remanei]
gi|308257114|gb|EFP01067.1| hypothetical protein CRE_13242 [Caenorhabditis remanei]
Length = 379
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK ++ + L+R Q FA+++L RS+ TL L PK W L R+ + +M
Sbjct: 295 VNTKNISNMMKNWLQRTQATQRWFAKKILGRSRKTLGQCLNKPKDWKDLSQKRKIYVKMH 354
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR+ +R+
Sbjct: 355 NWMCLAEEQRLEIMRV 370
>gi|358339103|dbj|GAA47225.1| homeobox protein cut-like 1 [Clonorchis sinensis]
Length = 1462
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 352 LEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETF 410
L E++T +L R L+R+++ Q +FA+ V S+ ++DLL + W+ L R +
Sbjct: 1023 LPELDTWDLCIRTKNYLRRHNVSQRLFAEAVFGMSEAWVNDLLNRTERWASLTPRSRLAY 1082
Query: 411 RRMWKWLQEP 420
R++ WLQ+P
Sbjct: 1083 ARLYVWLQQP 1092
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 36/65 (55%), Gaps = 1/65 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
++T +L + L ++ + + + + + TLS LLR PKPWS+L S RE + ++
Sbjct: 436 LDTAKLVDEVRRRLLFLNVSRRMLSSELKGTNYRTLSYLLRKPKPWSQLSSHAREIYLQL 495
Query: 414 WKWLQ 418
WL+
Sbjct: 496 DTWLK 500
>gi|308450292|ref|XP_003088246.1| hypothetical protein CRE_31633 [Caenorhabditis remanei]
gi|308248655|gb|EFO92607.1| hypothetical protein CRE_31633 [Caenorhabditis remanei]
Length = 238
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
I+TK + LK I Q FA++VL ++QG S + RNP PW +L + GR F RM
Sbjct: 89 IDTKRTVGDVKNWLKMNGINQTKFAEKVLEKTQGHFSVISRNPAPWEELLAPGRAVFVRM 148
Query: 414 WKWLQEPEFQRMSAL 428
W++ ++M L
Sbjct: 149 HNWMKLSHEEKMKIL 163
>gi|323650188|gb|ADX97180.1| hepatocyte nuclear factor 6 [Perca flavescens]
Length = 121
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 21/21 (100%), Positives = 21/21 (100%)
Query: 412 RMWKWLQEPEFQRMSALRLAG 432
RMWKWLQEPEFQRMSALRLAG
Sbjct: 1 RMWKWLQEPEFQRMSALRLAG 21
>gi|308470469|ref|XP_003097468.1| hypothetical protein CRE_16954 [Caenorhabditis remanei]
gi|308240176|gb|EFO84128.1| hypothetical protein CRE_16954 [Caenorhabditis remanei]
Length = 598
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 12/120 (10%)
Query: 314 DSPGPPKIISQQTATLTLKTVTP-TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
++ P KII Q AT+T + ++ N V +S TK++ I + +
Sbjct: 456 ENETPKKIIRQNPATMTAERISELMNQPVAYMS-----------TKKVTSDIKMWMAKTR 504
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
+ FA ++ R++ TL L PK W++L G+E + R++ W++ E +R +R G
Sbjct: 505 TTRKWFATNIMGRAKRTLVINLNYPKEWNELTRGKEIYVRLYNWMRMSEEERQDIMRFYG 564
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 39/76 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NT E+ + I L S FA +L L+++ P+ ++ L++ +ETF +M+
Sbjct: 369 VNTSEIVKEIKEWLGSSSYTNKYFASNILNIKGNHLTNIFAQPRDFNSLRNTKETFVKMY 428
Query: 415 KWLQEPEFQRMSALRL 430
WL+ E R L++
Sbjct: 429 NWLEMSEDVRTEMLKM 444
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 39/76 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NT E+ + I L S FA +L + L+++ + ++ L++ +ETF +M+
Sbjct: 154 VNTTEIVKEIKEWLGSSSYTNKFFASNILNIKEKNLTNIFAQKRDFNSLRNTKETFIKMY 213
Query: 415 KWLQEPEFQRMSALRL 430
WL+ E R L++
Sbjct: 214 NWLEMSEDIRTEMLKM 229
>gi|308470403|ref|XP_003097435.1| hypothetical protein CRE_16983 [Caenorhabditis remanei]
gi|308240143|gb|EFO84095.1| hypothetical protein CRE_16983 [Caenorhabditis remanei]
Length = 368
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
I+TK+ + LK I Q FA+ VL ++QG S + RNP PW +L + GR F RM
Sbjct: 219 IDTKQTVGNVKNWLKLNRINQTKFAEMVLEKTQGHFSVISRNPAPWEELLAPGRAVFARM 278
Query: 414 WKWLQ 418
W++
Sbjct: 279 HNWMK 283
>gi|308451707|ref|XP_003088770.1| hypothetical protein CRE_24333 [Caenorhabditis remanei]
gi|308245409|gb|EFO89361.1| hypothetical protein CRE_24333 [Caenorhabditis remanei]
Length = 439
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 12/120 (10%)
Query: 314 DSPGPPKIISQQTATLTLKTVTP-TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
++ P KI Q AT+T + + N V +S TK++ I L R
Sbjct: 297 ENDTPKKIFRQNPATMTAERIRELMNNPVTYMS-----------TKKVTSDIKMWLARTQ 345
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
+ FA ++ R++ TL L PK W +L G+E + R++ W++ E +R +R G
Sbjct: 346 TTRKWFATNIMGRAKRTLVINLNYPKEWEELTRGKEIYVRLYNWMRMSEEERQDIMRFYG 405
Score = 39.3 bits (90), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 39/76 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NT E+ + I L S FA +L + L+++ + ++ L++ +ETF +M+
Sbjct: 210 VNTTEIIKEIKEWLGSSSNTNKFFASNILNIKEKNLTNIFAQKRDFNSLRNTKETFIKMY 269
Query: 415 KWLQEPEFQRMSALRL 430
WL+ E R L++
Sbjct: 270 NWLEMSEDMRTEMLKM 285
>gi|308454886|ref|XP_003090031.1| hypothetical protein CRE_06203 [Caenorhabditis remanei]
gi|308266862|gb|EFP10815.1| hypothetical protein CRE_06203 [Caenorhabditis remanei]
Length = 474
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK ++ + L+ Q FA ++L R + TL+ L NPK W +L RE + +M
Sbjct: 390 VDTKRISVVVKNWLEITQATQEWFATKILKRCRRTLNQCLNNPKDWKELNQKREIYVKMH 449
Query: 415 KWLQEPEFQRMSALRL 430
W+ E QR +R+
Sbjct: 450 NWMCLTEEQRHEMMRV 465
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 42/78 (53%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A+++ QA FA VL + L +L P+P+ LK+G+E + +M+
Sbjct: 200 LDTKDIARQMKEWFTFGICTQAYFAGNVLGTVRNRLHRVLTIPRPFDSLKAGKELYIKMY 259
Query: 415 KWLQEPEFQRMSALRLAG 432
WL+ E + L + G
Sbjct: 260 NWLKLSEDVKNEILSIFG 277
>gi|308455024|ref|XP_003090090.1| hypothetical protein CRE_21618 [Caenorhabditis remanei]
gi|308266607|gb|EFP10560.1| hypothetical protein CRE_21618 [Caenorhabditis remanei]
Length = 270
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
I+TK + LK I Q FA++VL ++QG S + RNP PW +L + GR F RM
Sbjct: 121 IDTKRTVGDVKNWLKMNGINQTKFAEKVLEKTQGHFSVISRNPAPWEELLAPGRAVFVRM 180
Query: 414 WKWLQ 418
WL+
Sbjct: 181 HNWLK 185
>gi|341901471|gb|EGT57406.1| hypothetical protein CAEBREN_05998 [Caenorhabditis brenneri]
Length = 415
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 68/123 (55%), Gaps = 8/123 (6%)
Query: 301 PQHSLISNQLQRSDSPGPPKIISQQTATLTLKT-VTPTNAVVGAVSLGNSGELEEINTKE 359
PQ+S+ + QRS P K++ ++ + L+ V ++V+ L ++E I+TKE
Sbjct: 222 PQYSVGVDYEQRS----PQKLVEEKQPEVMLEAEVIALDSVLANQWLDE--KVEYIDTKE 275
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQ 418
+ R+ + + Q FA+++L SQ +LS ++ +PK + LK G++ + RM+ +L+
Sbjct: 276 MMGRLKEWMTKTGNLQKNFAEKILNLSQSSLSTMISSPKTFGNLKRGGKKQYYRMFNFLK 335
Query: 419 EPE 421
E
Sbjct: 336 FDE 338
>gi|308449459|ref|XP_003087970.1| hypothetical protein CRE_23843 [Caenorhabditis remanei]
gi|308250722|gb|EFO94674.1| hypothetical protein CRE_23843 [Caenorhabditis remanei]
Length = 394
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 53/116 (45%), Gaps = 12/116 (10%)
Query: 318 PPKIISQQTATLTLKTVTP-TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQA 376
P KI Q AT+T + + N V +NT+++ I + R +
Sbjct: 256 PKKIFRQNPATMTAERIRELMNQPVAY-----------MNTRKVTSDIKMWMARTQTTRK 304
Query: 377 IFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
FA ++ R++ TL L PK W +L G+E + R++ W++ E +R ++ G
Sbjct: 305 WFATNIMGRAKRTLVINLNYPKEWEELTRGKEVYVRLYNWMRMSEEERQDIMKFYG 360
>gi|195131161|ref|XP_002010019.1| GI15686 [Drosophila mojavensis]
gi|193908469|gb|EDW07336.1| GI15686 [Drosophila mojavensis]
Length = 516
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 35/51 (68%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
+ ++T +++R+ L ++I Q +FA+ +L SQGT+S+LL PKPW KL
Sbjct: 449 QALDTLHISRRVRELLSVHNIGQRLFAKYILGLSQGTVSELLSKPKPWDKL 499
>gi|308470377|ref|XP_003097422.1| hypothetical protein CRE_16958 [Caenorhabditis remanei]
gi|308240130|gb|EFO84082.1| hypothetical protein CRE_16958 [Caenorhabditis remanei]
Length = 387
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 43/76 (56%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK+ ++ K I + F++ +L +++ + +L + + WSKLK G+E + R++
Sbjct: 281 MNTKKTIAKLKEWFKTSGITKVWFSENILGKNRKAIHHVLHDTREWSKLKQGKENYERVF 340
Query: 415 KWLQEPEFQRMSALRL 430
W++ E +R + L
Sbjct: 341 NWMRISEHERQEIIWL 356
>gi|341901414|gb|EGT57349.1| hypothetical protein CAEBREN_00796 [Caenorhabditis brenneri]
Length = 403
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 67/123 (54%), Gaps = 8/123 (6%)
Query: 301 PQHSLISNQLQRSDSPGPPKIISQQTATLTLKT-VTPTNAVVGAVSLGNSGELEEINTKE 359
PQ+S+ + QRS P K++ ++ + L+ V ++V+ L +++ I+TKE
Sbjct: 210 PQYSVGVDYEQRS----PIKLVQEERPEVMLEAEVIALDSVLANELLDE--KVDYIDTKE 263
Query: 360 LAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQ 418
+ R+ + + Q FA+++L SQ +LS ++ PK + LK G++ + RM+ +L+
Sbjct: 264 MMGRLKEWMTKTGNLQKSFAEKILNLSQSSLSTMISRPKSFGNLKRGGKKQYYRMFNFLK 323
Query: 419 EPE 421
E
Sbjct: 324 YDE 326
>gi|308469497|ref|XP_003096986.1| hypothetical protein CRE_21459 [Caenorhabditis remanei]
gi|308241186|gb|EFO85138.1| hypothetical protein CRE_21459 [Caenorhabditis remanei]
Length = 450
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 38/75 (50%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK ++ + L+ Q FA ++L R + TL+ L NPK W L RE + +M
Sbjct: 366 VDTKRISVVVKNWLEETQATQEWFATKILKRCRRTLNQCLNNPKGWKDLSQKREIYVKMH 425
Query: 415 KWLQEPEFQRMSALR 429
W+ E QR +R
Sbjct: 426 NWMCLTEEQRHEMMR 440
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 43/78 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TK++A+++ QA FA VL + L +L P+P++ LK+G+E + +M+
Sbjct: 176 LDTKDIARQMREWFTLGICTQAFFAVHVLGTVRNRLHRVLTIPRPFNSLKAGKELYIKMY 235
Query: 415 KWLQEPEFQRMSALRLAG 432
WL+ E + L + G
Sbjct: 236 NWLKLSEDVKKEILSIFG 253
>gi|308454977|ref|XP_003090069.1| hypothetical protein CRE_23219 [Caenorhabditis remanei]
gi|308266622|gb|EFP10575.1| hypothetical protein CRE_23219 [Caenorhabditis remanei]
Length = 78
Score = 47.8 bits (112), Expect = 0.012, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 34/63 (53%)
Query: 368 LKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSA 427
L+ Q FA ++L R + TL+ L NPK W +L RE + +M W+ E QR+
Sbjct: 7 LEETQATQEWFATKILKRCRRTLNQCLNNPKDWKELSQKREIYVKMHNWMCLTEEQRLEI 66
Query: 428 LRL 430
+R+
Sbjct: 67 MRV 69
>gi|308454880|ref|XP_003090028.1| hypothetical protein CRE_06205 [Caenorhabditis remanei]
gi|308266859|gb|EFP10812.1| hypothetical protein CRE_06205 [Caenorhabditis remanei]
Length = 188
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 5/72 (6%)
Query: 364 ISAELKRY----SIPQAIFAQRVLCRSQGTLSDLLRNPKPWS-KLKSGRETFRRMWKWLQ 418
+S E+KR+ + + FA ++L R + TL+D + NPK W+ +++ + +M+ W+
Sbjct: 92 VSIEVKRWLEISQVCEEWFATKILKRWRTTLTDAINNPKDWNVGIRNNNNMYAKMYNWMS 151
Query: 419 EPEFQRMSALRL 430
E QR LRL
Sbjct: 152 MTEKQRQEILRL 163
>gi|268535068|ref|XP_002632667.1| Hypothetical protein CBG21591 [Caenorhabditis briggsae]
Length = 338
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 355 INTKELAQRISAELKRYS-----IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRE 408
INT ++ + +R+ + Q FA+ VL R+Q S + +NPKPW L + G+
Sbjct: 189 INTADIINACESSRRRFGFNGKMLSQKTFAELVLHRTQAYYSSICKNPKPWESLTEMGKA 248
Query: 409 TFRRMWKWLQEPEFQRMSALR 429
+ R++ W+Q + ++ + L+
Sbjct: 249 LYVRIYNWIQLNDDEKKTYLK 269
>gi|260825474|ref|XP_002607691.1| hypothetical protein BRAFLDRAFT_82863 [Branchiostoma floridae]
gi|229293040|gb|EEN63701.1| hypothetical protein BRAFLDRAFT_82863 [Branchiostoma floridae]
Length = 407
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 2/81 (2%)
Query: 353 EEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRET-FR 411
E ++ ++A + L++ + QA+F++ VL R +GT S+L+ NP PW K++ R+ ++
Sbjct: 298 EVLDPAQIANTLRDTLEKNKLTQAMFSRLVLQRGEGTFSELMNNPGPWEKMQPRRQNLYK 357
Query: 412 RMWKWLQEPEFQR-MSALRLA 431
M +W++ R + LR A
Sbjct: 358 TMKQWMEPDVLDRNVQELRTA 378
Score = 45.1 bits (105), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLK-SGRETFRRM 413
++T+++A+ I L + Q FA+ VL R G + DLL +PKPW KL+ R+ + M
Sbjct: 157 LDTEDVAKNIRKILHMNDLQQKCFAKHVLRRLPGIVGDLLNHPKPWKKLRPITRQLYTAM 216
Query: 414 WKWL 417
W+
Sbjct: 217 RDWM 220
>gi|308470357|ref|XP_003097412.1| hypothetical protein CRE_16961 [Caenorhabditis remanei]
gi|308240120|gb|EFO84072.1| hypothetical protein CRE_16961 [Caenorhabditis remanei]
Length = 414
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 43/76 (56%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NTK+ ++ + I + F++ +L +++ + +L + + WSKLK G+E + R++
Sbjct: 305 MNTKKTIAKLKEWFETSGITKVWFSENILGKNRKAIHHVLHDTREWSKLKQGKENYERVF 364
Query: 415 KWLQEPEFQRMSALRL 430
W++ E +R + L
Sbjct: 365 NWMRISEHERQEIIWL 380
>gi|159163394|pdb|1WIZ|A Chain A, Solution Structure Of The First Cut Domain Of Kiaa1034
Protein
Length = 101
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 49/86 (56%), Gaps = 6/86 (6%)
Query: 345 SLGNSGELEEINTK-----ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK- 398
S G+SG+ E N+ ++ Q++ ELKR S+ QA+FA+ R+QG LS++LR +
Sbjct: 2 SSGSSGKPEPTNSSVEVSPDIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEED 61
Query: 399 PWSKLKSGRETFRRMWKWLQEPEFQR 424
P + +S R M +L PE +R
Sbjct: 62 PRTASQSLLVNLRAMQNFLNLPEVER 87
>gi|327274899|ref|XP_003222213.1| PREDICTED: DNA-binding protein SATB1-like isoform 2 [Anolis
carolinensis]
Length = 788
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|410904579|ref|XP_003965769.1| PREDICTED: DNA-binding protein SATB1-like [Takifugu rubripes]
Length = 537
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 385 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 444
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 445 QLPEAER 451
>gi|327274897|ref|XP_003222212.1| PREDICTED: DNA-binding protein SATB1-like isoform 1 [Anolis
carolinensis]
Length = 756
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|308470385|ref|XP_003097426.1| hypothetical protein CRE_16946 [Caenorhabditis remanei]
gi|308240134|gb|EFO84086.1| hypothetical protein CRE_16946 [Caenorhabditis remanei]
Length = 305
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
I+T+ + LK I Q FA+ VL ++QG S + RNP W +L + GR F RM
Sbjct: 156 IDTRRTVGDVKNWLKLNGINQTKFAEMVLEKTQGHFSVISRNPASWEELLAPGRAVFVRM 215
Query: 414 WKWLQEPEFQRMSAL 428
W++ + ++M L
Sbjct: 216 HNWMKLSDEEKMKIL 230
>gi|432882487|ref|XP_004074055.1| PREDICTED: DNA-binding protein SATB1-like [Oryzias latipes]
Length = 872
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 11/108 (10%)
Query: 318 PPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAI 377
PP+ ++ L++ TPT + VG+ + ++ Q + ELKR I QAI
Sbjct: 351 PPQCGLGGGSSTGLQSQTPTGSSVGSADV----------PADIYQCVREELKRAGISQAI 400
Query: 378 FAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWLQEPEFQR 424
FA+ R+QG LS++LR + P +S R M +LQ PE +R
Sbjct: 401 FARVAFNRTQGLLSEILRKEEDPKHASQSLLVNLRAMHSFLQLPEAER 448
>gi|335298720|ref|XP_003358374.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Sus scrofa]
Length = 790
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|395816579|ref|XP_003781778.1| PREDICTED: DNA-binding protein SATB1-like isoform 1 [Otolemur
garnettii]
Length = 764
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|395540237|ref|XP_003772064.1| PREDICTED: DNA-binding protein SATB1 isoform 3 [Sarcophilus
harrisii]
Length = 754
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|395540235|ref|XP_003772063.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Sarcophilus
harrisii]
Length = 762
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|354476593|ref|XP_003500509.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Cricetulus griseus]
Length = 763
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|335298718|ref|XP_003358373.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Sus scrofa]
Length = 760
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|4506791|ref|NP_002962.1| DNA-binding protein SATB1 isoform 1 [Homo sapiens]
gi|196114999|ref|NP_001124482.1| DNA-binding protein SATB1 isoform 1 [Homo sapiens]
gi|114585655|ref|XP_001162368.1| PREDICTED: DNA-binding protein SATB1 isoform 9 [Pan troglodytes]
gi|114585657|ref|XP_001162452.1| PREDICTED: DNA-binding protein SATB1 isoform 11 [Pan troglodytes]
gi|397511759|ref|XP_003826234.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Pan paniscus]
gi|397511761|ref|XP_003826235.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Pan paniscus]
gi|426339649|ref|XP_004033756.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Gorilla gorilla
gorilla]
gi|426339651|ref|XP_004033757.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Gorilla gorilla
gorilla]
gi|417747|sp|Q01826.1|SATB1_HUMAN RecName: Full=DNA-binding protein SATB1; AltName: Full=Special
AT-rich sequence-binding protein 1
gi|337811|gb|AAA60304.1| putative [Homo sapiens]
gi|12804639|gb|AAH01744.1| SATB1 protein [Homo sapiens]
gi|119584695|gb|EAW64291.1| special AT-rich sequence binding protein 1 (binds to nuclear
matrix/scaffold-associating DNA's), isoform CRA_a [Homo
sapiens]
gi|119584696|gb|EAW64292.1| special AT-rich sequence binding protein 1 (binds to nuclear
matrix/scaffold-associating DNA's), isoform CRA_a [Homo
sapiens]
gi|119584697|gb|EAW64293.1| special AT-rich sequence binding protein 1 (binds to nuclear
matrix/scaffold-associating DNA's), isoform CRA_a [Homo
sapiens]
gi|123981934|gb|ABM82796.1| special AT-rich sequence binding protein 1 (binds to nuclear
matrix/scaffold-associating DNA's) [synthetic construct]
gi|123996765|gb|ABM85984.1| special AT-rich sequence binding protein 1 (binds to nuclear
matrix/scaffold-associating DNA's) [synthetic construct]
gi|193785310|dbj|BAG54463.1| unnamed protein product [Homo sapiens]
gi|306921249|dbj|BAJ17704.1| SATB homeobox 1 [synthetic construct]
gi|410211720|gb|JAA03079.1| SATB homeobox 1 [Pan troglodytes]
gi|410295472|gb|JAA26336.1| SATB homeobox 1 [Pan troglodytes]
gi|410342547|gb|JAA40220.1| SATB homeobox 1 [Pan troglodytes]
Length = 763
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|344288087|ref|XP_003415782.1| PREDICTED: DNA-binding protein SATB1-like isoform 1 [Loxodonta
africana]
Length = 773
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|453371|gb|AAA17372.1| SATB1 [Mus musculus]
Length = 764
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|395816581|ref|XP_003781779.1| PREDICTED: DNA-binding protein SATB1-like isoform 2 [Otolemur
garnettii]
Length = 796
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|149634139|ref|XP_001508630.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Ornithorhynchus
anatinus]
Length = 755
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|126341419|ref|XP_001369656.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Monodelphis
domestica]
Length = 763
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|254826759|ref|NP_001157102.1| DNA-binding protein SATB1 [Mus musculus]
gi|254826761|ref|NP_033148.2| DNA-binding protein SATB1 [Mus musculus]
gi|254826763|ref|NP_001157103.1| DNA-binding protein SATB1 [Mus musculus]
gi|254826765|ref|NP_001157104.1| DNA-binding protein SATB1 [Mus musculus]
gi|341942011|sp|Q60611.2|SATB1_MOUSE RecName: Full=DNA-binding protein SATB1; AltName: Full=Special
AT-rich sequence-binding protein 1
gi|15029818|gb|AAH11132.1| Satb1 protein [Mus musculus]
gi|26353594|dbj|BAC40427.1| unnamed protein product [Mus musculus]
gi|71060069|emb|CAJ18578.1| Satb1 [Mus musculus]
gi|74153144|dbj|BAE34542.1| unnamed protein product [Mus musculus]
gi|148691705|gb|EDL23652.1| special AT-rich sequence binding protein 1, isoform CRA_b [Mus
musculus]
Length = 764
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|314122339|ref|NP_001186573.1| DNA-binding protein SATB1 [Gallus gallus]
Length = 754
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|156120727|ref|NP_001095510.1| DNA-binding protein SATB1 [Bos taurus]
gi|154425686|gb|AAI51324.1| SATB1 protein [Bos taurus]
gi|296490800|tpg|DAA32913.1| TPA: special AT-rich sequence binding protein 1 [Bos taurus]
Length = 760
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|62089108|dbj|BAD92998.1| special AT-rich sequence binding protein 1 variant [Homo sapiens]
Length = 797
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 376 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 435
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 436 QLPEAER 442
>gi|345788938|ref|XP_542770.3| PREDICTED: DNA-binding protein SATB1 [Canis lupus familiaris]
Length = 730
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|332232301|ref|XP_003265342.1| PREDICTED: DNA-binding protein SATB1 isoform 3 [Nomascus
leucogenys]
Length = 795
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|402861666|ref|XP_003895206.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Papio anubis]
Length = 795
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|306518684|ref|NP_001182399.1| DNA-binding protein SATB1 isoform 2 [Homo sapiens]
gi|114585639|ref|XP_001162331.1| PREDICTED: DNA-binding protein SATB1 isoform 8 [Pan troglodytes]
gi|397511763|ref|XP_003826236.1| PREDICTED: DNA-binding protein SATB1 isoform 3 [Pan paniscus]
gi|168277860|dbj|BAG10908.1| DNA-binding protein SATB1 [synthetic construct]
Length = 795
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|348588939|ref|XP_003480222.1| PREDICTED: DNA-binding protein SATB1 [Cavia porcellus]
Length = 764
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|334349091|ref|XP_003342146.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Monodelphis
domestica]
Length = 794
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|149634137|ref|XP_001508604.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Ornithorhynchus
anatinus]
Length = 787
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|388454244|ref|NP_001252577.1| DNA-binding protein SATB1 [Macaca mulatta]
gi|402861664|ref|XP_003895205.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Papio anubis]
gi|387542598|gb|AFJ71926.1| DNA-binding protein SATB1 isoform 1 [Macaca mulatta]
Length = 763
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|58865828|ref|NP_001012129.1| DNA-binding protein SATB1 [Rattus norvegicus]
gi|55250418|gb|AAH85814.1| SATB homeobox 1 [Rattus norvegicus]
gi|149027423|gb|EDL83030.1| rCG23620, isoform CRA_b [Rattus norvegicus]
Length = 764
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|344288089|ref|XP_003415783.1| PREDICTED: DNA-binding protein SATB1-like isoform 2 [Loxodonta
africana]
Length = 805
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|403265511|ref|XP_003924976.1| PREDICTED: DNA-binding protein SATB1 [Saimiri boliviensis
boliviensis]
Length = 784
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|383408595|gb|AFH27511.1| DNA-binding protein SATB1 isoform 1 [Macaca mulatta]
Length = 763
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|390476402|ref|XP_002759687.2| PREDICTED: DNA-binding protein SATB1 [Callithrix jacchus]
Length = 796
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|297671910|ref|XP_002814066.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Pongo abelii]
gi|297671912|ref|XP_002814067.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Pongo abelii]
gi|332232297|ref|XP_003265340.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Nomascus
leucogenys]
gi|332232299|ref|XP_003265341.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Nomascus
leucogenys]
Length = 763
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|395540233|ref|XP_003772062.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Sarcophilus
harrisii]
Length = 794
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|354476595|ref|XP_003500510.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Cricetulus griseus]
Length = 795
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|291399687|ref|XP_002716245.1| PREDICTED: special AT-rich sequence binding protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 761
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|224045357|ref|XP_002196763.1| PREDICTED: DNA-binding protein SATB1 isoform 1 [Taeniopygia
guttata]
Length = 753
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|395734113|ref|XP_003776359.1| PREDICTED: DNA-binding protein SATB1 [Pongo abelii]
Length = 795
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|449492727|ref|XP_004175419.1| PREDICTED: DNA-binding protein SATB1 isoform 2 [Taeniopygia
guttata]
Length = 785
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|410971495|ref|XP_003992204.1| PREDICTED: LOW QUALITY PROTEIN: DNA-binding protein SATB1 [Felis
catus]
Length = 801
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|291399685|ref|XP_002716244.1| PREDICTED: special AT-rich sequence binding protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 764
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|149027422|gb|EDL83029.1| rCG23620, isoform CRA_a [Rattus norvegicus]
Length = 796
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|148691704|gb|EDL23651.1| special AT-rich sequence binding protein 1, isoform CRA_a [Mus
musculus]
Length = 796
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|431917012|gb|ELK16768.1| DNA-binding protein SATB1 [Pteropus alecto]
Length = 759
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 394 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 453
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 454 QLPEAER 460
>gi|281345888|gb|EFB21472.1| hypothetical protein PANDA_003378 [Ailuropoda melanoleuca]
Length = 802
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 376 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 435
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 436 QLPEAER 442
>gi|380793401|gb|AFE68576.1| DNA-binding protein SATB1 isoform 1, partial [Macaca mulatta]
Length = 607
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|308448819|ref|XP_003087763.1| hypothetical protein CRE_01509 [Caenorhabditis remanei]
gi|308253023|gb|EFO96975.1| hypothetical protein CRE_01509 [Caenorhabditis remanei]
Length = 216
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 362 QRISAELKRY----SIPQAIFAQRVLCRSQGTLSDLLRNPKPWSK-LKSGRETFRRMWKW 416
+ +S E+KR+ + + FA ++L R + TL+D + NPK W+ +++ + R+ W
Sbjct: 118 KNVSIEVKRWLEISQVCEEWFATKLLKRWRSTLTDAINNPKDWNDGIRNNNNMYARIHNW 177
Query: 417 LQEPEFQRMSALRL 430
+ E QR LRL
Sbjct: 178 MSMTEEQRQEILRL 191
>gi|194221566|ref|XP_001495643.2| PREDICTED: LOW QUALITY PROTEIN: DNA-binding protein SATB1 [Equus
caballus]
Length = 805
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|308469535|ref|XP_003097005.1| hypothetical protein CRE_21436 [Caenorhabditis remanei]
gi|308241205|gb|EFO85157.1| hypothetical protein CRE_21436 [Caenorhabditis remanei]
Length = 344
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 362 QRISAELKRY----SIPQAIFAQRVLCRSQGTLSDLLRNPKPWSK-LKSGRETFRRMWKW 416
+ +S E+KR+ + + FA ++L R + TL+D + NPK W+ +++ + R+ W
Sbjct: 246 KNVSIEVKRWLEISQVCEEWFATKLLKRWRSTLTDAINNPKDWNDGIRNNNNMYARIHNW 305
Query: 417 LQEPEFQRMSALRL 430
+ E QR LRL
Sbjct: 306 MSMTEEQRQEILRL 319
>gi|348511143|ref|XP_003443104.1| PREDICTED: DNA-binding protein SATB1-like [Oreochromis niloticus]
Length = 789
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 389 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 448
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 449 QLPEAER 455
>gi|432092947|gb|ELK25305.1| DNA-binding protein SATB1, partial [Myotis davidii]
Length = 851
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 376 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 435
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 436 QLPEAER 442
>gi|268535442|ref|XP_002632854.1| Hypothetical protein CBG15050 [Caenorhabditis briggsae]
Length = 351
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 37/70 (52%), Gaps = 6/70 (8%)
Query: 355 INTKELAQRISAELKRYS-----IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRE 408
INT ++ + +R+ + Q FA+ VL R+Q S + +NPKPW L G+
Sbjct: 202 INTADIINACESSRRRFGFNGKMLSQKTFAELVLQRTQAYYSSICKNPKPWESLTDMGKT 261
Query: 409 TFRRMWKWLQ 418
+ R++ W+Q
Sbjct: 262 LYVRIYNWIQ 271
>gi|157835474|pdb|2O49|A Chain A, Crystal Structure Of The N-Terminal Cut Domain Of Satb1
Bound To Matrix Attachment Region Dna
gi|157835477|pdb|2O4A|A Chain A, Crystal Structure Of The N-Terminal Cut Domain Of Satb1
Bound To Matrix Attachment Region Dna
Length = 93
Score = 45.8 bits (107), Expect = 0.046, Method: Composition-based stats.
Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 357 TKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWK 415
+ E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M
Sbjct: 9 SSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQN 68
Query: 416 WLQEPEFQR 424
+LQ PE +R
Sbjct: 69 FLQLPEAER 77
>gi|444511151|gb|ELV09811.1| DNA-binding protein SATB1 [Tupaia chinensis]
Length = 702
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 359 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 418
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 419 QLPEAER 425
>gi|355560057|gb|EHH16785.1| hypothetical protein EGK_12132 [Macaca mulatta]
Length = 815
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 359 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 418
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 419 QLPEAER 425
>gi|355747083|gb|EHH51697.1| hypothetical protein EGM_11125 [Macaca fascicularis]
Length = 815
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 359 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 418
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 419 QLPEAER 425
>gi|301759115|ref|XP_002915404.1| PREDICTED: LOW QUALITY PROTEIN: DNA-binding protein SATB1-like
[Ailuropoda melanoleuca]
Length = 771
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|426219590|ref|XP_004004002.1| PREDICTED: DNA-binding protein SATB1 [Ovis aries]
Length = 833
Score = 45.4 bits (106), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 374 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 433
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 434 QLPEAER 440
>gi|47218777|emb|CAG02763.1| unnamed protein product [Tetraodon nigroviridis]
Length = 799
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 440 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 499
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 500 QLPEAER 506
>gi|395520010|ref|XP_003764131.1| PREDICTED: DNA-binding protein SATB2 [Sarcophilus harrisii]
Length = 787
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 92/205 (44%), Gaps = 31/205 (15%)
Query: 237 QRDLVDRSVLGGGSQSPTLSPNEGSLHSSPTTLVTNFQG-----AATIPTMNGITTITPH 291
+ +L D VLG Q P PN L T TN Q + P N + T+ P
Sbjct: 293 RENLTDYCVLG---QRPMHLPNMNQL---ATLGKTNEQSPHSQIHHSTPIRNQVPTLQPI 346
Query: 292 TV--VVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAV----VGAVS 345
++SPQ SPQ L+ Q+ + +I+QQ A L A+ +
Sbjct: 347 MSPGLLSPQLSPQ--LVRQQIAMA------HLINQQIAVSRLLAHQHPQAINQQFLNHPP 398
Query: 346 LGNSGELEEINTK-----ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-P 399
+ + + E N+ ++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P
Sbjct: 399 IPRAVKPEPTNSSVEVSPDIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDP 458
Query: 400 WSKLKSGRETFRRMWKWLQEPEFQR 424
+ +S R M +L PE +R
Sbjct: 459 RTASQSLLVNLRAMQNFLNLPEVER 483
>gi|308451129|ref|XP_003088556.1| hypothetical protein CRE_15211 [Caenorhabditis remanei]
gi|308246899|gb|EFO90851.1| hypothetical protein CRE_15211 [Caenorhabditis remanei]
Length = 409
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 9/80 (11%)
Query: 355 INTKELAQRISAELKRY----SIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRET 409
+NTK R+SAE+K + + Q FA ++ R++ T+ + P+ W+ S G+E
Sbjct: 304 VNTK----RVSAEIKEWLVESQVTQEWFASTIVGRNRRTMGPAINYPRDWNDCASKGQEM 359
Query: 410 FRRMWKWLQEPEFQRMSALR 429
F RM W++ E QR +R
Sbjct: 360 FMRMHNWMKLSEMQRQEIMR 379
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 53/114 (46%), Gaps = 8/114 (7%)
Query: 306 ISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGAVSLGNSGELEEINTKELAQRIS 365
S +Q + P P I S T + +P AV A+ + L EI+ ++ + +
Sbjct: 93 FSATIQENRKPAQPPIKSSTTPSQN----SPPMAVERAIEMLTKPILSEISPIKITEDMK 148
Query: 366 AELKRYSIPQAIFAQRVL--CRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWL 417
+ S Q FA +L CRS+ + LL P + LKSG+E F +M+ WL
Sbjct: 149 EWMTSNSCSQEFFASNILNVCRSR--FNYLLNYPGLYGTLKSGKEYFVKMYNWL 200
>gi|440900787|gb|ELR51847.1| DNA-binding protein SATB1, partial [Bos grunniens mutus]
Length = 631
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 368 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 427
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 428 QLPEAER 434
>gi|355717879|gb|AES06083.1| SATB homeobox 1 [Mustela putorius furo]
Length = 483
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 265 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 324
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 325 QLPEAER 331
>gi|268535064|ref|XP_002632665.1| Hypothetical protein CBG21589 [Caenorhabditis briggsae]
Length = 292
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 6/70 (8%)
Query: 355 INTKELAQRISAELKRYS-----IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRE 408
INT ++ + +R+ + Q FA+ VL R+Q S + +NPKPW L + G+
Sbjct: 143 INTADIINACESSRRRFGFNGKMLSQKTFAELVLHRTQAYYSSICKNPKPWESLTEMGKA 202
Query: 409 TFRRMWKWLQ 418
+ R++ W+Q
Sbjct: 203 LYVRIYNWIQ 212
>gi|85544008|pdb|1YSE|A Chain A, Solution Structure Of The Mar-Binding Domain Of Satb1
Length = 141
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 25 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 84
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 85 QLPEAER 91
>gi|308449490|ref|XP_003087978.1| hypothetical protein CRE_10776 [Caenorhabditis remanei]
gi|308250661|gb|EFO94613.1| hypothetical protein CRE_10776 [Caenorhabditis remanei]
Length = 306
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRM 413
I+TK ++ LK + Q FA++VL ++QG S + RNP PW +L + + F RM
Sbjct: 157 IDTKRTVGDVNNWLKLNGVNQTKFAEKVLEKTQGHFSVISRNPAPWEELLAPEKAVFVRM 216
Query: 414 WKWLQ 418
W++
Sbjct: 217 HNWMK 221
>gi|326675284|ref|XP_001920758.3| PREDICTED: DNA-binding protein SATB1-like [Danio rerio]
Length = 782
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 8/151 (5%)
Query: 275 GAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTV 334
G A +P ++ TP +SPQ Q +++ L + + ++++QQ+ +
Sbjct: 296 GRAQLPGLHPGLVSTP----ISPQLVNQQIVMAQILNQQYAVN--RLLAQQSLSQQYLNH 349
Query: 335 TPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLL 394
P N + E++ ++ Q + ELKR I QA+FA+ R+QG LS++L
Sbjct: 350 PPVNRALNKPLEAQVSSNAEVSA-DIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEIL 408
Query: 395 RNPK-PWSKLKSGRETFRRMWKWLQEPEFQR 424
R + P + +S R M +LQ PE +R
Sbjct: 409 RKEEDPKTASQSLLVNLRAMQNFLQLPEAER 439
>gi|73695230|gb|AAI03493.1| SATB2 protein [Homo sapiens]
Length = 615
Score = 45.1 bits (105), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 245 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 304
Query: 418 QEPEFQR 424
PE +R
Sbjct: 305 NLPEVER 311
>gi|449281645|gb|EMC88681.1| DNA-binding protein SATB1, partial [Columba livia]
Length = 694
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 335 EIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 394
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 395 QLPEAER 401
>gi|308451127|ref|XP_003088555.1| hypothetical protein CRE_15210 [Caenorhabditis remanei]
gi|308246898|gb|EFO90850.1| hypothetical protein CRE_15210 [Caenorhabditis remanei]
Length = 266
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 29/120 (24%), Positives = 55/120 (45%), Gaps = 12/120 (10%)
Query: 314 DSPGPPKIISQQTATLTLKTVTP-TNAVVGAVSLGNSGELEEINTKELAQRISAELKRYS 372
++ P KI Q AT+T + V N V +NT+++ I + +
Sbjct: 124 ENDTPKKIFRQNPATMTAERVRELMNQPVAY-----------MNTRKVTSDIKMWMAKTQ 172
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAG 432
+ FA ++ R++ TL L PK W +L G+E + +++ W++ + +R +R G
Sbjct: 173 TTRKWFATNIMGRAKRTLVINLNYPKEWEELTRGKEVYVKLYNWMRMSKEERQDIMRFYG 232
>gi|154147656|ref|NP_001093671.1| SATB homeobox 1 [Xenopus (Silurana) tropicalis]
gi|134025652|gb|AAI36085.1| satb1 protein [Xenopus (Silurana) tropicalis]
Length = 753
Score = 44.7 bits (104), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q + ELKR I QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 375 DIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFL 434
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 435 QLPEAER 441
>gi|432090914|gb|ELK24147.1| DNA-binding protein SATB2 [Myotis davidii]
Length = 481
Score = 44.7 bits (104), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 111 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 170
Query: 418 QEPEFQR 424
PE +R
Sbjct: 171 NLPEVER 177
>gi|268535436|ref|XP_002632851.1| Hypothetical protein CBG15047 [Caenorhabditis briggsae]
Length = 351
Score = 44.7 bits (104), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 6/70 (8%)
Query: 355 INTKELAQRISAELKRYS-----IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRE 408
INT ++ + +R+ + Q F++ VL R+Q S + +NPKPW L + G+
Sbjct: 202 INTADIINACESSRRRFGFNGKMLSQKTFSELVLQRTQAYYSSICKNPKPWESLTEMGKA 261
Query: 409 TFRRMWKWLQ 418
+ R++ W+Q
Sbjct: 262 LYVRIYNWIQ 271
>gi|308451705|ref|XP_003088769.1| hypothetical protein CRE_24334 [Caenorhabditis remanei]
gi|308245408|gb|EFO89360.1| hypothetical protein CRE_24334 [Caenorhabditis remanei]
Length = 403
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 335 TPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL--CRSQGTLSD 392
+P+ V A+ + L EIN ++ + I + S Q FA +L CRS+ +
Sbjct: 142 SPSMIVEQAIKMLTKPILSEINPMKITEDIRDWMTSNSCSQGFFASTILNVCRSR--FNY 199
Query: 393 LLRNPKPWSKLKSGRETFRRMWKWLQEPEFQR 424
LL P+ + LK+G+E F +M+ WL+ + +R
Sbjct: 200 LLNYPELYGTLKTGKEYFVKMYNWLEMSKDER 231
>gi|62088332|dbj|BAD92613.1| SATB family member 2 variant [Homo sapiens]
Length = 763
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 393 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 452
Query: 418 QEPEFQR 424
PE +R
Sbjct: 453 NLPEVER 459
>gi|281353909|gb|EFB29493.1| hypothetical protein PANDA_003907 [Ailuropoda melanoleuca]
Length = 715
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 345 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 404
Query: 418 QEPEFQR 424
PE +R
Sbjct: 405 NLPEVER 411
>gi|431895025|gb|ELK04818.1| DNA-binding protein SATB2 [Pteropus alecto]
Length = 481
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 111 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 170
Query: 418 QEPEFQR 424
PE +R
Sbjct: 171 NLPEVER 177
>gi|297264638|ref|XP_001087625.2| PREDICTED: DNA-binding protein SATB2 [Macaca mulatta]
Length = 808
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 465 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 524
Query: 418 QEPEFQR 424
PE +R
Sbjct: 525 NLPEVER 531
>gi|344268706|ref|XP_003406197.1| PREDICTED: DNA-binding protein SATB2 [Loxodonta africana]
Length = 733
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|355750729|gb|EHH55056.1| hypothetical protein EGM_04187, partial [Macaca fascicularis]
Length = 704
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 334 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 393
Query: 418 QEPEFQR 424
PE +R
Sbjct: 394 NLPEVER 400
>gi|355565073|gb|EHH21562.1| hypothetical protein EGK_04662, partial [Macaca mulatta]
Length = 689
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 319 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 378
Query: 418 QEPEFQR 424
PE +R
Sbjct: 379 NLPEVER 385
>gi|395823650|ref|XP_003785097.1| PREDICTED: DNA-binding protein SATB2 [Otolemur garnettii]
Length = 733
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|5689405|dbj|BAA82986.1| KIAA1034 protein [Homo sapiens]
Length = 761
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 391 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 450
Query: 418 QEPEFQR 424
PE +R
Sbjct: 451 NLPEVER 457
>gi|301760145|ref|XP_002915881.1| PREDICTED: DNA-binding protein SATB2-like [Ailuropoda melanoleuca]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|37360204|dbj|BAC98080.1| mKIAA1034 protein [Mus musculus]
Length = 727
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 357 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 416
Query: 418 QEPEFQR 424
PE +R
Sbjct: 417 NLPEVER 423
>gi|157818121|ref|NP_001102776.1| DNA-binding protein SATB2 [Rattus norvegicus]
gi|149046136|gb|EDL99029.1| similar to KIAA1034-like DNA binding protein (predicted) [Rattus
norvegicus]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|354479748|ref|XP_003502071.1| PREDICTED: DNA-binding protein SATB2 [Cricetulus griseus]
gi|344236064|gb|EGV92167.1| DNA-binding protein SATB2 [Cricetulus griseus]
Length = 734
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|440906669|gb|ELR56901.1| DNA-binding protein SATB2 [Bos grunniens mutus]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|329663267|ref|NP_001192999.1| DNA-binding protein SATB2 [Bos taurus]
gi|296490402|tpg|DAA32515.1| TPA: special AT-rich sequence binding protein 2-like [Bos taurus]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|149731017|ref|XP_001502794.1| PREDICTED: DNA-binding protein SATB2 [Equus caballus]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|426221306|ref|XP_004004851.1| PREDICTED: DNA-binding protein SATB2 [Ovis aries]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|20982839|ref|NP_631885.1| DNA-binding protein SATB2 [Mus musculus]
gi|38372503|sp|Q8VI24.1|SATB2_MOUSE RecName: Full=DNA-binding protein SATB2; AltName: Full=Special
AT-rich sequence-binding protein 2
gi|17224928|gb|AAL37172.1|AF319623_1 KIAA1034-like DNA binding protein [Mus musculus]
gi|39918752|emb|CAE54289.1| special AT-rich sequence-binding protein-2 [Mus musculus]
gi|148667626|gb|EDL00043.1| special AT-rich sequence binding protein 2 [Mus musculus]
gi|187952957|gb|AAI38627.1| Special AT-rich sequence binding protein 2 [Mus musculus]
gi|223459946|gb|AAI38626.1| Special AT-rich sequence binding protein 2 [Mus musculus]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|402889005|ref|XP_003907823.1| PREDICTED: DNA-binding protein SATB2 [Papio anubis]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|334330332|ref|XP_001379450.2| PREDICTED: DNA-binding protein SATB2 [Monodelphis domestica]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|332209700|ref|XP_003253951.1| PREDICTED: DNA-binding protein SATB2 isoform 1 [Nomascus
leucogenys]
gi|441668528|ref|XP_004092049.1| PREDICTED: DNA-binding protein SATB2 isoform 2 [Nomascus
leucogenys]
gi|441668531|ref|XP_004092050.1| PREDICTED: DNA-binding protein SATB2 isoform 3 [Nomascus
leucogenys]
gi|380785479|gb|AFE64615.1| DNA-binding protein SATB2 [Macaca mulatta]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|296205186|ref|XP_002749651.1| PREDICTED: DNA-binding protein SATB2 isoform 2 [Callithrix jacchus]
gi|296205188|ref|XP_002749652.1| PREDICTED: DNA-binding protein SATB2 isoform 3 [Callithrix jacchus]
gi|403267196|ref|XP_003925735.1| PREDICTED: DNA-binding protein SATB2 [Saimiri boliviensis
boliviensis]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|38016202|ref|NP_056080.1| DNA-binding protein SATB2 [Homo sapiens]
gi|289547596|ref|NP_001165980.1| DNA-binding protein SATB2 [Homo sapiens]
gi|289547625|ref|NP_001165988.1| DNA-binding protein SATB2 [Homo sapiens]
gi|297669124|ref|XP_002812755.1| PREDICTED: DNA-binding protein SATB2 isoform 2 [Pongo abelii]
gi|332815079|ref|XP_516012.3| PREDICTED: DNA-binding protein SATB2 [Pan troglodytes]
gi|397509942|ref|XP_003825368.1| PREDICTED: DNA-binding protein SATB2 isoform 1 [Pan paniscus]
gi|397509944|ref|XP_003825369.1| PREDICTED: DNA-binding protein SATB2 isoform 2 [Pan paniscus]
gi|426338161|ref|XP_004033056.1| PREDICTED: DNA-binding protein SATB2 [Gorilla gorilla gorilla]
gi|13634020|sp|Q9UPW6.2|SATB2_HUMAN RecName: Full=DNA-binding protein SATB2; AltName: Full=Special
AT-rich sequence-binding protein 2
gi|66990047|gb|AAH98136.1| SATB homeobox 2 [Homo sapiens]
gi|71043454|gb|AAH99723.1| SATB homeobox 2 [Homo sapiens]
gi|73695396|gb|AAI03501.1| SATB homeobox 2 [Homo sapiens]
gi|119590586|gb|EAW70180.1| SATB family member 2, isoform CRA_a [Homo sapiens]
gi|119590587|gb|EAW70181.1| SATB family member 2, isoform CRA_a [Homo sapiens]
gi|158254772|dbj|BAF84152.1| unnamed protein product [Homo sapiens]
gi|167773579|gb|ABZ92224.1| SATB homeobox 2 [synthetic construct]
gi|168269634|dbj|BAG09944.1| DNA-binding protein SATB2 [synthetic construct]
gi|410209152|gb|JAA01795.1| SATB homeobox 2 [Pan troglodytes]
gi|410266950|gb|JAA21441.1| SATB homeobox 2 [Pan troglodytes]
gi|410348684|gb|JAA40946.1| SATB homeobox 2 [Pan troglodytes]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|410969132|ref|XP_003991050.1| PREDICTED: DNA-binding protein SATB2 [Felis catus]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|193786488|dbj|BAG51771.1| unnamed protein product [Homo sapiens]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|345797303|ref|XP_003434297.1| PREDICTED: DNA-binding protein SATB2 [Canis lupus familiaris]
Length = 733
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|47222219|emb|CAG11098.1| unnamed protein product [Tetraodon nigroviridis]
Length = 869
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWLQEPEF 422
+ ELKR I QAIFA+ R+QG LS++LR + P +S R M+ +LQ PE
Sbjct: 434 VREELKRAGISQAIFARVAFNRTQGLLSEILRKEEDPLHASQSLLVNLRAMYSFLQLPEA 493
Query: 423 QR 424
+R
Sbjct: 494 ER 495
>gi|291391988|ref|XP_002712419.1| PREDICTED: special AT-rich sequence binding protein 2-like
[Oryctolagus cuniculus]
Length = 733
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
PE +R
Sbjct: 423 NLPEVER 429
>gi|348522869|ref|XP_003448946.1| PREDICTED: DNA-binding protein SATB1 [Oreochromis niloticus]
Length = 738
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q + ELKR I QAIFA+ R+QG LS++LR + P +S R M +L
Sbjct: 381 DIYQNVREELKRAGISQAIFARVAFNRTQGLLSEILRKEEDPKHASQSLLVNLRAMHSFL 440
Query: 418 QEPEFQR 424
Q PE +R
Sbjct: 441 QLPEAER 447
>gi|308470369|ref|XP_003097418.1| hypothetical protein CRE_16952 [Caenorhabditis remanei]
gi|308240126|gb|EFO84078.1| hypothetical protein CRE_16952 [Caenorhabditis remanei]
Length = 444
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 362 QRISAELKRYSIP----QAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRMWKW 416
+R+SAE+K + + Q FA ++ R++ T+ + P+ W+ S G+E F RM W
Sbjct: 335 KRVSAEIKEWLVESQATQEWFASTIVGRNRRTMGPAINYPRDWNDCASKGQEMFMRMHNW 394
Query: 417 LQEPEFQRMSALR 429
++ E QR +R
Sbjct: 395 MKLSEMQRQEIMR 407
>gi|339635574|gb|AEJ84495.1| Satb2 [Xenopus laevis]
Length = 765
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 392 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 451
Query: 418 QEPEFQR 424
PE +R
Sbjct: 452 NLPEAER 458
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 48/95 (50%), Gaps = 12/95 (12%)
Query: 335 TPTNAVVGAVSLGNSGELEEIN-TKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDL 393
TPT+ + V GE +N T + I E+KR + QA+FA+ +SQG L +L
Sbjct: 497 TPTSDIQVKVE----GESSNVNITAAVYDEIQQEMKRAKVSQALFAKVAANKSQGWLCEL 552
Query: 394 LR---NPKPWSK-LKSGRETFRRMWKWLQEPEFQR 424
LR NP P ++ L T RR +L P+ +R
Sbjct: 553 LRWKENPSPENRTLWENLCTIRR---FLALPQHER 584
>gi|348555191|ref|XP_003463407.1| PREDICTED: LOW QUALITY PROTEIN: DNA-binding protein SATB2-like
[Cavia porcellus]
Length = 696
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 326 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 385
Query: 418 QEPEFQR 424
PE +R
Sbjct: 386 NLPEVER 392
>gi|308469493|ref|XP_003096984.1| hypothetical protein CRE_21434 [Caenorhabditis remanei]
gi|308241184|gb|EFO85136.1| hypothetical protein CRE_21434 [Caenorhabditis remanei]
Length = 384
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 31/56 (55%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETF 410
+NTK+++ I L++ Q FA ++L R + TLS L NPK W L RE +
Sbjct: 324 VNTKKISMLIKDWLEKTQATQEWFATKILGRCRRTLSRCLNNPKDWKDLSQKREIY 379
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
++TKE+A+R+ QA FA V ++ +L P P+ LK+G+E + +M+
Sbjct: 175 LDTKEIARRMREWFTLAICSQAFFAVHVRGVARNRFHRVLTIPPPFDSLKTGKELYIKMY 234
Query: 415 KWLQEPEFQRMSALRLAGN 433
WL+ E + L + GN
Sbjct: 235 NWLKISEDVKKEILSVFGN 253
>gi|410911672|ref|XP_003969314.1| PREDICTED: DNA-binding protein SATB1-like [Takifugu rubripes]
Length = 870
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWLQEPEF 422
+ ELKR I QAIFA+ R+QG LS++LR + P +S R M+ +LQ PE
Sbjct: 408 VREELKRAGISQAIFARVAFNRTQGLLSEILRKEEDPVHASQSLLVNLRAMYSFLQLPEA 467
Query: 423 QR 424
+R
Sbjct: 468 ER 469
>gi|348537720|ref|XP_003456341.1| PREDICTED: DNA-binding protein SATB2-like [Oreochromis niloticus]
Length = 1116
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 26/182 (14%)
Query: 260 GSLHSSPTTLVTNFQGAATIPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPP 319
G LH SP P + + ++SPQ SPQ L+ QL +
Sbjct: 317 GPLHHSPPLRTGQVPPPPPPPPPGALQPLLGPGGLLSPQLSPQ--LVRQQLAMA------ 368
Query: 320 KIISQQTATLTLKTVTPTNAV----------------VGAVSLGNSGELEEINTKELAQR 363
+I+QQ A L A+ G G++ E+++ E+ Q+
Sbjct: 369 HLINQQLAVSRLLAHQHPQALNQQFLNHPPIPRGSSKAGGDHPGSNPSASEVSS-EIYQQ 427
Query: 364 ISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWLQEPEF 422
+ ELKR S+ QA+FA+ R+QG LS++LR + P + +S + M +L PE
Sbjct: 428 VRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLKAMQNFLNLPES 487
Query: 423 QR 424
+R
Sbjct: 488 ER 489
>gi|327282796|ref|XP_003226128.1| PREDICTED: DNA-binding protein SATB2-like [Anolis carolinensis]
Length = 736
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 368 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 427
Query: 418 QEPEFQR 424
PE +R
Sbjct: 428 NLPETER 434
>gi|444721959|gb|ELW62665.1| DNA-binding protein SATB2 [Tupaia chinensis]
Length = 981
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 611 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 670
Query: 418 QEPEFQR 424
PE +R
Sbjct: 671 NLPEVER 677
>gi|301616653|ref|XP_002937759.1| PREDICTED: DNA-binding protein SATB2-like [Xenopus (Silurana)
tropicalis]
Length = 765
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 392 DVYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 451
Query: 418 QEPEFQR 424
PE +R
Sbjct: 452 NLPEAER 458
>gi|410905889|ref|XP_003966424.1| PREDICTED: DNA-binding protein SATB2-like [Takifugu rubripes]
Length = 796
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P S +S + M +L
Sbjct: 389 DIYQKVRNELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRSASQSLLVNLKAMQSFL 448
Query: 418 QEPEFQR 424
PE +R
Sbjct: 449 NLPEGER 455
>gi|348536948|ref|XP_003455957.1| PREDICTED: DNA-binding protein SATB2-like [Oreochromis niloticus]
Length = 869
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 52/92 (56%), Gaps = 2/92 (2%)
Query: 334 VTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDL 393
+ + + GAVS + E+++ ++ Q++ ELKR S+ QA+FA+ R+QG LS++
Sbjct: 421 IARSCKISGAVSEPSLNSGAEVSS-DIYQQVRNELKRASVSQAVFARVAFNRTQGLLSEI 479
Query: 394 LRNPK-PWSKLKSGRETFRRMWKWLQEPEFQR 424
LR + P S +S + M +L PE +R
Sbjct: 480 LRKEEDPRSASQSLLVNLKAMQNFLNLPESER 511
>gi|341901436|gb|EGT57371.1| hypothetical protein CAEBREN_04680 [Caenorhabditis brenneri]
Length = 384
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 74/156 (47%), Gaps = 9/156 (5%)
Query: 269 LVTNFQGAATIPTMNGITTITPHTVVVSPQTS-PQHSLISNQLQRSDSPGPPKIISQQTA 327
+ F G + I +T + Q + PQ+S+ + QRS P K++ ++
Sbjct: 158 MTNQFYGTHQVENQENIEPVTSGVPAQNLQFAMPQYSVGVDYEQRS----PHKLVQEEQP 213
Query: 328 TLTLKT-VTPTNAVVGAVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRS 386
+ L+ V +A L +++ I+TKE+ R+ + + Q FA+++L S
Sbjct: 214 EVMLEAEVIALDAATANQWLDE--KIDYIDTKEMMGRLKEWMTKTGNLQKNFAEKILNLS 271
Query: 387 QGTLSDLLRNPKPWSKLK-SGRETFRRMWKWLQEPE 421
+LS ++ PK + LK G++ + RM+ +L+ E
Sbjct: 272 PTSLSTMISRPKSFGNLKRGGKKQYYRMFNFLKFDE 307
>gi|122114618|ref|NP_001073631.1| uncharacterized protein LOC378993 [Danio rerio]
gi|190338565|gb|AAI63787.1| Si:ch211-195k18.2 [Danio rerio]
gi|190338567|gb|AAI63793.1| Si:ch211-195k18.2 [Danio rerio]
Length = 728
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
E+ + ELKR + QA+FA+ + R+QG LS++LR + P +S R M +L
Sbjct: 334 EIYHWVREELKRAGVSQAVFARVAINRTQGLLSEILRKEEDPRCASQSLLVNLRAMQSFL 393
Query: 418 QEPEFQR 424
Q P+ QR
Sbjct: 394 QLPQSQR 400
>gi|410896512|ref|XP_003961743.1| PREDICTED: DNA-binding protein SATB2-like [Takifugu rubripes]
Length = 884
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S + M +L
Sbjct: 421 DIYQKVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLKAMQNFL 480
Query: 418 QEPEFQR 424
PE +R
Sbjct: 481 NLPESER 487
>gi|405952333|gb|EKC20157.1| Hepatocyte nuclear factor 6 [Crassostrea gigas]
Length = 647
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 46/89 (51%), Gaps = 4/89 (4%)
Query: 347 GNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVL-CRSQGTLSDLLRNPKP--WSKL 403
G+S ++ ++ Q A L+ I Q A+ ++ SQG LS LL K WS+L
Sbjct: 100 GSSSTMDFVDIPATVQEFKAALESSGISQRFAAKFIMDSTSQGNLSFLLEKGKSKHWSEL 159
Query: 404 KS-GRETFRRMWKWLQEPEFQRMSALRLA 431
GR + RM +WL PE QR++ +L+
Sbjct: 160 SCRGRLPYIRMRRWLDSPEEQRITLSQLS 188
>gi|308449134|ref|XP_003087867.1| hypothetical protein CRE_16582 [Caenorhabditis remanei]
gi|308252159|gb|EFO96111.1| hypothetical protein CRE_16582 [Caenorhabditis remanei]
Length = 340
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 362 QRISAELKRYSIP----QAIFAQRVLCRSQGTLSDLLRNPKPWSKLKS-GRETFRRMWKW 416
+R+SAE+K + + Q FA ++ R++ T+ + P+ W+ S G+E F RM W
Sbjct: 231 KRVSAEIKEWLVESQATQEWFASTIVGRNRRTMGPAINYPRNWNDCPSKGQEMFMRMHNW 290
Query: 417 LQEPEFQRMSALR 429
++ E QR +R
Sbjct: 291 MKLSEMQRQEIMR 303
>gi|224055470|ref|XP_002191597.1| PREDICTED: DNA-binding protein SATB2 [Taeniopygia guttata]
Length = 731
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 363 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 422
Query: 418 QEPEFQR 424
P+ +R
Sbjct: 423 NLPDSER 429
>gi|312283698|ref|NP_001186039.1| DNA-binding protein SATB2 [Gallus gallus]
Length = 731
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 75/157 (47%), Gaps = 24/157 (15%)
Query: 279 IPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTN 338
+PTM T I P ++SPQ SPQ L+ Q+ + +I+QQ A L
Sbjct: 286 VPTMQ--TMINPG--LLSPQLSPQ--LVRQQIAMA------HLINQQIAVSRLLAHQHPQ 333
Query: 339 AVVG----------AVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQG 388
A+ AV S E+ + ++ Q++ ELKR S+ QA+FA+ R+QG
Sbjct: 334 AINQQFLNHPPIPRAVKPEPSNSSVEV-SPDIYQQVRDELKRASVSQAVFARVAFNRTQG 392
Query: 389 TLSDLLRNPK-PWSKLKSGRETFRRMWKWLQEPEFQR 424
LS++LR + P + +S R M +L P+ +R
Sbjct: 393 LLSEILRKEEDPRTASQSLLVNLRAMQNFLNLPDSER 429
>gi|305381002|gb|ADM49196.1| special AT-rich sequence binding protein 2 [Gallus gallus]
Length = 731
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 75/157 (47%), Gaps = 24/157 (15%)
Query: 279 IPTMNGITTITPHTVVVSPQTSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTN 338
+PTM T I P ++SPQ SPQ L+ Q+ + +I+QQ A L
Sbjct: 286 VPTMQ--TMINPG--LLSPQLSPQ--LVRQQIAMA------HLINQQIAVSRLLAHQHPQ 333
Query: 339 AVVG----------AVSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQG 388
A+ AV S E+ + ++ Q++ ELKR S+ QA+FA+ R+QG
Sbjct: 334 AINQQFLNHPPIPRAVKPEPSNSSVEV-SPDIYQQVRDELKRASVSQAVFARVAFNRTQG 392
Query: 389 TLSDLLRNPK-PWSKLKSGRETFRRMWKWLQEPEFQR 424
LS++LR + P + +S R M +L P+ +R
Sbjct: 393 LLSEILRKEEDPRTASQSLLVNLRAMQNFLNLPDSER 429
>gi|190570322|ref|NP_001122004.1| DNA-binding protein SATB2 [Danio rerio]
Length = 834
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q + ELKR S+ QA+FA+ R+QG LS++LR + P S +S + M +L
Sbjct: 415 DIYQLVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRSASQSLLVNLKAMQNFL 474
Query: 418 QEPEFQR 424
PE +R
Sbjct: 475 NLPESER 481
>gi|187476433|gb|ACD12677.1| AT-rich sequence binding protein 2 [Danio rerio]
Length = 737
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q + ELKR S+ QA+FA+ R+QG LS++LR + P S +S + M +L
Sbjct: 401 DIYQLVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRSASQSLLVNLKAMQNFL 460
Query: 418 QEPEFQR 424
PE +R
Sbjct: 461 NLPESER 467
>gi|268535060|ref|XP_002632663.1| Hypothetical protein CBG21587 [Caenorhabditis briggsae]
Length = 339
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 373 IPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQ 418
+ Q FA+ VL R+Q S + +NPKPW L + G+ + R++ W++
Sbjct: 213 LSQKTFAELVLQRTQAYYSSICKNPKPWESLTEMGKALYVRIYNWIE 259
>gi|305381000|gb|ADM49195.1| special AT-rich sequence binding protein 2 [Danio rerio]
Length = 863
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q + ELKR S+ QA+FA+ R+QG LS++LR + P S +S + M +L
Sbjct: 415 DIYQLVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRSASQSLLVNLKAMQNFL 474
Query: 418 QEPEFQR 424
PE +R
Sbjct: 475 NLPESER 481
>gi|326922529|ref|XP_003207501.1| PREDICTED: DNA-binding protein SATB2-like, partial [Meleagris
gallopavo]
Length = 615
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 359 ELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPK-PWSKLKSGRETFRRMWKWL 417
++ Q++ ELKR S+ QA+FA+ R+QG LS++LR + P + +S R M +L
Sbjct: 247 DIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFL 306
Query: 418 QEPEFQR 424
P+ +R
Sbjct: 307 NLPDSER 313
>gi|403413711|emb|CCM00411.1| predicted protein [Fibroporia radiculosa]
Length = 673
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 6/114 (5%)
Query: 89 SVSVASIMDSSE-FRNLHAEP--TYQTLTNVNGRLSPPGYSPSSSYATLTPLQPLPPIST 145
S S A ++S++ LH P + Q L+ + RL P + ++ + PLP ++
Sbjct: 321 SSSTAFTLESTKGVETLHFAPYASEQLLSILTDRLDPL-TNDAACAGRMKKFIPLPSMTL 379
Query: 146 MSDKFAYGHTGNVTGSFTVMQNNIGLGINSHYTYDKIQMSPPQHYSPNNGLSTI 199
++ K A HTG+V +F V++N I L + S T D + SP +P++ LS
Sbjct: 380 LAKKIA-AHTGDVRAAFEVLRNAIDLAVTSSKTPDMLD-SPDPTITPSHILSAF 431
>gi|308449410|ref|XP_003087954.1| hypothetical protein CRE_12448 [Caenorhabditis remanei]
gi|308250776|gb|EFO94728.1| hypothetical protein CRE_12448 [Caenorhabditis remanei]
Length = 521
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 39/76 (51%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NT E+ + I L S FA +L + L+++ + ++ L++ +ETF +M+
Sbjct: 216 VNTTEIVKEIKEWLGSSSYTNKYFASNILNIKEKNLTNIFAQKRDFNSLRNTKETFIKMY 275
Query: 415 KWLQEPEFQRMSALRL 430
WL+ E R L++
Sbjct: 276 NWLEMSEDVRAEMLKM 291
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 37/76 (48%)
Query: 355 INTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMW 414
+NT E+ + I L S FA +L L+++ P+ + L++ +E F +M+
Sbjct: 434 VNTTEIVKEIKEWLGSSSYTNKYFASNILNIKGNHLTNIFAQPRDFISLRNTKEAFIKMF 493
Query: 415 KWLQEPEFQRMSALRL 430
WL+ E R L++
Sbjct: 494 NWLEMSEDMRTEMLKM 509
>gi|422829786|ref|ZP_16877950.1| hypothetical protein ESNG_02455 [Escherichia coli B093]
gi|371607738|gb|EHN96303.1| hypothetical protein ESNG_02455 [Escherichia coli B093]
Length = 1471
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 78/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1006 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1058
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1059 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1106
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + S N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1107 GLNTLVLADAINGIAIGSNASANHANSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1166
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1167 VGSEDGQRQITN 1178
>gi|268537334|ref|XP_002633803.1| Hypothetical protein CBG19824 [Caenorhabditis briggsae]
Length = 517
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 5/60 (8%)
Query: 372 SIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL-KSGRETFRRMWKWLQEPEFQRMSALRL 430
S+P+ + VL +SQG S L ++ KPW L SG++ + RM+ WL + Q+ L +
Sbjct: 204 SLPEEM----VLGKSQGYYSSLFKHSKPWESLIDSGKQLYIRMFNWLNLDDNQKQEYLEI 259
>gi|422836126|ref|ZP_16884175.1| hypothetical protein ESOG_03776 [Escherichia coli E101]
gi|371609687|gb|EHN98221.1| hypothetical protein ESOG_03776 [Escherichia coli E101]
Length = 391
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 69/173 (39%), Gaps = 18/173 (10%)
Query: 71 DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSS 130
D+ D E+L I G + ++ D SE H T + L N G ++
Sbjct: 111 DTTDGELLGALSIGDDGKYRQIINVADCSE---AHDAATVRQLQNAIGAVAT-------- 159
Query: 131 YATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGINSHYTYDKIQ-MSPP 187
TP + ST D A G G+ TV+ N IG+G+N+ D I ++
Sbjct: 160 ----TPTKYFHANSTAEDSLAVGEDSLAMGAKTVVNGNAGIGIGLNTLVLADAINGIAIG 215
Query: 188 QHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDL 240
+ N+ S + + Q+ Y+ +++PQ S+ S S QR +
Sbjct: 216 SNARANHANSIAMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQI 268
>gi|170684255|ref|YP_001745904.1| hemagluttinin family protein [Escherichia coli SMS-3-5]
gi|170521973|gb|ACB20151.1| haemagluttinin family protein [Escherichia coli SMS-3-5]
Length = 1549
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 77/190 (40%), Gaps = 22/190 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 1086 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 1138
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGI 173
G ++ TP + ST D A G G+ T++ N IG+G+
Sbjct: 1139 AIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGIGL 1186
Query: 174 NSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
N+ D I ++ + S N+ S + + Q+ YS +++PQ S+ S
Sbjct: 1187 NTLVLADAINGIAIGSNASANHANSIAMGNGSQTTRGAQTNYSAYNMDAPQNSVGEFSVG 1246
Query: 233 SPYNQRDLVD 242
S QR + +
Sbjct: 1247 SEDGQRQITN 1256
>gi|194431046|ref|ZP_03063339.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|417675198|ref|ZP_12324623.1| hemagglutinin family protein [Shigella dysenteriae 155-74]
gi|420345786|ref|ZP_14847215.1| hemagglutinin family protein [Shigella boydii 965-58]
gi|194420501|gb|EDX36577.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|332084528|gb|EGI89723.1| hemagglutinin family protein [Shigella dysenteriae 155-74]
gi|391275838|gb|EIQ34621.1| hemagglutinin family protein [Shigella boydii 965-58]
Length = 337
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 70/175 (40%), Gaps = 18/175 (10%)
Query: 71 DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSS 130
D+ D E+L I G + ++ D SE H T + L N G ++
Sbjct: 6 DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQNAIGAVAT-------- 54
Query: 131 YATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGINSHYTYDKIQ-MSPP 187
TP + ST D A G G+ TV+ N IG+G+N+ D I ++
Sbjct: 55 ----TPTKYFHANSTAEDSLAVGEDSLAMGAKTVVNGNAGIGIGLNTLVLADAINGIAIG 110
Query: 188 QHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVD 242
+ N+ S + + Q+ Y+ +++PQ S+ S S QR + +
Sbjct: 111 SNARANHANSIAMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQITN 165
>gi|432519823|ref|ZP_19757002.1| hypothetical protein A17U_02805 [Escherichia coli KTE228]
gi|432915203|ref|ZP_20120530.1| hypothetical protein A13Q_04169 [Escherichia coli KTE190]
gi|431048075|gb|ELD58060.1| hypothetical protein A17U_02805 [Escherichia coli KTE228]
gi|431436271|gb|ELH17878.1| hypothetical protein A13Q_04169 [Escherichia coli KTE190]
Length = 1812
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 1349 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 1401
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 1402 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 1449
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 1450 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 1509
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 1510 SEDGQRQI 1517
>gi|386626424|ref|YP_006146152.1| hemagluttinin family protein [Escherichia coli O7:K1 str. CE10]
gi|349740160|gb|AEQ14866.1| hemagluttinin family protein [Escherichia coli O7:K1 str. CE10]
Length = 1567
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 77/190 (40%), Gaps = 22/190 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 1104 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 1156
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGI 173
G ++ TP + ST D A G G+ T++ N IG+G+
Sbjct: 1157 AIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGIGL 1204
Query: 174 NSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
N+ D I ++ + S N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1205 NTLVLADAINGIAIGSNASANHANSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFSVG 1264
Query: 233 SPYNQRDLVD 242
S QR + +
Sbjct: 1265 SEDGQRQITN 1274
>gi|341901415|gb|EGT57350.1| hypothetical protein CAEBREN_19718 [Caenorhabditis brenneri]
Length = 396
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 72/136 (52%), Gaps = 10/136 (7%)
Query: 288 ITPHTVVVSPQ----TSPQHSLISNQLQRSDSPGPPKIISQQTATLTLKTVTPTNAVVGA 343
+ P T VV PQ PQ+S+ + QRS P +++ ++ + L+ V A
Sbjct: 186 MDPVTSVVPPQNLQFAMPQYSVGIDYEQRS----PRELVQEEQPEVMLEAEVIALDVTTA 241
Query: 344 VSLGNSGELEEINTKELAQRISAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKL 403
L + +++ I+TKE+ R+ + + Q FA+++L SQ +LS ++ PK + L
Sbjct: 242 NQLLDE-KVDYIDTKEMMGRLKEWMTKTGNLQKHFAEKILNLSQSSLSSMISRPKSFGNL 300
Query: 404 KS-GRETFRRMWKWLQ 418
KS G++ + RM+ +L+
Sbjct: 301 KSGGKKQYYRMFNFLK 316
>gi|419943858|ref|ZP_14460371.1| putative exported adhesin [Escherichia coli HM605]
gi|388420055|gb|EIL79760.1| putative exported adhesin [Escherichia coli HM605]
Length = 1583
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1118 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1170
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1171 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1218
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1219 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1278
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1279 VGSEDGQRQITN 1290
>gi|422360826|ref|ZP_16441455.1| Hep_Hag protein [Escherichia coli MS 110-3]
gi|315285365|gb|EFU44810.1| Hep_Hag protein [Escherichia coli MS 110-3]
Length = 1583
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1118 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1170
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1171 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1218
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1219 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1278
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1279 VGSEDGQRQITN 1290
>gi|432803807|ref|ZP_20037758.1| autotransport adhesin [Escherichia coli KTE84]
gi|431345555|gb|ELG32471.1| autotransport adhesin [Escherichia coli KTE84]
Length = 1646
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1181 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1233
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1234 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1281
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1282 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1341
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1342 VGSEDGQRQITN 1353
>gi|312968056|ref|ZP_07782267.1| putative hemagglutinin [Escherichia coli 2362-75]
gi|417757961|ref|ZP_12406025.1| putative inner membrane protein [Escherichia coli DEC2B]
gi|418998948|ref|ZP_13546530.1| putative inner membrane protein [Escherichia coli DEC1A]
gi|419004333|ref|ZP_13551843.1| putative inner membrane protein [Escherichia coli DEC1B]
gi|419010012|ref|ZP_13557427.1| putative inner membrane protein [Escherichia coli DEC1C]
gi|419020646|ref|ZP_13567943.1| putative inner membrane protein [Escherichia coli DEC1E]
gi|419026103|ref|ZP_13573320.1| putative inner membrane protein [Escherichia coli DEC2A]
gi|419031239|ref|ZP_13578383.1| putative inner membrane protein [Escherichia coli DEC2C]
gi|419036818|ref|ZP_13583892.1| putative inner membrane protein [Escherichia coli DEC2D]
gi|419041941|ref|ZP_13588958.1| putative inner membrane protein [Escherichia coli DEC2E]
gi|312287315|gb|EFR15224.1| putative hemagglutinin [Escherichia coli 2362-75]
gi|377839678|gb|EHU04758.1| putative inner membrane protein [Escherichia coli DEC1C]
gi|377839892|gb|EHU04971.1| putative inner membrane protein [Escherichia coli DEC1A]
gi|377842822|gb|EHU07871.1| putative inner membrane protein [Escherichia coli DEC1B]
gi|377856607|gb|EHU21466.1| putative inner membrane protein [Escherichia coli DEC1E]
gi|377859652|gb|EHU24482.1| putative inner membrane protein [Escherichia coli DEC2A]
gi|377870889|gb|EHU35562.1| putative inner membrane protein [Escherichia coli DEC2B]
gi|377873219|gb|EHU37857.1| putative inner membrane protein [Escherichia coli DEC2C]
gi|377875272|gb|EHU39886.1| putative inner membrane protein [Escherichia coli DEC2D]
gi|377886653|gb|EHU51134.1| putative inner membrane protein [Escherichia coli DEC2E]
Length = 938
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 473 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 525
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 526 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 573
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 574 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 633
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 634 VGSEDGQRQI 643
>gi|417588739|ref|ZP_12239501.1| hemagglutinin family protein [Escherichia coli STEC_C165-02]
gi|345331738|gb|EGW64197.1| hemagglutinin family protein [Escherichia coli STEC_C165-02]
Length = 1665
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 1202 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 1254
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 1255 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 1302
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 1303 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 1362
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 1363 SEDGQRQI 1370
>gi|117625880|ref|YP_859203.1| autotransporter/adhesin [Escherichia coli APEC O1]
gi|218560677|ref|YP_002393590.1| exported adhesin [Escherichia coli S88]
gi|237703372|ref|ZP_04533853.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
gi|386601638|ref|YP_006103144.1| hemagluttinin family protein [Escherichia coli IHE3034]
gi|422756724|ref|ZP_16810546.1| hep Hag protein [Escherichia coli H263]
gi|422841624|ref|ZP_16889593.1| hypothetical protein ESPG_04279 [Escherichia coli H397]
gi|432360068|ref|ZP_19603280.1| autotransport adhesin [Escherichia coli KTE4]
gi|432364867|ref|ZP_19608021.1| autotransport adhesin [Escherichia coli KTE5]
gi|432590043|ref|ZP_19826394.1| autotransport adhesin [Escherichia coli KTE58]
gi|432756546|ref|ZP_19991089.1| autotransport adhesin [Escherichia coli KTE22]
gi|432780622|ref|ZP_20014841.1| autotransport adhesin [Escherichia coli KTE59]
gi|432789615|ref|ZP_20023741.1| autotransport adhesin [Escherichia coli KTE65]
gi|432823051|ref|ZP_20056738.1| autotransport adhesin [Escherichia coli KTE118]
gi|432824510|ref|ZP_20058173.1| autotransport adhesin [Escherichia coli KTE123]
gi|433007200|ref|ZP_20195622.1| autotransport adhesin [Escherichia coli KTE227]
gi|433009816|ref|ZP_20198227.1| autotransport adhesin [Escherichia coli KTE229]
gi|433155793|ref|ZP_20340720.1| autotransport adhesin [Escherichia coli KTE176]
gi|433165632|ref|ZP_20350357.1| autotransport adhesin [Escherichia coli KTE179]
gi|433170627|ref|ZP_20355243.1| autotransport adhesin [Escherichia coli KTE180]
gi|115515004|gb|ABJ03079.1| putative autotransporter/adhesin [Escherichia coli APEC O1]
gi|218367446|emb|CAR05228.1| putative exported adhesin [Escherichia coli S88]
gi|226902636|gb|EEH88895.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
gi|294490602|gb|ADE89358.1| hemagluttinin family protein [Escherichia coli IHE3034]
gi|323954855|gb|EGB50635.1| hep Hag protein [Escherichia coli H263]
gi|371603942|gb|EHN92576.1| hypothetical protein ESPG_04279 [Escherichia coli H397]
gi|430873202|gb|ELB96777.1| autotransport adhesin [Escherichia coli KTE4]
gi|430883157|gb|ELC06161.1| autotransport adhesin [Escherichia coli KTE5]
gi|431117555|gb|ELE20783.1| autotransport adhesin [Escherichia coli KTE58]
gi|431299434|gb|ELF89005.1| autotransport adhesin [Escherichia coli KTE22]
gi|431324463|gb|ELG11915.1| autotransport adhesin [Escherichia coli KTE59]
gi|431334784|gb|ELG21928.1| autotransport adhesin [Escherichia coli KTE65]
gi|431365260|gb|ELG51774.1| autotransport adhesin [Escherichia coli KTE118]
gi|431377452|gb|ELG62578.1| autotransport adhesin [Escherichia coli KTE123]
gi|431509807|gb|ELH88055.1| autotransport adhesin [Escherichia coli KTE227]
gi|431521198|gb|ELH98446.1| autotransport adhesin [Escherichia coli KTE229]
gi|431670224|gb|ELJ36578.1| autotransport adhesin [Escherichia coli KTE176]
gi|431683528|gb|ELJ49157.1| autotransport adhesin [Escherichia coli KTE179]
gi|431683952|gb|ELJ49573.1| autotransport adhesin [Escherichia coli KTE180]
Length = 1583
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1118 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1170
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1171 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1218
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1219 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1278
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1279 VGSEDGQRQITN 1290
>gi|91213116|ref|YP_543102.1| autotransport adhesin [Escherichia coli UTI89]
gi|386606195|ref|YP_006112495.1| putative exported adhesin [Escherichia coli UM146]
gi|432575856|ref|ZP_19812325.1| autotransport adhesin [Escherichia coli KTE55]
gi|91074690|gb|ABE09571.1| putative autotransport adhesin [Escherichia coli UTI89]
gi|307628679|gb|ADN72983.1| putative exported adhesin [Escherichia coli UM146]
gi|431104629|gb|ELE09002.1| autotransport adhesin [Escherichia coli KTE55]
Length = 1583
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1118 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1170
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1171 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1218
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1219 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1278
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1279 VGSEDGQRQITN 1290
>gi|422334719|ref|ZP_16415724.1| hypothetical protein HMPREF0986_04218 [Escherichia coli 4_1_47FAA]
gi|373244328|gb|EHP63815.1| hypothetical protein HMPREF0986_04218 [Escherichia coli 4_1_47FAA]
Length = 1518
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 1055 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 1107
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 1108 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 1155
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 1156 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 1215
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 1216 SEDGQRQI 1223
>gi|422751459|ref|ZP_16805368.1| hep Hag protein [Escherichia coli H252]
gi|323949844|gb|EGB45728.1| hep Hag protein [Escherichia coli H252]
Length = 1583
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1118 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1170
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1171 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1218
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1219 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1278
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1279 VGSEDGQRQITN 1290
>gi|419015652|ref|ZP_13562988.1| putative inner membrane protein [Escherichia coli DEC1D]
gi|377853291|gb|EHU18191.1| putative inner membrane protein [Escherichia coli DEC1D]
Length = 938
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 473 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 525
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 526 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 573
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 574 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 633
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 634 VGSEDGQRQITN 645
>gi|416338218|ref|ZP_11674452.1| hemagglutinin [Escherichia coli WV_060327]
gi|320193888|gb|EFW68521.1| hemagglutinin [Escherichia coli WV_060327]
Length = 924
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 459 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 511
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 512 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 559
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 560 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 619
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 620 VGSEDGQRQI 629
>gi|331659925|ref|ZP_08360863.1| putative autotransport adhesin [Escherichia coli TA206]
gi|331053140|gb|EGI25173.1| putative autotransport adhesin [Escherichia coli TA206]
Length = 1612
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1147 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1199
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1200 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1247
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1248 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1307
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 1308 VGSEDGQRQI 1317
>gi|417730870|ref|ZP_12379551.1| hemagglutinin family protein [Shigella flexneri K-671]
gi|332749946|gb|EGJ80358.1| hemagglutinin family protein [Shigella flexneri K-671]
Length = 562
Score = 38.1 bits (87), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 70/175 (40%), Gaps = 18/175 (10%)
Query: 71 DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSS 130
D+ D E+L I G + ++ D SE H T + L N G ++
Sbjct: 110 DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQNAIGAVAT-------- 158
Query: 131 YATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGINSHYTYDKIQ-MSPP 187
TP + ST D A G G+ TV+ N IG+G+N+ D I ++
Sbjct: 159 ----TPTKYFHANSTAEDSLAVGEDSLAMGAKTVVNGNAGIGIGLNTLVLADAINGIAIG 214
Query: 188 QHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVD 242
+ N+ S + + Q+ Y+ +++PQ S+ S S QR + +
Sbjct: 215 SNARANHANSIAMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQITN 269
>gi|432599868|ref|ZP_19836137.1| autotransport adhesin [Escherichia coli KTE62]
gi|431127743|gb|ELE30037.1| autotransport adhesin [Escherichia coli KTE62]
Length = 1505
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1040 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1092
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1093 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1140
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1141 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1200
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1201 VGSEDGQRQITN 1212
>gi|415847879|ref|ZP_11525993.1| hemagglutinin family protein [Shigella sonnei 53G]
gi|323166914|gb|EFZ52653.1| hemagglutinin family protein [Shigella sonnei 53G]
Length = 563
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 70/175 (40%), Gaps = 18/175 (10%)
Query: 71 DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSS 130
D+ D E+L I G + ++ D SE H T + L N G ++
Sbjct: 111 DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQNAIGAVAT-------- 159
Query: 131 YATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGINSHYTYDKIQ-MSPP 187
TP + ST D A G G+ TV+ N IG+G+N+ D I ++
Sbjct: 160 ----TPTKYFHANSTAEDSLAVGEDSLAMGAKTVVNGNAGIGIGLNTLVLADAINGIAIG 215
Query: 188 QHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDLVD 242
+ N+ S + + Q+ Y+ +++PQ S+ S S QR + +
Sbjct: 216 SNARANHANSIAMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQITN 270
>gi|432734369|ref|ZP_19969191.1| hypothetical protein WGK_04232, partial [Escherichia coli KTE45]
gi|432761454|ref|ZP_19995942.1| hypothetical protein A1S1_03604, partial [Escherichia coli KTE46]
gi|431270982|gb|ELF62124.1| hypothetical protein WGK_04232, partial [Escherichia coli KTE45]
gi|431305390|gb|ELF93713.1| hypothetical protein A1S1_03604, partial [Escherichia coli KTE46]
Length = 911
Score = 38.1 bits (87), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 446 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 498
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 499 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 546
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 547 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 606
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 607 VGSEDGQRQI 616
>gi|432605036|ref|ZP_19841245.1| hypothetical protein A1U7_00001, partial [Escherichia coli KTE67]
gi|431144058|gb|ELE45765.1| hypothetical protein A1U7_00001, partial [Escherichia coli KTE67]
Length = 906
Score = 38.1 bits (87), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 441 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 493
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 494 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 541
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 542 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 601
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 602 VGSEDGQRQI 611
>gi|26250246|ref|NP_756286.1| adhesin [Escherichia coli CFT073]
gi|386631518|ref|YP_006151238.1| putative adhesin [Escherichia coli str. 'clone D i2']
gi|386636438|ref|YP_006156157.1| putative adhesin [Escherichia coli str. 'clone D i14']
gi|386641212|ref|YP_006108010.1| hemagluttinin family protein [Escherichia coli ABU 83972]
gi|442605627|ref|ZP_21020443.1| FIG00638466: hypothetical protein [Escherichia coli Nissle 1917]
gi|26110675|gb|AAN82860.1|AE016768_278 Putative adhesin [Escherichia coli CFT073]
gi|307555704|gb|ADN48479.1| hemagluttinin family protein [Escherichia coli ABU 83972]
gi|355422417|gb|AER86614.1| putative adhesin [Escherichia coli str. 'clone D i2']
gi|355427337|gb|AER91533.1| putative adhesin [Escherichia coli str. 'clone D i14']
gi|441713309|emb|CCQ06420.1| FIG00638466: hypothetical protein [Escherichia coli Nissle 1917]
Length = 1778
Score = 38.1 bits (87), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1313 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1365
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1366 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1413
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1414 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1473
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1474 VGSEDGQRQITN 1485
>gi|432438426|ref|ZP_19680808.1| hypothetical protein A13M_04159, partial [Escherichia coli KTE188]
gi|432458739|ref|ZP_19700913.1| hypothetical protein A15C_04549, partial [Escherichia coli KTE201]
gi|432525945|ref|ZP_19763060.1| hypothetical protein A17Y_04072, partial [Escherichia coli KTE230]
gi|432594811|ref|ZP_19831122.1| hypothetical protein A1SS_04253, partial [Escherichia coli KTE60]
gi|432653210|ref|ZP_19888954.1| hypothetical protein A1W7_04240, partial [Escherichia coli KTE87]
gi|432785583|ref|ZP_20019759.1| hypothetical protein A1SY_04449, partial [Escherichia coli KTE63]
gi|432846732|ref|ZP_20079335.1| hypothetical protein A1YS_04107, partial [Escherichia coli KTE141]
gi|432975821|ref|ZP_20164653.1| hypothetical protein A15S_01691, partial [Escherichia coli KTE209]
gi|432997380|ref|ZP_20185960.1| hypothetical protein A17A_04460, partial [Escherichia coli KTE218]
gi|433001975|ref|ZP_20190492.1| hypothetical protein A17K_04325, partial [Escherichia coli KTE223]
gi|433117528|ref|ZP_20303307.1| hypothetical protein WKA_03721, partial [Escherichia coli KTE153]
gi|430960448|gb|ELC78603.1| hypothetical protein A13M_04159, partial [Escherichia coli KTE188]
gi|430979264|gb|ELC96049.1| hypothetical protein A15C_04549, partial [Escherichia coli KTE201]
gi|431048239|gb|ELD58222.1| hypothetical protein A17Y_04072, partial [Escherichia coli KTE230]
gi|431125712|gb|ELE28109.1| hypothetical protein A1SS_04253, partial [Escherichia coli KTE60]
gi|431187092|gb|ELE86606.1| hypothetical protein A1W7_04240, partial [Escherichia coli KTE87]
gi|431325905|gb|ELG13268.1| hypothetical protein A1SY_04449, partial [Escherichia coli KTE63]
gi|431392265|gb|ELG75864.1| hypothetical protein A1YS_04107, partial [Escherichia coli KTE141]
gi|431485487|gb|ELH65146.1| hypothetical protein A15S_01691, partial [Escherichia coli KTE209]
gi|431502072|gb|ELH80964.1| hypothetical protein A17A_04460, partial [Escherichia coli KTE218]
gi|431504247|gb|ELH82873.1| hypothetical protein A17K_04325, partial [Escherichia coli KTE223]
gi|431630488|gb|ELI98817.1| hypothetical protein WKA_03721, partial [Escherichia coli KTE153]
Length = 911
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 446 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 498
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 499 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 546
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 547 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 606
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 607 VGSEDGQRQI 616
>gi|432433827|ref|ZP_19676250.1| hypothetical protein A13K_04132 [Escherichia coli KTE187]
gi|433060121|ref|ZP_20247152.1| hypothetical protein WIM_03894 [Escherichia coli KTE124]
gi|433141304|ref|ZP_20326541.1| hypothetical protein WKM_03581 [Escherichia coli KTE167]
gi|430950097|gb|ELC69485.1| hypothetical protein A13K_04132 [Escherichia coli KTE187]
gi|431565726|gb|ELI38801.1| hypothetical protein WIM_03894 [Escherichia coli KTE124]
gi|431655727|gb|ELJ22757.1| hypothetical protein WKM_03581 [Escherichia coli KTE167]
Length = 987
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 522 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 574
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 575 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 622
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 623 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 682
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 683 VGSEDGQRQI 692
>gi|432570845|ref|ZP_19807350.1| autotransport adhesin, partial [Escherichia coli KTE53]
gi|431097011|gb|ELE02460.1| autotransport adhesin, partial [Escherichia coli KTE53]
Length = 1051
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 586 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 638
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 639 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 686
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 687 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 746
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 747 VGSEDGQRQI 756
>gi|218691889|ref|YP_002400101.1| putative hemagglutinin adhesin [Escherichia coli ED1a]
gi|218429453|emb|CAR10419.2| putative haemagglutinin adhesin [Escherichia coli ED1a]
Length = 1600
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 22/192 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1135 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1187
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1188 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1235
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1236 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1295
Query: 231 YDSPYNQRDLVD 242
S QR + +
Sbjct: 1296 VGSEDGQRQITN 1307
>gi|432491387|ref|ZP_19733248.1| hypothetical protein A171_03319, partial [Escherichia coli KTE213]
gi|432841416|ref|ZP_20074873.1| hypothetical protein A1YQ_04380, partial [Escherichia coli KTE140]
gi|431017873|gb|ELD31326.1| hypothetical protein A171_03319, partial [Escherichia coli KTE213]
gi|431385879|gb|ELG69844.1| hypothetical protein A1YQ_04380, partial [Escherichia coli KTE140]
Length = 947
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 484 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 536
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 537 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 584
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 585 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTGYAAYNMDAPQNSVGEFSVG 644
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 645 SEDGQRQI 652
>gi|433025493|ref|ZP_20213462.1| autotransport adhesin [Escherichia coli KTE106]
gi|431531221|gb|ELI07889.1| autotransport adhesin [Escherichia coli KTE106]
Length = 994
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 529 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 581
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 582 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 629
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 630 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 689
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 690 VGSEDGQRQI 699
>gi|293407226|ref|ZP_06651150.1| peptide transport system permease sapB [Escherichia coli FVEC1412]
gi|291426037|gb|EFE99071.1| peptide transport system permease sapB [Escherichia coli FVEC1412]
Length = 1017
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 554 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 606
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 607 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 654
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 655 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 714
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 715 SEDGQRQI 722
>gi|227883772|ref|ZP_04001577.1| possible autotransporter/adhesin, partial [Escherichia coli 83972]
gi|432413841|ref|ZP_19656494.1| hypothetical protein WG9_04338, partial [Escherichia coli KTE39]
gi|432497734|ref|ZP_19739525.1| hypothetical protein A173_04918, partial [Escherichia coli KTE214]
gi|227839050|gb|EEJ49516.1| possible autotransporter/adhesin [Escherichia coli 83972]
gi|430932867|gb|ELC53285.1| hypothetical protein WG9_04338, partial [Escherichia coli KTE39]
gi|431020742|gb|ELD34077.1| hypothetical protein A173_04918, partial [Escherichia coli KTE214]
Length = 959
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 494 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 546
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 547 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 594
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 595 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 654
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 655 VGSEDGQRQI 664
>gi|331675087|ref|ZP_08375844.1| putative autotransport adhesin, partial [Escherichia coli TA280]
gi|331067996|gb|EGI39394.1| putative autotransport adhesin [Escherichia coli TA280]
Length = 972
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 509 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 561
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 562 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 609
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 610 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 669
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 670 SEDGQRQI 677
>gi|301047393|ref|ZP_07194473.1| hemagglutinin, partial [Escherichia coli MS 185-1]
gi|300300667|gb|EFJ57052.1| hemagglutinin [Escherichia coli MS 185-1]
Length = 924
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 459 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 511
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 512 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 559
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 560 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 619
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 620 VGSEDGQRQI 629
>gi|433030541|ref|ZP_20218387.1| autotransport adhesin [Escherichia coli KTE109]
gi|431540173|gb|ELI15800.1| autotransport adhesin [Escherichia coli KTE109]
Length = 1015
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 550 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 602
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 603 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 650
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 651 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 710
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 711 VGSEDGQRQI 720
>gi|422364289|ref|ZP_16444807.1| hemagglutinin, partial [Escherichia coli MS 153-1]
gi|315292980|gb|EFU52332.1| hemagglutinin [Escherichia coli MS 153-1]
Length = 925
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 460 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 512
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 513 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 560
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 561 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 620
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 621 VGSEDGQRQI 630
>gi|300984996|ref|ZP_07177248.1| hemagglutinin, partial [Escherichia coli MS 45-1]
gi|300408276|gb|EFJ91814.1| hemagglutinin [Escherichia coli MS 45-1]
Length = 925
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 460 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 512
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 513 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 560
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 561 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 620
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 621 VGSEDGQRQI 630
>gi|300898748|ref|ZP_07117056.1| hemagglutinin, partial [Escherichia coli MS 198-1]
gi|300357601|gb|EFJ73471.1| hemagglutinin [Escherichia coli MS 198-1]
Length = 1074
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 611 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 663
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 664 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 711
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 712 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 771
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 772 SEDGQRQI 779
>gi|433020854|ref|ZP_20208940.1| hypothetical protein WI7_03774, partial [Escherichia coli KTE105]
gi|433055277|ref|ZP_20242434.1| hypothetical protein WIK_04078, partial [Escherichia coli KTE122]
gi|433069965|ref|ZP_20256732.1| hypothetical protein WIQ_03844, partial [Escherichia coli KTE128]
gi|433160760|ref|ZP_20345577.1| hypothetical protein WKU_03835, partial [Escherichia coli KTE177]
gi|433180479|ref|ZP_20364855.1| hypothetical protein WGM_04117, partial [Escherichia coli KTE82]
gi|431526847|gb|ELI03582.1| hypothetical protein WI7_03774, partial [Escherichia coli KTE105]
gi|431565978|gb|ELI39023.1| hypothetical protein WIK_04078, partial [Escherichia coli KTE122]
gi|431578867|gb|ELI51454.1| hypothetical protein WIQ_03844, partial [Escherichia coli KTE128]
gi|431673669|gb|ELJ39861.1| hypothetical protein WKU_03835, partial [Escherichia coli KTE177]
gi|431697907|gb|ELJ62990.1| hypothetical protein WGM_04117, partial [Escherichia coli KTE82]
Length = 953
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 490 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 542
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 543 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 590
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 591 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 650
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 651 SEDGQRQI 658
>gi|432477965|ref|ZP_19719951.1| hypothetical protein A15Q_04167, partial [Escherichia coli KTE208]
gi|432772380|ref|ZP_20006692.1| hypothetical protein A1SG_00448, partial [Escherichia coli KTE54]
gi|431001994|gb|ELD17560.1| hypothetical protein A15Q_04167, partial [Escherichia coli KTE208]
gi|431323469|gb|ELG10962.1| hypothetical protein A1SG_00448, partial [Escherichia coli KTE54]
Length = 918
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 455 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 507
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 508 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 555
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 556 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 615
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 616 SEDGQRQI 623
>gi|419912152|ref|ZP_14430610.1| putative hemagglutinin/invasin, partial [Escherichia coli KD1]
gi|388392136|gb|EIL53568.1| putative hemagglutinin/invasin, partial [Escherichia coli KD1]
Length = 915
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 450 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 502
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 503 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 550
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 551 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 610
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 611 VGSEDGQRQI 620
>gi|432901156|ref|ZP_20111268.1| hypothetical protein A13U_04060, partial [Escherichia coli KTE192]
gi|431422197|gb|ELH04390.1| hypothetical protein A13U_04060, partial [Escherichia coli KTE192]
Length = 952
Score = 38.1 bits (87), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 487 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 539
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 540 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 587
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 588 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 647
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 648 VGSEDGQRQI 657
>gi|417691982|ref|ZP_12341187.1| hemagglutinin family protein [Shigella boydii 5216-82]
gi|332084784|gb|EGI89967.1| hemagglutinin family protein [Shigella boydii 5216-82]
Length = 1616
Score = 38.1 bits (87), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 69/173 (39%), Gaps = 18/173 (10%)
Query: 71 DSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTNVNGRLSPPGYSPSSS 130
D+ D E+L I G + ++ D SE H T + L N G ++
Sbjct: 1164 DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQNAIGAVAT-------- 1212
Query: 131 YATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGLGINSHYTYDKIQ-MSPP 187
TP + ST D A G G+ TV+ N IG+G+N+ D I ++
Sbjct: 1213 ----TPTKYFHANSTAEDSLAVGEDSLAMGAKTVVNGNAGIGIGLNTLVLADAINGIAIG 1268
Query: 188 QHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYDSPYNQRDL 240
+ N+ S + + Q+ Y+ +++PQ S+ S S QR +
Sbjct: 1269 SNARANHANSIAMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQI 1321
>gi|432506489|ref|ZP_19748207.1| hypothetical protein A17E_03568, partial [Escherichia coli KTE220]
gi|433209752|ref|ZP_20393415.1| hypothetical protein WI1_03532, partial [Escherichia coli KTE97]
gi|431035309|gb|ELD46699.1| hypothetical protein A17E_03568, partial [Escherichia coli KTE220]
gi|431728032|gb|ELJ91753.1| hypothetical protein WI1_03532, partial [Escherichia coli KTE97]
Length = 946
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 481 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 533
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 534 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 581
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 582 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 641
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 642 VGSEDGQRQI 651
>gi|432462970|ref|ZP_19705102.1| hypothetical protein A15I_03849, partial [Escherichia coli KTE204]
gi|432633504|ref|ZP_19869422.1| hypothetical protein A1UW_03896, partial [Escherichia coli KTE80]
gi|432643201|ref|ZP_19879023.1| hypothetical protein A1W1_04079, partial [Escherichia coli KTE83]
gi|432668194|ref|ZP_19903765.1| hypothetical protein A1Y3_04816, partial [Escherichia coli KTE116]
gi|430985550|gb|ELD02147.1| hypothetical protein A15I_03849, partial [Escherichia coli KTE204]
gi|431167191|gb|ELE67472.1| hypothetical protein A1UW_03896, partial [Escherichia coli KTE80]
gi|431177372|gb|ELE77303.1| hypothetical protein A1W1_04079, partial [Escherichia coli KTE83]
gi|431197536|gb|ELE96383.1| hypothetical protein A1Y3_04816, partial [Escherichia coli KTE116]
Length = 947
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 74/188 (39%), Gaps = 22/188 (11%)
Query: 56 SIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTLTN 115
S+ EG+ + D+ D E+L I G + ++ D SE H T + L N
Sbjct: 484 SVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQLQN 536
Query: 116 VNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNNIGLGINS 175
G ++ TP + ST D A G G+ T++ N+G+GI
Sbjct: 537 AIGAVAT------------TPTKYFHANSTAEDSLAVGEDSLAMGAKTIVNGNVGIGIGY 584
Query: 176 HYTYDKIQMSP---PQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNSYD 232
D ++ + S N+ S + + + Q+ Y+ +++PQ S+ S
Sbjct: 585 GAYVDANALNGIAIGSNASANHANSIAMGSGSQTTRGAQTDYTAYNMDAPQNSVGEFSVG 644
Query: 233 SPYNQRDL 240
S QR +
Sbjct: 645 SEDGQRQI 652
>gi|222158308|ref|YP_002558447.1| adhesin [Escherichia coli LF82]
gi|387618904|ref|YP_006121926.1| putative hemagglutinin/invasin [Escherichia coli O83:H1 str. NRG
857C]
gi|222035313|emb|CAP78058.1| adhesin [Escherichia coli LF82]
gi|312948165|gb|ADR28992.1| putative hemagglutinin/invasin [Escherichia coli O83:H1 str. NRG
857C]
Length = 1648
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 54 NESIKTEGLTVIVAPQDDSRDSEILSPGKISPSGISVSVASIMDSSEFRNLHAEPTYQTL 113
N S+ EG+ + D+ D E+L I G + ++ D SE H T + L
Sbjct: 1183 NTSVSEEGVVIGY----DTTDGELLGALSIGDDGKYRQIINVADGSE---AHDAVTVRQL 1235
Query: 114 TNVNGRLSPPGYSPSSSYATLTPLQPLPPISTMSDKFAYGHTGNVTGSFTVMQNN--IGL 171
N G ++ TP + ST D A G G+ T++ N IG+
Sbjct: 1236 QNAIGAVAT------------TPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGI 1283
Query: 172 GINSHYTYDKIQ-MSPPQHYSPNNGLSTIVITQENSPLSPQSTYSQNGLNSPQKSLSPNS 230
G+N+ D I ++ + N+ S + + Q+ Y+ +++PQ S+ S
Sbjct: 1284 GLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFS 1343
Query: 231 YDSPYNQRDL 240
S QR +
Sbjct: 1344 VGSEDGQRQI 1353
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.309 0.126 0.355
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,112,532,008
Number of Sequences: 23463169
Number of extensions: 313666597
Number of successful extensions: 784880
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 495
Number of HSP's successfully gapped in prelim test: 1269
Number of HSP's that attempted gapping in prelim test: 780154
Number of HSP's gapped (non-prelim): 4988
length of query: 433
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 288
effective length of database: 8,957,035,862
effective search space: 2579626328256
effective search space used: 2579626328256
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 78 (34.7 bits)