BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy5912
(122 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|91079258|ref|XP_971676.1| PREDICTED: similar to iroquois-class homeodomain protein irx
[Tribolium castaneum]
Length = 458
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/88 (76%), Positives = 76/88 (86%), Gaps = 3/88 (3%)
Query: 34 AVTSTPAAPRST-SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-L 91
AVTS+ P S +P RCCDTGRP+FTDP++GQ++CSCQY+LL YQRL AGG PGLP L
Sbjct: 31 AVTSSAPTPGSPPAPSQRCCDTGRPLFTDPLTGQSVCSCQYELLGYQRL-AGGVPGLPAL 89
Query: 92 SMYSAPYPEGMAAYFPALGADQAPFYSS 119
SMYSAPYPEGMAAYFPALGADQAPFY+S
Sbjct: 90 SMYSAPYPEGMAAYFPALGADQAPFYTS 117
>gi|270004889|gb|EFA01337.1| mirror [Tribolium castaneum]
Length = 533
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/88 (76%), Positives = 76/88 (86%), Gaps = 3/88 (3%)
Query: 34 AVTSTPAAPRST-SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-L 91
AVTS+ P S +P RCCDTGRP+FTDP++GQ++CSCQY+LL YQRL AGG PGLP L
Sbjct: 2 AVTSSAPTPGSPPAPSQRCCDTGRPLFTDPLTGQSVCSCQYELLGYQRL-AGGVPGLPAL 60
Query: 92 SMYSAPYPEGMAAYFPALGADQAPFYSS 119
SMYSAPYPEGMAAYFPALGADQAPFY+S
Sbjct: 61 SMYSAPYPEGMAAYFPALGADQAPFYTS 88
>gi|157116233|ref|XP_001658394.1| iroquois-class homeodomain protein irx [Aedes aegypti]
gi|108876563|gb|EAT40788.1| AAEL007502-PA [Aedes aegypti]
Length = 540
Score = 130 bits (328), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 72/102 (70%), Positives = 80/102 (78%), Gaps = 16/102 (15%)
Query: 34 AVTST-PAAPRSTSPGS-----------RCCDTGRPIFTDPISGQTICSCQYDLLNYQRL 81
AVT+T P RSTSP S RCCDTGRPIFTDPISGQT+CSCQYDL+NYQRL
Sbjct: 54 AVTTTVPPGARSTSPCSATSGPPTSGSGRCCDTGRPIFTDPISGQTVCSCQYDLINYQRL 113
Query: 82 AAGG--GP-GLPLSMYSAPY-PEGMAAYFPALGADQAPFYSS 119
A+ G GP G+PLSMYSAPY PE MAAYFPA+GADQAPFY++
Sbjct: 114 ASAGIAGPGGVPLSMYSAPYSPETMAAYFPAIGADQAPFYAN 155
>gi|328697559|ref|XP_001952319.2| PREDICTED: hypothetical protein LOC100169212 [Acyrthosiphon pisum]
Length = 599
Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 59/72 (81%), Positives = 66/72 (91%), Gaps = 2/72 (2%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPAL 109
RCCDTGRPIFTDPI+GQT+CSCQYDLL+YQRLAA G P LSMY+APYPEGMAAYFPAL
Sbjct: 125 RCCDTGRPIFTDPITGQTVCSCQYDLLSYQRLAASGVPA--LSMYTAPYPEGMAAYFPAL 182
Query: 110 GADQAPFYSSAS 121
G+DQ PFYS+A+
Sbjct: 183 GSDQPPFYSAAA 194
>gi|170027824|ref|XP_001841797.1| iroquois-class homeodomain protein irx [Culex quinquefasciatus]
gi|167862367|gb|EDS25750.1| iroquois-class homeodomain protein irx [Culex quinquefasciatus]
Length = 512
Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/95 (70%), Positives = 79/95 (83%), Gaps = 6/95 (6%)
Query: 31 STTAVTSTPAAPRSTSPGS-RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG---G 86
ST+ ++T P ++ GS RCCDTGRPIFTDPISGQT+CSCQYDL+NYQRLA+ G G
Sbjct: 35 STSPCSATSGGPPTSGSGSGRCCDTGRPIFTDPISGQTVCSCQYDLINYQRLASAGIATG 94
Query: 87 P-GLPLSMYSAPY-PEGMAAYFPALGADQAPFYSS 119
P G+PLSMYSAPY PE MAAYFPA+GADQAPFY++
Sbjct: 95 PGGVPLSMYSAPYSPETMAAYFPAIGADQAPFYAN 129
>gi|312372585|gb|EFR20514.1| hypothetical protein AND_19963 [Anopheles darlingi]
Length = 241
Score = 128 bits (322), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/133 (60%), Positives = 96/133 (72%), Gaps = 17/133 (12%)
Query: 1 MLKIPESTVAPSSSPTSSSPPPSSTRASNTSTT---AVTSTPAAPRST------SPGS-R 50
M KI + PS+ SSPP +S + T+T A +S+P + S +PGS R
Sbjct: 10 MFKINTTPTVPSAP---SSPPTASVAMAVTTTVPPSARSSSPCSATSGPPTSGPTPGSGR 66
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAG--GGP-GLPLSMYSAPY-PEGMAAYF 106
CCDTGRPIFTDPISGQT+CSCQYDL+NYQRLA+ GGP G+PLSMYSAPY PE MAAYF
Sbjct: 67 CCDTGRPIFTDPISGQTVCSCQYDLINYQRLASAGIGGPGGVPLSMYSAPYSPETMAAYF 126
Query: 107 PALGADQAPFYSS 119
PA+GADQAPFY++
Sbjct: 127 PAIGADQAPFYAN 139
>gi|242023068|ref|XP_002431958.1| hypothetical protein Phum_PHUM562460 [Pediculus humanus corporis]
gi|212517309|gb|EEB19220.1| hypothetical protein Phum_PHUM562460 [Pediculus humanus corporis]
Length = 163
Score = 127 bits (319), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 71/93 (76%), Positives = 80/93 (86%), Gaps = 5/93 (5%)
Query: 30 TSTTAVTS-TPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLA-AGGGP 87
T+ TAVT+ T P S +PG RCCDTGRP+FTDPI+GQT+CSCQYDLL+YQRLA AG GP
Sbjct: 34 TNATAVTAGTGTRPASPTPG-RCCDTGRPLFTDPITGQTVCSCQYDLLSYQRLASAGVGP 92
Query: 88 -GLP-LSMYSAPYPEGMAAYFPALGADQAPFYS 118
GLP LSMYSAPY +GMAAYFPALGADQAPFY+
Sbjct: 93 AGLPALSMYSAPYADGMAAYFPALGADQAPFYT 125
>gi|158293797|ref|XP_315118.4| AGAP005011-PA [Anopheles gambiae str. PEST]
gi|157016618|gb|EAA10468.4| AGAP005011-PA [Anopheles gambiae str. PEST]
Length = 575
Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/74 (81%), Positives = 68/74 (91%), Gaps = 4/74 (5%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG--GP-GLPLSMYSAPY-PEGMAAY 105
RCCDTGRPIFTDPISGQT+CSCQYDL++YQRLA+ G GP G+PLSMYSAPY PE MAAY
Sbjct: 94 RCCDTGRPIFTDPISGQTVCSCQYDLISYQRLASAGIAGPGGVPLSMYSAPYSPETMAAY 153
Query: 106 FPALGADQAPFYSS 119
FPA+GADQAPFY++
Sbjct: 154 FPAIGADQAPFYAN 167
>gi|357607495|gb|EHJ65534.1| putative iroquois-class homeodomain protein irx [Danaus plexippus]
Length = 184
Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/111 (56%), Positives = 78/111 (70%), Gaps = 13/111 (11%)
Query: 14 SPTSSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQY 73
S S+SPPP+ S S + + +P + +RCCDTGRPIF DPI+GQT+CSCQY
Sbjct: 18 STASNSPPPAQAGMSGGSGASGAARSGSPAA----ARCCDTGRPIFQDPITGQTVCSCQY 73
Query: 74 DLLNYQRLAAGGGPGLPLSMYSAPYPE-----GMAAYFPALGADQAPFYSS 119
+ LNYQRLA+ G+PLSMYSAPYP+ GMAAYFPAL ADQ PFY++
Sbjct: 74 EFLNYQRLAS----GVPLSMYSAPYPDAAAAAGMAAYFPALAADQPPFYTN 120
>gi|195327201|ref|XP_002030310.1| GM25364 [Drosophila sechellia]
gi|194119253|gb|EDW41296.1| GM25364 [Drosophila sechellia]
Length = 547
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/77 (63%), Positives = 60/77 (77%), Gaps = 6/77 (7%)
Query: 43 RSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
RS +P +RCCDTGR I+ DP+SGQTICSCQYD+LNYQRLAA G G+PL + YPEGM
Sbjct: 17 RSVNPPNRCCDTGRTIYADPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGM 70
Query: 103 AAYFPALGADQAPFYSS 119
+AY + ADQ PFY++
Sbjct: 71 SAYLSGIAADQPPFYAN 87
>gi|195126379|ref|XP_002007648.1| GI12266 [Drosophila mojavensis]
gi|193919257|gb|EDW18124.1| GI12266 [Drosophila mojavensis]
Length = 735
Score = 104 bits (259), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 48/76 (63%), Positives = 58/76 (76%), Gaps = 9/76 (11%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGG-----PGLPLSMYSAPYPEGMA 103
+RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA GG G+PL + YPEGM+
Sbjct: 151 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAGGLVAGPGGVPLGV----YPEGMS 206
Query: 104 AYFPALGADQAPFYSS 119
AY + ADQ PFY++
Sbjct: 207 AYLSGIAADQPPFYAN 222
>gi|195378775|ref|XP_002048157.1| GJ11496 [Drosophila virilis]
gi|194155315|gb|EDW70499.1| GJ11496 [Drosophila virilis]
Length = 575
Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/76 (63%), Positives = 58/76 (76%), Gaps = 9/76 (11%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGG-----PGLPLSMYSAPYPEGMA 103
+RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA GG G+PL + YPEGM+
Sbjct: 168 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAGGLVAGPGGVPLGV----YPEGMS 223
Query: 104 AYFPALGADQAPFYSS 119
AY + ADQ PFY++
Sbjct: 224 AYLSGIAADQPPFYAN 239
>gi|195019738|ref|XP_001985044.1| GH14720 [Drosophila grimshawi]
gi|193898526|gb|EDV97392.1| GH14720 [Drosophila grimshawi]
Length = 669
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/76 (61%), Positives = 57/76 (75%), Gaps = 9/76 (11%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGG-----PGLPLSMYSAPYPEGMA 103
+RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA GG G+PL + YPEGM+
Sbjct: 126 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAGGLVAGPGGVPLGV----YPEGMS 181
Query: 104 AYFPALGADQAPFYSS 119
AY + DQ PFY++
Sbjct: 182 AYLSGIAGDQPPFYAN 197
>gi|442632003|ref|NP_001261778.1| mirror, isoform C [Drosophila melanogaster]
gi|440215710|gb|AGB94471.1| mirror, isoform C [Drosophila melanogaster]
Length = 682
Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/70 (67%), Positives = 56/70 (80%), Gaps = 6/70 (8%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPAL 109
RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA G G+PL + YPEGM+AY +
Sbjct: 118 RCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSGI 171
Query: 110 GADQAPFYSS 119
ADQ PFY++
Sbjct: 172 AADQPPFYAN 181
>gi|195493902|ref|XP_002094612.1| GE21915 [Drosophila yakuba]
gi|194180713|gb|EDW94324.1| GE21915 [Drosophila yakuba]
Length = 642
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/70 (67%), Positives = 56/70 (80%), Gaps = 6/70 (8%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPAL 109
RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA G G+PL + YPEGM+AY +
Sbjct: 119 RCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSGI 172
Query: 110 GADQAPFYSS 119
ADQ PFY++
Sbjct: 173 AADQPPFYAN 182
>gi|125978763|ref|XP_001353414.1| GA10430 [Drosophila pseudoobscura pseudoobscura]
gi|195160685|ref|XP_002021205.1| GL25205 [Drosophila persimilis]
gi|54642172|gb|EAL30921.1| GA10430 [Drosophila pseudoobscura pseudoobscura]
gi|194118318|gb|EDW40361.1| GL25205 [Drosophila persimilis]
Length = 639
Score = 102 bits (254), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/71 (64%), Positives = 56/71 (78%), Gaps = 6/71 (8%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPA 108
+RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA G G+PL + YPEGM+AY
Sbjct: 114 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSG 167
Query: 109 LGADQAPFYSS 119
+ DQ PFY++
Sbjct: 168 IAGDQPPFYAN 178
>gi|194747619|ref|XP_001956249.1| GF24692 [Drosophila ananassae]
gi|190623531|gb|EDV39055.1| GF24692 [Drosophila ananassae]
Length = 661
Score = 102 bits (253), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/73 (64%), Positives = 58/73 (79%), Gaps = 6/73 (8%)
Query: 47 PGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYF 106
P +RCCDTGR I+TDP+SGQTICSCQYD+L+YQRLAA G G+PL + YPEGM+AY
Sbjct: 126 PPNRCCDTGRTIYTDPVSGQTICSCQYDMLSYQRLAAAG--GVPLGV----YPEGMSAYL 179
Query: 107 PALGADQAPFYSS 119
+ ADQ PFY++
Sbjct: 180 SGIAADQPPFYAN 192
>gi|195446597|ref|XP_002070839.1| GK25452 [Drosophila willistoni]
gi|194166924|gb|EDW81825.1| GK25452 [Drosophila willistoni]
Length = 620
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 46/71 (64%), Positives = 58/71 (81%), Gaps = 6/71 (8%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPA 108
SRCC+TGR I+TDP+SGQ+ICSCQYD+L+YQRLAA G G+PL + YPEGM+AY
Sbjct: 102 SRCCETGRTIYTDPVSGQSICSCQYDMLSYQRLAAAG--GVPLGV----YPEGMSAYLSG 155
Query: 109 LGADQAPFYSS 119
+GA+Q PFYS+
Sbjct: 156 IGAEQPPFYSN 166
>gi|24663392|ref|NP_524047.2| mirror, isoform A [Drosophila melanogaster]
gi|24663396|ref|NP_729818.1| mirror, isoform B [Drosophila melanogaster]
gi|7294553|gb|AAF49894.1| mirror, isoform A [Drosophila melanogaster]
gi|23093572|gb|AAN11850.1| mirror, isoform B [Drosophila melanogaster]
Length = 641
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 17/96 (17%)
Query: 35 VTSTPAAPRST-----------SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAA 83
V + P P S+ +P +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA
Sbjct: 92 VGAPPPGPHSSAPGGAPAPGAGNPPNRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAA 151
Query: 84 GGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
G G+PL + YPEGM+AY + ADQ PFY++
Sbjct: 152 AG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181
>gi|2071997|gb|AAB53640.1| mirror [Drosophila melanogaster]
gi|3243091|gb|AAC23943.1| homeoprotein Sail [Drosophila melanogaster]
Length = 641
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 17/96 (17%)
Query: 35 VTSTPAAPRST-----------SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAA 83
V + P P S+ +P +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA
Sbjct: 92 VGAPPPGPHSSAPGGAPAPGAGNPPNRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAA 151
Query: 84 GGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
G G+PL + YPEGM+AY + ADQ PFY++
Sbjct: 152 AG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181
>gi|350404409|ref|XP_003487095.1| PREDICTED: hypothetical protein LOC100743699 [Bombus impatiens]
Length = 768
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 53/94 (56%), Positives = 60/94 (63%), Gaps = 30/94 (31%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLP--LSMYSAPYP------- 99
RCCDTGRPIFTDP++GQT+CSCQY++L YQRL A LP LSMYSAPY
Sbjct: 232 RCCDTGRPIFTDPLTGQTVCSCQYEILGGYQRLGA-----LPAALSMYSAPYAAAAXXXX 286
Query: 100 ---------------EGMAAYFPALGADQAPFYS 118
EGMAAYFP LGA+QAPFY+
Sbjct: 287 XXXXXXXXXXXAAASEGMAAYFPPLGAEQAPFYT 320
>gi|194869994|ref|XP_001972564.1| GG15588 [Drosophila erecta]
gi|190654347|gb|EDV51590.1| GG15588 [Drosophila erecta]
Length = 641
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 17/96 (17%)
Query: 35 VTSTPAAPRST-----------SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAA 83
V + P P S+ +P +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA
Sbjct: 92 VGAPPPGPHSSAPGGAPAPGAGNPPNRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAA 151
Query: 84 GGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
G G+PL + YPEGM+AY + ADQ PFY++
Sbjct: 152 AG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181
>gi|195589890|ref|XP_002084682.1| GD14396 [Drosophila simulans]
gi|194196691|gb|EDX10267.1| GD14396 [Drosophila simulans]
Length = 641
Score = 94.7 bits (234), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/120 (48%), Positives = 74/120 (61%), Gaps = 13/120 (10%)
Query: 7 STVAPSSSPTSSSPPPSSTRA-------SNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIF 59
STV P + TS P + A ++ AP + +P +RCCDTGR I+
Sbjct: 68 STVPPGARSTSPCGPSAGLPALSLVGAPPPGPHSSGPGGAPAPGAGNPPNRCCDTGRTIY 127
Query: 60 TDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
TDP+SGQTICSCQYD+LNYQRLAA G G+PL + YPEGM+AY + ADQ PFY++
Sbjct: 128 TDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181
>gi|383860303|ref|XP_003705630.1| PREDICTED: uncharacterized protein LOC100877027 [Megachile
rotundata]
Length = 685
Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 53/78 (67%), Positives = 59/78 (75%), Gaps = 11/78 (14%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
RCCDTGRPIFTDP++GQT+CSCQYDLL YQRL GG P LSMYS A E
Sbjct: 165 RCCDTGRPIFTDPLTGQTVCSCQYDLLGGYQRL--GGLPTAALSMYSAPYAAAAAAAASE 222
Query: 101 GMAAYFPALGADQAPFYS 118
GMAAYFP+LGA+QAPFY+
Sbjct: 223 GMAAYFPSLGAEQAPFYT 240
>gi|332024324|gb|EGI64523.1| Homeobox protein caupolican [Acromyrmex echinatior]
Length = 625
Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/79 (65%), Positives = 60/79 (75%), Gaps = 11/79 (13%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYP 99
+RCCDTGRPIFTDP++GQT+CSCQY+LL YQRL GG P LSMYS A
Sbjct: 86 ARCCDTGRPIFTDPLTGQTVCSCQYELLGGYQRL--GGLPTAALSMYSAPYAAAAAAAAS 143
Query: 100 EGMAAYFPALGADQAPFYS 118
EGMAAYFP+LGA+QAPFY+
Sbjct: 144 EGMAAYFPSLGAEQAPFYT 162
>gi|307177869|gb|EFN66829.1| Homeobox protein caupolican [Camponotus floridanus]
Length = 280
Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/78 (66%), Positives = 59/78 (75%), Gaps = 11/78 (14%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
RCCDTGRPIFTDP++GQT+CSCQY+LL YQRL GG P LSMYS A E
Sbjct: 87 RCCDTGRPIFTDPLTGQTVCSCQYELLGGYQRL--GGLPTAALSMYSAPYAAAAAAAASE 144
Query: 101 GMAAYFPALGADQAPFYS 118
GMAAYFP+LGA+QAPFY+
Sbjct: 145 GMAAYFPSLGAEQAPFYT 162
>gi|156547431|ref|XP_001604937.1| PREDICTED: hypothetical protein LOC100121331 [Nasonia vitripennis]
Length = 650
Score = 88.2 bits (217), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 57/92 (61%), Positives = 63/92 (68%), Gaps = 14/92 (15%)
Query: 39 PAAPRSTSPG---SRCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMY 94
P AP+ S G RCCDTGRPIFTDPI+GQT+CSCQY+LL YQRL G P LSMY
Sbjct: 99 PTAPQPASSGPTPGRCCDTGRPIFTDPITGQTVCSCQYELLGGYQRL--GALPTAALSMY 156
Query: 95 S--------APYPEGMAAYFPALGADQAPFYS 118
S A EGMAAYFP LGA+QAPFY+
Sbjct: 157 SAPYAAAAAAAASEGMAAYFPGLGAEQAPFYT 188
>gi|322795138|gb|EFZ17978.1| hypothetical protein SINV_13653 [Solenopsis invicta]
Length = 204
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 52/79 (65%), Positives = 60/79 (75%), Gaps = 11/79 (13%)
Query: 49 SRCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYP 99
+RCCDTGRPIFTDP++GQT+CSCQY+LL YQRL GG P LSMYS A
Sbjct: 88 ARCCDTGRPIFTDPLTGQTVCSCQYELLGGYQRL--GGLPTAALSMYSAPYAAAAAAAAS 145
Query: 100 EGMAAYFPALGADQAPFYS 118
EGMAAYFP+LGA+QAPFY+
Sbjct: 146 EGMAAYFPSLGAEQAPFYT 164
>gi|380014060|ref|XP_003691061.1| PREDICTED: homeobox protein araucan-like [Apis florea]
Length = 620
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/78 (64%), Positives = 58/78 (74%), Gaps = 11/78 (14%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
RCCDTGRPIFTDP++GQT+CSCQY+L+ YQRL G P LSMYS A E
Sbjct: 101 RCCDTGRPIFTDPLTGQTVCSCQYELIGGYQRL--GALPTAALSMYSAPYAAAAAAAASE 158
Query: 101 GMAAYFPALGADQAPFYS 118
GMAAYFP+LGA+QAPFY+
Sbjct: 159 GMAAYFPSLGAEQAPFYT 176
>gi|328777967|ref|XP_396291.3| PREDICTED: homeobox protein araucan [Apis mellifera]
Length = 622
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/78 (64%), Positives = 58/78 (74%), Gaps = 11/78 (14%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
RCCDTGRPIFTDP++GQT+CSCQY+L+ YQRL G P LSMYS A E
Sbjct: 101 RCCDTGRPIFTDPLTGQTVCSCQYELIGGYQRL--GALPTAALSMYSAPYAAAAAAAASE 158
Query: 101 GMAAYFPALGADQAPFYS 118
GMAAYFP+LGA+QAPFY+
Sbjct: 159 GMAAYFPSLGAEQAPFYT 176
>gi|340716066|ref|XP_003396524.1| PREDICTED: homeobox protein araucan-like [Bombus terrestris]
Length = 612
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/80 (63%), Positives = 58/80 (72%), Gaps = 16/80 (20%)
Query: 50 RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLP--LSMYS--------APY 98
RCCDTGRPIFTDP++GQT+CSCQY++L YQRL A LP LSMYS A
Sbjct: 90 RCCDTGRPIFTDPLTGQTVCSCQYEILGGYQRLGA-----LPAALSMYSAPYAAAAAAAA 144
Query: 99 PEGMAAYFPALGADQAPFYS 118
EGMAAYFP LGA+QAPFY+
Sbjct: 145 SEGMAAYFPPLGAEQAPFYT 164
>gi|240960565|ref|XP_002400563.1| hypothetical protein IscW_ISCW015739 [Ixodes scapularis]
gi|215490705|gb|EEC00348.1| hypothetical protein IscW_ISCW015739 [Ixodes scapularis]
Length = 110
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/74 (56%), Positives = 54/74 (72%), Gaps = 6/74 (8%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD--LLNYQRLAAGGGPGLPLSMY-SAPYPEGMAA 104
G+ CC++GRP+ TDPI+GQT+CSCQYD LLNYQRLAA GLP++MY +A Y G
Sbjct: 17 GTPCCESGRPVLTDPITGQTVCSCQYDAQLLNYQRLAAS---GLPINMYGTAAYAAGEQG 73
Query: 105 YFPALGADQAPFYS 118
+ P A+Q+ FYS
Sbjct: 74 FLPLGAAEQSAFYS 87
>gi|357607497|gb|EHJ65536.1| putative iroquois-like protein [Danaus plexippus]
Length = 456
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 63/106 (59%), Gaps = 10/106 (9%)
Query: 17 SSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD-- 74
SSS PP S S + A+ S A +++P + CCDT RPI TDP+SGQT+CSCQYD
Sbjct: 41 SSSAPPLSPGGSAVAGGAL-SPGAGSHASTPAASCCDTPRPIITDPVSGQTVCSCQYDAR 99
Query: 75 --LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYS 118
L +Y RL++ + +Y APYP +P++G D + FYS
Sbjct: 100 LALTSYPRLSSAA-----VGVYGAPYPSTDQNPYPSIGVDSSAFYS 140
>gi|74054137|gb|AAZ95458.1| caupolican-like protein [Calliphora vicina]
Length = 626
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L NY R+ GG + +Y YP
Sbjct: 155 CCENGRPIMTDPVSGQTVCSCQYDSARLALTNYSRMPTGG-----VGVYGTAYPSTEQNP 209
Query: 106 FPALGADQAPFYS 118
+P++G D + FY+
Sbjct: 210 YPSIGVDSSAFYT 222
>gi|195019751|ref|XP_001985047.1| GH14716 [Drosophila grimshawi]
gi|193898529|gb|EDV97395.1| GH14716 [Drosophila grimshawi]
Length = 733
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 8/73 (10%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L Y R+A P + +Y PYP
Sbjct: 116 CCENGRPIITDPVSGQTVCSCQYDSARLTLGGYSRMAL---PSAGVGVYGGPYPSNEQNP 172
Query: 106 FPALGADQAPFYS 118
+P++G D + FYS
Sbjct: 173 YPSIGVDNSAFYS 185
>gi|242023060|ref|XP_002431954.1| hypothetical protein Phum_PHUM562220 [Pediculus humanus corporis]
gi|212517305|gb|EEB19216.1| hypothetical protein Phum_PHUM562220 [Pediculus humanus corporis]
Length = 155
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 56/94 (59%), Gaps = 10/94 (10%)
Query: 29 NTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAG 84
N ST A + P P ST+ G+ CC+ G+P TDP+SGQT+CSCQYD L+Y RL
Sbjct: 42 NGSTAAAAAAPG-PGSTASGTSCCENGKPTITDPVSGQTVCSCQYDSARLTLSYPRLPTS 100
Query: 85 GGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYS 118
G + +Y APYP +P++G D + FYS
Sbjct: 101 G-----VGVYGAPYPSTDQNPYPSIGVDSSAFYS 129
>gi|198464914|ref|XP_001353412.2| GA10434 [Drosophila pseudoobscura pseudoobscura]
gi|198149934|gb|EAL30919.2| GA10434 [Drosophila pseudoobscura pseudoobscura]
Length = 682
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 8/78 (10%)
Query: 46 SPGSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPE 100
S G CC+ GRPI TDP+SGQT+CSCQYD + Y R+A P + +Y PYP
Sbjct: 106 SEGMSCCENGRPIITDPVSGQTVCSCQYDPARLAIGGYSRMAL---PSAGVGVYGGPYPS 162
Query: 101 GMAAYFPALGADQAPFYS 118
+P++G D + FY+
Sbjct: 163 NEQNPYPSIGVDNSAFYA 180
>gi|195126385|ref|XP_002007651.1| GI12262 [Drosophila mojavensis]
gi|193919260|gb|EDW18127.1| GI12262 [Drosophila mojavensis]
Length = 718
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 8/73 (10%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L Y R+A P + +Y PYP
Sbjct: 135 CCENGRPIITDPVSGQTVCSCQYDSARLTLGGYSRMAL---PSAGVGVYGGPYPSNEQNP 191
Query: 106 FPALGADQAPFYS 118
+P++G D + FYS
Sbjct: 192 YPSIGVDNSAFYS 204
>gi|195378781|ref|XP_002048160.1| GJ11493 [Drosophila virilis]
gi|194155318|gb|EDW70502.1| GJ11493 [Drosophila virilis]
Length = 698
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 8/73 (10%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L Y R+A P + +Y PYP
Sbjct: 125 CCENGRPIITDPVSGQTVCSCQYDSARLTLGGYSRMAL---PTAGVGVYGGPYPSNEQNP 181
Query: 106 FPALGADQAPFYS 118
+P++G D + FYS
Sbjct: 182 YPSIGVDNSAFYS 194
>gi|195454705|ref|XP_002074363.1| GK10560 [Drosophila willistoni]
gi|194170448|gb|EDW85349.1| GK10560 [Drosophila willistoni]
Length = 736
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/74 (48%), Positives = 46/74 (62%), Gaps = 6/74 (8%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP---LSMYSAPYPEGMAA 104
GS CC+ GRPI TDP+SGQT+CSCQYD RLA G +P + +Y APYP
Sbjct: 111 GSSCCENGRPIMTDPVSGQTVCSCQYDSA---RLALSGYSRMPAASVGVYGAPYPSSDQN 167
Query: 105 YFPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 168 PYQSIGVDSSAFYS 181
>gi|194747625|ref|XP_001956252.1| GF24689 [Drosophila ananassae]
gi|190623534|gb|EDV39058.1| GF24689 [Drosophila ananassae]
Length = 663
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 8/76 (10%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
G CC+ GRPI TDP+SGQT+CSCQYD + Y R+A P + +Y PYP
Sbjct: 110 GGSCCENGRPIITDPVSGQTVCSCQYDPTRLAIGGYSRMAL---PSAGVGVYGGPYPSND 166
Query: 103 AAYFPALGADQAPFYS 118
+P++G D + FY+
Sbjct: 167 QNPYPSIGVDNSAFYA 182
>gi|312376038|gb|EFR23247.1| hypothetical protein AND_13239 [Anopheles darlingi]
Length = 175
Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 45/77 (58%), Gaps = 10/77 (12%)
Query: 47 PGSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEG 101
P + CC+ GRPI TDP+SGQT+CSCQYD L Y R+ G + +Y PYP
Sbjct: 83 PATSCCENGRPIMTDPVSGQTVCSCQYDSARLALSGYSRIPTGS-----VGVYGTPYPST 137
Query: 102 MAAYFPALGADQAPFYS 118
+P++G D + FYS
Sbjct: 138 EQNPYPSIGVDSSAFYS 154
>gi|195126387|ref|XP_002007652.1| GI12261 [Drosophila mojavensis]
gi|193919261|gb|EDW18128.1| GI12261 [Drosophila mojavensis]
Length = 745
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 57/102 (55%), Gaps = 16/102 (15%)
Query: 22 PSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD-----LL 76
P + + ++AV +TP + GS CC+ GRPI TDP+SGQT+CSCQYD L
Sbjct: 97 PDAITGGSLDSSAVGTTP------TTGSSCCENGRPIMTDPVSGQTVCSCQYDSARLALS 150
Query: 77 NYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYS 118
+Y RL P + +Y +PYP + ++G D + FYS
Sbjct: 151 SYSRL-----PAASVGVYGSPYPSTDQNPYQSIGVDSSAFYS 187
>gi|195378783|ref|XP_002048161.1| GJ11492 [Drosophila virilis]
gi|194155319|gb|EDW70503.1| GJ11492 [Drosophila virilis]
Length = 756
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 46/76 (60%), Gaps = 10/76 (13%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
GS CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y +PYP
Sbjct: 130 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGSPYPSTD 184
Query: 103 AAYFPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 185 QNPYQSIGVDSSAFYS 200
>gi|158293795|ref|XP_315117.4| AGAP005010-PA [Anopheles gambiae str. PEST]
gi|157016617|gb|EAA10465.4| AGAP005010-PA [Anopheles gambiae str. PEST]
Length = 782
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 6/75 (8%)
Query: 47 PGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP---LSMYSAPYPEGMA 103
P + CC+ GRPI TDP+SGQT+CSCQYD RLA G +P + +Y PYP
Sbjct: 86 PTTSCCENGRPIMTDPVSGQTVCSCQYD---SARLALSGYSRIPTGSVGVYGTPYPSTEQ 142
Query: 104 AYFPALGADQAPFYS 118
+P++G D + FYS
Sbjct: 143 NPYPSIGVDSSAFYS 157
>gi|195160679|ref|XP_002021202.1| GL25202 [Drosophila persimilis]
gi|194118315|gb|EDW40358.1| GL25202 [Drosophila persimilis]
Length = 473
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 10/76 (13%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
GS CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 121 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTD 175
Query: 103 AAYFPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 176 QNPYQSIGVDSSAFYS 191
>gi|195019756|ref|XP_001985048.1| GH14715 [Drosophila grimshawi]
gi|193898530|gb|EDV97396.1| GH14715 [Drosophila grimshawi]
Length = 791
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 46/76 (60%), Gaps = 10/76 (13%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
GS CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y +PYP
Sbjct: 127 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGSPYPSTD 181
Query: 103 AAYFPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 182 QNPYQSIGVDSSAFYS 197
>gi|198464912|ref|XP_001353411.2| GA10403 [Drosophila pseudoobscura pseudoobscura]
gi|198149933|gb|EAL30918.2| GA10403 [Drosophila pseudoobscura pseudoobscura]
Length = 780
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 10/76 (13%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
GS CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 123 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTD 177
Query: 103 AAYFPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 178 QNPYQSIGVDSSAFYS 193
>gi|194747627|ref|XP_001956253.1| GF24688 [Drosophila ananassae]
gi|190623535|gb|EDV39059.1| GF24688 [Drosophila ananassae]
Length = 731
Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 44/76 (57%), Gaps = 10/76 (13%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
G CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 130 GGSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTD 184
Query: 103 AAYFPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 185 QNPYQSIGVDSSAFYS 200
>gi|195327183|ref|XP_002030301.1| GM25355 [Drosophila sechellia]
gi|194119244|gb|EDW41287.1| GM25355 [Drosophila sechellia]
Length = 384
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199
>gi|54650768|gb|AAV36963.1| LP02208p [Drosophila melanogaster]
Length = 650
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199
>gi|195493895|ref|XP_002094609.1| GE21912 [Drosophila yakuba]
gi|194180710|gb|EDW94321.1| GE21912 [Drosophila yakuba]
Length = 719
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 137 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 191
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 192 YQSIGVDSSAFYS 204
>gi|24663380|ref|NP_524045.2| araucan, isoform A [Drosophila melanogaster]
gi|19863700|sp|Q24248.2|ARA_DROME RecName: Full=Homeobox protein araucan
gi|7294555|gb|AAF49896.1| araucan, isoform A [Drosophila melanogaster]
Length = 717
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199
>gi|1212885|emb|CAA64486.1| araucan homeoprotein [Drosophila melanogaster]
Length = 716
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 131 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 185
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 186 YQSIGVDSSAFYS 198
>gi|281366150|ref|NP_001163432.1| araucan, isoform B [Drosophila melanogaster]
gi|272455181|gb|ACZ94703.1| araucan, isoform B [Drosophila melanogaster]
Length = 709
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199
>gi|194869979|ref|XP_001972561.1| GG15584 [Drosophila erecta]
gi|190654344|gb|EDV51587.1| GG15584 [Drosophila erecta]
Length = 727
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
CC+ GRPI TDP+SGQT+CSCQYD L +Y RL P + +Y PYP
Sbjct: 142 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 196
Query: 106 FPALGADQAPFYS 118
+ ++G D + FYS
Sbjct: 197 YQSIGVDSSAFYS 209
>gi|195454701|ref|XP_002074361.1| GK10562 [Drosophila willistoni]
gi|194170446|gb|EDW85347.1| GK10562 [Drosophila willistoni]
Length = 661
Score = 64.7 bits (156), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 12/83 (14%)
Query: 41 APRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYS 95
P STS CC+ GR I TDP++G T+CSCQY+ + Y R+A P + +Y
Sbjct: 109 VPESTS----CCENGRRIITDPVTGHTVCSCQYESTHLTIGGYSRMAL---PSAGIGVYG 161
Query: 96 APYPEGMAAYFPALGADQAPFYS 118
PYP +P++G D + FY+
Sbjct: 162 GPYPSNEQNPYPSIGVDNSAFYA 184
>gi|259013289|ref|NP_001158359.1| iroquois [Saccoglossus kowalevskii]
gi|32307785|gb|AAP79289.1| iroquois [Saccoglossus kowalevskii]
Length = 500
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 45/75 (60%), Gaps = 11/75 (14%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAGGGPGLPLSMYSAPYPEGMA 103
G+ CC++GRPI TDP +GQT+CSCQYD + + R+A GLPL MY + Y
Sbjct: 24 GTSCCESGRPIMTDPHTGQTVCSCQYDASRAAIMHSRVA-----GLPL-MYGSSY-TAEQ 76
Query: 104 AYFPALGADQAPFYS 118
Y P G D + FYS
Sbjct: 77 GYVPTFGTDPSAFYS 91
>gi|189235456|ref|XP_971623.2| PREDICTED: iroquois-like protein [Tribolium castaneum]
Length = 632
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 53/83 (63%), Gaps = 5/83 (6%)
Query: 36 TSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYS 95
T +P ST P ++CC++GRP+ TDP++GQ++CSCQYD RLAA P + +Y
Sbjct: 52 TLSPGGTSSTGP-TQCCESGRPLVTDPVTGQSVCSCQYDPA---RLAALQYP-RSVGVYG 106
Query: 96 APYPEGMAAYFPALGADQAPFYS 118
APYP +P++G + + FYS
Sbjct: 107 APYPSTDQNPYPSIGVESSAFYS 129
>gi|270004888|gb|EFA01336.1| iroquois-like protein [Tribolium castaneum]
Length = 547
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 53/83 (63%), Gaps = 5/83 (6%)
Query: 36 TSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYS 95
T +P ST P ++CC++GRP+ TDP++GQ++CSCQYD RLAA P + +Y
Sbjct: 130 TLSPGGTSSTGP-TQCCESGRPLVTDPVTGQSVCSCQYDPA---RLAALQYP-RSVGVYG 184
Query: 96 APYPEGMAAYFPALGADQAPFYS 118
APYP +P++G + + FYS
Sbjct: 185 APYPSTDQNPYPSIGVESSAFYS 207
>gi|332024321|gb|EGI64520.1| Homeobox protein araucan [Acromyrmex echinatior]
Length = 643
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 33 TAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLS 92
T +TP +T PG CC+ GRP+ TDP++GQT+CSCQYD N R A P LP +
Sbjct: 44 TGGATTPVGGAATGPG--CCENGRPMMTDPVTGQTVCSCQYD--NAARFALSTYPRLPTA 99
Query: 93 MYSAPYPEGMAAY-----FPALGADQAPFYS 118
+ YP + +P++G D + FYS
Sbjct: 100 TSYSSYPTPTPSTTDQGPYPSIGMDSSAFYS 130
>gi|345484618|ref|XP_001604958.2| PREDICTED: homeobox protein caupolican-like [Nasonia vitripennis]
Length = 725
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 8/74 (10%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG----GPG-LPLSMYSAPYPEGMAAY 105
CC+ GRP+ TDP++GQT+CSCQYD + RLA G GPG S Y P P G
Sbjct: 121 CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLGPGATSYSSYPTPTPSGSDQG 178
Query: 106 -FPALGADQAPFYS 118
+P++G D + FYS
Sbjct: 179 PYPSIGMDSSAFYS 192
>gi|405965487|gb|EKC30856.1| Homeobox protein caupolican [Crassostrea gigas]
Length = 485
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/69 (50%), Positives = 44/69 (63%), Gaps = 11/69 (15%)
Query: 31 STTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD--LLNYQRLAAGGGPG 88
++T +T TP+ P ST CC+ GRPI DP +GQTICSCQY+ LL Y RLA G
Sbjct: 15 TSTQMTKTPS-PTSTRS---CCENGRPIMADPHTGQTICSCQYNPALLGYSRLA-----G 65
Query: 89 LPLSMYSAP 97
LP +Y +P
Sbjct: 66 LPEGVYGSP 74
>gi|195160683|ref|XP_002021204.1| GL25204 [Drosophila persimilis]
gi|194118317|gb|EDW40360.1| GL25204 [Drosophila persimilis]
Length = 688
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 29/40 (72%), Gaps = 3/40 (7%)
Query: 46 SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
S G CC+ GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 106 SEGMSCCENGRPIITDPVSGQTVCSCQYDPA---RLAIGG 142
>gi|405969201|gb|EKC34186.1| Homeobox protein araucan [Crassostrea gigas]
Length = 493
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/71 (50%), Positives = 43/71 (60%), Gaps = 10/71 (14%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYD---LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFP 107
CC+ GRPI TDP +GQT+CSCQY L Y R+ P L S+YSA P G +Y P
Sbjct: 38 CCENGRPIMTDPHTGQTVCSCQYGSGLLSAYSRV-----PSLSDSVYSAG-PYGSQSYMP 91
Query: 108 ALGADQAPFYS 118
LG D + FYS
Sbjct: 92 -LGTDPSAFYS 101
>gi|350404412|ref|XP_003487096.1| PREDICTED: homeobox protein caupolican-like [Bombus impatiens]
Length = 624
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
+T PG CC+ GRP+ TDP++GQT+CSCQYD + RLA G P L S Y P
Sbjct: 90 TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 145
Query: 99 PEGMAAY-FPALGADQAPFY 117
P +P++G D + FY
Sbjct: 146 PSTTDQGPYPSIGMDSSAFY 165
>gi|340716546|ref|XP_003396758.1| PREDICTED: homeobox protein caupolican-like isoform 1 [Bombus
terrestris]
Length = 674
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
+T PG CC+ GRP+ TDP++GQT+CSCQYD + RLA G P L S Y P
Sbjct: 91 TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146
Query: 99 PEGMAAY-FPALGADQAPFY 117
P +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166
>gi|340716548|ref|XP_003396759.1| PREDICTED: homeobox protein caupolican-like isoform 2 [Bombus
terrestris]
Length = 653
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
+T PG CC+ GRP+ TDP++GQT+CSCQYD + RLA G P L S Y P
Sbjct: 70 TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 125
Query: 99 PEGMAAY-FPALGADQAPFY 117
P +P++G D + FY
Sbjct: 126 PSTTDQGPYPSIGMDSSAFY 145
>gi|383860361|ref|XP_003705659.1| PREDICTED: homeobox protein caupolican-like [Megachile rotundata]
Length = 679
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
+T PG CC+ GRP+ TDP++GQT+CSCQYD + RLA G P L S Y P
Sbjct: 91 ATGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146
Query: 99 PEGMAAY-FPALGADQAPFY 117
P +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166
>gi|195327189|ref|XP_002030304.1| GM25358 [Drosophila sechellia]
gi|194119247|gb|EDW41290.1| GM25358 [Drosophila sechellia]
Length = 680
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
CC+ GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 106 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137
>gi|54650584|gb|AAV36871.1| RE59515p [Drosophila melanogaster]
Length = 693
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
CC+ GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 106 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137
>gi|24663384|ref|NP_524046.2| caupolican [Drosophila melanogaster]
gi|19860750|sp|P54269.2|CAUP_DROME RecName: Full=Homeobox protein caupolican
gi|7294554|gb|AAF49895.1| caupolican [Drosophila melanogaster]
gi|60677775|gb|AAX33394.1| RE64213p [Drosophila melanogaster]
gi|220945950|gb|ACL85518.1| caup-PA [synthetic construct]
Length = 693
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
CC+ GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 106 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137
>gi|195493897|ref|XP_002094610.1| GE21913 [Drosophila yakuba]
gi|194180711|gb|EDW94322.1| GE21913 [Drosophila yakuba]
Length = 689
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
CC+ GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 109 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 140
>gi|194869983|ref|XP_001972562.1| GG15585 [Drosophila erecta]
gi|190654345|gb|EDV51588.1| GG15585 [Drosophila erecta]
Length = 813
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
CC+ GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 227 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 258
>gi|307197225|gb|EFN78544.1| Homeobox protein araucan [Harpegnathos saltator]
Length = 670
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 13/84 (15%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP--------LSMYS 95
+T PG CC+ GRP+ TDP++GQT+CSCQYD + RLA G P + S Y
Sbjct: 91 TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPSVARLGPAAASYSSYP 146
Query: 96 APYPEGMAAY-FPALGADQAPFYS 118
P P +P++G D + FYS
Sbjct: 147 TPTPSTTDQGPYPSIGMDSSAFYS 170
>gi|380014099|ref|XP_003691080.1| PREDICTED: homeobox protein araucan-like [Apis florea]
Length = 382
Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 10/80 (12%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
+T PG CC+ GRP+ TDP++GQ++CSCQYD + RLA G P L S Y P
Sbjct: 91 ATGPG--CCENGRPMMTDPVTGQSVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146
Query: 99 PEGMAAY-FPALGADQAPFY 117
P +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166
>gi|110760661|ref|XP_396292.3| PREDICTED: homeobox protein araucan-like [Apis mellifera]
Length = 677
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 10/80 (12%)
Query: 44 STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
+T PG CC+ GRP+ TDP++GQ++CSCQYD + RLA G P L S Y P
Sbjct: 91 ATGPG--CCENGRPMMTDPVTGQSVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146
Query: 99 PEGMAAY-FPALGADQAPFY 117
P +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166
>gi|195589880|ref|XP_002084677.1| GD14390 [Drosophila simulans]
gi|194196686|gb|EDX10262.1| GD14390 [Drosophila simulans]
Length = 687
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
CC+ GRPI T+P+SGQT+CSCQYD RLA GG
Sbjct: 108 CCENGRPIITNPVSGQTVCSCQYDP---ARLAIGG 139
>gi|1212887|emb|CAA64485.1| caupolican homeoprotein [Drosophila melanogaster]
Length = 693
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 30/40 (75%), Gaps = 4/40 (10%)
Query: 46 SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
S GSRC + GRPI TDP+SGQT+CSCQYD RLA GG
Sbjct: 102 SRGSRC-ENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137
>gi|321467887|gb|EFX78875.1| hypothetical protein DAPPUDRAFT_104948 [Daphnia pulex]
Length = 242
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 46/94 (48%), Gaps = 28/94 (29%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRL------------------------AAGGG 86
CCD+GRP+ TDP++GQT+CSCQY+ L Q L A G
Sbjct: 81 CCDSGRPLVTDPVTGQTVCSCQYERL--QALQNGLAAGYGPAAAAAAARLAAVSSAVTGV 138
Query: 87 PGLPLSMYSAPYPEGMAAYFPALGADQ-APFYSS 119
GL +YSA YP +P+LG D FYSS
Sbjct: 139 TGLH-GVYSAAYPSSDQNPYPSLGMDSTTAFYSS 171
>gi|193083635|gb|ACF10238.1| iroquois B [Branchiostoma lanceolatum]
Length = 494
Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y A G
Sbjct: 28 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIAAFG 85
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 86 ADPSAFYS 93
>gi|157116231|ref|XP_001658393.1| iroquois-class homeodomain protein irx [Aedes aegypti]
gi|108876562|gb|EAT40787.1| AAEL007505-PA [Aedes aegypti]
Length = 363
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 37/65 (56%), Gaps = 10/65 (15%)
Query: 59 FTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQ 113
TDP+SGQT+CSCQYD L +Y R+ G + +Y PYP +P++G D
Sbjct: 1 MTDPVSGQTVCSCQYDSARLALSSYSRIPTGS-----VGVYGTPYPSTEQNPYPSIGVDS 55
Query: 114 APFYS 118
+ FYS
Sbjct: 56 SAFYS 60
>gi|193083641|gb|ACF10241.1| iroquois C [Branchiostoma floridae]
Length = 494
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYESRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|260834613|ref|XP_002612304.1| iroquois 3 [Branchiostoma floridae]
gi|229297681|gb|EEN68313.1| iroquois 3 [Branchiostoma floridae]
Length = 494
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|193083637|gb|ACF10239.1| iroquois B [Branchiostoma floridae]
Length = 496
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y A G
Sbjct: 28 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIAAFG 85
Query: 111 ADQAPFYS 118
D + FYS
Sbjct: 86 GDPSAFYS 93
>gi|193083639|gb|ACF10240.1| iroquois C [Branchiostoma lanceolatum]
Length = 494
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|193083631|gb|ACF10236.1| iroquois A isoform 2 [Branchiostoma lanceolatum]
Length = 486
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|193083629|gb|ACF10235.1| iroquois A isoform 1 [Branchiostoma lanceolatum]
Length = 461
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|209529858|gb|ACI49802.1| iroquois A isoform 1 [Branchiostoma floridae]
Length = 459
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|193083633|gb|ACF10237.1| iroquois A isoform 1 [Branchiostoma floridae]
Length = 461
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 51 CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
CC+ GRPI +DP +GQ +CSCQY+ + G S+Y +PY Y + G
Sbjct: 25 CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYITSFG 82
Query: 111 ADQAPFYS 118
AD + FYS
Sbjct: 83 ADPSAFYS 90
>gi|193788691|ref|NP_001123285.1| iroquois homeobox A [Strongylocentrotus purpuratus]
gi|167859058|gb|ACA04464.1| IrxA [Strongylocentrotus purpuratus]
Length = 605
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 10/78 (12%)
Query: 46 SPGSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAGGGPGLPLSMYSAPYPEG 101
+P + CC++GRP+ +DP GQ +C D LL+ R++ GLP ++YS+PY
Sbjct: 23 APATTCCESGRPVASDPHPGQAVCCPPLDSRAALLHSARVS-----GLPTALYSSPYAAA 77
Query: 102 MAAY-FPALGADQAPFYS 118
A + AD + FYS
Sbjct: 78 AAEQGYVHFAADPSAFYS 95
>gi|322366536|gb|ADW95342.1| IrxA [Paracentrotus lividus]
Length = 602
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 10/78 (12%)
Query: 46 SPGSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAGGGPGLPLSMYSAPYPEG 101
+P + CC++GRP+ +DP GQ +C D LL+ R++ GLP ++YS+PY
Sbjct: 25 APATTCCESGRPVASDPHPGQAVCCPPLDSRAALLHSARVS-----GLPTALYSSPYAAA 79
Query: 102 MAAY-FPALGADQAPFYS 118
A + AD + FYS
Sbjct: 80 AAEQGYVHFAADPSAFYS 97
>gi|71423067|ref|XP_812329.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877099|gb|EAN90478.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 780
Score = 39.3 bits (90), Expect = 0.31, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 14/65 (21%)
Query: 7 STVAPSSSPTSSSPPPSST--------RASNTSTTAVTSTPA------APRSTSPGSRCC 52
+TV PS+SP++S+P P + R+S+ S AVT+TP+ R TSPG
Sbjct: 136 TTVPPSTSPSTSAPLPENVLPLRQNRGRSSHASWVAVTTTPSLSPSEPKSRETSPGRATA 195
Query: 53 DTGRP 57
T P
Sbjct: 196 ATRSP 200
>gi|1098898|gb|AAC46961.1| CTRP [Plasmodium falciparum]
Length = 2098
Score = 38.9 bits (89), Expect = 0.43, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 9 VAPSSSPTSSSP--PPSSTRASNTSTTAVT---STPAAPRSTSPGSRCCDT-GRPIFTDP 62
V P SP+ S+P P ST + + +T + STP P ++PGS T G P + P
Sbjct: 761 VLPPGSPSESTPGSPSESTPGNPSESTPGSPSESTPGNPSESTPGSPSESTPGSPSESTP 820
Query: 63 ISG-QTICSCQYDL 75
SG + +C YDL
Sbjct: 821 CSGTECLCHNTYDL 834
>gi|218442531|ref|YP_002380852.1| hypothetical protein PCC7424_5461 [Cyanothece sp. PCC 7424]
gi|218175302|gb|ACK74033.1| hypothetical protein PCC7424_5461 [Cyanothece sp. PCC 7424]
Length = 394
Score = 38.1 bits (87), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%)
Query: 10 APSSSPTSSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDT 54
A ++ S+ PSSTRA+N + T + STP RSTSP +R D
Sbjct: 339 AANAGRRSAQSVPSSTRAANRTVTVIPSTPRRMRSTSPTTRTIDV 383
>gi|410917886|ref|XP_003972417.1| PREDICTED: general transcription factor 3C polypeptide 1-like
[Takifugu rubripes]
Length = 2211
Score = 36.2 bits (82), Expect = 2.7, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 23/39 (58%)
Query: 12 SSSPTSSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSR 50
S+ P SS PPS+ AS+TS +A TPA+ R G R
Sbjct: 458 SADPQVSSTPPSTPAASSTSESANAKTPASRRKQKAGKR 496
>gi|444301783|gb|AGD98928.1| Irx [Nematostella vectensis]
Length = 451
Score = 35.8 bits (81), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 9/72 (12%)
Query: 48 GSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPE-GMAAYF 106
G+ CCD+ R + D + G ++C CQ+D + R+A+ +P+S+ P P G A +
Sbjct: 28 GNSCCDSVRQVH-DGMPGHSVCPCQFDRVLQHRVAS-----VPVSVL--PRPAFGEAGFS 79
Query: 107 PALGADQAPFYS 118
G + FYS
Sbjct: 80 LGSGTGLSAFYS 91
>gi|386836417|ref|YP_006241475.1| modular polyketide synthase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374096718|gb|AEY85602.1| modular polyketide synthase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451789776|gb|AGF59825.1| modular polyketide synthase [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 1024
Score = 35.4 bits (80), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 9/85 (10%)
Query: 3 KIPESTVAPSSSPTSSSPPPSSTRASNTSTTAVTSTPAA------PRSTSPGSRCCDTGR 56
+IP T P PTS+ P ++TR + ++ T T AA P S PG R D R
Sbjct: 407 RIP-WTAYPVDVPTSARPWQATTRRALVNSFGFTGTIAAAVVEQAPPSAEPGPRPGDGDR 465
Query: 57 PIFTDPISGQTICSCQYDLLNYQRL 81
+FT +S ++ S Q L Y++L
Sbjct: 466 HVFT--LSAKSAASLQLQLDRYRKL 488
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.310 0.125 0.378
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,205,079,079
Number of Sequences: 23463169
Number of extensions: 95884155
Number of successful extensions: 1309229
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2044
Number of HSP's successfully gapped in prelim test: 2810
Number of HSP's that attempted gapping in prelim test: 1219754
Number of HSP's gapped (non-prelim): 72395
length of query: 122
length of database: 8,064,228,071
effective HSP length: 89
effective length of query: 33
effective length of database: 5,976,006,030
effective search space: 197208198990
effective search space used: 197208198990
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 69 (31.2 bits)