BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy5912
         (122 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|91079258|ref|XP_971676.1| PREDICTED: similar to iroquois-class homeodomain protein irx
           [Tribolium castaneum]
          Length = 458

 Score =  132 bits (333), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 67/88 (76%), Positives = 76/88 (86%), Gaps = 3/88 (3%)

Query: 34  AVTSTPAAPRST-SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-L 91
           AVTS+   P S  +P  RCCDTGRP+FTDP++GQ++CSCQY+LL YQRL AGG PGLP L
Sbjct: 31  AVTSSAPTPGSPPAPSQRCCDTGRPLFTDPLTGQSVCSCQYELLGYQRL-AGGVPGLPAL 89

Query: 92  SMYSAPYPEGMAAYFPALGADQAPFYSS 119
           SMYSAPYPEGMAAYFPALGADQAPFY+S
Sbjct: 90  SMYSAPYPEGMAAYFPALGADQAPFYTS 117


>gi|270004889|gb|EFA01337.1| mirror [Tribolium castaneum]
          Length = 533

 Score =  132 bits (333), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 67/88 (76%), Positives = 76/88 (86%), Gaps = 3/88 (3%)

Query: 34  AVTSTPAAPRST-SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-L 91
           AVTS+   P S  +P  RCCDTGRP+FTDP++GQ++CSCQY+LL YQRL AGG PGLP L
Sbjct: 2   AVTSSAPTPGSPPAPSQRCCDTGRPLFTDPLTGQSVCSCQYELLGYQRL-AGGVPGLPAL 60

Query: 92  SMYSAPYPEGMAAYFPALGADQAPFYSS 119
           SMYSAPYPEGMAAYFPALGADQAPFY+S
Sbjct: 61  SMYSAPYPEGMAAYFPALGADQAPFYTS 88


>gi|157116233|ref|XP_001658394.1| iroquois-class homeodomain protein irx [Aedes aegypti]
 gi|108876563|gb|EAT40788.1| AAEL007502-PA [Aedes aegypti]
          Length = 540

 Score =  130 bits (328), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 72/102 (70%), Positives = 80/102 (78%), Gaps = 16/102 (15%)

Query: 34  AVTST-PAAPRSTSPGS-----------RCCDTGRPIFTDPISGQTICSCQYDLLNYQRL 81
           AVT+T P   RSTSP S           RCCDTGRPIFTDPISGQT+CSCQYDL+NYQRL
Sbjct: 54  AVTTTVPPGARSTSPCSATSGPPTSGSGRCCDTGRPIFTDPISGQTVCSCQYDLINYQRL 113

Query: 82  AAGG--GP-GLPLSMYSAPY-PEGMAAYFPALGADQAPFYSS 119
           A+ G  GP G+PLSMYSAPY PE MAAYFPA+GADQAPFY++
Sbjct: 114 ASAGIAGPGGVPLSMYSAPYSPETMAAYFPAIGADQAPFYAN 155


>gi|328697559|ref|XP_001952319.2| PREDICTED: hypothetical protein LOC100169212 [Acyrthosiphon pisum]
          Length = 599

 Score =  130 bits (326), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 59/72 (81%), Positives = 66/72 (91%), Gaps = 2/72 (2%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPAL 109
           RCCDTGRPIFTDPI+GQT+CSCQYDLL+YQRLAA G P   LSMY+APYPEGMAAYFPAL
Sbjct: 125 RCCDTGRPIFTDPITGQTVCSCQYDLLSYQRLAASGVPA--LSMYTAPYPEGMAAYFPAL 182

Query: 110 GADQAPFYSSAS 121
           G+DQ PFYS+A+
Sbjct: 183 GSDQPPFYSAAA 194


>gi|170027824|ref|XP_001841797.1| iroquois-class homeodomain protein irx [Culex quinquefasciatus]
 gi|167862367|gb|EDS25750.1| iroquois-class homeodomain protein irx [Culex quinquefasciatus]
          Length = 512

 Score =  129 bits (323), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 67/95 (70%), Positives = 79/95 (83%), Gaps = 6/95 (6%)

Query: 31  STTAVTSTPAAPRSTSPGS-RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG---G 86
           ST+  ++T   P ++  GS RCCDTGRPIFTDPISGQT+CSCQYDL+NYQRLA+ G   G
Sbjct: 35  STSPCSATSGGPPTSGSGSGRCCDTGRPIFTDPISGQTVCSCQYDLINYQRLASAGIATG 94

Query: 87  P-GLPLSMYSAPY-PEGMAAYFPALGADQAPFYSS 119
           P G+PLSMYSAPY PE MAAYFPA+GADQAPFY++
Sbjct: 95  PGGVPLSMYSAPYSPETMAAYFPAIGADQAPFYAN 129


>gi|312372585|gb|EFR20514.1| hypothetical protein AND_19963 [Anopheles darlingi]
          Length = 241

 Score =  128 bits (322), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 80/133 (60%), Positives = 96/133 (72%), Gaps = 17/133 (12%)

Query: 1   MLKIPESTVAPSSSPTSSSPPPSSTRASNTSTT---AVTSTPAAPRST------SPGS-R 50
           M KI  +   PS+    SSPP +S   + T+T    A +S+P +  S       +PGS R
Sbjct: 10  MFKINTTPTVPSAP---SSPPTASVAMAVTTTVPPSARSSSPCSATSGPPTSGPTPGSGR 66

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAG--GGP-GLPLSMYSAPY-PEGMAAYF 106
           CCDTGRPIFTDPISGQT+CSCQYDL+NYQRLA+   GGP G+PLSMYSAPY PE MAAYF
Sbjct: 67  CCDTGRPIFTDPISGQTVCSCQYDLINYQRLASAGIGGPGGVPLSMYSAPYSPETMAAYF 126

Query: 107 PALGADQAPFYSS 119
           PA+GADQAPFY++
Sbjct: 127 PAIGADQAPFYAN 139


>gi|242023068|ref|XP_002431958.1| hypothetical protein Phum_PHUM562460 [Pediculus humanus corporis]
 gi|212517309|gb|EEB19220.1| hypothetical protein Phum_PHUM562460 [Pediculus humanus corporis]
          Length = 163

 Score =  127 bits (319), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 71/93 (76%), Positives = 80/93 (86%), Gaps = 5/93 (5%)

Query: 30  TSTTAVTS-TPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLA-AGGGP 87
           T+ TAVT+ T   P S +PG RCCDTGRP+FTDPI+GQT+CSCQYDLL+YQRLA AG GP
Sbjct: 34  TNATAVTAGTGTRPASPTPG-RCCDTGRPLFTDPITGQTVCSCQYDLLSYQRLASAGVGP 92

Query: 88  -GLP-LSMYSAPYPEGMAAYFPALGADQAPFYS 118
            GLP LSMYSAPY +GMAAYFPALGADQAPFY+
Sbjct: 93  AGLPALSMYSAPYADGMAAYFPALGADQAPFYT 125


>gi|158293797|ref|XP_315118.4| AGAP005011-PA [Anopheles gambiae str. PEST]
 gi|157016618|gb|EAA10468.4| AGAP005011-PA [Anopheles gambiae str. PEST]
          Length = 575

 Score =  125 bits (315), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/74 (81%), Positives = 68/74 (91%), Gaps = 4/74 (5%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG--GP-GLPLSMYSAPY-PEGMAAY 105
           RCCDTGRPIFTDPISGQT+CSCQYDL++YQRLA+ G  GP G+PLSMYSAPY PE MAAY
Sbjct: 94  RCCDTGRPIFTDPISGQTVCSCQYDLISYQRLASAGIAGPGGVPLSMYSAPYSPETMAAY 153

Query: 106 FPALGADQAPFYSS 119
           FPA+GADQAPFY++
Sbjct: 154 FPAIGADQAPFYAN 167


>gi|357607495|gb|EHJ65534.1| putative iroquois-class homeodomain protein irx [Danaus plexippus]
          Length = 184

 Score =  106 bits (265), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/111 (56%), Positives = 78/111 (70%), Gaps = 13/111 (11%)

Query: 14  SPTSSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQY 73
           S  S+SPPP+    S  S  +  +   +P +    +RCCDTGRPIF DPI+GQT+CSCQY
Sbjct: 18  STASNSPPPAQAGMSGGSGASGAARSGSPAA----ARCCDTGRPIFQDPITGQTVCSCQY 73

Query: 74  DLLNYQRLAAGGGPGLPLSMYSAPYPE-----GMAAYFPALGADQAPFYSS 119
           + LNYQRLA+    G+PLSMYSAPYP+     GMAAYFPAL ADQ PFY++
Sbjct: 74  EFLNYQRLAS----GVPLSMYSAPYPDAAAAAGMAAYFPALAADQPPFYTN 120


>gi|195327201|ref|XP_002030310.1| GM25364 [Drosophila sechellia]
 gi|194119253|gb|EDW41296.1| GM25364 [Drosophila sechellia]
          Length = 547

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 49/77 (63%), Positives = 60/77 (77%), Gaps = 6/77 (7%)

Query: 43  RSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           RS +P +RCCDTGR I+ DP+SGQTICSCQYD+LNYQRLAA G  G+PL +    YPEGM
Sbjct: 17  RSVNPPNRCCDTGRTIYADPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGM 70

Query: 103 AAYFPALGADQAPFYSS 119
           +AY   + ADQ PFY++
Sbjct: 71  SAYLSGIAADQPPFYAN 87


>gi|195126379|ref|XP_002007648.1| GI12266 [Drosophila mojavensis]
 gi|193919257|gb|EDW18124.1| GI12266 [Drosophila mojavensis]
          Length = 735

 Score =  104 bits (259), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 48/76 (63%), Positives = 58/76 (76%), Gaps = 9/76 (11%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGG-----PGLPLSMYSAPYPEGMA 103
           +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA GG      G+PL +    YPEGM+
Sbjct: 151 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAGGLVAGPGGVPLGV----YPEGMS 206

Query: 104 AYFPALGADQAPFYSS 119
           AY   + ADQ PFY++
Sbjct: 207 AYLSGIAADQPPFYAN 222


>gi|195378775|ref|XP_002048157.1| GJ11496 [Drosophila virilis]
 gi|194155315|gb|EDW70499.1| GJ11496 [Drosophila virilis]
          Length = 575

 Score =  103 bits (258), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/76 (63%), Positives = 58/76 (76%), Gaps = 9/76 (11%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGG-----PGLPLSMYSAPYPEGMA 103
           +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA GG      G+PL +    YPEGM+
Sbjct: 168 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAGGLVAGPGGVPLGV----YPEGMS 223

Query: 104 AYFPALGADQAPFYSS 119
           AY   + ADQ PFY++
Sbjct: 224 AYLSGIAADQPPFYAN 239


>gi|195019738|ref|XP_001985044.1| GH14720 [Drosophila grimshawi]
 gi|193898526|gb|EDV97392.1| GH14720 [Drosophila grimshawi]
          Length = 669

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/76 (61%), Positives = 57/76 (75%), Gaps = 9/76 (11%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGG-----PGLPLSMYSAPYPEGMA 103
           +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA GG      G+PL +    YPEGM+
Sbjct: 126 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAGGLVAGPGGVPLGV----YPEGMS 181

Query: 104 AYFPALGADQAPFYSS 119
           AY   +  DQ PFY++
Sbjct: 182 AYLSGIAGDQPPFYAN 197


>gi|442632003|ref|NP_001261778.1| mirror, isoform C [Drosophila melanogaster]
 gi|440215710|gb|AGB94471.1| mirror, isoform C [Drosophila melanogaster]
          Length = 682

 Score =  103 bits (256), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/70 (67%), Positives = 56/70 (80%), Gaps = 6/70 (8%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPAL 109
           RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA G  G+PL +    YPEGM+AY   +
Sbjct: 118 RCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSGI 171

Query: 110 GADQAPFYSS 119
            ADQ PFY++
Sbjct: 172 AADQPPFYAN 181


>gi|195493902|ref|XP_002094612.1| GE21915 [Drosophila yakuba]
 gi|194180713|gb|EDW94324.1| GE21915 [Drosophila yakuba]
          Length = 642

 Score =  103 bits (256), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 47/70 (67%), Positives = 56/70 (80%), Gaps = 6/70 (8%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPAL 109
           RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA G  G+PL +    YPEGM+AY   +
Sbjct: 119 RCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSGI 172

Query: 110 GADQAPFYSS 119
            ADQ PFY++
Sbjct: 173 AADQPPFYAN 182


>gi|125978763|ref|XP_001353414.1| GA10430 [Drosophila pseudoobscura pseudoobscura]
 gi|195160685|ref|XP_002021205.1| GL25205 [Drosophila persimilis]
 gi|54642172|gb|EAL30921.1| GA10430 [Drosophila pseudoobscura pseudoobscura]
 gi|194118318|gb|EDW40361.1| GL25205 [Drosophila persimilis]
          Length = 639

 Score =  102 bits (254), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 46/71 (64%), Positives = 56/71 (78%), Gaps = 6/71 (8%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPA 108
           +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA G  G+PL +    YPEGM+AY   
Sbjct: 114 NRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSG 167

Query: 109 LGADQAPFYSS 119
           +  DQ PFY++
Sbjct: 168 IAGDQPPFYAN 178


>gi|194747619|ref|XP_001956249.1| GF24692 [Drosophila ananassae]
 gi|190623531|gb|EDV39055.1| GF24692 [Drosophila ananassae]
          Length = 661

 Score =  102 bits (253), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/73 (64%), Positives = 58/73 (79%), Gaps = 6/73 (8%)

Query: 47  PGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYF 106
           P +RCCDTGR I+TDP+SGQTICSCQYD+L+YQRLAA G  G+PL +    YPEGM+AY 
Sbjct: 126 PPNRCCDTGRTIYTDPVSGQTICSCQYDMLSYQRLAAAG--GVPLGV----YPEGMSAYL 179

Query: 107 PALGADQAPFYSS 119
             + ADQ PFY++
Sbjct: 180 SGIAADQPPFYAN 192


>gi|195446597|ref|XP_002070839.1| GK25452 [Drosophila willistoni]
 gi|194166924|gb|EDW81825.1| GK25452 [Drosophila willistoni]
          Length = 620

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 46/71 (64%), Positives = 58/71 (81%), Gaps = 6/71 (8%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPA 108
           SRCC+TGR I+TDP+SGQ+ICSCQYD+L+YQRLAA G  G+PL +    YPEGM+AY   
Sbjct: 102 SRCCETGRTIYTDPVSGQSICSCQYDMLSYQRLAAAG--GVPLGV----YPEGMSAYLSG 155

Query: 109 LGADQAPFYSS 119
           +GA+Q PFYS+
Sbjct: 156 IGAEQPPFYSN 166


>gi|24663392|ref|NP_524047.2| mirror, isoform A [Drosophila melanogaster]
 gi|24663396|ref|NP_729818.1| mirror, isoform B [Drosophila melanogaster]
 gi|7294553|gb|AAF49894.1| mirror, isoform A [Drosophila melanogaster]
 gi|23093572|gb|AAN11850.1| mirror, isoform B [Drosophila melanogaster]
          Length = 641

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 17/96 (17%)

Query: 35  VTSTPAAPRST-----------SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAA 83
           V + P  P S+           +P +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA
Sbjct: 92  VGAPPPGPHSSAPGGAPAPGAGNPPNRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAA 151

Query: 84  GGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
            G  G+PL +    YPEGM+AY   + ADQ PFY++
Sbjct: 152 AG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181


>gi|2071997|gb|AAB53640.1| mirror [Drosophila melanogaster]
 gi|3243091|gb|AAC23943.1| homeoprotein Sail [Drosophila melanogaster]
          Length = 641

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 17/96 (17%)

Query: 35  VTSTPAAPRST-----------SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAA 83
           V + P  P S+           +P +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA
Sbjct: 92  VGAPPPGPHSSAPGGAPAPGAGNPPNRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAA 151

Query: 84  GGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
            G  G+PL +    YPEGM+AY   + ADQ PFY++
Sbjct: 152 AG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181


>gi|350404409|ref|XP_003487095.1| PREDICTED: hypothetical protein LOC100743699 [Bombus impatiens]
          Length = 768

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 53/94 (56%), Positives = 60/94 (63%), Gaps = 30/94 (31%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLP--LSMYSAPYP------- 99
           RCCDTGRPIFTDP++GQT+CSCQY++L  YQRL A     LP  LSMYSAPY        
Sbjct: 232 RCCDTGRPIFTDPLTGQTVCSCQYEILGGYQRLGA-----LPAALSMYSAPYAAAAXXXX 286

Query: 100 ---------------EGMAAYFPALGADQAPFYS 118
                          EGMAAYFP LGA+QAPFY+
Sbjct: 287 XXXXXXXXXXXAAASEGMAAYFPPLGAEQAPFYT 320


>gi|194869994|ref|XP_001972564.1| GG15588 [Drosophila erecta]
 gi|190654347|gb|EDV51590.1| GG15588 [Drosophila erecta]
          Length = 641

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 17/96 (17%)

Query: 35  VTSTPAAPRST-----------SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAA 83
           V + P  P S+           +P +RCCDTGR I+TDP+SGQTICSCQYD+LNYQRLAA
Sbjct: 92  VGAPPPGPHSSAPGGAPAPGAGNPPNRCCDTGRTIYTDPVSGQTICSCQYDMLNYQRLAA 151

Query: 84  GGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
            G  G+PL +    YPEGM+AY   + ADQ PFY++
Sbjct: 152 AG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181


>gi|195589890|ref|XP_002084682.1| GD14396 [Drosophila simulans]
 gi|194196691|gb|EDX10267.1| GD14396 [Drosophila simulans]
          Length = 641

 Score = 94.7 bits (234), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 58/120 (48%), Positives = 74/120 (61%), Gaps = 13/120 (10%)

Query: 7   STVAPSSSPTSSSPPPSSTRA-------SNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIF 59
           STV P +  TS   P +   A            ++      AP + +P +RCCDTGR I+
Sbjct: 68  STVPPGARSTSPCGPSAGLPALSLVGAPPPGPHSSGPGGAPAPGAGNPPNRCCDTGRTIY 127

Query: 60  TDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYSS 119
           TDP+SGQTICSCQYD+LNYQRLAA G  G+PL +    YPEGM+AY   + ADQ PFY++
Sbjct: 128 TDPVSGQTICSCQYDMLNYQRLAAAG--GVPLGV----YPEGMSAYLSGIAADQPPFYAN 181


>gi|383860303|ref|XP_003705630.1| PREDICTED: uncharacterized protein LOC100877027 [Megachile
           rotundata]
          Length = 685

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 53/78 (67%), Positives = 59/78 (75%), Gaps = 11/78 (14%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
           RCCDTGRPIFTDP++GQT+CSCQYDLL  YQRL  GG P   LSMYS        A   E
Sbjct: 165 RCCDTGRPIFTDPLTGQTVCSCQYDLLGGYQRL--GGLPTAALSMYSAPYAAAAAAAASE 222

Query: 101 GMAAYFPALGADQAPFYS 118
           GMAAYFP+LGA+QAPFY+
Sbjct: 223 GMAAYFPSLGAEQAPFYT 240


>gi|332024324|gb|EGI64523.1| Homeobox protein caupolican [Acromyrmex echinatior]
          Length = 625

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/79 (65%), Positives = 60/79 (75%), Gaps = 11/79 (13%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYP 99
           +RCCDTGRPIFTDP++GQT+CSCQY+LL  YQRL  GG P   LSMYS        A   
Sbjct: 86  ARCCDTGRPIFTDPLTGQTVCSCQYELLGGYQRL--GGLPTAALSMYSAPYAAAAAAAAS 143

Query: 100 EGMAAYFPALGADQAPFYS 118
           EGMAAYFP+LGA+QAPFY+
Sbjct: 144 EGMAAYFPSLGAEQAPFYT 162


>gi|307177869|gb|EFN66829.1| Homeobox protein caupolican [Camponotus floridanus]
          Length = 280

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/78 (66%), Positives = 59/78 (75%), Gaps = 11/78 (14%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
           RCCDTGRPIFTDP++GQT+CSCQY+LL  YQRL  GG P   LSMYS        A   E
Sbjct: 87  RCCDTGRPIFTDPLTGQTVCSCQYELLGGYQRL--GGLPTAALSMYSAPYAAAAAAAASE 144

Query: 101 GMAAYFPALGADQAPFYS 118
           GMAAYFP+LGA+QAPFY+
Sbjct: 145 GMAAYFPSLGAEQAPFYT 162


>gi|156547431|ref|XP_001604937.1| PREDICTED: hypothetical protein LOC100121331 [Nasonia vitripennis]
          Length = 650

 Score = 88.2 bits (217), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 57/92 (61%), Positives = 63/92 (68%), Gaps = 14/92 (15%)

Query: 39  PAAPRSTSPG---SRCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMY 94
           P AP+  S G    RCCDTGRPIFTDPI+GQT+CSCQY+LL  YQRL  G  P   LSMY
Sbjct: 99  PTAPQPASSGPTPGRCCDTGRPIFTDPITGQTVCSCQYELLGGYQRL--GALPTAALSMY 156

Query: 95  S--------APYPEGMAAYFPALGADQAPFYS 118
           S        A   EGMAAYFP LGA+QAPFY+
Sbjct: 157 SAPYAAAAAAAASEGMAAYFPGLGAEQAPFYT 188


>gi|322795138|gb|EFZ17978.1| hypothetical protein SINV_13653 [Solenopsis invicta]
          Length = 204

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 52/79 (65%), Positives = 60/79 (75%), Gaps = 11/79 (13%)

Query: 49  SRCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYP 99
           +RCCDTGRPIFTDP++GQT+CSCQY+LL  YQRL  GG P   LSMYS        A   
Sbjct: 88  ARCCDTGRPIFTDPLTGQTVCSCQYELLGGYQRL--GGLPTAALSMYSAPYAAAAAAAAS 145

Query: 100 EGMAAYFPALGADQAPFYS 118
           EGMAAYFP+LGA+QAPFY+
Sbjct: 146 EGMAAYFPSLGAEQAPFYT 164


>gi|380014060|ref|XP_003691061.1| PREDICTED: homeobox protein araucan-like [Apis florea]
          Length = 620

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/78 (64%), Positives = 58/78 (74%), Gaps = 11/78 (14%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
           RCCDTGRPIFTDP++GQT+CSCQY+L+  YQRL  G  P   LSMYS        A   E
Sbjct: 101 RCCDTGRPIFTDPLTGQTVCSCQYELIGGYQRL--GALPTAALSMYSAPYAAAAAAAASE 158

Query: 101 GMAAYFPALGADQAPFYS 118
           GMAAYFP+LGA+QAPFY+
Sbjct: 159 GMAAYFPSLGAEQAPFYT 176


>gi|328777967|ref|XP_396291.3| PREDICTED: homeobox protein araucan [Apis mellifera]
          Length = 622

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/78 (64%), Positives = 58/78 (74%), Gaps = 11/78 (14%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLPLSMYS--------APYPE 100
           RCCDTGRPIFTDP++GQT+CSCQY+L+  YQRL  G  P   LSMYS        A   E
Sbjct: 101 RCCDTGRPIFTDPLTGQTVCSCQYELIGGYQRL--GALPTAALSMYSAPYAAAAAAAASE 158

Query: 101 GMAAYFPALGADQAPFYS 118
           GMAAYFP+LGA+QAPFY+
Sbjct: 159 GMAAYFPSLGAEQAPFYT 176


>gi|340716066|ref|XP_003396524.1| PREDICTED: homeobox protein araucan-like [Bombus terrestris]
          Length = 612

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 51/80 (63%), Positives = 58/80 (72%), Gaps = 16/80 (20%)

Query: 50  RCCDTGRPIFTDPISGQTICSCQYDLL-NYQRLAAGGGPGLP--LSMYS--------APY 98
           RCCDTGRPIFTDP++GQT+CSCQY++L  YQRL A     LP  LSMYS        A  
Sbjct: 90  RCCDTGRPIFTDPLTGQTVCSCQYEILGGYQRLGA-----LPAALSMYSAPYAAAAAAAA 144

Query: 99  PEGMAAYFPALGADQAPFYS 118
            EGMAAYFP LGA+QAPFY+
Sbjct: 145 SEGMAAYFPPLGAEQAPFYT 164


>gi|240960565|ref|XP_002400563.1| hypothetical protein IscW_ISCW015739 [Ixodes scapularis]
 gi|215490705|gb|EEC00348.1| hypothetical protein IscW_ISCW015739 [Ixodes scapularis]
          Length = 110

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/74 (56%), Positives = 54/74 (72%), Gaps = 6/74 (8%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD--LLNYQRLAAGGGPGLPLSMY-SAPYPEGMAA 104
           G+ CC++GRP+ TDPI+GQT+CSCQYD  LLNYQRLAA    GLP++MY +A Y  G   
Sbjct: 17  GTPCCESGRPVLTDPITGQTVCSCQYDAQLLNYQRLAAS---GLPINMYGTAAYAAGEQG 73

Query: 105 YFPALGADQAPFYS 118
           + P   A+Q+ FYS
Sbjct: 74  FLPLGAAEQSAFYS 87


>gi|357607497|gb|EHJ65536.1| putative iroquois-like protein [Danaus plexippus]
          Length = 456

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 63/106 (59%), Gaps = 10/106 (9%)

Query: 17  SSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD-- 74
           SSS PP S   S  +  A+ S  A   +++P + CCDT RPI TDP+SGQT+CSCQYD  
Sbjct: 41  SSSAPPLSPGGSAVAGGAL-SPGAGSHASTPAASCCDTPRPIITDPVSGQTVCSCQYDAR 99

Query: 75  --LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYS 118
             L +Y RL++       + +Y APYP      +P++G D + FYS
Sbjct: 100 LALTSYPRLSSAA-----VGVYGAPYPSTDQNPYPSIGVDSSAFYS 140


>gi|74054137|gb|AAZ95458.1| caupolican-like protein [Calliphora vicina]
          Length = 626

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L NY R+  GG     + +Y   YP      
Sbjct: 155 CCENGRPIMTDPVSGQTVCSCQYDSARLALTNYSRMPTGG-----VGVYGTAYPSTEQNP 209

Query: 106 FPALGADQAPFYS 118
           +P++G D + FY+
Sbjct: 210 YPSIGVDSSAFYT 222


>gi|195019751|ref|XP_001985047.1| GH14716 [Drosophila grimshawi]
 gi|193898529|gb|EDV97395.1| GH14716 [Drosophila grimshawi]
          Length = 733

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 8/73 (10%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L  Y R+A    P   + +Y  PYP      
Sbjct: 116 CCENGRPIITDPVSGQTVCSCQYDSARLTLGGYSRMAL---PSAGVGVYGGPYPSNEQNP 172

Query: 106 FPALGADQAPFYS 118
           +P++G D + FYS
Sbjct: 173 YPSIGVDNSAFYS 185


>gi|242023060|ref|XP_002431954.1| hypothetical protein Phum_PHUM562220 [Pediculus humanus corporis]
 gi|212517305|gb|EEB19216.1| hypothetical protein Phum_PHUM562220 [Pediculus humanus corporis]
          Length = 155

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 56/94 (59%), Gaps = 10/94 (10%)

Query: 29  NTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAG 84
           N ST A  + P  P ST+ G+ CC+ G+P  TDP+SGQT+CSCQYD     L+Y RL   
Sbjct: 42  NGSTAAAAAAPG-PGSTASGTSCCENGKPTITDPVSGQTVCSCQYDSARLTLSYPRLPTS 100

Query: 85  GGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYS 118
           G     + +Y APYP      +P++G D + FYS
Sbjct: 101 G-----VGVYGAPYPSTDQNPYPSIGVDSSAFYS 129


>gi|198464914|ref|XP_001353412.2| GA10434 [Drosophila pseudoobscura pseudoobscura]
 gi|198149934|gb|EAL30919.2| GA10434 [Drosophila pseudoobscura pseudoobscura]
          Length = 682

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 8/78 (10%)

Query: 46  SPGSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPE 100
           S G  CC+ GRPI TDP+SGQT+CSCQYD     +  Y R+A    P   + +Y  PYP 
Sbjct: 106 SEGMSCCENGRPIITDPVSGQTVCSCQYDPARLAIGGYSRMAL---PSAGVGVYGGPYPS 162

Query: 101 GMAAYFPALGADQAPFYS 118
                +P++G D + FY+
Sbjct: 163 NEQNPYPSIGVDNSAFYA 180


>gi|195126385|ref|XP_002007651.1| GI12262 [Drosophila mojavensis]
 gi|193919260|gb|EDW18127.1| GI12262 [Drosophila mojavensis]
          Length = 718

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 8/73 (10%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L  Y R+A    P   + +Y  PYP      
Sbjct: 135 CCENGRPIITDPVSGQTVCSCQYDSARLTLGGYSRMAL---PSAGVGVYGGPYPSNEQNP 191

Query: 106 FPALGADQAPFYS 118
           +P++G D + FYS
Sbjct: 192 YPSIGVDNSAFYS 204


>gi|195378781|ref|XP_002048160.1| GJ11493 [Drosophila virilis]
 gi|194155318|gb|EDW70502.1| GJ11493 [Drosophila virilis]
          Length = 698

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 8/73 (10%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L  Y R+A    P   + +Y  PYP      
Sbjct: 125 CCENGRPIITDPVSGQTVCSCQYDSARLTLGGYSRMAL---PTAGVGVYGGPYPSNEQNP 181

Query: 106 FPALGADQAPFYS 118
           +P++G D + FYS
Sbjct: 182 YPSIGVDNSAFYS 194


>gi|195454705|ref|XP_002074363.1| GK10560 [Drosophila willistoni]
 gi|194170448|gb|EDW85349.1| GK10560 [Drosophila willistoni]
          Length = 736

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 36/74 (48%), Positives = 46/74 (62%), Gaps = 6/74 (8%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP---LSMYSAPYPEGMAA 104
           GS CC+ GRPI TDP+SGQT+CSCQYD     RLA  G   +P   + +Y APYP     
Sbjct: 111 GSSCCENGRPIMTDPVSGQTVCSCQYDSA---RLALSGYSRMPAASVGVYGAPYPSSDQN 167

Query: 105 YFPALGADQAPFYS 118
            + ++G D + FYS
Sbjct: 168 PYQSIGVDSSAFYS 181


>gi|194747625|ref|XP_001956252.1| GF24689 [Drosophila ananassae]
 gi|190623534|gb|EDV39058.1| GF24689 [Drosophila ananassae]
          Length = 663

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 8/76 (10%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           G  CC+ GRPI TDP+SGQT+CSCQYD     +  Y R+A    P   + +Y  PYP   
Sbjct: 110 GGSCCENGRPIITDPVSGQTVCSCQYDPTRLAIGGYSRMAL---PSAGVGVYGGPYPSND 166

Query: 103 AAYFPALGADQAPFYS 118
              +P++G D + FY+
Sbjct: 167 QNPYPSIGVDNSAFYA 182


>gi|312376038|gb|EFR23247.1| hypothetical protein AND_13239 [Anopheles darlingi]
          Length = 175

 Score = 70.9 bits (172), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 45/77 (58%), Gaps = 10/77 (12%)

Query: 47  PGSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEG 101
           P + CC+ GRPI TDP+SGQT+CSCQYD     L  Y R+  G      + +Y  PYP  
Sbjct: 83  PATSCCENGRPIMTDPVSGQTVCSCQYDSARLALSGYSRIPTGS-----VGVYGTPYPST 137

Query: 102 MAAYFPALGADQAPFYS 118
               +P++G D + FYS
Sbjct: 138 EQNPYPSIGVDSSAFYS 154


>gi|195126387|ref|XP_002007652.1| GI12261 [Drosophila mojavensis]
 gi|193919261|gb|EDW18128.1| GI12261 [Drosophila mojavensis]
          Length = 745

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 57/102 (55%), Gaps = 16/102 (15%)

Query: 22  PSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD-----LL 76
           P +    +  ++AV +TP      + GS CC+ GRPI TDP+SGQT+CSCQYD     L 
Sbjct: 97  PDAITGGSLDSSAVGTTP------TTGSSCCENGRPIMTDPVSGQTVCSCQYDSARLALS 150

Query: 77  NYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQAPFYS 118
           +Y RL     P   + +Y +PYP      + ++G D + FYS
Sbjct: 151 SYSRL-----PAASVGVYGSPYPSTDQNPYQSIGVDSSAFYS 187


>gi|195378783|ref|XP_002048161.1| GJ11492 [Drosophila virilis]
 gi|194155319|gb|EDW70503.1| GJ11492 [Drosophila virilis]
          Length = 756

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 46/76 (60%), Gaps = 10/76 (13%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           GS CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y +PYP   
Sbjct: 130 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGSPYPSTD 184

Query: 103 AAYFPALGADQAPFYS 118
              + ++G D + FYS
Sbjct: 185 QNPYQSIGVDSSAFYS 200


>gi|158293795|ref|XP_315117.4| AGAP005010-PA [Anopheles gambiae str. PEST]
 gi|157016617|gb|EAA10465.4| AGAP005010-PA [Anopheles gambiae str. PEST]
          Length = 782

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 6/75 (8%)

Query: 47  PGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP---LSMYSAPYPEGMA 103
           P + CC+ GRPI TDP+SGQT+CSCQYD     RLA  G   +P   + +Y  PYP    
Sbjct: 86  PTTSCCENGRPIMTDPVSGQTVCSCQYD---SARLALSGYSRIPTGSVGVYGTPYPSTEQ 142

Query: 104 AYFPALGADQAPFYS 118
             +P++G D + FYS
Sbjct: 143 NPYPSIGVDSSAFYS 157


>gi|195160679|ref|XP_002021202.1| GL25202 [Drosophila persimilis]
 gi|194118315|gb|EDW40358.1| GL25202 [Drosophila persimilis]
          Length = 473

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 10/76 (13%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           GS CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP   
Sbjct: 121 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTD 175

Query: 103 AAYFPALGADQAPFYS 118
              + ++G D + FYS
Sbjct: 176 QNPYQSIGVDSSAFYS 191


>gi|195019756|ref|XP_001985048.1| GH14715 [Drosophila grimshawi]
 gi|193898530|gb|EDV97396.1| GH14715 [Drosophila grimshawi]
          Length = 791

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 46/76 (60%), Gaps = 10/76 (13%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           GS CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y +PYP   
Sbjct: 127 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGSPYPSTD 181

Query: 103 AAYFPALGADQAPFYS 118
              + ++G D + FYS
Sbjct: 182 QNPYQSIGVDSSAFYS 197


>gi|198464912|ref|XP_001353411.2| GA10403 [Drosophila pseudoobscura pseudoobscura]
 gi|198149933|gb|EAL30918.2| GA10403 [Drosophila pseudoobscura pseudoobscura]
          Length = 780

 Score = 69.3 bits (168), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 10/76 (13%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           GS CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP   
Sbjct: 123 GSSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTD 177

Query: 103 AAYFPALGADQAPFYS 118
              + ++G D + FYS
Sbjct: 178 QNPYQSIGVDSSAFYS 193


>gi|194747627|ref|XP_001956253.1| GF24688 [Drosophila ananassae]
 gi|190623535|gb|EDV39059.1| GF24688 [Drosophila ananassae]
          Length = 731

 Score = 67.8 bits (164), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 44/76 (57%), Gaps = 10/76 (13%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGM 102
           G  CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP   
Sbjct: 130 GGSCCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTD 184

Query: 103 AAYFPALGADQAPFYS 118
              + ++G D + FYS
Sbjct: 185 QNPYQSIGVDSSAFYS 200


>gi|195327183|ref|XP_002030301.1| GM25355 [Drosophila sechellia]
 gi|194119244|gb|EDW41287.1| GM25355 [Drosophila sechellia]
          Length = 384

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199


>gi|54650768|gb|AAV36963.1| LP02208p [Drosophila melanogaster]
          Length = 650

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199


>gi|195493895|ref|XP_002094609.1| GE21912 [Drosophila yakuba]
 gi|194180710|gb|EDW94321.1| GE21912 [Drosophila yakuba]
          Length = 719

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 137 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 191

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 192 YQSIGVDSSAFYS 204


>gi|24663380|ref|NP_524045.2| araucan, isoform A [Drosophila melanogaster]
 gi|19863700|sp|Q24248.2|ARA_DROME RecName: Full=Homeobox protein araucan
 gi|7294555|gb|AAF49896.1| araucan, isoform A [Drosophila melanogaster]
          Length = 717

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199


>gi|1212885|emb|CAA64486.1| araucan homeoprotein [Drosophila melanogaster]
          Length = 716

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 131 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 185

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 186 YQSIGVDSSAFYS 198


>gi|281366150|ref|NP_001163432.1| araucan, isoform B [Drosophila melanogaster]
 gi|272455181|gb|ACZ94703.1| araucan, isoform B [Drosophila melanogaster]
          Length = 709

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 132 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 186

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 187 YQSIGVDSSAFYS 199


>gi|194869979|ref|XP_001972561.1| GG15584 [Drosophila erecta]
 gi|190654344|gb|EDV51587.1| GG15584 [Drosophila erecta]
          Length = 727

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAY 105
           CC+ GRPI TDP+SGQT+CSCQYD     L +Y RL     P   + +Y  PYP      
Sbjct: 142 CCENGRPIMTDPVSGQTVCSCQYDSARLALSSYSRL-----PAASVGVYGTPYPSTDQNP 196

Query: 106 FPALGADQAPFYS 118
           + ++G D + FYS
Sbjct: 197 YQSIGVDSSAFYS 209


>gi|195454701|ref|XP_002074361.1| GK10562 [Drosophila willistoni]
 gi|194170446|gb|EDW85347.1| GK10562 [Drosophila willistoni]
          Length = 661

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 12/83 (14%)

Query: 41  APRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYS 95
            P STS    CC+ GR I TDP++G T+CSCQY+     +  Y R+A    P   + +Y 
Sbjct: 109 VPESTS----CCENGRRIITDPVTGHTVCSCQYESTHLTIGGYSRMAL---PSAGIGVYG 161

Query: 96  APYPEGMAAYFPALGADQAPFYS 118
            PYP      +P++G D + FY+
Sbjct: 162 GPYPSNEQNPYPSIGVDNSAFYA 184


>gi|259013289|ref|NP_001158359.1| iroquois [Saccoglossus kowalevskii]
 gi|32307785|gb|AAP79289.1| iroquois [Saccoglossus kowalevskii]
          Length = 500

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 35/75 (46%), Positives = 45/75 (60%), Gaps = 11/75 (14%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAGGGPGLPLSMYSAPYPEGMA 103
           G+ CC++GRPI TDP +GQT+CSCQYD     + + R+A     GLPL MY + Y     
Sbjct: 24  GTSCCESGRPIMTDPHTGQTVCSCQYDASRAAIMHSRVA-----GLPL-MYGSSY-TAEQ 76

Query: 104 AYFPALGADQAPFYS 118
            Y P  G D + FYS
Sbjct: 77  GYVPTFGTDPSAFYS 91


>gi|189235456|ref|XP_971623.2| PREDICTED: iroquois-like protein [Tribolium castaneum]
          Length = 632

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 53/83 (63%), Gaps = 5/83 (6%)

Query: 36  TSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYS 95
           T +P    ST P ++CC++GRP+ TDP++GQ++CSCQYD     RLAA   P   + +Y 
Sbjct: 52  TLSPGGTSSTGP-TQCCESGRPLVTDPVTGQSVCSCQYDPA---RLAALQYP-RSVGVYG 106

Query: 96  APYPEGMAAYFPALGADQAPFYS 118
           APYP      +P++G + + FYS
Sbjct: 107 APYPSTDQNPYPSIGVESSAFYS 129


>gi|270004888|gb|EFA01336.1| iroquois-like protein [Tribolium castaneum]
          Length = 547

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 53/83 (63%), Gaps = 5/83 (6%)

Query: 36  TSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYS 95
           T +P    ST P ++CC++GRP+ TDP++GQ++CSCQYD     RLAA   P   + +Y 
Sbjct: 130 TLSPGGTSSTGP-TQCCESGRPLVTDPVTGQSVCSCQYDPA---RLAALQYP-RSVGVYG 184

Query: 96  APYPEGMAAYFPALGADQAPFYS 118
           APYP      +P++G + + FYS
Sbjct: 185 APYPSTDQNPYPSIGVESSAFYS 207


>gi|332024321|gb|EGI64520.1| Homeobox protein araucan [Acromyrmex echinatior]
          Length = 643

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 51/91 (56%), Gaps = 9/91 (9%)

Query: 33  TAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLS 92
           T   +TP    +T PG  CC+ GRP+ TDP++GQT+CSCQYD  N  R A    P LP +
Sbjct: 44  TGGATTPVGGAATGPG--CCENGRPMMTDPVTGQTVCSCQYD--NAARFALSTYPRLPTA 99

Query: 93  MYSAPYPEGMAAY-----FPALGADQAPFYS 118
              + YP    +      +P++G D + FYS
Sbjct: 100 TSYSSYPTPTPSTTDQGPYPSIGMDSSAFYS 130


>gi|345484618|ref|XP_001604958.2| PREDICTED: homeobox protein caupolican-like [Nasonia vitripennis]
          Length = 725

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 8/74 (10%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG----GPG-LPLSMYSAPYPEGMAAY 105
           CC+ GRP+ TDP++GQT+CSCQYD  +  RLA G     GPG    S Y  P P G    
Sbjct: 121 CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLGPGATSYSSYPTPTPSGSDQG 178

Query: 106 -FPALGADQAPFYS 118
            +P++G D + FYS
Sbjct: 179 PYPSIGMDSSAFYS 192


>gi|405965487|gb|EKC30856.1| Homeobox protein caupolican [Crassostrea gigas]
          Length = 485

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/69 (50%), Positives = 44/69 (63%), Gaps = 11/69 (15%)

Query: 31 STTAVTSTPAAPRSTSPGSRCCDTGRPIFTDPISGQTICSCQYD--LLNYQRLAAGGGPG 88
          ++T +T TP+ P ST     CC+ GRPI  DP +GQTICSCQY+  LL Y RLA     G
Sbjct: 15 TSTQMTKTPS-PTSTRS---CCENGRPIMADPHTGQTICSCQYNPALLGYSRLA-----G 65

Query: 89 LPLSMYSAP 97
          LP  +Y +P
Sbjct: 66 LPEGVYGSP 74


>gi|195160683|ref|XP_002021204.1| GL25204 [Drosophila persimilis]
 gi|194118317|gb|EDW40360.1| GL25204 [Drosophila persimilis]
          Length = 688

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 29/40 (72%), Gaps = 3/40 (7%)

Query: 46  SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           S G  CC+ GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 106 SEGMSCCENGRPIITDPVSGQTVCSCQYDPA---RLAIGG 142


>gi|405969201|gb|EKC34186.1| Homeobox protein araucan [Crassostrea gigas]
          Length = 493

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/71 (50%), Positives = 43/71 (60%), Gaps = 10/71 (14%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYD---LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFP 107
           CC+ GRPI TDP +GQT+CSCQY    L  Y R+     P L  S+YSA  P G  +Y P
Sbjct: 38  CCENGRPIMTDPHTGQTVCSCQYGSGLLSAYSRV-----PSLSDSVYSAG-PYGSQSYMP 91

Query: 108 ALGADQAPFYS 118
            LG D + FYS
Sbjct: 92  -LGTDPSAFYS 101


>gi|350404412|ref|XP_003487096.1| PREDICTED: homeobox protein caupolican-like [Bombus impatiens]
          Length = 624

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
           +T PG  CC+ GRP+ TDP++GQT+CSCQYD  +  RLA G  P L       S Y  P 
Sbjct: 90  TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 145

Query: 99  PEGMAAY-FPALGADQAPFY 117
           P       +P++G D + FY
Sbjct: 146 PSTTDQGPYPSIGMDSSAFY 165


>gi|340716546|ref|XP_003396758.1| PREDICTED: homeobox protein caupolican-like isoform 1 [Bombus
           terrestris]
          Length = 674

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
           +T PG  CC+ GRP+ TDP++GQT+CSCQYD  +  RLA G  P L       S Y  P 
Sbjct: 91  TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146

Query: 99  PEGMAAY-FPALGADQAPFY 117
           P       +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166


>gi|340716548|ref|XP_003396759.1| PREDICTED: homeobox protein caupolican-like isoform 2 [Bombus
           terrestris]
          Length = 653

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
           +T PG  CC+ GRP+ TDP++GQT+CSCQYD  +  RLA G  P L       S Y  P 
Sbjct: 70  TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 125

Query: 99  PEGMAAY-FPALGADQAPFY 117
           P       +P++G D + FY
Sbjct: 126 PSTTDQGPYPSIGMDSSAFY 145


>gi|383860361|ref|XP_003705659.1| PREDICTED: homeobox protein caupolican-like [Megachile rotundata]
          Length = 679

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 46/80 (57%), Gaps = 10/80 (12%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
           +T PG  CC+ GRP+ TDP++GQT+CSCQYD  +  RLA G  P L       S Y  P 
Sbjct: 91  ATGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146

Query: 99  PEGMAAY-FPALGADQAPFY 117
           P       +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166


>gi|195327189|ref|XP_002030304.1| GM25358 [Drosophila sechellia]
 gi|194119247|gb|EDW41290.1| GM25358 [Drosophila sechellia]
          Length = 680

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           CC+ GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 106 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137


>gi|54650584|gb|AAV36871.1| RE59515p [Drosophila melanogaster]
          Length = 693

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           CC+ GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 106 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137


>gi|24663384|ref|NP_524046.2| caupolican [Drosophila melanogaster]
 gi|19860750|sp|P54269.2|CAUP_DROME RecName: Full=Homeobox protein caupolican
 gi|7294554|gb|AAF49895.1| caupolican [Drosophila melanogaster]
 gi|60677775|gb|AAX33394.1| RE64213p [Drosophila melanogaster]
 gi|220945950|gb|ACL85518.1| caup-PA [synthetic construct]
          Length = 693

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           CC+ GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 106 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137


>gi|195493897|ref|XP_002094610.1| GE21913 [Drosophila yakuba]
 gi|194180711|gb|EDW94322.1| GE21913 [Drosophila yakuba]
          Length = 689

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           CC+ GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 109 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 140


>gi|194869983|ref|XP_001972562.1| GG15585 [Drosophila erecta]
 gi|190654345|gb|EDV51588.1| GG15585 [Drosophila erecta]
          Length = 813

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           CC+ GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 227 CCENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 258


>gi|307197225|gb|EFN78544.1| Homeobox protein araucan [Harpegnathos saltator]
          Length = 670

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 13/84 (15%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP--------LSMYS 95
           +T PG  CC+ GRP+ TDP++GQT+CSCQYD  +  RLA G  P +          S Y 
Sbjct: 91  TTGPG--CCENGRPMMTDPVTGQTVCSCQYD--SAARLALGAYPSVARLGPAAASYSSYP 146

Query: 96  APYPEGMAAY-FPALGADQAPFYS 118
            P P       +P++G D + FYS
Sbjct: 147 TPTPSTTDQGPYPSIGMDSSAFYS 170


>gi|380014099|ref|XP_003691080.1| PREDICTED: homeobox protein araucan-like [Apis florea]
          Length = 382

 Score = 56.2 bits (134), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 10/80 (12%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
           +T PG  CC+ GRP+ TDP++GQ++CSCQYD  +  RLA G  P L       S Y  P 
Sbjct: 91  ATGPG--CCENGRPMMTDPVTGQSVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146

Query: 99  PEGMAAY-FPALGADQAPFY 117
           P       +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166


>gi|110760661|ref|XP_396292.3| PREDICTED: homeobox protein araucan-like [Apis mellifera]
          Length = 677

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 10/80 (12%)

Query: 44  STSPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLP-----LSMYSAPY 98
           +T PG  CC+ GRP+ TDP++GQ++CSCQYD  +  RLA G  P L       S Y  P 
Sbjct: 91  ATGPG--CCENGRPMMTDPVTGQSVCSCQYD--SAARLALGAYPRLAPTATSYSSYPTPT 146

Query: 99  PEGMAAY-FPALGADQAPFY 117
           P       +P++G D + FY
Sbjct: 147 PSTTDQGPYPSIGMDSSAFY 166


>gi|195589880|ref|XP_002084677.1| GD14390 [Drosophila simulans]
 gi|194196686|gb|EDX10262.1| GD14390 [Drosophila simulans]
          Length = 687

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           CC+ GRPI T+P+SGQT+CSCQYD     RLA GG
Sbjct: 108 CCENGRPIITNPVSGQTVCSCQYDP---ARLAIGG 139


>gi|1212887|emb|CAA64485.1| caupolican homeoprotein [Drosophila melanogaster]
          Length = 693

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/40 (67%), Positives = 30/40 (75%), Gaps = 4/40 (10%)

Query: 46  SPGSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGG 85
           S GSRC + GRPI TDP+SGQT+CSCQYD     RLA GG
Sbjct: 102 SRGSRC-ENGRPIITDPVSGQTVCSCQYDP---ARLAIGG 137


>gi|321467887|gb|EFX78875.1| hypothetical protein DAPPUDRAFT_104948 [Daphnia pulex]
          Length = 242

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 46/94 (48%), Gaps = 28/94 (29%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRL------------------------AAGGG 86
           CCD+GRP+ TDP++GQT+CSCQY+ L  Q L                        A  G 
Sbjct: 81  CCDSGRPLVTDPVTGQTVCSCQYERL--QALQNGLAAGYGPAAAAAAARLAAVSSAVTGV 138

Query: 87  PGLPLSMYSAPYPEGMAAYFPALGADQ-APFYSS 119
            GL   +YSA YP      +P+LG D    FYSS
Sbjct: 139 TGLH-GVYSAAYPSSDQNPYPSLGMDSTTAFYSS 171


>gi|193083635|gb|ACF10238.1| iroquois B [Branchiostoma lanceolatum]
          Length = 494

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  A G
Sbjct: 28  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIAAFG 85

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 86  ADPSAFYS 93


>gi|157116231|ref|XP_001658393.1| iroquois-class homeodomain protein irx [Aedes aegypti]
 gi|108876562|gb|EAT40787.1| AAEL007505-PA [Aedes aegypti]
          Length = 363

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 37/65 (56%), Gaps = 10/65 (15%)

Query: 59  FTDPISGQTICSCQYD-----LLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALGADQ 113
            TDP+SGQT+CSCQYD     L +Y R+  G      + +Y  PYP      +P++G D 
Sbjct: 1   MTDPVSGQTVCSCQYDSARLALSSYSRIPTGS-----VGVYGTPYPSTEQNPYPSIGVDS 55

Query: 114 APFYS 118
           + FYS
Sbjct: 56  SAFYS 60


>gi|193083641|gb|ACF10241.1| iroquois C [Branchiostoma floridae]
          Length = 494

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYESRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|260834613|ref|XP_002612304.1| iroquois 3 [Branchiostoma floridae]
 gi|229297681|gb|EEN68313.1| iroquois 3 [Branchiostoma floridae]
          Length = 494

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|193083637|gb|ACF10239.1| iroquois B [Branchiostoma floridae]
          Length = 496

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  A G
Sbjct: 28  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIAAFG 85

Query: 111 ADQAPFYS 118
            D + FYS
Sbjct: 86  GDPSAFYS 93


>gi|193083639|gb|ACF10240.1| iroquois C [Branchiostoma lanceolatum]
          Length = 494

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|193083631|gb|ACF10236.1| iroquois A isoform 2 [Branchiostoma lanceolatum]
          Length = 486

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|193083629|gb|ACF10235.1| iroquois A isoform 1 [Branchiostoma lanceolatum]
          Length = 461

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|209529858|gb|ACI49802.1| iroquois A isoform 1 [Branchiostoma floridae]
          Length = 459

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYIASFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|193083633|gb|ACF10237.1| iroquois A isoform 1 [Branchiostoma floridae]
          Length = 461

 Score = 50.1 bits (118), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 51  CCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPEGMAAYFPALG 110
           CC+ GRPI +DP +GQ +CSCQY+      +    G     S+Y +PY      Y  + G
Sbjct: 25  CCEAGRPIMSDPHTGQAVCSCQYEGRVAGLVPPRVGEMSLSSVYGSPY--AGQGYITSFG 82

Query: 111 ADQAPFYS 118
           AD + FYS
Sbjct: 83  ADPSAFYS 90


>gi|193788691|ref|NP_001123285.1| iroquois homeobox A [Strongylocentrotus purpuratus]
 gi|167859058|gb|ACA04464.1| IrxA [Strongylocentrotus purpuratus]
          Length = 605

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 10/78 (12%)

Query: 46  SPGSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAGGGPGLPLSMYSAPYPEG 101
           +P + CC++GRP+ +DP  GQ +C    D    LL+  R++     GLP ++YS+PY   
Sbjct: 23  APATTCCESGRPVASDPHPGQAVCCPPLDSRAALLHSARVS-----GLPTALYSSPYAAA 77

Query: 102 MAAY-FPALGADQAPFYS 118
            A   +    AD + FYS
Sbjct: 78  AAEQGYVHFAADPSAFYS 95


>gi|322366536|gb|ADW95342.1| IrxA [Paracentrotus lividus]
          Length = 602

 Score = 43.9 bits (102), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 10/78 (12%)

Query: 46  SPGSRCCDTGRPIFTDPISGQTICSCQYD----LLNYQRLAAGGGPGLPLSMYSAPYPEG 101
           +P + CC++GRP+ +DP  GQ +C    D    LL+  R++     GLP ++YS+PY   
Sbjct: 25  APATTCCESGRPVASDPHPGQAVCCPPLDSRAALLHSARVS-----GLPTALYSSPYAAA 79

Query: 102 MAAY-FPALGADQAPFYS 118
            A   +    AD + FYS
Sbjct: 80  AAEQGYVHFAADPSAFYS 97


>gi|71423067|ref|XP_812329.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877099|gb|EAN90478.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 780

 Score = 39.3 bits (90), Expect = 0.31,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 14/65 (21%)

Query: 7   STVAPSSSPTSSSPPPSST--------RASNTSTTAVTSTPA------APRSTSPGSRCC 52
           +TV PS+SP++S+P P +         R+S+ S  AVT+TP+        R TSPG    
Sbjct: 136 TTVPPSTSPSTSAPLPENVLPLRQNRGRSSHASWVAVTTTPSLSPSEPKSRETSPGRATA 195

Query: 53  DTGRP 57
            T  P
Sbjct: 196 ATRSP 200


>gi|1098898|gb|AAC46961.1| CTRP [Plasmodium falciparum]
          Length = 2098

 Score = 38.9 bits (89), Expect = 0.43,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 38/74 (51%), Gaps = 7/74 (9%)

Query: 9   VAPSSSPTSSSP--PPSSTRASNTSTTAVT---STPAAPRSTSPGSRCCDT-GRPIFTDP 62
           V P  SP+ S+P  P  ST  + + +T  +   STP  P  ++PGS    T G P  + P
Sbjct: 761 VLPPGSPSESTPGSPSESTPGNPSESTPGSPSESTPGNPSESTPGSPSESTPGSPSESTP 820

Query: 63  ISG-QTICSCQYDL 75
            SG + +C   YDL
Sbjct: 821 CSGTECLCHNTYDL 834


>gi|218442531|ref|YP_002380852.1| hypothetical protein PCC7424_5461 [Cyanothece sp. PCC 7424]
 gi|218175302|gb|ACK74033.1| hypothetical protein PCC7424_5461 [Cyanothece sp. PCC 7424]
          Length = 394

 Score = 38.1 bits (87), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 27/45 (60%)

Query: 10  APSSSPTSSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSRCCDT 54
           A ++   S+   PSSTRA+N + T + STP   RSTSP +R  D 
Sbjct: 339 AANAGRRSAQSVPSSTRAANRTVTVIPSTPRRMRSTSPTTRTIDV 383


>gi|410917886|ref|XP_003972417.1| PREDICTED: general transcription factor 3C polypeptide 1-like
           [Takifugu rubripes]
          Length = 2211

 Score = 36.2 bits (82), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 23/39 (58%)

Query: 12  SSSPTSSSPPPSSTRASNTSTTAVTSTPAAPRSTSPGSR 50
           S+ P  SS PPS+  AS+TS +A   TPA+ R    G R
Sbjct: 458 SADPQVSSTPPSTPAASSTSESANAKTPASRRKQKAGKR 496


>gi|444301783|gb|AGD98928.1| Irx [Nematostella vectensis]
          Length = 451

 Score = 35.8 bits (81), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 9/72 (12%)

Query: 48  GSRCCDTGRPIFTDPISGQTICSCQYDLLNYQRLAAGGGPGLPLSMYSAPYPE-GMAAYF 106
           G+ CCD+ R +  D + G ++C CQ+D +   R+A+     +P+S+   P P  G A + 
Sbjct: 28  GNSCCDSVRQVH-DGMPGHSVCPCQFDRVLQHRVAS-----VPVSVL--PRPAFGEAGFS 79

Query: 107 PALGADQAPFYS 118
              G   + FYS
Sbjct: 80  LGSGTGLSAFYS 91


>gi|386836417|ref|YP_006241475.1| modular polyketide synthase [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374096718|gb|AEY85602.1| modular polyketide synthase [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451789776|gb|AGF59825.1| modular polyketide synthase [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 1024

 Score = 35.4 bits (80), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 9/85 (10%)

Query: 3   KIPESTVAPSSSPTSSSPPPSSTRASNTSTTAVTSTPAA------PRSTSPGSRCCDTGR 56
           +IP  T  P   PTS+ P  ++TR +  ++   T T AA      P S  PG R  D  R
Sbjct: 407 RIP-WTAYPVDVPTSARPWQATTRRALVNSFGFTGTIAAAVVEQAPPSAEPGPRPGDGDR 465

Query: 57  PIFTDPISGQTICSCQYDLLNYQRL 81
            +FT  +S ++  S Q  L  Y++L
Sbjct: 466 HVFT--LSAKSAASLQLQLDRYRKL 488


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.310    0.125    0.378 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,205,079,079
Number of Sequences: 23463169
Number of extensions: 95884155
Number of successful extensions: 1309229
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2044
Number of HSP's successfully gapped in prelim test: 2810
Number of HSP's that attempted gapping in prelim test: 1219754
Number of HSP's gapped (non-prelim): 72395
length of query: 122
length of database: 8,064,228,071
effective HSP length: 89
effective length of query: 33
effective length of database: 5,976,006,030
effective search space: 197208198990
effective search space used: 197208198990
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 69 (31.2 bits)