BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy12800
         (134 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|350417175|ref|XP_003491293.1| PREDICTED: extracellular domains-containing protein CG31004-like
           [Bombus impatiens]
          Length = 1267

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 60/81 (74%), Gaps = 6/81 (7%)

Query: 52  SKHTLSA-DSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGF 110
           S HT +  D++  +G  P     +FVGFNAGNGTRSYEY P+SQ STIRDL GRGWANGF
Sbjct: 366 SSHTEAGGDTMYGEGGVP-----AFVGFNAGNGTRSYEYKPFSQMSTIRDLTGRGWANGF 420

Query: 111 PGRHIFRIDENIMLELTLGEI 131
           PGRH+FRIDENIM  +   +I
Sbjct: 421 PGRHMFRIDENIMPAVCNKDI 441


>gi|340715043|ref|XP_003396030.1| PREDICTED: extracellular domains-containing protein CG31004-like
           [Bombus terrestris]
          Length = 1267

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 60/81 (74%), Gaps = 6/81 (7%)

Query: 52  SKHTLSA-DSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGF 110
           S HT +  D++  +G  P     +FVGFNAGNGTRSYEY P+SQ STIRDL GRGWANGF
Sbjct: 366 SSHTEAGGDTMYGEGGVP-----AFVGFNAGNGTRSYEYKPFSQMSTIRDLTGRGWANGF 420

Query: 111 PGRHIFRIDENIMLELTLGEI 131
           PGRH+FRIDENIM  +   +I
Sbjct: 421 PGRHMFRIDENIMPAVCNKDI 441


>gi|383855804|ref|XP_003703400.1| PREDICTED: extracellular domains-containing protein CG31004-like
           [Megachile rotundata]
          Length = 1397

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 43/50 (86%), Positives = 46/50 (92%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIM 123
           +FVGFNAGNGT+ YEY PYSQRSTIRDL GRGWANGFPGRHIFRIDE I+
Sbjct: 383 AFVGFNAGNGTQVYEYKPYSQRSTIRDLTGRGWANGFPGRHIFRIDEKII 432


>gi|332019409|gb|EGI59893.1| Extracellular domains-containing protein [Acromyrmex echinatior]
          Length = 1406

 Score = 99.4 bits (246), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 43/51 (84%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +FVGFNAGNGT+SYEY PYSQ +TIRDL  RGWANGFPGRHIFRIDE IML
Sbjct: 388 AFVGFNAGNGTQSYEYKPYSQATTIRDLTSRGWANGFPGRHIFRIDEKIML 438


>gi|345482776|ref|XP_001600000.2| PREDICTED: extracellular domains-containing protein CG31004-like
           [Nasonia vitripennis]
          Length = 1358

 Score = 99.0 bits (245), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 42/51 (82%), Positives = 47/51 (92%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +++GFNAGNGT+SYEY PYSQ ST+RDL GRGWANGFPGRHIFRIDE IML
Sbjct: 386 AYIGFNAGNGTQSYEYMPYSQASTVRDLSGRGWANGFPGRHIFRIDEKIML 436


>gi|357623997|gb|EHJ74925.1| hypothetical protein KGM_05469 [Danaus plexippus]
          Length = 1477

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 41/51 (80%), Positives = 47/51 (92%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +++GFNAGNGTRSYEY PYSQ S +RDL GRGWANGFPGRHIFRIDENI++
Sbjct: 361 AYIGFNAGNGTRSYEYKPYSQASVLRDLTGRGWANGFPGRHIFRIDENILM 411


>gi|380020283|ref|XP_003694019.1| PREDICTED: extracellular domains-containing protein CG31004-like
           [Apis florea]
          Length = 1377

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 46/73 (63%), Positives = 52/73 (71%), Gaps = 5/73 (6%)

Query: 51  TSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGF 110
           TS      D+   +G  P     +FVGFNAGNGTRSYEY PYSQ S+IRDL GRGWAN F
Sbjct: 342 TSHTEAGGDTTRGEGGVP-----AFVGFNAGNGTRSYEYKPYSQMSSIRDLTGRGWANKF 396

Query: 111 PGRHIFRIDENIM 123
           PGRH+FRIDE I+
Sbjct: 397 PGRHMFRIDERIL 409


>gi|307174209|gb|EFN64854.1| Uncharacterized protein K03H1.5 [Camponotus floridanus]
          Length = 1395

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 42/51 (82%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           ++VGFNAGNGT+SYEY PYSQ +TIRDL  RGWANGFPGRHIFRIDE IML
Sbjct: 362 AYVGFNAGNGTQSYEYKPYSQATTIRDLTSRGWANGFPGRHIFRIDEKIML 412


>gi|328788914|ref|XP_392408.3| PREDICTED: extracellular domains-containing protein CG31004-like
           isoform 1 [Apis mellifera]
          Length = 1265

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 45/73 (61%), Positives = 53/73 (72%), Gaps = 5/73 (6%)

Query: 51  TSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGF 110
           TS      D+   +G  P     +FVGFNAGNGT+SYEY P+SQRS+IRDL GRGWAN F
Sbjct: 363 TSHTEAGGDTTRGEGGVP-----AFVGFNAGNGTQSYEYKPFSQRSSIRDLTGRGWANNF 417

Query: 111 PGRHIFRIDENIM 123
           PGRH+FRIDE I+
Sbjct: 418 PGRHMFRIDEIIL 430


>gi|157109935|ref|XP_001650886.1| hypothetical protein AaeL_AAEL005432 [Aedes aegypti]
 gi|108878861|gb|EAT43086.1| AAEL005432-PB [Aedes aegypti]
          Length = 1427

 Score = 95.5 bits (236), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 40/51 (78%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +++GFNAGNGT++YEY PYSQ S +RDL GRGWANGFPGRHIFRIDE IML
Sbjct: 369 AYIGFNAGNGTQAYEYKPYSQASVLRDLTGRGWANGFPGRHIFRIDERIML 419


>gi|157109933|ref|XP_001650885.1| hypothetical protein AaeL_AAEL005432 [Aedes aegypti]
 gi|108878860|gb|EAT43085.1| AAEL005432-PA [Aedes aegypti]
          Length = 1405

 Score = 95.5 bits (236), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 40/51 (78%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +++GFNAGNGT++YEY PYSQ S +RDL GRGWANGFPGRHIFRIDE IML
Sbjct: 369 AYIGFNAGNGTQAYEYKPYSQASVLRDLTGRGWANGFPGRHIFRIDERIML 419


>gi|347964032|ref|XP_310533.5| AGAP000550-PA [Anopheles gambiae str. PEST]
 gi|333466921|gb|EAA06289.5| AGAP000550-PA [Anopheles gambiae str. PEST]
          Length = 1429

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 40/51 (78%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +++GFNAGNGT++YEY PYSQ S +RDL GRGWANGFPGRHIFRIDE IML
Sbjct: 358 AYIGFNAGNGTQAYEYKPYSQASVLRDLTGRGWANGFPGRHIFRIDERIML 408


>gi|242006815|ref|XP_002424240.1| sushi domain containing protein, putative [Pediculus humanus
           corporis]
 gi|212507609|gb|EEB11502.1| sushi domain containing protein, putative [Pediculus humanus
           corporis]
          Length = 1461

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 40/51 (78%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           ++VGFNAGNGT+S+EY PYSQ S +RDL GRGWANGFPGRHIFRIDE I+L
Sbjct: 367 AYVGFNAGNGTQSFEYKPYSQDSVLRDLTGRGWANGFPGRHIFRIDEKILL 417


>gi|170064239|ref|XP_001867442.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881704|gb|EDS45087.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1271

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 39/51 (76%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +++GFNAGNGT++YEY PYSQ S +RDL GRGWANGFPGRHIFR+DE IML
Sbjct: 246 AYIGFNAGNGTQAYEYKPYSQASVLRDLTGRGWANGFPGRHIFRLDERIML 296


>gi|307210697|gb|EFN87120.1| Uncharacterized protein K03H1.5 [Harpegnathos saltator]
          Length = 1347

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 40/51 (78%), Positives = 46/51 (90%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           ++VGFNAGNGTRSYEY PYSQ +T+RDL  RGWAN FPGRHIFRIDE+I+L
Sbjct: 393 AYVGFNAGNGTRSYEYKPYSQMTTVRDLPNRGWANKFPGRHIFRIDEDIIL 443


>gi|195062183|ref|XP_001996151.1| GH14340 [Drosophila grimshawi]
 gi|193891943|gb|EDV90809.1| GH14340 [Drosophila grimshawi]
          Length = 1473

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 42/67 (62%), Positives = 50/67 (74%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDLV RGWANGFPGRHIFR
Sbjct: 396 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLVNRGWANGFPGRHIFR 450

Query: 118 IDENIML 124
           IDE I++
Sbjct: 451 IDEQILI 457


>gi|40882481|gb|AAR96152.1| RE69185p [Drosophila melanogaster]
          Length = 1240

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYNPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|390176772|ref|XP_003736199.1| GA15929, isoform C [Drosophila pseudoobscura pseudoobscura]
 gi|388858779|gb|EIM52272.1| GA15929, isoform C [Drosophila pseudoobscura pseudoobscura]
          Length = 1255

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 50/67 (74%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G +P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 382 GDTTKGEGGTP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWANGFPGRHIFR 436

Query: 118 IDENIML 124
           +DE I++
Sbjct: 437 VDEQILI 443


>gi|195575249|ref|XP_002105592.1| GD16568 [Drosophila simulans]
 gi|194201519|gb|EDX15095.1| GD16568 [Drosophila simulans]
          Length = 923

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYRPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|195394574|ref|XP_002055917.1| GJ10650 [Drosophila virilis]
 gi|194142626|gb|EDW59029.1| GJ10650 [Drosophila virilis]
          Length = 1074

 Score = 93.2 bits (230), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 41/67 (61%), Positives = 48/67 (71%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     +FVGFNAGNGT++YEY PYSQ   IRDL  RGW NGFPGRHIFR
Sbjct: 386 GDTTKGEGGVP-----AFVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWGNGFPGRHIFR 440

Query: 118 IDENIML 124
           IDE I++
Sbjct: 441 IDEQILI 447


>gi|194744497|ref|XP_001954730.1| GF16595 [Drosophila ananassae]
 gi|190627767|gb|EDV43291.1| GF16595 [Drosophila ananassae]
          Length = 1439

 Score = 93.2 bits (230), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 38/51 (74%), Positives = 44/51 (86%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIML 124
           +FVGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR+DE I++
Sbjct: 385 AFVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWANGFPGRHIFRVDEQILI 435


>gi|195159013|ref|XP_002020377.1| GL13953 [Drosophila persimilis]
 gi|194117146|gb|EDW39189.1| GL13953 [Drosophila persimilis]
          Length = 1458

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 50/67 (74%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G +P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 382 GDTTKGEGGTP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWANGFPGRHIFR 436

Query: 118 IDENIML 124
           +DE I++
Sbjct: 437 VDEQILI 443


>gi|125772520|ref|XP_001357569.1| GA15929, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|54637301|gb|EAL26703.1| GA15929, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 1458

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 50/67 (74%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G +P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 382 GDTTKGEGGTP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWANGFPGRHIFR 436

Query: 118 IDENIML 124
           +DE I++
Sbjct: 437 VDEQILI 443


>gi|281362894|ref|NP_001163781.1| CG31004, isoform C [Drosophila melanogaster]
 gi|189458931|gb|ACD99455.1| IP18607p [Drosophila melanogaster]
 gi|272477253|gb|ACZ95074.1| CG31004, isoform C [Drosophila melanogaster]
          Length = 1243

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYNPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|390176770|ref|XP_003736198.1| GA15929, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388858778|gb|EIM52271.1| GA15929, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 1481

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 50/67 (74%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G +P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 382 GDTTKGEGGTP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWANGFPGRHIFR 436

Query: 118 IDENIML 124
           +DE I++
Sbjct: 437 VDEQILI 443


>gi|195505375|ref|XP_002099477.1| GE23331 [Drosophila yakuba]
 gi|194185578|gb|EDW99189.1| GE23331 [Drosophila yakuba]
          Length = 1432

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 368 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYLPYSQNMVIRDLANRGWANGFPGRHIFR 422

Query: 118 IDENIML 124
           +DE I++
Sbjct: 423 VDEQILI 429


>gi|194905046|ref|XP_001981111.1| GG11883 [Drosophila erecta]
 gi|190655749|gb|EDV52981.1| GG11883 [Drosophila erecta]
          Length = 1431

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|195341700|ref|XP_002037444.1| GM12101 [Drosophila sechellia]
 gi|194131560|gb|EDW53603.1| GM12101 [Drosophila sechellia]
          Length = 1431

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYRPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|24651583|ref|NP_733418.1| CG31004, isoform A [Drosophila melanogaster]
 gi|24651585|ref|NP_733419.1| CG31004, isoform B [Drosophila melanogaster]
 gi|122065794|sp|Q0KHY3.1|Y1004_DROME RecName: Full=Extracellular domains-containing protein CG31004;
           Flags: Precursor
 gi|23172752|gb|AAF57124.2| CG31004, isoform A [Drosophila melanogaster]
 gi|23172753|gb|AAF57123.2| CG31004, isoform B [Drosophila melanogaster]
 gi|374275919|gb|AEZ02857.1| FI18817p1 [Drosophila melanogaster]
          Length = 1431

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYNPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|195112496|ref|XP_002000808.1| GI10435 [Drosophila mojavensis]
 gi|193917402|gb|EDW16269.1| GI10435 [Drosophila mojavensis]
          Length = 1477

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 41/67 (61%), Positives = 48/67 (71%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     +FVGFNAGNGT++YEY PYSQ   IRDL  RGW NGFPGRHIFR
Sbjct: 406 GDTTKGEGGVP-----AFVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWGNGFPGRHIFR 460

Query: 118 IDENIML 124
           IDE I++
Sbjct: 461 IDEQILI 467


>gi|281362896|ref|NP_001163782.1| CG31004, isoform D [Drosophila melanogaster]
 gi|272477254|gb|ACZ95075.1| CG31004, isoform D [Drosophila melanogaster]
          Length = 1454

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYNPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428


>gi|195449266|ref|XP_002071999.1| GK22555 [Drosophila willistoni]
 gi|194168084|gb|EDW82985.1| GK22555 [Drosophila willistoni]
          Length = 1455

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G +P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGW NGFPGRHIFR
Sbjct: 383 GDTTKGEGGTP-----AYVGFNAGNGTQAYEYKPYSQNMVIRDLANRGWGNGFPGRHIFR 437

Query: 118 IDENIML 124
           IDE I++
Sbjct: 438 IDEQILI 444


>gi|91081667|ref|XP_969700.1| PREDICTED: similar to AGAP000550-PA [Tribolium castaneum]
          Length = 1260

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 42/76 (55%), Positives = 52/76 (68%), Gaps = 6/76 (7%)

Query: 50  DTSKHTLSA-DSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWAN 108
           D + HT +  D+++ +G  P     ++VGFNAGNGTRSY+Y PYSQ S IRDL G G+ N
Sbjct: 337 DWTSHTEAGGDTINGEGGIP-----AYVGFNAGNGTRSYQYVPYSQNSVIRDLTGTGFGN 391

Query: 109 GFPGRHIFRIDENIML 124
              GRH FRIDENI L
Sbjct: 392 EKKGRHYFRIDENIYL 407


>gi|321452618|gb|EFX63962.1| hypothetical protein DAPPUDRAFT_305821 [Daphnia pulex]
          Length = 1120

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 39/75 (52%), Positives = 49/75 (65%), Gaps = 5/75 (6%)

Query: 49  GDTSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWAN 108
           G T+      D+   +G  P     +FVGFNAGNGTR+YEY PYSQ   IRDL  +G+AN
Sbjct: 219 GWTTHAEAGGDTNSGQGGVP-----AFVGFNAGNGTRAYEYRPYSQDMRIRDLTYKGYAN 273

Query: 109 GFPGRHIFRIDENIM 123
             PGRH+FRIDE ++
Sbjct: 274 QIPGRHVFRIDERVI 288


>gi|270006238|gb|EFA02686.1| hypothetical protein TcasGA2_TC008407 [Tribolium castaneum]
          Length = 1428

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/76 (55%), Positives = 52/76 (68%), Gaps = 6/76 (7%)

Query: 50  DTSKHTLSA-DSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWAN 108
           D + HT +  D+++ +G  P     ++VGFNAGNGTRSY+Y PYSQ S IRDL G G+ N
Sbjct: 337 DWTSHTEAGGDTINGEGGIP-----AYVGFNAGNGTRSYQYVPYSQNSVIRDLTGTGFGN 391

Query: 109 GFPGRHIFRIDENIML 124
              GRH FRIDENI L
Sbjct: 392 EKKGRHYFRIDENIYL 407


>gi|328777117|ref|XP_001121335.2| PREDICTED: extracellular domains-containing protein CG31004-like
           [Apis mellifera]
          Length = 371

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 35/50 (70%), Positives = 40/50 (80%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIM 123
           +FVGFNAGNGT SYEY P+SQ   IRDL   GW NGFPGRH+F+IDE I+
Sbjct: 317 AFVGFNAGNGTGSYEYKPFSQTPHIRDLTRAGWVNGFPGRHMFKIDEKII 366


>gi|340714823|ref|XP_003395923.1| PREDICTED: extracellular domains-containing protein CG31004-like
           [Bombus terrestris]
          Length = 450

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/50 (70%), Positives = 39/50 (78%)

Query: 74  SFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIM 123
           +FVGFNAGNGT SYEY PYSQ   IRDL   G  NGFPGRH+F+ID +IM
Sbjct: 385 AFVGFNAGNGTGSYEYEPYSQTWMIRDLTRMGCVNGFPGRHMFKIDNDIM 434


>gi|241574824|ref|XP_002403140.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215502161|gb|EEC11655.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 431

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/48 (58%), Positives = 34/48 (70%)

Query: 75  FVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENI 122
           FVGFNAGN TR YE+ PYSQ   +++L   G  NG  GR+IF+IDE I
Sbjct: 260 FVGFNAGNTTRYYEFLPYSQEPRVKNLPQHGEGNGIKGRYIFQIDEEI 307


>gi|391340235|ref|XP_003744449.1| PREDICTED: extracellular domains-containing protein CG31004-like
           [Metaseiulus occidentalis]
          Length = 1147

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 25/48 (52%), Positives = 33/48 (68%), Gaps = 1/48 (2%)

Query: 75  FVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENI 122
           F+GFNAGN TR+YE++PYSQ+  +  L   G      GR+IF+IDE I
Sbjct: 270 FIGFNAGNTTRAYEFFPYSQQPRVSQLT-EGGDEKMKGRYIFQIDEEI 316


>gi|328714909|ref|XP_001943721.2| PREDICTED: hypothetical protein LOC100168045 [Acyrthosiphon pisum]
          Length = 1028

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/74 (45%), Positives = 48/74 (64%), Gaps = 3/74 (4%)

Query: 2   GRESNESLESSGKKSRRPSF-ISGRSFASASRILNQHFPNLQLTKSKSGDTSKHTLSADS 60
           GR S ESL+  G + RRPSF +S ++  +A+RI+NQH   L L  +K+G + K + S  S
Sbjct: 783 GRSSQESLDG-GARKRRPSFGLSSKTLTNATRIINQHLFGLPLISNKTGKSGKSSPSV-S 840

Query: 61  LDSKGQSPEMHRRS 74
            +S  +SPE HRRS
Sbjct: 841 AESINRSPEPHRRS 854


>gi|158295006|ref|XP_315954.4| AGAP005924-PA [Anopheles gambiae str. PEST]
 gi|157015830|gb|EAA11029.4| AGAP005924-PA [Anopheles gambiae str. PEST]
          Length = 488

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 39/64 (60%), Gaps = 5/64 (7%)

Query: 14  KKSRRPSFISGRSFASASRILNQHF---PNLQLTKSKSGDTSKHTLSADSLDSKGQSPEM 70
           KK+RR SF+  +S ASA++++NQH    PN    K    DT   ++S DSL++  Q  E 
Sbjct: 293 KKTRRSSFMPAKSLASATKLINQHLFGIPN-STPKDAGADTKLSSISIDSLETSPQL-ET 350

Query: 71  HRRS 74
           HRRS
Sbjct: 351 HRRS 354


>gi|270003317|gb|EEZ99764.1| hypothetical protein TcasGA2_TC002537 [Tribolium castaneum]
          Length = 256

 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 34/50 (68%), Gaps = 4/50 (8%)

Query: 3   RESNESLESSGKKSRRPSFISGRSFASASRILNQHFPNLQ-LTKSKSGDT 51
           R+S ESLES  KK+   SFI  R  ASA+RI+NQH   LQ L+ S+SG+ 
Sbjct: 174 RDSQESLESKRKKT---SFIPTRGLASATRIINQHLFGLQNLSGSRSGNV 220


>gi|260816531|ref|XP_002603024.1| hypothetical protein BRAFLDRAFT_84760 [Branchiostoma floridae]
 gi|229288339|gb|EEN59036.1| hypothetical protein BRAFLDRAFT_84760 [Branchiostoma floridae]
          Length = 921

 Score = 36.6 bits (83), Expect = 3.1,   Method: Composition-based stats.
 Identities = 22/48 (45%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 76  VGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIM 123
           VGFNAGNG + Y   P SQ   I D+      N  PGR +FRID  ++
Sbjct: 167 VGFNAGNGMQ-YFSVPGSQTHAIVDVETTSNVN-IPGRWVFRIDSAVI 212


>gi|373859731|ref|ZP_09602456.1| hypothetical protein B1NLA3EDRAFT_4504 [Bacillus sp. 1NLA3E]
 gi|372450587|gb|EHP24073.1| hypothetical protein B1NLA3EDRAFT_4504 [Bacillus sp. 1NLA3E]
          Length = 235

 Score = 36.6 bits (83), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 43  LTKSKSGDTSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLV 102
           LT S+S D +K     ++  S+GQ PE   R + G N G+  + Y+ Y Y  RS   D +
Sbjct: 147 LTISESPDHTKMIEFRETTRSQGQKPERFLRIYYGEN-GSILKKYKEYNYQNRSYTADQL 205

Query: 103 GRGWANGF-PGRHIF 116
              W N   P   IF
Sbjct: 206 YVDWTNNIRPSIMIF 220


>gi|260815297|ref|XP_002602410.1| hypothetical protein BRAFLDRAFT_199125 [Branchiostoma floridae]
 gi|229287719|gb|EEN58422.1| hypothetical protein BRAFLDRAFT_199125 [Branchiostoma floridae]
          Length = 242

 Score = 35.4 bits (80), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 76  VGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIM 123
           VGFNAGNG + Y   P SQ   I D+      N  PGR +FRID  ++
Sbjct: 184 VGFNAGNGMQ-YFSVPGSQTHAIVDVETTSNVN-IPGRWVFRIDSAVI 229


>gi|307187720|gb|EFN72692.1| hypothetical protein EAG_06877 [Camponotus floridanus]
          Length = 471

 Score = 35.4 bits (80), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 38/58 (65%), Gaps = 4/58 (6%)

Query: 18  RPSFISGRSFASASRILNQH-FPNLQLTKSKSGDT--SKHTLSADSLDSKGQSPEMHR 72
           +PS +   + ASA+RI+N H F +L  +K  +G +  SK +LSADS+DS G +P M R
Sbjct: 218 KPSTLVPTNLASATRIINYHLFGSLGGSKHYTGASAESKFSLSADSIDSDG-TPFMDR 274


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.131    0.376 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,089,746,431
Number of Sequences: 23463169
Number of extensions: 81317815
Number of successful extensions: 137027
Number of sequences better than 100.0: 72
Number of HSP's better than 100.0 without gapping: 56
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 136960
Number of HSP's gapped (non-prelim): 72
length of query: 134
length of database: 8,064,228,071
effective HSP length: 99
effective length of query: 35
effective length of database: 10,036,341,636
effective search space: 351271957260
effective search space used: 351271957260
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 71 (32.0 bits)