BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy18235
         (232 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|390356505|ref|XP_003728811.1| PREDICTED: uncharacterized protein LOC100888155 [Strongylocentrotus
           purpuratus]
          Length = 1585

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/222 (38%), Positives = 132/222 (59%), Gaps = 11/222 (4%)

Query: 12  SLASEEEDEV-VVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTI-DTS 68
           +LA  E ++V VV+VG D DLLV+L+   + S +++ L       V  ++  Q  I DT 
Sbjct: 737 ALAVAESEQVPVVVVGTDTDLLVMLVARATTSTDMHMLCRSNPVTVFNIHEIQHVIGDTR 796

Query: 69  LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
           +  H++F+ A++GCD+ SA++ QGK KA   V K  D    +  F D  ST  E+  AGE
Sbjct: 797 I--HLMFLHAVTGCDTVSAIYRQGKRKAFNMVHKKRDY-DLLDTFTDSGSTHDEVKRAGE 853

Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQV 184
            F++ LYG +    SL+  R+ +Y      S+L  +  + +LPPT +AA+QH+   +  V
Sbjct: 854 AFILKLYGASR-FESLDDYRHIAYKRAIGRSSLSSSFQLESLPPTSAAANQHSYRTYLTV 912

Query: 185 QQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVAC 226
           Q+W+G ++PPTEWGW     TL P+ T  + APD+LL++VAC
Sbjct: 913 QEWMGNSLPPTEWGWRFQDGTLTPVETDIAVAPDTLLNMVAC 954


>gi|443716458|gb|ELU07980.1| hypothetical protein CAPTEDRAFT_206230 [Capitella teleta]
          Length = 269

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 110/231 (47%), Gaps = 19/231 (8%)

Query: 7   SSGRRSLASEEEDEVVVLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTID 66
           + GR+SL +        +   D  L   L    SP NV+             Y  +TTI 
Sbjct: 21  AKGRKSLLA------ATVARGDKQLHTELESIGSPCNVH-------AACYKDYTRETTIV 67

Query: 67  TSLAEHILFIRAISGCDSTS-ALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE 125
            +  + I         DS    L+ +GK  A   + +  DL P +  F  P +   EI  
Sbjct: 68  AAKRKVIAADECTGNVDSGDRTLYGKGKKTAFH-LARKTDLFPILESFEKPETPKAEIEN 126

Query: 126 AGERFLVVLYGGNHETTSLNKLRYKSYV----TSALKVTSYIAALPPTESAASQHALNKF 181
           AGE FL+  YG N   ++L+KLR+  Y      S+L  T  + +LPPT +AA+QH+L  +
Sbjct: 127 AGEEFLLKFYGSNKLVSTLDKLRHYKYKQAIRKSSLTSTIKLESLPPTSAAAAQHSLRAY 186

Query: 182 YQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           +QVQ W GK V  T WGW      L P+ T K  AP++LL +VAC CK  C
Sbjct: 187 HQVQTWRGKMVDATAWGWQIGDGILAPVETTKGVAPENLLKMVACGCKTQC 237


>gi|443691138|gb|ELT93079.1| hypothetical protein CAPTEDRAFT_217788 [Capitella teleta]
          Length = 1299

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 77/226 (34%), Positives = 108/226 (47%), Gaps = 44/226 (19%)

Query: 12   SLASEEEDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLA 70
            +L    + + VV++G D DLLV+++   +PS N+Y     +G   P L     T      
Sbjct: 1081 ALTHSTDGQTVVVIGNDTDLLVMMVALATPSMNMYICDTTKG---PRL----CTAKVKRL 1133

Query: 71   EHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERF 130
              I  ++ IS        F   KV  LR+ ++N                      AGE F
Sbjct: 1134 RFIWLVKLIS--------FPFWKV--LRSRIEN----------------------AGEEF 1161

Query: 131  LVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQVQQ 186
             + LYG N   ++L+KLRY  Y      S+L  T  + +LPPT +AA+QH+L  ++QVQ 
Sbjct: 1162 FLKLYGSNKLVSTLDKLRYYKYKQAKRKSSLTPTIKLESLPPTSAAAAQHSLRAYHQVQT 1221

Query: 187  WLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
            W GK V  T WGW      L P+ T K  AP +LL +VAC CK  C
Sbjct: 1222 WRGKMVDATAWGWQIRDGILAPVETTKGVAPQNLLKMVACGCKTQC 1267


>gi|156371522|ref|XP_001628812.1| predicted protein [Nematostella vectensis]
 gi|156215798|gb|EDO36749.1| predicted protein [Nematostella vectensis]
          Length = 373

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 72/219 (32%), Positives = 115/219 (52%), Gaps = 10/219 (4%)

Query: 23  VLVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPV-LYHPQTTIDT---SLAEHILFI 76
            LVGED DLL++L      +  ++ F+   + K   + +++ + T++     + ++I+F 
Sbjct: 118 ALVGEDTDLLILLCHYADANDHDILFMPQTKQKSGTMRVWNIKNTVEALGPDICKNIMFA 177

Query: 77  RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEA-GERFLVVLY 135
            AI GCD+TSAL+  GK  +L+ +  +         F    +   +I  A GE  L+ LY
Sbjct: 178 HAILGCDTTSALYGLGKGLSLKMLTSDATFRQQADIFHQADAAKNDIAAAAGETDLLCLY 237

Query: 136 GGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQWLGKNVP 193
            G  + T L+ LRY  +        + +   +LP   +AA  H+L  ++QVQQW G  +P
Sbjct: 238 KGLKDET-LDSLRYARFCQKISTGNTQVRPESLPQISAAAIYHSLRVYHQVQQWRGIALP 296

Query: 194 PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           P +WGW +V   L+P  T +S A  SLL L+ C CK GC
Sbjct: 297 PEDWGWKEVDGKLLPQRTDQSAAHPSLLELIRCKCKTGC 335


>gi|156378368|ref|XP_001631115.1| predicted protein [Nematostella vectensis]
 gi|156218149|gb|EDO39052.1| predicted protein [Nematostella vectensis]
          Length = 315

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 72/219 (32%), Positives = 116/219 (52%), Gaps = 10/219 (4%)

Query: 23  VLVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPV-LYHPQTTIDT---SLAEHILFI 76
            LVGED DLL++L      +  ++ F+   + K   + +++ + T++     + ++I+F 
Sbjct: 60  ALVGEDTDLLILLCHYADANDHDILFMPQTKQKSGTMRVWNIKNTVEALGPDICKNIMFA 119

Query: 77  RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEA-GERFLVVLY 135
            AI GCD+TSAL+  GK  +L+ +  +         F    +   +I  A GE  L+ LY
Sbjct: 120 HAILGCDTTSALYGLGKGLSLKMLTSDVTFRQQADIFHQADAAKNDIAAAAGETALLCLY 179

Query: 136 GGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQWLGKNVP 193
            G  + T L+ LRY  +        + +   +LP   +AA  H+L  ++QVQQW G  +P
Sbjct: 180 KGLKDET-LDSLRYARFCQKISTGNTQVRPESLPQISAAAIYHSLRVYHQVQQWRGIALP 238

Query: 194 PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           P +WGW +V   L+P  T +S A  SLL L+ C+CK GC
Sbjct: 239 PEDWGWKEVDGKLLPQRTDQSAAHPSLLELIRCNCKTGC 277


>gi|328698807|ref|XP_003240738.1| PREDICTED: hypothetical protein LOC100571634 [Acyrthosiphon pisum]
          Length = 1377

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 78/193 (40%), Positives = 109/193 (56%), Gaps = 17/193 (8%)

Query: 22   VVLVGEDVDLLVILIGTRSPSN-VYFLKPGRGKVVPVLYHPQTTIDTSLAEHI------- 73
            VV++GED+DLL++L+    P N + F KPGRGK+    Y        SL EH        
Sbjct: 1190 VVVIGEDIDLLILLLTLSPPKNTIIFEKPGRGKIETRSYAV-----GSLQEHFKNEIKYF 1244

Query: 74   LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
            +FI AI GCD+TSALF QGK+K L+TV K+ +L   +  F + +S+  EI  AGE++L+ 
Sbjct: 1245 MFIHAIGGCDTTSALFQQGKIKHLKTVKKHQELHDALLIFNNESSSPEEIERAGEKYLLA 1304

Query: 134  LYGGNHETTSLNKLRYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGKN 191
            LY      TSLNKLR+  +  +A   K    +A LPPT  AA +H    + Q+Q W G  
Sbjct: 1305 LYKAPAHITSLNKLRHDVFQKTAASNKKQVQLARLPPTIDAAREHLHRVYLQIQLWRGNK 1364

Query: 192  VP--PTEWGWNQV 202
            +    TE G  ++
Sbjct: 1365 LEFRSTELGLERI 1377


>gi|449667181|ref|XP_002159051.2| PREDICTED: uncharacterized protein LOC100200337, partial [Hydra
           magnipapillata]
          Length = 977

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 89/169 (52%), Gaps = 13/169 (7%)

Query: 69  LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
             +H+LF+ A SGCDSTSA+F +GK   L +V K+  +      F+D  +T +EI ++  
Sbjct: 777 FKDHLLFVHAWSGCDSTSAIFGKGKAMFLNSVKKSESMKEISETFMDYWATNKEIAKSSV 836

Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWL 188
                LY G H+ TSL KLRY  Y+ +  K       LPPT+ AA  H     Y+V   L
Sbjct: 837 NAFKELYNG-HQQTSLAKLRYSKYLEALCKGIVVPEKLPPTDRAAHFHG----YRVHLQL 891

Query: 189 GK--------NVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK 229
            +        N+ PTEWGW      L PI T K  AP +LL ++ C+C+
Sbjct: 892 IELKMLDEELNLKPTEWGWKSTDGHLEPIPTDKEIAPPNLLKVIRCNCR 940


>gi|443689573|gb|ELT91946.1| hypothetical protein CAPTEDRAFT_216478 [Capitella teleta]
          Length = 449

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 75/236 (31%), Positives = 110/236 (46%), Gaps = 47/236 (19%)

Query: 2   HYRVPSSGRRSLASEEEDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYH 60
           HY V S+       + + + V+++G D DLLV+++   +PS +VY     +G  V  + H
Sbjct: 159 HYPVNST----FIVDGDGQTVIVIGNDTDLLVMMVALATPSMHVYLCDTTKGPRVFSISH 214

Query: 61  PQTTIDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTT 120
                                           + K  R V  +  L  Y  +   P + T
Sbjct: 215 --------------------------------REKQRRNVELSSSL--YAEKPETPKAET 240

Query: 121 REITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQH 176
             + E    FL+ LYG N   ++L+KLR+  Y      S+L  T  +  LPPT +AA+QH
Sbjct: 241 ENVRE----FLLKLYGSNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLECLPPTSAAAAQH 296

Query: 177 ALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           +L  ++QVQ W GK V  T WGW   G  L P+ T K   P++LL +VACSCK  C
Sbjct: 297 SLRAYHQVQTWCGKRVDATAWGWQIGGGILAPVETTKGVVPENLLKMVACSCKTQC 352


>gi|443706049|gb|ELU02310.1| hypothetical protein CAPTEDRAFT_209473 [Capitella teleta]
          Length = 242

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/220 (32%), Positives = 103/220 (46%), Gaps = 43/220 (19%)

Query: 18  EDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFI 76
           + + V+++  D DLLV+++   +PS NVY     +G  V  + H +              
Sbjct: 58  DRQTVIVIDNDTDLLVMMVALATPSMNVYICDTTKGPRVFSISHRE-------------- 103

Query: 77  RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
                             K  R V  +  L  Y  +   P + T  + E    FL+ LYG
Sbjct: 104 ------------------KQRRNVELSSSL--YAEKPETPKAETENVRE----FLLKLYG 139

Query: 137 GNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNV 192
            N   ++L+KLR+  Y      S+L  T  +  LPPT +AA+QH+L  ++QVQ W GK V
Sbjct: 140 SNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLECLPPTSAAAAQHSLRAYHQVQTWRGKRV 199

Query: 193 PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
             T WGW   G  L P+ T K   P++LL +VACSCK  C
Sbjct: 200 DATAWGWQIGGGILAPVETTKGVVPENLLKMVACSCKTQC 239


>gi|443725032|gb|ELU12774.1| hypothetical protein CAPTEDRAFT_192602 [Capitella teleta]
          Length = 773

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 75/226 (33%), Positives = 106/226 (46%), Gaps = 44/226 (19%)

Query: 12  SLASEEEDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLA 70
           +L    + + VV++G D  LLV+++   +PS N+Y     +G   P L     T      
Sbjct: 555 ALTHSTDGQTVVVIGNDTYLLVMMVALATPSMNMYICDTTKG---PRL----CTAKVKRL 607

Query: 71  EHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERF 130
             I  ++ IS        F   KV  LR+ ++N                      AGE F
Sbjct: 608 RFIWLVKLIS--------FPFWKV--LRSRIEN----------------------AGEEF 635

Query: 131 LVVLYGGNHETTSLNKLRYKSYVTSALKV----TSYIAALPPTESAASQHALNKFYQVQQ 186
            + LYG N   ++L+KLRY  Y  +  K     T  + +LPPT +AA+QH+L  ++QVQ 
Sbjct: 636 FLKLYGSNKLVSTLDKLRYYKYKQAKRKSPLTPTIKLESLPPTSAAAAQHSLRAYHQVQT 695

Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           W GK V  T WGW      L P+ T K  AP +LL +VAC CK  C
Sbjct: 696 WRGKMVDATAWGWQIRDGILAPVETTKGVAPQNLLKMVACGCKTQC 741


>gi|405954377|gb|EKC21837.1| hypothetical protein CGI_10003232 [Crassostrea gigas]
          Length = 372

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 76/238 (31%), Positives = 111/238 (46%), Gaps = 27/238 (11%)

Query: 11  RSLASEEEDEVVVLVGEDVDLLVILIG-TRSPSNVYF-------LKPGR----GKVVPVL 58
           ++  ++ +D  VV+ GED DLLV+L    +    ++F       +K  R     K  PVL
Sbjct: 79  QTAVNKADDHTVVVYGEDTDLLVLLCHYAKEGRQIFFTTDKHTSMKNHRVWDISKAKPVL 138

Query: 59  YHPQTTIDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
                   +     +LFI A++GCD+TS L   GK  AL+ ++    L      FL   S
Sbjct: 139 -------GSDSCRQLLFIHALTGCDTTSCLHGIGKPAALKKIMTEIYLNSQGAVFLQENS 191

Query: 119 TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSAL----KVTSYIAALPPTESAAS 174
           +  +I +AGE  LV LY G      L+   ++ + T  +         + +LPPT  AA 
Sbjct: 192 SKEDIIKAGEEALVNLYAGV-LLEGLDIHIWRKFTTKTMSSKRNAVVQVQSLPPTSDAAI 250

Query: 175 QHALNKFYQVQQWLGKNVP---PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK 229
            H+   + Q Q W GK+V    PTEW W      L+PI   K P PD L  +  C+CK
Sbjct: 251 FHSTRVYLQCQYWKGKSVADLDPTEWDWTLKTGKLLPIEMSKPPVPDFLFKIAHCNCK 308


>gi|443691218|gb|ELT93134.1| hypothetical protein CAPTEDRAFT_194406 [Capitella teleta]
          Length = 250

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 109/225 (48%), Gaps = 48/225 (21%)

Query: 18  EDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFI 76
           + + V+++G D DLLV+++   +PS NVY     +G +V                    I
Sbjct: 36  DGQTVIVIGNDTDLLVMMVALATPSMNVYICDTTKGPIV------------------FSI 77

Query: 77  RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
           R I          N G V+   ++        Y  +   P +   E   A E FL+ LYG
Sbjct: 78  RVIEK--------NNGNVEQSSSL--------YAEK---PETPKAETENAREEFLLKLYG 118

Query: 137 GNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNV 192
            N   ++L+KLR+  Y      S+L  T  + +LPPT  AA+QH+L  ++QVQ W GK +
Sbjct: 119 SNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLESLPPTSVAAAQHSLRAYHQVQTWRGKMI 178

Query: 193 PPTEWGWNQVGN-TLIPITTLKSPAPDSLLSLVAC----SCKKGC 232
             T WGW Q+G+  L P+ T K   P++LL +VAC     C+K C
Sbjct: 179 DATAWGW-QIGDGILAPVETTKKVVPENLLKMVACGYKTQCRKSC 222


>gi|443682857|gb|ELT87293.1| hypothetical protein CAPTEDRAFT_189847 [Capitella teleta]
          Length = 173

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 73/129 (56%), Gaps = 5/129 (3%)

Query: 105 DLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVT 160
           DL P +  F  P +   EI E  E F + LYG N   ++L+KLR+  Y      S+L  T
Sbjct: 16  DLFPILESFEKPETPKAEI-ENAEEFPLKLYGSNKLVSTLDKLRHYKYKQAIEKSSLTST 74

Query: 161 SYIAALPPTESAASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSL 220
             + + PPT +AA+QH+L  ++QVQ W GK V  T WGW      L P+ T K  APD+L
Sbjct: 75  IKLESQPPTSAAAAQHSLRAYHQVQTWRGKMVDATAWGWQIGDGILAPVETTKGVAPDNL 134

Query: 221 LSLVACSCK 229
           L +VAC CK
Sbjct: 135 LKMVACGCK 143


>gi|321449045|gb|EFX61703.1| hypothetical protein DAPPUDRAFT_338406 [Daphnia pulex]
          Length = 296

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 104/220 (47%), Gaps = 11/220 (5%)

Query: 23  VLVGEDVDLLVILIG--TRSPSNVYFL---KPGRGKVVPVLYHPQTTIDTSLAEHILFIR 77
           V++GED DLL++L+         +Y     K  +G  +  +   +  +  +L + +LF  
Sbjct: 60  VVIGEDTDLLILLLHYCNEKIEEIYMRIEPKAQKGGKIWNIAKLKKMLGKNLCDSLLFGH 119

Query: 78  AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
           A   CD+TS  F +GK  AL+ +  N       + F  P S+  EI  AGE  + ++YGG
Sbjct: 120 AFLRCDTTSKPFGKGKAMALKLLNTNVQFQRLSKVFYLPTSSIEEIDAAGESAMCIVYGG 179

Query: 138 NHETTSLNKLRY---KSYVTSALKVTSYI-AALPPTESAASQHALNKFYQVQQWLG-KNV 192
              T  +  LRY   K  V++A    S     LPPT+ +   H+   + Q+Q WLG   +
Sbjct: 180 -LPTDDIAFLRYTIFKKKVSNASVAKSIKPEELPPTKGSVKFHSRRVYLQIQMWLGVTTM 238

Query: 193 PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
             T+WGW        P TT   PAP  +L ++ C C   C
Sbjct: 239 KETDWGWFMNDRLFFPNTTDLPPAPKEILKMIKCGCNGSC 278


>gi|405968047|gb|EKC33150.1| hypothetical protein CGI_10014062 [Crassostrea gigas]
          Length = 399

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 12/215 (5%)

Query: 23  VLVGEDVDLLVILIG-TRSPSNVYFLKPGRGKVVPV------LYHPQTTIDTSLAEHILF 75
           VL+GE+ DLLV+L+      SN    K G    V        +   +  +   L   +  
Sbjct: 136 VLLGENSDLLVLLLHHANVTSNSLIFKSGNVSKVNTHIKIWDILKTKLLLGEELCTLLPL 195

Query: 76  IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLY 135
           IRAISGCD+TS +F   +   L+   ++ D L    + L  A+   ++  AGE  +  +Y
Sbjct: 196 IRAISGCDTTSRMFGVSEASTLKKFGEH-DFLKTQAQLLCNANAKDDVISAGESIISSVY 254

Query: 136 GGNHETTSLNKLRYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGK-NV 192
            G      LN LRY+ +    L  K    I  LPPT +AAS ++ + + Q++ W+ K N+
Sbjct: 255 KG-APYEGLNVLRYRKFAARVLTNKTCVQIHTLPPTSNAASFYSQSAYLQMKMWMNKDNL 313

Query: 193 PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACS 227
            P EWGW      L+P+      A   LL+++ C+
Sbjct: 314 NPCEWGWKVANGNLVPVKCTMDAALSKLLNIIRCN 348


>gi|405973593|gb|EKC38297.1| hypothetical protein CGI_10000780 [Crassostrea gigas]
          Length = 449

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 98/204 (48%), Gaps = 14/204 (6%)

Query: 23  VLVGEDVDLLVILIGTRS-PSNVYFLKPGRGKVVPV------LYHPQTTIDTSLAEHILF 75
           VL+GED DLLV+L+      SN    K G    V        +   +  +   L   +  
Sbjct: 246 VLLGEDTDLLVLLLHHADVTSNSLIFKSGNVSKVNTHIKIWDILKTKVLLGEELCTLLPL 305

Query: 76  IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLY 135
           I AISGCD+TS +F   K   L+   ++ D L   R+ L  A+   ++  AGE  +  LY
Sbjct: 306 IHAISGCDTTSRMFGVSKAATLKKFAEH-DFLK-TRQLLCNANAKDDVISAGENIISSLY 363

Query: 136 GGNHETTSLNKLRYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGK-NV 192
            G      LN LRY+ +    L  K    I  LPPT +AAS H+   + Q++ W+ + N+
Sbjct: 364 NG-APYEELNVLRYRKFAARVLTNKTCVQIHTLPPTSNAASFHSQRAYLQMKMWMNEDNL 422

Query: 193 PPTEWGWNQVGNTLIPI-TTLKSP 215
            P EWGW      L+P+  T+K P
Sbjct: 423 NPCEWGWKVANGNLVPVKCTVKLP 446


>gi|443709383|gb|ELU04056.1| hypothetical protein CAPTEDRAFT_206657 [Capitella teleta]
          Length = 493

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 67/121 (55%), Gaps = 4/121 (3%)

Query: 116 PASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTES 171
           P +   EI  A E FL+ LY  N   ++ +KLR+  Y      S+L  T    +LPPT +
Sbjct: 341 PETPKAEIENAREEFLLKLYSSNKLGSTHDKLRHYKYKQAIEKSSLTSTIKHESLPPTSA 400

Query: 172 AASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKG 231
           AA+QH+L  ++QVQ W GK V  T WGW      L P+ T K   P++LL +VAC CK  
Sbjct: 401 AAAQHSLRVYHQVQTWRGKMVDATAWGWQIGDGILAPVETTKGVVPENLLKMVACGCKTQ 460

Query: 232 C 232
           C
Sbjct: 461 C 461


>gi|357612321|gb|EHJ67915.1| hypothetical protein KGM_18541 [Danaus plexippus]
          Length = 707

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 1/111 (0%)

Query: 123 ITEAGERFLVVLYGGNHETTSLNKLRYKSYVT-SALKVTSYIAALPPTESAASQHALNKF 181
           + E G R L+  Y       +++  RY  ++  + L     ++ LPPT  AA QH    +
Sbjct: 517 LLENGVRILLAFYNAPKSEDNIDHFRYTQFIKFTKLNKPVQLSTLPPTSVAAHQHIKRVY 576

Query: 182 YQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           YQ+Q WLGK++ P EWGW      L PI TL  PAP  LL+++ C+CK GC
Sbjct: 577 YQIQTWLGKDLEPQEWGWMLENEILEPIRTLLPPAPAELLNVIFCNCKNGC 627


>gi|321455975|gb|EFX67094.1| hypothetical protein DAPPUDRAFT_331399 [Daphnia pulex]
          Length = 222

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 84/166 (50%), Gaps = 3/166 (1%)

Query: 69  LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
           +  +ILF   + GCD+TS  +   K K+L+ +  N        +F +  +T   I +AGE
Sbjct: 15  ICNNILFAHTVLGCDTTSKPYGVCKAKSLKLLPTNTTFQMMAAKFYEQDATIDVIKKAGE 74

Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQ 186
             ++++YGG+     ++ LRYK +       TS++    +PPT +A   H+   ++Q+Q 
Sbjct: 75  VAMILIYGGS-TNEGIDTLRYKEFQRKISIATSFVNPQEIPPTSAALQFHSQRVYFQIQS 133

Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           WLG ++   +WGW    + L P  T    +P  LL ++   CK  C
Sbjct: 134 WLGFSLDACDWGWVMKHDMLWPRLTNIEASPKDLLQIIKGGCKGVC 179


>gi|405970451|gb|EKC35353.1| hypothetical protein CGI_10012495 [Crassostrea gigas]
          Length = 218

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 84/166 (50%), Gaps = 2/166 (1%)

Query: 69  LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
           L + +  I +++G D+TS  F  GK  AL+ +     +   +  F D +++T  I ++G 
Sbjct: 7   LCQLLPAIHSVTGFDTTSKPFGIGKAAALKKIKSTNYIQETLNTFCDTSASTESILQSGS 66

Query: 129 RFLVVLYGG-NHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
             +  LYGG  +E   + +LR      +A  ++  + +LPPT  AA+ H    + Q + W
Sbjct: 67  EVISSLYGGVPYEGLDILRLRKFGSKVAAGSLSVQVQSLPPTADAANFHIKRAYLQYRYW 126

Query: 188 L-GKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           +  +N+ P +WGW      L P+ T K  AP  LL  V C+CK  C
Sbjct: 127 MTDENLDPCQWGWYLSSGELFPVKTSKPVAPAILLKTVKCNCKTNC 172


>gi|443692109|gb|ELT93782.1| hypothetical protein CAPTEDRAFT_190961 [Capitella teleta]
          Length = 394

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 86/167 (51%), Gaps = 16/167 (9%)

Query: 69  LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
           +   ++F R + G  S      +   +A + + K  D+      FLDP +   ++ +AGE
Sbjct: 197 IQSEVMFQRYMLGAQSM-----ESPAEAFKRLFKEADI------FLDPTANINDVRQAGE 245

Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQ 186
             + +++G    T+S N LRY+       +   Y+    LPPT +A+  H+   + QVQQ
Sbjct: 246 SAMAIVFGAKSRTSS-NDLRYQLLCKKIARKNKYVQPCTLPPTSAASKFHSDRVYLQVQQ 304

Query: 187 WL-GKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           W   +++ P EWGW ++ ++++PI T    AP  LLS++  SC+  C
Sbjct: 305 WRKNESLSPCEWGW-KIVDSVMPIMTDVPAAPAVLLSMICYSCQGDC 350


>gi|321470191|gb|EFX81168.1| hypothetical protein DAPPUDRAFT_317920 [Daphnia pulex]
          Length = 392

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 90/175 (51%), Gaps = 10/175 (5%)

Query: 22  VVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGKVVPVLYHP---QTTIDTSLAEHILFI 76
           VV+VGED DLL++LI     S  N+Y     + K    L++    Q ++   L   ILF 
Sbjct: 219 VVVVGEDTDLLILLIHYCKESNHNMYMKSESKSKKCGKLWNIKKIQDSLGKELCSSILFC 278

Query: 77  RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
            A  GCD+TS  F + K  +L+    N D     + F +  ST ++I  AGE  + ++YG
Sbjct: 279 HAFLGCDTTSKPFGKEKCASLKLQNTNSDFKIVSKIFYESESTKQDIDTAGENAMCIVYG 338

Query: 137 GNHETTSLNKLRYKSY---VTSALKVTSYI-AALPPTESAASQHALNKFYQVQQW 187
           G      +++LRY+ +   V +A    S I   LPPT++A   H+   ++QV ++
Sbjct: 339 G-LVIDGIDRLRYQIFQKKVNNAKLTKSIIPEELPPTQAALKFHSRRAYFQVFKF 392


>gi|260808991|ref|XP_002599290.1| hypothetical protein BRAFLDRAFT_64352 [Branchiostoma floridae]
 gi|229284567|gb|EEN55302.1| hypothetical protein BRAFLDRAFT_64352 [Branchiostoma floridae]
          Length = 1620

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 18/233 (7%)

Query: 14   ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPG---RGKVVPVLYHPQTTIDTS 68
            A+E   E VV+  ED D+++I +    R P  VY  K G   R + + +       +   
Sbjct: 1338 AAESGSEAVVITAEDTDVMIISLAFAKRIPCKVY-QKCGTKNRTRFIDI-DKLADALGEE 1395

Query: 69   LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT--EA 126
            + + ++ + A +GCD+ SA   +GK+ A + ++KN +     ++  +  + + + T  + 
Sbjct: 1396 VCKALVGLHAFTGCDTVSAFSGRGKLGAFKLMLKNEEYQNAFQKLGESWTVSPDDTLFKR 1455

Query: 127  GERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQ 186
             ERF   +Y  +     +N++R+  ++     V S  +ALPP       H     YQ   
Sbjct: 1456 IERFTCQMYVSSTPVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGI 1513

Query: 187  WLG--KNVP----PTEWGWNQVGNTLIPITTLKS-PAPDSLLSLVACSCKKGC 232
            W G  +N P    P + GW    +  + IT L+S PAP ++L L+ CSC + C
Sbjct: 1514 WRGCLQNDPQVPSPVDAGWKLDEDGNLSITWLQSPPAPAAVLELLTCSCSRSC 1566


>gi|443716490|gb|ELU07992.1| hypothetical protein CAPTEDRAFT_216620 [Capitella teleta]
          Length = 1395

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 77/164 (46%), Gaps = 10/164 (6%)

Query: 75   FIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVL 134
               A++GCD+ SA + +GK  A  T    P++    ++ +   + ++      ERF+++L
Sbjct: 1213 LFHAMTGCDTVSAFYGRGKRTAWETWQSYPEVTEAFQKCVSSDTVSKTCMSLLERFVILL 1272

Query: 135  YGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN--- 191
            Y  + + T +N+ R   +   A  + +    +PPT++A  QH      Q + W       
Sbjct: 1273 YDKSSKATDVNEARKHIFTQKARSLEN----IPPTQAALEQHVKRAVLQAKIWNNSTEAV 1328

Query: 192  ---VPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
               + P++WGW + G+   P+ T    A      L+ C CKKGC
Sbjct: 1329 PSAIDPSKWGWVKEGSQWKPLWTSLPEAAKCCYELIHCGCKKGC 1372


>gi|260819138|ref|XP_002604894.1| hypothetical protein BRAFLDRAFT_77273 [Branchiostoma floridae]
 gi|229290223|gb|EEN60904.1| hypothetical protein BRAFLDRAFT_77273 [Branchiostoma floridae]
          Length = 1407

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/232 (26%), Positives = 109/232 (46%), Gaps = 16/232 (6%)

Query: 14   ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGKVVPVLYHPQ--TTIDTSL 69
            A+E   E VV+  ED D++VI +    R P  +Y  K G       +   Q   T+   +
Sbjct: 1125 AAETGSEAVVITAEDTDVMVISLAFAKRIPCKLY-QKCGTKSRTRFIAIDQLGDTLGEEV 1183

Query: 70   AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT--EAG 127
             + ++ + A +GCD+ SA   +GK+ A + +++N +     ++  +  + + + T  +  
Sbjct: 1184 CKALVGLHAFTGCDTVSAFSGRGKLGAFKLMLRNAEYQDAFQQLGESWTVSPDDTLFKRI 1243

Query: 128  ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
            ERF   +Y  +     +N++R+  ++     V S  +ALPP       H     YQ   W
Sbjct: 1244 ERFTCQMYVSSTSVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGIW 1301

Query: 188  LG--KNVP----PTEWGWNQVGNTLIPITTLKS-PAPDSLLSLVACSCKKGC 232
             G  +N P    P + GW    +  + I  L+S PAP ++L L+ CSC + C
Sbjct: 1302 RGCLQNDPQVPSPVDAGWKLDEDGNLSIRWLQSPPAPAAVLELLTCSCSRSC 1353


>gi|156356217|ref|XP_001623825.1| predicted protein [Nematostella vectensis]
 gi|156210558|gb|EDO31725.1| predicted protein [Nematostella vectensis]
          Length = 605

 Score = 80.5 bits (197), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 64/122 (52%), Gaps = 3/122 (2%)

Query: 113 FLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIA--ALPPTE 170
           F    +   +I  AGE  L+ LY G  + T L+ LRY  +        + +   +LPPT 
Sbjct: 419 FHQADAAKNDIAAAGETDLLCLYKGLKDET-LDSLRYARFCQKISTGNTQVQPESLPPTS 477

Query: 171 SAASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
           +AA  H+L  ++QVQQW G  +PP +WGW +V   L P  T +S A  SLL L+ C+   
Sbjct: 478 AAAIYHSLRVYHQVQQWRGIALPPEDWGWKEVDGKLQPQRTDQSAAHPSLLELIRCNSLC 537

Query: 231 GC 232
            C
Sbjct: 538 SC 539


>gi|260817557|ref|XP_002603652.1| hypothetical protein BRAFLDRAFT_98594 [Branchiostoma floridae]
 gi|229288974|gb|EEN59663.1| hypothetical protein BRAFLDRAFT_98594 [Branchiostoma floridae]
          Length = 1607

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 99/222 (44%), Gaps = 15/222 (6%)

Query: 22   VVLVGEDVDLLVILIGTRS--PSNVYFLKPG---RGKVVPVLYHPQTTIDTSLAEHILFI 76
            V++  ED D+ ++ +  +   P+++ F+K G   R K V +    Q      +   +L +
Sbjct: 1340 VIVSSEDTDVFLLCLAFKQSIPASI-FVKCGTHSRIKYVSITNAAQV-WGQDICSSLLGM 1397

Query: 77   RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
             A +GCDS SA   +GK+ ALR V +N D     +        + E+ +  E F   +Y 
Sbjct: 1398 HAFTGCDSVSAFAGRGKLGALRLVKENRDFQEMFKLVGMDWELSNELFKKLEEFTCHMYS 1457

Query: 137  GNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGK-----N 191
                T+ +N+LRY+ +      + S    LPP       HA    Y    W         
Sbjct: 1458 SRPGTSDVNELRYRLFCAKRGSIDS--VQLPPCADCLYNHAKRANYVAAIWKKSLESHPV 1515

Query: 192  VP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
            +P P   GW + G+ L+       PAP ++L L++CSC K C
Sbjct: 1516 IPSPIGLGWCKDGDQLVIDWMDGEPAPTAVLELLSCSCSKAC 1557


>gi|405973743|gb|EKC38436.1| hypothetical protein CGI_10011190 [Crassostrea gigas]
          Length = 183

 Score = 79.7 bits (195), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 4/131 (3%)

Query: 105 DLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSAL--KVTSY 162
           D L    + L  A+       AGE  +  LY G      LN LRY+ +    L  K    
Sbjct: 17  DFLKTQAQLLCNANAKDYAISAGENIISSLYNG-APYEGLNVLRYRKFAARVLTNKTCVQ 75

Query: 163 IAALPPTESAASQHALNKFYQVQQWLGK-NVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
           I  LPPT +AAS H+   + Q++ W+ K N+ P EWGW      L+P+      AP  L 
Sbjct: 76  IHTLPPTSNAASFHSQRAYLQMKMWMNKDNLNPCEWGWKVANGNLVPVKCTMDAAPSKLR 135

Query: 222 SLVACSCKKGC 232
           +++ C+CK  C
Sbjct: 136 NIIRCNCKTNC 146


>gi|443728947|gb|ELU15065.1| hypothetical protein CAPTEDRAFT_192160 [Capitella teleta]
          Length = 480

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 51/81 (62%), Gaps = 4/81 (4%)

Query: 150 KSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPI 209
           KS +TS +K+ S    LPPT +AA+QH+L  ++QVQ W GK V  T WGW      L P+
Sbjct: 336 KSSLTSTIKLES----LPPTSAAAAQHSLRAYHQVQTWHGKMVDATAWGWQIRDGILAPV 391

Query: 210 TTLKSPAPDSLLSLVACSCKK 230
            + K   P++LL +VAC CK+
Sbjct: 392 ESTKGVDPENLLKIVACGCKR 412


>gi|443724546|gb|ELU12506.1| hypothetical protein CAPTEDRAFT_214834 [Capitella teleta]
          Length = 463

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 64/121 (52%), Gaps = 10/121 (8%)

Query: 116 PASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTES 171
           P +   EI  A E FL+ LYG N   ++ +KLR+  Y      S+L  T  + +LPPT +
Sbjct: 317 PKTPKAEIENAREEFLLKLYGSNKLGSTFDKLRHYKYKQAIEKSSLTSTIRLESLPPTSA 376

Query: 172 AASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKG 231
           AA+QH+   ++QV      ++  T WGW      L P  T K   P++LL +VAC CK  
Sbjct: 377 AAAQHS---YHQVH---STDLHATAWGWQMGDGILAPEETTKGVVPENLLKMVACGCKTQ 430

Query: 232 C 232
           C
Sbjct: 431 C 431


>gi|321469250|gb|EFX80231.1| hypothetical protein DAPPUDRAFT_103743 [Daphnia pulex]
          Length = 210

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 75/150 (50%), Gaps = 3/150 (2%)

Query: 85  TSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSL 144
           +S  +  GK K+L+ +  N  +     +F +  +T   I EAGE  ++++YGG+     +
Sbjct: 19  SSKPYGVGKAKSLKLLSTNTTIQMMAAKFYEQDATIDVIQEAGEVAMILIYGGST-NKGI 77

Query: 145 NKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQWLGKNVPPTEWGWNQV 202
           + LRYK +       TS++    +PPT +A   ++   ++QVQ WLG ++   +WGW   
Sbjct: 78  DTLRYKEFQRKISIATSFMNPQEIPPTSAALQFNSQRVYFQVQSWLGFSLDDCDWGWVMK 137

Query: 203 GNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
            + L P       AP  LL ++ C  K  C
Sbjct: 138 HDMLWPRLNNIEAAPKDLLQIIKCGGKGDC 167


>gi|291238911|ref|XP_002739369.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1049

 Score = 76.3 bits (186), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 59/233 (25%), Positives = 99/233 (42%), Gaps = 17/233 (7%)

Query: 14   ASEEEDEVVVLVGEDVDLLVILIGTRSPSNV-YFLKPG---RGKVVPVLYHPQTTIDTSL 69
            A+EE  + VV+  ED D+ ++ +          F K G   R K++ +       I   +
Sbjct: 772  ATEEGYQAVVICSEDTDVFIMSLAFYDKIGASVFQKYGTRTRTKIIDI-GKVSAAIGIDV 830

Query: 70   AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGER 129
               ++ + A +GCD+ SA   +GK  ALR +  N +      +     + ++E+    E 
Sbjct: 831  CRALIGMHAFTGCDTVSAFAGKGKASALRFMTSNRETQDIFLQLGQEWTLSQELMNKLEA 890

Query: 130  FLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW-- 187
            F   LYG    ++ +N LRY  +     ++ S+   LPP      +HA    YQ   W  
Sbjct: 891  FTCTLYGPKASSSKVNDLRYHLFCAKKGEIESH--QLPPCRDCLVKHAQRANYQAGIWRR 948

Query: 188  ---LGKNVP-PTEWGWN----QVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
                   +P P   GW     +    L+       PAP ++L L+AC+C + C
Sbjct: 949  CLEQDPQIPSPIGKGWKIEKFEGTEQLVVHWMDGQPAPQAILDLLACNCTRKC 1001


>gi|328698809|ref|XP_003240739.1| PREDICTED: hypothetical protein LOC100571724 [Acyrthosiphon pisum]
          Length = 442

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 66/119 (55%), Gaps = 4/119 (3%)

Query: 88  LFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKL 147
           LF QGK+K L+TV K+ +L   +  F + +S+  EI  AGE++L+ LY      TSLNKL
Sbjct: 324 LFQQGKIKHLKTVKKHQELHDALLIFNNESSSPEEIERAGEKYLLALYKAPAHITSLNKL 383

Query: 148 RYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGKNVP--PTEWGWNQV 202
           R+  +  +A   K    +A LPPT  AA +H    + Q+Q W G  +    TE G  ++
Sbjct: 384 RHDVFQKTAASNKKQVQLARLPPTIDAAREHLHRVYLQIQLWRGNKLEFRSTELGLERI 442


>gi|390344245|ref|XP_003726079.1| PREDICTED: uncharacterized protein LOC764337 [Strongylocentrotus
            purpuratus]
          Length = 1742

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 94/218 (43%), Gaps = 15/218 (6%)

Query: 22   VVLVGEDVDLLVILIGTRSPSNVYFL-KPGRG--KVVPVLYHPQTTIDTSLAEHILFIRA 78
            V++   D D+ VI +          L   GRG  K    L   ++ +  S+A  ++ +  
Sbjct: 1478 VIIKSPDTDVFVIGVAKADQIQAQLLFHTGRGNNKRTLNLTAIRSHLGDSVANALIGLHC 1537

Query: 79   ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGN 138
             SGCDSTS  + + K K L+ + ++ D     ++F    S    + +  E+F+  LY  +
Sbjct: 1538 FSGCDSTSCFYGRSKKKPLKLMTESVDFQAAFQKFGATFSVEESLVDTMEKFVCRLY--D 1595

Query: 139  HETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ--VQQWLGKNVP--- 193
             + TS+N  RY  ++      T     +PPT  A  +H +   YQ  +     +  P   
Sbjct: 1596 QDCTSVNTARYNKFLMG----TKAEMNMPPTHDALVKHLMRANYQSAIHTRCLEQYPVIP 1651

Query: 194  -PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
             P   GW      +  +     PAP +LL L  CSCKK
Sbjct: 1652 SPHNHGWKVTDTNIEVVWGDLPPAPSTLLELTYCSCKK 1689


>gi|115615245|ref|XP_001199031.1| PREDICTED: uncharacterized protein LOC763146 [Strongylocentrotus
            purpuratus]
          Length = 1617

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 15/218 (6%)

Query: 22   VVLVGEDVDLLVILIGTRSPSNVYFL-KPGRG--KVVPVLYHPQTTIDTSLAEHILFIRA 78
            V++   D D+ VI +          L   GRG  K    L   ++ +   +A  ++ +  
Sbjct: 1353 VIIKSPDTDVFVIGVAKADQIQAQLLFHTGRGNNKRTLNLTAIRSHLGDGVANALIGLHC 1412

Query: 79   ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGN 138
             SGCDSTS  + + K K L+ + ++ D     ++F    S    + +  E+F+  LY  +
Sbjct: 1413 FSGCDSTSCFYGRSKKKPLKLMTESVDFQAAFQKFGATFSVEESLVDTMEKFVCRLY--D 1470

Query: 139  HETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ--VQQWLGKNVP--- 193
             + TS+N  RY  ++      T     +PPT  A  +H +   YQ  +     +  P   
Sbjct: 1471 QDCTSVNTARYNKFLMG----TKAEMNMPPTHDALVKHLMRANYQSAIHTRCLEQYPVIP 1526

Query: 194  -PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
             P   GW      +  +     PAP +LL L  CSCKK
Sbjct: 1527 SPHNHGWKVTDTNIEVVWGDLPPAPSTLLELTYCSCKK 1564


>gi|390365574|ref|XP_003730848.1| PREDICTED: uncharacterized protein LOC100891077 [Strongylocentrotus
           purpuratus]
          Length = 636

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 93/216 (43%), Gaps = 18/216 (8%)

Query: 22  VVLVGEDVDLLVILIGTR--SPSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRAI 79
           VV+V +D D+L++L   R  + +N+Y      G     LY   ++ID    E  LF    
Sbjct: 383 VVVVAQDTDILILLCYHRPVNCTNLYLQADFDG-----LYDI-SSIDIGDQEEFLFKYGW 436

Query: 80  SGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNH 139
           SG D+ S +   G  K      K P     I  F    ST   +  AG R + + YG   
Sbjct: 437 SGNDTVSCI--HGHTKCALYKCKFP--ASVITAFTSNTSTDSTVRTAGLRAMQITYGCG- 491

Query: 140 ETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN---VPPTE 196
             T L K RY  +   A K       LPPTE A +QH+L    QV  W   +   + P  
Sbjct: 492 -DTPLEKSRYLKFKKQAAKGKIDPDRLPPTEDATAQHSLRVHLQVVVWKHLDTAILDPKG 550

Query: 197 WGWNQVGN-TLIPITTLKSPAPDSLLSLVACSCKKG 231
            GW    N  L P       APD+LL  + C+CK+G
Sbjct: 551 RGWELDSNRKLRPKMLSVGIAPDNLLKGICCNCKEG 586


>gi|340385049|ref|XP_003391023.1| PREDICTED: hypothetical protein LOC100633611, partial [Amphimedon
           queenslandica]
          Length = 842

 Score = 70.1 bits (170), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 90/217 (41%), Gaps = 17/217 (7%)

Query: 28  DVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTIDTSLA----EHILFIRAISGCD 83
           D D+ V+L+   +   +  L    G      Y P   I TS+       +    A +GCD
Sbjct: 606 DTDVFVLLLAAAAKMKIEKLWIAFGTGKNFRYIPVHDIVTSIGAVKCRALPMFHAYTGCD 665

Query: 84  STSALFNQGKVKALRTV-VKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETT 142
           + S+   +GK  A  T  V +     +I     PA  +    E  ERF ++LY    +  
Sbjct: 666 TVSSFSTRGKKSAWDTWKVYDKVTETFIALSTGPAQISDGDLEVLERFTILLYDRTSKAI 725

Query: 143 SLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGK------NVP-PT 195
            +N+ R   +     K    + A+PPT++A  QH     YQ     GK      N+P P 
Sbjct: 726 CINEARKHLFS----KKGRSMDAIPPTKAALIQHIRRAVYQGGHCWGKVYQAAMNLPSPE 781

Query: 196 EWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
            WGW    N   P+ T    A  S   L+ C+CKKGC
Sbjct: 782 NWGWTDT-NDWKPLWTTLPEAGASSRELLCCNCKKGC 817


>gi|291223052|ref|XP_002731527.1| PREDICTED: scavenger receptor class B, member 2-like [Saccoglossus
           kowalevskii]
          Length = 639

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 12/155 (7%)

Query: 85  TSALFN-QG-----KVKALRTVVK-NPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
           T A+F  QG     K  A R +VK +P +      F+    +  +I+   +  +V L+GG
Sbjct: 3   TQAVFELQGFSASEKKSAFRKLVKSDPVMKSCASAFILQNKSQEDISGLAKDMMVNLFGG 62

Query: 138 NHETTSLNKLRYKSYVTSALKVTSYIAA--LPPTESAASQHALNKFYQVQQWLG--KNVP 193
            +  T L+ LR   +        S++    LPPT  A   H+   ++Q+  W+G    + 
Sbjct: 63  KYNDT-LSSLRLIIFTKKVATAESFVTLEWLPPTLPATRFHSQCVYFQIMVWMGMANEMN 121

Query: 194 PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
           P EW   Q  + LIPI T K+ APD LL ++ C C
Sbjct: 122 PIEWRLKQENDQLIPIMTPKNAAPDKLLKIIHCDC 156


>gi|443686811|gb|ELT89964.1| hypothetical protein CAPTEDRAFT_191939 [Capitella teleta]
          Length = 321

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 85/175 (48%), Gaps = 18/175 (10%)

Query: 18  EDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPV--------LYHPQTTIDTS 68
           ++  V  V ED DL+V+L+     S     LK    + +          ++  Q ++ + 
Sbjct: 122 KENHVTAVDEDTDLIVLLLHHAPNSCQTIVLKSDIHRSIKSHKPAKQWHIHSAQRSLGSE 181

Query: 69  LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRR----FLDPASTTREIT 124
           +  H+LFI  + GCD+TS+++  G  K +   +KN  L   + +    FLDP +   ++ 
Sbjct: 182 MCRHLLFIHGLLGCDTTSSIYGLG--KGMPLALKNASLCRRLFKEADIFLDPTANINDVQ 239

Query: 125 EAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHA 177
           +A E  + +++G  +   SLN L Y+ +     ++  Y+    LPPT +A+  H+
Sbjct: 240 QARECAMAIVFGVKNR-PSLNDLTYQLFCKKIARINKYVQPCTLPPTSAASKFHS 293


>gi|328699454|ref|XP_003240937.1| PREDICTED: KRAB-A domain-containing protein 2-like [Acyrthosiphon
           pisum]
          Length = 525

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 4/108 (3%)

Query: 74  LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
           +FI A+SGCD+TS++F QGK+K ++T  K+ +L   +  F + +S+  +I   G+ FL+ 
Sbjct: 1   MFIHALSGCDTTSSIFQQGKLKYVKTFQKHSELQDSLLIFNNESSSADDILSVGQEFLLK 60

Query: 134 LYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKF 181
           L+      TSLN+    SY++       Y+  +   +       LNKF
Sbjct: 61  LFNAPKFITSLNQEDNSSYLSDE----KYLNIINELKGNHKSRRLNKF 104


>gi|405966034|gb|EKC31359.1| hypothetical protein CGI_10019217 [Crassostrea gigas]
          Length = 233

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 9/164 (5%)

Query: 18  EDEVVVLVGEDVDLLVILIG-TRSPSNVYFLKPG---RGKVVPVLYHPQTTIDTSLAEHI 73
           +D  VV+ GED DLL +L    +    ++F       +   V  +   ++ + +     +
Sbjct: 50  DDHTVVVYGEDTDLLNLLCHYAKEGRQIFFTDKQTSMKNHRVWDISKAKSVLGSDSCRQL 109

Query: 74  LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
           LFI A++GCD+ S L   GK  AL+ ++ +  L      FL   S+  +I +AGE  LV 
Sbjct: 110 LFIHALTGCDTASRLHGIGKPAALKKIMTDIYLKSQGAVFLQENSSKEDIIKAGEEALVN 169

Query: 134 LYGGNHETTSLNKLRYKSYVTSAL----KVTSYIAALPPTESAA 173
           L GG      L+ LR++ + T  +         +  LPPT  AA
Sbjct: 170 LCGGVL-LEGLDILRWRKFTTKTMSSKRNAVVQVQPLPPTSDAA 212


>gi|291228946|ref|XP_002734438.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 760

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 60/128 (46%), Gaps = 10/128 (7%)

Query: 65  IDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRF-LDPASTTREI 123
           I   L   +L   A +GCDS S+  ++GKV  LR + +NP  +P         A TT E 
Sbjct: 627 IGNELCNTLLSFHAFTGCDSISSFVHKGKVGLLRVLQRNPQFIPVFTSLGTSQALTTDEF 686

Query: 124 TEAGERFLVVLYGGNHETTSLNKLRYKSYVT------SALKVTSYI--AALPPTESAASQ 175
            E  E F  ++YG N     +NKLRY  ++T       +L   S I  + LPP  S+   
Sbjct: 687 DEL-EAFTCLMYGANTGVKDINKLRYTMFMTCYTPKNKSLSNESGIDLSLLPPCRSSLRM 745

Query: 176 HALNKFYQ 183
           H +   YQ
Sbjct: 746 HIVRANYQ 753


>gi|156390576|ref|XP_001635346.1| predicted protein [Nematostella vectensis]
 gi|156222439|gb|EDO43283.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 50/173 (28%), Positives = 79/173 (45%), Gaps = 17/173 (9%)

Query: 23  VLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTT------------IDTSLA 70
           V+  +D D+LVIL+G         LK G G    ++   Q T            I+ +L 
Sbjct: 356 VVNSDDTDVLVILLGHAQYLGKRHLKKGTGTKTRIIELDQITRKLARLAAQDIAIEDALC 415

Query: 71  EHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERF 130
             + F  A++GCDS SA   +GK + L+ V+KN   +  ++      S T E+  A E  
Sbjct: 416 GLVGF-HALTGCDSVSAFSQKGKWRPLQIVLKNKKYMEAMKEIGRQWSVTEELFSATEEL 474

Query: 131 LVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ 183
           +  +YG     T +N+L Y+ +    +K+     ALPP +S+   H     YQ
Sbjct: 475 VFHIYGKR--GTRVNRLIYELHCAKGVKIEP--NALPPYQSSLKLHVSRANYQ 523


>gi|405970677|gb|EKC35562.1| hypothetical protein CGI_10014486 [Crassostrea gigas]
          Length = 381

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 69/132 (52%), Gaps = 5/132 (3%)

Query: 11  RSLASEEEDEVVVLVGEDVDLLVILIG-TRSPSNVYFLKPGRGKV----VPVLYHPQTTI 65
            ++ ++ +D  +V+ GED DLLV+L    +    ++F    +  +    V  +   ++ +
Sbjct: 161 ETVVNKADDHTMVVYGEDPDLLVLLCHYAKEGRQIFFTTDKQTSMKNHRVGDISKAKSVL 220

Query: 66  DTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE 125
            +     +LFI A++GCD+TS L   GK  AL+ ++ +  L      FL   S+  +I +
Sbjct: 221 GSDSCRQLLFIHALTGCDTTSRLHGIGKPAALKKIMTDIYLKSQGAVFLQENSSKEDIIK 280

Query: 126 AGERFLVVLYGG 137
           AGE  LV LYGG
Sbjct: 281 AGEEALVNLYGG 292


>gi|405978040|gb|EKC42457.1| hypothetical protein CGI_10004686 [Crassostrea gigas]
          Length = 133

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 6/73 (8%)

Query: 163 IAALPPTESAASQHALNKFYQVQQWLGKNVP---PTEWGWNQVGNTLIPITTLKSPAPDS 219
           + +LPPT  AA  H++  + Q Q W GK+V    PTEWGW      L+PI  +K PAPD 
Sbjct: 11  VLSLPPTSDAAIVHSMRVYLQCQYWKGKSVADLDPTEWGWTLKTGKLLPI--VKPPAPD- 67

Query: 220 LLSLVACSCKKGC 232
            L ++ C+CK  C
Sbjct: 68  FLKIIHCNCKTNC 80


>gi|390356858|ref|XP_003728872.1| PREDICTED: uncharacterized protein LOC100891119 [Strongylocentrotus
            purpuratus]
          Length = 1654

 Score = 59.7 bits (143), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 88/212 (41%), Gaps = 15/212 (7%)

Query: 22   VVLVGEDVDLLVILIGTRSP---SNVYFLKPGRGKVVPV-LYHPQTTIDTSLAEHILFIR 77
            +++   D D+ VI IG  +    S +YF    + K   + L   Q  +   +++ I+ + 
Sbjct: 1374 IIIKSPDTDVFVIGIGIAAQLEGSKLYFHTGKQDKERTINLNAIQCHLGDQISDAIVGLH 1433

Query: 78   AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
              +GCDS SAL+ +GK K    + +    +   +      + T ++    E F+  LYG 
Sbjct: 1434 PFTGCDSVSALYGRGKTKPFTLMSQTTKFIKAFQELGKAFTLTDDLVSTLEEFVCKLYGM 1493

Query: 138  NHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWL-----GKNV 192
              E + +N+ RY  +  +    T     +PP + A  +H     YQ   W        +V
Sbjct: 1494 K-EISKVNEARYAFFSMA----TRQEDIMPPNKDALMKHIQRANYQTAIWRRCLESQPDV 1548

Query: 193  P-PTEWGWNQVGNTLIPITTLKSPAPDSLLSL 223
            P P   GW  V   L        PA  S+L L
Sbjct: 1549 PSPVGRGWQSVDGGLRIDWMDMQPAQQSILEL 1580


>gi|405973424|gb|EKC38141.1| hypothetical protein CGI_10019306 [Crassostrea gigas]
          Length = 117

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 105 DLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSAL--KVTSY 162
           D L    + L  A+   ++  AGE  +  LY G      LN LRY+ +    L  K    
Sbjct: 17  DFLKTQAQLLCNANAKDDVISAGENIISSLYNG-APYEGLNVLRYRKFAARVLTNKTCVQ 75

Query: 163 IAALPPTESAASQHALNKFYQVQQWLGK-NVPPTEWGW 199
           I  LPPT +AAS H+   + Q++ W+ K N+ P EWGW
Sbjct: 76  IHTLPPTLNAASFHSQRAYLQMKMWMNKDNLNPCEWGW 113


>gi|156398608|ref|XP_001638280.1| predicted protein [Nematostella vectensis]
 gi|156225399|gb|EDO46217.1| predicted protein [Nematostella vectensis]
          Length = 1216

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 92/228 (40%), Gaps = 16/228 (7%)

Query: 14   ASEEEDEVVVLVGE-DVDLLVILIGTRSPSNVYFLKPG---RGKVVPVLYHPQTTIDTSL 69
            AS+E   VV+   + DV +L I    R      + K G   R   +P+ +     +   +
Sbjct: 943  ASQEMTRVVIQSPDTDVAILAIHAFERMGCRELWFKTGYKDRVCFIPI-HDAARKLGPKV 1001

Query: 70   AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGER 129
               I  + A++GCD+TS L N GK K     + N D   ++           +  ++ E 
Sbjct: 1002 CAAIPGLHALTGCDTTSGLANIGKTKPWNKFMANTDSQAHLETLGSVIPPHEDTIKSAEC 1061

Query: 130  FLVVLYGGNHET-TSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW- 187
            F+  LY  + +   + +K+RY  +             LPPT  +  QH +   YQ   W 
Sbjct: 1062 FVCSLYTTSTKAGMTADKIRYWMFCQKHQSSDR----LPPTSDSLKQHIIRANYQTFIWK 1117

Query: 188  --LGKNVP---PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
              L   VP   P   G       L P+   + PAP SLL L  C C K
Sbjct: 1118 YCLQPMVPLPSPDGNGGKITNGRLEPLLMSQDPAPLSLLKLTTCRCVK 1165


>gi|390343919|ref|XP_003725994.1| PREDICTED: uncharacterized protein LOC100891484 [Strongylocentrotus
           purpuratus]
          Length = 556

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 88/221 (39%), Gaps = 26/221 (11%)

Query: 14  ASEEEDEVVVLVGEDVDLLVILIG------TRSPSNVYFLKPGRGKVVPVLYHPQTTIDT 67
           A+E   + +V+   D D++ IL+G         P    ++  G GK     ++    I  
Sbjct: 327 AAERGSKKIVIRTVDTDIIAILVGQLPSLIVEYPDIDIWVAFGMGK--NFCHYNINNICR 384

Query: 68  SLAEH----ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLD----PAST 119
           +L E     +    A +GCD+TS    +GK  A R     P++      F+D        
Sbjct: 385 NLGEDKSLALPLFHAYTGCDTTSCFLAKGKKSAWRVWKSYPEVTQAFLHFVDHPFRAVDV 444

Query: 120 TREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALN 179
           + E     ERF V+LY       S+N+ R + +     K    +  +PPT+ A  QH   
Sbjct: 445 SCEHFRHLERFTVLLYDITSNLLSVNEARRELFC----KKKRSLENIPPTQDALLQHIKR 500

Query: 180 KFYQVQQW-----LGKNVPPTE-WGWNQVGNTLIPITTLKS 214
             YQ   W        +VPP E WGW    N       LKS
Sbjct: 501 VLYQGGIWTTCRQAQPSVPPPEGWGWTMEDNHRGVFHQLKS 541


>gi|291221333|ref|XP_002730676.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 836

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 92/205 (44%), Gaps = 29/205 (14%)

Query: 13  LASEEEDEVVVLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEH 72
           +AS   D+ +V+   D D+ V+L+     SN       + +++  L+         +   
Sbjct: 589 IASSSPDKTIVVRSPDTDVFVLLLKFGQFSN-------KRRLID-LHKVIAETGKDICSA 640

Query: 73  ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAST---TREITEAGER 129
           +  + A SGCD+ SA   +GK+  L+ + ++P+   +I  FL   +T      + E+ E 
Sbjct: 641 LPALHAFSGCDTISAFVRKGKLLPLKILKQHPE---FIETFLTLGTTPDVDDRVYESLEH 697

Query: 130 FLVVLYGGNHETTSLNKLRYKSYVT-----SALKVTSY----IAALPPTESAASQHALNK 180
           F  +LY   + +  +NKLR + ++          ++SY    ++ LPP ++A   H    
Sbjct: 698 FTCILYKRTYASCDINKLRLELFMQKFNPKQGTVLSSYNGVDMSILPPCKAALQMHIRRA 757

Query: 181 FYQVQQWLGKN-----VP-PTEWGW 199
            YQ   W   N     VP P + GW
Sbjct: 758 NYQALIWCRANQAHPQVPTPDDHGW 782


>gi|443685664|gb|ELT89202.1| hypothetical protein CAPTEDRAFT_223408 [Capitella teleta]
          Length = 636

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 50/169 (29%), Positives = 69/169 (40%), Gaps = 14/169 (8%)

Query: 73  ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS--TTREITEAGERF 130
           +L   A+SGCD+ S+ F +GK  A +     PDL   +     P S  + + +    E F
Sbjct: 447 LLAFHALSGCDTVSSFFGKGKRSAWQAWQACPDLTSALLELSSPVSHDSVKRVLPIIETF 506

Query: 131 LVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQ-QWLG 189
           +  LYG   E+  L     K   T  L        +PP+  A   H L   +Q    W G
Sbjct: 507 VTRLYGV--ESVDLVNAARK---TLFLNKGKQFVQIPPSSDALQLHLLRAVHQSAFVWGG 561

Query: 190 KNV------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
             +       P EWGW + G+  +P      P   SL  L  CSCK  C
Sbjct: 562 LLIRDPLVPSPEEWGWQRSGSAFVPHYISLPPLSSSLPELSFCSCKSVC 610


>gi|291227245|ref|XP_002733597.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
          Length = 1578

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 13/167 (7%)

Query: 73   ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPA---STTREITEAGER 129
            +L    +SGCD+ S  + +GK +A +  ++    L  + ++ +P         + EA   
Sbjct: 1375 LLCAYVLSGCDTVSYPYKKGKKRAAKVALEVASKLLPLAQYAEPGEGFDVDENVIEAARL 1434

Query: 130  FLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ--VQQW 187
            +++ LYG      + ++LR   + T+   +  +    PPTE A  QH L   +Q  V + 
Sbjct: 1435 YIMALYGRADFCGNFDELRAHLFGTTKGDLRCF----PPTEDAFLQHILRGLHQIVVNKS 1490

Query: 188  LGKNVP----PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
                VP     T +G   V   L+PI  LK   P        C CKK
Sbjct: 1491 AHDPVPEYPIATMFGRKVVDGKLVPIRMLKGAKPTEATHKNYCRCKK 1537


>gi|357631811|gb|EHJ79278.1| hypothetical protein KGM_15523 [Danaus plexippus]
          Length = 155

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 1/58 (1%)

Query: 175 QHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           QH +N +YQ+Q WLG  +  T+WGW      ++P  T K   P+ LL  + CSC+ GC
Sbjct: 5   QH-INAYYQLQTWLGNELTATDWGWKLYQRGIMPKFTQKELIPEILLKTICCSCEIGC 61


>gi|260804463|ref|XP_002597107.1| hypothetical protein BRAFLDRAFT_76362 [Branchiostoma floridae]
 gi|229282370|gb|EEN53119.1| hypothetical protein BRAFLDRAFT_76362 [Branchiostoma floridae]
          Length = 1477

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/168 (25%), Positives = 71/168 (42%), Gaps = 19/168 (11%)

Query: 76   IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE----AGERFL 131
              A +GCD+ S   + GK  A +    + +   +   F D  S+   I++    A E F 
Sbjct: 1293 FHAFTGCDTVSQFSHIGKKTAWKVWETHDE---FTATFYDLHSSPHHISDEAETALEYFT 1349

Query: 132  VVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN 191
            ++L+      +S+N+LR   +     +    +++LPPT++A  QH      Q     G+N
Sbjct: 1350 ILLFDRTSTCSSINQLRKHLFT----RKNRPMSSLPPTQAALHQHMRRAILQGGHHWGRN 1405

Query: 192  V-------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
                     P EWGW        P+ T    A +S   L+ C C+  C
Sbjct: 1406 TTPCRQLPSPAEWGWTG-EEEWRPLWTTLPEAVESCPELLKCKCRTRC 1452


>gi|156408397|ref|XP_001641843.1| predicted protein [Nematostella vectensis]
 gi|156228983|gb|EDO49780.1| predicted protein [Nematostella vectensis]
          Length = 604

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 81/195 (41%), Gaps = 54/195 (27%)

Query: 24  LVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPV-LYHPQTTIDT---SLAEHILFIR 77
           LVGED DLL++L      +  ++ F+   + K   + +++ + T++     + ++I+F  
Sbjct: 438 LVGEDTDLLILLCHYADANDHDILFMPQTKQKSGTMRVWNIKNTVEALGPDICKNIMFAH 497

Query: 78  AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
           AI GCD+TSAL+  GK  +L+ +  +  LL  IR                          
Sbjct: 498 AILGCDTTSALYGLGKGLSLKMLTSDATLLELIR-------------------------C 532

Query: 138 NHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNVPPTEW 197
           N   + LN +                         A  H+L  + QVQQW G  +PP + 
Sbjct: 533 NSLCSCLNGI-----------------------VGAVYHSLRVYQQVQQWRGLALPPEDC 569

Query: 198 GWNQVGNTLIPITTL 212
           G  ++   L P  TL
Sbjct: 570 GLKEMDGKLPPQRTL 584


>gi|260816964|ref|XP_002603357.1| hypothetical protein BRAFLDRAFT_80350 [Branchiostoma floridae]
 gi|229288676|gb|EEN59368.1| hypothetical protein BRAFLDRAFT_80350 [Branchiostoma floridae]
          Length = 1516

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/182 (24%), Positives = 81/182 (44%), Gaps = 9/182 (4%)

Query: 14  ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGKVVPVLYHPQ--TTIDTSL 69
           A+E   E VV+  ED D++VI +    R P  +Y  K G       +   Q   T+   +
Sbjct: 565 AAETGSEAVVITAEDTDVMVISLAFAKRIPCKLY-QKCGTKSRTRFIAIDQLGDTLGEEV 623

Query: 70  AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT--EAG 127
            + ++ + A +GCD+ S    +GK+   + ++KN +     ++  +  + + + T  +  
Sbjct: 624 CKALVGLHAFTGCDTVSTFSGRGKLGVFKLMLKNAEYQDAFQQLGESWTVSPDGTLFKRI 683

Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
           ERF   +Y  +     +N++R+  ++     V S  +ALPP       H     YQ   W
Sbjct: 684 ERFTCQMYVSSTSVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGIW 741

Query: 188 LG 189
            G
Sbjct: 742 RG 743


>gi|156398132|ref|XP_001638043.1| predicted protein [Nematostella vectensis]
 gi|156225160|gb|EDO45980.1| predicted protein [Nematostella vectensis]
          Length = 1142

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 95/242 (39%), Gaps = 26/242 (10%)

Query: 10   RRSLASEEEDEVVVLVGEDVDLLVILIGT-----RSPSNVYFLKPGRGKVVPVLYHPQTT 64
            R ++ +    EV+V    D D++VIL G        P    ++  G G+     Y    T
Sbjct: 885  RHAVETSHAKEVIVRTV-DTDIVVILAGKFHQLKSQPELHLWVAYGMGR--HFRYLSINT 941

Query: 65   IDTSLAEHILF----IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRF-LDP--- 116
            I   L E          A+SGCD+TS+ + +GK+ A +T     D+ P ++   L+P   
Sbjct: 942  ICEGLGEARFIALPVFHALSGCDTTSSFYGKGKLTAWQTWDIYKDVTPTLQFLALNPFHQ 1001

Query: 117  ASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQH 176
             +   E  +  ERF V+LY        +N  R + +     K    +  LPPT+    QH
Sbjct: 1002 LTLDSEHFKRIERFTVILYDKLSPFACINDARMELFC----KNNRAMDKLPPTKDTLLQH 1057

Query: 177  ALNKFYQVQQWLGKNV------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
                 YQ   W    +       P ++ W +   T +P          +   L  C+ K 
Sbjct: 1058 VKRCIYQAGIWTSSEISQPVLPSPDQFAWIKEAGTWVPGWITIPEVSKACSELTKCNSKG 1117

Query: 231  GC 232
             C
Sbjct: 1118 VC 1119


>gi|443731762|gb|ELU16757.1| hypothetical protein CAPTEDRAFT_196996 [Capitella teleta]
          Length = 569

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 77/176 (43%), Gaps = 54/176 (30%)

Query: 20  EVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRA 78
           + VV++G   DLLV+++   +PS NVY                                 
Sbjct: 412 QTVVVIGNGTDLLVMMVALATPSMNVYI-------------------------------- 439

Query: 79  ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFL-------DPASTTREITEAGERFL 131
              CD+T       +V ++R + KN   +  I  +L        P +   EI  A   FL
Sbjct: 440 ---CDTTKG----PRVFSIRVIEKN---IGEISSYLLAWGYAEKPETPKAEIENARGGFL 489

Query: 132 VVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQ 183
           + LYG N   ++L+KLR+  Y      S+L  T  + +LPPT +AA+QH+L  ++Q
Sbjct: 490 LKLYGSNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLESLPPTSAAAAQHSLRAYHQ 545


>gi|443722574|gb|ELU11371.1| hypothetical protein CAPTEDRAFT_211864 [Capitella teleta]
          Length = 264

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 46/73 (63%), Gaps = 7/73 (9%)

Query: 116 PASTTREITEAGERFLVVLYGGNHETTSLNKLRY---KSYVTSALKVTSYIAALPPTESA 172
           P +   EI  A E FL+ LYG N   ++L+KL+    KS +TS +K+ S    LPPT +A
Sbjct: 187 PETPKAEIENAREEFLLKLYGFNKLGSTLDKLKQAIEKSSLTSTIKLES----LPPTSAA 242

Query: 173 ASQHALNKFYQVQ 185
           A+QH+L  ++QVQ
Sbjct: 243 AAQHSLRAYHQVQ 255


>gi|291238319|ref|XP_002739077.1| PREDICTED: tryptophan hydroxylase-like protein [Saccoglossus
           kowalevskii]
          Length = 726

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 83/209 (39%), Gaps = 16/209 (7%)

Query: 27  EDVDLLVILIGTRS-PSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRAISGCDST 85
           +D D+L++    R       +L+   G +  +     +TID S  E  LF    SG  + 
Sbjct: 81  QDTDILILFCYHRPIDCTGLYLQADFGGLYDI-----STIDNSDREEFLFKYGWSG--NG 133

Query: 86  SALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLN 145
           + L   G  K      K P     I  F    S   +I  AG + L + YG     T L 
Sbjct: 134 TILCIHGHTKCALYKCKFP--ASVITTFTSNTSPVSKIRTAGLKALQITYGC--RDTPLE 189

Query: 146 KLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN---VPPTEWGWNQV 202
           K RY  +   A K       LPPTE+A +QH+L    QV  W   N   + P   GW   
Sbjct: 190 KSRYLKFTKQAAKGKIDPDRLPPTENATAQHSLRVHLQVVIWKHLNTSILEPKGRGWELD 249

Query: 203 GN-TLIPITTLKSPAPDSLLSLVACSCKK 230
            N  L P       A D LL  + C+ ++
Sbjct: 250 SNRKLKPKMLSGGIASDILLKGICCTARQ 278


>gi|390337691|ref|XP_790384.2| PREDICTED: LMBR1 domain-containing protein 2-like, partial
           [Strongylocentrotus purpuratus]
          Length = 1013

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 38/66 (57%), Gaps = 4/66 (6%)

Query: 171 SAASQHALNKFYQVQQWLG---KNVPPTEWGWNQ-VGNTLIPITTLKSPAPDSLLSLVAC 226
           +AA  H+L  +YQV+QW G     + P +WGW +  G  ++ + T   PA + LL ++ C
Sbjct: 398 AAAKHHSLRVYYQVRQWKGIAADELQPDDWGWRERSGGVIVLVRTDLPPALEELLKMIRC 457

Query: 227 SCKKGC 232
           +C+  C
Sbjct: 458 NCQIDC 463


>gi|443700686|gb|ELT99539.1| hypothetical protein CAPTEDRAFT_208935 [Capitella teleta]
          Length = 129

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/132 (23%), Positives = 58/132 (43%), Gaps = 10/132 (7%)

Query: 79  ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGN 138
           ++GCD+ SA + +GK  A        ++    +  +     ++      E F+++LY  +
Sbjct: 1   MTGCDTVSAFYGRGKRTAWEAWKSYLEVTEAYQDCVSSDRVSKTCMALSEGFVILLYDKS 60

Query: 139 HETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN------V 192
            + T +NK R   +   A  + +    +PPT +A  QH      Q + W          +
Sbjct: 61  SKATDVNKARKHIFTQKARSLEN----IPPTHAALEQHVKRAVLQAKIWNNSTEAVPSAI 116

Query: 193 PPTEWGWNQVGN 204
            P++WGW + GN
Sbjct: 117 DPSKWGWVKEGN 128


>gi|260782789|ref|XP_002586464.1| hypothetical protein BRAFLDRAFT_106666 [Branchiostoma floridae]
 gi|229271576|gb|EEN42475.1| hypothetical protein BRAFLDRAFT_106666 [Branchiostoma floridae]
          Length = 276

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 9/112 (8%)

Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
           ERF   +Y  +     +N++R+  ++     V S  +ALPP       H     YQ   W
Sbjct: 113 ERFTCQMYVPSTPVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGIW 170

Query: 188 LG--KNVP----PTEWGWNQVGNTLIPITTLKSP-APDSLLSLVACSCKKGC 232
            G  +N P    P + GW    +  + IT L+SP AP ++L L+ CSC + C
Sbjct: 171 RGCLQNDPQVPSPVDAGWKLDEDGNLSITWLQSPPAPAAVLELLTCSCSRSC 222


>gi|260827565|ref|XP_002608735.1| hypothetical protein BRAFLDRAFT_73956 [Branchiostoma floridae]
 gi|229294087|gb|EEN64745.1| hypothetical protein BRAFLDRAFT_73956 [Branchiostoma floridae]
          Length = 255

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 9/112 (8%)

Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
           ERF   +Y  +     +N++R+  ++     V S  +ALPP       H     YQ   W
Sbjct: 113 ERFTCQMYVPSTPVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVKQANYQAGIW 170

Query: 188 LG--KNVP----PTEWGWNQVGNTLIPITTLKSP-APDSLLSLVACSCKKGC 232
            G  +N P    P + GW    +  + IT L+SP AP ++L L+ CSC + C
Sbjct: 171 RGCLQNDPQVPSPVDAGWKLDEDGNLSITWLQSPPAPATVLELLTCSCSRSC 222


>gi|291226324|ref|XP_002733143.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1155

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 27/183 (14%)

Query: 67   TSLAEHILFI--------RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
            T+LA HI  +         A++GCD TS+ F +GKV  L+  VKN   L  +   L    
Sbjct: 935  TALANHIGPVVCKSLPGYHALTGCDYTSSFFRKGKVNPLKKAVKNTLYLEGL-SILGENI 993

Query: 119  TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT--------SALKVTSYIAA--LPP 168
            T  +     E+++  LYG     TS+N+ R K ++         S L+    I A  LPP
Sbjct: 994  TFADDENLVEQYVCSLYGQG-ALTSVNEARLKIFLQKYRPTNQESPLQKIKGIDAIMLPP 1052

Query: 169  TESAASQHALNKFYQVQQWLG-------KNVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
             +    Q      Y    W         +N+ PT+ GW  V    +P+    +  P  L 
Sbjct: 1053 CKDVLVQKLARCNYVAYLWKHAHVQNPLENIKPTDHGWKDVNGVFLPVWFTDNQMPSMLS 1112

Query: 222  SLV 224
             ++
Sbjct: 1113 QII 1115


>gi|291224113|ref|XP_002732051.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1436

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 27/183 (14%)

Query: 67   TSLAEHILFI--------RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
            T+LA HI  +         A++GCD TS+ F +GKV  L+  VKN   L  +   L    
Sbjct: 1218 TALANHIGPVVCKSLPGYHALTGCDYTSSFFRKGKVNPLKKAVKNTLYLEGL-SILGENI 1276

Query: 119  TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT--------SALKVTSYIAA--LPP 168
            T  +     E+++  LYG     TS+N+ R K ++         S L+    I A  LPP
Sbjct: 1277 TFADDENLVEQYVCSLYGQG-ALTSVNEARLKIFLQKYRPTNQESPLQKIKGIDAGMLPP 1335

Query: 169  TESAASQHALNKFYQVQQWLG-------KNVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
             +    Q      Y    W         +N+ PT+ GW  V    +P+    +  P  L 
Sbjct: 1336 CKDVLVQKLARCNYVAYLWKHAHIQNPLENIKPTDHGWKDVNGVFLPVWFTGNQMPSMLS 1395

Query: 222  SLV 224
             ++
Sbjct: 1396 QII 1398


>gi|449689845|ref|XP_002154924.2| PREDICTED: uncharacterized protein LOC100207801 [Hydra
           magnipapillata]
          Length = 460

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 166 LPPTESAASQHALNKFYQVQQWL----GKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
           LPPT+ AA  H L  + QV +W+      N+ P EWGWN     L+ I T+K     ++L
Sbjct: 119 LPPTDRAAYFHGLRVYLQVIEWIMLDESFNLDPKEWGWNLDNGCLLTIPTIKDVVSPNIL 178

Query: 222 SLVAC 226
            ++ C
Sbjct: 179 KVIRC 183


>gi|328721000|ref|XP_003247183.1| PREDICTED: hypothetical protein LOC100570608 [Acyrthosiphon pisum]
          Length = 135

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSP-APDSLLSLVACSCKKGC 232
           WLG    P EWGW+   + L+PI  LK P  P+SLL+L+ C+C KGC
Sbjct: 2   WLGNEKSPIEWGWHVKNDLLLPIP-LKGPLIPESLLNLITCNCTKGC 47


>gi|405969476|gb|EKC34445.1| hypothetical protein CGI_10012724 [Crassostrea gigas]
          Length = 101

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 181 FYQVQQWLGK---NVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK 229
           + Q Q W GK   ++ PTEWGW      L+PI     PAPD LL ++ C+CK
Sbjct: 4   YLQCQYWKGKSEADLDPTEWGWTLKTGKLLPIEMSNPPAPDFLLKIIHCNCK 55


>gi|443732835|gb|ELU17399.1| hypothetical protein CAPTEDRAFT_207062 [Capitella teleta]
          Length = 627

 Score = 50.4 bits (119), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 67/166 (40%), Gaps = 14/166 (8%)

Query: 76  IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS--TTREITEAGERFLVV 133
             A+SGCD+ S+ F +GK  A +     P L   +     P S  + + +    E F+  
Sbjct: 236 FHALSGCDTVSSFFGKGKRSAWQAWQACPGLTSALLEVSSPVSHDSVKRVLPIIETFVTR 295

Query: 134 LYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQ-QWLG--- 189
           LYG   E+  L     K   T  L        +PP+  A   H L   +Q    W G   
Sbjct: 296 LYGV--ESVDLVNAARK---TLFLNKGKQFVQIPPSSDALQLHLLRAVHQSAFVWGGLLI 350

Query: 190 --KNVP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
               VP P +WGW + G+  +P      P   SL  L  CSCK  C
Sbjct: 351 RDSLVPSPEKWGWQRSGSAFMPHYISLPPLSSSLPELSFCSCKSVC 396


>gi|156364755|ref|XP_001626511.1| predicted protein [Nematostella vectensis]
 gi|156213389|gb|EDO34411.1| predicted protein [Nematostella vectensis]
          Length = 1373

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 51/112 (45%), Gaps = 11/112 (9%)

Query: 124  TEAG-ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFY 182
            T AG E+ +  LY  N   T +  LR+  +     K  +    LPPT +A  +      Y
Sbjct: 1205 TYAGIEKLVCKLYQPNTRITKVKDLRWLLF----RKKQAESERLPPTLAALREATKRAHY 1260

Query: 183  QVQQW-----LGKNVP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
            Q   W         +P P+++GW    +  +P+ T   PAPD++L LV C C
Sbjct: 1261 QCMVWNSDILANPELPSPSDYGWKLEADEWVPVMTSLKPAPDAVLHLVKCGC 1312


>gi|449668498|ref|XP_004206798.1| PREDICTED: uncharacterized protein LOC100212644 [Hydra
           magnipapillata]
          Length = 429

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 66/167 (39%), Gaps = 14/167 (8%)

Query: 76  IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT-EAGERFLVVL 134
             AI+GCD  S    + K  + +T     DL   ++   + +S    I     ERF+V+L
Sbjct: 242 FHAITGCDQVSFFAGKEKKLSWKTWENFEDLTNALQSISNCSSKEDIIIFHLIERFVVLL 301

Query: 135 YGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQ----QWLGK 190
           Y      +S N+ R   +     K      A+PPT+     H     YQ      Q L K
Sbjct: 302 YDRTSSYSSTNECRKDLFS----KKGRLPDAMPPTQDNLQLHISRAVYQASFCWAQSLCK 357

Query: 191 N--VP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK--KGC 232
           N  +P P EWGW    N    + T           L+ C CK  KGC
Sbjct: 358 NPSLPNPCEWGWVMEDNVYKIVWTTIREVSKMCSELIRCGCKIEKGC 404


>gi|390339454|ref|XP_003725007.1| PREDICTED: uncharacterized protein LOC100893398 [Strongylocentrotus
            purpuratus]
          Length = 1226

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 90/231 (38%), Gaps = 31/231 (13%)

Query: 14   ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGK-----VVPVLYHPQTTID 66
            A E  D  +++   D D+ +IL+G   RS      +  G G       V  +   Q    
Sbjct: 962  AREISDGNILVRSSDTDVFIILLGLCGRSTGMNIIMDYGSGNNRRYIDVSNVAALQENKQ 1021

Query: 67   TSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEA 126
                E ++ + A++GCD TS  F  GK+K L+ +  +   +  +R           IT  
Sbjct: 1022 PGSTEALIGLHALTGCDFTSCFFRNGKMKPLQKMEASTTHMQALRSLTSEEVDVPAITS- 1080

Query: 127  GERFLVVLYGGNHETTSLNKLRYKSY--VTSA--------LKVTSYIAALPPTESAASQH 176
               F+  LYG    T+ +N+ R K++  ++S         LK  +  ++LPP       H
Sbjct: 1081 ---FVCSLYGC--VTSDINEARCKAFKRISSGGEKGPLAKLKKIN-CSSLPPCAKTLGNH 1134

Query: 177  ALNKFYQVQQWLGKNVP-------PTEWGWNQVGNTLIPITTLKSPAPDSL 220
                 Y  + W   +         P ++GW    N   P     S  P+SL
Sbjct: 1135 IKRAHYVARMWKRADQAEPTGGSRPLDFGWKSTNNCFEPEWYPGSAVPESL 1185


>gi|291245182|ref|XP_002742470.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 1470

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/231 (23%), Positives = 100/231 (43%), Gaps = 34/231 (14%)

Query: 22   VVLVGEDVDLLVILI----GTRSPSNVYFLKPGR-------GKVVPVLYHPQTTIDTSLA 70
            +V+   D D++V+LI      ++ S V+ ++ GR        +++P+  H       SLA
Sbjct: 1210 LVVQSPDTDVMVLLIHYMPQMQAVSTVW-METGRITRSLDLRRMIPI--HKIADKIGSLA 1266

Query: 71   EHIL-FIRAISGCDSTSALFNQGKVKALRTVVK-NPDLLPYIRRFLDPASTTREITEAGE 128
              ++  + +++GCD+ S+ F  GK   ++ ++K +   L +++              A  
Sbjct: 1267 CSVMPAVHSLTGCDTVSSFFGIGKKTVIQRMLKLSKTELTHLQAL--SKGNEDSAVNASR 1324

Query: 129  RFLVVLYGGNHE-----TTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ 183
            RF+V LY   ++      +SLN L  +   T ++ V    A  PP E+A  QH     +Q
Sbjct: 1325 RFVVSLYDPKNKYSRDNQSSLNSLSVRLAATRSVSV----AKFPPCEAAFIQHVKRASWQ 1380

Query: 184  VQQWLGKNVP------PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
             + W   ++       P   GW    + L+P+   + P     L    C C
Sbjct: 1381 TKTWTSAHLAKPILPKPAGHGWIIKNDILVPV-FFEGPTALDKLKDYFCGC 1430


>gi|291234141|ref|XP_002737008.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1288

 Score = 47.0 bits (110), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 79/176 (44%), Gaps = 22/176 (12%)

Query: 21   VVVLVGEDVDLLVILIGTRSPSNV-----YFLKPGRGKVVPVLYHPQTTIDTSLAEHILF 75
            +VVL G D D+ V+L+      N       +++ G G      Y P   +   + + + +
Sbjct: 1126 IVVLSG-DTDVFVLLMHYWDIFNSEGLRELWIRAGVGD--STRYFPIHILAPRIGKDLCY 1182

Query: 76   ----IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFL 131
                +  ++GCD TS +   G   A  T+  NP  L Y+  F    S T + T + E +L
Sbjct: 1183 LLPLVHTLTGCDYTSKV---GTKHA--TLSANP--LEYLSDFDSAPSCTDDFTASCETYL 1235

Query: 132  VVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
            V +   N   T++++LR  +Y  S  K  S+   LPPT  A  QH    +Y   +W
Sbjct: 1236 VQVLKRNTTCTTMDRLRDDTYHHS--KGVSF-DQLPPTSHAVQQHIRRAYYATLRW 1288


>gi|260800823|ref|XP_002595296.1| hypothetical protein BRAFLDRAFT_128105 [Branchiostoma floridae]
 gi|229280541|gb|EEN51308.1| hypothetical protein BRAFLDRAFT_128105 [Branchiostoma floridae]
          Length = 1434

 Score = 46.2 bits (108), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 5/126 (3%)

Query: 51   RGKVVPVLYHPQTTIDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYI 110
            + + +PV    Q+ +   L + +    AI+GCDST +    GK KAL  + +NP+    +
Sbjct: 1293 KARYIPVHSLSQS-LGPQLCQALPAFHAITGCDSTGSFHGIGKKKALSVLRQNPEHQSNL 1351

Query: 111  RRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTE 170
              F        E   + E F+  LY       + ++LRY  +     +      ALPPT 
Sbjct: 1352 AVFGQEPKLGEECFRSSENFVCDLYESGKAPCTTDELRYFIFCQKKQRN----EALPPTS 1407

Query: 171  SAASQH 176
            ++   H
Sbjct: 1408 NSLRHH 1413


>gi|291244877|ref|XP_002742320.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1507

 Score = 45.8 bits (107), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 50/221 (22%), Positives = 84/221 (38%), Gaps = 38/221 (17%)

Query: 22   VVLVGEDVDLLVILIGT---------RSPSNVYFLKPGRGK-----VVPVLYHPQTTIDT 67
            VV+   D D++VILIG           +  N   ++ G G       V  +     ++  
Sbjct: 1232 VVVRASDTDVIVILIGMLGRHLESHRETSYNRIIMECGSGNNHRHIDVSSIATALESMQN 1291

Query: 68   SLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE-A 126
             LA  +  + A +GCD T++ + +GKVK L  + KN      + +F    S+     +  
Sbjct: 1292 GLAAAMPGLHAFTGCDFTASFYRKGKVKPLEVLQKNT--TGTLTQFFSKLSSEDAPDQRK 1349

Query: 127  GERFLVVLYGGNHETTSLNKLRY------------KSYVTSALKVTSYIAALPPTESAAS 174
             E F+  LYG   +   +N+ RY            ++ +T+  KV  + A LPP      
Sbjct: 1350 AEEFICSLYGMKGDVKDVNEARYAKLLQMTGNKNQEASMTNVKKV--HCALLPPCAKTVH 1407

Query: 175  QHALNKFYQVQQWLGKNVP-------PTEWGWNQVGNTLIP 208
                   +    W   + P       P ++GW        P
Sbjct: 1408 NKLQRAHFISILWGNADSPHPGHGLDPLKYGWKDKNGYYAP 1448


>gi|156381299|ref|XP_001632203.1| predicted protein [Nematostella vectensis]
 gi|156219255|gb|EDO40140.1| predicted protein [Nematostella vectensis]
          Length = 1141

 Score = 45.4 bits (106), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 55/243 (22%), Positives = 93/243 (38%), Gaps = 29/243 (11%)

Query: 14   ASEEEDEVVVLVGEDV-DLLVILIGTRS-PSNVYFLKPGRGKVVPVLYHPQTTIDTS--- 68
            + EE D  V+L  +D  D     I  R+  ++V  L      ++P+        DT    
Sbjct: 875  SHEEADSRVILHVKDAHDQGFTKIAVRTVDTDVLVLSVAVLPLLPIQPKLWVAFDTGANF 934

Query: 69   --LAEHILFIRA-ISGCDSTSALFN------QGKVKALRTVVKNPDLLPYIRRFLDPAST 119
              +A H+ F++  +S C     +F       Q K K  +T  +   L P +       + 
Sbjct: 935  RCIAAHMPFLQTLVSKCLERFRIFMPSQAAIQFKGKGKKTAFETWKLYPEVTDVFIAPTV 994

Query: 120  TREITEAG----ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQ 175
              E+++      ERF V+LY       ++N+ + + +          + ALPP+++A  +
Sbjct: 995  PEEVSDIHMATIERFTVLLYDRTSSKLTVNEAKKQLFAQKGRP----LEALPPSKAALLE 1050

Query: 176  HALNKFYQVQQWLGKNVP-------PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
            H     YQ     G+++        P  WGW        P  T           L+ C C
Sbjct: 1051 HTKRAAYQAGHCWGQSLARSPVLPSPEHWGWALSDGKWEPYWTALPDVTRVCQELIRCGC 1110

Query: 229  KKG 231
            KKG
Sbjct: 1111 KKG 1113


>gi|156371271|ref|XP_001628688.1| predicted protein [Nematostella vectensis]
 gi|156215671|gb|EDO36625.1| predicted protein [Nematostella vectensis]
          Length = 616

 Score = 45.1 bits (105), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 23/49 (46%)

Query: 184 VQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           VQ W G  V P EWGW       +P+   +  AP  LL  + C+C   C
Sbjct: 524 VQAWKGNGVSPEEWGWPVTCTGFVPVQMSEPAAPAQLLRNIKCNCGGHC 572


>gi|291230800|ref|XP_002735353.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1507

 Score = 45.1 bits (105), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 75/182 (41%), Gaps = 28/182 (15%)

Query: 67   TSLAEHILFI--------RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
            T+LA HI  +         A++GCD TS+ F +GKV  L+  VKNP  L  +   L    
Sbjct: 1290 TALANHIGPVVCKSLPGHHALTGCDYTSSFFRKGKVNPLKKAVKNPLYLEGL-SILGENI 1348

Query: 119  TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT--------SALKVTSYIAA-LPPT 169
            T  +     E+++  LYG     TS+ + R K ++         S L++    A  LPP 
Sbjct: 1349 TFADDENLVEQYVCSLYGQG-ALTSVYEARLKIFLQKYRPTNQESPLQIKGIDAGMLPPC 1407

Query: 170  ESAASQHALNKFYQVQQWLG-------KNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLS 222
            +    Q      Y    W         +N+ PT+ GW       +P+    +  P  L  
Sbjct: 1408 KDVLVQKLARCNYVAYLWKHAHIQNPLENIKPTDHGWKD--GVFLPVWFTGNQMPSMLSQ 1465

Query: 223  LV 224
            ++
Sbjct: 1466 II 1467


>gi|156368670|ref|XP_001627815.1| predicted protein [Nematostella vectensis]
 gi|156214736|gb|EDO35715.1| predicted protein [Nematostella vectensis]
          Length = 1195

 Score = 44.7 bits (104), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 9/136 (6%)

Query: 22   VVLVGEDVDLLVILIG------TRSPSNVYFLKP-GRGKVV-PVLYHPQTTIDTSLAEHI 73
            VVL   D D+LVI +         S ++V  L   G G  +  +  H  T   T     +
Sbjct: 1032 VVLRTVDTDVLVIAVAYFQKLQMLSQASVGLLVAFGTGAYLRDIAAHEVTGSFTGNVAAL 1091

Query: 74   LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLD-PASTTREITEAGERFLV 132
            L   A +GCD+ S  + +GK KAL T    P+++P      + P+    E     ERF++
Sbjct: 1092 LTFHAFTGCDTVSCFYGKGKKKALETWHIYPEIIPVFAAIANAPSEMLDEWMPILERFVI 1151

Query: 133  VLYGGNHETTSLNKLR 148
            +LY      +S+N  R
Sbjct: 1152 LLYDRTSSASSVNDAR 1167


>gi|405972735|gb|EKC37485.1| hypothetical protein CGI_10018098 [Crassostrea gigas]
          Length = 117

 Score = 43.9 bits (102), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 46/99 (46%), Gaps = 10/99 (10%)

Query: 75  FIRAISGCDSTSALFNQG---KVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFL 131
           FI AISGCD+T  +F  G     K  ++ V   DL   I  +    S+   + +AGE  +
Sbjct: 23  FIHAISGCDTTFRMFGIGMGAAYKKFKSSVYIQDLAQRIMTY----SSKENVVQAGEEIV 78

Query: 132 VVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAA--LPP 168
             L  G  E   L+ LRY+ + +  +    Y+    LPP
Sbjct: 79  ACL-NGYVENEGLDLLRYRKFASKVVTGNMYVQVQTLPP 116


>gi|443692893|gb|ELT94391.1| hypothetical protein CAPTEDRAFT_204036 [Capitella teleta]
          Length = 478

 Score = 43.9 bits (102), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 46/85 (54%), Gaps = 9/85 (10%)

Query: 22  VVLVGEDVDLLVILIG-TRSPSNVYFLKPGRGKVVPV--------LYHPQTTIDTSLAEH 72
           V +VGED DL+V+L+  T +      LK    + +          ++  Q ++ + +  H
Sbjct: 379 VTVVGEDTDLIVLLLHHTPNSCQTIVLKSDIHRSIKSHKQAKQWQIHSVQRSLGSEMCRH 438

Query: 73  ILFIRAISGCDSTSALFNQGKVKAL 97
           +LFI A+ GCD+T++++  GK  A+
Sbjct: 439 LLFIHALLGCDTTASIYGLGKGNAI 463


>gi|291235271|ref|XP_002737568.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
          Length = 1293

 Score = 43.5 bits (101), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 17/127 (13%)

Query: 22   VVLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRAISG 81
            VV+   D D++VIL+G              G +   L     ++   LA  +  + A +G
Sbjct: 1146 VVVRASDTDVIVILLGML------------GSIATAL----ESMQNGLAAAMPGLHAFTG 1189

Query: 82   CDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHET 141
            CD T++ + +GKVK L  + KN  +   I+ F   +S         E F+  LYG   + 
Sbjct: 1190 CDFTASFYRKGKVKPLEVLQKNT-MRTLIQFFSKLSSEDAPDQRKAEEFICSLYGMKGDE 1248

Query: 142  TSLNKLR 148
             S+  ++
Sbjct: 1249 ASMTNVK 1255


>gi|345490613|ref|XP_001599794.2| PREDICTED: hypothetical protein LOC100114943 [Nasonia vitripennis]
          Length = 511

 Score = 42.0 bits (97), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%)

Query: 182 YQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLV 224
           Y  + WLG ++ P  WGW +  N L+PI   K  AP  LL +V
Sbjct: 427 YNHEIWLGFDLDPVLWGWKRTSNMLLPIMNPKPVAPVELLEMV 469


>gi|156342193|ref|XP_001620906.1| hypothetical protein NEMVEDRAFT_v1g222583 [Nematostella vectensis]
 gi|156206361|gb|EDO28806.1| predicted protein [Nematostella vectensis]
          Length = 545

 Score = 41.6 bits (96), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 24/140 (17%)

Query: 22  VVLVGEDVDLLVILIGTRS-PSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIR--- 77
           +V+  ED D+LVI +  +S  ++  F+K  +          +T +D S    ++ ++   
Sbjct: 115 IVISSEDTDVLVIALSLKSFIASPLFIKTTK--------QSRTYVDVSKVVLVIGVQMCI 166

Query: 78  ------AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPA---STTREITEAGE 128
                 A +GC+  SA F +GK KAL  + +N     ++  FL+         E+ +  +
Sbjct: 167 ALPGFHAFTGCERASAFFGKGKAKALELLHQNSS---FVNLFLEMGMYWQLDEELFQKIQ 223

Query: 129 RFLVVLYGGNHETTSLNKLR 148
            F   +Y     T+ +NKLR
Sbjct: 224 EFTCTMYSYLAGTSDVNKLR 243


>gi|156363424|ref|XP_001626044.1| predicted protein [Nematostella vectensis]
 gi|156212905|gb|EDO33944.1| predicted protein [Nematostella vectensis]
          Length = 1194

 Score = 41.2 bits (95), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 5/84 (5%)

Query: 69   LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
            + ++ILF  AI GCD+TSAL+      +L+ +  +         F    +   +I  AGE
Sbjct: 1042 ICKNILFAHAILGCDTTSALYGL----SLKMLTSDATSRQQADIFHQADAAKNDIAAAGE 1097

Query: 129  RFLVVLYGGNHETTSLNKLRYKSY 152
              L+ LY G  + T L+ LRY  +
Sbjct: 1098 TALLCLYKGLKDET-LDSLRYARF 1120


>gi|405970723|gb|EKC35601.1| hypothetical protein CGI_10016597 [Crassostrea gigas]
          Length = 87

 Score = 40.8 bits (94), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 1/50 (2%)

Query: 184 VQQWLGK-NVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
           ++ W+ K N+ P EWGW      L+P+    + AP  LL+++  +CK  C
Sbjct: 1   MKMWMNKDNLNPCEWGWKVANGNLVPVKCTMNAAPSKLLNIIRYNCKTNC 50


>gi|291239442|ref|XP_002739632.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 172

 Score = 40.8 bits (94), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 12/118 (10%)

Query: 120 TREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALN 179
           + E+    E  +  LYG     + +N+ R   +     +  S+   LPP +    +H   
Sbjct: 21  SEELLSQLEALVCALYGATKGVSDVNQCRCGVFCAKKGEAESH--QLPPCKDCLYKHCQR 78

Query: 180 KFYQVQQWLGK----NVP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
             YQ   W        VP P   GW    + +  +     PAP +++ L++C CKK C
Sbjct: 79  ANYQASVWRNSLQNAEVPTPVGKGWALEHDGMTGL-----PAPIAVIELMSCMCKKKC 131


>gi|332027826|gb|EGI67889.1| hypothetical protein G5I_03478 [Acromyrmex echinatior]
          Length = 89

 Score = 40.4 bits (93), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 180 KFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPD 218
           ++YQVQ+WLG  + P +W W  + N L PI T  S  PD
Sbjct: 26  RYYQVQKWLGYELNPEDWRWIMITNILEPIQTFLS-TPD 63


>gi|156372302|ref|XP_001628977.1| predicted protein [Nematostella vectensis]
 gi|156215967|gb|EDO36914.1| predicted protein [Nematostella vectensis]
          Length = 604

 Score = 38.1 bits (87), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 8/74 (10%)

Query: 166 LPPTESAASQHALNKFYQVQQWLGKNVP------PTEWG--WNQVGNTLIPITTLKSPAP 217
           L PT +A  Q  L   YQ+  W    V       P+ +G  W       +P+     PAP
Sbjct: 474 LAPTSAALKQAILRSHYQLLVWNNDIVANPVLPSPSVYGRKWQNAKKIWVPVMRDLPPAP 533

Query: 218 DSLLSLVACSCKKG 231
           D+++ LV C C+ G
Sbjct: 534 DAIIHLVRCGCEYG 547


>gi|291221207|ref|XP_002730614.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1516

 Score = 38.1 bits (87), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 76   IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLY 135
            + A++GCD   +   +GKVK    + K    + ++   L  A  T +  +A E+F+  LY
Sbjct: 1319 LHAMTGCDYIPSFLGKGKVKPFELMRKQACFIAFMAA-LGNAPLTTDTVKACEKFVCHLY 1377

Query: 136  GGNHETTSLNKLRYKSY 152
            G + + +S+N+ RY  +
Sbjct: 1378 GQS-KLSSVNQARYMMF 1393


>gi|339483351|ref|YP_004695137.1| hypothetical protein Nit79A3_1940 [Nitrosomonas sp. Is79A3]
 gi|338805496|gb|AEJ01738.1| hypothetical protein Nit79A3_1940 [Nitrosomonas sp. Is79A3]
          Length = 651

 Score = 37.4 bits (85), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 32/140 (22%), Positives = 57/140 (40%), Gaps = 14/140 (10%)

Query: 79  ISGCDSTSALFNQGKVKALRTVVKNPDLL--------PYIRRFLDPASTTREITEAGERF 130
            S  D  S   N   ++A+RT+  N  +L         Y+ + L   ST R +   G  F
Sbjct: 123 FSKSDYESLQLNNAYLEAMRTIFSNTTVLFLGYGLRDEYVFQNLQTCSTIRPLFGTGPHF 182

Query: 131 LVVLYGGNHETTSLNKLRYKSYVT---SALKVTSYIAALPPTESAASQHALNKFYQVQQ- 186
           +V     N     + ++RY++  T     +++   IA   P++S  + +   K    QQ 
Sbjct: 183 IVTSEERNELPAIVRRIRYEADATDHRDVIQILEVIADFTPSQSIQTANEDEKTISSQQS 242

Query: 187 --WLGKNVPPTEWGWNQVGN 204
             ++   + P  W  +Q  N
Sbjct: 243 IYYIADLITPGTWTTSQTFN 262


>gi|156355296|ref|XP_001623606.1| predicted protein [Nematostella vectensis]
 gi|156210323|gb|EDO31506.1| predicted protein [Nematostella vectensis]
          Length = 416

 Score = 37.4 bits (85), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 56/126 (44%), Gaps = 12/126 (9%)

Query: 15  SEEEDEVVVLVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPVLYHPQTT---IDTSL 69
           ++  ++  +L+GED DLLV+L+      P         R   V  ++  + T   +    
Sbjct: 291 AKAREQTSILIGEDTDLLVLLLYHAEMEPRQ-------RATAVRKVWDIEKTKAALGRDS 343

Query: 70  AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGER 129
              ILF+ A+ GC++TS +    +  AL+    +         F    S+  EI EAGE 
Sbjct: 344 TSGILFVHALLGCNTTSRIHGIWEGVALKRAKISTQFRELAGVFSSADSSRVEIIEAGEM 403

Query: 130 FLVVLY 135
            L+ ++
Sbjct: 404 ALLNIF 409


>gi|219110080|ref|XP_002176792.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411327|gb|EEC51255.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 791

 Score = 37.0 bits (84), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 13/111 (11%)

Query: 65  IDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT 124
           ID    +H L   A +G  S  + ++  +V+ LRT+ ++P  L Y +     A +  E+ 
Sbjct: 336 IDNDQTKHDLA--ATAGHQSNRSRWSDDRVRQLRTLARDP--LIYEKLTASLAPSIWELD 391

Query: 125 EAGERFLVVLYGGNHETTSLNKLRYKSYV---------TSALKVTSYIAAL 166
              +  L +L+GGNH T + +KL  +  V         TS  ++ SY+  L
Sbjct: 392 NVKKGILCMLFGGNHGTGATSKLNKRGDVNILLCGDPGTSKSQLLSYVHKL 442


>gi|156353856|ref|XP_001623126.1| predicted protein [Nematostella vectensis]
 gi|156209789|gb|EDO31026.1| predicted protein [Nematostella vectensis]
          Length = 703

 Score = 37.0 bits (84), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 39/107 (36%), Gaps = 10/107 (9%)

Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
           ERF V+LY        +N  R + +     K    +  LPPT+ A  +H     YQ   W
Sbjct: 601 ERFTVILYDKLSPFACINDARMELFC----KNNRAMDKLPPTKDALLKHVKRCIYQAGIW 656

Query: 188 LGKNV------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
               +       P ++ W +   T +P          +   L  CSC
Sbjct: 657 TSSEISQPVLPSPDQFAWIKEAGTWVPRWITIPEVSKACSELTKCSC 703


>gi|405978391|gb|EKC42788.1| hypothetical protein CGI_10015106 [Crassostrea gigas]
          Length = 411

 Score = 36.6 bits (83), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 61/131 (46%), Gaps = 13/131 (9%)

Query: 22  VVLVGEDVDLLVILIGTRSPSN-------VYFLKPGRGKVVPVLYHPQTTIDTSLAEHIL 74
           +V+   D D+L++L+   S  N       V   KP    +  V+     ++D    +H++
Sbjct: 275 IVISASDTDVLLLLVSFYSEINRRLWMVAVSSSKPKNIPIHAVVEKNFPSLDRR--KHLV 332

Query: 75  FIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVL 134
             RA++GCD+TS  +   K  AL+    N +LL      L     T ++ +  E+F+ ++
Sbjct: 333 AFRALTGCDTTSFFYGISKKSALKVYRDNYELLEG----LGEGDLTDQMIKDCEKFVYLV 388

Query: 135 YGGNHETTSLN 145
                E+T+ N
Sbjct: 389 LIVLDESTAFN 399


>gi|255033875|ref|YP_003084496.1| hypothetical protein Dfer_0060 [Dyadobacter fermentans DSM 18053]
 gi|254946631|gb|ACT91331.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 368

 Score = 36.6 bits (83), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 12/115 (10%)

Query: 74  LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
           L +R IS  D  +A FN   VKA    V +P LL   + ++D   T+ +I   G  F  +
Sbjct: 173 LSVREISAVDLCNAHFN---VKAEH--VLDPTLLIERQTYIDLFQTSNKIHSGGTLFSYI 227

Query: 134 LYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWL 188
           L   + +T+++       ++   L VTS+  + P  +       LN  Y V+QWL
Sbjct: 228 LDMNDEKTSAI------KHIEKQLNVTSFSPSKPSIKKNVPIE-LNIEYSVEQWL 275


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,659,751,083
Number of Sequences: 23463169
Number of extensions: 144749385
Number of successful extensions: 339012
Number of sequences better than 100.0: 104
Number of HSP's better than 100.0 without gapping: 51
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 338838
Number of HSP's gapped (non-prelim): 118
length of query: 232
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 94
effective length of database: 9,121,278,045
effective search space: 857400136230
effective search space used: 857400136230
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)