BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy18235
(232 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|390356505|ref|XP_003728811.1| PREDICTED: uncharacterized protein LOC100888155 [Strongylocentrotus
purpuratus]
Length = 1585
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/222 (38%), Positives = 132/222 (59%), Gaps = 11/222 (4%)
Query: 12 SLASEEEDEV-VVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTI-DTS 68
+LA E ++V VV+VG D DLLV+L+ + S +++ L V ++ Q I DT
Sbjct: 737 ALAVAESEQVPVVVVGTDTDLLVMLVARATTSTDMHMLCRSNPVTVFNIHEIQHVIGDTR 796
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
+ H++F+ A++GCD+ SA++ QGK KA V K D + F D ST E+ AGE
Sbjct: 797 I--HLMFLHAVTGCDTVSAIYRQGKRKAFNMVHKKRDY-DLLDTFTDSGSTHDEVKRAGE 853
Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQV 184
F++ LYG + SL+ R+ +Y S+L + + +LPPT +AA+QH+ + V
Sbjct: 854 AFILKLYGASR-FESLDDYRHIAYKRAIGRSSLSSSFQLESLPPTSAAANQHSYRTYLTV 912
Query: 185 QQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVAC 226
Q+W+G ++PPTEWGW TL P+ T + APD+LL++VAC
Sbjct: 913 QEWMGNSLPPTEWGWRFQDGTLTPVETDIAVAPDTLLNMVAC 954
>gi|443716458|gb|ELU07980.1| hypothetical protein CAPTEDRAFT_206230 [Capitella teleta]
Length = 269
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 110/231 (47%), Gaps = 19/231 (8%)
Query: 7 SSGRRSLASEEEDEVVVLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTID 66
+ GR+SL + + D L L SP NV+ Y +TTI
Sbjct: 21 AKGRKSLLA------ATVARGDKQLHTELESIGSPCNVH-------AACYKDYTRETTIV 67
Query: 67 TSLAEHILFIRAISGCDSTS-ALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE 125
+ + I DS L+ +GK A + + DL P + F P + EI
Sbjct: 68 AAKRKVIAADECTGNVDSGDRTLYGKGKKTAFH-LARKTDLFPILESFEKPETPKAEIEN 126
Query: 126 AGERFLVVLYGGNHETTSLNKLRYKSYV----TSALKVTSYIAALPPTESAASQHALNKF 181
AGE FL+ YG N ++L+KLR+ Y S+L T + +LPPT +AA+QH+L +
Sbjct: 127 AGEEFLLKFYGSNKLVSTLDKLRHYKYKQAIRKSSLTSTIKLESLPPTSAAAAQHSLRAY 186
Query: 182 YQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+QVQ W GK V T WGW L P+ T K AP++LL +VAC CK C
Sbjct: 187 HQVQTWRGKMVDATAWGWQIGDGILAPVETTKGVAPENLLKMVACGCKTQC 237
>gi|443691138|gb|ELT93079.1| hypothetical protein CAPTEDRAFT_217788 [Capitella teleta]
Length = 1299
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 108/226 (47%), Gaps = 44/226 (19%)
Query: 12 SLASEEEDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLA 70
+L + + VV++G D DLLV+++ +PS N+Y +G P L T
Sbjct: 1081 ALTHSTDGQTVVVIGNDTDLLVMMVALATPSMNMYICDTTKG---PRL----CTAKVKRL 1133
Query: 71 EHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERF 130
I ++ IS F KV LR+ ++N AGE F
Sbjct: 1134 RFIWLVKLIS--------FPFWKV--LRSRIEN----------------------AGEEF 1161
Query: 131 LVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQVQQ 186
+ LYG N ++L+KLRY Y S+L T + +LPPT +AA+QH+L ++QVQ
Sbjct: 1162 FLKLYGSNKLVSTLDKLRYYKYKQAKRKSSLTPTIKLESLPPTSAAAAQHSLRAYHQVQT 1221
Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
W GK V T WGW L P+ T K AP +LL +VAC CK C
Sbjct: 1222 WRGKMVDATAWGWQIRDGILAPVETTKGVAPQNLLKMVACGCKTQC 1267
>gi|156371522|ref|XP_001628812.1| predicted protein [Nematostella vectensis]
gi|156215798|gb|EDO36749.1| predicted protein [Nematostella vectensis]
Length = 373
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/219 (32%), Positives = 115/219 (52%), Gaps = 10/219 (4%)
Query: 23 VLVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPV-LYHPQTTIDT---SLAEHILFI 76
LVGED DLL++L + ++ F+ + K + +++ + T++ + ++I+F
Sbjct: 118 ALVGEDTDLLILLCHYADANDHDILFMPQTKQKSGTMRVWNIKNTVEALGPDICKNIMFA 177
Query: 77 RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEA-GERFLVVLY 135
AI GCD+TSAL+ GK +L+ + + F + +I A GE L+ LY
Sbjct: 178 HAILGCDTTSALYGLGKGLSLKMLTSDATFRQQADIFHQADAAKNDIAAAAGETDLLCLY 237
Query: 136 GGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQWLGKNVP 193
G + T L+ LRY + + + +LP +AA H+L ++QVQQW G +P
Sbjct: 238 KGLKDET-LDSLRYARFCQKISTGNTQVRPESLPQISAAAIYHSLRVYHQVQQWRGIALP 296
Query: 194 PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
P +WGW +V L+P T +S A SLL L+ C CK GC
Sbjct: 297 PEDWGWKEVDGKLLPQRTDQSAAHPSLLELIRCKCKTGC 335
>gi|156378368|ref|XP_001631115.1| predicted protein [Nematostella vectensis]
gi|156218149|gb|EDO39052.1| predicted protein [Nematostella vectensis]
Length = 315
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/219 (32%), Positives = 116/219 (52%), Gaps = 10/219 (4%)
Query: 23 VLVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPV-LYHPQTTIDT---SLAEHILFI 76
LVGED DLL++L + ++ F+ + K + +++ + T++ + ++I+F
Sbjct: 60 ALVGEDTDLLILLCHYADANDHDILFMPQTKQKSGTMRVWNIKNTVEALGPDICKNIMFA 119
Query: 77 RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEA-GERFLVVLY 135
AI GCD+TSAL+ GK +L+ + + F + +I A GE L+ LY
Sbjct: 120 HAILGCDTTSALYGLGKGLSLKMLTSDVTFRQQADIFHQADAAKNDIAAAAGETALLCLY 179
Query: 136 GGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQWLGKNVP 193
G + T L+ LRY + + + +LP +AA H+L ++QVQQW G +P
Sbjct: 180 KGLKDET-LDSLRYARFCQKISTGNTQVRPESLPQISAAAIYHSLRVYHQVQQWRGIALP 238
Query: 194 PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
P +WGW +V L+P T +S A SLL L+ C+CK GC
Sbjct: 239 PEDWGWKEVDGKLLPQRTDQSAAHPSLLELIRCNCKTGC 277
>gi|328698807|ref|XP_003240738.1| PREDICTED: hypothetical protein LOC100571634 [Acyrthosiphon pisum]
Length = 1377
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 78/193 (40%), Positives = 109/193 (56%), Gaps = 17/193 (8%)
Query: 22 VVLVGEDVDLLVILIGTRSPSN-VYFLKPGRGKVVPVLYHPQTTIDTSLAEHI------- 73
VV++GED+DLL++L+ P N + F KPGRGK+ Y SL EH
Sbjct: 1190 VVVIGEDIDLLILLLTLSPPKNTIIFEKPGRGKIETRSYAV-----GSLQEHFKNEIKYF 1244
Query: 74 LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
+FI AI GCD+TSALF QGK+K L+TV K+ +L + F + +S+ EI AGE++L+
Sbjct: 1245 MFIHAIGGCDTTSALFQQGKIKHLKTVKKHQELHDALLIFNNESSSPEEIERAGEKYLLA 1304
Query: 134 LYGGNHETTSLNKLRYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGKN 191
LY TSLNKLR+ + +A K +A LPPT AA +H + Q+Q W G
Sbjct: 1305 LYKAPAHITSLNKLRHDVFQKTAASNKKQVQLARLPPTIDAAREHLHRVYLQIQLWRGNK 1364
Query: 192 VP--PTEWGWNQV 202
+ TE G ++
Sbjct: 1365 LEFRSTELGLERI 1377
>gi|449667181|ref|XP_002159051.2| PREDICTED: uncharacterized protein LOC100200337, partial [Hydra
magnipapillata]
Length = 977
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 89/169 (52%), Gaps = 13/169 (7%)
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
+H+LF+ A SGCDSTSA+F +GK L +V K+ + F+D +T +EI ++
Sbjct: 777 FKDHLLFVHAWSGCDSTSAIFGKGKAMFLNSVKKSESMKEISETFMDYWATNKEIAKSSV 836
Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWL 188
LY G H+ TSL KLRY Y+ + K LPPT+ AA H Y+V L
Sbjct: 837 NAFKELYNG-HQQTSLAKLRYSKYLEALCKGIVVPEKLPPTDRAAHFHG----YRVHLQL 891
Query: 189 GK--------NVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK 229
+ N+ PTEWGW L PI T K AP +LL ++ C+C+
Sbjct: 892 IELKMLDEELNLKPTEWGWKSTDGHLEPIPTDKEIAPPNLLKVIRCNCR 940
>gi|443689573|gb|ELT91946.1| hypothetical protein CAPTEDRAFT_216478 [Capitella teleta]
Length = 449
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 110/236 (46%), Gaps = 47/236 (19%)
Query: 2 HYRVPSSGRRSLASEEEDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYH 60
HY V S+ + + + V+++G D DLLV+++ +PS +VY +G V + H
Sbjct: 159 HYPVNST----FIVDGDGQTVIVIGNDTDLLVMMVALATPSMHVYLCDTTKGPRVFSISH 214
Query: 61 PQTTIDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTT 120
+ K R V + L Y + P + T
Sbjct: 215 --------------------------------REKQRRNVELSSSL--YAEKPETPKAET 240
Query: 121 REITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQH 176
+ E FL+ LYG N ++L+KLR+ Y S+L T + LPPT +AA+QH
Sbjct: 241 ENVRE----FLLKLYGSNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLECLPPTSAAAAQH 296
Query: 177 ALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+L ++QVQ W GK V T WGW G L P+ T K P++LL +VACSCK C
Sbjct: 297 SLRAYHQVQTWCGKRVDATAWGWQIGGGILAPVETTKGVVPENLLKMVACSCKTQC 352
>gi|443706049|gb|ELU02310.1| hypothetical protein CAPTEDRAFT_209473 [Capitella teleta]
Length = 242
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 103/220 (46%), Gaps = 43/220 (19%)
Query: 18 EDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFI 76
+ + V+++ D DLLV+++ +PS NVY +G V + H +
Sbjct: 58 DRQTVIVIDNDTDLLVMMVALATPSMNVYICDTTKGPRVFSISHRE-------------- 103
Query: 77 RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
K R V + L Y + P + T + E FL+ LYG
Sbjct: 104 ------------------KQRRNVELSSSL--YAEKPETPKAETENVRE----FLLKLYG 139
Query: 137 GNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNV 192
N ++L+KLR+ Y S+L T + LPPT +AA+QH+L ++QVQ W GK V
Sbjct: 140 SNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLECLPPTSAAAAQHSLRAYHQVQTWRGKRV 199
Query: 193 PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
T WGW G L P+ T K P++LL +VACSCK C
Sbjct: 200 DATAWGWQIGGGILAPVETTKGVVPENLLKMVACSCKTQC 239
>gi|443725032|gb|ELU12774.1| hypothetical protein CAPTEDRAFT_192602 [Capitella teleta]
Length = 773
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/226 (33%), Positives = 106/226 (46%), Gaps = 44/226 (19%)
Query: 12 SLASEEEDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLA 70
+L + + VV++G D LLV+++ +PS N+Y +G P L T
Sbjct: 555 ALTHSTDGQTVVVIGNDTYLLVMMVALATPSMNMYICDTTKG---PRL----CTAKVKRL 607
Query: 71 EHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERF 130
I ++ IS F KV LR+ ++N AGE F
Sbjct: 608 RFIWLVKLIS--------FPFWKV--LRSRIEN----------------------AGEEF 635
Query: 131 LVVLYGGNHETTSLNKLRYKSYVTSALKV----TSYIAALPPTESAASQHALNKFYQVQQ 186
+ LYG N ++L+KLRY Y + K T + +LPPT +AA+QH+L ++QVQ
Sbjct: 636 FLKLYGSNKLVSTLDKLRYYKYKQAKRKSPLTPTIKLESLPPTSAAAAQHSLRAYHQVQT 695
Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
W GK V T WGW L P+ T K AP +LL +VAC CK C
Sbjct: 696 WRGKMVDATAWGWQIRDGILAPVETTKGVAPQNLLKMVACGCKTQC 741
>gi|405954377|gb|EKC21837.1| hypothetical protein CGI_10003232 [Crassostrea gigas]
Length = 372
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 111/238 (46%), Gaps = 27/238 (11%)
Query: 11 RSLASEEEDEVVVLVGEDVDLLVILIG-TRSPSNVYF-------LKPGR----GKVVPVL 58
++ ++ +D VV+ GED DLLV+L + ++F +K R K PVL
Sbjct: 79 QTAVNKADDHTVVVYGEDTDLLVLLCHYAKEGRQIFFTTDKHTSMKNHRVWDISKAKPVL 138
Query: 59 YHPQTTIDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
+ +LFI A++GCD+TS L GK AL+ ++ L FL S
Sbjct: 139 -------GSDSCRQLLFIHALTGCDTTSCLHGIGKPAALKKIMTEIYLNSQGAVFLQENS 191
Query: 119 TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSAL----KVTSYIAALPPTESAAS 174
+ +I +AGE LV LY G L+ ++ + T + + +LPPT AA
Sbjct: 192 SKEDIIKAGEEALVNLYAGV-LLEGLDIHIWRKFTTKTMSSKRNAVVQVQSLPPTSDAAI 250
Query: 175 QHALNKFYQVQQWLGKNVP---PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK 229
H+ + Q Q W GK+V PTEW W L+PI K P PD L + C+CK
Sbjct: 251 FHSTRVYLQCQYWKGKSVADLDPTEWDWTLKTGKLLPIEMSKPPVPDFLFKIAHCNCK 308
>gi|443691218|gb|ELT93134.1| hypothetical protein CAPTEDRAFT_194406 [Capitella teleta]
Length = 250
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 109/225 (48%), Gaps = 48/225 (21%)
Query: 18 EDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFI 76
+ + V+++G D DLLV+++ +PS NVY +G +V I
Sbjct: 36 DGQTVIVIGNDTDLLVMMVALATPSMNVYICDTTKGPIV------------------FSI 77
Query: 77 RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
R I N G V+ ++ Y + P + E A E FL+ LYG
Sbjct: 78 RVIEK--------NNGNVEQSSSL--------YAEK---PETPKAETENAREEFLLKLYG 118
Query: 137 GNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNV 192
N ++L+KLR+ Y S+L T + +LPPT AA+QH+L ++QVQ W GK +
Sbjct: 119 SNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLESLPPTSVAAAQHSLRAYHQVQTWRGKMI 178
Query: 193 PPTEWGWNQVGN-TLIPITTLKSPAPDSLLSLVAC----SCKKGC 232
T WGW Q+G+ L P+ T K P++LL +VAC C+K C
Sbjct: 179 DATAWGW-QIGDGILAPVETTKKVVPENLLKMVACGYKTQCRKSC 222
>gi|443682857|gb|ELT87293.1| hypothetical protein CAPTEDRAFT_189847 [Capitella teleta]
Length = 173
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 73/129 (56%), Gaps = 5/129 (3%)
Query: 105 DLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVT 160
DL P + F P + EI E E F + LYG N ++L+KLR+ Y S+L T
Sbjct: 16 DLFPILESFEKPETPKAEI-ENAEEFPLKLYGSNKLVSTLDKLRHYKYKQAIEKSSLTST 74
Query: 161 SYIAALPPTESAASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSL 220
+ + PPT +AA+QH+L ++QVQ W GK V T WGW L P+ T K APD+L
Sbjct: 75 IKLESQPPTSAAAAQHSLRAYHQVQTWRGKMVDATAWGWQIGDGILAPVETTKGVAPDNL 134
Query: 221 LSLVACSCK 229
L +VAC CK
Sbjct: 135 LKMVACGCK 143
>gi|321449045|gb|EFX61703.1| hypothetical protein DAPPUDRAFT_338406 [Daphnia pulex]
Length = 296
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 104/220 (47%), Gaps = 11/220 (5%)
Query: 23 VLVGEDVDLLVILIG--TRSPSNVYFL---KPGRGKVVPVLYHPQTTIDTSLAEHILFIR 77
V++GED DLL++L+ +Y K +G + + + + +L + +LF
Sbjct: 60 VVIGEDTDLLILLLHYCNEKIEEIYMRIEPKAQKGGKIWNIAKLKKMLGKNLCDSLLFGH 119
Query: 78 AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
A CD+TS F +GK AL+ + N + F P S+ EI AGE + ++YGG
Sbjct: 120 AFLRCDTTSKPFGKGKAMALKLLNTNVQFQRLSKVFYLPTSSIEEIDAAGESAMCIVYGG 179
Query: 138 NHETTSLNKLRY---KSYVTSALKVTSYI-AALPPTESAASQHALNKFYQVQQWLG-KNV 192
T + LRY K V++A S LPPT+ + H+ + Q+Q WLG +
Sbjct: 180 -LPTDDIAFLRYTIFKKKVSNASVAKSIKPEELPPTKGSVKFHSRRVYLQIQMWLGVTTM 238
Query: 193 PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
T+WGW P TT PAP +L ++ C C C
Sbjct: 239 KETDWGWFMNDRLFFPNTTDLPPAPKEILKMIKCGCNGSC 278
>gi|405968047|gb|EKC33150.1| hypothetical protein CGI_10014062 [Crassostrea gigas]
Length = 399
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 12/215 (5%)
Query: 23 VLVGEDVDLLVILIG-TRSPSNVYFLKPGRGKVVPV------LYHPQTTIDTSLAEHILF 75
VL+GE+ DLLV+L+ SN K G V + + + L +
Sbjct: 136 VLLGENSDLLVLLLHHANVTSNSLIFKSGNVSKVNTHIKIWDILKTKLLLGEELCTLLPL 195
Query: 76 IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLY 135
IRAISGCD+TS +F + L+ ++ D L + L A+ ++ AGE + +Y
Sbjct: 196 IRAISGCDTTSRMFGVSEASTLKKFGEH-DFLKTQAQLLCNANAKDDVISAGESIISSVY 254
Query: 136 GGNHETTSLNKLRYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGK-NV 192
G LN LRY+ + L K I LPPT +AAS ++ + + Q++ W+ K N+
Sbjct: 255 KG-APYEGLNVLRYRKFAARVLTNKTCVQIHTLPPTSNAASFYSQSAYLQMKMWMNKDNL 313
Query: 193 PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACS 227
P EWGW L+P+ A LL+++ C+
Sbjct: 314 NPCEWGWKVANGNLVPVKCTMDAALSKLLNIIRCN 348
>gi|405973593|gb|EKC38297.1| hypothetical protein CGI_10000780 [Crassostrea gigas]
Length = 449
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 98/204 (48%), Gaps = 14/204 (6%)
Query: 23 VLVGEDVDLLVILIGTRS-PSNVYFLKPGRGKVVPV------LYHPQTTIDTSLAEHILF 75
VL+GED DLLV+L+ SN K G V + + + L +
Sbjct: 246 VLLGEDTDLLVLLLHHADVTSNSLIFKSGNVSKVNTHIKIWDILKTKVLLGEELCTLLPL 305
Query: 76 IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLY 135
I AISGCD+TS +F K L+ ++ D L R+ L A+ ++ AGE + LY
Sbjct: 306 IHAISGCDTTSRMFGVSKAATLKKFAEH-DFLK-TRQLLCNANAKDDVISAGENIISSLY 363
Query: 136 GGNHETTSLNKLRYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGK-NV 192
G LN LRY+ + L K I LPPT +AAS H+ + Q++ W+ + N+
Sbjct: 364 NG-APYEELNVLRYRKFAARVLTNKTCVQIHTLPPTSNAASFHSQRAYLQMKMWMNEDNL 422
Query: 193 PPTEWGWNQVGNTLIPI-TTLKSP 215
P EWGW L+P+ T+K P
Sbjct: 423 NPCEWGWKVANGNLVPVKCTVKLP 446
>gi|443709383|gb|ELU04056.1| hypothetical protein CAPTEDRAFT_206657 [Capitella teleta]
Length = 493
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 67/121 (55%), Gaps = 4/121 (3%)
Query: 116 PASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTES 171
P + EI A E FL+ LY N ++ +KLR+ Y S+L T +LPPT +
Sbjct: 341 PETPKAEIENAREEFLLKLYSSNKLGSTHDKLRHYKYKQAIEKSSLTSTIKHESLPPTSA 400
Query: 172 AASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKG 231
AA+QH+L ++QVQ W GK V T WGW L P+ T K P++LL +VAC CK
Sbjct: 401 AAAQHSLRVYHQVQTWRGKMVDATAWGWQIGDGILAPVETTKGVVPENLLKMVACGCKTQ 460
Query: 232 C 232
C
Sbjct: 461 C 461
>gi|357612321|gb|EHJ67915.1| hypothetical protein KGM_18541 [Danaus plexippus]
Length = 707
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 1/111 (0%)
Query: 123 ITEAGERFLVVLYGGNHETTSLNKLRYKSYVT-SALKVTSYIAALPPTESAASQHALNKF 181
+ E G R L+ Y +++ RY ++ + L ++ LPPT AA QH +
Sbjct: 517 LLENGVRILLAFYNAPKSEDNIDHFRYTQFIKFTKLNKPVQLSTLPPTSVAAHQHIKRVY 576
Query: 182 YQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
YQ+Q WLGK++ P EWGW L PI TL PAP LL+++ C+CK GC
Sbjct: 577 YQIQTWLGKDLEPQEWGWMLENEILEPIRTLLPPAPAELLNVIFCNCKNGC 627
>gi|321455975|gb|EFX67094.1| hypothetical protein DAPPUDRAFT_331399 [Daphnia pulex]
Length = 222
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 84/166 (50%), Gaps = 3/166 (1%)
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
+ +ILF + GCD+TS + K K+L+ + N +F + +T I +AGE
Sbjct: 15 ICNNILFAHTVLGCDTTSKPYGVCKAKSLKLLPTNTTFQMMAAKFYEQDATIDVIKKAGE 74
Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQ 186
++++YGG+ ++ LRYK + TS++ +PPT +A H+ ++Q+Q
Sbjct: 75 VAMILIYGGS-TNEGIDTLRYKEFQRKISIATSFVNPQEIPPTSAALQFHSQRVYFQIQS 133
Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
WLG ++ +WGW + L P T +P LL ++ CK C
Sbjct: 134 WLGFSLDACDWGWVMKHDMLWPRLTNIEASPKDLLQIIKGGCKGVC 179
>gi|405970451|gb|EKC35353.1| hypothetical protein CGI_10012495 [Crassostrea gigas]
Length = 218
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 84/166 (50%), Gaps = 2/166 (1%)
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
L + + I +++G D+TS F GK AL+ + + + F D +++T I ++G
Sbjct: 7 LCQLLPAIHSVTGFDTTSKPFGIGKAAALKKIKSTNYIQETLNTFCDTSASTESILQSGS 66
Query: 129 RFLVVLYGG-NHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
+ LYGG +E + +LR +A ++ + +LPPT AA+ H + Q + W
Sbjct: 67 EVISSLYGGVPYEGLDILRLRKFGSKVAAGSLSVQVQSLPPTADAANFHIKRAYLQYRYW 126
Query: 188 L-GKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+ +N+ P +WGW L P+ T K AP LL V C+CK C
Sbjct: 127 MTDENLDPCQWGWYLSSGELFPVKTSKPVAPAILLKTVKCNCKTNC 172
>gi|443692109|gb|ELT93782.1| hypothetical protein CAPTEDRAFT_190961 [Capitella teleta]
Length = 394
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 86/167 (51%), Gaps = 16/167 (9%)
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
+ ++F R + G S + +A + + K D+ FLDP + ++ +AGE
Sbjct: 197 IQSEVMFQRYMLGAQSM-----ESPAEAFKRLFKEADI------FLDPTANINDVRQAGE 245
Query: 129 RFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQ 186
+ +++G T+S N LRY+ + Y+ LPPT +A+ H+ + QVQQ
Sbjct: 246 SAMAIVFGAKSRTSS-NDLRYQLLCKKIARKNKYVQPCTLPPTSAASKFHSDRVYLQVQQ 304
Query: 187 WL-GKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
W +++ P EWGW ++ ++++PI T AP LLS++ SC+ C
Sbjct: 305 WRKNESLSPCEWGW-KIVDSVMPIMTDVPAAPAVLLSMICYSCQGDC 350
>gi|321470191|gb|EFX81168.1| hypothetical protein DAPPUDRAFT_317920 [Daphnia pulex]
Length = 392
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 90/175 (51%), Gaps = 10/175 (5%)
Query: 22 VVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGKVVPVLYHP---QTTIDTSLAEHILFI 76
VV+VGED DLL++LI S N+Y + K L++ Q ++ L ILF
Sbjct: 219 VVVVGEDTDLLILLIHYCKESNHNMYMKSESKSKKCGKLWNIKKIQDSLGKELCSSILFC 278
Query: 77 RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
A GCD+TS F + K +L+ N D + F + ST ++I AGE + ++YG
Sbjct: 279 HAFLGCDTTSKPFGKEKCASLKLQNTNSDFKIVSKIFYESESTKQDIDTAGENAMCIVYG 338
Query: 137 GNHETTSLNKLRYKSY---VTSALKVTSYI-AALPPTESAASQHALNKFYQVQQW 187
G +++LRY+ + V +A S I LPPT++A H+ ++QV ++
Sbjct: 339 G-LVIDGIDRLRYQIFQKKVNNAKLTKSIIPEELPPTQAALKFHSRRAYFQVFKF 392
>gi|260808991|ref|XP_002599290.1| hypothetical protein BRAFLDRAFT_64352 [Branchiostoma floridae]
gi|229284567|gb|EEN55302.1| hypothetical protein BRAFLDRAFT_64352 [Branchiostoma floridae]
Length = 1620
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 18/233 (7%)
Query: 14 ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPG---RGKVVPVLYHPQTTIDTS 68
A+E E VV+ ED D+++I + R P VY K G R + + + +
Sbjct: 1338 AAESGSEAVVITAEDTDVMIISLAFAKRIPCKVY-QKCGTKNRTRFIDI-DKLADALGEE 1395
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT--EA 126
+ + ++ + A +GCD+ SA +GK+ A + ++KN + ++ + + + + T +
Sbjct: 1396 VCKALVGLHAFTGCDTVSAFSGRGKLGAFKLMLKNEEYQNAFQKLGESWTVSPDDTLFKR 1455
Query: 127 GERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQ 186
ERF +Y + +N++R+ ++ V S +ALPP H YQ
Sbjct: 1456 IERFTCQMYVSSTPVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGI 1513
Query: 187 WLG--KNVP----PTEWGWNQVGNTLIPITTLKS-PAPDSLLSLVACSCKKGC 232
W G +N P P + GW + + IT L+S PAP ++L L+ CSC + C
Sbjct: 1514 WRGCLQNDPQVPSPVDAGWKLDEDGNLSITWLQSPPAPAAVLELLTCSCSRSC 1566
>gi|443716490|gb|ELU07992.1| hypothetical protein CAPTEDRAFT_216620 [Capitella teleta]
Length = 1395
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 77/164 (46%), Gaps = 10/164 (6%)
Query: 75 FIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVL 134
A++GCD+ SA + +GK A T P++ ++ + + ++ ERF+++L
Sbjct: 1213 LFHAMTGCDTVSAFYGRGKRTAWETWQSYPEVTEAFQKCVSSDTVSKTCMSLLERFVILL 1272
Query: 135 YGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN--- 191
Y + + T +N+ R + A + + +PPT++A QH Q + W
Sbjct: 1273 YDKSSKATDVNEARKHIFTQKARSLEN----IPPTQAALEQHVKRAVLQAKIWNNSTEAV 1328
Query: 192 ---VPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+ P++WGW + G+ P+ T A L+ C CKKGC
Sbjct: 1329 PSAIDPSKWGWVKEGSQWKPLWTSLPEAAKCCYELIHCGCKKGC 1372
>gi|260819138|ref|XP_002604894.1| hypothetical protein BRAFLDRAFT_77273 [Branchiostoma floridae]
gi|229290223|gb|EEN60904.1| hypothetical protein BRAFLDRAFT_77273 [Branchiostoma floridae]
Length = 1407
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 109/232 (46%), Gaps = 16/232 (6%)
Query: 14 ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGKVVPVLYHPQ--TTIDTSL 69
A+E E VV+ ED D++VI + R P +Y K G + Q T+ +
Sbjct: 1125 AAETGSEAVVITAEDTDVMVISLAFAKRIPCKLY-QKCGTKSRTRFIAIDQLGDTLGEEV 1183
Query: 70 AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT--EAG 127
+ ++ + A +GCD+ SA +GK+ A + +++N + ++ + + + + T +
Sbjct: 1184 CKALVGLHAFTGCDTVSAFSGRGKLGAFKLMLRNAEYQDAFQQLGESWTVSPDDTLFKRI 1243
Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
ERF +Y + +N++R+ ++ V S +ALPP H YQ W
Sbjct: 1244 ERFTCQMYVSSTSVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGIW 1301
Query: 188 LG--KNVP----PTEWGWNQVGNTLIPITTLKS-PAPDSLLSLVACSCKKGC 232
G +N P P + GW + + I L+S PAP ++L L+ CSC + C
Sbjct: 1302 RGCLQNDPQVPSPVDAGWKLDEDGNLSIRWLQSPPAPAAVLELLTCSCSRSC 1353
>gi|156356217|ref|XP_001623825.1| predicted protein [Nematostella vectensis]
gi|156210558|gb|EDO31725.1| predicted protein [Nematostella vectensis]
Length = 605
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 64/122 (52%), Gaps = 3/122 (2%)
Query: 113 FLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIA--ALPPTE 170
F + +I AGE L+ LY G + T L+ LRY + + + +LPPT
Sbjct: 419 FHQADAAKNDIAAAGETDLLCLYKGLKDET-LDSLRYARFCQKISTGNTQVQPESLPPTS 477
Query: 171 SAASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
+AA H+L ++QVQQW G +PP +WGW +V L P T +S A SLL L+ C+
Sbjct: 478 AAAIYHSLRVYHQVQQWRGIALPPEDWGWKEVDGKLQPQRTDQSAAHPSLLELIRCNSLC 537
Query: 231 GC 232
C
Sbjct: 538 SC 539
>gi|260817557|ref|XP_002603652.1| hypothetical protein BRAFLDRAFT_98594 [Branchiostoma floridae]
gi|229288974|gb|EEN59663.1| hypothetical protein BRAFLDRAFT_98594 [Branchiostoma floridae]
Length = 1607
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 99/222 (44%), Gaps = 15/222 (6%)
Query: 22 VVLVGEDVDLLVILIGTRS--PSNVYFLKPG---RGKVVPVLYHPQTTIDTSLAEHILFI 76
V++ ED D+ ++ + + P+++ F+K G R K V + Q + +L +
Sbjct: 1340 VIVSSEDTDVFLLCLAFKQSIPASI-FVKCGTHSRIKYVSITNAAQV-WGQDICSSLLGM 1397
Query: 77 RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYG 136
A +GCDS SA +GK+ ALR V +N D + + E+ + E F +Y
Sbjct: 1398 HAFTGCDSVSAFAGRGKLGALRLVKENRDFQEMFKLVGMDWELSNELFKKLEEFTCHMYS 1457
Query: 137 GNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGK-----N 191
T+ +N+LRY+ + + S LPP HA Y W
Sbjct: 1458 SRPGTSDVNELRYRLFCAKRGSIDS--VQLPPCADCLYNHAKRANYVAAIWKKSLESHPV 1515
Query: 192 VP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+P P GW + G+ L+ PAP ++L L++CSC K C
Sbjct: 1516 IPSPIGLGWCKDGDQLVIDWMDGEPAPTAVLELLSCSCSKAC 1557
>gi|405973743|gb|EKC38436.1| hypothetical protein CGI_10011190 [Crassostrea gigas]
Length = 183
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 4/131 (3%)
Query: 105 DLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSAL--KVTSY 162
D L + L A+ AGE + LY G LN LRY+ + L K
Sbjct: 17 DFLKTQAQLLCNANAKDYAISAGENIISSLYNG-APYEGLNVLRYRKFAARVLTNKTCVQ 75
Query: 163 IAALPPTESAASQHALNKFYQVQQWLGK-NVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
I LPPT +AAS H+ + Q++ W+ K N+ P EWGW L+P+ AP L
Sbjct: 76 IHTLPPTSNAASFHSQRAYLQMKMWMNKDNLNPCEWGWKVANGNLVPVKCTMDAAPSKLR 135
Query: 222 SLVACSCKKGC 232
+++ C+CK C
Sbjct: 136 NIIRCNCKTNC 146
>gi|443728947|gb|ELU15065.1| hypothetical protein CAPTEDRAFT_192160 [Capitella teleta]
Length = 480
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 51/81 (62%), Gaps = 4/81 (4%)
Query: 150 KSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPI 209
KS +TS +K+ S LPPT +AA+QH+L ++QVQ W GK V T WGW L P+
Sbjct: 336 KSSLTSTIKLES----LPPTSAAAAQHSLRAYHQVQTWHGKMVDATAWGWQIRDGILAPV 391
Query: 210 TTLKSPAPDSLLSLVACSCKK 230
+ K P++LL +VAC CK+
Sbjct: 392 ESTKGVDPENLLKIVACGCKR 412
>gi|443724546|gb|ELU12506.1| hypothetical protein CAPTEDRAFT_214834 [Capitella teleta]
Length = 463
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 64/121 (52%), Gaps = 10/121 (8%)
Query: 116 PASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTES 171
P + EI A E FL+ LYG N ++ +KLR+ Y S+L T + +LPPT +
Sbjct: 317 PKTPKAEIENAREEFLLKLYGSNKLGSTFDKLRHYKYKQAIEKSSLTSTIRLESLPPTSA 376
Query: 172 AASQHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKG 231
AA+QH+ ++QV ++ T WGW L P T K P++LL +VAC CK
Sbjct: 377 AAAQHS---YHQVH---STDLHATAWGWQMGDGILAPEETTKGVVPENLLKMVACGCKTQ 430
Query: 232 C 232
C
Sbjct: 431 C 431
>gi|321469250|gb|EFX80231.1| hypothetical protein DAPPUDRAFT_103743 [Daphnia pulex]
Length = 210
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 75/150 (50%), Gaps = 3/150 (2%)
Query: 85 TSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSL 144
+S + GK K+L+ + N + +F + +T I EAGE ++++YGG+ +
Sbjct: 19 SSKPYGVGKAKSLKLLSTNTTIQMMAAKFYEQDATIDVIQEAGEVAMILIYGGST-NKGI 77
Query: 145 NKLRYKSYVTSALKVTSYI--AALPPTESAASQHALNKFYQVQQWLGKNVPPTEWGWNQV 202
+ LRYK + TS++ +PPT +A ++ ++QVQ WLG ++ +WGW
Sbjct: 78 DTLRYKEFQRKISIATSFMNPQEIPPTSAALQFNSQRVYFQVQSWLGFSLDDCDWGWVMK 137
Query: 203 GNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+ L P AP LL ++ C K C
Sbjct: 138 HDMLWPRLNNIEAAPKDLLQIIKCGGKGDC 167
>gi|291238911|ref|XP_002739369.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1049
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 99/233 (42%), Gaps = 17/233 (7%)
Query: 14 ASEEEDEVVVLVGEDVDLLVILIGTRSPSNV-YFLKPG---RGKVVPVLYHPQTTIDTSL 69
A+EE + VV+ ED D+ ++ + F K G R K++ + I +
Sbjct: 772 ATEEGYQAVVICSEDTDVFIMSLAFYDKIGASVFQKYGTRTRTKIIDI-GKVSAAIGIDV 830
Query: 70 AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGER 129
++ + A +GCD+ SA +GK ALR + N + + + ++E+ E
Sbjct: 831 CRALIGMHAFTGCDTVSAFAGKGKASALRFMTSNRETQDIFLQLGQEWTLSQELMNKLEA 890
Query: 130 FLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW-- 187
F LYG ++ +N LRY + ++ S+ LPP +HA YQ W
Sbjct: 891 FTCTLYGPKASSSKVNDLRYHLFCAKKGEIESH--QLPPCRDCLVKHAQRANYQAGIWRR 948
Query: 188 ---LGKNVP-PTEWGWN----QVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+P P GW + L+ PAP ++L L+AC+C + C
Sbjct: 949 CLEQDPQIPSPIGKGWKIEKFEGTEQLVVHWMDGQPAPQAILDLLACNCTRKC 1001
>gi|328698809|ref|XP_003240739.1| PREDICTED: hypothetical protein LOC100571724 [Acyrthosiphon pisum]
Length = 442
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 66/119 (55%), Gaps = 4/119 (3%)
Query: 88 LFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKL 147
LF QGK+K L+TV K+ +L + F + +S+ EI AGE++L+ LY TSLNKL
Sbjct: 324 LFQQGKIKHLKTVKKHQELHDALLIFNNESSSPEEIERAGEKYLLALYKAPAHITSLNKL 383
Query: 148 RYKSYVTSAL--KVTSYIAALPPTESAASQHALNKFYQVQQWLGKNVP--PTEWGWNQV 202
R+ + +A K +A LPPT AA +H + Q+Q W G + TE G ++
Sbjct: 384 RHDVFQKTAASNKKQVQLARLPPTIDAAREHLHRVYLQIQLWRGNKLEFRSTELGLERI 442
>gi|390344245|ref|XP_003726079.1| PREDICTED: uncharacterized protein LOC764337 [Strongylocentrotus
purpuratus]
Length = 1742
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 94/218 (43%), Gaps = 15/218 (6%)
Query: 22 VVLVGEDVDLLVILIGTRSPSNVYFL-KPGRG--KVVPVLYHPQTTIDTSLAEHILFIRA 78
V++ D D+ VI + L GRG K L ++ + S+A ++ +
Sbjct: 1478 VIIKSPDTDVFVIGVAKADQIQAQLLFHTGRGNNKRTLNLTAIRSHLGDSVANALIGLHC 1537
Query: 79 ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGN 138
SGCDSTS + + K K L+ + ++ D ++F S + + E+F+ LY +
Sbjct: 1538 FSGCDSTSCFYGRSKKKPLKLMTESVDFQAAFQKFGATFSVEESLVDTMEKFVCRLY--D 1595
Query: 139 HETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ--VQQWLGKNVP--- 193
+ TS+N RY ++ T +PPT A +H + YQ + + P
Sbjct: 1596 QDCTSVNTARYNKFLMG----TKAEMNMPPTHDALVKHLMRANYQSAIHTRCLEQYPVIP 1651
Query: 194 -PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
P GW + + PAP +LL L CSCKK
Sbjct: 1652 SPHNHGWKVTDTNIEVVWGDLPPAPSTLLELTYCSCKK 1689
>gi|115615245|ref|XP_001199031.1| PREDICTED: uncharacterized protein LOC763146 [Strongylocentrotus
purpuratus]
Length = 1617
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 15/218 (6%)
Query: 22 VVLVGEDVDLLVILIGTRSPSNVYFL-KPGRG--KVVPVLYHPQTTIDTSLAEHILFIRA 78
V++ D D+ VI + L GRG K L ++ + +A ++ +
Sbjct: 1353 VIIKSPDTDVFVIGVAKADQIQAQLLFHTGRGNNKRTLNLTAIRSHLGDGVANALIGLHC 1412
Query: 79 ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGN 138
SGCDSTS + + K K L+ + ++ D ++F S + + E+F+ LY +
Sbjct: 1413 FSGCDSTSCFYGRSKKKPLKLMTESVDFQAAFQKFGATFSVEESLVDTMEKFVCRLY--D 1470
Query: 139 HETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ--VQQWLGKNVP--- 193
+ TS+N RY ++ T +PPT A +H + YQ + + P
Sbjct: 1471 QDCTSVNTARYNKFLMG----TKAEMNMPPTHDALVKHLMRANYQSAIHTRCLEQYPVIP 1526
Query: 194 -PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
P GW + + PAP +LL L CSCKK
Sbjct: 1527 SPHNHGWKVTDTNIEVVWGDLPPAPSTLLELTYCSCKK 1564
>gi|390365574|ref|XP_003730848.1| PREDICTED: uncharacterized protein LOC100891077 [Strongylocentrotus
purpuratus]
Length = 636
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 93/216 (43%), Gaps = 18/216 (8%)
Query: 22 VVLVGEDVDLLVILIGTR--SPSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRAI 79
VV+V +D D+L++L R + +N+Y G LY ++ID E LF
Sbjct: 383 VVVVAQDTDILILLCYHRPVNCTNLYLQADFDG-----LYDI-SSIDIGDQEEFLFKYGW 436
Query: 80 SGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNH 139
SG D+ S + G K K P I F ST + AG R + + YG
Sbjct: 437 SGNDTVSCI--HGHTKCALYKCKFP--ASVITAFTSNTSTDSTVRTAGLRAMQITYGCG- 491
Query: 140 ETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN---VPPTE 196
T L K RY + A K LPPTE A +QH+L QV W + + P
Sbjct: 492 -DTPLEKSRYLKFKKQAAKGKIDPDRLPPTEDATAQHSLRVHLQVVVWKHLDTAILDPKG 550
Query: 197 WGWNQVGN-TLIPITTLKSPAPDSLLSLVACSCKKG 231
GW N L P APD+LL + C+CK+G
Sbjct: 551 RGWELDSNRKLRPKMLSVGIAPDNLLKGICCNCKEG 586
>gi|340385049|ref|XP_003391023.1| PREDICTED: hypothetical protein LOC100633611, partial [Amphimedon
queenslandica]
Length = 842
Score = 70.1 bits (170), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 90/217 (41%), Gaps = 17/217 (7%)
Query: 28 DVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTIDTSLA----EHILFIRAISGCD 83
D D+ V+L+ + + L G Y P I TS+ + A +GCD
Sbjct: 606 DTDVFVLLLAAAAKMKIEKLWIAFGTGKNFRYIPVHDIVTSIGAVKCRALPMFHAYTGCD 665
Query: 84 STSALFNQGKVKALRTV-VKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETT 142
+ S+ +GK A T V + +I PA + E ERF ++LY +
Sbjct: 666 TVSSFSTRGKKSAWDTWKVYDKVTETFIALSTGPAQISDGDLEVLERFTILLYDRTSKAI 725
Query: 143 SLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGK------NVP-PT 195
+N+ R + K + A+PPT++A QH YQ GK N+P P
Sbjct: 726 CINEARKHLFS----KKGRSMDAIPPTKAALIQHIRRAVYQGGHCWGKVYQAAMNLPSPE 781
Query: 196 EWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
WGW N P+ T A S L+ C+CKKGC
Sbjct: 782 NWGWTDT-NDWKPLWTTLPEAGASSRELLCCNCKKGC 817
>gi|291223052|ref|XP_002731527.1| PREDICTED: scavenger receptor class B, member 2-like [Saccoglossus
kowalevskii]
Length = 639
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 12/155 (7%)
Query: 85 TSALFN-QG-----KVKALRTVVK-NPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
T A+F QG K A R +VK +P + F+ + +I+ + +V L+GG
Sbjct: 3 TQAVFELQGFSASEKKSAFRKLVKSDPVMKSCASAFILQNKSQEDISGLAKDMMVNLFGG 62
Query: 138 NHETTSLNKLRYKSYVTSALKVTSYIAA--LPPTESAASQHALNKFYQVQQWLG--KNVP 193
+ T L+ LR + S++ LPPT A H+ ++Q+ W+G +
Sbjct: 63 KYNDT-LSSLRLIIFTKKVATAESFVTLEWLPPTLPATRFHSQCVYFQIMVWMGMANEMN 121
Query: 194 PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
P EW Q + LIPI T K+ APD LL ++ C C
Sbjct: 122 PIEWRLKQENDQLIPIMTPKNAAPDKLLKIIHCDC 156
>gi|443686811|gb|ELT89964.1| hypothetical protein CAPTEDRAFT_191939 [Capitella teleta]
Length = 321
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 85/175 (48%), Gaps = 18/175 (10%)
Query: 18 EDEVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPV--------LYHPQTTIDTS 68
++ V V ED DL+V+L+ S LK + + ++ Q ++ +
Sbjct: 122 KENHVTAVDEDTDLIVLLLHHAPNSCQTIVLKSDIHRSIKSHKPAKQWHIHSAQRSLGSE 181
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRR----FLDPASTTREIT 124
+ H+LFI + GCD+TS+++ G K + +KN L + + FLDP + ++
Sbjct: 182 MCRHLLFIHGLLGCDTTSSIYGLG--KGMPLALKNASLCRRLFKEADIFLDPTANINDVQ 239
Query: 125 EAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYI--AALPPTESAASQHA 177
+A E + +++G + SLN L Y+ + ++ Y+ LPPT +A+ H+
Sbjct: 240 QARECAMAIVFGVKNR-PSLNDLTYQLFCKKIARINKYVQPCTLPPTSAASKFHS 293
>gi|328699454|ref|XP_003240937.1| PREDICTED: KRAB-A domain-containing protein 2-like [Acyrthosiphon
pisum]
Length = 525
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 4/108 (3%)
Query: 74 LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
+FI A+SGCD+TS++F QGK+K ++T K+ +L + F + +S+ +I G+ FL+
Sbjct: 1 MFIHALSGCDTTSSIFQQGKLKYVKTFQKHSELQDSLLIFNNESSSADDILSVGQEFLLK 60
Query: 134 LYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKF 181
L+ TSLN+ SY++ Y+ + + LNKF
Sbjct: 61 LFNAPKFITSLNQEDNSSYLSDE----KYLNIINELKGNHKSRRLNKF 104
>gi|405966034|gb|EKC31359.1| hypothetical protein CGI_10019217 [Crassostrea gigas]
Length = 233
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 9/164 (5%)
Query: 18 EDEVVVLVGEDVDLLVILIG-TRSPSNVYFLKPG---RGKVVPVLYHPQTTIDTSLAEHI 73
+D VV+ GED DLL +L + ++F + V + ++ + + +
Sbjct: 50 DDHTVVVYGEDTDLLNLLCHYAKEGRQIFFTDKQTSMKNHRVWDISKAKSVLGSDSCRQL 109
Query: 74 LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
LFI A++GCD+ S L GK AL+ ++ + L FL S+ +I +AGE LV
Sbjct: 110 LFIHALTGCDTASRLHGIGKPAALKKIMTDIYLKSQGAVFLQENSSKEDIIKAGEEALVN 169
Query: 134 LYGGNHETTSLNKLRYKSYVTSAL----KVTSYIAALPPTESAA 173
L GG L+ LR++ + T + + LPPT AA
Sbjct: 170 LCGGVL-LEGLDILRWRKFTTKTMSSKRNAVVQVQPLPPTSDAA 212
>gi|291228946|ref|XP_002734438.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 760
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 60/128 (46%), Gaps = 10/128 (7%)
Query: 65 IDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRF-LDPASTTREI 123
I L +L A +GCDS S+ ++GKV LR + +NP +P A TT E
Sbjct: 627 IGNELCNTLLSFHAFTGCDSISSFVHKGKVGLLRVLQRNPQFIPVFTSLGTSQALTTDEF 686
Query: 124 TEAGERFLVVLYGGNHETTSLNKLRYKSYVT------SALKVTSYI--AALPPTESAASQ 175
E E F ++YG N +NKLRY ++T +L S I + LPP S+
Sbjct: 687 DEL-EAFTCLMYGANTGVKDINKLRYTMFMTCYTPKNKSLSNESGIDLSLLPPCRSSLRM 745
Query: 176 HALNKFYQ 183
H + YQ
Sbjct: 746 HIVRANYQ 753
>gi|156390576|ref|XP_001635346.1| predicted protein [Nematostella vectensis]
gi|156222439|gb|EDO43283.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 79/173 (45%), Gaps = 17/173 (9%)
Query: 23 VLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTT------------IDTSLA 70
V+ +D D+LVIL+G LK G G ++ Q T I+ +L
Sbjct: 356 VVNSDDTDVLVILLGHAQYLGKRHLKKGTGTKTRIIELDQITRKLARLAAQDIAIEDALC 415
Query: 71 EHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERF 130
+ F A++GCDS SA +GK + L+ V+KN + ++ S T E+ A E
Sbjct: 416 GLVGF-HALTGCDSVSAFSQKGKWRPLQIVLKNKKYMEAMKEIGRQWSVTEELFSATEEL 474
Query: 131 LVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ 183
+ +YG T +N+L Y+ + +K+ ALPP +S+ H YQ
Sbjct: 475 VFHIYGKR--GTRVNRLIYELHCAKGVKIEP--NALPPYQSSLKLHVSRANYQ 523
>gi|405970677|gb|EKC35562.1| hypothetical protein CGI_10014486 [Crassostrea gigas]
Length = 381
Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 69/132 (52%), Gaps = 5/132 (3%)
Query: 11 RSLASEEEDEVVVLVGEDVDLLVILIG-TRSPSNVYFLKPGRGKV----VPVLYHPQTTI 65
++ ++ +D +V+ GED DLLV+L + ++F + + V + ++ +
Sbjct: 161 ETVVNKADDHTMVVYGEDPDLLVLLCHYAKEGRQIFFTTDKQTSMKNHRVGDISKAKSVL 220
Query: 66 DTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE 125
+ +LFI A++GCD+TS L GK AL+ ++ + L FL S+ +I +
Sbjct: 221 GSDSCRQLLFIHALTGCDTTSRLHGIGKPAALKKIMTDIYLKSQGAVFLQENSSKEDIIK 280
Query: 126 AGERFLVVLYGG 137
AGE LV LYGG
Sbjct: 281 AGEEALVNLYGG 292
>gi|405978040|gb|EKC42457.1| hypothetical protein CGI_10004686 [Crassostrea gigas]
Length = 133
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 6/73 (8%)
Query: 163 IAALPPTESAASQHALNKFYQVQQWLGKNVP---PTEWGWNQVGNTLIPITTLKSPAPDS 219
+ +LPPT AA H++ + Q Q W GK+V PTEWGW L+PI +K PAPD
Sbjct: 11 VLSLPPTSDAAIVHSMRVYLQCQYWKGKSVADLDPTEWGWTLKTGKLLPI--VKPPAPD- 67
Query: 220 LLSLVACSCKKGC 232
L ++ C+CK C
Sbjct: 68 FLKIIHCNCKTNC 80
>gi|390356858|ref|XP_003728872.1| PREDICTED: uncharacterized protein LOC100891119 [Strongylocentrotus
purpuratus]
Length = 1654
Score = 59.7 bits (143), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 88/212 (41%), Gaps = 15/212 (7%)
Query: 22 VVLVGEDVDLLVILIGTRSP---SNVYFLKPGRGKVVPV-LYHPQTTIDTSLAEHILFIR 77
+++ D D+ VI IG + S +YF + K + L Q + +++ I+ +
Sbjct: 1374 IIIKSPDTDVFVIGIGIAAQLEGSKLYFHTGKQDKERTINLNAIQCHLGDQISDAIVGLH 1433
Query: 78 AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
+GCDS SAL+ +GK K + + + + + T ++ E F+ LYG
Sbjct: 1434 PFTGCDSVSALYGRGKTKPFTLMSQTTKFIKAFQELGKAFTLTDDLVSTLEEFVCKLYGM 1493
Query: 138 NHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWL-----GKNV 192
E + +N+ RY + + T +PP + A +H YQ W +V
Sbjct: 1494 K-EISKVNEARYAFFSMA----TRQEDIMPPNKDALMKHIQRANYQTAIWRRCLESQPDV 1548
Query: 193 P-PTEWGWNQVGNTLIPITTLKSPAPDSLLSL 223
P P GW V L PA S+L L
Sbjct: 1549 PSPVGRGWQSVDGGLRIDWMDMQPAQQSILEL 1580
>gi|405973424|gb|EKC38141.1| hypothetical protein CGI_10019306 [Crassostrea gigas]
Length = 117
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 105 DLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSAL--KVTSY 162
D L + L A+ ++ AGE + LY G LN LRY+ + L K
Sbjct: 17 DFLKTQAQLLCNANAKDDVISAGENIISSLYNG-APYEGLNVLRYRKFAARVLTNKTCVQ 75
Query: 163 IAALPPTESAASQHALNKFYQVQQWLGK-NVPPTEWGW 199
I LPPT +AAS H+ + Q++ W+ K N+ P EWGW
Sbjct: 76 IHTLPPTLNAASFHSQRAYLQMKMWMNKDNLNPCEWGW 113
>gi|156398608|ref|XP_001638280.1| predicted protein [Nematostella vectensis]
gi|156225399|gb|EDO46217.1| predicted protein [Nematostella vectensis]
Length = 1216
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 92/228 (40%), Gaps = 16/228 (7%)
Query: 14 ASEEEDEVVVLVGE-DVDLLVILIGTRSPSNVYFLKPG---RGKVVPVLYHPQTTIDTSL 69
AS+E VV+ + DV +L I R + K G R +P+ + + +
Sbjct: 943 ASQEMTRVVIQSPDTDVAILAIHAFERMGCRELWFKTGYKDRVCFIPI-HDAARKLGPKV 1001
Query: 70 AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGER 129
I + A++GCD+TS L N GK K + N D ++ + ++ E
Sbjct: 1002 CAAIPGLHALTGCDTTSGLANIGKTKPWNKFMANTDSQAHLETLGSVIPPHEDTIKSAEC 1061
Query: 130 FLVVLYGGNHET-TSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW- 187
F+ LY + + + +K+RY + LPPT + QH + YQ W
Sbjct: 1062 FVCSLYTTSTKAGMTADKIRYWMFCQKHQSSDR----LPPTSDSLKQHIIRANYQTFIWK 1117
Query: 188 --LGKNVP---PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
L VP P G L P+ + PAP SLL L C C K
Sbjct: 1118 YCLQPMVPLPSPDGNGGKITNGRLEPLLMSQDPAPLSLLKLTTCRCVK 1165
>gi|390343919|ref|XP_003725994.1| PREDICTED: uncharacterized protein LOC100891484 [Strongylocentrotus
purpuratus]
Length = 556
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 88/221 (39%), Gaps = 26/221 (11%)
Query: 14 ASEEEDEVVVLVGEDVDLLVILIG------TRSPSNVYFLKPGRGKVVPVLYHPQTTIDT 67
A+E + +V+ D D++ IL+G P ++ G GK ++ I
Sbjct: 327 AAERGSKKIVIRTVDTDIIAILVGQLPSLIVEYPDIDIWVAFGMGK--NFCHYNINNICR 384
Query: 68 SLAEH----ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLD----PAST 119
+L E + A +GCD+TS +GK A R P++ F+D
Sbjct: 385 NLGEDKSLALPLFHAYTGCDTTSCFLAKGKKSAWRVWKSYPEVTQAFLHFVDHPFRAVDV 444
Query: 120 TREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALN 179
+ E ERF V+LY S+N+ R + + K + +PPT+ A QH
Sbjct: 445 SCEHFRHLERFTVLLYDITSNLLSVNEARRELFC----KKKRSLENIPPTQDALLQHIKR 500
Query: 180 KFYQVQQW-----LGKNVPPTE-WGWNQVGNTLIPITTLKS 214
YQ W +VPP E WGW N LKS
Sbjct: 501 VLYQGGIWTTCRQAQPSVPPPEGWGWTMEDNHRGVFHQLKS 541
>gi|291221333|ref|XP_002730676.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 836
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 92/205 (44%), Gaps = 29/205 (14%)
Query: 13 LASEEEDEVVVLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEH 72
+AS D+ +V+ D D+ V+L+ SN + +++ L+ +
Sbjct: 589 IASSSPDKTIVVRSPDTDVFVLLLKFGQFSN-------KRRLID-LHKVIAETGKDICSA 640
Query: 73 ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAST---TREITEAGER 129
+ + A SGCD+ SA +GK+ L+ + ++P+ +I FL +T + E+ E
Sbjct: 641 LPALHAFSGCDTISAFVRKGKLLPLKILKQHPE---FIETFLTLGTTPDVDDRVYESLEH 697
Query: 130 FLVVLYGGNHETTSLNKLRYKSYVT-----SALKVTSY----IAALPPTESAASQHALNK 180
F +LY + + +NKLR + ++ ++SY ++ LPP ++A H
Sbjct: 698 FTCILYKRTYASCDINKLRLELFMQKFNPKQGTVLSSYNGVDMSILPPCKAALQMHIRRA 757
Query: 181 FYQVQQWLGKN-----VP-PTEWGW 199
YQ W N VP P + GW
Sbjct: 758 NYQALIWCRANQAHPQVPTPDDHGW 782
>gi|443685664|gb|ELT89202.1| hypothetical protein CAPTEDRAFT_223408 [Capitella teleta]
Length = 636
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 69/169 (40%), Gaps = 14/169 (8%)
Query: 73 ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS--TTREITEAGERF 130
+L A+SGCD+ S+ F +GK A + PDL + P S + + + E F
Sbjct: 447 LLAFHALSGCDTVSSFFGKGKRSAWQAWQACPDLTSALLELSSPVSHDSVKRVLPIIETF 506
Query: 131 LVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQ-QWLG 189
+ LYG E+ L K T L +PP+ A H L +Q W G
Sbjct: 507 VTRLYGV--ESVDLVNAARK---TLFLNKGKQFVQIPPSSDALQLHLLRAVHQSAFVWGG 561
Query: 190 KNV------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
+ P EWGW + G+ +P P SL L CSCK C
Sbjct: 562 LLIRDPLVPSPEEWGWQRSGSAFVPHYISLPPLSSSLPELSFCSCKSVC 610
>gi|291227245|ref|XP_002733597.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
Length = 1578
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 13/167 (7%)
Query: 73 ILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPA---STTREITEAGER 129
+L +SGCD+ S + +GK +A + ++ L + ++ +P + EA
Sbjct: 1375 LLCAYVLSGCDTVSYPYKKGKKRAAKVALEVASKLLPLAQYAEPGEGFDVDENVIEAARL 1434
Query: 130 FLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ--VQQW 187
+++ LYG + ++LR + T+ + + PPTE A QH L +Q V +
Sbjct: 1435 YIMALYGRADFCGNFDELRAHLFGTTKGDLRCF----PPTEDAFLQHILRGLHQIVVNKS 1490
Query: 188 LGKNVP----PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
VP T +G V L+PI LK P C CKK
Sbjct: 1491 AHDPVPEYPIATMFGRKVVDGKLVPIRMLKGAKPTEATHKNYCRCKK 1537
>gi|357631811|gb|EHJ79278.1| hypothetical protein KGM_15523 [Danaus plexippus]
Length = 155
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 175 QHALNKFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
QH +N +YQ+Q WLG + T+WGW ++P T K P+ LL + CSC+ GC
Sbjct: 5 QH-INAYYQLQTWLGNELTATDWGWKLYQRGIMPKFTQKELIPEILLKTICCSCEIGC 61
>gi|260804463|ref|XP_002597107.1| hypothetical protein BRAFLDRAFT_76362 [Branchiostoma floridae]
gi|229282370|gb|EEN53119.1| hypothetical protein BRAFLDRAFT_76362 [Branchiostoma floridae]
Length = 1477
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 71/168 (42%), Gaps = 19/168 (11%)
Query: 76 IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE----AGERFL 131
A +GCD+ S + GK A + + + + F D S+ I++ A E F
Sbjct: 1293 FHAFTGCDTVSQFSHIGKKTAWKVWETHDE---FTATFYDLHSSPHHISDEAETALEYFT 1349
Query: 132 VVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN 191
++L+ +S+N+LR + + +++LPPT++A QH Q G+N
Sbjct: 1350 ILLFDRTSTCSSINQLRKHLFT----RKNRPMSSLPPTQAALHQHMRRAILQGGHHWGRN 1405
Query: 192 V-------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
P EWGW P+ T A +S L+ C C+ C
Sbjct: 1406 TTPCRQLPSPAEWGWTG-EEEWRPLWTTLPEAVESCPELLKCKCRTRC 1452
>gi|156408397|ref|XP_001641843.1| predicted protein [Nematostella vectensis]
gi|156228983|gb|EDO49780.1| predicted protein [Nematostella vectensis]
Length = 604
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 81/195 (41%), Gaps = 54/195 (27%)
Query: 24 LVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPV-LYHPQTTIDT---SLAEHILFIR 77
LVGED DLL++L + ++ F+ + K + +++ + T++ + ++I+F
Sbjct: 438 LVGEDTDLLILLCHYADANDHDILFMPQTKQKSGTMRVWNIKNTVEALGPDICKNIMFAH 497
Query: 78 AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGG 137
AI GCD+TSAL+ GK +L+ + + LL IR
Sbjct: 498 AILGCDTTSALYGLGKGLSLKMLTSDATLLELIR-------------------------C 532
Query: 138 NHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKNVPPTEW 197
N + LN + A H+L + QVQQW G +PP +
Sbjct: 533 NSLCSCLNGI-----------------------VGAVYHSLRVYQQVQQWRGLALPPEDC 569
Query: 198 GWNQVGNTLIPITTL 212
G ++ L P TL
Sbjct: 570 GLKEMDGKLPPQRTL 584
>gi|260816964|ref|XP_002603357.1| hypothetical protein BRAFLDRAFT_80350 [Branchiostoma floridae]
gi|229288676|gb|EEN59368.1| hypothetical protein BRAFLDRAFT_80350 [Branchiostoma floridae]
Length = 1516
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/182 (24%), Positives = 81/182 (44%), Gaps = 9/182 (4%)
Query: 14 ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGKVVPVLYHPQ--TTIDTSL 69
A+E E VV+ ED D++VI + R P +Y K G + Q T+ +
Sbjct: 565 AAETGSEAVVITAEDTDVMVISLAFAKRIPCKLY-QKCGTKSRTRFIAIDQLGDTLGEEV 623
Query: 70 AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT--EAG 127
+ ++ + A +GCD+ S +GK+ + ++KN + ++ + + + + T +
Sbjct: 624 CKALVGLHAFTGCDTVSTFSGRGKLGVFKLMLKNAEYQDAFQQLGESWTVSPDGTLFKRI 683
Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
ERF +Y + +N++R+ ++ V S +ALPP H YQ W
Sbjct: 684 ERFTCQMYVSSTSVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGIW 741
Query: 188 LG 189
G
Sbjct: 742 RG 743
>gi|156398132|ref|XP_001638043.1| predicted protein [Nematostella vectensis]
gi|156225160|gb|EDO45980.1| predicted protein [Nematostella vectensis]
Length = 1142
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 95/242 (39%), Gaps = 26/242 (10%)
Query: 10 RRSLASEEEDEVVVLVGEDVDLLVILIGT-----RSPSNVYFLKPGRGKVVPVLYHPQTT 64
R ++ + EV+V D D++VIL G P ++ G G+ Y T
Sbjct: 885 RHAVETSHAKEVIVRTV-DTDIVVILAGKFHQLKSQPELHLWVAYGMGR--HFRYLSINT 941
Query: 65 IDTSLAEHILF----IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRF-LDP--- 116
I L E A+SGCD+TS+ + +GK+ A +T D+ P ++ L+P
Sbjct: 942 ICEGLGEARFIALPVFHALSGCDTTSSFYGKGKLTAWQTWDIYKDVTPTLQFLALNPFHQ 1001
Query: 117 ASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQH 176
+ E + ERF V+LY +N R + + K + LPPT+ QH
Sbjct: 1002 LTLDSEHFKRIERFTVILYDKLSPFACINDARMELFC----KNNRAMDKLPPTKDTLLQH 1057
Query: 177 ALNKFYQVQQWLGKNV------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKK 230
YQ W + P ++ W + T +P + L C+ K
Sbjct: 1058 VKRCIYQAGIWTSSEISQPVLPSPDQFAWIKEAGTWVPGWITIPEVSKACSELTKCNSKG 1117
Query: 231 GC 232
C
Sbjct: 1118 VC 1119
>gi|443731762|gb|ELU16757.1| hypothetical protein CAPTEDRAFT_196996 [Capitella teleta]
Length = 569
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 77/176 (43%), Gaps = 54/176 (30%)
Query: 20 EVVVLVGEDVDLLVILIGTRSPS-NVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRA 78
+ VV++G DLLV+++ +PS NVY
Sbjct: 412 QTVVVIGNGTDLLVMMVALATPSMNVYI-------------------------------- 439
Query: 79 ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFL-------DPASTTREITEAGERFL 131
CD+T +V ++R + KN + I +L P + EI A FL
Sbjct: 440 ---CDTTKG----PRVFSIRVIEKN---IGEISSYLLAWGYAEKPETPKAEIENARGGFL 489
Query: 132 VVLYGGNHETTSLNKLRYKSYVT----SALKVTSYIAALPPTESAASQHALNKFYQ 183
+ LYG N ++L+KLR+ Y S+L T + +LPPT +AA+QH+L ++Q
Sbjct: 490 LKLYGSNKLGSTLDKLRHYKYKQAIEKSSLTSTIKLESLPPTSAAAAQHSLRAYHQ 545
>gi|443722574|gb|ELU11371.1| hypothetical protein CAPTEDRAFT_211864 [Capitella teleta]
Length = 264
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 46/73 (63%), Gaps = 7/73 (9%)
Query: 116 PASTTREITEAGERFLVVLYGGNHETTSLNKLRY---KSYVTSALKVTSYIAALPPTESA 172
P + EI A E FL+ LYG N ++L+KL+ KS +TS +K+ S LPPT +A
Sbjct: 187 PETPKAEIENAREEFLLKLYGFNKLGSTLDKLKQAIEKSSLTSTIKLES----LPPTSAA 242
Query: 173 ASQHALNKFYQVQ 185
A+QH+L ++QVQ
Sbjct: 243 AAQHSLRAYHQVQ 255
>gi|291238319|ref|XP_002739077.1| PREDICTED: tryptophan hydroxylase-like protein [Saccoglossus
kowalevskii]
Length = 726
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 83/209 (39%), Gaps = 16/209 (7%)
Query: 27 EDVDLLVILIGTRS-PSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRAISGCDST 85
+D D+L++ R +L+ G + + +TID S E LF SG +
Sbjct: 81 QDTDILILFCYHRPIDCTGLYLQADFGGLYDI-----STIDNSDREEFLFKYGWSG--NG 133
Query: 86 SALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHETTSLN 145
+ L G K K P I F S +I AG + L + YG T L
Sbjct: 134 TILCIHGHTKCALYKCKFP--ASVITTFTSNTSPVSKIRTAGLKALQITYGC--RDTPLE 189
Query: 146 KLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN---VPPTEWGWNQV 202
K RY + A K LPPTE+A +QH+L QV W N + P GW
Sbjct: 190 KSRYLKFTKQAAKGKIDPDRLPPTENATAQHSLRVHLQVVIWKHLNTSILEPKGRGWELD 249
Query: 203 GN-TLIPITTLKSPAPDSLLSLVACSCKK 230
N L P A D LL + C+ ++
Sbjct: 250 SNRKLKPKMLSGGIASDILLKGICCTARQ 278
>gi|390337691|ref|XP_790384.2| PREDICTED: LMBR1 domain-containing protein 2-like, partial
[Strongylocentrotus purpuratus]
Length = 1013
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 38/66 (57%), Gaps = 4/66 (6%)
Query: 171 SAASQHALNKFYQVQQWLG---KNVPPTEWGWNQ-VGNTLIPITTLKSPAPDSLLSLVAC 226
+AA H+L +YQV+QW G + P +WGW + G ++ + T PA + LL ++ C
Sbjct: 398 AAAKHHSLRVYYQVRQWKGIAADELQPDDWGWRERSGGVIVLVRTDLPPALEELLKMIRC 457
Query: 227 SCKKGC 232
+C+ C
Sbjct: 458 NCQIDC 463
>gi|443700686|gb|ELT99539.1| hypothetical protein CAPTEDRAFT_208935 [Capitella teleta]
Length = 129
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/132 (23%), Positives = 58/132 (43%), Gaps = 10/132 (7%)
Query: 79 ISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGN 138
++GCD+ SA + +GK A ++ + + ++ E F+++LY +
Sbjct: 1 MTGCDTVSAFYGRGKRTAWEAWKSYLEVTEAYQDCVSSDRVSKTCMALSEGFVILLYDKS 60
Query: 139 HETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWLGKN------V 192
+ T +NK R + A + + +PPT +A QH Q + W +
Sbjct: 61 SKATDVNKARKHIFTQKARSLEN----IPPTHAALEQHVKRAVLQAKIWNNSTEAVPSAI 116
Query: 193 PPTEWGWNQVGN 204
P++WGW + GN
Sbjct: 117 DPSKWGWVKEGN 128
>gi|260782789|ref|XP_002586464.1| hypothetical protein BRAFLDRAFT_106666 [Branchiostoma floridae]
gi|229271576|gb|EEN42475.1| hypothetical protein BRAFLDRAFT_106666 [Branchiostoma floridae]
Length = 276
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 9/112 (8%)
Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
ERF +Y + +N++R+ ++ V S +ALPP H YQ W
Sbjct: 113 ERFTCQMYVPSTPVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVQRANYQAGIW 170
Query: 188 LG--KNVP----PTEWGWNQVGNTLIPITTLKSP-APDSLLSLVACSCKKGC 232
G +N P P + GW + + IT L+SP AP ++L L+ CSC + C
Sbjct: 171 RGCLQNDPQVPSPVDAGWKLDEDGNLSITWLQSPPAPAAVLELLTCSCSRSC 222
>gi|260827565|ref|XP_002608735.1| hypothetical protein BRAFLDRAFT_73956 [Branchiostoma floridae]
gi|229294087|gb|EEN64745.1| hypothetical protein BRAFLDRAFT_73956 [Branchiostoma floridae]
Length = 255
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 9/112 (8%)
Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
ERF +Y + +N++R+ ++ V S +ALPP H YQ W
Sbjct: 113 ERFTCQMYVPSTPVADVNEMRHHLFIAKKGNVES--SALPPCRDCLHLHVKQANYQAGIW 170
Query: 188 LG--KNVP----PTEWGWNQVGNTLIPITTLKSP-APDSLLSLVACSCKKGC 232
G +N P P + GW + + IT L+SP AP ++L L+ CSC + C
Sbjct: 171 RGCLQNDPQVPSPVDAGWKLDEDGNLSITWLQSPPAPATVLELLTCSCSRSC 222
>gi|291226324|ref|XP_002733143.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1155
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 27/183 (14%)
Query: 67 TSLAEHILFI--------RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
T+LA HI + A++GCD TS+ F +GKV L+ VKN L + L
Sbjct: 935 TALANHIGPVVCKSLPGYHALTGCDYTSSFFRKGKVNPLKKAVKNTLYLEGL-SILGENI 993
Query: 119 TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT--------SALKVTSYIAA--LPP 168
T + E+++ LYG TS+N+ R K ++ S L+ I A LPP
Sbjct: 994 TFADDENLVEQYVCSLYGQG-ALTSVNEARLKIFLQKYRPTNQESPLQKIKGIDAIMLPP 1052
Query: 169 TESAASQHALNKFYQVQQWLG-------KNVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
+ Q Y W +N+ PT+ GW V +P+ + P L
Sbjct: 1053 CKDVLVQKLARCNYVAYLWKHAHVQNPLENIKPTDHGWKDVNGVFLPVWFTDNQMPSMLS 1112
Query: 222 SLV 224
++
Sbjct: 1113 QII 1115
>gi|291224113|ref|XP_002732051.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1436
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 27/183 (14%)
Query: 67 TSLAEHILFI--------RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
T+LA HI + A++GCD TS+ F +GKV L+ VKN L + L
Sbjct: 1218 TALANHIGPVVCKSLPGYHALTGCDYTSSFFRKGKVNPLKKAVKNTLYLEGL-SILGENI 1276
Query: 119 TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT--------SALKVTSYIAA--LPP 168
T + E+++ LYG TS+N+ R K ++ S L+ I A LPP
Sbjct: 1277 TFADDENLVEQYVCSLYGQG-ALTSVNEARLKIFLQKYRPTNQESPLQKIKGIDAGMLPP 1335
Query: 169 TESAASQHALNKFYQVQQWLG-------KNVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
+ Q Y W +N+ PT+ GW V +P+ + P L
Sbjct: 1336 CKDVLVQKLARCNYVAYLWKHAHIQNPLENIKPTDHGWKDVNGVFLPVWFTGNQMPSMLS 1395
Query: 222 SLV 224
++
Sbjct: 1396 QII 1398
>gi|449689845|ref|XP_002154924.2| PREDICTED: uncharacterized protein LOC100207801 [Hydra
magnipapillata]
Length = 460
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 166 LPPTESAASQHALNKFYQVQQWL----GKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLL 221
LPPT+ AA H L + QV +W+ N+ P EWGWN L+ I T+K ++L
Sbjct: 119 LPPTDRAAYFHGLRVYLQVIEWIMLDESFNLDPKEWGWNLDNGCLLTIPTIKDVVSPNIL 178
Query: 222 SLVAC 226
++ C
Sbjct: 179 KVIRC 183
>gi|328721000|ref|XP_003247183.1| PREDICTED: hypothetical protein LOC100570608 [Acyrthosiphon pisum]
Length = 135
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 187 WLGKNVPPTEWGWNQVGNTLIPITTLKSP-APDSLLSLVACSCKKGC 232
WLG P EWGW+ + L+PI LK P P+SLL+L+ C+C KGC
Sbjct: 2 WLGNEKSPIEWGWHVKNDLLLPIP-LKGPLIPESLLNLITCNCTKGC 47
>gi|405969476|gb|EKC34445.1| hypothetical protein CGI_10012724 [Crassostrea gigas]
Length = 101
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 181 FYQVQQWLGK---NVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK 229
+ Q Q W GK ++ PTEWGW L+PI PAPD LL ++ C+CK
Sbjct: 4 YLQCQYWKGKSEADLDPTEWGWTLKTGKLLPIEMSNPPAPDFLLKIIHCNCK 55
>gi|443732835|gb|ELU17399.1| hypothetical protein CAPTEDRAFT_207062 [Capitella teleta]
Length = 627
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 67/166 (40%), Gaps = 14/166 (8%)
Query: 76 IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS--TTREITEAGERFLVV 133
A+SGCD+ S+ F +GK A + P L + P S + + + E F+
Sbjct: 236 FHALSGCDTVSSFFGKGKRSAWQAWQACPGLTSALLEVSSPVSHDSVKRVLPIIETFVTR 295
Query: 134 LYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQ-QWLG--- 189
LYG E+ L K T L +PP+ A H L +Q W G
Sbjct: 296 LYGV--ESVDLVNAARK---TLFLNKGKQFVQIPPSSDALQLHLLRAVHQSAFVWGGLLI 350
Query: 190 --KNVP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
VP P +WGW + G+ +P P SL L CSCK C
Sbjct: 351 RDSLVPSPEKWGWQRSGSAFMPHYISLPPLSSSLPELSFCSCKSVC 396
>gi|156364755|ref|XP_001626511.1| predicted protein [Nematostella vectensis]
gi|156213389|gb|EDO34411.1| predicted protein [Nematostella vectensis]
Length = 1373
Score = 50.1 bits (118), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%), Gaps = 11/112 (9%)
Query: 124 TEAG-ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFY 182
T AG E+ + LY N T + LR+ + K + LPPT +A + Y
Sbjct: 1205 TYAGIEKLVCKLYQPNTRITKVKDLRWLLF----RKKQAESERLPPTLAALREATKRAHY 1260
Query: 183 QVQQW-----LGKNVP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
Q W +P P+++GW + +P+ T PAPD++L LV C C
Sbjct: 1261 QCMVWNSDILANPELPSPSDYGWKLEADEWVPVMTSLKPAPDAVLHLVKCGC 1312
>gi|449668498|ref|XP_004206798.1| PREDICTED: uncharacterized protein LOC100212644 [Hydra
magnipapillata]
Length = 429
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 66/167 (39%), Gaps = 14/167 (8%)
Query: 76 IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT-EAGERFLVVL 134
AI+GCD S + K + +T DL ++ + +S I ERF+V+L
Sbjct: 242 FHAITGCDQVSFFAGKEKKLSWKTWENFEDLTNALQSISNCSSKEDIIIFHLIERFVVLL 301
Query: 135 YGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQ----QWLGK 190
Y +S N+ R + K A+PPT+ H YQ Q L K
Sbjct: 302 YDRTSSYSSTNECRKDLFS----KKGRLPDAMPPTQDNLQLHISRAVYQASFCWAQSLCK 357
Query: 191 N--VP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCK--KGC 232
N +P P EWGW N + T L+ C CK KGC
Sbjct: 358 NPSLPNPCEWGWVMEDNVYKIVWTTIREVSKMCSELIRCGCKIEKGC 404
>gi|390339454|ref|XP_003725007.1| PREDICTED: uncharacterized protein LOC100893398 [Strongylocentrotus
purpuratus]
Length = 1226
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 90/231 (38%), Gaps = 31/231 (13%)
Query: 14 ASEEEDEVVVLVGEDVDLLVILIG--TRSPSNVYFLKPGRGK-----VVPVLYHPQTTID 66
A E D +++ D D+ +IL+G RS + G G V + Q
Sbjct: 962 AREISDGNILVRSSDTDVFIILLGLCGRSTGMNIIMDYGSGNNRRYIDVSNVAALQENKQ 1021
Query: 67 TSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEA 126
E ++ + A++GCD TS F GK+K L+ + + + +R IT
Sbjct: 1022 PGSTEALIGLHALTGCDFTSCFFRNGKMKPLQKMEASTTHMQALRSLTSEEVDVPAITS- 1080
Query: 127 GERFLVVLYGGNHETTSLNKLRYKSY--VTSA--------LKVTSYIAALPPTESAASQH 176
F+ LYG T+ +N+ R K++ ++S LK + ++LPP H
Sbjct: 1081 ---FVCSLYGC--VTSDINEARCKAFKRISSGGEKGPLAKLKKIN-CSSLPPCAKTLGNH 1134
Query: 177 ALNKFYQVQQWLGKNVP-------PTEWGWNQVGNTLIPITTLKSPAPDSL 220
Y + W + P ++GW N P S P+SL
Sbjct: 1135 IKRAHYVARMWKRADQAEPTGGSRPLDFGWKSTNNCFEPEWYPGSAVPESL 1185
>gi|291245182|ref|XP_002742470.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 1470
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 100/231 (43%), Gaps = 34/231 (14%)
Query: 22 VVLVGEDVDLLVILI----GTRSPSNVYFLKPGR-------GKVVPVLYHPQTTIDTSLA 70
+V+ D D++V+LI ++ S V+ ++ GR +++P+ H SLA
Sbjct: 1210 LVVQSPDTDVMVLLIHYMPQMQAVSTVW-METGRITRSLDLRRMIPI--HKIADKIGSLA 1266
Query: 71 EHIL-FIRAISGCDSTSALFNQGKVKALRTVVK-NPDLLPYIRRFLDPASTTREITEAGE 128
++ + +++GCD+ S+ F GK ++ ++K + L +++ A
Sbjct: 1267 CSVMPAVHSLTGCDTVSSFFGIGKKTVIQRMLKLSKTELTHLQAL--SKGNEDSAVNASR 1324
Query: 129 RFLVVLYGGNHE-----TTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQ 183
RF+V LY ++ +SLN L + T ++ V A PP E+A QH +Q
Sbjct: 1325 RFVVSLYDPKNKYSRDNQSSLNSLSVRLAATRSVSV----AKFPPCEAAFIQHVKRASWQ 1380
Query: 184 VQQWLGKNVP------PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
+ W ++ P GW + L+P+ + P L C C
Sbjct: 1381 TKTWTSAHLAKPILPKPAGHGWIIKNDILVPV-FFEGPTALDKLKDYFCGC 1430
>gi|291234141|ref|XP_002737008.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1288
Score = 47.0 bits (110), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 79/176 (44%), Gaps = 22/176 (12%)
Query: 21 VVVLVGEDVDLLVILIGTRSPSNV-----YFLKPGRGKVVPVLYHPQTTIDTSLAEHILF 75
+VVL G D D+ V+L+ N +++ G G Y P + + + + +
Sbjct: 1126 IVVLSG-DTDVFVLLMHYWDIFNSEGLRELWIRAGVGD--STRYFPIHILAPRIGKDLCY 1182
Query: 76 ----IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFL 131
+ ++GCD TS + G A T+ NP L Y+ F S T + T + E +L
Sbjct: 1183 LLPLVHTLTGCDYTSKV---GTKHA--TLSANP--LEYLSDFDSAPSCTDDFTASCETYL 1235
Query: 132 VVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
V + N T++++LR +Y S K S+ LPPT A QH +Y +W
Sbjct: 1236 VQVLKRNTTCTTMDRLRDDTYHHS--KGVSF-DQLPPTSHAVQQHIRRAYYATLRW 1288
>gi|260800823|ref|XP_002595296.1| hypothetical protein BRAFLDRAFT_128105 [Branchiostoma floridae]
gi|229280541|gb|EEN51308.1| hypothetical protein BRAFLDRAFT_128105 [Branchiostoma floridae]
Length = 1434
Score = 46.2 bits (108), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 5/126 (3%)
Query: 51 RGKVVPVLYHPQTTIDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYI 110
+ + +PV Q+ + L + + AI+GCDST + GK KAL + +NP+ +
Sbjct: 1293 KARYIPVHSLSQS-LGPQLCQALPAFHAITGCDSTGSFHGIGKKKALSVLRQNPEHQSNL 1351
Query: 111 RRFLDPASTTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTE 170
F E + E F+ LY + ++LRY + + ALPPT
Sbjct: 1352 AVFGQEPKLGEECFRSSENFVCDLYESGKAPCTTDELRYFIFCQKKQRN----EALPPTS 1407
Query: 171 SAASQH 176
++ H
Sbjct: 1408 NSLRHH 1413
>gi|291244877|ref|XP_002742320.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1507
Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 84/221 (38%), Gaps = 38/221 (17%)
Query: 22 VVLVGEDVDLLVILIGT---------RSPSNVYFLKPGRGK-----VVPVLYHPQTTIDT 67
VV+ D D++VILIG + N ++ G G V + ++
Sbjct: 1232 VVVRASDTDVIVILIGMLGRHLESHRETSYNRIIMECGSGNNHRHIDVSSIATALESMQN 1291
Query: 68 SLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITE-A 126
LA + + A +GCD T++ + +GKVK L + KN + +F S+ +
Sbjct: 1292 GLAAAMPGLHAFTGCDFTASFYRKGKVKPLEVLQKNT--TGTLTQFFSKLSSEDAPDQRK 1349
Query: 127 GERFLVVLYGGNHETTSLNKLRY------------KSYVTSALKVTSYIAALPPTESAAS 174
E F+ LYG + +N+ RY ++ +T+ KV + A LPP
Sbjct: 1350 AEEFICSLYGMKGDVKDVNEARYAKLLQMTGNKNQEASMTNVKKV--HCALLPPCAKTVH 1407
Query: 175 QHALNKFYQVQQWLGKNVP-------PTEWGWNQVGNTLIP 208
+ W + P P ++GW P
Sbjct: 1408 NKLQRAHFISILWGNADSPHPGHGLDPLKYGWKDKNGYYAP 1448
>gi|156381299|ref|XP_001632203.1| predicted protein [Nematostella vectensis]
gi|156219255|gb|EDO40140.1| predicted protein [Nematostella vectensis]
Length = 1141
Score = 45.4 bits (106), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 93/243 (38%), Gaps = 29/243 (11%)
Query: 14 ASEEEDEVVVLVGEDV-DLLVILIGTRS-PSNVYFLKPGRGKVVPVLYHPQTTIDTS--- 68
+ EE D V+L +D D I R+ ++V L ++P+ DT
Sbjct: 875 SHEEADSRVILHVKDAHDQGFTKIAVRTVDTDVLVLSVAVLPLLPIQPKLWVAFDTGANF 934
Query: 69 --LAEHILFIRA-ISGCDSTSALFN------QGKVKALRTVVKNPDLLPYIRRFLDPAST 119
+A H+ F++ +S C +F Q K K +T + L P + +
Sbjct: 935 RCIAAHMPFLQTLVSKCLERFRIFMPSQAAIQFKGKGKKTAFETWKLYPEVTDVFIAPTV 994
Query: 120 TREITEAG----ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQ 175
E+++ ERF V+LY ++N+ + + + + ALPP+++A +
Sbjct: 995 PEEVSDIHMATIERFTVLLYDRTSSKLTVNEAKKQLFAQKGRP----LEALPPSKAALLE 1050
Query: 176 HALNKFYQVQQWLGKNVP-------PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
H YQ G+++ P WGW P T L+ C C
Sbjct: 1051 HTKRAAYQAGHCWGQSLARSPVLPSPEHWGWALSDGKWEPYWTALPDVTRVCQELIRCGC 1110
Query: 229 KKG 231
KKG
Sbjct: 1111 KKG 1113
>gi|156371271|ref|XP_001628688.1| predicted protein [Nematostella vectensis]
gi|156215671|gb|EDO36625.1| predicted protein [Nematostella vectensis]
Length = 616
Score = 45.1 bits (105), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 23/49 (46%)
Query: 184 VQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
VQ W G V P EWGW +P+ + AP LL + C+C C
Sbjct: 524 VQAWKGNGVSPEEWGWPVTCTGFVPVQMSEPAAPAQLLRNIKCNCGGHC 572
>gi|291230800|ref|XP_002735353.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1507
Score = 45.1 bits (105), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 75/182 (41%), Gaps = 28/182 (15%)
Query: 67 TSLAEHILFI--------RAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPAS 118
T+LA HI + A++GCD TS+ F +GKV L+ VKNP L + L
Sbjct: 1290 TALANHIGPVVCKSLPGHHALTGCDYTSSFFRKGKVNPLKKAVKNPLYLEGL-SILGENI 1348
Query: 119 TTREITEAGERFLVVLYGGNHETTSLNKLRYKSYVT--------SALKVTSYIAA-LPPT 169
T + E+++ LYG TS+ + R K ++ S L++ A LPP
Sbjct: 1349 TFADDENLVEQYVCSLYGQG-ALTSVYEARLKIFLQKYRPTNQESPLQIKGIDAGMLPPC 1407
Query: 170 ESAASQHALNKFYQVQQWLG-------KNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLS 222
+ Q Y W +N+ PT+ GW +P+ + P L
Sbjct: 1408 KDVLVQKLARCNYVAYLWKHAHIQNPLENIKPTDHGWKD--GVFLPVWFTGNQMPSMLSQ 1465
Query: 223 LV 224
++
Sbjct: 1466 II 1467
>gi|156368670|ref|XP_001627815.1| predicted protein [Nematostella vectensis]
gi|156214736|gb|EDO35715.1| predicted protein [Nematostella vectensis]
Length = 1195
Score = 44.7 bits (104), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 9/136 (6%)
Query: 22 VVLVGEDVDLLVILIG------TRSPSNVYFLKP-GRGKVV-PVLYHPQTTIDTSLAEHI 73
VVL D D+LVI + S ++V L G G + + H T T +
Sbjct: 1032 VVLRTVDTDVLVIAVAYFQKLQMLSQASVGLLVAFGTGAYLRDIAAHEVTGSFTGNVAAL 1091
Query: 74 LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLD-PASTTREITEAGERFLV 132
L A +GCD+ S + +GK KAL T P+++P + P+ E ERF++
Sbjct: 1092 LTFHAFTGCDTVSCFYGKGKKKALETWHIYPEIIPVFAAIANAPSEMLDEWMPILERFVI 1151
Query: 133 VLYGGNHETTSLNKLR 148
+LY +S+N R
Sbjct: 1152 LLYDRTSSASSVNDAR 1167
>gi|405972735|gb|EKC37485.1| hypothetical protein CGI_10018098 [Crassostrea gigas]
Length = 117
Score = 43.9 bits (102), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 46/99 (46%), Gaps = 10/99 (10%)
Query: 75 FIRAISGCDSTSALFNQG---KVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFL 131
FI AISGCD+T +F G K ++ V DL I + S+ + +AGE +
Sbjct: 23 FIHAISGCDTTFRMFGIGMGAAYKKFKSSVYIQDLAQRIMTY----SSKENVVQAGEEIV 78
Query: 132 VVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAA--LPP 168
L G E L+ LRY+ + + + Y+ LPP
Sbjct: 79 ACL-NGYVENEGLDLLRYRKFASKVVTGNMYVQVQTLPP 116
>gi|443692893|gb|ELT94391.1| hypothetical protein CAPTEDRAFT_204036 [Capitella teleta]
Length = 478
Score = 43.9 bits (102), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 46/85 (54%), Gaps = 9/85 (10%)
Query: 22 VVLVGEDVDLLVILIG-TRSPSNVYFLKPGRGKVVPV--------LYHPQTTIDTSLAEH 72
V +VGED DL+V+L+ T + LK + + ++ Q ++ + + H
Sbjct: 379 VTVVGEDTDLIVLLLHHTPNSCQTIVLKSDIHRSIKSHKQAKQWQIHSVQRSLGSEMCRH 438
Query: 73 ILFIRAISGCDSTSALFNQGKVKAL 97
+LFI A+ GCD+T++++ GK A+
Sbjct: 439 LLFIHALLGCDTTASIYGLGKGNAI 463
>gi|291235271|ref|XP_002737568.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
Length = 1293
Score = 43.5 bits (101), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 17/127 (13%)
Query: 22 VVLVGEDVDLLVILIGTRSPSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIRAISG 81
VV+ D D++VIL+G G + L ++ LA + + A +G
Sbjct: 1146 VVVRASDTDVIVILLGML------------GSIATAL----ESMQNGLAAAMPGLHAFTG 1189
Query: 82 CDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLYGGNHET 141
CD T++ + +GKVK L + KN + I+ F +S E F+ LYG +
Sbjct: 1190 CDFTASFYRKGKVKPLEVLQKNT-MRTLIQFFSKLSSEDAPDQRKAEEFICSLYGMKGDE 1248
Query: 142 TSLNKLR 148
S+ ++
Sbjct: 1249 ASMTNVK 1255
>gi|345490613|ref|XP_001599794.2| PREDICTED: hypothetical protein LOC100114943 [Nasonia vitripennis]
Length = 511
Score = 42.0 bits (97), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 182 YQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLV 224
Y + WLG ++ P WGW + N L+PI K AP LL +V
Sbjct: 427 YNHEIWLGFDLDPVLWGWKRTSNMLLPIMNPKPVAPVELLEMV 469
>gi|156342193|ref|XP_001620906.1| hypothetical protein NEMVEDRAFT_v1g222583 [Nematostella vectensis]
gi|156206361|gb|EDO28806.1| predicted protein [Nematostella vectensis]
Length = 545
Score = 41.6 bits (96), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 24/140 (17%)
Query: 22 VVLVGEDVDLLVILIGTRS-PSNVYFLKPGRGKVVPVLYHPQTTIDTSLAEHILFIR--- 77
+V+ ED D+LVI + +S ++ F+K + +T +D S ++ ++
Sbjct: 115 IVISSEDTDVLVIALSLKSFIASPLFIKTTK--------QSRTYVDVSKVVLVIGVQMCI 166
Query: 78 ------AISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPA---STTREITEAGE 128
A +GC+ SA F +GK KAL + +N ++ FL+ E+ + +
Sbjct: 167 ALPGFHAFTGCERASAFFGKGKAKALELLHQNSS---FVNLFLEMGMYWQLDEELFQKIQ 223
Query: 129 RFLVVLYGGNHETTSLNKLR 148
F +Y T+ +NKLR
Sbjct: 224 EFTCTMYSYLAGTSDVNKLR 243
>gi|156363424|ref|XP_001626044.1| predicted protein [Nematostella vectensis]
gi|156212905|gb|EDO33944.1| predicted protein [Nematostella vectensis]
Length = 1194
Score = 41.2 bits (95), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 69 LAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGE 128
+ ++ILF AI GCD+TSAL+ +L+ + + F + +I AGE
Sbjct: 1042 ICKNILFAHAILGCDTTSALYGL----SLKMLTSDATSRQQADIFHQADAAKNDIAAAGE 1097
Query: 129 RFLVVLYGGNHETTSLNKLRYKSY 152
L+ LY G + T L+ LRY +
Sbjct: 1098 TALLCLYKGLKDET-LDSLRYARF 1120
>gi|405970723|gb|EKC35601.1| hypothetical protein CGI_10016597 [Crassostrea gigas]
Length = 87
Score = 40.8 bits (94), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 184 VQQWLGK-NVPPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
++ W+ K N+ P EWGW L+P+ + AP LL+++ +CK C
Sbjct: 1 MKMWMNKDNLNPCEWGWKVANGNLVPVKCTMNAAPSKLLNIIRYNCKTNC 50
>gi|291239442|ref|XP_002739632.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 172
Score = 40.8 bits (94), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 12/118 (10%)
Query: 120 TREITEAGERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALN 179
+ E+ E + LYG + +N+ R + + S+ LPP + +H
Sbjct: 21 SEELLSQLEALVCALYGATKGVSDVNQCRCGVFCAKKGEAESH--QLPPCKDCLYKHCQR 78
Query: 180 KFYQVQQWLGK----NVP-PTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSCKKGC 232
YQ W VP P GW + + + PAP +++ L++C CKK C
Sbjct: 79 ANYQASVWRNSLQNAEVPTPVGKGWALEHDGMTGL-----PAPIAVIELMSCMCKKKC 131
>gi|332027826|gb|EGI67889.1| hypothetical protein G5I_03478 [Acromyrmex echinatior]
Length = 89
Score = 40.4 bits (93), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 180 KFYQVQQWLGKNVPPTEWGWNQVGNTLIPITTLKSPAPD 218
++YQVQ+WLG + P +W W + N L PI T S PD
Sbjct: 26 RYYQVQKWLGYELNPEDWRWIMITNILEPIQTFLS-TPD 63
>gi|156372302|ref|XP_001628977.1| predicted protein [Nematostella vectensis]
gi|156215967|gb|EDO36914.1| predicted protein [Nematostella vectensis]
Length = 604
Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 166 LPPTESAASQHALNKFYQVQQWLGKNVP------PTEWG--WNQVGNTLIPITTLKSPAP 217
L PT +A Q L YQ+ W V P+ +G W +P+ PAP
Sbjct: 474 LAPTSAALKQAILRSHYQLLVWNNDIVANPVLPSPSVYGRKWQNAKKIWVPVMRDLPPAP 533
Query: 218 DSLLSLVACSCKKG 231
D+++ LV C C+ G
Sbjct: 534 DAIIHLVRCGCEYG 547
>gi|291221207|ref|XP_002730614.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1516
Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 76 IRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVLY 135
+ A++GCD + +GKVK + K + ++ L A T + +A E+F+ LY
Sbjct: 1319 LHAMTGCDYIPSFLGKGKVKPFELMRKQACFIAFMAA-LGNAPLTTDTVKACEKFVCHLY 1377
Query: 136 GGNHETTSLNKLRYKSY 152
G + + +S+N+ RY +
Sbjct: 1378 GQS-KLSSVNQARYMMF 1393
>gi|339483351|ref|YP_004695137.1| hypothetical protein Nit79A3_1940 [Nitrosomonas sp. Is79A3]
gi|338805496|gb|AEJ01738.1| hypothetical protein Nit79A3_1940 [Nitrosomonas sp. Is79A3]
Length = 651
Score = 37.4 bits (85), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 32/140 (22%), Positives = 57/140 (40%), Gaps = 14/140 (10%)
Query: 79 ISGCDSTSALFNQGKVKALRTVVKNPDLL--------PYIRRFLDPASTTREITEAGERF 130
S D S N ++A+RT+ N +L Y+ + L ST R + G F
Sbjct: 123 FSKSDYESLQLNNAYLEAMRTIFSNTTVLFLGYGLRDEYVFQNLQTCSTIRPLFGTGPHF 182
Query: 131 LVVLYGGNHETTSLNKLRYKSYVT---SALKVTSYIAALPPTESAASQHALNKFYQVQQ- 186
+V N + ++RY++ T +++ IA P++S + + K QQ
Sbjct: 183 IVTSEERNELPAIVRRIRYEADATDHRDVIQILEVIADFTPSQSIQTANEDEKTISSQQS 242
Query: 187 --WLGKNVPPTEWGWNQVGN 204
++ + P W +Q N
Sbjct: 243 IYYIADLITPGTWTTSQTFN 262
>gi|156355296|ref|XP_001623606.1| predicted protein [Nematostella vectensis]
gi|156210323|gb|EDO31506.1| predicted protein [Nematostella vectensis]
Length = 416
Score = 37.4 bits (85), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 56/126 (44%), Gaps = 12/126 (9%)
Query: 15 SEEEDEVVVLVGEDVDLLVILI--GTRSPSNVYFLKPGRGKVVPVLYHPQTT---IDTSL 69
++ ++ +L+GED DLLV+L+ P R V ++ + T +
Sbjct: 291 AKAREQTSILIGEDTDLLVLLLYHAEMEPRQ-------RATAVRKVWDIEKTKAALGRDS 343
Query: 70 AEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGER 129
ILF+ A+ GC++TS + + AL+ + F S+ EI EAGE
Sbjct: 344 TSGILFVHALLGCNTTSRIHGIWEGVALKRAKISTQFRELAGVFSSADSSRVEIIEAGEM 403
Query: 130 FLVVLY 135
L+ ++
Sbjct: 404 ALLNIF 409
>gi|219110080|ref|XP_002176792.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411327|gb|EEC51255.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 791
Score = 37.0 bits (84), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 13/111 (11%)
Query: 65 IDTSLAEHILFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREIT 124
ID +H L A +G S + ++ +V+ LRT+ ++P L Y + A + E+
Sbjct: 336 IDNDQTKHDLA--ATAGHQSNRSRWSDDRVRQLRTLARDP--LIYEKLTASLAPSIWELD 391
Query: 125 EAGERFLVVLYGGNHETTSLNKLRYKSYV---------TSALKVTSYIAAL 166
+ L +L+GGNH T + +KL + V TS ++ SY+ L
Sbjct: 392 NVKKGILCMLFGGNHGTGATSKLNKRGDVNILLCGDPGTSKSQLLSYVHKL 442
>gi|156353856|ref|XP_001623126.1| predicted protein [Nematostella vectensis]
gi|156209789|gb|EDO31026.1| predicted protein [Nematostella vectensis]
Length = 703
Score = 37.0 bits (84), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 39/107 (36%), Gaps = 10/107 (9%)
Query: 128 ERFLVVLYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQW 187
ERF V+LY +N R + + K + LPPT+ A +H YQ W
Sbjct: 601 ERFTVILYDKLSPFACINDARMELFC----KNNRAMDKLPPTKDALLKHVKRCIYQAGIW 656
Query: 188 LGKNV------PPTEWGWNQVGNTLIPITTLKSPAPDSLLSLVACSC 228
+ P ++ W + T +P + L CSC
Sbjct: 657 TSSEISQPVLPSPDQFAWIKEAGTWVPRWITIPEVSKACSELTKCSC 703
>gi|405978391|gb|EKC42788.1| hypothetical protein CGI_10015106 [Crassostrea gigas]
Length = 411
Score = 36.6 bits (83), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 61/131 (46%), Gaps = 13/131 (9%)
Query: 22 VVLVGEDVDLLVILIGTRSPSN-------VYFLKPGRGKVVPVLYHPQTTIDTSLAEHIL 74
+V+ D D+L++L+ S N V KP + V+ ++D +H++
Sbjct: 275 IVISASDTDVLLLLVSFYSEINRRLWMVAVSSSKPKNIPIHAVVEKNFPSLDRR--KHLV 332
Query: 75 FIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVVL 134
RA++GCD+TS + K AL+ N +LL L T ++ + E+F+ ++
Sbjct: 333 AFRALTGCDTTSFFYGISKKSALKVYRDNYELLEG----LGEGDLTDQMIKDCEKFVYLV 388
Query: 135 YGGNHETTSLN 145
E+T+ N
Sbjct: 389 LIVLDESTAFN 399
>gi|255033875|ref|YP_003084496.1| hypothetical protein Dfer_0060 [Dyadobacter fermentans DSM 18053]
gi|254946631|gb|ACT91331.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 368
Score = 36.6 bits (83), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 12/115 (10%)
Query: 74 LFIRAISGCDSTSALFNQGKVKALRTVVKNPDLLPYIRRFLDPASTTREITEAGERFLVV 133
L +R IS D +A FN VKA V +P LL + ++D T+ +I G F +
Sbjct: 173 LSVREISAVDLCNAHFN---VKAEH--VLDPTLLIERQTYIDLFQTSNKIHSGGTLFSYI 227
Query: 134 LYGGNHETTSLNKLRYKSYVTSALKVTSYIAALPPTESAASQHALNKFYQVQQWL 188
L + +T+++ ++ L VTS+ + P + LN Y V+QWL
Sbjct: 228 LDMNDEKTSAI------KHIEKQLNVTSFSPSKPSIKKNVPIE-LNIEYSVEQWL 275
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,659,751,083
Number of Sequences: 23463169
Number of extensions: 144749385
Number of successful extensions: 339012
Number of sequences better than 100.0: 104
Number of HSP's better than 100.0 without gapping: 51
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 338838
Number of HSP's gapped (non-prelim): 118
length of query: 232
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 94
effective length of database: 9,121,278,045
effective search space: 857400136230
effective search space used: 857400136230
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)