BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy16584
(202 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|322801276|gb|EFZ21963.1| hypothetical protein SINV_07046 [Solenopsis invicta]
Length = 252
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 119/202 (58%), Gaps = 2/202 (0%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D+P +H GR+RSF H+R +W TL+YI T+ + + L+ + + +I H+S
Sbjct: 51 IDDPLDHNGRVRSFKHERGNWVTLIYINY-TSSDCFHTWINSVLSKLPVEGSIISNLHIS 109
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSL 120
LS+TLV+ +HWI++ VE++ + R N+ ++F I ++CNEE+TR+F+ + L
Sbjct: 110 LSRTLVLKFHWIESFVESIKQSCRKFNKFVLQFTDIRVYCNEERTRTFLGIYCRDEDGML 169
Query: 121 TSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTS-D 179
+ D E++LP++Y++ ++H S WCL DK A LK +L L + +F + +
Sbjct: 170 KCLTDVFDNVLAEYQLPSFYKDTSYHISFFWCLGDKRACLKEILPSLTSSLNKFLAENME 229
Query: 180 ESFHVVTHIHMKTGNKFYSFPL 201
E++ V I K GNK Y+F L
Sbjct: 230 EAYMHVNDIQCKIGNKCYTFEL 251
>gi|149640758|ref|XP_001508138.1| PREDICTED: UPF0406 protein C16orf57-like [Ornithorhynchus anatinus]
Length = 233
Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 76/204 (37%), Positives = 120/204 (58%), Gaps = 3/204 (1%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPH 58
+D+ +HGGR+R+FPH+R +WAT VY+P + + L +L + V + + H
Sbjct: 29 VDDVTKHGGRVRTFPHERGNWATHVYVPYEAKEDFFELLDLLVSRAKASVPHVVTMEKFH 88
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ YHWI V++L + +R + ++I+ NE KTR+FI L + T
Sbjct: 89 LSLSQSVVLRYHWITPFVQSLKERVASFHRFFFSADRVKIYTNEGKTRTFIGLEVTAGHT 148
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
L +V VD+ +EF LPT+Y+EP+FH S+AWC+ D T L+ + +L + F+ +
Sbjct: 149 QLLRLVSEVDRVMEEFDLPTFYKEPSFHISLAWCVGDATGELEGQCVRELQDTVNGFEDS 208
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
S I K+GNKF+SFPL
Sbjct: 209 SVFLRVPSEQIRCKSGNKFFSFPL 232
>gi|332021123|gb|EGI61510.1| UPF0406 protein C16orf57-like protein [Acromyrmex echinatior]
Length = 253
Score = 149 bits (376), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 117/202 (57%), Gaps = 2/202 (0%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D+P +H GRIRSF H+R +WATL+YI + L+ +K LN + + +I H+S
Sbjct: 52 IDDPLDHDGRIRSFKHERGNWATLIYINYIPSDC-LHTWMKSVLNKLPVEGNIISSLHIS 110
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSL 120
LS+TLV+ HWI++ VE + R N+ I+ + ++CNEE+TR+F+ + L
Sbjct: 111 LSRTLVLKLHWIESFVEDIKLACRSFNKFIIQLTDVRVYCNEERTRTFLGIYCQDEDKML 170
Query: 121 TSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDE 180
+ + D E++LP++Y++ ++H S WCL DK A LK +L L + +F + E
Sbjct: 171 KCLTEIFDNLLAEYQLPSFYKDTSYHISFFWCLGDKQACLKEILPPLTSSLNKFLAENME 230
Query: 181 SFHV-VTHIHMKTGNKFYSFPL 201
+V V I K GNK Y+F L
Sbjct: 231 DAYVHVNDIQCKIGNKCYTFKL 252
>gi|187607690|ref|NP_001120615.1| uncharacterized protein LOC100145778 [Xenopus (Silurana)
tropicalis]
gi|171846494|gb|AAI61753.1| LOC100145778 protein [Xenopus (Silurana) tropicalis]
Length = 250
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 7/206 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV----GISVEVIPE 56
+D+ +HGGRIRSF H+R +WAT VYIP Q + L +EL SV G+ + + E
Sbjct: 46 LDDRTKHGGRIRSFTHERGNWATYVYIPFQPQDE--FLDLVDELLSVAAEHGVFLTKMSE 103
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
H+S S+T+V+ +HWI+ +E+L + L + R + I+++ N+EKTR+F+ L +
Sbjct: 104 FHISQSQTVVLRHHWINPFIESLKDKLHCMYRFLCIADRIKVYTNQEKTRTFLGLEVSVG 163
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
L +V VD+S +EF L T+YEEP+FH S+AWC+ DK L+ L +L + +F+
Sbjct: 164 SEHLLEVVSEVDQSLKEFNLQTFYEEPSFHMSLAWCVGDKAGKLEGSCLVELQKVIDRFE 223
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + I K GNK + PL
Sbjct: 224 DSDILTRFNAEEIRCKAGNKTFCIPL 249
>gi|307192126|gb|EFN75454.1| UPF0406 protein C16orf57-like protein [Harpegnathos saltator]
Length = 257
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 117/202 (57%), Gaps = 2/202 (0%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D+P H GRIRSF H+R +WATLVYI + L L L + + +VI + H+S
Sbjct: 56 IDDPLNHDGRIRSFKHERGNWATLVYINYAASDC-LRTWLNSVLKDLPVKGDVISKLHIS 114
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSL 120
LS+TLV+ YHWI++ E L R + ++ I ++CNEEKTR+F+ + + +L
Sbjct: 115 LSRTLVLKYHWIESFTENLKLLCRKFSPFIVQLTDIRVYCNEEKTRTFLGIYCQNDDGTL 174
Query: 121 TSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQF-KLTSD 179
+++A+D E++LP YY++ ++H S WCL D+ LK +L L + F S+
Sbjct: 175 KCLIEALDNLLAEYQLPLYYKDTSYHISFFWCLGDQRICLKKILPSLTHSLNVFLAENSE 234
Query: 180 ESFHVVTHIHMKTGNKFYSFPL 201
+++ V I K GNK Y+F L
Sbjct: 235 DNYLNVNEIQCKIGNKCYAFDL 256
>gi|126305197|ref|XP_001376322.1| PREDICTED: UPF0406 protein C16orf57 homolog [Monodelphis domestica]
Length = 268
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 120/204 (58%), Gaps = 3/204 (1%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPH 58
+D+ +HGGRIR+FPH+R WAT V++P + + L ++L + V + + + H
Sbjct: 64 VDDIEKHGGRIRTFPHERGIWATHVWVPYEAREDFFDLLSLLMTQARRVLPRLVKMEDFH 123
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ YHWI V++L +L +R N ++++ N+EKTR+FI L +
Sbjct: 124 LSLSQSVVLRYHWISPFVQSLKKHLASFHRFLFTANQVKVYTNQEKTRTFIGLEVTAGHA 183
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLT 177
+V VDK +EF L T+Y+EP+FH S+AWC+ D + L+ + +L I F+ +
Sbjct: 184 QFLDLVSEVDKVMEEFDLSTFYKEPSFHISLAWCVGDGRSKLQAQAIQELQEIVDGFEDS 243
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNK++SFPL
Sbjct: 244 ARLLCVGAEQVRCKSGNKYFSFPL 267
>gi|345487031|ref|XP_001603072.2| PREDICTED: UPF0406 protein C16orf57 homolog [Nasonia vitripennis]
Length = 258
Score = 146 bits (368), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 76/202 (37%), Positives = 117/202 (57%), Gaps = 2/202 (0%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D+P EH GRIRSF H+R +WATLVYI + + +KE L+ + + + E H+S
Sbjct: 57 VDDPLEHDGRIRSFKHERGNWATLVYIDYIPS-EEMQLWMKEVLDQITQTGNIFQEFHVS 115
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSL 120
L++TLV+ +HWID+ +E + + ++ ++EI+ NEEKTR+FI + S SL
Sbjct: 116 LTRTLVLKFHWIDSFIEAVKSLTASYQSFVLELGNMEIYSNEEKTRTFIGIKIQSANDSL 175
Query: 121 TSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDE 180
+ A+DK E++LP++YE+ ++H S W L D+ TL+ LL + FT F E
Sbjct: 176 QRLTGALDKLLDEYQLPSFYEDASYHISFLWFLGDQRKTLEALLPTFTSSFTDFLDDHPE 235
Query: 181 SFHV-VTHIHMKTGNKFYSFPL 201
+ V + K GNK YS L
Sbjct: 236 QRSMQVKKLKCKIGNKLYSLKL 257
>gi|260798388|ref|XP_002594182.1| hypothetical protein BRAFLDRAFT_201261 [Branchiostoma floridae]
gi|229279415|gb|EEN50193.1| hypothetical protein BRAFLDRAFT_201261 [Branchiostoma floridae]
Length = 222
Score = 145 bits (367), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 76/204 (37%), Positives = 115/204 (56%), Gaps = 6/204 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEV--IPEPHL 59
DNP EHGGRIRSFPH +WAT VY+ + + A +K L S+ V++ + + H+
Sbjct: 20 DNPEEHGGRIRSFPHVAGNWATHVYVSFEPD-ADFTDGVKHLLGSLEADVQLQQVQDYHI 78
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+T+ + +HWI +++L + LRH R F+ +++ N+EKTRSF+ L
Sbjct: 79 SLSRTVPLTFHWIQPFIDSLRDALRHTERFFCTFHGAKVYTNDEKTRSFLGLEVTGGHNH 138
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIF--TQFKLT 177
L +V VD +EF+L YY+ P+FH ++ WC+ D + KL Q +
Sbjct: 139 LLQLVLGVDSCLEEFRLQKYYKNPSFHMTVGWCVGDVHEDTQQFNQKLQASLAEAQEEFP 198
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
S F +V I K+GN+F+SFPL
Sbjct: 199 SLGGF-LVQEIKCKSGNRFFSFPL 221
>gi|148227076|ref|NP_001079479.1| putative U6 snRNA phosphodiesterase [Xenopus laevis]
gi|82176830|sp|Q7ZYI9.1|USB1_XENLA RecName: Full=Putative U6 snRNA phosphodiesterase
gi|27694652|gb|AAH43765.1| MGC52944 protein [Xenopus laevis]
Length = 250
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 118/206 (57%), Gaps = 7/206 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV----GISVEVIPE 56
+D +H GRIRSF H+R +WAT VYIP Q + L +EL SV G+ + + E
Sbjct: 46 LDENTKHEGRIRSFKHERGNWATYVYIPFQPQ--EEFLDLLDELVSVAAENGVLLTKMSE 103
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
H+S S+T+V+ +HWI+ VE+L + L + R I+++ N+EKTR+F+ L +
Sbjct: 104 FHISQSQTVVLRHHWINPFVESLKDKLHCMYRFLCIAERIKVYTNQEKTRTFLGLEVSVG 163
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
L +V VD+S QEF L T+Y+EP+FH S+AWC+ DK LK L +L + +F+
Sbjct: 164 MEHLLEVVSEVDRSLQEFNLQTFYQEPSFHVSLAWCVGDKYEKLKGSCLLELQKVIDRFE 223
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + I K GNK + PL
Sbjct: 224 DSDTLTRFNAEEIRCKAGNKTFCIPL 249
>gi|327291127|ref|XP_003230273.1| PREDICTED: UPF0406 protein C16orf57 homolog, partial [Anolis
carolinensis]
Length = 230
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 120/203 (59%), Gaps = 3/203 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNSVGISVEVIPEPHL 59
D +HGGR+R+FPH+R +WAT VY+P + + L L + S+ +PE H+
Sbjct: 27 DESEKHGGRLRTFPHERGNWATHVYMPYVAEEDFQGLLQALLCCARTYVPSLSPLPEFHI 86
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+ +V+ YHWI V++L LR ++R + + ++++ N+ KTR+F+ L +S +
Sbjct: 87 SLSQVVVLRYHWIAPFVQSLKERLRSVSRFICRADQVKVYTNQTKTRTFLGLEVSSGYSK 146
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL-KPLLTKLDNIFTQFKLTS 178
L+ +V VDK +EF L T+Y++P+FH S+AW L D + L L +L F+ +S
Sbjct: 147 LSELVSEVDKVMEEFSLATFYKDPSFHLSLAWGLGDLSEALGGQRLQELQETVDGFEDSS 206
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
T + K+GNKF+SFPL
Sbjct: 207 SFLRIPGTEVRCKSGNKFFSFPL 229
>gi|193676554|ref|XP_001949239.1| PREDICTED: UPF0406 protein C16orf57-like [Acyrthosiphon pisum]
Length = 235
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 117/201 (58%), Gaps = 9/201 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSL 61
D P+ H GR R F H+R +WAT VY+P + L +++++L G+ E+I PH+SL
Sbjct: 44 DEPDSHQGRSRLFEHERGNWATYVYVP--CPVMDLVEIVQDQLTKFGL--EIIKHPHISL 99
Query: 62 SKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
+KT+V+ YHWI + + + L + R T+ F +E+FCNE +R+FI + ++
Sbjct: 100 TKTVVLQYHWIQRFINDVKSKLSTILRFTVTFGDLEVFCNENCSRTFIGFLVHPAGV-IS 158
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDES 181
V +D ++ LP YYE+P FH S+AWCL D++ L+ TKL ++ + ++ + S
Sbjct: 159 QCVDQLDNVLSDYNLPKYYEDPKFHLSVAWCLGDQSTQLR---TKLKSLELEIEVDTCRS 215
Query: 182 FHVVTHIHMKTGNKFYSFPLT 202
V + KTGNK Y+F L
Sbjct: 216 LR-VDKLMCKTGNKKYNFDLV 235
>gi|345793897|ref|XP_853879.2| PREDICTED: UPF0406 protein C16orf57 homolog [Canis lupus
familiaris]
Length = 273
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 117/205 (57%), Gaps = 5/205 (2%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEP 57
+D+P +HGGR+R+FPH+R +WAT VY+P +T L L +L V V+ +
Sbjct: 69 VDDPEKHGGRVRTFPHERGNWATHVYVPYETREEFLDLLDVLLPHAQTYVPRLVQ-MEAF 127
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
HLSLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+FI L S
Sbjct: 128 HLSLSQSVVLRHHWILPFVQALKDRMASFQRFFFTANRVKIYTNQEKTRTFIGLEVTSGH 187
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKL 176
+V VD+ +EF L T+Y+ P+FH S+AWC+ D L+ L +L NI +F+
Sbjct: 188 NQFLDLVSEVDRVMEEFDLTTFYQAPSFHISLAWCVDDAHLQLEGQCLQELQNIVDEFED 247
Query: 177 TSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 248 SEMVLRAHAEQVRCKSGNKFFSMPL 272
>gi|255683453|ref|NP_598715.2| putative U6 snRNA phosphodiesterase [Mus musculus]
gi|74144469|dbj|BAE36080.1| unnamed protein product [Mus musculus]
gi|74183292|dbj|BAE22567.1| unnamed protein product [Mus musculus]
gi|74219896|dbj|BAE40531.1| unnamed protein product [Mus musculus]
Length = 267
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 117/203 (57%), Gaps = 3/203 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGRIR+FPH+R +WAT +YIP + + L L + ++ E H+
Sbjct: 64 DDSAKHGGRIRTFPHERGNWATHIYIPYEAKEDFRDLLDALLPRAQMFVPRLVLMEEFHV 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+FI L +S
Sbjct: 124 SLSQSVVLRHHWILPFVQVLKDRMASFQRFFFTANRVKIYTNQEKTRTFIGLEVSSGHAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTS 178
+V VD++ +EF L T+Y++P+FH S+AWC+ D + L+ L +L I +F+ +
Sbjct: 184 FLDLVSEVDRAMKEFDLTTFYQDPSFHVSLAWCVGDASLQLEGQCLQELQEIVDEFEDSE 243
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 244 MLLRVLANQVRCKSGNKFFSMPL 266
>gi|426243552|ref|XP_004015616.1| PREDICTED: putative U6 snRNA phosphodiesterase [Ovis aries]
Length = 250
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 117/207 (56%), Gaps = 9/207 (4%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIP 55
+D+ +HGGR+R+FPH+R +WAT VY+P + L L A+L V + +E
Sbjct: 46 VDDSAQHGGRVRTFPHERGNWATHVYVPYEAREEFLDLLDALLCHAQTYVPRLVRMEAF- 104
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
HLSLS+++V+ +HWI V+ L + + R + I+I+ N+EKTR+F+ L S
Sbjct: 105 --HLSLSQSVVLRHHWILPFVQALKDRVASFQRFCFTADQIKIYTNQEKTRTFVGLEVTS 162
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQF 174
+V VD+ +EF L T+Y++P+FH S+AWC+ D ++ P L +L I +F
Sbjct: 163 GHAHFLDLVAEVDRVMEEFDLSTFYQDPSFHISLAWCVGDARLQMEGPCLQELQGIVDEF 222
Query: 175 KLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + I K+GNKF+S PL
Sbjct: 223 EDSEMLLRAYAEQIRCKSGNKFFSMPL 249
>gi|81879558|sp|Q91W78.1|USB1_MOUSE RecName: Full=Putative U6 snRNA phosphodiesterase
gi|16741131|gb|AAH16418.1| Expressed sequence AA960436 [Mus musculus]
gi|148679221|gb|EDL11168.1| expressed sequence AA960436, isoform CRA_b [Mus musculus]
Length = 267
Score = 142 bits (358), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 117/203 (57%), Gaps = 3/203 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGRIR+FPH+R +WAT +YIP + + L L + ++ E H+
Sbjct: 64 DDSAKHGGRIRTFPHERGNWATHIYIPYEAKEDFRDLLDALLPRAQMFVPRLVLMEEFHV 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+FI L +S
Sbjct: 124 SLSQSVVLRHHWILPFVQVLKDRMASFQRFFFTANRVKIYTNQEKTRTFIGLEVSSGHAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTS 178
+V VD++ +EF L T+Y++P+FH S+AWC+ D + L+ L +L I +F+ +
Sbjct: 184 FLDLVSEVDRAMKEFDLTTFYQDPSFHISLAWCVGDASLQLEGQCLQELQEIVDEFEDSE 243
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 244 MLLRVLANQVRCKSGNKFFSMPL 266
>gi|354495448|ref|XP_003509842.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Cricetulus
griseus]
gi|344256614|gb|EGW12718.1| UPF0406 protein C16orf57-like [Cricetulus griseus]
Length = 268
Score = 142 bits (358), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 116/204 (56%), Gaps = 5/204 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGRIR+FPH+R +WAT +YIP + L L +L V V+ + E H
Sbjct: 65 DDSTKHGGRIRTFPHERGNWATHIYIPYEAKEEFLDLLDVLLSRAQTFVPRLVQ-MEEFH 123
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L +++ R N ++I+ N+EKTR+FI L S
Sbjct: 124 LSLSQSVVLRHHWILPFVQALKDHMASFQRFFFTANQVKIYTNQEKTRTFIGLEVTSGHA 183
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y+ P+FH S+AWC+ D L+ L +L I +F+ +
Sbjct: 184 QFLDLVSEVDRVMEEFDLTTFYQNPSFHVSLAWCVGDACLQLEGQCLQELQEIVDEFEDS 243
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 244 EMLLRVLAEQVRCKSGNKFFSMPL 267
>gi|62078755|ref|NP_001014035.1| putative U6 snRNA phosphodiesterase [Rattus norvegicus]
gi|81882988|sp|Q5I0I5.1|USB1_RAT RecName: Full=Putative U6 snRNA phosphodiesterase
gi|56971346|gb|AAH88280.1| Similar to expressed sequence AA960436 [Rattus norvegicus]
gi|149032398|gb|EDL87289.1| rCG39094, isoform CRA_a [Rattus norvegicus]
Length = 267
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 113/203 (55%), Gaps = 3/203 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--LARLYAMLKEELNSVGISVEVIPEPHL 59
D+ HGGRIR+FPH+R +WAT +YIP + N L +L + + E HL
Sbjct: 64 DDSARHGGRIRTFPHERGNWATHIYIPYEANEEFQDLLDVLLPRAQMFAPRLVQMEEFHL 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+FI L +S
Sbjct: 124 SLSQSVVLRHHWILPFVQVLKDRMASFQRFFFTANRVKIYTNQEKTRTFIGLEVSSGHAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTK-LDNIFTQFKLTS 178
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ + L I +F+ +
Sbjct: 184 FLDMVSEVDRVMKEFDLTTFYQDPSFHVSLAWCVGDARLQLEGQCQQELQEIVDEFEDSE 243
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 244 MLLRVLAEQVRCKSGNKFFSMPL 266
>gi|115495463|ref|NP_001068945.1| putative U6 snRNA phosphodiesterase [Bos taurus]
gi|122145270|sp|Q0II50.1|USB1_BOVIN RecName: Full=Putative U6 snRNA phosphodiesterase
gi|113912219|gb|AAI22808.1| Chromosome 16 open reading frame 57 ortholog [Bos taurus]
gi|296477941|tpg|DAA20056.1| TPA: hypothetical protein LOC510934 [Bos taurus]
Length = 265
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 118/207 (57%), Gaps = 9/207 (4%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIP 55
+D+ +HGGR+R+FPH+R +WAT VYIP + L L A+L V + +E
Sbjct: 61 VDDSAKHGGRVRTFPHERGNWATHVYIPYEAREEFLDLLDALLCHAQTYVPRLVRMEAF- 119
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
HLSLS+++V+ +HWI V+ L + + +R + ++I+ N+EKTR+F+ L S
Sbjct: 120 --HLSLSQSVVLRHHWILPFVQALKDRVASFHRFCFTTDQVKIYTNQEKTRTFVGLEVTS 177
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQF 174
+V VD+ +EF L T+Y++P+FH S+AWC+ D ++ P L +L I +F
Sbjct: 178 GHAHFLDLVAEVDRVMEEFDLSTFYQDPSFHISLAWCVGDARLQMEGPCLQELQGIVDEF 237
Query: 175 KLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + I K+GNKF+S PL
Sbjct: 238 EDSEMLLRAYAEQIRCKSGNKFFSMPL 264
>gi|301752964|ref|XP_002912318.1| PREDICTED: UPF0406 protein C16orf57 homolog [Ailuropoda
melanoleuca]
Length = 269
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 118/207 (57%), Gaps = 11/207 (5%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT VY+P +T L L +L V V+ + H
Sbjct: 66 DDREKHGGRVRTFPHERGNWATHVYVPYETREEFLGLLDVLLPHAQTYVPRLVQ-MEAFH 124
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+F+ L S
Sbjct: 125 LSLSQSVVLRHHWILPFVQALKDRVASFQRFFFTANRVKIYTNQEKTRTFVGLEVTSGHA 184
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L NI +F+
Sbjct: 185 QFLDLVSEVDRVMEEFDLTTFYQDPSFHVSLAWCVGDARLQLEGRCLRELQNIVDEFE-- 242
Query: 178 SDESFHVVTH---IHMKTGNKFYSFPL 201
D + H + K+GNKF+S PL
Sbjct: 243 -DPEMVLRAHAEQVRCKSGNKFFSMPL 268
>gi|281346655|gb|EFB22239.1| hypothetical protein PANDA_000050 [Ailuropoda melanoleuca]
Length = 231
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 118/207 (57%), Gaps = 11/207 (5%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT VY+P +T L L +L V V+ + H
Sbjct: 28 DDREKHGGRVRTFPHERGNWATHVYVPYETREEFLGLLDVLLPHAQTYVPRLVQ-MEAFH 86
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+F+ L S
Sbjct: 87 LSLSQSVVLRHHWILPFVQALKDRVASFQRFFFTANRVKIYTNQEKTRTFVGLEVTSGHA 146
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L NI +F+
Sbjct: 147 QFLDLVSEVDRVMEEFDLTTFYQDPSFHVSLAWCVGDARLQLEGRCLRELQNIVDEFE-- 204
Query: 178 SDESFHVVTH---IHMKTGNKFYSFPL 201
D + H + K+GNKF+S PL
Sbjct: 205 -DPEMVLRAHAEQVRCKSGNKFFSMPL 230
>gi|440902642|gb|ELR53412.1| hypothetical protein M91_00070 [Bos grunniens mutus]
Length = 265
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 117/207 (56%), Gaps = 9/207 (4%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIP 55
+D+ +HGGR+R+FPH+R +WAT VY+P + L L A+L V + +E
Sbjct: 61 VDDSAKHGGRVRTFPHERGNWATHVYVPYEAREEFLDLLDALLCHAQTYVPRLVRMEAF- 119
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
HLSLS+++V+ +HWI V+ L + + R + ++I+ N+EKTR+F+ L S
Sbjct: 120 --HLSLSQSVVLRHHWILPFVQALKDRVASFQRFCFTTDQVKIYTNQEKTRTFVGLEVTS 177
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQF 174
+V VD+ +EF L T+Y++P+FH S+AWC+ D ++ P L +L I +F
Sbjct: 178 GHAHFLDLVAEVDRVMEEFDLSTFYQDPSFHISLAWCVGDARLQMEGPCLQELQGIVDEF 237
Query: 175 KLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + I K+GNKF+S PL
Sbjct: 238 EDSEMLLRAYAEQIRCKSGNKFFSMPL 264
>gi|395508703|ref|XP_003758649.1| PREDICTED: UPF0406 protein C16orf57 homolog [Sarcophilus harrisii]
Length = 257
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 117/204 (57%), Gaps = 3/204 (1%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPH 58
+D+ +HGGR+R+FPH+R WAT V++P + + L +L + V + + E H
Sbjct: 53 VDDAEKHGGRVRTFPHERGIWATHVWVPYEAKEDFFDLLDILMTQAQKVLPRLVKMKEFH 112
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
+SLS+++V+ YHWI +++L + R N ++++ N+EKTR+F+ L + +
Sbjct: 113 ISLSQSVVLRYHWITPFMQSLKGRMAPFYRFLFTANQVKVYTNQEKTRTFLGLEVTAGHS 172
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y+EP+FH S+AWC+ D + L+ ++ +L F +
Sbjct: 173 HFLDLVSEVDRVMEEFDLATFYKEPSFHISLAWCVGDARSRLEEGVVRELQETVNSFDDS 232
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+SFPL
Sbjct: 233 APLLRVRAEQVRCKSGNKFFSFPL 256
>gi|149699601|ref|XP_001494731.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Equus
caballus]
Length = 265
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 115/206 (55%), Gaps = 9/206 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYVPYEAGEDFLELLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
H+SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+F+ L S
Sbjct: 120 -HVSLSQSVVLRHHWILPFVQALKDRVASFQRFCFTANQVKIYTNQEKTRTFVGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF LPT+Y++P+FH S+AWC+ D L+ L +L I +F+
Sbjct: 179 HAQFLDLVSEVDRVMEEFDLPTFYQDPSFHISLAWCVGDARLQLEGQCLRELQAIVDEFE 238
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 239 DSEILLRVRAGQVRCKSGNKFFSMPL 264
>gi|340720887|ref|XP_003398860.1| PREDICTED: UPF0406 protein C16orf57 homolog [Bombus terrestris]
Length = 253
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 119/205 (58%), Gaps = 9/205 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPEP- 57
DNP +H GR+RSF H+R +WATL+YI P + L+ ++++L E V + + E
Sbjct: 52 DNPLQHEGRVRSFKHERGNWATLIYIGYKPSEDMLSWMFSVLGE----VPVKCNIFSEEF 107
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+SLS+TL++ +HWI++ VE + ++ ++ ++ + NEEKTR+F+ + CK
Sbjct: 108 HISLSRTLILKFHWIESFVEEIKKLCEQTDQFNLELLNVRAYTNEEKTRTFLGIECIDCK 167
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
L V+ V+K E+ LP +YE+ ++H S WCL ++ + L L QF +
Sbjct: 168 GVLGHFVKNVNKFLAEYDLPAFYEDSSYHVSFLWCLGNELSVLDDQAHFLTIKLNQFLIE 227
Query: 178 SDESFHV-VTHIHMKTGNKFYSFPL 201
E+ ++ VT I++K GNK Y F L
Sbjct: 228 HAEARYINVTKIYLKIGNKLYVFKL 252
>gi|449472398|ref|XP_004175234.1| PREDICTED: uncharacterized protein LOC101234041 [Taeniopygia
guttata]
Length = 453
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 117/203 (57%), Gaps = 3/203 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP--LQTNLARLYAMLKEELNSVGISVEVIPEPHL 59
D+ + HGGR+R FPH+R +WAT VY+P Q L +L + S+ + E HL
Sbjct: 250 DDSSRHGGRVRGFPHERGNWATHVYLPYIAQEEFLELLELLVSRARTYVPSLAAMEEFHL 309
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+ +V+ YHWI+ V +L L +R + ++++ N+ KTR+FI L ++
Sbjct: 310 SLSQCVVLRYHWIEPFVRSLRERLAAFHRFFCVADQVKVYTNQNKTRTFIGLEVSAGHFQ 369
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLTS 178
L +V VD +EF LPT+Y++P+FH S+AWC+ D + L+ L +L +I F+ ++
Sbjct: 370 LLELVSEVDSVLEEFDLPTFYKDPSFHISLAWCVGDLSGRLEGQCLQELQDIVDGFEESA 429
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
I K+GNK++SFPL
Sbjct: 430 LLLRIQWEQIRCKSGNKYFSFPL 452
>gi|291390210|ref|XP_002711593.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 263
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 114/204 (55%), Gaps = 5/204 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--HL 59
D+ +HGGR+RSFPH+R +WAT VYIP L + + ++ HL
Sbjct: 60 DDSAKHGGRVRSFPHERGNWATHVYIPYAAREEFLDLLDVLLRRAQACVPRLVGMEGFHL 119
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS ++V+ +HWI V+ L + L R N ++I+ N+EKTR+F+ L S
Sbjct: 120 SLSHSVVLRHHWILPFVQALKDRLAAFQRFFFTANRVKIYTNQEKTRTFVGLEVTSGHAQ 179
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTS 178
L +V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+ +
Sbjct: 180 LLDLVSEVDRVMEEFDLTTFYQDPSFHISLAWCVGDARVQLEGQCLRELQEIVDEFE-DA 238
Query: 179 DESFHVVT-HIHMKTGNKFYSFPL 201
+ V+T + K+GNKF+S PL
Sbjct: 239 EVPLRVLTEQVRCKSGNKFFSMPL 262
>gi|431914174|gb|ELK15433.1| hypothetical protein PAL_GLEAN10011107 [Pteropus alecto]
Length = 265
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 114/206 (55%), Gaps = 9/206 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYLPYDAREEFMDLLDTLLFHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + + R N I+I+ N+EKTR+F+ L S
Sbjct: 120 -HLSLSQSVVLRHHWIHPFVQALKDRMASCQRFCFTANQIKIYTNQEKTRTFVGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
T +V VD+ +EF L T+Y++P+FH S+AWC+ D L+ P L +L I +F+
Sbjct: 179 HTQFLDLVSEVDRVMEEFDLATFYQDPSFHISLAWCVGDAQLQLEGPCLRELQEIVDEFE 238
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+ PL
Sbjct: 239 DSEMLLRMHAEQVRCKSGNKFFLMPL 264
>gi|351697752|gb|EHB00671.1| hypothetical protein GW7_12014 [Heterocephalus glaber]
Length = 267
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 114/204 (55%), Gaps = 5/204 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGR+R+FPH+R +W T VY+P + L ML + + + + HL
Sbjct: 64 DDSAKHGGRVRTFPHERGNWVTHVYVPYEAREEFPDLLDMLLPQAQAYVPRLVKMEAFHL 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+F+ L S
Sbjct: 124 SLSQSVVLRHHWILPFVQALKDRMASFERFLFTANQVKIYTNQEKTRTFVGLEVTSGYAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTS 178
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+ S
Sbjct: 184 FLDLVSEVDRVMEEFDLTTFYQDPSFHISLAWCVGDACLQLEGQCLQELQEIVDEFE-DS 242
Query: 179 DESFHV-VTHIHMKTGNKFYSFPL 201
+ V + K+GNKF+S PL
Sbjct: 243 EMLLRVQAMQVRCKSGNKFFSMPL 266
>gi|51011009|ref|NP_001003460.1| putative U6 snRNA phosphodiesterase [Danio rerio]
gi|82182651|sp|Q6DEF6.1|USB1_DANRE RecName: Full=Putative U6 snRNA phosphodiesterase
gi|50417398|gb|AAH77163.1| Zgc:91896 [Danio rerio]
Length = 276
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 115/206 (55%), Gaps = 9/206 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLK--EELNSVGISVEVIPEPHL 59
D EHGGR+RSF H+R +WAT V+ P A L + K + I + V E HL
Sbjct: 73 DKSEEHGGRLRSFQHERGNWATYVFFPYDPEEAFLEVLNKMMAAAEAHDIPLTVSEEFHL 132
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLSKT+V+ +HWI V+++ +L H + ++++ N EKTR+F+ + ++
Sbjct: 133 SLSKTVVLRHHWIQPFVQSIRTSLTHFQKFYCVAYKLKVYSNAEKTRTFLGMEVSTGTPH 192
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL-KPLLTKLDNIFTQFKLTS 178
L + + VD++ +EF L T+YE+P+FH S+AWC+ D+T L K L +L + +
Sbjct: 193 LLELSKIVDETMKEFNLSTFYEDPSFHISLAWCVGDQTERLKKACLLELQGLIDAHE--- 249
Query: 179 DESFHV---VTHIHMKTGNKFYSFPL 201
D FH + KTGNK + FPL
Sbjct: 250 DGPFHARLNCNELRCKTGNKVFVFPL 275
>gi|195572238|ref|XP_002104103.1| GD20784 [Drosophila simulans]
gi|194200030|gb|EDX13606.1| GD20784 [Drosophila simulans]
Length = 258
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 114/204 (55%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P EHGGRIRSF H+R +WAT VY+P + +L E + + +E+ P H
Sbjct: 58 VDDPAEHGGRIRSFKHERGNWATYVYVPATACVDQLEEFQSEAIARLEPHLELQPNESLH 117
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ YH ID +L + L + I+ NEE+TR+FIA ++
Sbjct: 118 LSLSRTVVLQYHQIDEFSRSLQSALNSSAGFAATLQGLRIYTNEERTRTFIAAPLDAAFV 177
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+T+I+Q +DK +++L +Y+ +FH S+ WC+ D+ A LK LT+L +
Sbjct: 178 EKMTAILQPIDKVMLDYRLQQFYDPASFHVSLLWCVGDQEALLKEKLTELQELLED---- 233
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D V +H+K GNK +++ L
Sbjct: 234 QDTLCLAVNEVHLKCGNKDFTYTL 257
>gi|410928833|ref|XP_003977804.1| PREDICTED: UPF0406 protein C16orf57 homolog [Takifugu rubripes]
Length = 214
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 113/204 (55%), Gaps = 16/204 (7%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV----GISVEVIPEPHLSLS 62
HGGRIRSF H+R +WAT VY P Q + L +++ SV G+++ E HLS+S
Sbjct: 16 HGGRIRSFKHERGNWATYVYFPYQPEEE--FVELLDQMVSVAKAHGVALSPQEEFHLSVS 73
Query: 63 KTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTS 122
+T+V+ +HWI +L L R ++++CN ++TR+F+ + + L
Sbjct: 74 QTVVLRHHWIQPFTRSLRAGLTLRKRFCCSAERLKVYCNADRTRTFLGMEVCTGHAQLLD 133
Query: 123 IVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQ-----FKLT 177
+VQ VD++ EF L T+Y++P+FH S+AWC+ D T ++ L +L ++F F L
Sbjct: 134 LVQIVDRTMSEFLLETFYKDPSFHVSLAWCVGDMTEPMRECLKELQSVFDGCEEGPFLLR 193
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D + +TGN+ + FPL
Sbjct: 194 LD-----CRELRCRTGNRIFHFPL 212
>gi|350397950|ref|XP_003485041.1| PREDICTED: UPF0406 protein C16orf57 homolog [Bombus impatiens]
Length = 253
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 117/205 (57%), Gaps = 9/205 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPEP- 57
DNP +H GR+RSF H+R +WATL+YI P + L+ ++++L E V + + E
Sbjct: 52 DNPLQHEGRVRSFKHERGNWATLIYIDYEPSEDMLSWMFSVLGE----VPVKCNIFSEQF 107
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+SLS+TL++ +HWI++ VE ++ ++ ++ + NEEKTR+F+ + CK
Sbjct: 108 HISLSRTLILKFHWIESFVEETKKLCEQTDQFNLELLNVRAYTNEEKTRTFLGIECIDCK 167
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
L V+ V+K E+ LP +YE+ ++H S WCL ++ + L L QF +
Sbjct: 168 GVLGHFVKNVNKLLAEYDLPAFYEDSSYHVSFLWCLGNELSVLDDQARFLTIKLNQFLIE 227
Query: 178 SDESFHV-VTHIHMKTGNKFYSFPL 201
E+ ++ V I++K GNK Y F L
Sbjct: 228 HAEARYINVNKIYLKIGNKLYVFKL 252
>gi|328783219|ref|XP_001120854.2| PREDICTED: UPF0406 protein C16orf57 homolog [Apis mellifera]
Length = 215
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 118/205 (57%), Gaps = 9/205 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPEP- 57
D+P++H GR+RSF H+R +WATL+YI P + + ++++L+E + I + E
Sbjct: 14 DDPSQHDGRVRSFKHERGNWATLIYINYEPSEAIFSWIFSVLEE----INIKCNIFSEQF 69
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+SL+KTL++ +HWI++ ++ ++ +K +++ + NEE TR+F+ + C
Sbjct: 70 HISLTKTLILKFHWIESFIKETKKLCEQTDQFDLKLLNVKAYTNEENTRTFLGIECVDCN 129
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
L V+ ++K E+ LP +YE+ ++H S WCL ++ L L QF +
Sbjct: 130 GVLACFVENINKFLAEYDLPPFYEDSSYHISFLWCLGNEFVVLNNYTHSLTTKLNQFLVE 189
Query: 178 -SDESFHVVTHIHMKTGNKFYSFPL 201
S+E + +T IH+K GNK Y+F L
Sbjct: 190 HSEERYIHITKIHLKIGNKLYAFKL 214
>gi|195108141|ref|XP_001998651.1| GI24089 [Drosophila mojavensis]
gi|193915245|gb|EDW14112.1| GI24089 [Drosophila mojavensis]
Length = 255
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 116/204 (56%), Gaps = 9/204 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--HL 59
DNP HGGRIRSF H+R +WAT VY+P +L +E + ++ +E+ P HL
Sbjct: 56 DNPTLHGGRIRSFKHERGNWATFVYVPALACAEQLEDFQREAIETLAPHLELQPNESIHL 115
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLSKT+V+ YH ID +L + L NS+E++ NEE+TR+F+A+ ++ T+
Sbjct: 116 SLSKTVVLQYHQIDEFQRSLQHALHSCVGFNSTLNSLEVYTNEERTRTFLAVQLDAAYTT 175
Query: 120 -LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTS 178
++ ++ VD ++++L +Y+ P+FH S+ WCL D+ A L L +L + L
Sbjct: 176 KMSGLLHCVDSVMRDYRLEQFYKNPSFHVSLLWCLGDQKAMLHAKLQELQQL-----LED 230
Query: 179 DESFHVVTH-IHMKTGNKFYSFPL 201
E+ + H + K GNK + + L
Sbjct: 231 HETLKLSVHELRCKCGNKDFIYKL 254
>gi|402908573|ref|XP_003917012.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 3 [Papio
anubis]
Length = 214
Score = 136 bits (342), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 111/204 (54%), Gaps = 5/204 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V V + H
Sbjct: 11 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVR-MEAFH 69
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 70 LSLSQSVVLRHHWILPFVQALKARMASFQRFFFTANQVKIYTNQEKTRTFIGLEVTSGHA 129
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F+
Sbjct: 130 QFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGFEDA 189
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 190 EVLLRVYIEQVRCKSGNKFFSMPL 213
>gi|348504170|ref|XP_003439635.1| PREDICTED: UPF0406 protein C16orf57 homolog [Oreochromis niloticus]
Length = 279
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 113/200 (56%), Gaps = 8/200 (4%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKT 64
HGGRIRSF H+R +WA+ +Y P + L + + G+ + E HLSLS+T
Sbjct: 81 HGGRIRSFKHERGNWASYIYFPYHPEEEFGELLDGILSAAGARGVVLTAQDEFHLSLSQT 140
Query: 65 LVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIV 124
+V+ +HWI ++L ++L R + ++ N EKTR+F+ + ++ L ++
Sbjct: 141 VVLRHHWIQPFTQSLKSSLTGCKRFVCSAGRLRVYSNAEKTRTFLGMEVSTGHAQLLDLI 200
Query: 125 QAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHV 184
+ VD++ EF+L T+Y++P+FH S+AWC+ D+T ++ + +L ++ + D F +
Sbjct: 201 RTVDRTMTEFRLETFYKDPSFHVSLAWCVGDQTVQMEECMQELQSLVDDHE---DGPFVL 257
Query: 185 ---VTHIHMKTGNKFYSFPL 201
+ + +TGNK + FPL
Sbjct: 258 RLDCSELRCRTGNKTFRFPL 277
>gi|402908569|ref|XP_003917010.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Papio
anubis]
gi|355710247|gb|EHH31711.1| hypothetical protein EGK_12838 [Macaca mulatta]
gi|380816552|gb|AFE80150.1| hypothetical protein LOC79650 isoform 1 [Macaca mulatta]
gi|383412867|gb|AFH29647.1| hypothetical protein LOC79650 isoform 1 [Macaca mulatta]
gi|384943636|gb|AFI35423.1| hypothetical protein LOC79650 isoform 1 [Macaca mulatta]
Length = 265
Score = 135 bits (340), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 112/206 (54%), Gaps = 9/206 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMASFQRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F+
Sbjct: 179 HAQFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGFE 238
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 239 DAEVLLRVYIEQVRCKSGNKFFSMPL 264
>gi|195330368|ref|XP_002031876.1| GM26244 [Drosophila sechellia]
gi|194120819|gb|EDW42862.1| GM26244 [Drosophila sechellia]
Length = 258
Score = 135 bits (340), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 113/204 (55%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P EHGGRIRSF H+R +WAT VY+P + +L E + + +E+ P H
Sbjct: 58 LDDPAEHGGRIRSFKHERGNWATYVYVPATACVDQLEEFQSEAIARLETHMELQPNESLH 117
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ YH ID +L + L + I+ NEE+TR+FIA ++
Sbjct: 118 LSLSRTVVLQYHQIDEFSRSLQSALNISAGFAATLQGLRIYTNEERTRTFIAAPLDAAFV 177
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+T+I+Q +DK +++L +Y+ +FH S+ WC+ D+ LK LT+L +
Sbjct: 178 EKMTAILQPIDKVMLDYRLQQFYDPASFHVSLLWCVGDQETLLKDKLTELQELLED---- 233
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D V +H+K GNK +++ L
Sbjct: 234 QDTLCLAVNEVHLKCGNKDFTYTL 257
>gi|355756824|gb|EHH60432.1| hypothetical protein EGM_11789 [Macaca fascicularis]
Length = 265
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 112/206 (54%), Gaps = 9/206 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMASFQRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F+
Sbjct: 179 HAQFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGFE 238
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 239 DAEVLLRVYIEQVRCKSGNKFFSMPL 264
>gi|348605198|ref|NP_001231732.1| chromosome 16 open reading frame 57 [Sus scrofa]
Length = 269
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 114/209 (54%), Gaps = 15/209 (7%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VYIP + L L +L V + +E
Sbjct: 66 DDSAKHGGRVRTFPHERGNWATHVYIPYEAREEFLDLLDVLLPHAQTYVPRLVRMEAF-- 123
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + + R + ++I+ N+EKTR+F+ L S
Sbjct: 124 -HLSLSQSVVLRHHWILPFVQVLKDRVASFQRFCFTADQVKIYTNQEKTRTFVGLEVTSG 182
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFK 175
+V VDK +EF L T+Y++P+FH S+ WC+ D L+ L +L I F+
Sbjct: 183 HAQFLDLVSEVDKVMEEFDLTTFYQDPSFHVSLTWCVGDARLQLEGRCLQELQEIVDAFE 242
Query: 176 LTSDESFHVVTH---IHMKTGNKFYSFPL 201
D + H I K+GNKF+S PL
Sbjct: 243 ---DSEMLLRMHAEQIRCKSGNKFFSMPL 268
>gi|403306028|ref|XP_003943548.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Saimiri
boliviensis boliviensis]
Length = 265
Score = 134 bits (338), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/211 (36%), Positives = 114/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VYIP + L L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYIPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMASFQRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 179 HAQFLDLVSEVDRVMEEFDLSTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 237
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 238 ----EDAEVLLRMHTEQVRCKSGNKFFSMPL 264
>gi|10435046|dbj|BAB14469.1| unnamed protein product [Homo sapiens]
Length = 265
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 116/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + ++V
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMKVF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 179 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHPSLAWCVGDARLQLEGQCLQELQAIVDGF- 237
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 238 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 264
>gi|55643949|ref|XP_511000.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 3 [Pan
troglodytes]
gi|410207098|gb|JAA00768.1| chromosome 16 open reading frame 57 [Pan troglodytes]
gi|410261756|gb|JAA18844.1| chromosome 16 open reading frame 57 [Pan troglodytes]
gi|410297898|gb|JAA27549.1| chromosome 16 open reading frame 57 [Pan troglodytes]
gi|410329579|gb|JAA33736.1| chromosome 16 open reading frame 57 [Pan troglodytes]
Length = 265
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 115/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 179 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 237
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 238 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 264
>gi|432119386|gb|ELK38464.1| hypothetical protein MDA_GLEAN10014282 [Myotis davidii]
Length = 214
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 114/205 (55%), Gaps = 7/205 (3%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLA---RLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT VY+P + L A+L V V + H
Sbjct: 11 DDSAKHGGRVRTFPHERGNWATHVYVPYEAREEFPDLLDALLPHAQTRVPRLVR-MEAFH 69
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + R + ++I+ NEEKTR+F+ L S
Sbjct: 70 LSLSQSVVLRHHWILPFVQALKERVASRQRFCFTADRVKIYTNEEKTRTFVGLEVTSGHA 129
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+
Sbjct: 130 QFLDLVSEVDRVMEEFDLTTFYKDPSFHISLAWCVGDARLQLEGQCLRELQEIVDEFE-D 188
Query: 178 SDESFHV-VTHIHMKTGNKFYSFPL 201
S+ V + K+GNKF+S PL
Sbjct: 189 SETLLRVHAEQVRCKSGNKFFSVPL 213
>gi|42716283|ref|NP_078874.2| putative U6 snRNA phosphodiesterase isoform 1 [Homo sapiens]
gi|74732815|sp|Q9BQ65.1|USB1_HUMAN RecName: Full=Putative U6 snRNA phosphodiesterase; Short=hUsb1
gi|13325194|gb|AAH04415.1| Chromosome 16 open reading frame 57 [Homo sapiens]
gi|13623389|gb|AAH06291.1| C16orf57 protein [Homo sapiens]
gi|14043592|gb|AAH07774.1| Chromosome 16 open reading frame 57 [Homo sapiens]
gi|18204368|gb|AAH21554.1| Chromosome 16 open reading frame 57 [Homo sapiens]
gi|119603368|gb|EAW82962.1| chromosome 16 open reading frame 57, isoform CRA_a [Homo sapiens]
gi|119603370|gb|EAW82964.1| chromosome 16 open reading frame 57, isoform CRA_a [Homo sapiens]
gi|325464229|gb|ADZ15885.1| chromosome 16 open reading frame 57 [synthetic construct]
Length = 265
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 116/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + ++V
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMKVF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 179 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 237
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 238 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 264
>gi|397506498|ref|XP_003823764.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 3 [Pan
paniscus]
Length = 214
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 114/209 (54%), Gaps = 15/209 (7%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT +Y+P + L L +L V V + H
Sbjct: 11 DDSTKHGGRVRTFPHERGNWATHIYVPYEAKEEFLDLLDVLLPHAQTYVPRLVR-MEAFH 69
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 70 LSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSGHA 129
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 130 QFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF--- 186
Query: 178 SDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 187 --EDAEVLLRVHTEQVRCKSGNKFFSMPL 213
>gi|296231217|ref|XP_002761061.1| PREDICTED: UPF0406 protein C16orf57 isoform 1 [Callithrix jacchus]
Length = 265
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/211 (36%), Positives = 114/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VYIP + L L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYIPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARVASFQRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 179 HAQFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 237
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 238 ----EDAEVLLRMHTEQVRCKSGNKFFSMPL 264
>gi|444725647|gb|ELW66208.1| Matrix metalloproteinase-15 [Tupaia chinensis]
Length = 891
Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats.
Identities = 73/207 (35%), Positives = 114/207 (55%), Gaps = 11/207 (5%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT VYIP + L L A+L + V + H
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYIPYEAEEEFLDLLDALLSHAQTYIPRLVR-MEAFH 120
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + + N ++I+ N+EKTR+F+ L S
Sbjct: 121 LSLSQSVVLRHHWILPFVQALKDRMASCQGFFFTANQVKIYTNQEKTRTFVGLEVTSGHA 180
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L +I F+
Sbjct: 181 QFLHLVSEVDRVMEEFDLTTFYQDPSFHVSLAWCVGDARLQLEGQCLQELQDIVDGFE-- 238
Query: 178 SDESFHVVTH---IHMKTGNKFYSFPL 201
D + H + K+GNKF+ PL
Sbjct: 239 -DSEMLLRMHADQVRCKSGNKFFFMPL 264
>gi|397506494|ref|XP_003823762.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Pan
paniscus]
Length = 265
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 115/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT +Y+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHIYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 179 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 237
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 238 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 264
>gi|417398062|gb|JAA46064.1| Hypothetical protein [Desmodus rotundus]
Length = 265
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 112/207 (54%), Gaps = 11/207 (5%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP---H 58
D+ +HGGR+R+FPH+R +WAT VY+P + +L E L V + H
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYVPYEAG-EEFLDLLDEVLPHAQTYVPRLVRMEAFH 120
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 121 LSLSQSVVLRHHWILPFVQALKDRTASRQRFCFTANRVKIYTNQEKTRTFIGLEVTSGHA 180
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L + QF+
Sbjct: 181 QFLDLVSEVDRVMEEFDLTTFYQDPSFHVSLAWCVGDARLQLEGQCLRELQEMVDQFE-- 238
Query: 178 SDESFHVVTH---IHMKTGNKFYSFPL 201
D + H + K+GNK +S PL
Sbjct: 239 -DSEMLLRVHAEQVRCKSGNKCFSMPL 264
>gi|426382354|ref|XP_004057772.1| PREDICTED: putative U6 snRNA phosphodiesterase isoform 1 [Gorilla
gorilla gorilla]
Length = 265
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 114/211 (54%), Gaps = 19/211 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R N ++I+ N+EKTR+FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFQRFFFTANQVKIYTNQEKTRTFIGLEVTSG 178
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I
Sbjct: 179 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAI----- 233
Query: 176 LTSDESFHVVTHIHM-----KTGNKFYSFPL 201
+ E V+ +H K+GNKF+S PL
Sbjct: 234 VDGSEDAEVLLRVHTEQVRCKSGNKFFSMPL 264
>gi|348572650|ref|XP_003472105.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Cavia
porcellus]
Length = 266
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 112/203 (55%), Gaps = 3/203 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--LARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGR+R+FPH+R +WAT VY+P + L +L + + + HL
Sbjct: 63 DDSAKHGGRVRTFPHERGNWATHVYVPYEAKEEFPDLLDVLLPHAQTYVPRLVRMEAFHL 122
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++++ +HWI V+ L + + R + ++I+ N+EKTR+F+ L S
Sbjct: 123 SLSQSVILRHHWILPFVQALKDRVASFERFLFTTDRVKIYTNQEKTRTFVGLEVTSGHAQ 182
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLTS 178
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+ +
Sbjct: 183 FLDLVSEVDRVMEEFDLTTFYQDPSFHISLAWCVGDACLQLEGRCLQELQEIVDEFEDSE 242
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
+ K+GNKF+S PL
Sbjct: 243 MLLRVQAKQVRCKSGNKFFSMPL 265
>gi|291238248|ref|XP_002739047.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 282
Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 118/206 (57%), Gaps = 5/206 (2%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNSVGISVEVIPEPH 58
+D+ ++H GR RSF HQ +WAT V+IP+ + L L L I ++++ + H
Sbjct: 78 VDDKHKHEGRSRSFEHQAGNWATFVHIPVPPSDDFYDLCTALVHAL-PTDIPMKLMDDFH 136
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
+SLS+T+V+ +HWI++ +E L + T+ ++E + N+EKTRSF+ L +
Sbjct: 137 VSLSRTVVLQHHWINSFIEALRQCFIDCSSFTLVLETLEFYTNDEKTRSFLGLKVSIGHD 196
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL-KPLLTKLDNIFTQFKLT 177
L +V+ VD+ EF+LP +YE+P+FH SIAWCL + A + K L L +F F
Sbjct: 197 YLLELVKLVDECLLEFRLPIFYEKPSFHVSIAWCLGNMRAKVTKDLTNDLKRLFDDFIDQ 256
Query: 178 SDESFHV-VTHIHMKTGNKFYSFPLT 202
+ V V I +GNK +SFPL+
Sbjct: 257 NPAIDRVEVKEIQCNSGNKHFSFPLS 282
>gi|443721372|gb|ELU10712.1| hypothetical protein CAPTEDRAFT_226950 [Capitella teleta]
Length = 239
Score = 132 bits (332), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 111/202 (54%), Gaps = 3/202 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSL 61
DNP+ H GRIRSF H+R +WA+ V+I + +L + E L S+ + + HLSL
Sbjct: 36 DNPDHHDGRIRSFGHERGNWASHVFIATDAS-DQLRQLTCELLKSLPGEFRCMEDFHLSL 94
Query: 62 SKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
SKT ++ +HWI L+ +L L R + F ++ NEE TR+F+ L S LT
Sbjct: 95 SKTFIVRHHWIKDLLSSLKKQLASCQRCCLCFQGVDFLVNEEGTRTFMVLKTCSGFNILT 154
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD-KTATLKPLLTKLDNIFTQF-KLTSD 179
V +VD + E+KLP YY++P+FH S+AW L D K + LL+ L +F + +SD
Sbjct: 155 QYVASVDAALSEYKLPAYYQDPSFHISLAWALGDVKKQISEELLSCLRETTAEFLEDSSD 214
Query: 180 ESFHVVTHIHMKTGNKFYSFPL 201
+ + +TGN+ Y L
Sbjct: 215 CAILEANTLVFRTGNRQYDIEL 236
>gi|410983609|ref|XP_003998131.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Felis catus]
Length = 267
Score = 132 bits (332), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 113/204 (55%), Gaps = 5/204 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--LARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGR+R+FPH+R +WAT VY+P + L ML + + + HL
Sbjct: 64 DDSAKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDMLLPRAQTYVPRLVRMEAFHL 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + N ++I+ N+EKTR+F+ L S
Sbjct: 124 SLSQSVVLRHHWILPFVQALKDRVASFQGFFFIANRVKIYTNQEKTRTFVGLEVTSGHAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTS 178
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+ S
Sbjct: 184 FLDLVSEVDRVMEEFDLTTFYQDPSFHISLAWCVGDARLQLEGQCLRELQTIVDEFE-DS 242
Query: 179 DESFHV-VTHIHMKTGNKFYSFPL 201
+ V + K+G+KF+S PL
Sbjct: 243 EMVLRVHAEQVRCKSGHKFFSMPL 266
>gi|24645417|ref|NP_649911.1| CG16790, isoform A [Drosophila melanogaster]
gi|74869088|sp|Q9VHB3.1|USB1_DROME RecName: Full=Putative U6 snRNA phosphodiesterase
gi|7299208|gb|AAF54405.1| CG16790, isoform A [Drosophila melanogaster]
gi|209417982|gb|ACI46529.1| FI06459p [Drosophila melanogaster]
Length = 258
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 113/204 (55%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P EHGGR+RSF H+R +WAT VY+P + +L E + + +E+ P H
Sbjct: 58 VDDPAEHGGRMRSFKHERGNWATYVYVPATACVDQLEEFQTEAIARLEPHLELQPNESLH 117
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ YH ID +L + L + I+ NEE+TR+FIA ++
Sbjct: 118 LSLSRTVVLQYHQIDEFSRSLQSALNSSAGFAATLQGLRIYTNEERTRTFIAAPLDAAFV 177
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+T+I+Q +D+ +++L +Y+ +FH S+ WC+ D+ L LT+L +
Sbjct: 178 EKMTAILQPIDQVMLDYRLQQFYDPASFHVSLLWCVGDQETLLNEKLTELKELLDD---- 233
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D+ V +H+K GNK +++ L
Sbjct: 234 QDKLCLAVNEVHLKCGNKDFTYSL 257
>gi|68051333|gb|AAY84930.1| IP09928p [Drosophila melanogaster]
Length = 258
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 112/204 (54%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P EHG R+RSF H+R +WAT VY+P + +L E + + +E+ P H
Sbjct: 58 VDDPAEHGSRMRSFKHERGNWATYVYVPATACVDQLEEFQTEAIARLEPHLELQPNESLH 117
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ YH ID +L + L + I+ NEE+TR+FIA ++
Sbjct: 118 LSLSRTVVLQYHQIDEFSRSLQSALNSSAGFAATLQGLRIYTNEERTRTFIAAPLDAAFV 177
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+T+I+Q +D+ +++L +Y+ +FH S+ WC+ D+ L LT+L +
Sbjct: 178 EKMTAILQPIDQVMLDYRLQQFYDPASFHVSLLWCVGDQETLLNEKLTELKELLDD---- 233
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D+ V +H+K GNK +++ L
Sbjct: 234 QDKLCLAVNEVHLKCGNKDFTYSL 257
>gi|195153144|ref|XP_002017489.1| GL22328 [Drosophila persimilis]
gi|194112546|gb|EDW34589.1| GL22328 [Drosophila persimilis]
Length = 255
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 109/204 (53%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIP-EP-H 58
+D+P HGGRIRSF H+R +WAT VYIP RL +E + + V++ EP H
Sbjct: 55 IDDPALHGGRIRSFKHERGNWATYVYIPALACAERLEEFQEEAIACLAPDVQMQANEPLH 114
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+T+V+ YH ID +L L ++I+ NEE+TR+F+A+ + T
Sbjct: 115 LSLSRTVVLQYHQIDEFSRSLQVALNSSTGFASTLQGLKIYTNEERTRTFLAVQLDGAFT 174
Query: 119 S-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+ SI+Q +DK +++LP +Y P+FH S+ WC+ D L L +L +
Sbjct: 175 EKVLSILQPIDKVMHDYRLPKFYAPPSFHVSLLWCVGDHEDLLNKKLKELRVLLED---- 230
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D V IH K G K +++ L
Sbjct: 231 QDTLELAVNEIHCKCGKKDFTYKL 254
>gi|321455849|gb|EFX66971.1| hypothetical protein DAPPUDRAFT_302251 [Daphnia pulex]
Length = 259
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 112/198 (56%), Gaps = 11/198 (5%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQ--TNLARLYAMLKEELNSVGISVEVIPEPH 58
+D+P++HG RIRSFPH+R +WA+ +Y+P + +N ++ + + GI +++ + H
Sbjct: 64 VDDPSQHGNRIRSFPHERGNWASFIYLPWEGDSNFVNSVELITQCFQNYGIELQICDDFH 123
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNR--LTIKFNSIEIFCNEEKTRSFIALGANSC 116
+SL+KT ++ +HWI+ V ++ L ++R + N + ++ NEE+ R+F+A+
Sbjct: 124 ISLTKTFILRHHWIEGFVNSVKKQLNGISRPFQLLGTNVLSVYTNEERNRTFLAINIEDP 183
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKL 176
L + Q +D E++LPT+Y+E H SIAWC+ DK +L+ I KL
Sbjct: 184 SGMLNVLTQKMDSCMIEYQLPTFYKEACHHVSIAWCVCDKK-------EELEKIAAGLKL 236
Query: 177 TSDESFHVVTHIHMKTGN 194
+++ + + K GN
Sbjct: 237 EIEQTACNMAEVRCKIGN 254
>gi|195444757|ref|XP_002070015.1| GK11826 [Drosophila willistoni]
gi|194166100|gb|EDW81001.1| GK11826 [Drosophila willistoni]
Length = 263
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 116/205 (56%), Gaps = 9/205 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELN---SVGISVEVIPEP- 57
DNP+ HGGRIRSF H+R +WAT VY+P +T + +L L+ + + ++ +P
Sbjct: 62 DNPDCHGGRIRSFKHERGNWATYVYVPAETCVEQLEDFQNAALSLLEPLQLDMQPNEQPF 121
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC- 116
HLSLSKT+V+ +H I+ +L ++L + + I+ NEE+TR+FIA+ ++
Sbjct: 122 HLSLSKTIVLQHHQIEEFSRSLKDSLESQCGFFVCLQDLRIYTNEERTRTFIAVEVSAAY 181
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKL 176
K L I++ +D+ +++LP +Y+ P+FH S+ WC+ D + LK L +L ++
Sbjct: 182 KEKLCLILKPIDRVMLDYRLPQFYDPPSFHVSLLWCVGDHESVLKEKLNELSSLLETEDT 241
Query: 177 TSDESFHVVTHIHMKTGNKFYSFPL 201
V +H K GNK + + L
Sbjct: 242 LLLN----VNKLHCKCGNKQFIYKL 262
>gi|170058598|ref|XP_001864989.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877665|gb|EDS41048.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 246
Score = 126 bits (316), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 116/209 (55%), Gaps = 11/209 (5%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI------PLQTNLARLYAMLKEELNSVGISVEVIP 55
D+P +H GR+RSF H+R WA+ V++ PL ++ +EL+ + + +
Sbjct: 41 DDPAKHQGRVRSFAHERGIWASYVFVDYNDVEPLNDLQQQIIDKASKELS---LELNRVD 97
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
H+SL+KT VI +H I VE + N + R T+ +++ I+ NEE+TR+F+A+ +
Sbjct: 98 NLHMSLTKTFVIRHHNITAFVENIRNAVSGSKRFTVLPSNLAIYVNEEQTRTFLAVKIDE 157
Query: 116 CKTS-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQF 174
L +V A+D +E+KLP +Y+E +FH SI W L ++ L+ +L +LD +F
Sbjct: 158 TSFGPLEQLVDALDGCMREYKLPVFYQERSFHVSILWTLGNQRDKLEGILPELDELFNAI 217
Query: 175 KLTSDESFHV-VTHIHMKTGNKFYSFPLT 202
+V V +H+K GNKFY F L
Sbjct: 218 YEEEYCDMNVNVKRLHLKCGNKFYDFGLV 246
>gi|195499478|ref|XP_002096965.1| GE24763 [Drosophila yakuba]
gi|194183066|gb|EDW96677.1| GE24763 [Drosophila yakuba]
Length = 258
Score = 125 bits (315), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 108/204 (52%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P HGGRIRSF H+R +WAT VY+P +L E + + VE+ P H
Sbjct: 58 VDDPALHGGRIRSFKHERGNWATYVYVPATACADQLEEFQSEAIARLEPHVELQPNESLH 117
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ YH ID +L L + I+ NEE+TR+FIA ++
Sbjct: 118 LSLSRTVVLQYHQIDEFSRSLQAALNSSTGFAATLQGLRIYTNEERTRTFIAAPLDAAFV 177
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+T+I+Q +D+ +++L +Y +FH S+ WC+ D+ L L+ L +
Sbjct: 178 EKMTAILQPIDQVMLDYRLQQFYVPASFHVSLLWCVGDQEKVLNEKLSDLQELLED---- 233
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D V +H+K+GNK +++ L
Sbjct: 234 QDTLSLAVNEVHLKSGNKDFTYTL 257
>gi|346468683|gb|AEO34186.1| hypothetical protein [Amblyomma maculatum]
Length = 279
Score = 125 bits (315), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 114/203 (56%), Gaps = 5/203 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI--PLQTNLARLYAMLKEELNSVGISVEVIPEPHL 59
D+ + H GR+R+FPH+ WA+ +I Q+++ L A L +++ + + + H+
Sbjct: 76 DSQSLHDGRVRTFPHEPGVWASYAFISGAEQSHIESLIAFLCRDIDY--LKPQQLSSCHV 133
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+T+ + +HWI LVE+L L + TI+F S++++ N EKTR+F++L +
Sbjct: 134 SLSRTVKLRHHWIQPLVESLRAVLTPHRKFTIRFGSLDVYTNAEKTRTFLSLKVHKGIEH 193
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSD 179
L +V VD +E+ LP +YE+P+FH S+AWC + L+ L +L QF +
Sbjct: 194 LKRMVVEVDGCLKEYDLPLFYEDPSFHMSVAWCDTSEEDRLQQSLHELQVKLEQFSIRHP 253
Query: 180 ESFHV-VTHIHMKTGNKFYSFPL 201
S V + +TGNK + PL
Sbjct: 254 SSLVTEVASVWFRTGNKLFELPL 276
>gi|194902989|ref|XP_001980801.1| GG17359 [Drosophila erecta]
gi|190652504|gb|EDV49759.1| GG17359 [Drosophila erecta]
Length = 258
Score = 125 bits (314), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 106/204 (51%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P HGGRIRSF H+R +WAT VY+P +L E + + VE+ P H
Sbjct: 58 VDDPELHGGRIRSFKHERGNWATYVYVPSTACADQLEEFQSEAIARLEPHVELQPNESLH 117
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ YH ID +L L + I+ NEE+TR+FIA ++
Sbjct: 118 LSLSRTVVLQYHQIDEFSRSLQTALNSSTGFAATLRGLRIYTNEERTRTFIAAPLDAAFV 177
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+T+I+Q +D+ E++ +Y+ +FH S+ WC+ D+ L L L +
Sbjct: 178 EKMTAILQPIDRVMLEYRQQQFYDPASFHVSLLWCVGDQEKVLNEKLKDLQELLED---- 233
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D V +H+K GNK +++ L
Sbjct: 234 QDTLSLAVNEVHLKCGNKDFTYTL 257
>gi|125777323|ref|XP_001359569.1| GA14152 [Drosophila pseudoobscura pseudoobscura]
gi|54639316|gb|EAL28718.1| GA14152 [Drosophila pseudoobscura pseudoobscura]
Length = 255
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 109/204 (53%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIP-EP-H 58
+D+P HGGRIRSF H+R +WAT VY+P +L +E + + V++ EP H
Sbjct: 55 VDDPALHGGRIRSFKHERGNWATYVYVPALACAEQLEEFQEEAIACLAPDVQMQANEPLH 114
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+T+V+ YH ID +L L ++I+ NEE+TR+F+A+ + T
Sbjct: 115 LSLSRTVVLQYHQIDEFSRSLQVALNSSTGFASTLQGLKIYTNEERTRTFLAVQLDGAFT 174
Query: 119 S-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+ SI+Q +DK +++LP +Y P+FH S+ WC+ D L L +L +
Sbjct: 175 EKVLSILQPIDKVMHDYRLPKFYAPPSFHVSLLWCVGDHEDLLNKKLKELRVLLED---- 230
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D V IH K G K +++ L
Sbjct: 231 QDTLPLAVNEIHCKCGKKDFTYKL 254
>gi|194391132|dbj|BAG60684.1| unnamed protein product [Homo sapiens]
Length = 213
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 111/206 (53%), Gaps = 27/206 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSL 61
D+ +HGGR+R+FPH+R +WAT VY+P Y +E L+ + + + PH
Sbjct: 28 DDSTKHGGRVRTFPHERGNWATHVYVP--------YEAKEEFLDLLDVLL-----PH--- 71
Query: 62 SKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
++++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 72 AQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSGHAQFL 131
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTSDE 180
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L + I F E
Sbjct: 132 DLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQESQAIVDGF-----E 186
Query: 181 SFHVVTHIH-----MKTGNKFYSFPL 201
V+ +H K+GNKF+S PL
Sbjct: 187 DAEVLLRVHTEQVRCKSGNKFFSMPL 212
>gi|256080768|ref|XP_002576649.1| hypothetical protein [Schistosoma mansoni]
gi|353232053|emb|CCD79408.1| hypothetical protein Smp_053550.1 [Schistosoma mansoni]
Length = 246
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 14/206 (6%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEEL---NSVGISVEVIPEPH 58
D+P++H R R+FPH+ SWAT VYIP +R+ +K+ + N V + PH
Sbjct: 40 DDPSKHDFRSRTFPHEPGSWATSVYIPCYHLHSRILEAIKDPVIQSNPVMADCYTVDSPH 99
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGAN-SCK 117
+SLSKT I +HWI+ LV L + ++ + I + +E+ NEE TRSF L A+ +
Sbjct: 100 ISLSKTWPIYFHWIENLVGNLRSAVKSFGKFWIALDGVEVLVNEENTRSFFTLVASEESR 159
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD-----KTATLKPLLTKLDN-IF 171
L S++ +VD F+ P YYE P FH S WC + T TL L IF
Sbjct: 160 IVLISLLNSVDPCVTAFRGPKYYENPKFHMSFLWCNGNVRKKYSTETLNNFSMDLQKAIF 219
Query: 172 TQFKLTSDESFHVVTHIHMKTGNKFY 197
+ S + FH VT I K+G+K +
Sbjct: 220 VE----SKKIFHEVTDIVCKSGSKHF 241
>gi|383853928|ref|XP_003702474.1| PREDICTED: UPF0406 protein C16orf57 homolog [Megachile rotundata]
Length = 254
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 118/206 (57%), Gaps = 9/206 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPEP- 57
D+P +H GR RSF H+R +WATL YI P T ++ ++++L+E + + + E
Sbjct: 53 DDPLQHEGRTRSFKHERGNWATLAYINYEPSDTMISWMFSILEE----IPVKSNIFSEQF 108
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+S+++TL++ +HWI++ VE + ++ ++ +I + NE+ TR+F+ + K
Sbjct: 109 HISVTRTLILKFHWIESFVEEVKKLCEQTHQFNLELLNIRAYTNEDTTRTFLGIECFESK 168
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
L + V ++ E++LP +YE+ ++H S WCL ++ A L L L QF
Sbjct: 169 GILNNFVNNLNNILAEYELPPFYEDSSYHVSFLWCLGNEVANLNNHLYNLTTKLNQFLAD 228
Query: 178 SDESFHV-VTHIHMKTGNKFYSFPLT 202
+E ++ VT IH+K GNK Y+F L+
Sbjct: 229 HEEERNINVTTIHLKIGNKLYAFKLS 254
>gi|194744751|ref|XP_001954856.1| GF16534 [Drosophila ananassae]
gi|190627893|gb|EDV43417.1| GF16534 [Drosophila ananassae]
Length = 252
Score = 122 bits (307), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 111/204 (54%), Gaps = 7/204 (3%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--H 58
+D+P +HGGRIRSF H+R +WAT +Y+P +L E + + VE+ H
Sbjct: 52 VDDPAQHGGRIRSFKHERGNWATYIYVPAGACAEQLEDFQAEAIAKLAPQVELTANESLH 111
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-K 117
LSLS+T+V+ +H ID +L L + N + I+ N+E TR+F+A+ ++ +
Sbjct: 112 LSLSRTVVLQHHQIDEFSRSLKTALHSCTGFSASLNGLRIYTNDESTRTFVAVQLDAAFR 171
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+ ++ +D E++LP +Y+ P+FH S+ WC+ D+T +LT+ +
Sbjct: 172 DKASMLLNPIDSVMLEYRLPPFYDPPSFHVSLLWCVGDQTK----VLTEKLEELQELLAD 227
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D VT +H+K GN+ +S+ L
Sbjct: 228 QDTLPLPVTDVHLKCGNRDFSYIL 251
>gi|157120655|ref|XP_001659707.1| hypothetical protein AaeL_AAEL009090 [Aedes aegypti]
gi|108874836|gb|EAT39061.1| AAEL009090-PA [Aedes aegypti]
Length = 272
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 119/208 (57%), Gaps = 13/208 (6%)
Query: 3 NPNEHGGRIRSFPHQRNSWATLVYIPLQ-----TNLARLYAMLKEELNSVGISVEVIPEP 57
+P++H GR+RSF H+R WA+ V+I +L + ++++ + + + + + +
Sbjct: 68 DPSKHQGRVRSFAHERGIWASYVFIDYNEIDAFDDLQK--QLIEKSMKDLDLELNRVDDL 125
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG-ANSC 116
HLSL+KT V+ +H I VE + + + R I + + ++ NEE TR+F+A+ A+
Sbjct: 126 HLSLTKTFVLRHHNIAAFVENVRSAISGTKRFRISLSDLAVYTNEENTRTFLAVKVADQS 185
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKL 176
L+ +V+ +D S +E+KLPT+Y++P+FH S+ WCL ++ L + L F+ L
Sbjct: 186 CGPLSFLVEKLDTSMREYKLPTFYKDPSFHVSLLWCLGNRRQLLDDNMPTLQETFS--AL 243
Query: 177 TSDESFHVVTHIHM---KTGNKFYSFPL 201
+E + ++ M K GNK+YSF L
Sbjct: 244 YEEEYCDMNINVKMLNFKCGNKYYSFDL 271
>gi|195037825|ref|XP_001990361.1| GH18281 [Drosophila grimshawi]
gi|193894557|gb|EDV93423.1| GH18281 [Drosophila grimshawi]
Length = 256
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 114/204 (55%), Gaps = 9/204 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--HL 59
+N HGGRIRSF H+R +WAT VY+ +L E + + +E+ HL
Sbjct: 57 ENAALHGGRIRSFKHERGNWATFVYVSAGDCAEQLEDFQLEAIERLAPQIELHANESLHL 116
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLSKT+V+ YH ID +L L + + + ++ NEE+TR+F+A+ ++ TS
Sbjct: 117 SLSKTVVLQYHQIDEFQRSLQQALHCCAGFSSTLHLLSVYTNEERTRTFLAVQLDAAFTS 176
Query: 120 -LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTS 178
++S+++ VD ++++LP +YE+P+FH S+ WC+ D L+ L +L + L
Sbjct: 177 KMSSLLRPVDLVMRDYRLPQFYEKPSFHVSLLWCVGDHQTLLQDKLKELQQL-----LDD 231
Query: 179 DESFHV-VTHIHMKTGNKFYSFPL 201
E+ + V ++ K GNK +++ L
Sbjct: 232 HETLQLAVNEVNCKCGNKDFTYKL 255
>gi|405978511|gb|EKC42891.1| UPF0406 protein C16orf57-like protein [Crassostrea gigas]
Length = 275
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 112/207 (54%), Gaps = 10/207 (4%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV-GISVEVIPEPHLS 60
DNP H G IRSF H +WAT + + + R+ ++ E L + + + + + HLS
Sbjct: 67 DNPEVHEGIIRSFEHLEGNWATHIGVSYDPD-ERMIELIDELLKCLRPLEFKPMKDLHLS 125
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSL 120
LS+T+ I +HWI L + L + L + + +S++++ N+EKTR+F++L ++ L
Sbjct: 126 LSRTVAIRHHWIQPLTDRLRRRFKLLPKTCCEISSVKLYTNDEKTRTFLSLTVSAPGDIL 185
Query: 121 TSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD-----KTATLKPLLTKLDNIFTQFK 175
+AVD+ +E+KLP YYE P+FH SI WCL+D TL L +DN ++
Sbjct: 186 QQYTKAVDECFEEYKLPKYYENPSFHISIGWCLKDVIPQISEETLNKLQDLVDNAMEEY- 244
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPLT 202
+ V K+GNK + PL+
Sbjct: 245 --PELRLFPVEEAICKSGNKQFPLPLS 269
>gi|357620390|gb|EHJ72601.1| hypothetical protein KGM_18449 [Danaus plexippus]
Length = 720
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 67/181 (37%), Positives = 103/181 (56%), Gaps = 3/181 (1%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI--PLQTNLARLYAMLKEELNSVGISVEVIPEPH 58
+D P+ HGGR+RSFPH R +W + +YI P Q +L + L ++S+ I H
Sbjct: 48 IDEPSLHGGRLRSFPHVRGNWPSFIYIEYPEQDHLHKTINKLSNFVSSLNILCNRCDGIH 107
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLSKT I YH I L L L ++ + F+SIE++CNEEKTR+FIA+ A+ +
Sbjct: 108 LSLSKTFTIQYHMIKPLSSALQEVLGYIESFELFFDSIEVYCNEEKTRTFIAIKADIYSS 167
Query: 119 S-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
L +I +D +++KLP +Y++P+FH SI +K + ++ L+ I + T
Sbjct: 168 KILANITDKIDGILEDYKLPKFYKDPSFHISILSVNGNKKNDILRIVEDLNKILIRNPQT 227
Query: 178 S 178
S
Sbjct: 228 S 228
>gi|347967416|ref|XP_307963.4| AGAP002220-PA [Anopheles gambiae str. PEST]
gi|333466306|gb|EAA03693.4| AGAP002220-PA [Anopheles gambiae str. PEST]
Length = 242
Score = 119 bits (297), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 111/204 (54%), Gaps = 8/204 (3%)
Query: 5 NEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAM--LKEELNSVGISVEVIP--EPHLS 60
++H GR RSFPH+R WA+ V+I + L K LN +VE P HLS
Sbjct: 39 SKHQGRTRSFPHERGIWASYVFIDYGDSDGWLEMQNECKTLLNDASTTVEFNPIDRMHLS 98
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG-ANSCKTS 119
L+KT I +H I+ V L L L R ++F+ ++++ NEE+TR+F+ + A +
Sbjct: 99 LTKTFTIRHHNINPFVANLREQLAGLRRFRLEFSGVQVYVNEERTRTFLGVRVAEESYGA 158
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQF--KLT 177
L ++V ++D+ +E+KLP YY + FH SI WCL D+ A ++ L L +F + +
Sbjct: 159 LNALVTSLDECLREYKLPLYYTDRAFHVSILWCLGDREAEVRKELPALAAVFERVYEEEY 218
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
D S V T + K GNK + F L
Sbjct: 219 CDISQQVKT-LWFKCGNKSFHFNL 241
>gi|395839514|ref|XP_003792634.1| PREDICTED: UPF0406 protein C16orf57 homolog [Otolemur garnettii]
Length = 247
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 110/205 (53%), Gaps = 25/205 (12%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGR+R+FPH+R +WAT VYIP + L L +L V V + E H
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYIPYEAKEEFLDLLNVLLPHAQTYVPRLVR-MEEFH 120
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L N + +R +F+ L S T
Sbjct: 121 LSLSQSVVLRHHWILPFVQALKNRMASFHR------------------TFVGLEVTSGHT 162
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y++P+FH S+AWC+ D + L+ L +L I +F+
Sbjct: 163 QFLDLVSEVDRVMEEFNLTTFYQDPSFHISLAWCVGDASLQLEGKCLRELQEIVDEFE-D 221
Query: 178 SDESFHVVT-HIHMKTGNKFYSFPL 201
++ V T H+ K+GNKF+S PL
Sbjct: 222 AEMLLRVHTEHVRCKSGNKFFSIPL 246
>gi|390366342|ref|XP_001183592.2| PREDICTED: UPF0406 protein C16orf57 homolog [Strongylocentrotus
purpuratus]
Length = 297
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 91/163 (55%), Gaps = 2/163 (1%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQT-NLARLYAMLKEELNSVGISVEVIPEPHL 59
M++P H GRIRSF H +WAT VYIP T +L+ L L L ++ + HL
Sbjct: 69 MNDPTNHHGRIRSFAHTPGNWATFVYIPADTPSLSSLTETLMTCLPQ-DLTFHPSDDLHL 127
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+T+ + +HWID ++ + + ++++ N+E TR+F+ L + +
Sbjct: 128 SLSRTVCLQFHWIDPFTQSFRERVSGMRSFQCHIEQVDVYANDEGTRTFLGLKIGAGHGT 187
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP 162
L +VQ ++ +EF LP +YE+P+FH S WC+ D ++ + P
Sbjct: 188 LCDLVQLTNECLEEFSLPVFYEDPSFHVSFGWCVGDVSSRIGP 230
>gi|448262564|pdb|4H7W|A Chain A, Crystal Structure Of Human C16orf57
Length = 193
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 105/198 (53%), Gaps = 15/198 (7%)
Query: 13 SFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPHLSLSKTLVIPY 69
+FPH+R +WAT VY+P + L L +L V V + HLSLS+++V+ +
Sbjct: 1 TFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVR-MKVFHLSLSQSVVLRH 59
Query: 70 HWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDK 129
HWI V+ L + +R N ++I+ N+EKTR+FI L S +V VD+
Sbjct: 60 HWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSGHAQFLDLVSEVDR 119
Query: 130 SAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTSDESFHVVTHI 188
+EF L T+Y++P+FH S+AWC+ D L+ L +L I F E V+ +
Sbjct: 120 VMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF-----EDAEVLLRV 174
Query: 189 H-----MKTGNKFYSFPL 201
H K+GNKF+S PL
Sbjct: 175 HTEQVRCKSGNKFFSMPL 192
>gi|427798955|gb|JAA64929.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 270
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 111/208 (53%), Gaps = 19/208 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP------LQTNLARL---YAMLKEELNSVGISVE 52
D+ + H GR+R+F H+ WA+ V++ +QT ++RL + LK + S
Sbjct: 72 DDRSRHDGRVRTFAHEPGVWASYVFVSVAKESEIQTLISRLCHGFDFLKPQQPSTC---- 127
Query: 53 VIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG 112
H+SLS+T+ + +HWI +VE+L R I+F S++++ N EKTR+F++L
Sbjct: 128 -----HVSLSRTVKLRHHWIQPMVESLKAVTAPYCRFVIRFGSLDVYTNAEKTRTFLSLK 182
Query: 113 ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFT 172
+ L +V VD +E+ LP +YE+P+FH S+AWC + + L L +L
Sbjct: 183 VHKGVEHLERLVSEVDSCLKEYDLPLFYEDPSFHMSVAWCDASEESRLLQCLHELQIRLE 242
Query: 173 QFKLTSDESFHV-VTHIHMKTGNKFYSF 199
QF + + V V+ + ++GNK +
Sbjct: 243 QFVIGHPSALAVDVSSVWFRSGNKLFEL 270
>gi|354495450|ref|XP_003509843.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Cricetulus
griseus]
Length = 250
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEPH 58
D+ +HGGRIR+FPH+R +WAT +YIP + L L +L V V+ + E H
Sbjct: 65 DDSTKHGGRIRTFPHERGNWATHIYIPYEAKEEFLDLLDVLLSRAQTFVPRLVQ-MEEFH 123
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT 118
LSLS+++V+ +HWI V+ L +++ R +FI L S
Sbjct: 124 LSLSQSVVLRHHWILPFVQALKDHMASFQR------------------TFIGLEVTSGHA 165
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLT 177
+V VD+ +EF L T+Y+ P+FH S+AWC+ D L+ L +L I +F+ +
Sbjct: 166 QFLDLVSEVDRVMEEFDLTTFYQNPSFHVSLAWCVGDACLQLEGQCLQELQEIVDEFEDS 225
Query: 178 SDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 226 EMLLRVLAEQVRCKSGNKFFSMPL 249
>gi|149032399|gb|EDL87290.1| rCG39094, isoform CRA_b [Rattus norvegicus]
Length = 248
Score = 113 bits (283), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 104/202 (51%), Gaps = 20/202 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--LARLYAMLKEELNSVGISVEVIPEPHL 59
D+ HGGRIR+FPH+R +WAT +YIP + N L +L + + E HL
Sbjct: 64 DDSARHGGRIRTFPHERGNWATHIYIPYEANEEFQDLLDVLLPRAQMFAPRLVQMEEFHL 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+FI L +S
Sbjct: 124 SLSQSVVLRHHWILPFVQVLKDRMASFQRFFFTANRVKIYTNQEKTRTFIGLEVSSGHAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSD 179
+V VD+ +EF L T+Y+ L LK ++ + ++ ++ ++
Sbjct: 184 FLDMVSEVDRVMKEFDLTTFYQ-----------LSSPCCWLKEIVDEFEDSEMLLRVLAE 232
Query: 180 ESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 233 Q-------VRCKSGNKFFSMPL 247
>gi|338723124|ref|XP_003364660.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Equus
caballus]
Length = 247
Score = 112 bits (280), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 104/206 (50%), Gaps = 27/206 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYVPYEAGEDFLELLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
H+SLS+++V+ +HWI V+ L + + R +F+ L S
Sbjct: 120 -HVSLSQSVVLRHHWILPFVQALKDRVASFQR------------------TFVGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFK 175
+V VD+ +EF LPT+Y++P+FH S+AWC+ D L+ L +L I +F+
Sbjct: 161 HAQFLDLVSEVDRVMEEFDLPTFYQDPSFHISLAWCVGDARLQLEGQCLRELQAIVDEFE 220
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 221 DSEILLRVRAGQVRCKSGNKFFSMPL 246
>gi|68051359|gb|AAY84943.1| IP09828p [Drosophila melanogaster]
Length = 191
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 104/194 (53%), Gaps = 7/194 (3%)
Query: 11 IRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--HLSLSKTLVIP 68
+RSF H+R +WAT VY+P + +L E + + +E+ P HLSLS+T+V+
Sbjct: 1 MRSFKHERGNWATYVYVPATACVDQLEEFQTEAIARLEPHLELQPNESLHLSLSRTVVLQ 60
Query: 69 YHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-KTSLTSIVQAV 127
YH ID +L + L + I+ NEE+TR+FIA ++ +T+I+Q +
Sbjct: 61 YHQIDEFSRSLQSALNSSAGFAATLQGLRIYTNEERTRTFIAAPLDAAFVEKMTAILQPI 120
Query: 128 DKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTH 187
D+ +++L +Y+ +FH S+ WC+ D+ L LT+L + D+ V
Sbjct: 121 DQVMLDYRLQQFYDPASFHVSLLWCVGDQETLLNEKLTELKELLDD----QDKLCLAVNE 176
Query: 188 IHMKTGNKFYSFPL 201
+H+K GNK +++ L
Sbjct: 177 VHLKCGNKDFTYSL 190
>gi|312386025|gb|EFR30398.1| hypothetical protein AND_00058 [Anopheles darlingi]
Length = 228
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 103/199 (51%), Gaps = 5/199 (2%)
Query: 9 GRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNSVGI-SVEVIPEPHLSLSKTL 65
GR RSFPH+R WA+ VYI Q L R+ +EL S GI + + HLSL+KT
Sbjct: 30 GRQRSFPHERGIWASYVYIDYEGQEGLERMQQEWNQELRSAGIKDFQPVGHLHLSLTKTF 89
Query: 66 VIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG-ANSCKTSLTSIV 124
I +H I V +L L R ++ + + ++ NEE+TR+FI + A L +V
Sbjct: 90 TIRHHNIVPFVGSLQELLGVHRRFRLELDGVGVYVNEERTRTFIGVRIAEVSYPPLDRLV 149
Query: 125 QAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQ-FKLTSDESFH 183
+D+ E+KLP YYE+ +FH S+ W L D++ + L KL F + ++ +
Sbjct: 150 TELDECLLEYKLPRYYEDRSFHISLLWTLGDQSEIVTSRLPKLTEQFEEIYEDEYTDLTQ 209
Query: 184 VVTHIHMKTGNKFYSFPLT 202
+ + K GNK Y F L
Sbjct: 210 TIRKLWFKCGNKLYPFQLA 228
>gi|395747915|ref|XP_003778683.1| PREDICTED: LOW QUALITY PROTEIN: UPF0406 protein C16orf57 homolog
[Pongo abelii]
Length = 328
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 9/160 (5%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVI-P 55
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E P
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAFHP 121
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
+P +++V+ +HWI L L + +R N ++I+ N+EKTR+FI L S
Sbjct: 122 QP---CPQSVVLRHHWILPLCRALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTS 178
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
+V VD+ +EF L T+Y++P+FH S+AWC+ D
Sbjct: 179 GHAQFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGD 218
>gi|344289328|ref|XP_003416396.1| PREDICTED: UPF0406 protein C16orf57 homolog [Loxodonta africana]
Length = 245
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 27/201 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSL 61
D+ +HGGR+R+FPH+R +WAT VYIP L ++ + EE S+ + P P
Sbjct: 70 DDSAKHGGRVRTFPHERGNWATHVYIPC--TLLTVFPTMTEEATSIS-NTNRDPPP---- 122
Query: 62 SKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
+P +L+H R N ++I+ N+EKTR+F+ L S
Sbjct: 123 -----LP-------------SLKH-QRFFFTANRVKIYTNQEKTRTFVGLEVTSGHAQFL 163
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLTSDE 180
S+V VD+ +EF L T+Y+ P+FH S+AWC+ D L+ L L I +F+ +
Sbjct: 164 SLVSEVDRVMEEFALTTFYQNPSFHVSLAWCVGDARPQLEGQCLHNLQEIVDEFEDSELL 223
Query: 181 SFHVVTHIHMKTGNKFYSFPL 201
I K+GNKF+S PL
Sbjct: 224 LRVYAEQIRCKSGNKFFSLPL 244
>gi|148679220|gb|EDL11167.1| expressed sequence AA960436, isoform CRA_a [Mus musculus]
Length = 248
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 106/202 (52%), Gaps = 20/202 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQT--NLARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGRIR+FPH+R +WAT +YIP + + L L + ++ E H+
Sbjct: 64 DDSAKHGGRIRTFPHERGNWATHIYIPYEAKEDFRDLLDALLPRAQMFVPRLVLMEEFHV 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + R N ++I+ N+EKTR+FI L +S
Sbjct: 124 SLSQSVVLRHHWILPFVQVLKDRMASFQRFFFTANRVKIYTNQEKTRTFIGLEVSSGHAQ 183
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSD 179
+V VD++ +EF L T+Y+ L LK ++ + ++ ++ ++
Sbjct: 184 FLDLVSEVDRAMKEFDLTTFYQ-----------LSSPCCWLKEIVDEFEDSEMLLRVLAN 232
Query: 180 ESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 233 Q-------VRCKSGNKFFSMPL 247
>gi|242006412|ref|XP_002424044.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507350|gb|EEB11306.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 250
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 97/168 (57%), Gaps = 2/168 (1%)
Query: 8 GGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV-GISVEVIPEPHLSLSKTLV 66
G++R+FPH+ WAT VYIP++ L +K+ +N +++++ + H+SLS+T++
Sbjct: 53 DGKVRNFPHEPGIWATFVYIPVKP-CYELSDFVKKLINYFEQFNLKLVDDYHISLSRTVI 111
Query: 67 IPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQA 126
+ +HWI +E L N L+ + I ++++ NEEKTR+F++L + L +IV
Sbjct: 112 LKHHWIQGFMELLKNFLKDIPSFKIHVKGLKVYTNEEKTRTFLSLMVDENVEMLKTIVNQ 171
Query: 127 VDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQF 174
VD +E+ L +YE+P FH S+ WC +K + +L L +F
Sbjct: 172 VDLCLKEYNLGKFYEDPLFHLSLLWCPDNKYEEINSILMNLPTKLKEF 219
>gi|449268865|gb|EMC79702.1| hypothetical protein A306_12808, partial [Columba livia]
Length = 174
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 90/153 (58%), Gaps = 1/153 (0%)
Query: 50 SVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFI 109
S+ + + HLSLS+ +V+ YHWID V L L +R + ++++ N+ KTR+FI
Sbjct: 21 SLAAMEQFHLSLSQCVVLRYHWIDPFVRCLRERLATFHRFFCVADQVKVYTNQNKTRTFI 80
Query: 110 ALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLD 168
L ++ L +V VD+ +EF LPT+Y++P+FH S+AWC+ D + L+ L +L
Sbjct: 81 GLEVSTGHFQLLELVSEVDRVLEEFDLPTFYKDPSFHISLAWCVGDMSGRLEGQCLRELQ 140
Query: 169 NIFTQFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
+I F+ ++ I K+GNK++SFPL
Sbjct: 141 DIVDGFEDSALLLRVQWEQIRCKSGNKYFSFPL 173
>gi|348572652|ref|XP_003472106.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Cavia
porcellus]
Length = 248
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 101/203 (49%), Gaps = 21/203 (10%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--LARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGR+R+FPH+R +WAT VY+P + L +L + + + HL
Sbjct: 63 DDSAKHGGRVRTFPHERGNWATHVYVPYEAKEEFPDLLDVLLPHAQTYVPRLVRMEAFHL 122
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++++ +HWI V+ L + + R +F+ L S
Sbjct: 123 SLSQSVILRHHWILPFVQALKDRVASFER------------------TFVGLEVTSGHAQ 164
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP-LLTKLDNIFTQFKLTS 178
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+ +
Sbjct: 165 FLDLVSEVDRVMEEFDLTTFYQDPSFHISLAWCVGDACLQLEGRCLQELQEIVDEFEDSE 224
Query: 179 DESFHVVTHIHMKTGNKFYSFPL 201
+ K+GNKF+S PL
Sbjct: 225 MLLRVQAKQVRCKSGNKFFSMPL 247
>gi|402908571|ref|XP_003917011.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Papio
anubis]
Length = 247
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 101/206 (49%), Gaps = 27/206 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMASFQR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F+
Sbjct: 161 HAQFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGFE 220
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + K+GNKF+S PL
Sbjct: 221 DAEVLLRVYIEQVRCKSGNKFFSMPL 246
>gi|403306030|ref|XP_003943549.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Saimiri
boliviensis boliviensis]
Length = 247
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 103/211 (48%), Gaps = 37/211 (17%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VYIP + L L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYIPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMASFQR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 161 HAQFLDLVSEVDRVMEEFDLSTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 219
Query: 176 LTSDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 220 ----EDAEVLLRMHTEQVRCKSGNKFFSMPL 246
>gi|332846036|ref|XP_003315166.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 1 [Pan
troglodytes]
gi|410261754|gb|JAA18843.1| chromosome 16 open reading frame 57 [Pan troglodytes]
Length = 247
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 104/211 (49%), Gaps = 37/211 (17%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 161 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 219
Query: 176 LTSDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 220 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 246
>gi|306035177|ref|NP_001182231.1| putative U6 snRNA phosphodiesterase isoform 2 [Homo sapiens]
gi|194381716|dbj|BAG64227.1| unnamed protein product [Homo sapiens]
Length = 247
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 105/211 (49%), Gaps = 37/211 (17%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + ++V
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMKVF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 161 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 219
Query: 176 LTSDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 220 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 246
>gi|296231219|ref|XP_002761062.1| PREDICTED: UPF0406 protein C16orf57 isoform 2 [Callithrix jacchus]
Length = 247
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 103/211 (48%), Gaps = 37/211 (17%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VYIP + L L +L V + +E
Sbjct: 62 DDSAKHGGRVRTFPHERGNWATHVYIPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARVASFQR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 161 HAQFLDLVSEVDRVMEEFDLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 219
Query: 176 LTSDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 220 ----EDAEVLLRMHTEQVRCKSGNKFFSMPL 246
>gi|397506496|ref|XP_003823763.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Pan
paniscus]
Length = 247
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 104/211 (49%), Gaps = 37/211 (17%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT +Y+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHIYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + +R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFHR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 161 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF- 219
Query: 176 LTSDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 220 ----EDAEVLLRVHTEQVRCKSGNKFFSMPL 246
>gi|426382356|ref|XP_004057773.1| PREDICTED: putative U6 snRNA phosphodiesterase isoform 2 [Gorilla
gorilla gorilla]
Length = 247
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 103/211 (48%), Gaps = 37/211 (17%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC 116
HLSLS+++V+ +HWI V+ L + R +FI L S
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQALKARMTSFQR------------------TFIGLEVTSG 160
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I
Sbjct: 161 HAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAI----- 215
Query: 176 LTSDESFHVVTHIH-----MKTGNKFYSFPL 201
+ E V+ +H K+GNKF+S PL
Sbjct: 216 VDGSEDAEVLLRVHTEQVRCKSGNKFFSMPL 246
>gi|410983611|ref|XP_003998132.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Felis catus]
Length = 249
Score = 106 bits (265), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--LARLYAMLKEELNSVGISVEVIPEPHL 59
D+ +HGGR+R+FPH+R +WAT VY+P + L ML + + + HL
Sbjct: 64 DDSAKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDMLLPRAQTYVPRLVRMEAFHL 123
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
SLS+++V+ +HWI V+ L + + +F+ L S
Sbjct: 124 SLSQSVVLRHHWILPFVQALKDRVASFQG------------------TFVGLEVTSGHAQ 165
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTS 178
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I +F+ S
Sbjct: 166 FLDLVSEVDRVMEEFDLTTFYQDPSFHISLAWCVGDARLQLEGQCLRELQTIVDEFE-DS 224
Query: 179 DESFHV-VTHIHMKTGNKFYSFPL 201
+ V + K+G+KF+S PL
Sbjct: 225 EMVLRVHAEQVRCKSGHKFFSMPL 248
>gi|195395480|ref|XP_002056364.1| GJ10908 [Drosophila virilis]
gi|194143073|gb|EDW59476.1| GJ10908 [Drosophila virilis]
Length = 257
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 53/154 (34%), Positives = 89/154 (57%), Gaps = 3/154 (1%)
Query: 3 NPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP--HLS 60
N HGGRIRSF H+R +WAT VY+P+ +L E + + +E+ HLS
Sbjct: 58 NSELHGGRIRSFKHERGNWATFVYMPVLACAEQLEDFQIEAIKRLSPDLELRANESLHLS 117
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSC-KTS 119
LSKT+V+ YH ID +L L N + ++ NEE+TR+F+A+ ++ T
Sbjct: 118 LSKTVVLQYHQIDEFHRSLQQALHSCVGFNSTLNLLRVYTNEERTRTFLAVQLDAAYGTK 177
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
++++++ VD ++++L +Y+ P+FH S+ WC+
Sbjct: 178 MSALLRPVDLVMRDYRLAQFYDNPSFHVSLLWCV 211
>gi|307187998|gb|EFN72856.1| UPF0406 protein C16orf57-like protein [Camponotus floridanus]
Length = 170
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 92/161 (57%), Gaps = 1/161 (0%)
Query: 36 LYAMLKEELNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNS 95
L+ +K LN + + E+I H+SLS+TLV+ +HWI++ VE+L + ++
Sbjct: 10 LHTWMKSMLNELPVQGEIISNLHISLSRTLVLKFHWIESFVESLKLLCSGFHPFVVQLTD 69
Query: 96 IEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
++++CNEEKTR+F+ + + +L + A+D E++LP++YE+ ++H S WCL D
Sbjct: 70 VKMYCNEEKTRTFLGIYCQNEDGTLKHLTDAIDCLLAEYQLPSFYEDISYHISFFWCLGD 129
Query: 156 KTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNK 195
K LK +L L ++ + E ++ V I K GNK
Sbjct: 130 KQTYLKKILPSLTCSLNKYLAENVEDTYIHVKEIQCKIGNK 170
>gi|76154778|gb|AAX26198.2| SJCHGC02690 protein [Schistosoma japonicum]
Length = 191
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 85/154 (55%), Gaps = 4/154 (2%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKE---ELNSVGISVEVIPEPH 58
++P+ H R R+FPH+ SWAT +YI +R+ +K +LN + + H
Sbjct: 38 EDPSRHNYRSRTFPHEPGSWATSIYIACPHFYSRIQEAIKSPIIQLNPIMNDCCAVNFLH 97
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIAL-GANSCK 117
+SLSKT I +HWI+ L L + + + + I F+++E+F NEE TRSF L + +
Sbjct: 98 ISLSKTWPIYFHWIENLACNLRSAVSSIEKFCIAFDNVEVFVNEENTRSFFTLITSEESR 157
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAW 151
+LT ++ +VD F+ P YY+ P FH S W
Sbjct: 158 VALTPLLSSVDSCVTAFRGPAYYKNPKFHMSFLW 191
>gi|198427776|ref|XP_002131286.1| PREDICTED: similar to CG16790 CG16790-PA [Ciona intestinalis]
Length = 262
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 120/212 (56%), Gaps = 15/212 (7%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPL-QTNLARLYAMLKEELNSVGIS-VEVIPEPHL 59
++P +H GR+RSF H + +WAT +YIP + L + +++ L G+ +++ + HL
Sbjct: 52 NDPTKHQGRVRSFQHVKGNWATFLYIPYPNPHCNELRSYVEDLLQFQGLDHWKIVDDFHL 111
Query: 60 SLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN-SIEIFCNEEKTRSFIALGANS--C 116
S+S+T IP+H+I+ LVE + + L+ + + F+ ++ + N+EKTRSF S
Sbjct: 112 SVSRTSAIPHHFIEPLVEGIQGCAQKLSPVILNFSCDLKFYVNDEKTRSFCGFEVTSPFI 171
Query: 117 KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL-----QDKTATLKPLLTKLDNIF 171
L ++V ++K +++K +YYE P+FH SI+W L Q+ L+ T +F
Sbjct: 172 LAKLQTLVDHINKPLKDYKCDSYYENPSFHISISWTLGNIFEQNWKKQLEIFQTHWSEVF 231
Query: 172 TQF-KLTSDESFHVVTHIHMKTGNKFYSFPLT 202
+ +L S E+ ++V K GN+ ++F ++
Sbjct: 232 IESPELFSFEANNLVC----KCGNRLFTFEIS 259
>gi|427797819|gb|JAA64361.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 272
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 105/200 (52%), Gaps = 27/200 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP------LQTNLARLY-----------AMLKE-E 43
D+ + H GR+R+F H+ WA+ V++ +Q+ ++RL ++ KE E
Sbjct: 72 DDRSRHDGRVRTFAHEPGVWASYVFVSVAKESEIQSLISRLXXXXXWASYVFVSVAKESE 131
Query: 44 LNSV------GISVEVIPEP---HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN 94
+ ++ G +P H+SLS+T+ + +HWI +VE+L R I+F
Sbjct: 132 IQTLISRLCHGFDFLKPQQPSTCHVSLSRTVKLRHHWIQPMVESLKAVTAPYCRFVIRFG 191
Query: 95 SIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQ 154
S++++ N EKTR+F++L + L +V VD +E+ LP +YE+P+FH S+AWC
Sbjct: 192 SLDVYTNAEKTRTFLSLKVHKGVEHLERLVSEVDSCLKEYDLPLFYEDPSFHMSVAWCDA 251
Query: 155 DKTATLKPLLTKLDNIFTQF 174
+ + L L +L QF
Sbjct: 252 SEESRLLQCLHELQIRLEQF 271
>gi|156398088|ref|XP_001638021.1| predicted protein [Nematostella vectensis]
gi|156225138|gb|EDO45958.1| predicted protein [Nematostella vectensis]
Length = 233
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 115/213 (53%), Gaps = 17/213 (7%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNSVGI-SVEVIP--EPHLS 60
+H GR+R+F H +WA VYIP+ ++LA L L S + + + P E HLS
Sbjct: 22 DHQGRVRTFEHFPGNWALHVYIPMFGPSDLASLIESFVMSLPSDLVPKMHLFPCNELHLS 81
Query: 61 LSKTLVIPYHWIDTLVETLGNN--LRHLNRLTIKFNSIEIFCNEEKTR-SFIALGANSCK 117
LS+T+ I ++WI+++V+ L + L+H ++ + ++ + + T+ SFI L S
Sbjct: 82 LSRTVPIRHYWINSIVDQLKSAFALKHRYKIFLARHTQQTYPIARHTQQSFIGLKILSGH 141
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD--------KTATLKPLLTKLDN 169
L+S+V VD+ +EF LPT+Y++P+FH SIAWCL D ++ + D+
Sbjct: 142 KKLSSLVCNVDEVLEEFALPTFYQDPSFHVSIAWCLGDIHEHLMEKHIKQIQVITDAFDS 201
Query: 170 IFTQFKLTSDESFHVVTHIHMKTGNKFYSFPLT 202
+ + +F +H + GNK ++FP +
Sbjct: 202 LVMEMSAADCLAF-TPNQVHCRIGNKLFNFPFS 233
>gi|194375628|dbj|BAG56759.1| unnamed protein product [Homo sapiens]
Length = 150
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 83/150 (55%), Gaps = 11/150 (7%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
HLSLS+++V+ +HWI V+ L + +R N ++I+ N+EKTR+FI L S
Sbjct: 5 HLSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSGH 64
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKL 176
+V VD+ +EF L T+Y++P+FH S+AWC+ D L+ L +L I F
Sbjct: 65 AQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGF-- 122
Query: 177 TSDESFHVVTHIH-----MKTGNKFYSFPL 201
E V+ +H K+GNKF+S PL
Sbjct: 123 ---EDAEVLLRVHTEQVRCKSGNKFFSMPL 149
>gi|391344655|ref|XP_003746611.1| PREDICTED: UPF0406 protein C16orf57 homolog [Metaseiulus
occidentalis]
Length = 283
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 97/209 (46%), Gaps = 19/209 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV-GISVEVIP----- 55
D P +HGGR R+F H++ WA+ ++ QT KE ++ V I+ + +P
Sbjct: 62 DEPEKHGGRRRAFEHEQGVWASHFFV--QT-------CDKEYMDDVQKIACDTVPFLRRS 112
Query: 56 ---EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG 112
+ H+SLSKTL YHWI L + ++ + F+ ++ NEEK F+AL
Sbjct: 113 DTAQSHISLSKTLKCRYHWIKPLYSYVRQGIQTRKPFLVSFSRFSVYENEEKNTCFLALD 172
Query: 113 ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFT 172
A+ L + VD + F LP +Y +P FH SI W + L L L
Sbjct: 173 ADIGANDLKDLSGRVDNALDHFNLPHFYHQPRFHVSIGWVPMHRKNDLLKSLADLQRELD 232
Query: 173 QFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ L S E VV + ++G K Y PL
Sbjct: 233 DYCL-SVELAGVVDRLCFRSGCKLYEIPL 260
>gi|380031018|ref|XP_003699135.1| PREDICTED: UPF0406 protein C16orf57 homolog, partial [Apis florea]
Length = 188
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 84/141 (59%), Gaps = 2/141 (1%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP-HLS 60
D+P +H GR+RSF H+R +WATL+YI + + A +++ + L + I + E H+S
Sbjct: 49 DDPLQHDGRVRSFKHERGNWATLIYINYEPSEA-IFSWISSVLEEINIKCNIFSEQFHIS 107
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSL 120
L+KTL++ +HWI++ ++ ++ +K +++ + NEE TR+F+ + C L
Sbjct: 108 LTKTLILKFHWIESFIKETKKLCEQTDQFDLKLLNVKAYINEENTRTFLGIECVDCNGVL 167
Query: 121 TSIVQAVDKSAQEFKLPTYYE 141
V+ ++K E+ LP++YE
Sbjct: 168 ARFVENINKFLAEYDLPSFYE 188
>gi|170072388|ref|XP_001870168.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167868637|gb|EDS32020.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 248
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 84/147 (57%), Gaps = 2/147 (1%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+SL+KT VI +H I VE + N + R + +++ I+ NEE+TR+F+A+ +
Sbjct: 102 HMSLTKTFVIRHHNITAFVEDIRNAVSGSKRFIVLPSNLAIYVNEEQTRTFLAVKIDETS 161
Query: 118 -TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKL 176
L +V A+D +E+KLP +Y+E +FH SI W L ++ L+ +L +LD +F
Sbjct: 162 FRPLEQLVDALDGCMREYKLPVFYQERSFHVSILWTLGNQRDKLEGILPELDELFNAIYE 221
Query: 177 TSDESFHV-VTHIHMKTGNKFYSFPLT 202
+V V +H+K GNKFY F L
Sbjct: 222 EEYCDMNVNVKRLHLKCGNKFYDFGLV 248
>gi|428184559|gb|EKX53414.1| hypothetical protein GUITHDRAFT_48966, partial [Guillardia theta
CCMP2712]
Length = 150
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 85/150 (56%), Gaps = 7/150 (4%)
Query: 9 GRIRSFPHQRNSWATLVYIPL------QTNLARLYAMLKEELNS-VGISVEVIPEPHLSL 61
GRIRSFPH ++A V++ L QT +++ +++L S V + + E H+SL
Sbjct: 1 GRIRSFPHVEGNYAGFVFVSLKQVPELQTLAMQVFDNARKKLPSEVVLHRITLDEMHISL 60
Query: 62 SKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
S+TLV+ H I LV L L + +K +++E+ NEE TRSF+ + + +
Sbjct: 61 SRTLVVKRHQIQPLVNRLRKALHSIPSFHMKMSAVELLSNEEGTRSFVCMTVRPVEDEML 120
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIAW 151
IV+ D+ + FKL YYE+ +FHAS+AW
Sbjct: 121 KIVKTTDEVLRSFKLQPYYEDMHFHASVAW 150
>gi|432863753|ref|XP_004070165.1| PREDICTED: LOW QUALITY PROTEIN: putative U6 snRNA
phosphodiesterase-like [Oryzias latipes]
Length = 206
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 90/202 (44%), Gaps = 38/202 (18%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV--GISVEVIPEPHLSLSKT 64
HGGRIRSF H+R +WAT VY + + S G+++ E HLSLS+T
Sbjct: 34 HGGRIRSFKHERGNWATYVYFAYHPEEEFEELLEELLSASTSHGVALTAQEEFHLSLSQT 93
Query: 65 LVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIV 124
+V+ +HWI +++L L H R + ++CN E+T A
Sbjct: 94 VVLRHHWIQPFMKSLRGGLTHSKRFVCSAGRLRVYCNAERTCCCSAY------------- 140
Query: 125 QAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQ-----FKLTSD 179
++ +P+FH S+AWC+ D T LK + +L ++ F L D
Sbjct: 141 -------------CFFXDPSFHVSLAWCVGDLTEELKEITQELQSLVDGREDGPFLLNLD 187
Query: 180 ESFHVVTHIHMKTGNKFYSFPL 201
+ +TGNK + FPL
Sbjct: 188 -----CAELRCRTGNKTFRFPL 204
>gi|363737962|ref|XP_413994.3| PREDICTED: UPF0406 protein C16orf57 homolog [Gallus gallus]
Length = 121
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 73/118 (61%), Gaps = 1/118 (0%)
Query: 54 IPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGA 113
+ E H+SLS+++V+ YHWI +++L L +R + ++++ N+ KTR+F+ L
Sbjct: 1 MEEFHVSLSQSVVLRYHWISPFMQSLKERLAAFHRFFCVADRVKVYTNQNKTRTFVGLEV 60
Query: 114 NSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNI 170
++ L +V VDK +E+ LP +Y++P+FH S+AWC+ D +L+ L +L +I
Sbjct: 61 STGHFQLLELVSEVDKVMEEYDLPVFYKDPSFHISMAWCVGDLRGSLEGQCLQELQDI 118
>gi|168043938|ref|XP_001774440.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674292|gb|EDQ60803.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 303
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 106/216 (49%), Gaps = 24/216 (11%)
Query: 5 NEHGGRIRSFPHQRNSWATLVYIP------LQTNLA-------RLYAMLK----EELNS- 46
++HGGR+R+FPH ++A VYIP ++T +A L+ LK ++L++
Sbjct: 70 SDHGGRVRAFPHVEGNYALHVYIPVVLSASIRTKIALYLQKAVSLFPSLKSTEDDDLSAN 129
Query: 47 VGI--SVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEK 104
VG +++ E H+SL +T+ I H IDT+V L R ++F + E+F N+++
Sbjct: 130 VGSNQGIKLATEFHISLGRTVPIRIHQIDTMVHLLRRKFEGQKRFLVEFGTWEVFVNDDR 189
Query: 105 TRSFIALGANSCK-TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPL 163
TRSF++L + + V VD LPT+Y P H S+AW L D L +
Sbjct: 190 TRSFLSLEVVATGYAEIKKQVSLVDHVYTLHGLPTFYPNPRPHISLAWALGDVKDALATV 249
Query: 164 LTKLDNIFTQFKLTSDESF--HVVTHIHMKTGNKFY 197
+L N F+ F + T + + G K +
Sbjct: 250 AQEL-NTMACFREEKSACFFTSLATKVECRIGQKLH 284
>gi|440790135|gb|ELR11422.1| hypothetical protein ACA1_323190 [Acanthamoeba castellanii str.
Neff]
Length = 340
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 105/250 (42%), Gaps = 52/250 (20%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP--------LQTNLARLYAMLKEELNSVGISVEV 53
D+P H GR+RSFPH ++ T VYIP L RL ++ L G V
Sbjct: 92 DSPAAHLGRVRSFPHIEGNYPTFVYIPVYITAVDGLMPCADRLLQQFRDWLP--GRQVYS 149
Query: 54 IPE--------------P-----HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN 94
I E P H+S+S+T+ + H ID L++ L + L F
Sbjct: 150 ISELRDESSGDVTSSRRPLELIYHVSISRTVGVRQHQIDPLIDLLRSTLTPERSFEASFG 209
Query: 95 SIEIFCNEEKTRSFIALGANSCKTS---------------------LTSIVQAVDKSAQE 133
+ E+F N+E TRSF++L K+ + S++ VD ++
Sbjct: 210 AYEVFTNDEHTRSFLSLSVVDGKSKAALLSPLNLLFSLLSDPRWLCVCSLIAKVDGVFKQ 269
Query: 134 FKLPTYYEEPNFHASIAWCLQDKTATLK--PLLTKLDNIFTQFKLTSDESFHVVTHIHMK 191
F+LPTYYE+P H +I W L+D +T + P + D + V ++
Sbjct: 270 FRLPTYYEDPRPHMTIGWTLEDVLSTDEKGPPGLRPDRRHSLDPDMGGPFRFRVGNVQCT 329
Query: 192 TGNKFYSFPL 201
G Y FPL
Sbjct: 330 PGRLIYDFPL 339
>gi|170090019|ref|XP_001876232.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164649492|gb|EDR13734.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 300
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/170 (31%), Positives = 83/170 (48%), Gaps = 12/170 (7%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPL-----QTNLARLYAMLKEELNSVG------- 48
+D+P+ H GRIR+ PH +A+ +YI L Q L L +L+E SV
Sbjct: 49 LDDPSLHQGRIRTIPHVDGQFASHIYIALPLGREQPLLQLLRDILREAKESVPYLYEIGF 108
Query: 49 ISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSF 108
S E E H+SLS+ + + H + L + L R + T+ F + N+EKTR+F
Sbjct: 109 TSEEKTSELHISLSRPIFLRVHQREDLKKALRALARRQKKFTLSFATFSELVNDEKTRTF 168
Query: 109 IALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTA 158
+ + + L ++ +A+ + Q + YY P FHASIAW D+
Sbjct: 169 LTMEVGAGHHELQNLAEALSPALQGIRQKEYYTNPRFHASIAWAHLDRPG 218
>gi|255539937|ref|XP_002511033.1| conserved hypothetical protein [Ricinus communis]
gi|223550148|gb|EEF51635.1| conserved hypothetical protein [Ricinus communis]
Length = 288
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 26/216 (12%)
Query: 10 RIRSFPHQRNSWATLVYIPLQTNLA---RLYAMLKE--------ELNSVGISVEVI---- 54
R+RSF H ++A VYIP+ A + + LK+ + V I ++++
Sbjct: 66 RMRSFAHVEGNYALHVYIPVYIPPASKKEILSFLKKISSLVPGLHVVDVDIPLDILCKDD 125
Query: 55 ---------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT 105
E H+SL +T+ I H ID++V L L+ NR I F+ E+F N++KT
Sbjct: 126 QKLEHVALGREFHISLGRTVPIRVHQIDSVVSMLRQRLQFKNRYWIDFSKWEVFVNDDKT 185
Query: 106 RSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLL 164
RSF++L N +T +++V++ + LP +Y++P H S+AW L D + +LK ++
Sbjct: 186 RSFLSLEVVNGGLAEITKQIESVNQVYKLHNLPEFYKDPRPHISLAWALGDISDSLKRVV 245
Query: 165 TKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSF 199
+ + L F I K GNK Y+F
Sbjct: 246 EEEIKKSSIAGLVQKRIFTCKCRGIECKIGNKTYNF 281
>gi|218191275|gb|EEC73702.1| hypothetical protein OsI_08295 [Oryza sativa Indica Group]
Length = 291
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 102/223 (45%), Gaps = 38/223 (17%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPL------QTNLA-----------RLYA---------ML 40
G R+RSFPH ++A VYIP+ + +LA LYA +
Sbjct: 68 QGSRVRSFPHVEGNYALHVYIPVVIPSDAKKHLALVMRRAASFVPDLYAIDADYALSELC 127
Query: 41 KEE--LNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
K+E L V +S E H+SL +T+ I H I++LV L R R + FN E
Sbjct: 128 KDEQKLEKVLLSREF----HVSLGRTVAIQVHQIESLVAMLRQKFRSQQRYWMDFNKWEH 183
Query: 99 FCNEEKTRSFIALGANSCKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
F N++ TRSF++L S T L I + VD + LP +Y+ P H S+AW L D
Sbjct: 184 FVNDDCTRSFLSLEVTS--TGLPEISKQITMVDDVYRLHGLPEFYKNPRPHISLAWALGD 241
Query: 156 KTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFY 197
+ LK + +++ + + + +H+ K G K Y
Sbjct: 242 VSCKLKQAIKEIEKSQSSLGTSQKSNLRCKFSHVVCKIGKKVY 284
>gi|148907218|gb|ABR16750.1| unknown [Picea sitchensis]
Length = 388
Score = 82.8 bits (203), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 109/223 (48%), Gaps = 29/223 (13%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPLQTN---------LARLYAMLKEELNSV--------- 47
+ G RIRSFP +++ VYIP++ + + ML EL ++
Sbjct: 161 DSGNRIRSFPSVDGNYSLHVYIPVKISSIAEKQLAPFIKKAGMLVPELFAIDTGLPLCSA 220
Query: 48 ------GISVEV---IPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
G V+ + H+SLS+T+ I +H ID++V L + L R I+F E+
Sbjct: 221 HLKSNDGAKVDARARAKQYHISLSRTVEIRHHQIDSIVSMLRHKLHSQKRYWIEFGKWEV 280
Query: 99 FCNEEKTRSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKT 157
F N+++TRSF+A+ S S ++ ++ V++ LP YY+ P H S+A L D T
Sbjct: 281 FINDDQTRSFLAMEVRSGGLSEISKQIRLVNEVFILHNLPEYYKNPRPHISVAKGLGDIT 340
Query: 158 ATLKPLLTKLDNIFTQFKLTSDESF-HVVTHIHMKTGNKFYSF 199
+TLK + +L+ + + T + ++ + I K G + YS
Sbjct: 341 STLKLVADELNRLKGNVRPTEKSIWSYMFSGIGCKIGQRTYSI 383
>gi|357136807|ref|XP_003569995.1| PREDICTED: UPF0406 protein C16orf57 homolog [Brachypodium
distachyon]
Length = 275
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 101/220 (45%), Gaps = 30/220 (13%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPLQ---------TNLARLYAMLKEELNSVGIS------- 50
G RIRSFPH ++A VYIP+ T + R A L +L +V
Sbjct: 52 QGSRIRSFPHVEGNYALHVYIPVVIPFDARKQLTIVMRRAASLVPDLYAVDADYALAELC 111
Query: 51 --------VEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
V + E H+SL +T+ I H ID++V L + R + F E F N+
Sbjct: 112 KDEQKLEKVLLAREFHVSLGRTVAIQVHQIDSIVAMLRQKFQSQQRYWMDFTKWEHFVND 171
Query: 103 EKTRSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK 161
+ TRSF++L S +T V VD+ + LP +Y+ P H S+ W L D ++ LK
Sbjct: 172 DSTRSFLSLEVTRTGLSEITKQVHMVDEVYRLHGLPEFYKNPRPHISLVWALGDISSKLK 231
Query: 162 PLLTKLDNIFTQFKLTSDESFHV---VTHIHMKTGNKFYS 198
+++N Q + S ++ ++ + + K GNK Y
Sbjct: 232 QATKEIENF--QNSVNSYKNCNLRCKFSRVVCKVGNKVYD 269
>gi|328769541|gb|EGF79585.1| hypothetical protein BATDEDRAFT_26031 [Batrachochytrium
dendrobatidis JAM81]
Length = 257
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 96/219 (43%), Gaps = 27/219 (12%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIP--LQTNL-ARLYAMLKEEL------------- 44
+D PN +R+ H +WAT +YIP LQ +L A L +K+
Sbjct: 48 VDKPNA-PVPVRTVAHIEGNWATYIYIPIELQEDLQAELKDWIKQATTQFPSWHSHIQLD 106
Query: 45 --NSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
+S+G+ H+SL +T + ID + L + +R T+ FN + + N+
Sbjct: 107 SNDSLGL--------HVSLCQTAYLKVFQIDRFMALLEQQIGLQSRFTVSFNGVSSYVND 158
Query: 103 EKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP 162
E+TRSF+ + L +VQ VD + F P +Y+ P FHASI W + ++
Sbjct: 159 EQTRSFVGMDIGHGHEELLELVQCVDLALTAFHQPPFYKNPRFHASIVWSPTTRASSFDN 218
Query: 163 LLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + + + +V + K+GNK F L
Sbjct: 219 PIESIPDSLRDIPKKLHNTLFLVRSVVCKSGNKLKQFDL 257
>gi|222623351|gb|EEE57483.1| hypothetical protein OsJ_07746 [Oryza sativa Japonica Group]
Length = 291
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 100/223 (44%), Gaps = 38/223 (17%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPL-----------------QTNLARLYA---------ML 40
G R+RSFPH ++A VYIP+ + + LYA +
Sbjct: 68 QGSRVRSFPHVEGNYALHVYIPVVIPSDAKKHLVLVMRRAASFVPDLYAIDADYALSELC 127
Query: 41 KEE--LNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
K+E L V +S E H+SL +T+ I H I++LV L R R + FN E
Sbjct: 128 KDEQKLEKVLLSREF----HVSLGRTVAIQVHQIESLVAMLRQKFRSQQRYWMDFNKWEH 183
Query: 99 FCNEEKTRSFIALGANSCKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
F N++ TRSF++L S T L I + VD + LP +Y+ P H S+AW L D
Sbjct: 184 FVNDDCTRSFLSLEVTS--TGLPEISKQITMVDDVYRLHGLPEFYKNPRPHISLAWALGD 241
Query: 156 KTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFY 197
+ LK + +++ + + + +H+ K G K Y
Sbjct: 242 VSCKLKQAIKEIEKSQSSLGTSQISNLRCKFSHVVCKIGKKVY 284
>gi|225454880|ref|XP_002278790.1| PREDICTED: UPF0406 protein C16orf57 homolog [Vitis vinifera]
gi|147791224|emb|CAN70131.1| hypothetical protein VITISV_030399 [Vitis vinifera]
gi|297737378|emb|CBI26579.3| unnamed protein product [Vitis vinifera]
Length = 283
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 101/215 (46%), Gaps = 26/215 (12%)
Query: 9 GRIRSFPHQRNSWATLVYIPL---QTNLARLYAMLKEELNSV-GISVEVIPEP------- 57
GR+RSFPH ++A V+IP+ + L LK+ ++ V G+ V + P
Sbjct: 61 GRVRSFPHVEGNYALHVFIPVYIPSSPKKELVQYLKKVMSLVPGLHVVDVDIPLNILCKD 120
Query: 58 -------------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEK 104
H+SL +T+ I H ID++V L L+ R I FN E+F N+++
Sbjct: 121 DNKLEQVALGREFHISLGRTVPIRVHQIDSIVTMLRQKLQFQRRYWIDFNKWEVFVNDDQ 180
Query: 105 TRSFIALGANSCK-TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPL 163
TRSF+++ + +T +QAV++ + LP +YE+P H S+ W L + + +LK
Sbjct: 181 TRSFLSVEVIAGGLAEITRQIQAVNEVYRLHNLPEFYEDPRPHISLVWALGNISDSLKRA 240
Query: 164 LTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFY 197
+ ++ F I K GNK Y
Sbjct: 241 VEEMRRHINVGGSVQKRIFTCKFNGIECKIGNKTY 275
>gi|326504100|dbj|BAK02836.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 283
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 98/221 (44%), Gaps = 30/221 (13%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPLQ---------TNLARLYAMLKEELNSVGIS------- 50
G RIRSFPH ++A VYIP+ T + R A L +L +V
Sbjct: 60 QGSRIRSFPHVEGNYALHVYIPVVIPFNARKHLTLVMRRVASLVPDLYAVDADYALSELC 119
Query: 51 --------VEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
V + E H+SL +T+ I H ID+LV L + R + N E F N+
Sbjct: 120 KDEQKLEKVLLGREFHVSLGRTVGIQVHQIDSLVAMLRQKFQSQQRYWMDLNKWEHFVND 179
Query: 103 EKTRSFIALGANSCKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTAT 159
+ TRSF++L +T L I + VD+ + LP +Y+ P H S+AW L D ++
Sbjct: 180 DSTRSFLSLEVT--RTGLPEISKQIHMVDEVYRLHGLPEFYKNPRPHISLAWALGDVSSK 237
Query: 160 LKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSF 199
LK +++ L+ + + + I K G K Y
Sbjct: 238 LKQATKEIEKFENSINLSKNYNLRCNFSRILCKVGKKVYDI 278
>gi|326927192|ref|XP_003209777.1| PREDICTED: UPF0406 protein C16orf57 homolog [Meleagris gallopavo]
Length = 137
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 73/125 (58%), Gaps = 2/125 (1%)
Query: 78 TLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLP 137
TL L H + + ++++ N+ KTR+F+ L ++ L +V VDK +E+ LP
Sbjct: 13 TLKLFLSHFRFFCVA-DRVKVYTNQNKTRTFVGLEVSTGHFQLLELVSEVDKVMEEYDLP 71
Query: 138 TYYEEPNFHASIAWCLQDKTATLK-PLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKF 196
+Y++P+FH S+AWC+ D + +L+ L +L +I +F+ ++ I K+GNKF
Sbjct: 72 LFYKDPSFHISMAWCVGDLSGSLEGQCLQELQDIVDRFEDSARILRVQWEQIRCKSGNKF 131
Query: 197 YSFPL 201
+SFPL
Sbjct: 132 FSFPL 136
>gi|299743609|ref|XP_002910683.1| hypothetical protein CC1G_15014 [Coprinopsis cinerea okayama7#130]
gi|298405734|gb|EFI27189.1| hypothetical protein CC1G_15014 [Coprinopsis cinerea okayama7#130]
Length = 307
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 80/168 (47%), Gaps = 17/168 (10%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQ-TNLARLYAMLKEELNSVGISVEVIP----- 55
D+P H GRIR+ PH +A VY+ L LY +LKE L + E++P
Sbjct: 52 DDPALHQGRIRTTPHVEGQFAAHVYVSLSLKGNPTLYKLLKEVLRD---AKELVPTLHEL 108
Query: 56 --------EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRS 107
+ H+SLS+ + + H + ++ N + + I F + N+EKTR+
Sbjct: 109 VAFDTGYSDLHVSLSRPVFLRAHQREEFKRSVRNIAKEQSPFAISFAAFSELTNDEKTRT 168
Query: 108 FIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
F+ L + +L ++ ++ + F+ YYE P FHASI W L D
Sbjct: 169 FLVLEVGAGHHNLRTLASSLTSVMKSFRQKEYYESPRFHASIGWALLD 216
>gi|348688599|gb|EGZ28413.1| hypothetical protein PHYSODRAFT_469983 [Phytophthora sojae]
Length = 243
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 93/211 (44%), Gaps = 46/211 (21%)
Query: 12 RSFPHQRNSWATLVYIPLQT-----NLARLYAMLKEELNSVGISVEVIP----------- 55
R+FPH +W + V I + +A+ +EL VG +V ++P
Sbjct: 57 RAFPHVDGNWPSHVRIDIPVTEELREMAKCAIDRAQEL--VGENVTLVPFEELRLGESSS 114
Query: 56 -----EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIA 110
HLSLS+ ++ Y I+ V++L L+ R ++ + N++KTRSF+A
Sbjct: 115 TGSGGGLHLSLSRAFILTYDQIEGFVDSLHTALKWRQRFSVTLEGALVLVNDDKTRSFLA 174
Query: 111 LGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNI 170
L ++ +++ VD+ FK PTYY++P H SIA + ++ A L P
Sbjct: 175 LRVSAGAQQFNQVLRCVDQCLACFKQPTYYQDPIPHVSIASSMGEELAQLTP-------- 226
Query: 171 FTQFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ FHV GNK Y PL
Sbjct: 227 ---------DQFHVA------IGNKHYDIPL 242
>gi|301117794|ref|XP_002906625.1| hypothetical protein PITG_03564 [Phytophthora infestans T30-4]
gi|262107974|gb|EEY66026.1| hypothetical protein PITG_03564 [Phytophthora infestans T30-4]
Length = 235
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 91/206 (44%), Gaps = 39/206 (18%)
Query: 12 RSFPHQRNSWATLVYIPLQT-----NLARLYAMLKEELNSVGISVEVIPEP--------- 57
R+FPH +W + V I + LAR +E+ +++ E
Sbjct: 52 RAFPHVDGNWPSHVRIDIPVTHELRELARHAIERAQEIVGDAVAMLAFEELGLTSTGATD 111
Query: 58 --HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
HLSLS+ V+ Y I+ V++L L+ R ++ + N+EKTRSF+AL
Sbjct: 112 YLHLSLSRPFVLTYDQINGFVDSLRAALKWQQRFSVTLRRALVLANDEKTRSFLALRVGE 171
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFK 175
+ T +++ VD+ + PTYY++P H SIA L ++ A +
Sbjct: 172 GEQQFTQVLRCVDQCLSRVEQPTYYKDPIPHVSIASSLGEELA----------------Q 215
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
LTSD+ H+ GNK Y PL
Sbjct: 216 LTSDQ-------FHVAIGNKHYDIPL 234
>gi|281206293|gb|EFA80482.1| UPF0406 family protein [Polysphondylium pallidum PN500]
Length = 260
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 80/149 (53%), Gaps = 9/149 (6%)
Query: 8 GGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGIS---VEVIPEPHLSLSKT 64
G+ R F H ++ T VY+ + T+ + AM+ EE+ S+ +EV+ + H+SLS+
Sbjct: 69 NGKKRLFEHVDGNYPTYVYVKIPTS-DDINAMI-EEVGSIAKEHDVLEVVDDYHVSLSRV 126
Query: 65 LVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKT--SLTS 122
+ H IDT + L+++ TI F +I F N+ ++R F + AN C+ L
Sbjct: 127 FTMREHHIDTFTSEMTTKLKNIKEFTINFETISTFVNDGQSRLFFS--ANVCRNVDKLND 184
Query: 123 IVQAVDKSAQEFKLPTYYEEPNFHASIAW 151
++ +D+ ++FK P +YE P H S+ W
Sbjct: 185 TIKKIDQVLKQFKFPVFYETPLPHLSVNW 213
>gi|56755705|gb|AAW26031.1| SJCHGC02691 protein [Schistosoma japonicum]
Length = 202
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 79/154 (51%), Gaps = 14/154 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKE---ELNSVGISVEVIPEPH 58
++P+ H R R+FPH+ SWAT +YI +R+ +K +LN + + H
Sbjct: 38 EDPSRHNYRSRTFPHEPGSWATSIYIACPHFYSRIQEAIKSPIIQLNPIMNDCCAVDFLH 97
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT----------RSF 108
+SLSKT I +HWI+ L L + + + + I F+++E+F NEE T RSF
Sbjct: 98 ISLSKTWPIYFHWIENLACNLRSAVSSIEKFCIAFDNVEVFVNEENTRYLSNAIFIFRSF 157
Query: 109 IAL-GANSCKTSLTSIVQAVDKSAQEFKLPTYYE 141
L + + +LT ++ VD F+ P +++
Sbjct: 158 FPLITSEESRVALTPLLSFVDSWVPAFRGPAFFK 191
>gi|224136392|ref|XP_002322318.1| predicted protein [Populus trichocarpa]
gi|222869314|gb|EEF06445.1| predicted protein [Populus trichocarpa]
Length = 290
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 89/183 (48%), Gaps = 27/183 (14%)
Query: 5 NEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGI--------------- 49
N RIRSFPH + ++A VYIP+ A L + + LN + +
Sbjct: 63 NGQSSRIRSFPHVQGNYALHVYIPVNIPPA-LKKEVVQFLNRISLVVPGLHVVDADVPLD 121
Query: 50 ----------SVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIF 99
V + + H+SL +T+ I H ID++V L L+ I FN E+F
Sbjct: 122 ILCKDDHKLEQVALGRDFHISLGRTVPIRVHQIDSVVAMLRQKLQFQKGYWIDFNKWEVF 181
Query: 100 CNEEKTRSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTA 158
N++KTR+F++L + + +T +Q+V+ + LP +Y++P H S+AW L D +
Sbjct: 182 VNDDKTRTFLSLEVVTGGLAEITKQIQSVNDVYKLHNLPEFYKDPRPHISLAWALGDVSD 241
Query: 159 TLK 161
LK
Sbjct: 242 VLK 244
>gi|320168978|gb|EFW45877.1| hypothetical protein CAOG_03861 [Capsaspora owczarzaki ATCC 30864]
Length = 422
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 80/154 (51%), Gaps = 1/154 (0%)
Query: 13 SFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKTLVIPYHWI 72
+F + + A + +Q + + L A + + G + H+S+++ LV+ + +
Sbjct: 167 AFDAEDDDGADPTAVDMQASASALPAARQRNVAEHGWVAIDPADMHISVTRPLVVSHAKL 226
Query: 73 DTLVETLGNNLRHL-NRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSA 131
L++ + LRH + + +E F N+E+TR+F++L T L +V+ VD
Sbjct: 227 SPLLDRMRARLRHQHGGIVVDLADVEFFVNDERTRTFVSLMVRRGLTGLKPLVKQVDAVL 286
Query: 132 QEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLT 165
++F L +Y+EP FH S+AWC+ D L+ L+T
Sbjct: 287 RDFPLQEFYDEPRFHLSVAWCVGDCMPLLQRLIT 320
>gi|349806389|gb|AEQ18667.1| hypothetical protein [Hymenochirus curtipes]
Length = 101
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 59/98 (60%), Gaps = 1/98 (1%)
Query: 101 NEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL 160
N+EKTR+F+ L + +L IV VD S +EF L T+YE+P+FH S+AWC+ DK L
Sbjct: 2 NQEKTRTFLGLEVSVGMENLLGIVSEVDLSLKEFNLKTFYEDPSFHVSLAWCVGDKAGQL 61
Query: 161 K-PLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKFY 197
+ L +L ++ +F+ + + V IH K GNK +
Sbjct: 62 EGSGLLELQDVLDRFEDSDALTRFCVEEIHCKAGNKSF 99
>gi|449548738|gb|EMD39704.1| hypothetical protein CERSUDRAFT_45798 [Ceriporiopsis subvermispora
B]
Length = 297
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 85/168 (50%), Gaps = 12/168 (7%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQ-TNLARLYAMLKEELNSVG--------ISVE 52
D+P H GR+R+ PH +A VY+PL+ A L +L++ L S IS++
Sbjct: 51 DDPARHQGRVRTAPHVEGQFAAYVYVPLRLEKGAPLARLLRDVLESAKARVPSLHPISIQ 110
Query: 53 VIP---EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFI 109
+ P E H+SL++ + + H D L + R + T F + N+E+TR+F+
Sbjct: 111 LPPGESELHISLTRPVYLRAHQRDELKTAVRAIARAHSAFTASFAAFAELTNDERTRTFL 170
Query: 110 ALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKT 157
A+ A + L ++ A+ + + + +Y P FHASIAW L D+
Sbjct: 171 AVEAGAGHAELKALSDALVPTLRLLRQKEFYSAPRFHASIAWALLDRA 218
>gi|325183645|emb|CCA18105.1| hypothetical protein PITG_03564 [Albugo laibachii Nc14]
Length = 242
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 93/212 (43%), Gaps = 28/212 (13%)
Query: 11 IRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGI----SVEVIPEP--------- 57
IR FPH W + +YI L N ++ K+E N + V +IP+
Sbjct: 37 IRRFPHVVGHWPSHIYISLVENESK--QSQKKEFNVLADEIIDGVRLIPDVTQVHLVALK 94
Query: 58 --------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFI 109
HLSLS+ V+ Y I+ V+ L L+ R + IF NE+ TRSF+
Sbjct: 95 TNAEENPYHLSLSRPFVLTYGMIEKFVQELRLCLKWRRRFLYTLQGLSIFINEDATRSFL 154
Query: 110 ALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDN 169
A+ + T I++ V+ F+LP YYE P H S+A TL ++T+ +
Sbjct: 155 AINMINDTTPFLHILRCVNSCMTRFQLPAYYENPRPHVSVA---SSPHGTLGSIVTQ--S 209
Query: 170 IFTQFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ + E + TH+ + GNK + L
Sbjct: 210 QLDRLPSAAVEWHGIATHVAVAIGNKRFDIIL 241
>gi|328865692|gb|EGG14078.1| Rab GTPase [Dictyostelium fasciculatum]
Length = 466
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 5 NEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEEL---NSVGISVEVIPEPHLSL 61
++H G+ R F H ++ T VY + + + +M+ E N +GI +V H+SL
Sbjct: 84 DKHKGKKRQFEHVEGNYPTFVYFDVPIDRKDMESMIDEVRDIGNEMGIGHQV-DHYHVSL 142
Query: 62 SKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
S+ + H ID L L+++ + +I+F SI F N++K+R F++ + S+
Sbjct: 143 SRVFPMREHHIDLFCNQLKLELKNIQKFSIQFESISTFFNDDKSRIFLSSNVTTGIKSVN 202
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIA 150
I+ VDK +FK P ++E P H SI
Sbjct: 203 RIIAMVDKCLDQFKFPLFHEIPLPHLSIC 231
>gi|255648012|gb|ACU24462.1| unknown [Glycine max]
Length = 279
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 89/182 (48%), Gaps = 29/182 (15%)
Query: 10 RIRSFPHQRNSWATLVYIPL---QTNLARLYAMLKE------ELNSVGISV--------- 51
R+RSFPH ++A VYIP+ + L A LK+ LN V + V
Sbjct: 58 RLRSFPHVDGNYALHVYIPIYISSPSKKELVAFLKKISSREPRLNVVDVDVPLNILCQND 117
Query: 52 ------EVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT 105
+ E H+SL +T+ I H ID++V L L+ ++ I FN E+F N++ T
Sbjct: 118 EKLEQVTLGREFHISLGRTVPIRVHQIDSVVSMLRQKLQIQHQYWIDFNKWEVFVNDDHT 177
Query: 106 RSFIALGANSCKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP 162
RSF L A + L I ++ V+ + LP +Y++P H S+AW L D +LK
Sbjct: 178 RSF--LSAEVVQGGLVEITKQIEVVNAIYRLHNLPEFYKDPRPHISLAWALGDIAHSLKK 235
Query: 163 LL 164
++
Sbjct: 236 IV 237
>gi|356495919|ref|XP_003516818.1| PREDICTED: UPF0406 protein C16orf57-like isoform 1 [Glycine max]
gi|356495921|ref|XP_003516819.1| PREDICTED: UPF0406 protein C16orf57-like isoform 2 [Glycine max]
Length = 279
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 89/182 (48%), Gaps = 29/182 (15%)
Query: 10 RIRSFPHQRNSWATLVYIPL---QTNLARLYAMLKE------ELNSVGISV--------- 51
R+RSFPH ++A VYIP+ + L A LK+ LN V + V
Sbjct: 58 RLRSFPHVDGNYALHVYIPIYISSPSKKELVAFLKKISSREPRLNVVDVDVPLNILCQND 117
Query: 52 ------EVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT 105
+ E H+SL +T+ I H ID++V L L+ ++ I FN E+F N++ T
Sbjct: 118 EKLEQVTLGREFHISLGRTVPIRVHQIDSVVSMLRQKLQIQHQYWIDFNKWEVFVNDDHT 177
Query: 106 RSFIALGANSCKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP 162
RSF L A + L I ++ V+ + LP +Y++P H S+AW L D +LK
Sbjct: 178 RSF--LSAEVVQGGLVEITKQIEVVNAIYRLHNLPEFYKDPRPHISLAWALGDIAHSLKK 235
Query: 163 LL 164
++
Sbjct: 236 IV 237
>gi|302783913|ref|XP_002973729.1| hypothetical protein SELMODRAFT_414038 [Selaginella moellendorffii]
gi|300158767|gb|EFJ25389.1| hypothetical protein SELMODRAFT_414038 [Selaginella moellendorffii]
Length = 303
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 93/198 (46%), Gaps = 33/198 (16%)
Query: 4 PNEHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKE----------- 42
P H GR R+FPH ++ + V IP L++ L + +M+ +
Sbjct: 65 PPSHAGRKRTFPHVEGNFPSYVNIPVPVTLGARNVLESLLGKASSMMPQLRGMDDDIVVS 124
Query: 43 -------ELNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNS 95
++ +++ E H+SLS+T+ I H IDTL+ L + R I+F
Sbjct: 125 FKRRDTTSRSNADSGIQLAKEFHISLSRTVPIRVHQIDTLIPLLRHKFESQKRFWIEFGR 184
Query: 96 IEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFK---LPTYYEEPNFHASIAWC 152
EI+ N++++R+F LG L+ I + + + +K LP +Y+ P H S+AW
Sbjct: 185 WEIYLNDDRSRTF--LGLEVVSGGLSDIRRQIALVTEAYKLHGLPPFYDTPRPHISLAWA 242
Query: 153 LQDKTATLKPLLTKLDNI 170
L D ++ + + +L+ +
Sbjct: 243 LGDVSSDAQRVADELNEL 260
>gi|449488623|ref|XP_004158117.1| PREDICTED: putative U6 snRNA phosphodiesterase-like [Cucumis
sativus]
Length = 289
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 90/178 (50%), Gaps = 24/178 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNS---------VGIS 50
D P + R+RSFPH + ++A VYIP+ TN + A+ ++++S + I
Sbjct: 61 DLPIDQATRVRSFPHVQGNYALHVYIPVYVPTNARKEVALFMKKISSLVPALHLVDIDIP 120
Query: 51 VEVI------------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
++V+ E H+SLS+T+ I H ID++V L L+ R I F+ E
Sbjct: 121 LDVLCKDDQKLEQALAREFHISLSRTVPIRVHQIDSIVTMLRQKLQSPRRYWIDFSKWET 180
Query: 99 FCNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
F N++ +R+F+++ + +Q V++ + LP +Y+E H S+AW L D
Sbjct: 181 FVNDDLSRTFLSMEIITGGLMEIRKQIQVVNEVYKLHNLPEFYKEARPHISVAWALGD 238
>gi|449451850|ref|XP_004143673.1| PREDICTED: putative U6 snRNA phosphodiesterase-like [Cucumis
sativus]
Length = 289
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 90/178 (50%), Gaps = 24/178 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPL--QTNLARLYAMLKEELNS---------VGIS 50
D P + R+RSFPH + ++A VYIP+ TN + A+ ++++S + I
Sbjct: 61 DLPIDQATRVRSFPHVQGNYALHVYIPVYVPTNARKEVALFMKKISSLVPALHLVDIDIP 120
Query: 51 VEVI------------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
++V+ E H+SLS+T+ I H ID++V L L+ R I F+ E
Sbjct: 121 LDVLCKDDQKLEQAWAREFHISLSRTVPIRVHQIDSIVTMLRQKLQSPRRYWIDFSKWET 180
Query: 99 FCNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
F N++ +R+F+++ + +Q V++ + LP +Y+E H S+AW L D
Sbjct: 181 FVNDDLSRTFLSMEIITGGLMEIRKQIQVVNEVYKLHNLPEFYKEARPHISVAWALGD 238
>gi|302689367|ref|XP_003034363.1| hypothetical protein SCHCODRAFT_14810 [Schizophyllum commune H4-8]
gi|300108058|gb|EFI99460.1| hypothetical protein SCHCODRAFT_14810 [Schizophyllum commune H4-8]
Length = 324
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 73/165 (44%), Gaps = 12/165 (7%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPL----QTNLARLYAMLKEELNSVGISV----- 51
+DNP H GRIRS PH WA VY+P+ ++ L + + + G+S
Sbjct: 73 VDNPALHQGRIRSVPHVDGQWACHVYVPVTLKPRSALRDVLEDAIQRAKTSGMSAFHTFW 132
Query: 52 ---EVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSF 108
PE H+SLS+ + + H + L + R + F S + N+E TR+F
Sbjct: 133 DAETPRPELHISLSRPIFLRAHQREDLKRAVKRVARETQGFSTSFTSFSVLTNDENTRAF 192
Query: 109 IALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
+ + + L ++ + + + YY P +HASI W L
Sbjct: 193 LTVDVGAGHPELAAMTSKLTPFLRSVRQQEYYSCPKYHASIGWTL 237
>gi|302788019|ref|XP_002975779.1| hypothetical protein SELMODRAFT_442940 [Selaginella moellendorffii]
gi|300156780|gb|EFJ23408.1| hypothetical protein SELMODRAFT_442940 [Selaginella moellendorffii]
Length = 303
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 93/198 (46%), Gaps = 33/198 (16%)
Query: 4 PNEHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKE----------- 42
P H GR R+FPH ++ + V IP L++ L + +M+ +
Sbjct: 65 PPSHAGRKRTFPHVEGNFPSYVNIPVPVTLGARNVLESLLGKASSMMPQLRGMDDDIVVS 124
Query: 43 -------ELNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNS 95
++ +++ E H+SLS+T+ I H IDTL+ L + R I+F
Sbjct: 125 FKRRDTTSRSNADSGIQLAKEFHISLSRTVPIRVHQIDTLIPLLRHKFESQKRFWIEFGR 184
Query: 96 IEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFK---LPTYYEEPNFHASIAWC 152
E++ N++++R+F LG L+ I + + + +K LP +Y+ P H S+AW
Sbjct: 185 WEVYLNDDRSRTF--LGLEVVSGGLSDIRRQIALVTEAYKLHGLPPFYDTPRPHISLAWA 242
Query: 153 LQDKTATLKPLLTKLDNI 170
L D ++ + + +L+ +
Sbjct: 243 LGDVSSDAQRVADELNEL 260
>gi|195998229|ref|XP_002108983.1| hypothetical protein TRIADDRAFT_52524 [Trichoplax adhaerens]
gi|190589759|gb|EDV29781.1| hypothetical protein TRIADDRAFT_52524 [Trichoplax adhaerens]
Length = 193
Score = 76.3 bits (186), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 77/147 (52%), Gaps = 3/147 (2%)
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
E H+SLSK + I +H + L+ + L R NSI ++ N++ TR+F+A+
Sbjct: 47 ELHISLSKNVHIGFHIMQPLMNDIKAALTTKTRFNCYMNSIAVYPNDDYTRTFLAIEIYG 106
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL-KPLLTKLDNIFTQF 174
L S+ ++++ ++++LP +YE P+FH SI WC ++ + ++ KL ++
Sbjct: 107 GYEQLRSLTNSINECFRKYQLPPFYEPPSFHISIGWCPGNQVDKFNQSIMNKLQTKLEEY 166
Query: 175 KLTSDESFHVVTHIHMKTGNKFYSFPL 201
D + + ++ + GNK Y F L
Sbjct: 167 --LEDLNPVTIEYLECRCGNKLYKFSL 191
>gi|395324564|gb|EJF57002.1| hypothetical protein DICSQDRAFT_129768 [Dichomitus squalens
LYAD-421 SS1]
Length = 322
Score = 75.9 bits (185), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 84/184 (45%), Gaps = 25/184 (13%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIP--------LQTNLARLYAMLKE---ELNSVGI 49
+DNP H GR R+ PH +A VY+P L L R+Y+ K L +G
Sbjct: 53 VDNPALHQGRRRTTPHVEGQFAAYVYVPIVIPRVSKLLALLMRIYSAAKRLVPSLQPIGF 112
Query: 50 S-VEVIPEP-------------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNS 95
S + + EP H+SL++ + H + + + ++ + + F +
Sbjct: 113 SDGDALKEPGESSNLEADSIELHISLTRPTYLRAHQREEFKRAVRSAIKAKAKFSASFVT 172
Query: 96 IEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
N+E+TR+F+ L + L ++V+ + + + + +YE+P FHASIAW L D
Sbjct: 173 FSELTNDERTRTFLTLEIGAGHDHLKTLVENLTPALRAIRQKEFYEDPRFHASIAWALLD 232
Query: 156 KTAT 159
T
Sbjct: 233 GAKT 236
>gi|393212545|gb|EJC98045.1| hypothetical protein FOMMEDRAFT_137382 [Fomitiporia mediterranea
MF3/22]
Length = 296
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 84/167 (50%), Gaps = 20/167 (11%)
Query: 3 NPNEHGGRIRSFPHQRNSWATLVYIP--LQTNLARLYAMLKEELNSVGISVEVIP----- 55
+P+ H GRIR+ PH +A VY+P L+ N L ++L E VG S E+ P
Sbjct: 57 DPSRHQGRIRAIPHVEGQFAAYVYVPVCLKANTP-LRSLLDE---VVGRSKEIEPFLVCD 112
Query: 56 ---EP------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTR 106
P H+SL++ + + +H + L + R ++ T F+S +F N+E TR
Sbjct: 113 WQDSPNATYLLHISLTRPIYLRHHQREELRRAVKMAARAVDPFTASFSSFSVFENDEHTR 172
Query: 107 SFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
F+ + + + L + ++++ + + +Y EP +HASIAW L
Sbjct: 173 VFLGVDIGAGHSMLEVLSKSIEPTLKLLHQKQFYIEPRYHASIAWSL 219
>gi|226530062|ref|NP_001142745.1| uncharacterized protein LOC100275088 [Zea mays]
gi|195609042|gb|ACG26351.1| hypothetical protein [Zea mays]
Length = 280
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 89/194 (45%), Gaps = 37/194 (19%)
Query: 7 HGGRIRSFPHQRNSWATLVYIP----------LQTNLAR-------LYA---------ML 40
GGR+RSFPH ++A VYIP L ++ R LYA +
Sbjct: 57 QGGRVRSFPHVEGNYAVHVYIPVVIPSDARKQLALSMKRAASLVPDLYAVDADYALSELC 116
Query: 41 KEE--LNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
K+E L V +S E H+SL + + + H ID+ + L + + ++FN E
Sbjct: 117 KDEQKLEKVLLSREF----HVSLGRPVAVQVHQIDSFIAMLRQKFQPQQQYWMEFNKWEH 172
Query: 99 FCNEEKTRSFIALGANSCKTSLTSIVQA---VDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
F N++ TRSF++L +T L I + VD+ + LP +Y P H S+ W L D
Sbjct: 173 FVNDDCTRSFVSLEVT--RTGLPEITRQILMVDEVYRLHGLPEFYTNPRPHISLVWALGD 230
Query: 156 KTATLKPLLTKLDN 169
+ LK L ++
Sbjct: 231 VSGKLKQALKDIEK 244
>gi|355736960|gb|AES12165.1| hypothetical protein [Mustela putorius furo]
Length = 111
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 53/88 (60%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+SLS+++V+ +HWI V+ L + L R N ++I+ N+EKTR+F+ L S
Sbjct: 24 HVSLSQSVVLRHHWIIPFVQALKDRLASFQRFFFTANRVKIYTNQEKTRTFVGLEVTSGH 83
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNF 145
+V VD+ +EF L T+Y++P+F
Sbjct: 84 PQFLDLVSEVDRVMEEFDLTTFYQDPSF 111
>gi|297795921|ref|XP_002865845.1| hypothetical protein ARALYDRAFT_495185 [Arabidopsis lyrata subsp.
lyrata]
gi|297311680|gb|EFH42104.1| hypothetical protein ARALYDRAFT_495185 [Arabidopsis lyrata subsp.
lyrata]
Length = 260
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 91/185 (49%), Gaps = 28/185 (15%)
Query: 3 NPNEHGGRIRSFPHQRNSWATLVYIPLQTN----------LARLYAMLKE-ELNSVGISV 51
+ E G R+R+FPH ++A VYIP+ L R+ +++ L + +
Sbjct: 33 DSTEPGVRVRNFPHVDGNYALHVYIPVSIPPLPKKEIVCFLKRVASVVPHLHLVEADVPL 92
Query: 52 EVI------------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIF 99
++ E H+SL +++ + H I+++V L L+ R I FN E+F
Sbjct: 93 SILCKDDQKFERALGREFHISLGRSVPLRVHQINSVVSMLRQKLQLQKRYAIDFNKWEVF 152
Query: 100 CNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFK---LPTYYEEPNFHASIAWCLQDK 156
N++ TRSF++L + + L+ I + +D + +K LP +Y++P H S+ W L D
Sbjct: 153 VNDDCTRSFLSLEITT--SGLSEISKQIDAVNEVYKLHNLPEFYKDPRPHISLVWALGDI 210
Query: 157 TATLK 161
+LK
Sbjct: 211 RTSLK 215
>gi|389744728|gb|EIM85910.1| hypothetical protein STEHIDRAFT_98157 [Stereum hirsutum FP-91666
SS1]
Length = 333
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 79/185 (42%), Gaps = 30/185 (16%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQT----NLARLY----AMLKEE-----LNSV 47
+D+P++H GR R+ PH WA VY+P++ N+ RL A +E L S+
Sbjct: 56 VDDPSKHQGRTRTVPHVDGQWAAYVYVPIKAGARDNIGRLVRRTLASARERESMGCLRSI 115
Query: 48 GISVEVI-----------------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLT 90
G ++ E H+SLS+ + + + + + + T
Sbjct: 116 GFDIDSDELGKGKTRDENGIRTEDDELHISLSRPVFLRAYQREEFKRAVRIIAKSNKSFT 175
Query: 91 IKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIA 150
F +I N+EKTRSF+ L + L + + + + + YY P FHAS A
Sbjct: 176 GSFATIAALTNDEKTRSFLCLEVGAGHNELRKLSDDLTPTLRSMRQKEYYAYPRFHASFA 235
Query: 151 WCLQD 155
W L D
Sbjct: 236 WALLD 240
>gi|409039200|gb|EKM48878.1| hypothetical protein PHACADRAFT_179402 [Phanerochaete carnosa
HHB-10118-sp]
Length = 313
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 86/182 (47%), Gaps = 25/182 (13%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIP----- 55
+DNP H GR+RS PH +A VY+P++ + L N+ +V+++
Sbjct: 46 VDNPAIHQGRLRSSPHVEGQFAAYVYVPVRIPPTSMLGSLVN--NAFHHAVDIVSILHPI 103
Query: 56 -----EP-----HLSLSKTLVIPYHWID---TLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
EP H+SL++ + + H + T V +G + + F + N+
Sbjct: 104 GDTNIEPDERELHISLTRPMYLRAHQREEFRTAVRVVGGQQK---AFSASFANFAELTND 160
Query: 103 EKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL--QDKTATL 160
E+TR+F+ + + L ++ A+ + + ++ +YE+P FHASIAW L Q +
Sbjct: 161 ERTRTFLTIEVGAGHRELEALCNALAPTLRSYRQKEFYEKPRFHASIAWALLAQSTSTEF 220
Query: 161 KP 162
KP
Sbjct: 221 KP 222
>gi|392566479|gb|EIW59655.1| hypothetical protein TRAVEDRAFT_122213 [Trametes versicolor
FP-101664 SS1]
Length = 328
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/232 (23%), Positives = 92/232 (39%), Gaps = 32/232 (13%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN--------LARLYAMLK---EELNSVGI 49
+DNP H GR+R+ PH +A VYIPL L R++A K L+ +G
Sbjct: 51 IDNPALHQGRVRTTPHVEGQFAAYVYIPLVVERNSKLHKLLLRIFAAAKLAVPSLHPIGF 110
Query: 50 SVEVI------------PEP-----HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIK 92
S + PE H+SLS+ + + H + + +
Sbjct: 111 SQSTLNTADETERDGNAPEDGAMELHISLSRPVYLRAHQRADFKRAVKAAAKSKRSFSAS 170
Query: 93 FNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWC 152
F ++ N+E+TR+F+ L + ++ + + + + +Y++P FHAS+AW
Sbjct: 171 FATLSELTNDERTRTFLTLEIGAGHDDFRALSDELTPTLKSLRQKEFYQDPRFHASVAWA 230
Query: 153 LQDKTATLKPLLTKLDNIFT----QFKLTSDESFHVVTHIHMKTGNKFYSFP 200
L D + L + + T +D + I T F S P
Sbjct: 231 LLDSAKSPPQELREAEPAPTLPTSDLSEKTDTPQTDIVEIQTSTAESFSSIP 282
>gi|18423246|ref|NP_568753.1| uncharacterized protein [Arabidopsis thaliana]
gi|8843853|dbj|BAA97379.1| unnamed protein product [Arabidopsis thaliana]
gi|21617972|gb|AAM67022.1| unknown [Arabidopsis thaliana]
gi|28393839|gb|AAO42327.1| unknown protein [Arabidopsis thaliana]
gi|28973371|gb|AAO64010.1| unknown protein [Arabidopsis thaliana]
gi|332008663|gb|AED96046.1| uncharacterized protein [Arabidopsis thaliana]
Length = 285
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 87/182 (47%), Gaps = 28/182 (15%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPLQ-------------TNLARLYAMLKEELNSVGISV- 51
E G R+R+FPH ++A VY+P+ +A + L V +S+
Sbjct: 61 EPGVRVRNFPHVDGNYALHVYVPVCIPPLPKKEIVCFLKKVASVVPHLHLVEADVPLSIL 120
Query: 52 ---------EVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
+ E H+SL + + + H I++++ L L+ R I FN E+F N+
Sbjct: 121 CKDDQKFERALGREFHISLGRNVPLRVHQINSVISMLRQKLQLQKRYLIDFNKWEVFVND 180
Query: 103 EKTRSFIALGANSCKTSLTSIVQAVDKSAQEFK---LPTYYEEPNFHASIAWCLQDKTAT 159
+ TRSF++L + + L+ I + +D + +K LP +Y++P H S+ W L D +
Sbjct: 181 DHTRSFLSLEITT--SGLSEISKQIDAVNEVYKLHNLPEFYKDPRPHISLVWALGDIRTS 238
Query: 160 LK 161
LK
Sbjct: 239 LK 240
>gi|66814510|ref|XP_641434.1| UPF0406 family protein [Dictyostelium discoideum AX4]
gi|74856040|sp|Q54W16.1|USB1_DICDI RecName: Full=Putative U6 snRNA phosphodiesterase
gi|60469462|gb|EAL67455.1| UPF0406 family protein [Dictyostelium discoideum AX4]
Length = 275
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 79/155 (50%), Gaps = 6/155 (3%)
Query: 5 NEHGGRIRSFPHQRNSWATLVYIPLQT----NLARLYAMLKEELNSVGISVEVIPEPHLS 60
+E + R F H ++ T +Y + T ++ L +KE N + I + H+S
Sbjct: 65 DETNKKTRLFEHVEGNYPTFIYFKVPTKSRNDIKELIEQVKEIGNEINIKQDTETCFHIS 124
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN-SIEIFCNEEKTRSFIALGAN-SCKT 118
+S+T I H I+T + L L++ + I+ + +F N+ ++R F+++ N S K+
Sbjct: 125 ISRTFPIREHHIETFTQELKKTLKNQRSIDIQLSKECCVFINDNQSRIFLSIPINQSFKS 184
Query: 119 SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
++ +++ +D FK P YY+ P H SI+W L
Sbjct: 185 NILKLIERIDSCLSLFKFPKYYDNPEPHLSISWSL 219
>gi|115447621|ref|NP_001047590.1| Os02g0651000 [Oryza sativa Japonica Group]
gi|113537121|dbj|BAF09504.1| Os02g0651000 [Oryza sativa Japonica Group]
Length = 188
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 6/146 (4%)
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
E H+SL +T+ I H I++LV L R R + FN E F N++ TRSF++L S
Sbjct: 38 EFHVSLGRTVAIQVHQIESLVAMLRQKFRSQQRYWMDFNKWEHFVNDDCTRSFLSLEVTS 97
Query: 116 CKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFT 172
T L I + VD + LP +Y+ P H S+AW L D + LK + +++ +
Sbjct: 98 --TGLPEISKQITMVDDVYRLHGLPEFYKNPRPHISLAWALGDVSCKLKQAIKEIEKSQS 155
Query: 173 QFKLTSDESFHV-VTHIHMKTGNKFY 197
+ + +H+ K G K Y
Sbjct: 156 SLGTSQISNLRCKFSHVVCKIGKKVY 181
>gi|441597741|ref|XP_003263042.2| PREDICTED: putative U6 snRNA phosphodiesterase-like [Nomascus
leucogenys]
Length = 150
Score = 69.7 bits (169), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 51/91 (56%), Gaps = 1/91 (1%)
Query: 86 LNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNF 145
+R N ++I+ N+EKTR+FI L S +V VD+ +EF L T+Y++P+F
Sbjct: 43 FHRFFFTANQVKIYTNQEKTRTFIGLEVTSGHAQFLDLVSEVDRVMEEFDLTTFYQDPSF 102
Query: 146 HASIAWCLQDKTATLK-PLLTKLDNIFTQFK 175
H S+AWC+ D L+ L +L I F+
Sbjct: 103 HLSLAWCVGDARLQLEGQCLQELQAIVDGFE 133
>gi|118373048|ref|XP_001019718.1| hypothetical protein TTHERM_00136370 [Tetrahymena thermophila]
gi|89301485|gb|EAR99473.1| hypothetical protein TTHERM_00136370 [Tetrahymena thermophila
SB210]
Length = 289
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 87/203 (42%), Gaps = 11/203 (5%)
Query: 10 RIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGIS------VEVIPEP-HLSLS 62
+ R PH +A +YI + LA+L N V V++ P+ H+SLS
Sbjct: 86 KTRKVPHIDGQFACYIYIDVNVGLAQLVKAQSNFKNRVNRDYPDYQFVDIEPDNFHISLS 145
Query: 63 KTLVIPYHWIDTLVETLGNN-LRHLNR--LTIKFNSIEIFCNEEKTRSFIALGANSCKTS 119
KT + H I+ + +L L+ + L+I N +++F NE++ R F+A K
Sbjct: 146 KTFYLNRHQIEPFMSSLREKYLKSFQQISLSIDLNKLKVFSNEDRNRFFVAASVGQGKNI 205
Query: 120 LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSD 179
+ IV +D Q + L T+++ H S+ W T + L DN F T
Sbjct: 206 VKDIVNQIDNCLQAYGLDTFFKGNKHHVSLLWTNNKATQNAEFKLYVKDNRFNS-SGTDG 264
Query: 180 ESFHVVTHIHMKTGNKFYSFPLT 202
+ V+ + K G + + L
Sbjct: 265 QVIFDVSEVQCKIGKRINTIKLN 287
>gi|426195878|gb|EKV45807.1| hypothetical protein AGABI2DRAFT_152066 [Agaricus bisporus var.
bisporus H97]
Length = 293
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 72/158 (45%), Gaps = 4/158 (2%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQ-TNLARLYAMLKEELNSVGISVEVIP--EP 57
+DNP+ H GR RS PH +WA VY+ + T LY++L+ + + E
Sbjct: 44 IDNPSLHQGRTRSTPHTDGNWAAHVYVSITITKSHTLYSLLETAVAEAKRQTDRKHQLEL 103
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
H+SL++ I H + + + L + + F S N+E TR+F+ + +
Sbjct: 104 HISLTRPFSIRAHQKEEFRQAI-RKLAKCSPFALSFTSFAELRNDEHTRTFLVMEIGAGH 162
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
L + + + + YY P FHAS+AW L D
Sbjct: 163 HELNRLCCDLKPLIESLRQRAYYARPRFHASVAWALLD 200
>gi|336369780|gb|EGN98121.1| hypothetical protein SERLA73DRAFT_91348 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382552|gb|EGO23702.1| hypothetical protein SERLADRAFT_469918 [Serpula lacrymans var.
lacrymans S7.9]
Length = 328
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 76/180 (42%), Gaps = 29/180 (16%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIP----- 55
+DNP H GRIR+ PH +A +Y+PL +L E +++ + +V+P
Sbjct: 43 IDNPVLHQGRIRTVPHVDGQYAAYIYVPLILEPTNPLNLLIE--DALNYTKQVVPSSYEI 100
Query: 56 ----------------------EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKF 93
E H+SLS+ + + H D L + + + F
Sbjct: 101 GIQDLRENVSTHHTPGTVLHHREFHISLSRPIFLRAHQRDELKRAIKTIAMNHSPFEASF 160
Query: 94 NSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
N+E+TR+F+ + L S+ QA+ + + +Y EP FHASIAW L
Sbjct: 161 AMFTELTNDERTRTFLTAEVGAGHQELRSMSQALTPVLKAIRQKEFYVEPRFHASIAWAL 220
>gi|452001968|gb|EMD94427.1| hypothetical protein COCHEDRAFT_1093137 [Cochliobolus
heterostrophus C5]
Length = 331
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 106/260 (40%), Gaps = 64/260 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY---IPLQT------NLARLYAMLKEELNSVGISVE 52
DNP+ HGGR R+ PH + +W + VY IP Q NL + + E N+ +
Sbjct: 60 DNPDLHGGRRRAVPHVQGNWPSHVYLEWIPTQRESIALLNLIQHVKSVLELENTKRVKKL 119
Query: 53 VIPEP---------------HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNS 95
+PE H+SLS+TL I +T ++TLG +LR + +F
Sbjct: 120 PVPEDITPSLQSDLGVPLPLHVSLSRTLQIKTEDRETFLDTLGTSLRRCAVPAFNFEFQG 179
Query: 96 IEIFCNEEKTRSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYE------------- 141
++ N E+ R F+ L + L +++ A +++A+ P Y
Sbjct: 180 LKWVPNFERNRWFLVLAIKRPENDELNTLLHACNQAAKNTGHPALYTGGAGDGPMEDVDH 239
Query: 142 -------------------EPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESF 182
P FH S+AW L + A L+ K+D T++ T D F
Sbjct: 240 EHRPKRRKVDKSDPQMHDYSPYFHVSVAWNLTEPDAEWTALIQKID--ATEYIQTPDAVF 297
Query: 183 HVVTHIHMKTGNKFYSFPLT 202
V ++ GN ++ PL
Sbjct: 298 DAVK---VRIGNAVHNIPLA 314
>gi|358255935|dbj|GAA57538.1| UPF0406 protein, partial [Clonorchis sinensis]
Length = 124
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 54/97 (55%), Gaps = 13/97 (13%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLN------------RLTIKFNSIEIFCNEEKT 105
H+SLSKT + YHWID+L + L N ++++ R I+F+ +E+F NEE++
Sbjct: 28 HVSLSKTWPLRYHWIDSLADKLRNTFQNISRYLVTVMPLIFIRFDIRFSGLELFINEERS 87
Query: 106 RSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYE 141
RSFI L + L IV +VD+ F P YY+
Sbjct: 88 RSFIGLTLSQESSEHLKPIVASVDQCVHAFCGPGYYK 124
>gi|85113890|ref|XP_964600.1| hypothetical protein NCU02073 [Neurospora crassa OR74A]
gi|28926388|gb|EAA35364.1| predicted protein [Neurospora crassa OR74A]
Length = 481
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 111/252 (44%), Gaps = 54/252 (21%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI-----------------PLQTNLARLYAMLKEEL 44
D+P+ H GR R PH +W + VYI LQT +A A +L
Sbjct: 220 DDPSLHQGRQRQVPHIPGNWPSHVYIDWDPSSGDRELLSSLVDKLQTRVA-AAAQRYPDL 278
Query: 45 NSVGISVEV------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFN- 94
V IS + + +P H+SLS L + D ++ + LR ++ + F+
Sbjct: 279 EGVKISTALRDPELPVDKPLHISLSAPLTLTSKNKDAFLDDVTRALRSSGVSPFVVDFSG 338
Query: 95 SIEIFCNEEKTRSFIAL--------GANSCKTS----LTSIVQAVDKSAQEFKLPTYYEE 142
+ + +EE TRSF+ L G + +S LT+++Q +K+A+E+ P Y+
Sbjct: 339 GVNWYRSEESTRSFLVLRVREVQNTGMTTADSSPNPRLTTLLQRCNKTAKEYGQPPLYDS 398
Query: 143 PN----FHASIAWCLQDKTATLKPLLTKL---------DNIFTQFKLTSDESFHVVTHIH 189
+ FH +IAW + +LK L + +N+ + KL + SF V T +
Sbjct: 399 QDMGYRFHVTIAWTHARPSESLKQLTDSIFDDCKTMYSENMSIRDKLCTGSSFRVET-VK 457
Query: 190 MKTGNKFYSFPL 201
+K GN F L
Sbjct: 458 VKIGNHVTRFEL 469
>gi|442580992|sp|Q7SEZ0.2|USB1_NEUCR RecName: Full=Putative U6 snRNA phosphodiesterase
Length = 364
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 111/252 (44%), Gaps = 54/252 (21%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI-----------------PLQTNLARLYAMLKEEL 44
D+P+ H GR R PH +W + VYI LQT +A A +L
Sbjct: 103 DDPSLHQGRQRQVPHIPGNWPSHVYIDWDPSSGDRELLSSLVDKLQTRVA-AAAQRYPDL 161
Query: 45 NSVGISVEV------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFN- 94
V IS + + +P H+SLS L + D ++ + LR ++ + F+
Sbjct: 162 EGVKISTALRDPELPVDKPLHISLSAPLTLTSKNKDAFLDDVTRALRSSGVSPFVVDFSG 221
Query: 95 SIEIFCNEEKTRSFIAL--------GANSCKTS----LTSIVQAVDKSAQEFKLPTYYEE 142
+ + +EE TRSF+ L G + +S LT+++Q +K+A+E+ P Y+
Sbjct: 222 GVNWYRSEESTRSFLVLRVREVQNTGMTTADSSPNPRLTTLLQRCNKTAKEYGQPPLYDS 281
Query: 143 PN----FHASIAWCLQDKTATLKPLLTKL---------DNIFTQFKLTSDESFHVVTHIH 189
+ FH +IAW + +LK L + +N+ + KL + SF V T +
Sbjct: 282 QDMGYRFHVTIAWTHARPSESLKQLTDSIFDDCKTMYSENMSIRDKLCTGSSFRVET-VK 340
Query: 190 MKTGNKFYSFPL 201
+K GN F L
Sbjct: 341 VKIGNHVTRFEL 352
>gi|358055038|dbj|GAA98807.1| hypothetical protein E5Q_05495 [Mixia osmundae IAM 14324]
Length = 211
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 84/195 (43%), Gaps = 18/195 (9%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKTLV 66
H GR+R+ PH + ++ + + + P LSLSKTL
Sbjct: 30 HQGRVRTVPH----------------VDGIFPSQQCQAAHPDVRATDGPSLRLSLSKTLY 73
Query: 67 IPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQA 126
+ H D + + + + ++ I F+ + NE+KTRSF+A + +LT+I+
Sbjct: 74 LRAHETDKVRDAVRSIAAKISSFEISFDKLIRLTNEDKTRSFLARRVAAGAENLTAILAE 133
Query: 127 VDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLL--TKLDNIFTQFKLTSDESFHV 184
+ ++ +LP +Y+ P FHAS AW + + T+LD + + H
Sbjct: 134 LHVVLRKLRLPLFYDPPIFHASFAWRVLLNAEQVGNAFSDTELDEANAEHGSALRKDIHS 193
Query: 185 VTHIHMKTGNKFYSF 199
VT + K N SF
Sbjct: 194 VTELCFKCRNDVQSF 208
>gi|388579005|gb|EIM19335.1| hypothetical protein WALSEDRAFT_34109 [Wallemia sebi CBS 633.66]
Length = 248
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 81/169 (47%), Gaps = 20/169 (11%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY--IPLQTNLARLYAMLKEELNSVGISVEVIPEP-- 57
DNP+EH GR+RS P + + T V+ IPL +L + + ++ + P
Sbjct: 31 DNPSEHQGRVRSQPFREGVYYTHVHLSIPLDYKFKKLLEEIYNDFKGKYNNIHPLFNPDE 90
Query: 58 ---------HLSLSKTLVIPYHWI---DTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT 105
++SLS+ L + I + +E++ NN + +T+ F+ I++ N+ KT
Sbjct: 91 DIDDDDEYFYISLSRPLGLRNFQIKPFNKAIESITNNHK---AITVSFSDIDVLFNDNKT 147
Query: 106 RSFIALGANSCKTSLTSIVQAVDKS-AQEFKLPTYYEEPNFHASIAWCL 153
R+FI L + L +++ +D +E K YY+ H SIAW L
Sbjct: 148 RAFIVLPIVAGYQDLNKLLRDIDNGPVRENKQEPYYDPAVLHTSIAWML 196
>gi|390370265|ref|XP_001189097.2| PREDICTED: UPF0406 protein C16orf57 homolog, partial
[Strongylocentrotus purpuratus]
Length = 207
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 62/139 (44%), Gaps = 34/139 (24%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQT-NLAR----LYAMLKEELN---SVGISVE 52
M+NP EH GRIRSF H +WAT VYIP T +LA L L ++L S + +
Sbjct: 69 MNNPTEHHGRIRSFAHTPGNWATFVYIPADTPSLASLTETLMTCLPQDLTFHPSDDLHLS 128
Query: 53 VIPEP--------------------------HLSLSKTLVIPYHWIDTLVETLGNNLRHL 86
+ P+ HLSLS+T+ + +HWID ++ + +
Sbjct: 129 LSPDTPSLASLTETLMTCLPQDLTFHPSDDLHLSLSRTVCLQFHWIDPFTQSFRERVSGM 188
Query: 87 NRLTIKFNSIEIFCNEEKT 105
++++ N+E T
Sbjct: 189 RSFQCHIEQVDVYANDEGT 207
>gi|330842346|ref|XP_003293141.1| hypothetical protein DICPUDRAFT_157936 [Dictyostelium purpureum]
gi|325076568|gb|EGC30344.1| hypothetical protein DICPUDRAFT_157936 [Dictyostelium purpureum]
Length = 319
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 77/152 (50%), Gaps = 6/152 (3%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPL---QTNLARLYAMLKEELNSVGISVEVIPEPHLSLSK 63
RIR F H ++ + +++ + + +++ L +KE N VGI E + + H+SLSK
Sbjct: 112 ENKRIRLFEHVDGNYPSFIFLKIPKYRNDISDLINQVKEIGNQVGIR-EDLNDYHISLSK 170
Query: 64 TLVIPYHWIDTLVETLGNNLRHLNRLTIKF-NSIEIFCNEEKTRSFIALGAN-SCKTSLT 121
T + H I++ L L++ IK + F NE ++R F+++ + K +
Sbjct: 171 TFPMREHHIESFSNELRKILKNQRPFNIKLADHCCSFINENQSRIFLSVPIDIKFKNPVL 230
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
+++ +D Q FK P YY EP H SI+ L
Sbjct: 231 KLIERIDNCLQLFKFPKYYNEPEPHLSISSDL 262
>gi|336463436|gb|EGO51676.1| hypothetical protein NEUTE1DRAFT_104714 [Neurospora tetrasperma
FGSC 2508]
Length = 436
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 108/251 (43%), Gaps = 53/251 (21%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI-----------------PLQTNLARLYAMLKEEL 44
D+P+ H GR R PH +W + VYI LQ +A A +L
Sbjct: 176 DDPSLHQGRQRQVPHIPGNWPSHVYIDWDPSSADRELLSSLVDKLQAEVA-AAAQRYPDL 234
Query: 45 NSVGISVEV------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFN- 94
V IS + + +P H+SLS + + D ++ + LR ++ + F+
Sbjct: 235 EGVKISTALRDPDLPVDKPLHISLSAPITLTSKNKDAFLDDVTRALRSSGVSPFVVDFSG 294
Query: 95 SIEIFCNEEKTRSFIAL-----------GANSCKTSLTSIVQAVDKSAQEFKLPTYYEEP 143
++ + +EE TRSF+ L +S LT+++Q +K+ +E+ P Y+
Sbjct: 295 GVDWYRSEESTRSFLVLRVREVQNTGTTADSSPNPRLTTLIQRCNKTVKEYGQPPIYDSQ 354
Query: 144 N----FHASIAWCLQDKTATLKPLLTKL---------DNIFTQFKLTSDESFHVVTHIHM 190
+ FH +IAW + +LK L + +N+ + KL + SF V T + +
Sbjct: 355 DMGYRFHVTIAWTHARPSESLKQLTDSIFDDCKTMYSENMSIREKLRTGSSFRVET-VKV 413
Query: 191 KTGNKFYSFPL 201
K GN F L
Sbjct: 414 KIGNHVTRFEL 424
>gi|67481677|ref|XP_656188.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473375|gb|EAL50804.1| hypothetical protein EHI_012200 [Entamoeba histolytica HM-1:IMSS]
gi|449702646|gb|EMD43245.1| Hypothetical protein EHI5A_020570 [Entamoeba histolytica KU27]
Length = 227
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 5/143 (3%)
Query: 9 GRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKTLVIP 68
GRI+ PH+ + VYI + + + EE N + I H+SL+K +
Sbjct: 46 GRIQQRPHKIGEYPGSVYIEIPIEIKN---KIMEETNQFSQDFKPIQSLHISLTKEFSLR 102
Query: 69 YHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVD 128
H I V+ + ++ L TI F +E+ N E+ F+++ S + + S++ +D
Sbjct: 103 EHQIPLFVQEVRKKIKRLPTFTITFGQLELLLNPEQNTEFLSIQVTSPE--ILSLIDLLD 160
Query: 129 KSAQEFKLPTYYEEPNFHASIAW 151
F L YYEE H+S+ +
Sbjct: 161 TVMMSFNLEKYYEERKIHSSLMY 183
>gi|389641389|ref|XP_003718327.1| hypothetical protein MGG_11511 [Magnaporthe oryzae 70-15]
gi|351640880|gb|EHA48743.1| hypothetical protein MGG_11511 [Magnaporthe oryzae 70-15]
Length = 294
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 97/226 (42%), Gaps = 28/226 (12%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTN---LARLYAMLKEELNSVGISVEVI- 54
D+P H GR R PH +W + +YI P Q L+ L + L+ + + VG +
Sbjct: 57 DDPALHHGRSRQIPHVVGNWPSHIYIEWFPSQAQCNVLSSLVSALRSDDHQVGAELHSFL 116
Query: 55 ------PEP-HLSLSKTLVIPYHWIDTLVETLGN--NLRHLNRLTIKFNSIEIFCNEEKT 105
P P H+SLS+ V+ D + + LR +R ++ +++ + +
Sbjct: 117 TSDLGTPLPLHISLSRPFVLTTEQKDVFLSRVPGALRLRSFSRFAVRPSALSWHRSPDSN 176
Query: 106 RSFIALGAN-SCKT---SLTSIVQAVDKSAQEFKLPTYYEEPN-----FHASIAWCLQDK 156
R+F+ L SC LT +++ ++ +EF P Y + FH S+AW D
Sbjct: 177 RAFLVLRVQESCDGGNLGLTRLLERCNEVVREFGQPELYSDGGRVMDRFHLSVAWSFVDV 236
Query: 157 TATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKFYSFPLT 202
T +L+ ++ D + F+ + + +K GN S L
Sbjct: 237 TGSLQ---SRTDVAYENFRDQVAAMEIPIESVKVKIGNIITSLSLA 279
>gi|451853687|gb|EMD66980.1| hypothetical protein COCSADRAFT_110326 [Cochliobolus sativus
ND90Pr]
Length = 343
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 104/260 (40%), Gaps = 64/260 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY---IPLQT------NLARLYAMLKEELNSVGISVE 52
DNP+ HGGR R+ PH + +W + VY IP Q NL + + E N+ +
Sbjct: 72 DNPDLHGGRRRAVPHIQGNWPSHVYLEWIPTQRESIALLNLIQHVKTVFELENTKRVKKL 131
Query: 53 VIPEP---------------HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNS 95
+PE H+SLS+TL I +T ++TLG +LR + +F
Sbjct: 132 PVPEDITPSLQSDLGVPLPLHVSLSRTLQIKTEDRETFLDTLGASLRRCAVPAFNFEFQG 191
Query: 96 IEIFCNEEKTRSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYE------------- 141
++ N E+ R F+ L + L +++ A +++A+ P Y
Sbjct: 192 LKWVPNFERNRWFLVLAIKRPENDELNTLLHACNQAAKNTGHPALYTSGAGDGPMEDVDH 251
Query: 142 -------------------EPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESF 182
P FH SIAW L + A L+ ++D T++ T
Sbjct: 252 NHRPKRRKVDKNDPQTHDYSPYFHVSIAWNLTEPDAEWTALIERID--ATEYIQTPGAML 309
Query: 183 HVVTHIHMKTGNKFYSFPLT 202
V ++ GN ++ PL
Sbjct: 310 DAVK---VRIGNAVHNIPLA 326
>gi|350297347|gb|EGZ78324.1| hypothetical protein NEUTE2DRAFT_124833 [Neurospora tetrasperma
FGSC 2509]
Length = 483
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 108/251 (43%), Gaps = 53/251 (21%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI-----------------PLQTNLARLYAMLKEEL 44
D+P+ H GR R PH +W + VYI LQ +A A +L
Sbjct: 223 DDPSLHQGRQRQVPHIPGNWPSHVYIDWDPSSADRELLSSLVDKLQAEVA-AAAQRYPDL 281
Query: 45 NSVGISVEV------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFN- 94
V IS + + +P H+SLS + + D ++ + LR ++ + F+
Sbjct: 282 EGVKISTALRDPDLPVDKPLHISLSAPITLTPKNKDAFLDDVTRALRSSGVSPFVVDFSG 341
Query: 95 SIEIFCNEEKTRSFIAL-----------GANSCKTSLTSIVQAVDKSAQEFKLPTYYEEP 143
++ + +EE TRSF+ L +S LT+++Q +K+ +E+ P Y+
Sbjct: 342 GVDWYRSEESTRSFLVLRVREVQNTGTTADSSPNPRLTTLIQRCNKTVKEYGQPPLYDSQ 401
Query: 144 N----FHASIAWCLQDKTATLKPLLTKL---------DNIFTQFKLTSDESFHVVTHIHM 190
+ FH +IAW + +LK L + +N+ + KL + SF V T + +
Sbjct: 402 DMGYRFHVTIAWTHARPSESLKQLTDSIFDDCKTMYSENMSIREKLRTGSSFRVET-VKV 460
Query: 191 KTGNKFYSFPL 201
K GN F L
Sbjct: 461 KIGNHVTRFEL 471
>gi|384253602|gb|EIE27076.1| hypothetical protein COCSUDRAFT_45724 [Coccomyxa subellipsoidea
C-169]
Length = 202
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 71/145 (48%), Gaps = 9/145 (6%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCK 117
HLSLS+ + + Y I L+ +L +L R + F +E F N++KTRSF+++ +
Sbjct: 56 HLSLSRVVAVHYPQIQPLIASLKQHLSKTQRFKVSFGRLEAFENDDKTRSFLSILVDQGF 115
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLT 177
+ V+ +++ E L ++++P H S+ W L +T + L+T + T L
Sbjct: 116 DQVCRAVRRTNRAFAEHGLQQFHKDPRPHVSLMWAL---GSTSQRLVTLSQEVQTGLGLA 172
Query: 178 SDESFHV----VTHIHMKTGNKFYS 198
E H V+ I + G + Y+
Sbjct: 173 LQE--HPWECNVSKIECRVGQRIYA 195
>gi|145344938|ref|XP_001416981.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577207|gb|ABO95274.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 210
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 70/156 (44%), Gaps = 12/156 (7%)
Query: 7 HGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPE------PHLS 60
R R+F H ++AT V L + + G +E + + H+S
Sbjct: 2 RAARKRTFEHVEGNYATHVRARAGAATRATEKTLDAFVEAFG-ELEKVSDLKREDGAHVS 60
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG-----ANS 115
LS+T + L L LR + + F+++ +F N++ +R+F+A G ++
Sbjct: 61 LSRTFACVKGDWERLFGNLRRELRDMEAFEVVFDAMRVFTNDDGSRAFVAAGFREGSTSA 120
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAW 151
K +L ++ VDK+ P YY++P+ H S+ W
Sbjct: 121 SKVALVRAIERVDKAIVPLGFPKYYDDPDPHVSLLW 156
>gi|407036577|gb|EKE38242.1| hypothetical protein ENU1_172740 [Entamoeba nuttalli P19]
Length = 227
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 5/143 (3%)
Query: 9 GRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKTLVIP 68
GRI+ PH+ + VYI + + + EE N + I H+SL+K +
Sbjct: 46 GRIQQRPHKIGEYPGSVYIEIPIEIKN---KIMEETNQFSQDFKPIQSLHISLTKEFSLR 102
Query: 69 YHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVD 128
H I V+ + ++ TI F +E+ N E+ F+++ S + + S++ +D
Sbjct: 103 EHQIPLFVQEVRKKIKRFPTFTITFGQLELLLNPEQNTEFLSIQVTSPE--ILSLIDLLD 160
Query: 129 KSAQEFKLPTYYEEPNFHASIAW 151
F L YYEE H+S+ +
Sbjct: 161 TVMMSFNLEKYYEERKIHSSLMY 183
>gi|296421635|ref|XP_002840370.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636585|emb|CAZ84561.1| unnamed protein product [Tuber melanosporum]
Length = 326
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 81/191 (42%), Gaps = 38/191 (19%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQ---------------------TNLARLYAM 39
D+P+ HGGR RS PH + +W T ++I T RL +
Sbjct: 72 QDDPSLHGGRKRSIPHIQGNWPTHIFIEWHLSRSEFDVLNGAYHTASTITATAGVRLESH 131
Query: 40 LKEELNSVGISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSI 96
LK +L S +P HLSLS+ ++ + +E L + L + +++F
Sbjct: 132 LKSDLGS--------EQPLHLSLSRPNILTTAQREGFLELLKDRLDKTRIKPFSVEFTGF 183
Query: 97 EIFCNEEKTRSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYEE-----PNFHASIA 150
E N ++TR F L A + L ++Q + + + F+ P Y E FH S+A
Sbjct: 184 EFVSNNDRTRWFFVLRATKGDDAQLPRLLQLANHTFEAFEQPPLYTENGDKLDGFHVSLA 243
Query: 151 WCLQDKTATLK 161
W L + +K
Sbjct: 244 WSLTEPDKDVK 254
>gi|390594253|gb|EIN03666.1| hypothetical protein PUNSTDRAFT_139379 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 337
Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 87/181 (48%), Gaps = 26/181 (14%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY----IPLQTNLAR-LYAMLKEELNSVGI------- 49
D+P+ H GR R+ P + VY IP+++ LA+ L A +K+ +V I
Sbjct: 46 DDPSLHQGRKRTVPFVEGQFCAYVYVPIFIPVKSGLAKVLKAAIKDAKTAVPILHPIDRL 105
Query: 50 SVEVIP--------EPHLSLSKTLVIPYHWIDTL---VETLGNNLRHLNRLTIKFNSIEI 98
S E E H+SL++ + + H + L V++L ++ R + KF+ ++
Sbjct: 106 SSESDAASSGGDELELHVSLTRPIYLRAHQREELKCAVKSLAHSHRKFHASFAKFDELQ- 164
Query: 99 FCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTA 158
N+E TR+F+ + +L ++ + + + T+Y++P +H SIAW L D
Sbjct: 165 --NDEATRTFVVAEVGAGWDNLKALTSVLSPAISALRQSTFYDQPRYHISIAWALLDGAK 222
Query: 159 T 159
T
Sbjct: 223 T 223
>gi|440466776|gb|ELQ36020.1| hypothetical protein OOU_Y34scaffold00669g5 [Magnaporthe oryzae
Y34]
gi|440480260|gb|ELQ60934.1| hypothetical protein OOW_P131scaffold01213g6 [Magnaporthe oryzae
P131]
Length = 280
Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 92/213 (43%), Gaps = 16/213 (7%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP-HLS 60
D+P H GR R PH +W + +YI + L + EL+S S P P H+S
Sbjct: 57 DDPALHHGRSRQIPHVVGNWPSHIYIEF-SALRSDDHQVGAELHSFLTSDLGTPLPLHIS 115
Query: 61 LSKTLVIPYHWIDTLVETLGN--NLRHLNRLTIKFNSIEIFCNEEKTRSFIALGAN-SCK 117
LS+ V+ D + + LR +R ++ +++ + + R+F+ L SC
Sbjct: 116 LSRPFVLTTEQKDVFLSRVPGALRLRSFSRFAVRPSALSWHRSPDSNRAFLVLRVQESCD 175
Query: 118 T---SLTSIVQAVDKSAQEFKLPTYYEEPN-----FHASIAWCLQDKTATLKPLLTKLDN 169
LT +++ ++ +EF P Y + FH S+AW D T +L+ ++ D
Sbjct: 176 GGNLGLTRLLERCNEVVREFGQPELYSDGGRVMDRFHLSVAWSFVDVTGSLQ---SRTDV 232
Query: 170 IFTQFKLTSDESFHVVTHIHMKTGNKFYSFPLT 202
+ F+ + + +K GN S L
Sbjct: 233 AYENFRDQVAAMEIPIESVKVKIGNIITSLSLA 265
>gi|159463854|ref|XP_001690157.1| predicted protein [Chlamydomonas reinhardtii]
gi|158284145|gb|EDP09895.1| predicted protein [Chlamydomonas reinhardtii]
Length = 261
Score = 59.3 bits (142), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 56/112 (50%), Gaps = 2/112 (1%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
H+SLS+++ I I+ L L L + + + F N+E +RSF++ +
Sbjct: 63 HISLSRSVPITRAQIEPLTTQLAARLEAAGIGAFPLTLCGLRTFANDEGSRSFVSAMVAT 122
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKL 167
+ + +V+AVD + + LP +Y+EP H S+ W + D+ ++ L +
Sbjct: 123 GEREVVGLVRAVDGAFEAHGLPPFYQEPLPHVSVGWLVGDQRPRIQAALDRF 174
>gi|393243051|gb|EJD50567.1| hypothetical protein AURDEDRAFT_160468 [Auricularia delicata
TFB-10046 SS5]
Length = 282
Score = 59.3 bits (142), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 87/229 (37%), Gaps = 30/229 (13%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQ----TNLARLYAMLKEELNS---------- 46
+D+P EH GRIR+ H +A VY ++ LA A + +
Sbjct: 52 VDDPTEHQGRIRTHRHVDGLYAAYVYAAVRLEDAPRLAEFIARATQHAKACLPDLHFECA 111
Query: 47 ----VGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
+ + E H+SLS+ + I + D L + + F + N+
Sbjct: 112 TDADLHADDKGAAELHISLSRPIYISANQRDALKRVVRDIAAKHRPFRASFAEFAVLEND 171
Query: 103 EKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKP 162
E+ R+F+ + I + +D S YY EP FHAS AW L + L
Sbjct: 172 ERARAFLVAEIGAGHQDFVKITREIDASLVSMGHEPYYAEPRFHASFAWWLPQCSTELPS 231
Query: 163 LLTK-----LDNIFTQFKLTSDESFHV----VTHIHMKTGNKFYSFPLT 202
T LD + +F SD+ V V I ++ G S+ LT
Sbjct: 232 STTHTGSELLDELRARF---SDDLRKVGTVDVGGISLRIGKAVDSWALT 277
>gi|426382358|ref|XP_004057774.1| PREDICTED: putative U6 snRNA phosphodiesterase isoform 3 [Gorilla
gorilla gorilla]
Length = 186
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 49/83 (59%), Gaps = 8/83 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETL 79
HLSLS+++V+ +HWI V+ L
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQAL 141
>gi|242063078|ref|XP_002452828.1| hypothetical protein SORBIDRAFT_04g033290 [Sorghum bicolor]
gi|241932659|gb|EES05804.1| hypothetical protein SORBIDRAFT_04g033290 [Sorghum bicolor]
Length = 195
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 58/117 (49%), Gaps = 5/117 (4%)
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
E H+SL + + I H ID+ + L + R ++FN E F N++ TRSF++L
Sbjct: 45 EFHVSLGRPVAIQVHQIDSFIAMLRQKFQTQQRYWMEFNKWEHFVNDDCTRSFLSLEVT- 103
Query: 116 CKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDN 169
+T L I + VD+ + LP +Y P H S W L D ++ LK + ++
Sbjct: 104 -RTGLPEIRKQILMVDEVYRLHGLPEFYTNPRPHISFVWALGDVSSKLKQAIKDIEK 159
>gi|332846038|ref|XP_003315167.1| PREDICTED: UPF0406 protein C16orf57 homolog isoform 2 [Pan
troglodytes]
Length = 186
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 49/83 (59%), Gaps = 8/83 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + +E
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMEAF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETL 79
HLSLS+++V+ +HWI V+ L
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQAL 141
>gi|413938026|gb|AFW72577.1| hypothetical protein ZEAMMB73_211143 [Zea mays]
Length = 188
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 58/117 (49%), Gaps = 5/117 (4%)
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
E H+SL + + + H ID+ + L + + ++FN E F N++ TRSF++L
Sbjct: 38 EFHVSLGRPVAVQVHQIDSFIAMLRQKFQPQQQYWMEFNKWEHFVNDDCTRSFVSLEVT- 96
Query: 116 CKTSLTSIVQA---VDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDN 169
+T L I + VD+ + LP +Y P H S+ W L D + LK L ++
Sbjct: 97 -RTGLPEITRQILMVDEVYRLHGLPEFYTNPRPHISLVWALGDVSGKLKQALKDIEK 152
>gi|325995161|ref|NP_001191840.1| putative U6 snRNA phosphodiesterase isoform 3 [Homo sapiens]
gi|194376888|dbj|BAG63005.1| unnamed protein product [Homo sapiens]
Length = 186
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 50/83 (60%), Gaps = 8/83 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVG--ISVEVIPE 56
D+ +HGGR+R+FPH+R +WAT VY+P + L L +L V + ++V
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMKVF-- 119
Query: 57 PHLSLSKTLVIPYHWIDTLVETL 79
HLSLS+++V+ +HWI V+ L
Sbjct: 120 -HLSLSQSVVLRHHWILPFVQAL 141
>gi|307105830|gb|EFN54078.1| hypothetical protein CHLNCDRAFT_136195 [Chlorella variabilis]
Length = 361
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 57/150 (38%), Gaps = 49/150 (32%)
Query: 55 PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIAL--- 111
P H+SLS+T+ + H I+ LV L LR I+ E+FCN++++R+F++L
Sbjct: 155 PRYHISLSRTVPLRLHQIEPLVADLRKRLRQQETFCIRMGRAEVFCNDDRSRTFLSLRAC 214
Query: 112 -----GANSC-----------------------------------------KTSLTSIVQ 125
G + C L +
Sbjct: 215 GVGGVGDSGCCPVADAAQAADAAQQQQREGGEEAGGEVRGPAEAGAGPQPYSRQLVGLSH 274
Query: 126 AVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
AV + LP +Y EP HAS+AW L D
Sbjct: 275 AVSRVFAAHALPRFYAEPRPHASVAWVLGD 304
>gi|409078970|gb|EKM79332.1| hypothetical protein AGABI1DRAFT_128490 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 320
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 77/185 (41%), Gaps = 31/185 (16%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQ-TNLARLYAMLKEELNSVGISVEVIP---- 55
+D+P+ H GR RS PH +WA VY+ + T LY++L+ + +V +
Sbjct: 44 IDDPSLHQGRTRSTPHTDGNWAAHVYVSITITKSHTLYSLLETAVAEAKRAVPTLNSFTS 103
Query: 56 ---------EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTR 106
E H+SL++ I H + + + L + + F S N+E TR
Sbjct: 104 GQTDRKHQLELHISLTRPFFIRAHQKEEFRQAI-RKLAKCSPFALSFTSFAELHNDEHTR 162
Query: 107 SFIAL--GANSCK--------------TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIA 150
+F+ + GA + + L + + + + YY P FHAS+A
Sbjct: 163 TFLVMEIGAGHHEVRFIPPPGLWPYHFSQLNRLCCDLKPLIESLRQRAYYARPRFHASVA 222
Query: 151 WCLQD 155
W L D
Sbjct: 223 WALLD 227
>gi|297459234|ref|XP_002684567.1| PREDICTED: UPF0406 protein C16orf57 homolog [Bos taurus]
gi|297489828|ref|XP_002697877.1| PREDICTED: UPF0406 protein C16orf57 homolog [Bos taurus]
gi|296473800|tpg|DAA15915.1| TPA: hypothetical protein BOS_22396 [Bos taurus]
Length = 179
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 1/89 (1%)
Query: 107 SFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLK-PLLT 165
+F+ L S +V VD+ +EF L T+Y++P+FH S+AWC+ D ++ P L
Sbjct: 75 TFVGLEVTSGHAHFLDLVAEVDRVMEEFDLSTFYQDPSFHISLAWCVGDARLQMEGPCLQ 134
Query: 166 KLDNIFTQFKLTSDESFHVVTHIHMKTGN 194
+L I +F+ + I K+GN
Sbjct: 135 ELQGIVDEFEDSEMLLRAYAEQIRCKSGN 163
>gi|298708847|emb|CBJ30805.1| similar to Chromosome 16 open reading frame 57 [Ectocarpus
siliculosus]
Length = 292
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 68/180 (37%), Gaps = 49/180 (27%)
Query: 11 IRSFPHQRNSWATLVYIPLQTNLA--------------RLYAMLKE-------------- 42
+R FPH+R +W T V+IP+ + A RL +E
Sbjct: 74 VRRFPHERGNWPTHVFIPVPNSAAFKSMAEASVSHFRQRLATAWREGHGRAEGGKGGKKR 133
Query: 43 ------ELNSVGISVEVIPEP--------------HLSLSKTLVIPYHWIDTLVETLGNN 82
N S +P HLSLSKT + H I+ V L
Sbjct: 134 TRDGRQRSNQASASSAGKLDPPEVVMNDMDPSGRQHLSLSKTTALRAHQINPFVRGLKEA 193
Query: 83 LRHLNRLTIKF-NSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYE 141
++ T F + ++ NE+KTRSF+ L + + S++ VD + FK YYE
Sbjct: 194 VKSSRSFTASFVSGYDVLVNEDKTRSFVCLRVRGGRQMVLSLIAKVDPLMRRFKQSEYYE 253
>gi|308812510|ref|XP_003083562.1| unnamed protein product [Ostreococcus tauri]
gi|116055443|emb|CAL58111.1| unnamed protein product [Ostreococcus tauri]
Length = 207
Score = 55.8 bits (133), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/97 (24%), Positives = 52/97 (53%), Gaps = 5/97 (5%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG----- 112
H+S+SK +T+ + LR + + + F+++ +F NE++TR+F+A G
Sbjct: 55 HVSMSKPFETRAEDWETMRAGVRKELRGMEAIEVTFDALRVFVNEDETRAFVAAGFREGS 114
Query: 113 ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASI 149
K +L ++ V+++ + P Y+++P+ H S+
Sbjct: 115 ERGDKRALVRAIERVNRALEPLGFPRYFDDPDPHVSL 151
>gi|453085689|gb|EMF13732.1| hypothetical protein SEPMUDRAFT_41855 [Mycosphaerella populorum
SO2202]
Length = 286
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 105/233 (45%), Gaps = 37/233 (15%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAML---------KEELNSVGI 49
D+P+ H GR R PH +W T VY+ P + L ++ K++++S+
Sbjct: 52 DDPSLHEGRKRVTPHIAGNWPTHVYLDWTPQPHDYRSLQDLITHAQKTTNCKDDIHSLLE 111
Query: 50 SVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGN--NLRHLNRLTIKFNSIEIFCNEEKTR 106
+ + +P H+SLS+ L + DT + L +L ++ +++ + NE++TR
Sbjct: 112 NDLGVRQPLHVSLSRPLALKTEHKDTFFDRLKTSISLAGVSAFSVRPLDLVWHPNEDQTR 171
Query: 107 SFIALGAN--SCKTSLTSIVQAVDKSAQEFKLPTYY--------------EEPNFHASIA 150
F+ L SC L +++ + A+++ P Y E NFH SIA
Sbjct: 172 WFLVLRLRRPSCD-ELRILLERSNDLARQYNQPLLYGHRTFSSSHVPEAWEHDNFHISIA 230
Query: 151 WCLQ--DKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
W +Q K+A+ P+ + + Q SD +F V ++ G +S PL
Sbjct: 231 WSIQPHQKSASAGPVEVVISDQLLQKLKASDLTFREVK---VRIGQDVHSIPL 280
>gi|226484596|emb|CAX74207.1| hypothetical protein [Schistosoma japonicum]
Length = 127
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 3/90 (3%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKE---ELNSVGISVEVIPEPH 58
++P+ H R R+FPH+ SWAT +YI +R+ +K +LN + + H
Sbjct: 38 EDPSRHNYRSRTFPHEPGSWATSIYIACPHFYSRIQEAIKSPIIQLNPIMNDCCAVDFLH 97
Query: 59 LSLSKTLVIPYHWIDTLVETLGNNLRHLNR 88
+SLSKT I +HWID L L + + + +
Sbjct: 98 ISLSKTWPIYFHWIDNLACNLRSAVSSIEK 127
>gi|387169517|gb|AFJ66178.1| hypothetical protein 11M19.24 [Arabidopsis halleri]
Length = 313
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 80/162 (49%), Gaps = 28/162 (17%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPLQTN----------LARLYAMLKE-ELNSVGISVEVI 54
E G R+R+FPH ++A VYIP+ L R+ +++ L + + ++
Sbjct: 153 EPGVRVRNFPHVDGNYALHVYIPVSIPPLPKKEIVCFLKRVASVVPHLHLVEADVPLSIL 212
Query: 55 ------------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
E H+SL +++ + H I+++V L L+ R +I FN ++F N+
Sbjct: 213 CKDDQKFERALGREFHISLGRSVPLRVHQINSVVSMLRQKLQFQKRYSIDFNKWQVFVND 272
Query: 103 EKTRSFIALGANSCKTSLTSIVQAVDKSAQEFK---LPTYYE 141
+ TRSF++L + + L+ I + +D + +K LP +Y+
Sbjct: 273 DCTRSFLSLEITT--SGLSEISKQIDAVNEVYKLHNLPEFYK 312
>gi|167377788|ref|XP_001734542.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903909|gb|EDR29299.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 227
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 73/169 (43%), Gaps = 11/169 (6%)
Query: 9 GRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKTLVIP 68
GRI+ PH+ + VYI + + + E+ N + I H+SL+K +
Sbjct: 46 GRIQQRPHKIGEYPGSVYIEIPIEIR---DKIMEKTNQFSQDFKPIQSLHISLTKEFSLR 102
Query: 69 YHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVD 128
H I V+ + ++ + I F +E+ N E+ F+++ S + L +V +D
Sbjct: 103 EHQIPLFVQEVRKKMKTIPTFNITFGQLELLLNPEQNTEFLSIQVTSPEILL--LVDLLD 160
Query: 129 KSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPL--LTKLDNIFTQFK 175
F L YYEE H+S+ + +T LK L+ D FK
Sbjct: 161 TIMTSFNLEKYYEERKIHSSLMY----RTEHLKDSYDLSSFDKCLVSFK 205
>gi|71032703|ref|XP_765993.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352950|gb|EAN33710.1| hypothetical protein, conserved [Theileria parva]
Length = 260
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 73/161 (45%), Gaps = 16/161 (9%)
Query: 6 EHGGRIRSFPHQRNSWATLVYI------------PLQTNLARLYAMLKEELNSVGISVEV 53
+ +R+ PH ++ TL YI P Q + ++ + +L+S S +
Sbjct: 3 DSNSNVRNVPHVDGNYHTLCYIKEKLNNVLFSEQPTQADSDYTHSDIDMDLDSQVGSTRL 62
Query: 54 IPE-PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKF-NSIEIFCNEEKTRSF-IA 110
P HLSL K L + +ID+ +E L N L++L + N + I NE R F ++
Sbjct: 63 SPSYAHLSLCKPLYLRRQFIDSFLEKLKNTLQNLKPFYLILENRVSICANENLNRYFAVS 122
Query: 111 LGANSCK-TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIA 150
+C+ ++ I+ VD+ + F YYE+ H S A
Sbjct: 123 FVDKTCRDDTVLPIIDRVDRVLESFGFDKYYEQRKPHVSFA 163
>gi|443917557|gb|ELU38253.1| F-box-like domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 1073
Score = 53.9 bits (128), Expect = 3e-05, Method: Composition-based stats.
Identities = 43/183 (23%), Positives = 79/183 (43%), Gaps = 26/183 (14%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGIS-------VEV 53
+D+P++H GR R+ P+ + VY+P++ L A+L+ + S +E
Sbjct: 812 IDDPSKHQGRKRTIPYVEGQFIAHVYVPIKLG-GELLALLRSIVKSAQSDSTAWHSLLEP 870
Query: 54 IP---------------EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI 98
IP HLSLS+ + + H D + + + F I
Sbjct: 871 IPGTTSEQNTGPTFSIFRSHLSLSRPVPLRAHQRDDFRKEVRKAALERTQFVASFAQITT 930
Query: 99 FCNEEKTRSFIA--LGANSCKTS-LTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
N++ +R+F+ +GA + S ++ Q++ + YY +P FH SIAW L
Sbjct: 931 LTNDDHSRAFLCAEVGAGHKEVSKFQALSQSLSAHLALLRQLPYYPQPRFHISIAWMLTH 990
Query: 156 KTA 158
+++
Sbjct: 991 ESS 993
>gi|326427426|gb|EGD72996.1| hypothetical protein PTSG_04705 [Salpingoeca sp. ATCC 50818]
Length = 234
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/206 (23%), Positives = 86/206 (41%), Gaps = 17/206 (8%)
Query: 10 RIRSFPHQRNSWATLVYIPLQTNLARL----YAMLKEELNSVGISVEVIP--EPHLSLSK 63
R+R FPH W T + L AM + + + V+ +P + H+SLS+
Sbjct: 24 RVRQFPHVEGQWPTFACLRLHDGEEEEDVLEAAMEVAQCITDTMKVKAVPCEDVHVSLSR 83
Query: 64 TLVIPYHWIDTLVETLGNNLRHLNRL--TIKFNSIEIFCNEEKTRSFIALGANSCKTSLT 121
T + ID+ V + + + + F + N++ +R+F +G N ++
Sbjct: 84 TFTLQLGEIDSFVSQVRQAVASVPAFYAPLGFRGY-VMLNDDGSRAFFCVGLNQRVPAID 142
Query: 122 SIVQAVDKSAQEFKLPTYYEEPNFHASIA-WCL-----QDKTATLKPLLTKLDNIFTQFK 175
++ +D+ F P +YE + H S+A W + + L P L LD I +
Sbjct: 143 LLLDRIDECLAAFAKPAFYEARDIHVSLASWVIPPAQRNGQPPQLPPNL--LDAIQGILQ 200
Query: 176 LTSDESFHVVTHIHMKTGNKFYSFPL 201
DE + + +GNK + PL
Sbjct: 201 THLDELVIPGDTVTVHSGNKTFRLPL 226
>gi|255071519|ref|XP_002499434.1| predicted protein [Micromonas sp. RCC299]
gi|226514696|gb|ACO60692.1| predicted protein [Micromonas sp. RCC299]
Length = 315
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 81/202 (40%), Gaps = 43/202 (21%)
Query: 9 GRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSV--------------------- 47
GR+R+FPH ++AT VY+PL + + ++ + EL +V
Sbjct: 64 GRVRAFPHVDGNFATHVYVPLALSRSAQWSRARAELGNVLARIAARVPGLRPIGDYGKIG 123
Query: 48 ----GISVEVIPEP--HLSLSKTLVIPYHWIDTLVETLGNNLR-HLNRLTIK--FNSIEI 98
+S VIP+ H+SLS T I D L L L + R + ++++
Sbjct: 124 DAATAVSSFVIPDGDLHVSLSHTFPIRAARRDGLFAALRRALSASMTRAWVARVGPNLDV 183
Query: 99 FCNEEKTRSFIAL-------------GANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNF 145
N +T +F+A+ G S AVD P ++++P
Sbjct: 184 LVNANRTTTFLAMRVADAELSPGTGAGDGDVGGSFAGAYSAVDAVLTRGGYPRFHDDPKP 243
Query: 146 HASIAWCLQDKTATLKPLLTKL 167
HAS+A+ D A L + +L
Sbjct: 244 HASVAFAPGDVEAELVEAIKEL 265
>gi|156054662|ref|XP_001593257.1| hypothetical protein SS1G_06179 [Sclerotinia sclerotiorum 1980]
gi|154703959|gb|EDO03698.1| hypothetical protein SS1G_06179 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 372
Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKEELNSVGISV 51
D+P+ HGGR R PH +W T +YI L + ++++ ++ + E+ + S
Sbjct: 75 DDPSLHGGRKRVIPHIEGNWPTHIYIEWYPSTIEFRLLSSLISKVVSLKRFEIQTFLTSD 134
Query: 52 EVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKTRSF 108
+P P H+SLS+ + D+ + + ++ + I F+ + N EKTR F
Sbjct: 135 LGVPLPLHVSLSRAIGFSKDVKDSFLNSFEQAIKSSGIRPFEIGFSGLAWVPNYEKTRWF 194
Query: 109 IALGANSCKT-SLTSIVQAVDKSAQEFKLPTYYEE 142
+ L N K+ +L ++ +K +EF P Y +
Sbjct: 195 LVLRLNIPKSNALNKLLHVSNKVVEEFGQPPLYAD 229
>gi|336276203|ref|XP_003352855.1| hypothetical protein SMAC_04969 [Sordaria macrospora k-hell]
gi|380092973|emb|CCC09210.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 365
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 57/234 (24%), Positives = 101/234 (43%), Gaps = 47/234 (20%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTN------LARLYAMLKEELNSV-------- 47
D+P+ H GR R PH +W + VYI + + L L LK E S
Sbjct: 104 DDPSLHHGRQRQVPHIPGNWPSHVYIDWEPSSGDRELLTSLVDKLKAEAKSAAQRSPDLE 163
Query: 48 GISVEV--------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFN-S 95
G+ + + P H+SLS + + + D ++ L ++ ++ + F+
Sbjct: 164 GVEIHTALRDSDLPVNRPLHISLSAPITLTSNNKDDFLDDLTKAMKSCRVSPFVLDFSGG 223
Query: 96 IEIFCNEEKTRSFIALGANSCKTS------------LTSIVQAVDKSAQEFKLPTYYEEP 143
+ + +EE TRSF+ L ++S LT++++ +K +E+ P YE
Sbjct: 224 VNWYRSEESTRSFLVLRVREVQSSTGNTANSSPNPRLTTLLEKCNKIVKEYGQPPLYESK 283
Query: 144 N----FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTG 193
+ FH +IAW +A+LK L D++F + E +V ++TG
Sbjct: 284 DMGYRFHVTIAWTHARPSASLKQL---TDSVFDDCETMHSE--NVSIRDKLRTG 332
>gi|345566707|gb|EGX49649.1| hypothetical protein AOL_s00078g138 [Arthrobotrys oligospora ATCC
24927]
Length = 294
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 98/245 (40%), Gaps = 57/245 (23%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEP---- 57
D+P+ HGGR R+ PH +W T +Y + + EL VG ++V E
Sbjct: 58 DDPSAHGGRHRAIPHVEGNWPTHLYFE--------WHLTTLELGKVGRLLQVAAEAIGNY 109
Query: 58 -------------------------HLSLSKTLVIPYHWIDTLVETL--GNNLRHLNRLT 90
H+S+S+ +V+ D V L G +
Sbjct: 110 EKFHHTEEFKLESFLYSDLGTQLPLHISMSRPIVLRTEERDGFVAELEKGVEASAVQSFD 169
Query: 91 IKFNSIEIFCNEEKTRSFIALGANS-----C-KTSLTSIVQAVDKSAQEFKLPTYY---- 140
++F S+E N+++TR F + A S C +T L+S++ + + P Y
Sbjct: 170 VRFTSLEWVPNQDRTRWFWVMRATSPMLGQCRRTLLSSLLTTCNHVVKNHNQPELYLTDS 229
Query: 141 ---EEPNFHASIAWCLQDKTATL-KPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKF 196
FH SI W L + + L + +LT + + T+ SD + T + +K GN
Sbjct: 230 RGATREGFHVSIGWSLTEPSKDLCEGILTAISSNRTEL---SDLTMKCDT-LKVKIGNIV 285
Query: 197 YSFPL 201
++ L
Sbjct: 286 HAVEL 290
>gi|219125875|ref|XP_002183196.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405471|gb|EEC45414.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 300
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 86/219 (39%), Gaps = 31/219 (14%)
Query: 12 RSFPHQRNSWATLVYIPLQTNL----------ARLYAMLKEELNSVGIS-VEVIPEP--- 57
R+ PH+ +WA V++ + + L + L G S V V P P
Sbjct: 83 RTVPHRVGNWAGHVFLDVTSTLRNPSDPDDQIGAFAERCRRVLQEFGQSGVLVAPHPTAS 142
Query: 58 -HLSLSKTLVIPYHWIDTLVETL-------------GNNLRHLNRLTIKFNSIEIFCNEE 103
H+SL++ + ID+ V L G N H + I + + NEE
Sbjct: 143 FHVSLARPFYVHIACIDSFVRQLRIRLESRIADAFRGEN-NHTALIPIVSSKPVVLVNEE 201
Query: 104 KTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPL 163
KTRSF A S L +V+ VD+ ++L YY+ P FH S+A D + + L
Sbjct: 202 KTRSFFAWSVVS-NHILRELVKVVDEVMDLYRLSHYYDPPTFHVSVASFPGDLSTIRERL 260
Query: 164 LTKLDNIFTQFKLTSDESFHVVTHIHMKTG-NKFYSFPL 201
L K +H V +H G K + PL
Sbjct: 261 EDTLRLDSGCDKTRPGILYHRVASVHCTFGTTKAFQIPL 299
>gi|387169554|gb|AFJ66213.1| hypothetical protein 34G24.14 [Capsella rubella]
Length = 247
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 63/129 (48%), Gaps = 23/129 (17%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKE-ELNSVGISVEVI 54
E G R+R+FPH ++A VYIP + L R+ +++ + L + + ++
Sbjct: 69 ESGVRVRNFPHVDGNYALHVYIPVIIPPLPKKEIVCFLKRVASVVPQLHLVEADVPLSIL 128
Query: 55 ------------PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
E H+SL +++ + H I ++V L L+ R I FN E+F N+
Sbjct: 129 CKDDHKFERALGREFHISLGRSVPLRVHQISSVVSMLKQKLQFHKRYLIDFNKWEVFVND 188
Query: 103 EKTRSFIAL 111
+ TRSF++L
Sbjct: 189 DCTRSFLSL 197
>gi|405123064|gb|AFR97829.1| hypothetical protein CNAG_01624 [Cryptococcus neoformans var.
grubii H99]
Length = 228
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 70/155 (45%), Gaps = 15/155 (9%)
Query: 11 IRSFPHQRNSWATLVYIPLQTN------LARLYAMLKEELNSVGISVEVIPEPHLSLSKT 64
+RS P+ + T VY+ L + L + A L N + ++P H+SL++
Sbjct: 1 MRSRPYVDGEYNTHVYLSLSISWKLSKILETIIAQLPPSPNPIH---SLLPNLHISLTRP 57
Query: 65 LVIPYHWIDTLVETLGNNLRHLNRLTIKF-NSIEIFCNE----EKTRSFIALGANSCKTS 119
+ + H I + L + L + + S++ + NE R+F+AL +
Sbjct: 58 VPLRRHQIQPFRDELASRLGQICPFKLSLVGSVKAYYNEVTGGGSNRAFLALRVGAGARE 117
Query: 120 LTSIVQAV-DKSAQEFKLPTYYEEPNFHASIAWCL 153
L IV V D + ++ PTY++ P FH S AW L
Sbjct: 118 LKKIVDVVLDPTLKKIHRPTYHDNPEFHTSFAWTL 152
>gi|189203521|ref|XP_001938096.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985195|gb|EDU50683.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 335
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/259 (23%), Positives = 108/259 (41%), Gaps = 63/259 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY---IPLQTNLARLYAMLKEELNSVGIS----VEVI 54
DNP+ HGGR R+ PH +W + VY IP QT L+ +++ +S+ + ++ +
Sbjct: 67 DNPSLHGGRKRAVPHVEGNWPSHVYLEWIPTQTESDSLHRLIQHVRDSLELQNAKRLKKL 126
Query: 55 PEP----------------HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSI 96
P P H+SLS+TL I +T +ETL +LR + +F ++
Sbjct: 127 PIPDIVSSLQSELGAPLPLHVSLSRTLQIKAEDRETFLETLRLSLRRSTVCSFVFEFRNL 186
Query: 97 EIFCNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYY---------EEPN-- 144
+ N ++ R F+ L L ++ A +++A+ P Y E+ +
Sbjct: 187 KWVPNFDRNRWFLVLSIKRPVNDELNHLLHACNEAARSSGHPGLYTGTEGDGPMEDHDSN 246
Query: 145 ---------------------FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFH 183
FH +IAW L++ L+ +LD T F + + F
Sbjct: 247 DSLKRRKVDKNERRRMDFSKYFHVTIAWNLEEPDTEWTGLVEQLDT--TTFIQSPEAEFD 304
Query: 184 VVTHIHMKTGNKFYSFPLT 202
V ++ G+ ++ L+
Sbjct: 305 TVK---VRIGSAVHNIALS 320
>gi|396472300|ref|XP_003839073.1| hypothetical protein LEMA_P027460.1 [Leptosphaeria maculans JN3]
gi|312215642|emb|CBX95594.1| hypothetical protein LEMA_P027460.1 [Leptosphaeria maculans JN3]
Length = 330
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 96/227 (42%), Gaps = 55/227 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY---IPLQTNLARLYAMLKEELNSVGIS-------- 50
DNP+ HGGR R+ PH + +W + VY IP QT L+ ++K +S+ +
Sbjct: 65 DNPSLHGGRKRAVPHVQGNWPSHVYLEWIPSQTEAYGLHKLIKHVKDSLELQNATRAKQL 124
Query: 51 --VEVIP-------EP---HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTI--KFNSI 96
E+IP P H+SLS+TL I +E+L +++R+ T F+ +
Sbjct: 125 PIPEIIPSLQSDLASPLPLHISLSRTLQIKTEDRSVFLESLEHSIRNSAVRTFHYAFHGL 184
Query: 97 EIFCNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEF--------------------- 134
+ N +++R F+ L + L ++ A +++AQ+
Sbjct: 185 KWVPNFDRSRWFLVLNITRPPQDELNRLLNACNQAAQKHGHAGLYTGGEGDGPMEASETS 244
Query: 135 ------KLPTYYEEPN-FHASIAWCLQDKTATLKPLLTKLD-NIFTQ 173
K PT + FH SIAW L + + +D FTQ
Sbjct: 245 PESTKRKAPTLGDRSAFFHVSIAWNLSEPAPEWISWVKNIDVGAFTQ 291
>gi|63054558|ref|NP_593641.2| conserved eukaryotic protein [Schizosaccharomyces pombe 972h-]
gi|24638477|sp|O13915.2|USB1_SCHPO RecName: Full=Putative U6 snRNA phosphodiesterase
gi|21535774|emb|CAB11163.2| conserved eukaryotic protein [Schizosaccharomyces pombe]
Length = 265
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 89/221 (40%), Gaps = 23/221 (10%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVG------ISVEVI 54
D+P H GRIR H W Y+ + + ++ ++E LNS S +
Sbjct: 46 FDSPEFHEGRIRGQKHIEGLWFVQTYLEVDLS-KKVKKGIREFLNSQSRFQSLLCSEHNV 104
Query: 55 PEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKF--NSIEIFCNEEKTRSFIAL 111
P HLS+S+ I Y + LV +LN T+KF + + N+EKTR F+A
Sbjct: 105 PRRLHLSISENYRINYSTKNQLVHKWEQYTNNLNYRTLKFRLGKMCLLFNDEKTRMFLAF 164
Query: 112 GANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIA-----------WCLQDKTATL 160
+ ++ +EF E+ H S A W QD+ +
Sbjct: 165 ECKFSDENYKDLISHASDCMKEFTNRNLREDFLLHISFASSLTNEDEYQNWVSQDRESHF 224
Query: 161 KPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKFYSFPL 201
+ ++ N Q K ESF +V + + G+ ++FP
Sbjct: 225 FKTMNEIINTKIQ-KDQFSESF-IVDSLKLSIGHLIFTFPF 263
>gi|134109975|ref|XP_776373.1| hypothetical protein CNBC5890 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259047|gb|EAL21726.1| hypothetical protein CNBC5890 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 236
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 6/107 (5%)
Query: 53 VIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKF-NSIEIFCNE----EKTRS 107
++P H+SL++ + + H I + L + L + + S++ + NE R+
Sbjct: 54 LLPNLHISLTRPVPLRRHQIQPFKDELASRLGQICTFKLSLIGSVKAYYNEVTGGGSNRA 113
Query: 108 FIALGANSCKTSLTSIVQAV-DKSAQEFKLPTYYEEPNFHASIAWCL 153
F+AL + + L IV V D + ++ LPTY++ P FH S AW L
Sbjct: 114 FLALRVGAGVSELKKIVDVVLDPTLKKIHLPTYHDNPEFHTSFAWTL 160
>gi|440292125|gb|ELP85367.1| hypothetical protein EIN_086390 [Entamoeba invadens IP1]
Length = 225
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 81/194 (41%), Gaps = 18/194 (9%)
Query: 10 RIRSFPHQRNSWATLVYIPLQTNLAR--LYAMLKEELNSVGISVEVIPEPHLSLSKTLVI 67
RIR P + + + V I L T++ L L ++L G +I PH+++S+ +
Sbjct: 44 RIRQRPFRIGDFPSSVQIDLPTDVCAVVLNPDLTKQLE--GTFFTLIKSPHITISREFSL 101
Query: 68 PYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAV 127
H I V TL +R + + N E F+++ A S S + ++ V
Sbjct: 102 RDHQISVFVRTLRRLVRSIPEFQVSLTDFVYLKNPENNTEFLSVVAVS--ESFSKLLDVV 159
Query: 128 DKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTH 187
D F Y EE HAS ++ P +TKLD F+ F +VT
Sbjct: 160 DSVLLAFCKEKYNEERVVHASFFSRIKSLDIP-PPSITKLD--FSPF---------IVTQ 207
Query: 188 IHMKTGNKFYSFPL 201
I K G+ YS PL
Sbjct: 208 IRFKVGDVIYSLPL 221
>gi|84999142|ref|XP_954292.1| hypothetical protein [Theileria annulata]
gi|65305290|emb|CAI73615.1| hypothetical protein TA20665 [Theileria annulata]
Length = 281
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 76/182 (41%), Gaps = 37/182 (20%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPLQT----------------NLARLYAMLKEE------ 43
+H +R+ PH ++ TL YI ++ NL +L +L E
Sbjct: 3 DHNSNVRNVPHVDGNYHTLCYIKVEISKEISSIAKKAYNTLFNLEKLNNVLFSEQPTQAD 62
Query: 44 -----------LNSVGISVEVIPE-PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTI 91
L+S S + P HLSL K L + +ID+ +E L N L++L +
Sbjct: 63 SDYTHSDIDMDLDSQTNSTRLSPSYAHLSLCKPLYLRRQFIDSFLEKLKNTLQNLKPFYL 122
Query: 92 KF-NSIEIFCNEEKTRSF-IALGANSCK-TSLTSIVQAVDKSAQEFKLPTYYEEPNFHAS 148
N + I NE R F ++ +C+ ++ I+ VD+ + F YYE+ H S
Sbjct: 123 ILENRVSICANENLNRYFAVSFVDKACRDDTVLPIIDRVDQVVESFGFEKYYEQRKPHVS 182
Query: 149 IA 150
A
Sbjct: 183 FA 184
>gi|330915388|ref|XP_003297010.1| hypothetical protein PTT_07278 [Pyrenophora teres f. teres 0-1]
gi|311330543|gb|EFQ94890.1| hypothetical protein PTT_07278 [Pyrenophora teres f. teres 0-1]
Length = 337
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 63/259 (24%), Positives = 104/259 (40%), Gaps = 63/259 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVY---IPLQTNLARLYAMLK------EELNSVGISVE 52
DNP+ HGGR R+ PH +W + VY IP Q+ L+ +++ E N+ +
Sbjct: 70 DNPSLHGGRKRAVPHIEGNWPSHVYLEWIPAQSESDALHRLIQHVRHSLELQNAKRLKKL 129
Query: 53 VIPE--P------------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN--SI 96
IP+ P H+SLS+TL I + +ETL +LR T F ++
Sbjct: 130 AIPDIVPSLQSELGAPLPLHVSLSRTLQIKTEDREAFLETLRLSLRRSTVCTFSFGFRNL 189
Query: 97 EIFCNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYY--------------- 140
+ N E+ R F+ L L ++ A +++A+ P Y
Sbjct: 190 KWVPNFERNRWFLVLSIERPANDELNHLLHACNEAARSSGHPGLYTGTEGDGPMEDHESN 249
Query: 141 ----------EEPN-------FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFH 183
EP FH +IAW L++ L+ +LD T F + + F
Sbjct: 250 DSLKRRKVDKNEPRRLDFSKYFHVTIAWNLEEPDTEWTKLVEQLDT--TTFIQSPEAEFD 307
Query: 184 VVTHIHMKTGNKFYSFPLT 202
V ++ G+ ++ L+
Sbjct: 308 TVK---VRIGSAVHNIALS 323
>gi|358390355|gb|EHK39761.1| hypothetical protein TRIATDRAFT_168536, partial [Trichoderma
atroviride IMI 206040]
Length = 304
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 104/236 (44%), Gaps = 39/236 (16%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PL---QTNLARLYAMLKEELNSVGI----- 49
+D+P H GR R PH +W + +Y+ PL Q L +L + + E++ + +
Sbjct: 72 VDDPALHQGRTRLIPHVVGNWPSHLYVEWHPLAKQQEKLIQLISRVNEDIGQIKLHSFMT 131
Query: 50 SVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGN--NLRHLNRLTIKFNSIEIFCNEEKTR 106
S P P H+SLS+ + + D ++ + + N ++++ ++ + F + + R
Sbjct: 132 SDLGTPLPLHISLSRPISLTTSNKDQFLDQISSAINTSNVHQFSVSPKRLIWFKSPDSNR 191
Query: 107 SFI------ALGANSCKT----SLTSIVQAVDKSAQEFKLPTYYEEPN-------FHASI 149
+F+ +L A S T L +++ + + F P Y+ P FH SI
Sbjct: 192 TFLVLQVAGSLAAESATTLSNPELMRLLKTCNDTVGSFGQPVLYQGPGKDAADEAFHISI 251
Query: 150 AWC--LQDKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSFPLT 202
W LQ A+ K L +F + S +S+ + V + +K GN PL+
Sbjct: 252 GWALNLQVDEASNKAL-----KVFDDAEFQSLKSWRIQVPGVKVKIGNVVSHLPLS 302
>gi|321253355|ref|XP_003192709.1| hypothetical protein CGB_C2180C [Cryptococcus gattii WM276]
gi|317459178|gb|ADV20922.1| hypothetical protein CNBC5890 [Cryptococcus gattii WM276]
Length = 231
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 53/107 (49%), Gaps = 6/107 (5%)
Query: 53 VIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKF-NSIEIFCNE----EKTRS 107
++P H+SL++ + + H I + L + L + + ++ + NE R+
Sbjct: 49 LLPNLHISLTRPVPLRRHQIQPFRDELASRLGQICAFKLSLVGAVNSYYNEVTGGGSNRA 108
Query: 108 FIALGANSCKTSLTSIVQAV-DKSAQEFKLPTYYEEPNFHASIAWCL 153
F+AL + + L IV V D + ++ LPTY++ P FH S AW L
Sbjct: 109 FLALRVGAGASELKKIVDGVLDPTLKKLHLPTYHDNPEFHTSFAWTL 155
>gi|403220816|dbj|BAM38949.1| conserved hypothetical protein [Theileria orientalis strain
Shintoku]
Length = 258
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 71/153 (46%), Gaps = 14/153 (9%)
Query: 12 RSFPHQRNSWATLVYI----------PLQTNLARLYAMLKEELNSVGISVEVIPEP-HLS 60
R+ PH ++ T+ Y+ P Q + Y+ + + +S +V P H+S
Sbjct: 9 RNVPHVDGNYHTICYLKVNNVLFCEQPTQADSDYTYSDTEMDTDSQTDRKKVNPSHLHVS 68
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKF-NSIEIFCNEEKTRSF-IALGANSCK- 117
L K L + +ID+ +E L +L+HL + N + I +E + R F ++L +C+
Sbjct: 69 LCKPLYLRRQFIDSFLENLRGSLQHLKPFYLMLENKVSICASENQNRYFAVSLVDKACRD 128
Query: 118 TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIA 150
++ I+ VDK F YY + H S+A
Sbjct: 129 NTILPIIDLVDKVVSSFGFEKYYSQRKPHVSLA 161
>gi|408396640|gb|EKJ75795.1| hypothetical protein FPSE_03975 [Fusarium pseudograminearum CS3096]
Length = 306
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/237 (27%), Positives = 104/237 (43%), Gaps = 51/237 (21%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN------LARLYAMLKEELNSVG------ 48
+D+P+ H GR R PH +W + VY + L L A ++E+++S
Sbjct: 64 VDDPSLHQGRKRHIPHVVGNWPSHVYTEWHPSTEQHGLLTTLIADIEEQVSSDTKLFNFL 123
Query: 49 ISVEVIPEP-HLSLSKTLVIPY----HWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEE 103
S P P H+SLS+ L + ++D + E+ N + T+K S+ F + +
Sbjct: 124 TSDLGSPLPLHISLSRPLSLTTSAKDEFLDKITESF--NSSGIAPFTVKPQSLAWFRSPD 181
Query: 104 KTRSFIALGANSCKTS------LTSIVQAVDKSAQEFKLPTYY----EEP---NFHASIA 150
R+F+ L S + LTS++ + A +F LP+ Y +EP FH SI
Sbjct: 182 SDRTFLILRVASGPDTKPLNPELTSLLLRSNSVAAQFGLPSLYARSPDEPVGGAFHVSIG 241
Query: 151 WC--LQDKTATLKPLL----TKLDNI--------FTQFKLTSDESFHVVTHIHMKTG 193
W L +LK L +K D+I + K+ +VV HI +KTG
Sbjct: 242 WTFHLPGDDMSLKTLRLFKQSKFDDIRKLEISVPGVKVKIG-----NVVNHIALKTG 293
>gi|171695014|ref|XP_001912431.1| hypothetical protein [Podospora anserina S mat+]
gi|170947749|emb|CAP59912.1| unnamed protein product [Podospora anserina S mat+]
Length = 312
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 94/230 (40%), Gaps = 33/230 (14%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI------PLQTNLARLYAMLKEELNSVGISVEV-- 53
D+P+ H GR R PH +W + VYI + L+ L ++ ++ + VE+
Sbjct: 68 DDPSLHQGRTRQTPHIPGNWPSHVYIEWHPPPEFKRMLSGLIYSVRSQIRKIDPEVEITS 127
Query: 54 -------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT 105
+P P H+SLS+ L + D + L L I+ +E E
Sbjct: 128 FLESDLGVPLPLHISLSRPLNLSTFQKDQFLSDLQKILAGDQPFEIRLGRVEWHFTSESG 187
Query: 106 RSFIALGA--NSCKTSLTSIVQAVDKSAQEFKLPTYYE----------EPNFHASIAWCL 153
R+F+ L S L ++ +++ A + P Y FH SIAWCL
Sbjct: 188 RAFLVLRVVCPSRNNELVYLLSKINELANIYGQPQLYSWASAAEGKDVADAFHFSIAWCL 247
Query: 154 QDKTATLKPLLTKLDNIFTQFKLTSDESFHVVT--HIHMKTGNKFYSFPL 201
+ L+ +TK +F + ++ + +T I +K GN + PL
Sbjct: 248 GKPSDHLE-RITK--EVFAKPEIRTVIGMGKLTIHSIKVKIGNVVTNIPL 294
>gi|367052339|ref|XP_003656548.1| hypothetical protein THITE_2021557, partial [Thielavia terrestris
NRRL 8126]
gi|347003813|gb|AEO70212.1| hypothetical protein THITE_2021557, partial [Thielavia terrestris
NRRL 8126]
Length = 260
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 98/235 (41%), Gaps = 45/235 (19%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI------PLQTNLARLYAMLKEELNSVGISVEV-- 53
D+P+ H GR R PH +W + +YI + + L L + L+ + ++ +V+V
Sbjct: 22 DDPSLHQGRTRQNPHVPGNWPSHIYIEWHPPSAVHSLLVELVSSLQAQARTLSSAVQVTS 81
Query: 54 -------IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEE 103
P+P H+SLS+ +V+ DT + +R + T+ +++E E
Sbjct: 82 LLLSDLAAPQPLHISLSRPIVLSSAQKDTFSAEVEAAIRASGIAAFTLACSTVEWHRTAE 141
Query: 104 KTRSFIALGANSCKTS------------LTSIVQAVDKSAQEFKLPTYYEEPN------- 144
RSF+ L + + + LT +++ + ++ P Y
Sbjct: 142 SGRSFLVLRVHGTQGTREGDTEDNPNPELTELLRRCNAVVAKYGQPGLYRWAEGDDVSRV 201
Query: 145 ---FHASIAWCLQDKTATLKPLLTKLDNIFTQ--FKLTSDESFHVVTHIHMKTGN 194
FH SIAW + T LK ++T + +F Q K E V I +K GN
Sbjct: 202 WKAFHVSIAWSFAEPTEELK-MVT--ERVFGQTTAKGRIQEVRVPVEGIKVKIGN 253
>gi|353241150|emb|CCA72983.1| hypothetical protein PIIN_06938 [Piriformospora indica DSM 11827]
Length = 191
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 14/107 (13%)
Query: 56 EPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLT-----IKFN--SIEIFCNEEKTRSF 108
E HLSLS+ + + H D RH+ RL KF+ S I N+EKTRSF
Sbjct: 44 ELHLSLSRPIYLREHQRDMA-------RRHVKRLADTSNAFKFSLASFTILTNDEKTRSF 96
Query: 109 IALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
+A+ + + ++ E + P YY +H S AW L +
Sbjct: 97 LAVEVGAGHAEFEAFSNGLNPLLTELRQPLYYSSARYHISFAWILSE 143
>gi|255945507|ref|XP_002563521.1| Pc20g10290 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588256|emb|CAP86358.1| Pc20g10290 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 295
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 83/210 (39%), Gaps = 42/210 (20%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPE- 56
DNP+ HGGR R PH +W T +Y+ P + L+ L ++ + N V+
Sbjct: 55 QDNPSLHGGRKRVIPHVEGNWPTHLYLEWYPGKDELSLLADVISQSGNVPDEKAHVMHSL 114
Query: 57 ---------P-HLSLSKTLVIPYHWIDTLVETLGNNL--RHLNRLTIKFNSIEIFCNEEK 104
P H+SLS+ +V+ + E L + H++ ++ +S+ N EK
Sbjct: 115 LHSDLGAQLPLHISLSRPVVLRTEQRASFTEALQKAIDDAHVSSFHVQPDSLYWSPNYEK 174
Query: 105 TRSFIALGAN-SCKTSLTSIVQAVDKSAQEFKLPTYYEEPN------------------- 144
TR F+ LG L +++ + + F P Y +
Sbjct: 175 TRWFLVLGVQRPSNDGLNRLLKLSNDTLARFGQPPLYATSSTHEQHTSVSLRDRSSSLTG 234
Query: 145 ------FHASIAWCLQDKTATLKPLLTKLD 168
FH S+AWCL + + + + +D
Sbjct: 235 EDFSKCFHISLAWCLSEPSPKERERVAGID 264
>gi|452979210|gb|EME78972.1| hypothetical protein MYCFIDRAFT_120758, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 246
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 75/180 (41%), Gaps = 31/180 (17%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEEL---------------- 44
D+P H GR R PH +W T VY+ + + + Y++L E L
Sbjct: 25 QDDPALHAGRKRVTPHVVGNWPTHVYLDW-SPMPKEYSLLSEILEEIRKTSGEEHVQSLL 83
Query: 45 -NSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFC--N 101
N +G+ + + H+SLS+ L + D+ V L + + + +E+ N
Sbjct: 84 ENDLGVQLPL----HVSLSRPLALKTEQKDSFVSRLETAIEEASVRAFEVQPLELVWHPN 139
Query: 102 EEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYEE------PNFHASIAWCLQ 154
E +TR F+ L L ++++ + A+ P Y E FH SIAW L+
Sbjct: 140 EHRTRWFLVLRLQRPAGDELKNLLKTCNALAKSLDQPLLYAECHNEASDAFHISIAWSLK 199
>gi|358372673|dbj|GAA89275.1| hypothetical protein AKAW_07389 [Aspergillus kawachii IFO 4308]
Length = 303
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 74/157 (47%), Gaps = 17/157 (10%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARL---YAMLKEELNSVGISVEVIP 55
D+P+ HGGR R PH +W T +Y+ P + L+ L A +K+ L+ I + +
Sbjct: 57 DDPSLHGGRKRVIPHVEGNWPTHIYLEWYPSKDELSILSDVIAQIKDRLDGSTIQLHSLL 116
Query: 56 EP--------HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEEKT 105
H+SLS+ +V+ + ++ + L+ + + NS++ N EKT
Sbjct: 117 HSDLGAQLPLHISLSRPVVLRTEQRQSFLDMFQSQLKESQIPTFHVSTNSLDCVSNYEKT 176
Query: 106 RSFIALGANSC-KTSLTSIVQAVDKSAQEFKLPTYYE 141
R F L A + +L +++ +++ F+ P YE
Sbjct: 177 RWFYVLRAEKPEEDNLNRLLRLSNRALARFEQPPLYE 213
>gi|307190443|gb|EFN74479.1| UPF0406 protein CG16790 [Camponotus floridanus]
Length = 61
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 20/37 (54%), Positives = 29/37 (78%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLY 37
+DNP++H GRIRSF H+R +WATLVYI + ++ +Y
Sbjct: 4 IDNPSDHDGRIRSFKHERGNWATLVYIDCKYFISYIY 40
>gi|440640085|gb|ELR10004.1| hypothetical protein GMDG_00762 [Geomyces destructans 20631-21]
Length = 343
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/257 (23%), Positives = 99/257 (38%), Gaps = 62/257 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLA----------------RLYAMLKEELN 45
D+P+ HGGR R+ PH + +W T +YI N A R +L+
Sbjct: 81 DDPSLHGGRKRTTPHVQGNWPTHLYIEWYPNPAEHKLLTSLVDSFRGQSRYGTQYDGDLD 140
Query: 46 SVGISVEVIPEP---HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFC 100
+ + P H+SLS+ + D + ++ + + ++ I FN +
Sbjct: 141 VQSLLTSDLGSPLPLHISLSRPIGFSTATKDAFLSSIQTAISNSDIHPFPITFNGCDWAS 200
Query: 101 NEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYE-----EPN---------- 144
N E TR F+AL + + +L +++ + + Q P Y +PN
Sbjct: 201 NYEGTRWFLALRVSKPLQDNLNKLLRVCNGTVQSHGQPPLYASAVSPDPNTTGQSKLPPQ 260
Query: 145 ---------------FHASIAWCL--QDKTAT--LKPLLTKLDNIFTQFKLTSDESFHVV 185
FH SIAW L KTA P ++++ KL S + V
Sbjct: 261 PPSSSPSPKKDLSSSFHISIAWALTVPPKTAGSWQPPAASEIE------KLNSASTPIVA 314
Query: 186 THIHMKTGNKFYSFPLT 202
+ I K GN S PL+
Sbjct: 315 SEIKAKIGNVVTSIPLS 331
>gi|296805955|ref|XP_002843797.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845099|gb|EEQ34761.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 316
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 95/243 (39%), Gaps = 57/243 (23%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSV--GISVEVIP 55
D+P+ H GR R+ PH +W T +Y+ P + L L ++K S G+S+ +
Sbjct: 70 QDDPSLHNGRTRAVPHVVGNWPTHIYLEWYPNNSELGVLSEVIKRCERSALDGMSIHSLL 129
Query: 56 EP--------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRS 107
H+SLS+ +V+ ++ + NL H + K ++ N E TR
Sbjct: 130 YSDLGAQLPLHISLSRPVVLLTEQRESFL-----NLFHESIYKSKIQPLDWVSNFEHTRW 184
Query: 108 FIALGANSCKT-SLTSIVQAVDKSAQEFKLPTYYEEPN---------------------- 144
F+ L N K L +++ + + F P YE P
Sbjct: 185 FLVLRLNCPKNDGLNRLLRLSNTTLAHFSQPPLYENPTHRQKRKHGKQVGHQHDISPYSG 244
Query: 145 ---------FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNK 195
FH SIAW L++ A K L +D F KL ++ +K GN+
Sbjct: 245 PGDTDYTDCFHISIAWTLKEPGARDKERLLSID--FQPHKLN-----FKFNNVKVKVGNQ 297
Query: 196 FYS 198
+S
Sbjct: 298 IHS 300
>gi|317139668|ref|XP_003189190.1| hypothetical protein AOR_1_1108174 [Aspergillus oryzae RIB40]
Length = 302
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 101/251 (40%), Gaps = 63/251 (25%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAML----------KEELNSVG 48
D+P+ HGGR R+ PH +W T +Y+ P + L+ L ++ K +LNS+
Sbjct: 61 DDPSLHGGRKRAIPHVEGNWPTHIYLEWYPSKGELSTLSNVIAQIEGKLRKSKVKLNSLL 120
Query: 49 ISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFC--NEEKT 105
S P H+SLS+ +V+ + ++T + + N + ++C N EKT
Sbjct: 121 RSDLGAQLPLHISLSRPVVLRTEQRQSFLDTFKSAIEDSNIRAFNATTEGLYCVSNHEKT 180
Query: 106 RSFIALGANSCKT-SLTSIVQAVDKSAQEFKLPTYYE-EPN------------------- 144
R F L + +L +++ ++S F P YE P+
Sbjct: 181 RWFYVLRVKKPENDALNRLLKLSNRSLAFFNQPPLYEASPHILTADAGLSSPMKWQRGGS 240
Query: 145 ------FHASIAWCL-------QDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMK 191
FH S+AW L +D+ A +K L +L ++ F + K
Sbjct: 241 ADYSHCFHISLAWSLTEPSPDERDQIANIK--LRELSDLSVYFDC-----------VKAK 287
Query: 192 TGNKFYSFPLT 202
GN S PL
Sbjct: 288 IGNNITSMPLA 298
>gi|402223077|gb|EJU03142.1| hypothetical protein DACRYDRAFT_115389 [Dacryopinax sp. DJM-731
SS1]
Length = 325
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/181 (20%), Positives = 74/181 (40%), Gaps = 28/181 (15%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---------LAR-------LYAMLKEEL 44
+D P++H GR R+ P + + VYIPL+ + R L+++L
Sbjct: 45 VDEPHKHQGRTRTTPFVQGQYCAHVYIPLEVGELEEGFRYLMKRAKGLCPTLHSLLDPPA 104
Query: 45 NSVGISVE------------VIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIK 92
S+E I E H+SL++ + + + ++ + L +
Sbjct: 105 PPPTDSLEPSNPLESQDTDSAIQELHISLTRPIFLRSSERAAFLASVRSLLSSHKSFELS 164
Query: 93 FNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWC 152
++ N+ +TR F+ + L ++ +++ + + TYY EP +H S+ W
Sbjct: 165 LATLAGLENDHRTRGFLVAEVGAGFAELQALTTSLEPAITALRQETYYSEPRYHVSLGWA 224
Query: 153 L 153
L
Sbjct: 225 L 225
>gi|49387876|dbj|BAD26563.1| unknown protein [Oryza sativa Japonica Group]
Length = 116
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 6/113 (5%)
Query: 91 IKFNSIEIFCNEEKTRSFIALGANSCKTSLTSI---VQAVDKSAQEFKLPTYYEEPNFHA 147
+ FN E F N++ TRSF++L S T L I + VD + LP +Y+ P H
Sbjct: 1 MDFNKWEHFVNDDCTRSFLSLEVTS--TGLPEISKQITMVDDVYRLHGLPEFYKNPRPHI 58
Query: 148 SIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSF 199
S+AW L D + LK + +++ + + + +H+ K G K Y
Sbjct: 59 SLAWALGDVSCKLKQAIKEIEKSQSSLGTSQISNLRCKFSHVVCKIGKKVYDI 111
>gi|134083621|emb|CAL00536.1| unnamed protein product [Aspergillus niger]
Length = 285
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 76/168 (45%), Gaps = 20/168 (11%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKE---ELNSVGISVEVIP 55
D+P+ HGGR R PH +W T +Y+ P + L+ L ++ + L+ I + +
Sbjct: 57 DDPSLHGGRKRVIPHVEGNWPTHIYLEWYPSKDELSILSDVIDQINDRLDGSTIQLHSLL 116
Query: 56 EP--------HLSLSKTLVIPYHWIDTLVETLGNNLRHLN-----RLTIKFNSIEIFCNE 102
H+SLS+ +V+ + +E + L+ R + NS++ N
Sbjct: 117 HSDLGAQLPLHISLSRPVVLRTEQRQSFLELFQSQLKESQIPAYARFNVTTNSLDCVSNY 176
Query: 103 EKTRSFIALGANSC-KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASI 149
EKTR F L A + +L +++ +++ F+ P YE A++
Sbjct: 177 EKTRWFYVLRAEKPEENALNRLLRLSNRALARFEQPPLYESSQDVAAV 224
>gi|350633134|gb|EHA21500.1| hypothetical protein ASPNIDRAFT_45562 [Aspergillus niger ATCC 1015]
Length = 300
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 76/165 (46%), Gaps = 17/165 (10%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKE---ELNSVGISVEVIP 55
D+P+ HGGR R PH +W T +Y+ P + L+ L ++ + L+ I + +
Sbjct: 57 DDPSLHGGRKRVIPHVEGNWPTHIYLEWYPSKDELSILSDVIDQINDRLDGSTIQLHSLL 116
Query: 56 EP--------HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEEKT 105
H+SLS+ +V+ + +E + L+ + + NS++ N EKT
Sbjct: 117 HSDLGAQLPLHISLSRPVVLRTEQRQSFLELFQSQLKESQIPAFNVTTNSLDCVSNYEKT 176
Query: 106 RSFIALGANSCK-TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASI 149
R F L A + +L +++ +++ F+ P YE A++
Sbjct: 177 RWFYVLRAEKPEGNALNRLLRLSNRALARFEQPPLYESSQDVAAV 221
>gi|317036804|ref|XP_001398063.2| hypothetical protein ANI_1_1976144 [Aspergillus niger CBS 513.88]
Length = 300
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 76/165 (46%), Gaps = 17/165 (10%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLK---EELNSVGISVEVIP 55
D+P+ HGGR R PH +W T +Y+ P + L+ L ++ + L+ I + +
Sbjct: 57 DDPSLHGGRKRVIPHVEGNWPTHIYLEWYPSKDELSILSDVIDQINDRLDGSTIQLHSLL 116
Query: 56 EP--------HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEEKT 105
H+SLS+ +V+ + +E + L+ + + NS++ N EKT
Sbjct: 117 HSDLGAQLPLHISLSRPVVLRTEQRQSFLELFQSQLKESQIPAFNVTTNSLDCVSNYEKT 176
Query: 106 RSFIALGANSC-KTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASI 149
R F L A + +L +++ +++ F+ P YE A++
Sbjct: 177 RWFYVLRAEKPEENALNRLLRLSNRALARFEQPPLYESSQDVAAV 221
>gi|169603271|ref|XP_001795057.1| hypothetical protein SNOG_04643 [Phaeosphaeria nodorum SN15]
gi|111067283|gb|EAT88403.1| hypothetical protein SNOG_04643 [Phaeosphaeria nodorum SN15]
Length = 336
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/225 (21%), Positives = 84/225 (37%), Gaps = 58/225 (25%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPEP- 57
DNP HGGR R+ PH +W + VY+ P Q L+ +++ +++ + P+
Sbjct: 68 DNPELHGGRKRAMPHVEGNWPSHVYLEWNPSQAESDNLHGLIQHVKDAIDAQNKERPKKT 127
Query: 58 -------------------HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSI 96
H+SLS+TL I D ++ L + LR + +F +
Sbjct: 128 PVPKIIPSLQSELGAPLPLHVSLSRTLQIKTEDRDVFLDALRSRLRRASVAPFQFQFRGL 187
Query: 97 EIFCNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYY--------------- 140
+ N ++ R F+ L L ++ A +K+ + P Y
Sbjct: 188 KWVPNFQRNRWFLVLSIEKPANNELNRLLDACNKATRHCGHPGLYVGGHGDGPMESSNEN 247
Query: 141 -----------EEPN------FHASIAWCLQDKTATLKPLLTKLD 168
EE + FH SIAW L++ L+ +D
Sbjct: 248 TGNKRRKGQDTEEESVDRSDRFHVSIAWNLEEPDPEWTALIKNID 292
>gi|380494935|emb|CCF32778.1| hypothetical protein CH063_05100 [Colletotrichum higginsianum]
Length = 335
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 95/246 (38%), Gaps = 60/246 (24%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQT-------------------NLARLYAMLKE 42
D+PN H GR R+ PH +W + VYI N +LY +L
Sbjct: 88 DDPNLHQGRKRTIPHIAGNWPSHVYIEWHPTADQHSLLVALLDNIKPILNSQKLYPLLTS 147
Query: 43 ELNSVGISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHL-NRLTIKFNSIEIFC 100
+LN+ P P H+SLS+ L +P D + L ++L I I F
Sbjct: 148 DLNA--------PLPLHISLSRPLSLPTAQKDHFLSALTHSLSAATGGFHISPRGIGFFK 199
Query: 101 NEEKTRSFI-------ALGANSCKTS-----LTSIVQAVDKSAQEFKLPTYYE------- 141
+ + R+F+ A+ ++ TS L +++ + A P Y+
Sbjct: 200 SPDSDRAFLVLRVADSAVSQDTTNTSGKNPQLQALLTKCNAVALRLNHPPLYQTDATELV 259
Query: 142 EPNFHASIAWCL----QDKTATLKPLLTKLD-NIFTQFKLTSDESFHVVTHIHMKTGNKF 196
+ FH SI W +D + LL + D Q+++ V+ + +K GN
Sbjct: 260 DDAFHMSIGWTFGLPPEDACLRTRALLKQPDFRDVGQWEIG-------VSGVKIKIGNII 312
Query: 197 YSFPLT 202
PLT
Sbjct: 313 THVPLT 318
>gi|145334795|ref|NP_001078743.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008664|gb|AED96047.1| uncharacterized protein [Arabidopsis thaliana]
Length = 238
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 60/129 (46%), Gaps = 23/129 (17%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPL-------------QTNLARLYAMLKEELNSVGISV- 51
E G R+R+FPH ++A VY+P+ +A + L V +S+
Sbjct: 61 EPGVRVRNFPHVDGNYALHVYVPVCIPPLPKKEIVCFLKKVASVVPHLHLVEADVPLSIL 120
Query: 52 ---------EVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
+ E H+SL + + + H I++++ L L+ R I FN E+F N+
Sbjct: 121 CKDDQKFERALGREFHISLGRNVPLRVHQINSVISMLRQKLQLQKRYLIDFNKWEVFVND 180
Query: 103 EKTRSFIAL 111
+ TRSF++L
Sbjct: 181 DHTRSFLSL 189
>gi|406700801|gb|EKD03964.1| hypothetical protein A1Q2_01734 [Trichosporon asahii var. asahii
CBS 8904]
Length = 201
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 55 PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN-SIEIFCNEEKT---RSFIA 110
P H+SLS L + T + LR + ++ +++ N K R+F+A
Sbjct: 52 PALHISLSHPLPLRRPLSQTFPGLVSRILRDMPAFSVGLAWPPKVYNNAPKNGPKRAFLA 111
Query: 111 LGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
L ++ L ++++ VD + LPTY+E P FH S AW L D
Sbjct: 112 LRTSAGTRELGTLLEKVDGLLRREHLPTYHENPEFHTSFAWWLDD 156
>gi|401882154|gb|EJT46426.1| hypothetical protein A1Q1_04975 [Trichosporon asahii var. asahii
CBS 2479]
Length = 201
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 55 PEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN-SIEIFCNEEKT---RSFIA 110
P H+SLS L + T + LR + ++ +++ N K R+F+A
Sbjct: 52 PALHISLSHPLPLRRPLSQTFPGLVSRILRDMPAFSVGLAWPPKVYNNAPKNGPKRAFLA 111
Query: 111 LGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQD 155
L ++ L ++++ VD + LPTY+E P FH S AW L D
Sbjct: 112 LRTSAGTRELGTLLEKVDGLLRREHLPTYHENPEFHTSFAWWLDD 156
>gi|270003500|gb|EEZ99947.1| hypothetical protein TcasGA2_TC002743 [Tribolium castaneum]
Length = 94
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/28 (64%), Positives = 23/28 (82%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIP 28
+D+P H GR+RSFPH+R +WAT VYIP
Sbjct: 62 VDDPTLHEGRLRSFPHERGNWATYVYIP 89
>gi|346319830|gb|EGX89431.1| Unidentified protein family UPF0406 [Cordyceps militaris CM01]
Length = 305
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 74/180 (41%), Gaps = 27/180 (15%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN------LARLYAMLKE------ELNSVG 48
+D+P+ H GR R PH W + VYI + + L RL ++E EL+
Sbjct: 64 VDDPSLHQGRKRQVPHIAGHWPSHVYIEWRPSYEQHAVLTRLLTKVEESLDRETELHPFL 123
Query: 49 ISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHLNR-LTIKFNSIEIFCNEEKTR 106
S P P H+SLS+ L + D ++ + +L R ++ + F + + R
Sbjct: 124 TSDLGAPLPLHVSLSRPLSLATADKDDFLQRVSTSLGGAVRPFAVRPRRLAWFASPDSNR 183
Query: 107 SFIALG-------ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPN------FHASIAWCL 153
F+ LG S L +++ + +A + P Y+ FH SIAW
Sbjct: 184 CFLVLGVAAATDNGGSDGGPLMELLRRSNAAASRARQPLLYQADEEAARTAFHVSIAWTF 243
>gi|340515386|gb|EGR45641.1| predicted protein [Trichoderma reesei QM6a]
Length = 328
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 97/240 (40%), Gaps = 44/240 (18%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PL---QTNLARLYAMLKEELNSVG------ 48
+D+P H GR R PH +W + VY+ PL LARL + E + S
Sbjct: 77 VDDPALHHGRKRLTPHVVGNWPSHVYVEWHPLADQHEKLARLVCSVNEAIGSRAKLHSFL 136
Query: 49 ISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEEKT 105
S P P H+SLS+ L + D ++ + + L +++ ++ + + + +
Sbjct: 137 TSDLGAPLPLHISLSRPLSLSTSNKDRFLDQISSELHASGVSQFSVSPRRLLWYNSPDSN 196
Query: 106 RSFIAL----------GANSCKTSLTSIVQAVDKSAQEFKLPTYYE---------EPNFH 146
R+F+ L G L ++ A + AQ F P Y+ + FH
Sbjct: 197 RTFLILQVASSSMLKSGGTLSNPELMRLLNACNAVAQRFDQPILYQSKGGEEGSADEAFH 256
Query: 147 ASIAWCLQ---DKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSFPLT 202
SI W L D+ + K +F + S E++ + V + +K GN PL+
Sbjct: 257 ISIGWALDLPVDEES------NKALEVFRDGEFESVETWKIPVPGVKVKIGNVVSHVPLS 310
>gi|115491179|ref|XP_001210217.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114197077|gb|EAU38777.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 282
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 93/244 (38%), Gaps = 62/244 (25%)
Query: 7 HGGRIRSFPHQRNSWATLVYI---PLQTNL------------------ARLYAMLKEELN 45
HGGR R PH +W T +Y+ P + L +++++L+ +L
Sbjct: 48 HGGRKRVIPHVEGNWPTHIYLEWYPSKQELRVLEDVVSQLEGICSKSCVKIHSLLRSDL- 106
Query: 46 SVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLN--RLTIKFNSIEIFCNEE 103
G+ + + H+SLS+ +V+ + +E ++L N + S+ N E
Sbjct: 107 --GVQLPL----HISLSRPVVLRTEQRHSFLELFQSSLEGSNIPVFDVSTESLRCVSNYE 160
Query: 104 KTRSFIALGANSCKT-SLTSIVQAVDKSAQEFKLPTYYEEPN------------------ 144
KTR F L ++ SL +++ ++S F P YE
Sbjct: 161 KTRWFFVLRVKKPESDSLNRLLKLSNRSLARFDQPPLYEGSRSNEAKGRQHPSNSQDSRA 220
Query: 145 -----FHASIAWCLQDKTATLKPLLTKLD-NIFTQFKLTSDESFHVVTHIHMKTGNKFYS 198
FH SIAW L +A L LD + ++ D + K GN S
Sbjct: 221 DYSDCFHISIAWSLTAPSAEDTERLANLDLQNLSGLRIGFD-------CVKAKIGNNITS 273
Query: 199 FPLT 202
PL+
Sbjct: 274 IPLS 277
>gi|46120524|ref|XP_385085.1| hypothetical protein FG04909.1 [Gibberella zeae PH-1]
Length = 306
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 97/227 (42%), Gaps = 40/227 (17%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN------LARLYAMLKEELNSVGISVEVI 54
+D+P+ H GR R PH +W + VY + L L A ++E ++S +
Sbjct: 64 VDDPSLHQGRKRHIPHVVGNWPSHVYTEWHPSTEQHGLLTTLIADIEEHVSSNTKLFNFL 123
Query: 55 ------PEP-HLSLSKTLVIPY----HWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEE 103
P P H+SLS+ L + ++D + E+ N + +K S+ F + +
Sbjct: 124 TSDLGSPLPLHISLSRPLSLTTSAKDEFLDKITESF--NSSGIAPFAVKPQSLAWFRSPD 181
Query: 104 KTRSFIALGANSCKTS------LTSIVQAVDKSAQEFKLPTYY----EEP---NFHASIA 150
R+F+ L S + LTS++ + A +F LP+ Y +EP FH SI
Sbjct: 182 SDRTFLILRVASGPDTKPLNPELTSLLLRSNSVAAQFGLPSLYARSPDEPVGGAFHVSIG 241
Query: 151 WC--LQDKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGN 194
W L +LK L +F Q K + + V + +K GN
Sbjct: 242 WTFHLPGDELSLKTL-----QLFKQPKFDDIRKWEISVLGVKVKIGN 283
>gi|50553270|ref|XP_504045.1| YALI0E16951p [Yarrowia lipolytica]
gi|49649914|emb|CAG79638.1| YALI0E16951p [Yarrowia lipolytica CLIB122]
Length = 247
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 76/167 (45%), Gaps = 25/167 (14%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPLQTNLA-RLYAMLKEELNSVGISVEVIPEPHLSLSKT 64
+ G++R+ PH W T VY+ + ++A R ML + V + +V+ + SL+K+
Sbjct: 39 DKNGKVRALPHIEGQWPTHVYLEWKPDVAWRRLEMLNGAIAKVLETYQVV---YHSLAKS 95
Query: 65 LV---IPYH--WIDTLVETLGNNLRHLNRLT---------IKFN----SIEIFCNEEKTR 106
V +P H DTL+ T + + + + IK N +++ N +KTR
Sbjct: 96 DVGVRLPLHVSLSDTLMPTTESKQQVTDSICEAVTGWKGPIKINVSKTKLQVVLNRQKTR 155
Query: 107 SFIALGANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCL 153
+F+ L + ++ AV+ + + LP P H SI W L
Sbjct: 156 AFVVLALTD-NEPIVRLIAAVNAAVEPHGLPALAAHP--HVSIGWFL 199
>gi|14603278|gb|AAH10099.1| C16orf57 protein [Homo sapiens]
Length = 90
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 24/29 (82%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIPLQ 30
D+ +HGGR+R+FPH+R +WAT VY+P +
Sbjct: 62 DDSTKHGGRVRTFPHERGNWATHVYVPCE 90
>gi|70983578|ref|XP_747316.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66844942|gb|EAL85278.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|159123678|gb|EDP48797.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 306
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 99/243 (40%), Gaps = 48/243 (19%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKE-ELNSVGISVEV---- 53
D+P+ HGGR R PH +W T +Y+ P + L L +L + E+ G + E+
Sbjct: 62 DDPSLHGGRKRVIPHVEGNWPTHIYLEWYPSKAELTVLDGILSQVEVKLGGDAGEIHSLL 121
Query: 54 -----IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKT 105
+ P H+SLS+ +V+ ++ L+ + ++ S++ N E+T
Sbjct: 122 RSDLGVQLPLHISLSRPVVLRTEQRQPFMDMFQTALQESVVPAFSVSPCSLDWVSNYERT 181
Query: 106 RSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYE--------------------EPN 144
R F+ L +L ++ ++S F P+ Y +P
Sbjct: 182 RWFLVLRVTKPTNDNLNRLLSLSNRSLAHFGQPSLYAGNPASPVHRLGRHENIKPHTQPE 241
Query: 145 -----FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTGNKFYSF 199
FH S+AW L + T K + +D + ++ S H + +K GN S
Sbjct: 242 ELSHCFHVSLAWSLIEPTTEQKERIDAVD--IRRLRIL---SIHFDC-VKVKIGNNISSI 295
Query: 200 PLT 202
PL+
Sbjct: 296 PLS 298
>gi|326472112|gb|EGD96121.1| hypothetical protein TESG_03579 [Trichophyton tonsurans CBS 112818]
gi|326477023|gb|EGE01033.1| hypothetical protein TEQG_00087 [Trichophyton equinum CBS 127.97]
Length = 323
Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 98/251 (39%), Gaps = 60/251 (23%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEV---- 53
D+P+ H GR R+ PH +W T +Y+ P L L ++KE ++ G + +
Sbjct: 70 QDDPSLHNGRTRAIPHVVGNWPTHIYLEWYPNNAELGVLSNIIKECESTEGSGIAIHSLL 129
Query: 54 -------IPEPHLSLSKTLVIPYH----WIDTLVETLGNNLRHLNRLTIKFNSIEIFCNE 102
+P H+SLS+ +V+ ++D + + N+ + + ++ N
Sbjct: 130 YSDLGAQLPL-HISLSRPVVLLTEQRELFLDLFRDYIYNS--KIQPFEVSPQNLAWVSNF 186
Query: 103 EKTRSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYEEPN----------------- 144
E TR F+ L N L +++ + S F P YE P
Sbjct: 187 EHTRWFLVLRLNRPANNGLNQLLRLSNSSLACFGQPLLYESPAGTQKRKHGRQAGHQRNS 246
Query: 145 --------------FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHM 190
FH SIAW L++ + K LT ++ L + E ++ +
Sbjct: 247 SSSPSSIDTDYTSCFHISIAWTLEEPSHDEKERLTSIE-------LPARELIIKFNNVKL 299
Query: 191 KTGNKFYSFPL 201
K GN+ YS L
Sbjct: 300 KIGNQIYSEAL 310
>gi|452842186|gb|EME44122.1| hypothetical protein DOTSEDRAFT_172151 [Dothistroma septosporum
NZE10]
Length = 304
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/203 (22%), Positives = 80/203 (39%), Gaps = 48/203 (23%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---------------------PLQTNLARLYAM 39
D+P+ H GR R PH +W T VY+ L+ + ++++
Sbjct: 59 QDDPSLHAGRKRVVPHIAGNWPTHVYLEWFPEAEAYNVLCLALRDVQHSLEHGVDNVHSL 118
Query: 40 LKEELNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIF 99
L+ N +G+ + + H+SLS++L + D+ + L + T + ++
Sbjct: 119 LQ---NHLGVQLPL----HVSLSRSLALKTEQKDSFLTDLEQAVASTAVRTFSVSPTQLI 171
Query: 100 --CNEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYE--EPN---------- 144
NE++TR F+ L L ++ ++ A F P Y+ +P
Sbjct: 172 WHPNEDRTRWFLVLKLQRPAGDELQKMLGGCNELAARFDQPLLYQTTDPGSAEGKFKATS 231
Query: 145 -----FHASIAWCLQDKTATLKP 162
FH S+AW LQ A P
Sbjct: 232 IPVKAFHISVAWSLQPPIAQQPP 254
>gi|315046320|ref|XP_003172535.1| hypothetical protein MGYG_05126 [Arthroderma gypseum CBS 118893]
gi|311342921|gb|EFR02124.1| hypothetical protein MGYG_05126 [Arthroderma gypseum CBS 118893]
Length = 323
Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 85/217 (39%), Gaps = 51/217 (23%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEV---- 53
D+P+ H GR R+ PH +W T +Y+ P T L L ++K N+ G V +
Sbjct: 70 QDDPSLHNGRTRAVPHVAGNWPTHIYLEWYPDNTELGILSKIIKGCQNAHGKDVAIHSLL 129
Query: 54 -------IPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLT---IKFNSIEIFCNEE 103
+P H+SLS+ + + + ++ + L H +R+ I +S+ N E
Sbjct: 130 YSDLGAQLPL-HISLSRPVALLTEQREPFLDLFRDYL-HKSRIKPFEISPHSLAWESNFE 187
Query: 104 KTRSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYEEP------------------- 143
TR F+ L N L +++ + F P YE P
Sbjct: 188 STRWFLVLRLNRPANNGLNQLLRLSNSVLSHFSQPPLYENPVGTQKRKNRKQAGNRDDSV 247
Query: 144 ------------NFHASIAWCLQDKTATLKPLLTKLD 168
FH SIAW L++ + K LT +D
Sbjct: 248 QSSASIDTDHTDCFHISIAWTLEEPSDEEKERLTTID 284
>gi|429328303|gb|AFZ80063.1| hypothetical protein BEWA_029130 [Babesia equi]
Length = 268
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 69/171 (40%), Gaps = 30/171 (17%)
Query: 8 GGRIRSFPHQRNSWATLVYIPLQ-----TNLARLYAMLKEELNSVGISVEVIPE------ 56
GR+R+ PH ++ TLVYI Q N+ +L + K + S + + +
Sbjct: 2 AGRVRNLPHIDGNFHTLVYIKDQHLRKEENVQQL--LRKNDEFSQPVDSDNTEDSFSDSD 59
Query: 57 --------------PHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNS-IEIFCN 101
HLSL K L + +ID + L L H+ + + I I N
Sbjct: 60 VNSGQNGGKFDHNYAHLSLCKPLYLRKQFIDPFLTKLKQQLSHIKPFYLMLDKRIAICAN 119
Query: 102 EEKTRSFIALGANS-CKT-SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIA 150
E K R F L + C+ S+ I+ VD + F YY + H S+A
Sbjct: 120 EAKNRFFAVLPVDGMCRDQSILPIIDIVDNVVESFGFQRYYTQRLPHVSVA 170
>gi|154296374|ref|XP_001548618.1| hypothetical protein BC1G_13013 [Botryotinia fuckeliana B05.10]
Length = 346
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKEELNSVGISV 51
D+P+ HGGR R PH +W T +YI + + ++++ + ++ + +
Sbjct: 81 DDPSLHGGRKRVTPHIEGNWPTHIYIEWYPSTIEFDLVSSLISQVESFKTHDIQTFLSND 140
Query: 52 EVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKTRSF 108
+P P H+SLS+ + DT + + ++ + + F+ + N EKTR F
Sbjct: 141 LGVPLPLHVSLSRAIGFSKDVKDTFLTSFEQVIKSSGVRPFEMGFSGLAWVPNYEKTRWF 200
Query: 109 IALGANSCKT-SLTSIVQAVDKSAQEFKLPTYY 140
+ L N+ ++ +L ++ +K +EF P Y
Sbjct: 201 LVLRVNTPESNALNKLLHVSNKVVEEFGQPPLY 233
>gi|347839124|emb|CCD53696.1| hypothetical protein [Botryotinia fuckeliana]
Length = 346
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKEELNSVGISV 51
D+P+ HGGR R PH +W T +YI + + ++++ + ++ + +
Sbjct: 81 DDPSLHGGRKRVTPHIEGNWPTHIYIEWYPSTIEFDLVSSLISQVESFKTHDIQTFLSND 140
Query: 52 EVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKTRSF 108
+P P H+SLS+ + DT + + ++ + + F+ + N EKTR F
Sbjct: 141 LGVPLPLHVSLSRAIGFSKDVKDTFLTSFEQVIKSSGVRPFEMGFSGLAWVPNYEKTRWF 200
Query: 109 IALGANSCKT-SLTSIVQAVDKSAQEFKLPTYY 140
+ L N+ ++ +L ++ +K +EF P Y
Sbjct: 201 LVLRVNTPESNALNKLLHVSNKVVEEFGQPPLY 233
>gi|322694458|gb|EFY86287.1| hypothetical protein MAC_07668 [Metarhizium acridum CQMa 102]
Length = 306
Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 97/239 (40%), Gaps = 45/239 (18%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEV----- 53
D+P+ H GR R PH +W T +YI P + L A++K +G +++
Sbjct: 54 DDPSLHQGRKRLNPHVAGNWPTHLYIEWHPTEVQHDTLDALIKRAQQDLGDDIQLHTFLN 113
Query: 54 ------IPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNR--LTIKFNSIEIFCNEEKT 105
+P H+SLS+ L + D + + +++R ++K + + + +
Sbjct: 114 SDLGTDLPL-HISLSRPLSLLTSTKDEYLSKVKHSIRSCGTGVFSVKPAGLAWYKSPDSD 172
Query: 106 RSFIAL---GANSCKTS----------LTSIVQAVDKSAQEFKLPTYYEEPN-------F 145
R+F L AN+ S L ++ + A F P Y++ F
Sbjct: 173 RTFFVLRIASANATHDSKHLRASTNPELMMLLTRCNTVAAHFDQPPLYQQSQGESADTAF 232
Query: 146 HASIAWCLQ--DKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSFPL 201
H SI W L D+ A L+ L N+F + +++V V + K GN PL
Sbjct: 233 HISIGWTLAAPDEEARLRAL-----NLFQDKEFKDIHAWNVKVDGVKAKIGNVVTHVPL 286
>gi|327305273|ref|XP_003237328.1| hypothetical protein TERG_02050 [Trichophyton rubrum CBS 118892]
gi|326460326|gb|EGD85779.1| hypothetical protein TERG_02050 [Trichophyton rubrum CBS 118892]
Length = 323
Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 98/248 (39%), Gaps = 54/248 (21%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKE----ELNSVGISVEV 53
D+P+ H GR R+ PH +W T +Y+ P T L L ++K E N V I +
Sbjct: 70 QDDPSLHNGRTRAIPHVVGNWPTHIYLEWYPNNTELGVLSNIIKGCESTEGNGVAIHSLL 129
Query: 54 IPE-----P-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKT 105
+ P H+SLS+ +V+ ++ ++ + +++ + + ++ N E T
Sbjct: 130 YSDLGAQLPLHISLSRPVVLLTEQRESFLDLFRDYIKNSKIQPFEVSPQNLAWVSNFENT 189
Query: 106 RSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYEEPN-------------------- 144
R F+ L N L +++ + S F P YE P
Sbjct: 190 RWFLVLRLNRPANNGLNQLLRLSNSSLACFGQPPLYESPGGTQKRKHGRQAGHQRNPSSS 249
Query: 145 -----------FHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTHIHMKTG 193
FH SIAW L++ + K LT ++ L + + ++ +K G
Sbjct: 250 TGSIDADYTSCFHISIAWTLEEPSDDEKERLTSIE-------LPARDLIIKFNNVKVKVG 302
Query: 194 NKFYSFPL 201
N+ + L
Sbjct: 303 NQIHGEAL 310
>gi|425781935|gb|EKV19869.1| hypothetical protein PDIG_00360 [Penicillium digitatum PHI26]
gi|425783974|gb|EKV21785.1| hypothetical protein PDIP_02620 [Penicillium digitatum Pd1]
Length = 312
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 56/260 (21%), Positives = 94/260 (36%), Gaps = 67/260 (25%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPE-- 56
DNPN HGGR R PH +W T +Y+ P + L+ L ++ E N V ++
Sbjct: 56 DNPNLHGGRTRVIPHVEGNWPTHLYLEWYPGKDELSLLTDVISESGNWVDEKAPIVHSLL 115
Query: 57 --------P-HLSLSKTLVIPYHWIDTLVETLGNNL--RHLN-----------------R 88
P H+SLS+ +V+ E L + H+ R
Sbjct: 116 HSDLGAQLPLHISLSRPVVLRTEQRALFTEALQKAIYDSHVTSYVLNLQISWSSSNARIR 175
Query: 89 LTIKFNSIEIFCNEEKTRSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYEEPN--- 144
++ ++ N EKTR F+ LG L +++ + + F P Y +
Sbjct: 176 FAVQPETLYWSSNYEKTRWFLVLGVQRPSHDGLNRLLRLSNDTLARFGQPPLYATSSTRR 235
Query: 145 ----------------------FHASIAWCLQDKTATLKPLLTKLD-NIFTQFKLTSDES 181
FH S+AW L + + + + ++D + ++ D
Sbjct: 236 EQTSASLHKGSSSMSGEDFSGCFHISLAWSLSEPSVKERERVARVDLRALREIEVEFD-- 293
Query: 182 FHVVTHIHMKTGNKFYSFPL 201
++ K GN S PL
Sbjct: 294 -----NVKAKIGNMVGSIPL 308
>gi|406865172|gb|EKD18215.1| hypothetical protein MBM_03987 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 340
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 58/264 (21%), Positives = 99/264 (37%), Gaps = 68/264 (25%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYIP---------LQTNLARLYAMLKE--------EL 44
D P+ H GR R PH +W T +YI + +LA L+E ++
Sbjct: 73 DEPSLHEGRKRLIPHIEGNWPTHLYIECMLSTSCTTMGKDLADSIEALQERDSREPKSKV 132
Query: 45 NSVGISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCN 101
+S+ S +P P H+SLS++L D V++L ++ + + FN ++ N
Sbjct: 133 HSLVTSDIGVPLPLHVSLSRSLGFLAPQKDDFVDSLRRAVKESGIRPFNLSFNGLKWVAN 192
Query: 102 EEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEP----------------- 143
E TR F+ L + L ++ + + +++ P Y +P
Sbjct: 193 FEGTRWFLVLRIPRPEEDGLNKLLHVCNSTVKQYGQPPLYPKPPSEPLTIARKKEEQRRS 252
Query: 144 -------------------------NFHASIAWCLQDKTATLKPLLTKLDNIFTQFKLTS 178
FH S+AW LQ + + +L + T+ L
Sbjct: 253 ARFKRILFDTPSRKTDWTQMQDATDAFHVSLAWTLQSPSDE----ILELTEMATKDHLDG 308
Query: 179 DESFHV-VTHIHMKTGNKFYSFPL 201
++ + V I K GN S PL
Sbjct: 309 IQAIQLRVEEIKCKVGNVVTSMPL 332
>gi|335345784|gb|AEH41472.1| conserved hypothetical protein [Endocarpon pusillum]
Length = 367
Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 72/158 (45%), Gaps = 17/158 (10%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVIPEP 57
+D+P H GR R PH SW T VY+ P T L LY ++ ++ S+ +
Sbjct: 92 LDDPTLHAGRTRQTPHIEGSWPTHVYLEWYPSTTELQTLYRLINSTGSAPSESIHTLLHS 151
Query: 58 --------HLSLSKTLVIPYH----WIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKT 105
H+SLS L H ++D+L +++ + ++I+ + ++ N + T
Sbjct: 152 PLGARTPLHISLSVPLAFQTHEKSLFLDSLAQSISTSNTKAFTVSIQ-DRLDWVPNFDST 210
Query: 106 RSFIALGANSCKTS-LTSIVQAVDKSAQEFKLPTYYEE 142
F+AL + L +++ ++ +EF+ P YE
Sbjct: 211 GWFLALRLGQPEGDELNKLLKMCNEVVKEFRQPLLYER 248
>gi|322704817|gb|EFY96408.1| hypothetical protein MAA_08115 [Metarhizium anisopliae ARSEF 23]
Length = 305
Score = 44.7 bits (104), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 37/189 (19%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEV---- 53
+D+P+ H GR R PH +W T +YI P + L A++K+ +G +++
Sbjct: 53 VDDPSLHQGRTRLNPHVAGNWPTHLYIEWHPTEVQHDTLDALIKKTQQELGDDIQLHTFL 112
Query: 54 -------IPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNR--LTIKFNSIEIFCNEEK 104
+P H+SLS+ L +P D + + +++R ++K + + + +
Sbjct: 113 NSDLGTDLPL-HISLSRPLSLPTSIKDDYLTKVKHSIRSCGTGVFSVKPAGLAWYKSPDS 171
Query: 105 TRSFIAL-----GAN--------SCKTSLTSIVQAVDKSAQEFKLPTYYEEPN------- 144
R+F + AN S L ++ + A F P YE+
Sbjct: 172 DRTFFVVRIAPTDANDDSKHLRASTNPELMMLLTRCNTVAAHFDQPPLYEQTQGESADTA 231
Query: 145 FHASIAWCL 153
FH SI W L
Sbjct: 232 FHISIGWTL 240
>gi|303272821|ref|XP_003055772.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463746|gb|EEH61024.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 313
Score = 44.3 bits (103), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 81/202 (40%), Gaps = 44/202 (21%)
Query: 9 GRIRSFPHQRNSWATLVYIPLQ-------TNLAR--LYAMLKE--ELNSVGISVEVIP-- 55
GR+R+F H ++AT V P+ T+ R L AM + L ++G P
Sbjct: 64 GRVRAFEHVEGNFATHVRAPIAFPSDASTTDALRASLRAMRAKMPSLRAIGDDAPAAPAT 123
Query: 56 EP-------------------HLSLSKTLVIPYHWIDTLVETLGNNLRHLN--RLTIKFN 94
P H+SLS+ + LV L L N +
Sbjct: 124 RPRADADADADADADVLPASLHVSLSRVFPVRAEARKPLVAALRARLAAANVDAFFVAAT 183
Query: 95 SIEIFCNEEKTRSFIALGANSCKTS----------LTSIVQAVDKSAQEFKLPTYYEEPN 144
++F N+++T +F+AL TS ++ AVD + + PT+Y +P
Sbjct: 184 DFDVFTNDDETTTFLALRLEEASTSAAPTRERRGGFAGLIAAVDAALRAKGHPTFYADPK 243
Query: 145 FHASIAWCLQDKTATLKPLLTK 166
HAS+ W + + L+ ++++
Sbjct: 244 PHASVMWAPGNVSRALRKIVSE 265
>gi|300122493|emb|CBK23063.2| unnamed protein product [Blastocystis hominis]
Length = 234
Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 40/171 (23%), Positives = 73/171 (42%), Gaps = 14/171 (8%)
Query: 6 EHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLK-EELNS-VGISVEVIPEP------ 57
E RIR H +W + YI ++ A L+ K ELNS + +S P
Sbjct: 24 EQPKRIRQIQHVDGNWPSFAYILVEGEEATLFVNDKIRELNSQLNLSSSKQFRPLSSETQ 83
Query: 58 -----HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALG 112
H+SL++ + I++ V ++ + I + IF NE+K+RSF +L
Sbjct: 84 GFNRFHVSLTRLFFLRGFQIESFVNSVRQAV-ATPAFYISASQTRIFTNEDKSRSFCSLL 142
Query: 113 ANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPL 163
+ ++ +D+ ++K +Y+ P H ++ + D + K L
Sbjct: 143 IQRGFNKVVDLIHQLDEVLVKYKKEKFYDPPVPHCTVGSVVGDLSEEAKKL 193
>gi|398397415|ref|XP_003852165.1| hypothetical protein MYCGRDRAFT_26273, partial [Zymoseptoria
tritici IPO323]
gi|339472046|gb|EGP87141.1| hypothetical protein MYCGRDRAFT_26273 [Zymoseptoria tritici IPO323]
Length = 275
Score = 43.9 bits (102), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 80/205 (39%), Gaps = 58/205 (28%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI-----PLQTNLAR-------------------L 36
D+P+ HGGR R PH +W VY+ P Q L +
Sbjct: 26 QDDPSLHGGRKRVTPHVAGNWPGHVYLEWCPNPEQQALLEQAICEMQSTSPAEGSERQTI 85
Query: 37 YAMLKEELNSVGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSI 96
+++LK N +G+ + + H+SLS+ L + D ++ L + + +
Sbjct: 86 HSLLK---NDLGVDLPL----HVSLSRPLTLKTPQKDPFLDQLKSAITQTAVSAFATRPL 138
Query: 97 EIFC--NEEKTRSFIALG-ANSCKTSLTSIVQAVDKSAQEFKLPTYYE------------ 141
++ NE++TR F+ L L ++++A ++ A EF P Y+
Sbjct: 139 DLLWHPNEDRTRWFLVLRLERPAADELQTLLKACNQIAGEFGQPFLYQTQSAMAATGSKR 198
Query: 142 ------------EPNFHASIAWCLQ 154
+FH SIAW LQ
Sbjct: 199 SRAITATAQAPPHSSFHISIAWSLQ 223
>gi|212541056|ref|XP_002150683.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210067982|gb|EEA22074.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 295
Score = 43.9 bits (102), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 87/197 (44%), Gaps = 29/197 (14%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAML-KEELNS----------- 46
D+PN HGGR R PH +W T +Y+ P + LA L +L K E S
Sbjct: 57 DDPNLHGGRKRVIPHVEGNWPTHIYLEWYPRRDELAVLADVLSKCEAESNHGTSKICSLL 116
Query: 47 ---VGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNR--LTIKFNSIEIFCN 101
+G+ + + H+SLS+ +V+ +E + ++ N T+ ++++ N
Sbjct: 117 YSELGVQLPL----HVSLSRPVVLSTDQKQPFIEAFEHAIKASNTKPFTVVPDTLDWVSN 172
Query: 102 EEKTRSFIALGANSCK-TSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL 160
E+TR F+ + K +L +++ ++ + P Y A+ DKT
Sbjct: 173 TERTRWFLVIRLEKPKDDNLNRLLRISNRILDSYGQPPLYAT-EIGATNRLLKHDKT--- 228
Query: 161 KPLLTKLDNIFTQFKLT 177
KP + ++D+ F ++
Sbjct: 229 KPPVAEVDDYTNCFHIS 245
>gi|302924733|ref|XP_003053955.1| hypothetical protein NECHADRAFT_75622 [Nectria haematococca mpVI
77-13-4]
gi|256734896|gb|EEU48242.1| hypothetical protein NECHADRAFT_75622 [Nectria haematococca mpVI
77-13-4]
Length = 344
Score = 43.9 bits (102), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 97/249 (38%), Gaps = 70/249 (28%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEVI--- 54
+D+P+ H GR R PH +W + +Y+ P T A L +L + +E++
Sbjct: 87 VDDPSLHQGRKRQVPHVVGNWPSHLYVEWHPSTTQHALLTELLADIEKQASGEIELLNFL 146
Query: 55 ------PEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEI--------- 98
P P H+SLS+ L + + GN L+R+T F S I
Sbjct: 147 TSDLGSPLPLHISLSRPLSL----------STGNKDEFLDRITQTFTSSGIAPFIVRPRG 196
Query: 99 ---FCNEEKTRSFIAL---------GANSCKT------SLTSIVQAVDKSAQEFKLPTYY 140
+ + + R+F+ L G++S + LT+++ + A +F PT Y
Sbjct: 197 LAWYRSPDSDRTFLILRVASNVSKAGSDSEEAVRTPNPELTALLAKSNTVATQFGQPTLY 256
Query: 141 EE------------PNFHASIAWC--LQDKTATLKPLLTKLDNIFTQFKLTSDESFHV-V 185
+ FH SI W L +L+ L +F Q K + + V
Sbjct: 257 QRNTNDVDTRDAVGTAFHISIGWTFHLPGDELSLETL-----GLFKQSKFAGIRGWEINV 311
Query: 186 THIHMKTGN 194
T I K GN
Sbjct: 312 TGIKAKIGN 320
>gi|346971799|gb|EGY15251.1| hypothetical protein VDAG_06105 [Verticillium dahliae VdLs.17]
Length = 345
Score = 43.5 bits (101), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 59/259 (22%), Positives = 96/259 (37%), Gaps = 75/259 (28%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI-----------------PLQTNLA---RLYAMLK 41
D+P+ H GR R PH +W + VYI Q L RL++ L
Sbjct: 69 DDPSLHNGRKRLTPHVAGNWPSHVYIEWHPTTEQHALLVEFLDAAQKQLGTTHRLHSFLT 128
Query: 42 EELNSVGISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHLNR-----LTIKFNS 95
+LN+ P P H+SLSK+L + D ++ L ++LR + T+ +
Sbjct: 129 SDLNA--------PLPLHISLSKSLSLTTANKDAFLDALTSSLRPTSAAVKGPFTVSPSG 180
Query: 96 IEIFCNEEKTRSFIAL-----------GANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPN 144
+ + + + R+F+ L S L +++ + A LPT Y +
Sbjct: 181 LAFYKSPDSDRAFLVLRVADPKAKPPIPGTSANPQLRELLRRCNTVASSRGLPTLYASRD 240
Query: 145 ---------------------FHASIAW--CLQDKTATLK-------PLLTKLDNIFTQF 174
FH SIAW L D+ ++ P KL N +
Sbjct: 241 QVEAANKGEADAEADALVDNAFHVSIAWTFALPDEEMCIRTYRIFRAPSFKKLRNWEVRV 300
Query: 175 KLTSDESFHVVTHIHMKTG 193
+ +VVTH+ + G
Sbjct: 301 SGVKVKIGNVVTHVLLGNG 319
>gi|358387838|gb|EHK25432.1| hypothetical protein TRIVIDRAFT_31794 [Trichoderma virens Gv29-8]
Length = 323
Score = 43.5 bits (101), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 52/238 (21%), Positives = 97/238 (40%), Gaps = 42/238 (17%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKEELNSVGISVEV---- 53
+D+P H GR R PH +W + +YI PL +L ++ +G V++
Sbjct: 73 VDDPALHQGRKRLIPHVVGNWPSHLYIEWHPLADQHEKLVQLVCRVNEQIGSRVKLHSFL 132
Query: 54 -----IPEP-HLSLSKTLVIPYHWIDTLVETLGNNLR--HLNRLTIKFNSIEIFCNEEKT 105
P P H+SLS+ + + D ++ + + L+ + + ++ + + + +
Sbjct: 133 KSDLGAPLPLHISLSRPISLTTSNKDVFLDKISSALQASAVPQFSVSPKRLRWYKSPDSN 192
Query: 106 RSFIALGANSCKT----------SLTSIVQAVDKSAQEFKLPTYYE-------EPNFHAS 148
R+F+ L S +T L ++ + Q F P Y+ + FH S
Sbjct: 193 RTFLILQVASSRTLSSAGTLSNPELMRLLNTCNDMVQSFNQPILYQTKGTDSADEAFHIS 252
Query: 149 IAWCLQ---DKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSFPLT 202
I W L D+ + K + F + S +S+ V V + +K GN PL+
Sbjct: 253 IGWALDLPVDEES------NKALSAFGDEEFQSLKSWEVSVPGVKVKIGNVVSHLPLS 304
>gi|261198084|ref|XP_002625444.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
gi|239595407|gb|EEQ77988.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
Length = 332
Score = 43.5 bits (101), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 90/246 (36%), Gaps = 53/246 (21%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTN-------LARLYAMLKEELNSVGI- 49
D+P+ H GR R PH +W T +Y+ P +AR L+ L G+
Sbjct: 77 QDDPSLHDGRKRVMPHVPGNWPTHIYLEWYPAAAEIAILAEVIARCGQKLENGLKVHGLL 136
Query: 50 --SVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKT 105
+ H+SLS+ +V+ + L + LR + +K + ++ N EKT
Sbjct: 137 YSDLGAQAPLHISLSRPVVLVKEDRQPFRDLLNDALRESDIRPFHVKTDGLDWVSNFEKT 196
Query: 106 RSFIALGA-NSCKTSLTSIVQAVDKSAQEFKLPTYYEEPN-------------------- 144
R F+ L L ++ ++S F+ P Y+ P
Sbjct: 197 RWFLVLRVMKPANNELNRLLAISNRSLAAFQQPPLYQTPAPAYTRASDRNTEKPPKQRST 256
Query: 145 ------------FHASIAWCL-----QDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTH 187
FH SIAW L +DK L K+ I F + ++V +
Sbjct: 257 TSSVAVADYSDYFHISIAWSLTEPSREDKERVASVELDKVKKIGIPFGSVKLKIGNIVHN 316
Query: 188 IHMKTG 193
+ + TG
Sbjct: 317 LELPTG 322
>gi|242799799|ref|XP_002483454.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|242799803|ref|XP_002483455.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218716799|gb|EED16220.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218716800|gb|EED16221.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 500
Score = 43.5 bits (101), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 16/126 (12%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKE---ELNSVGISVEVIP 55
D+PN HGGR R PH +W T +Y+ P + L L +LK+ EL ++ +
Sbjct: 56 DDPNLHGGRKRVIPHVEGNWPTHIYLEWYPRKDELIILEDILKKCETELAHGHSKIDTLL 115
Query: 56 EP--------HLSLSKTLVIPYHWIDTLVETLGNNLRHLNR--LTIKFNSIEIFCNEEKT 105
+ H+SLS+ +V+ +E + ++ N T+ ++++ N E+T
Sbjct: 116 QSDLNVQLPLHVSLSRPVVLSTDQKQPFIEGFEHAIKESNTKPFTVISDTLDWVSNTERT 175
Query: 106 RSFIAL 111
R F+ +
Sbjct: 176 RWFLVI 181
>gi|239607741|gb|EEQ84728.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3]
gi|327354587|gb|EGE83444.1| hypothetical protein BDDG_06388 [Ajellomyces dermatitidis ATCC
18188]
Length = 332
Score = 43.1 bits (100), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 90/246 (36%), Gaps = 53/246 (21%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYI---PLQTN-------LARLYAMLKEELNSVGI- 49
D+P+ H GR R PH +W T +Y+ P +AR L+ L G+
Sbjct: 77 QDDPSLHDGRKRVMPHVPGNWPTHIYLEWYPAAAEIAILAEVIARCGQKLENGLKVHGLL 136
Query: 50 --SVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKT 105
+ H+SLS+ +V+ + L + LR + +K + ++ N EKT
Sbjct: 137 YSDLGAQAPLHISLSRPVVLVTEDRQPFRDLLNDALRESDIRPFHVKTDGLDWVSNFEKT 196
Query: 106 RSFIALGA-NSCKTSLTSIVQAVDKSAQEFKLPTYYEEPN-------------------- 144
R F+ L L ++ ++S F+ P Y+ P
Sbjct: 197 RWFLVLRVMKPANNELNRLLAISNRSLAAFQQPPLYQTPAPAYTRASDRNTEKPPKQRST 256
Query: 145 ------------FHASIAWCL-----QDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTH 187
FH SIAW L +DK L K+ I F + ++V +
Sbjct: 257 TSSVAVADYSDYFHISIAWSLTEPSREDKERVASVELDKVKEIGIPFGSVKLKIGNIVHN 316
Query: 188 IHMKTG 193
+ + TG
Sbjct: 317 LELPTG 322
>gi|71013774|ref|XP_758661.1| hypothetical protein UM02514.1 [Ustilago maydis 521]
gi|46098412|gb|EAK83645.1| hypothetical protein UM02514.1 [Ustilago maydis 521]
Length = 350
Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 58/133 (43%), Gaps = 16/133 (12%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNR----LTIKFNSIEIFCNEEKTRSFIALGA 113
H+SL++ + + D V+ + L F+ I N++ +R F+ L
Sbjct: 188 HISLTRPFTVRSYERDEYVKIATAQVHQLKENISSFPFTFSRIAYLSNDDASRHFMVLEV 247
Query: 114 NSCKTSLTSIVQAVDKSAQE-FKLPTYYEEPNFHASIAWCLQDKT----------ATLKP 162
S + L S+ A+ + F+ YYEE FHASIA C+ + T A +P
Sbjct: 248 GSGREKLRSLSTALSTELRRAFRAKVYYEEARFHASIA-CVMNMTSVSDDRIKLDAAPEP 306
Query: 163 LLTKLDNIFTQFK 175
+ ++L I + +
Sbjct: 307 ISSRLGTIIDKIE 319
>gi|399217287|emb|CCF73974.1| unnamed protein product [Babesia microti strain RI]
Length = 252
Score = 42.7 bits (99), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 48/90 (53%), Gaps = 4/90 (4%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN--SIEIFCNEEKTRSFIALG-AN 114
H+SLSK + + YH+ID + + + + + + +I I+ N+ K+R FI +G AN
Sbjct: 63 HISLSKGVNLRYHFIDPFLAVVSSIVAKFRSFPVILDTKAINIYANDRKSRYFIGIGVAN 122
Query: 115 -SCKTSLTSIVQAVDKSAQEFKLPTYYEEP 143
S K+ + ++ +D + ++F Y P
Sbjct: 123 VSAKSKIQQLLDQLDLAIEQFGFSKYKGIP 152
>gi|402085918|gb|EJT80816.1| hypothetical protein GGTG_00810 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 318
Score = 42.7 bits (99), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 45/200 (22%), Positives = 76/200 (38%), Gaps = 41/200 (20%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAMLKE---------------- 42
D+P+ H GR R PH +W + +YI P Q A L ++L E
Sbjct: 63 DDPSLHQGRTRQIPHVAGNWPSHIYIEWFPTQEECATLASLLDELRSELGGGGEGPSGGQ 122
Query: 43 ELNSVGISVEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRH---LNRLTIKFNSIEI 98
+L+S S P P H+SLS+ V+ D + L + + R + +++
Sbjct: 123 QLHSFLASDLGTPLPLHISLSRPFVLTTGDKDDFLRRLTVAVEGQTAVPRFAVHPSALSW 182
Query: 99 FCNEEKTRSFIAL------GANSCKTSLTSIVQAVDKSAQEFKLPTYY------------ 140
+ + R+F+ L L +++ + ++F P Y
Sbjct: 183 HRSPDSNRAFLVLRVRERDDDGDTNPGLAALLARCNALVRQFGQPPLYASSSSSSSPPSP 242
Query: 141 EEPNFHASIAWCLQDKTATL 160
+ FH S+AW D T L
Sbjct: 243 ADDKFHVSVAWSFADVTEDL 262
>gi|400595475|gb|EJP63276.1| hypothetical protein BBA_07876 [Beauveria bassiana ARSEF 2860]
Length = 326
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 100/249 (40%), Gaps = 57/249 (22%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN------LARLYAMLKE------ELNSVG 48
+D+P H GR R PH W + VYI + + L RL A +++ EL+
Sbjct: 71 VDDPALHQGRKRQVPHIVGQWPSHVYIEWRPSYEQHAVLTRLLAKVEDTLDGEIELHQFM 130
Query: 49 ISVEVIPEP-HLSLSKTLVIPY----HWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEE 103
S P P H+SLS+ L + ++ + E+LG+ + ++ S+ F + +
Sbjct: 131 TSDLGAPLPLHVSLSRPLSLSTTEKDDFLQRVSESLGSGA--VPPFIMQPRSLAWFTSPD 188
Query: 104 KTRSFIALGANSC---------KTSLTSIVQAVDKSAQEFKLPTYYEEPN---------- 144
RSF+ LG + L +++ + +A F Y+
Sbjct: 189 SNRSFLVLGVAATGGDGDDEGDNAPLMKLLRKSNAAAARFGQSLLYQHARDDDDDDDDDD 248
Query: 145 -----FHASIAWCL----QD-KTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTG 193
FH SIAW QD ATL ++F + L ++ + V ++ +K G
Sbjct: 249 EARTAFHVSIAWTFARPGQDISQATL--------DLFQRLPLDEVMAWRIAVDNVKVKIG 300
Query: 194 NKFYSFPLT 202
N S L+
Sbjct: 301 NVVTSVALS 309
>gi|320591775|gb|EFX04214.1| hypothetical protein CMQ_1142 [Grosmannia clavigera kw1407]
Length = 339
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 56/253 (22%), Positives = 95/253 (37%), Gaps = 52/253 (20%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARLYAML---------------KEE 43
D P+ H GR R+ PH W + +Y+ P A L A+L +
Sbjct: 60 DCPSLHQGRTRAIPHVAGQWPSHIYVEWFPTSPEHASLEALLTAVIDVLPQTGNPPGRPA 119
Query: 44 LNSVGISVEVIPEP-HLSLSKTLVIPYHWIDTLVETL--GNNLRHLNRLTIKFNSIEIFC 100
++S+ S +P P H+SLS+ V+ T +++L + + + FN ++ +
Sbjct: 120 VHSLLTSDLGVPLPLHISLSRPFVLRTAQKPTFLDSLIASVSSSRVVPSYVGFNGLDWYR 179
Query: 101 NEEKTRSFIALG---------ANSCKTS-------LTSIVQAVDKSAQEFKLPTYYEEPN 144
+ + R+F+ L A+ K+ L S++ + PT Y +
Sbjct: 180 SPDSARAFLVLRVAVTDKAAVADVSKSRGRGHNHPLVSLLARCNSQVASVGQPTLYGQST 239
Query: 145 -------------FHASIAWCLQDKTATLKPLLTKLDNIFTQ-FKLTSDESFHV-VTHIH 189
FH S+AW L D L + + Q + S E H V +
Sbjct: 240 ASESDEDDTVAFAFHVSVAWTLADDIHAWVKLTEQAYGTWQQNQRKESKEPLHFNVDSVK 299
Query: 190 MKTGNKFYSFPLT 202
+K GN PL
Sbjct: 300 IKIGNIVSDIPLA 312
>gi|429851268|gb|ELA26471.1| hypothetical protein CGGC5_1702 [Colletotrichum gloeosporioides
Nara gc5]
Length = 319
Score = 40.8 bits (94), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 13/123 (10%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---------PLQTN-LARLYAMLK-EELNSVGIS 50
D+P H GR R+ PH W + VYI L TN LA++ +LK ++L + S
Sbjct: 78 DDPTLHQGRKRAIPHIVGHWPSHVYIEWHPTASQHALLTNLLAKIAPLLKSQKLQPLLTS 137
Query: 51 VEVIPEP-HLSLSKTLVIPYHWIDTLVETLGNNLRHL-NRLTIKFNSIEIFCNEEKTRSF 108
P P H+SL++ L + D + L +++ H T+ I F + + R+F
Sbjct: 138 DLNAPLPLHVSLTRPLSLTTAQKDDFLSALTSSVSHATGAFTLSPRGIGFFKSPDSDRAF 197
Query: 109 IAL 111
+ L
Sbjct: 198 LIL 200
>gi|440910445|gb|ELR60241.1| F-box/LRR-repeat protein 21 [Bos grunniens mutus]
Length = 434
Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 33/147 (22%), Positives = 63/147 (42%), Gaps = 13/147 (8%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D +E+ G+I +R SW L+ N+ + + +EE+ + E P HL
Sbjct: 263 IDVVSENAGQIEFHSIKRQSWDALIKHSPGVNVVMYFFLYEEEMET--FFKEETPVTHLY 320
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSI-----EIFCNEEKTRSFIALGANS 115
+++ ++ LG N L L + N I E+ C E ++ ALG +
Sbjct: 321 FGRSVS------KEILGRLGLNCPRLTELVVCANGIQVIDTELICIAEHCKNLTALGLSE 374
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEE 142
C+ S ++ ++ V ++ + EE
Sbjct: 375 CEVSCSAFIEFVRLCGRKLTHLSIMEE 401
>gi|77735459|ref|NP_001029424.1| F-box/LRR-repeat protein 21 [Bos taurus]
gi|122140750|sp|Q3ZBA7.1|FXL21_BOVIN RecName: Full=F-box/LRR-repeat protein 21; AltName: Full=F-box and
leucine-rich repeat protein 21
gi|73586880|gb|AAI03469.1| F-box and leucine-rich repeat protein 21 [Bos taurus]
gi|296485310|tpg|DAA27425.1| TPA: F-box/LRR-repeat protein 21 [Bos taurus]
Length = 434
Score = 40.4 bits (93), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/147 (22%), Positives = 63/147 (42%), Gaps = 13/147 (8%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D +E+ G+I +R SW L+ N+ + + +EE+ + E P HL
Sbjct: 263 IDVMSENAGQIEFHSIKRQSWDALIKHSPGVNVVMYFFLYEEEMET--FFKEETPVTHLY 320
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSI-----EIFCNEEKTRSFIALGANS 115
+++ ++ LG N L L + N I E+ C E ++ ALG +
Sbjct: 321 FGRSVS------KEILGRLGLNCPRLTELVVCANGIQVIDTELICIAEHCKNLTALGLSE 374
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEE 142
C+ S ++ ++ V ++ + EE
Sbjct: 375 CEVSCSAFIEFVRLCGRKLTHLSIMEE 401
>gi|342877554|gb|EGU79004.1| hypothetical protein FOXB_10433 [Fusarium oxysporum Fo5176]
Length = 328
Score = 39.3 bits (90), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 96/242 (39%), Gaps = 48/242 (19%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN------LARLYAMLKEELNS------VG 48
+D+P+ H GR R PH +W + VY + L L A +++E++S
Sbjct: 76 VDDPSLHHGRKRQVPHVVGNWPSHVYTEWHPSTKQHGLLTSLMADIEKEVSSEIKLHNFL 135
Query: 49 ISVEVIPEP-HLSLSKTLVIPY----HWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEE 103
S P P H+SLS+ L + ++D + +TL N+ + ++ + + + +
Sbjct: 136 TSDLGSPLPLHISLSRPLSLTTGNKDEFLDKITDTLDNS--GIAPFVVRPQGLAWYRSPD 193
Query: 104 KTRSFIAL--------------GANSCKTSLTSIVQAVDKSAQEFKLPTYYE-------E 142
R+F+ L G LTS++ + A ++ P Y+
Sbjct: 194 SDRTFLILRVASGPRSPTNSKDGVKPLNPELTSLLTKSNTVATQYGQPPLYQGKAKVPVG 253
Query: 143 PNFHASIAWC--LQDKTATLKPLLTKLDNIFTQFKLTSDESFHV-VTHIHMKTGNKFYSF 199
FH SI W L +LK L +F Q K + + V I +K GN +
Sbjct: 254 DAFHISIGWTFHLPADEMSLKTL-----RLFRQPKFGDIRKWEISVASIKVKIGNAVHHV 308
Query: 200 PL 201
L
Sbjct: 309 AL 310
>gi|156083068|ref|XP_001609018.1| hypothetical protein [Babesia bovis T2Bo]
gi|154796268|gb|EDO05450.1| conserved hypothetical protein [Babesia bovis]
Length = 263
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 8/113 (7%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFN-SIEIFCNEEKTRSFIALGANS- 115
HLSL K + + +I + L L+ + + + +I I NEE+ F L +
Sbjct: 62 HLSLCKPIYLKRQFIRPFKDRLEETLKRIKPFYLILDKNIAICANEERNNFFAVLPVETQ 121
Query: 116 CKT-SLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKL 167
C S++ ++ VD AQ F YYE+ H S+A T L ++T+L
Sbjct: 122 CNARSISPLIDLVDDIAQIFGYQKYYEQRKPHVSLAV-----TGNLTSVMTEL 169
>gi|225561949|gb|EEH10229.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 330
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 69/159 (43%), Gaps = 22/159 (13%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---PLQTNLARL---YAMLKEELNS--------- 46
D+P+ H GR R PH +W T +Y+ P T LA L A ++L S
Sbjct: 79 DDPSLHHGRKRMMPHVAGNWPTHIYLEWFPAPTELAILEEAIARCDQKLPSTIHGLLHTD 138
Query: 47 VGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEK 104
+G V++ H+SLS+ +++ + L + LR + +K ++ N E+
Sbjct: 139 LGAQVQL----HISLSRPVMLLTEDRQPFRDLLTDALRESDIRPFHVKPVGLDWVSNFEE 194
Query: 105 TRSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYEE 142
TR F+ L L ++ +KS F+ P Y +
Sbjct: 195 TRWFLVLRVTKPTNNELNRLLAISNKSLAAFQQPPLYHD 233
>gi|343429307|emb|CBQ72880.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 312
Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRHL----NRLTIKFNSIEIFCNEEKTRSFIALGA 113
H+SL++ + + D V+ ++ L F+ I N++ +R F+ L
Sbjct: 155 HISLTRPFTVRSYERDEYVKVAAAEVQRLKATVGSFPFTFSRIAYLANDDASRHFMVLEV 214
Query: 114 NSCKTSLTSIVQAVDKS-AQEFKLPTYYEEPNFHASIAWCLQDKT 157
+ + + A+ + F+ YY+E FHAS A C+ D T
Sbjct: 215 GAGRDKFHRLSTALSTELRRAFRAKAYYDEARFHASTA-CVLDST 258
>gi|310790448|gb|EFQ25981.1| vegetative cell wall protein gp1 [Glomerella graminicola M1.001]
Length = 655
Score = 38.5 bits (88), Expect = 1.8, Method: Composition-based stats.
Identities = 17/61 (27%), Positives = 28/61 (45%)
Query: 112 GANSCKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNIF 171
GAN C L +V A++F P + EEPNFH A + + + + +++
Sbjct: 150 GANPCPLGLDKFAGSVVTIAKDFASPHFLEEPNFHPHAAQLISSTAVCMDQIAGRFNDVM 209
Query: 172 T 172
T
Sbjct: 210 T 210
>gi|123476124|ref|XP_001321236.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121904058|gb|EAY09013.1| hypothetical protein TVAG_073790 [Trichomonas vaginalis G3]
Length = 180
Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 63/143 (44%), Gaps = 8/143 (5%)
Query: 11 IRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLSLSKTLVIPYH 70
IR FPH++ W T ++ L+ + Y ++ E + S + + H+SLS + Y
Sbjct: 3 IRHFPHKQGQWVT--HVCLEIEIDESYLVVPEGFIPLFDSSKD-QKLHVSLSPLFSLKYF 59
Query: 71 WIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKS 130
+ ++ N + +NR F + + + F AL +C +T+I DK
Sbjct: 60 QVKAFKKSCENFSKTINRQCFTFENGVFLDDSSNSSVFYALNIRNCD-EMTNIKNKFDKL 118
Query: 131 AQEFKLPTYYEEPN---FHASIA 150
+ PT+ E+ + H SIA
Sbjct: 119 VDSYN-PTFLEKFDDEIMHISIA 140
>gi|154283659|ref|XP_001542625.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150410805|gb|EDN06193.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 311
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 67/148 (45%), Gaps = 13/148 (8%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTN---LARLYAMLKEELNSVGISVEVIPEP 57
D+P+ H GR R PH +W T +Y+ + + + ++ +L +L G V++
Sbjct: 78 QDDPSLHHGRKRIMPHVAGNWPTHIYLESRCDQKLQSTIHGLLHTDL---GAQVQL---- 130
Query: 58 HLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEEKTRSFIALGANS 115
H+SLS+ +++ + L + LR + +K ++ N E+TR F+ L
Sbjct: 131 HISLSRPVMLLTEDRQPFRDLLTDALRESDIRPFHVKPVGLDWVSNFEETRWFLVLRVTK 190
Query: 116 -CKTSLTSIVQAVDKSAQEFKLPTYYEE 142
L ++ +KS F+ P Y +
Sbjct: 191 PTNNELNRLLAISNKSLAAFQQPPLYHD 218
>gi|332234529|ref|XP_003266459.1| PREDICTED: F-box/LRR-repeat protein 21 [Nomascus leucogenys]
Length = 434
Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/147 (21%), Positives = 65/147 (44%), Gaps = 13/147 (8%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D +E+ G+I+ P +++SW L+ + N+ + + +EE+ + E P HL
Sbjct: 263 IDVVSENPGQIKFHPIKKHSWDALIKHSPRVNVVMYFFLYEEEVET--FFKEETPVTHLY 320
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSI-----EIFCNEEKTRSFIALGANS 115
+++ ++ +G N L L + N + E+ C E + ALG +
Sbjct: 321 FGRSVS------KAVLGQVGLNCPRLIELVVCANGLQPLDNELICIAEHCTNLTALGLSE 374
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEE 142
C+ S ++ ++ V + + EE
Sbjct: 375 CEVSCSAFIKFVRLCGRRLTQLSIMEE 401
>gi|443894603|dbj|GAC71951.1| hypothetical protein PANT_5d00146 [Pseudozyma antarctica T-34]
Length = 307
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 46/101 (45%), Gaps = 6/101 (5%)
Query: 58 HLSLSKTLVIPYHWIDTLVET----LGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGA 113
H+SL++ + H D V+T +G + + F+ I N+ ++ F+ L
Sbjct: 152 HISLTRPFTVRAHERDEYVKTALAEIGRLSTGMASFSFSFSRIACLANDNASKHFMVLEV 211
Query: 114 NSCKTSLTSIVQAVDKSA-QEFKLPTYYEEPNFHASIAWCL 153
+ +L + A+ + F+ +YY+E FHAS CL
Sbjct: 212 GPGRENLRKLSTALGAELHRAFRAKSYYQEARFHASTT-CL 251
>gi|388853555|emb|CCF52727.1| uncharacterized protein [Ustilago hordei]
Length = 327
Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 75 LVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGANSCKTSLTSIVQAVDKSA-QE 133
L ET+G N F+ I N++ +R F+ L S + + A+ K +
Sbjct: 189 LKETIGRN-----SFPFSFSRIAYLNNDDASRHFMVLEVGSGRDQFHKLSTALSKVLHRA 243
Query: 134 FKLPTYYEEPNFHASIAWCLQ 154
F+ YYEE FHAS + CL+
Sbjct: 244 FRAKAYYEEARFHASTS-CLE 263
>gi|325091392|gb|EGC44702.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 330
Score = 36.6 bits (83), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 37/160 (23%), Positives = 67/160 (41%), Gaps = 22/160 (13%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIP----------LQTNLARLYAMLKEELNS---- 46
D+P+ H GR R PH +W T +Y+ L+ +AR L ++
Sbjct: 78 QDDPSLHHGRKRIVPHVAGNWPTHIYLEWFPAPIELAILEEAIARCDQKLPSTIHGLLHT 137
Query: 47 -VGISVEVIPEPHLSLSKTLVIPYHWIDTLVETLGNNLRH--LNRLTIKFNSIEIFCNEE 103
+G V++ H+SLS+ +++ + L + LR + +K ++ N E
Sbjct: 138 DLGAQVQL----HISLSRPVMLLTEDRQPFRDLLTDALRESDIRPFLVKPVGLDWVSNFE 193
Query: 104 KTRSFIALGANS-CKTSLTSIVQAVDKSAQEFKLPTYYEE 142
+TR F+ L L ++ +KS F+ P Y +
Sbjct: 194 ETRWFLVLRVTKPTNNELNRLLAISNKSLAAFQQPPLYHD 233
>gi|440752584|ref|ZP_20931787.1| hypothetical protein O53_954 [Microcystis aeruginosa TAIHU98]
gi|440177077|gb|ELP56350.1| hypothetical protein O53_954 [Microcystis aeruginosa TAIHU98]
Length = 230
Score = 36.2 bits (82), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 17/118 (14%)
Query: 54 IPEPHLSLSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSIEIFCNEEKTRSFIALGA 113
+P P L+ S V P + T + T+ + L ++ I N F +G
Sbjct: 120 LPNPELNRSVDYVAPDNPTQTAIATIFGQVLKLEKVGINDN-------------FFEIGG 166
Query: 114 NSCKTSLTSIVQAVDKS-AQEFKLPTYYEEPNFHASIAWCLQDKTATLKPLLTKLDNI 170
NS + T ++ + +S A E L +E+P A +A + D ATL+ L T +DN+
Sbjct: 167 NSLQA--TQVISRLRESFALELPLRRLFEQPTV-ADLALAVTDIHATLQKLQTPIDNL 221
>gi|339252482|ref|XP_003371464.1| conserved hypothetical protein [Trichinella spiralis]
gi|316968306|gb|EFV52602.1| conserved hypothetical protein [Trichinella spiralis]
Length = 537
Score = 36.2 bits (82), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 116 CKTSLTSIVQAVDKSAQEFKLPTYYEEPNFHASIAWCLQDKTATL 160
CK + ++++ K FK+ ++ EEP + WC+ D TA L
Sbjct: 225 CKATNSNLISVTSKECPTFKMTSFLEEPAKPDTFTWCMGDDTAVL 269
>gi|310789915|gb|EFQ25448.1| hypothetical protein GLRG_00592 [Glomerella graminicola M1.001]
Length = 337
Score = 36.2 bits (82), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 95/248 (38%), Gaps = 67/248 (27%)
Query: 2 DNPNEHGGRIRSFPHQRNSWATLVYI---------PLQTNL----------ARLYAMLKE 42
D+PN H GR R+ PH +W + VYI L T+L ++Y +L
Sbjct: 90 DDPNLHQGRKRTIPHIAGNWPSHVYIEWHPDADQHSLLTSLLDQIKPLLGSQKMYPLLTS 149
Query: 43 ELNSVGISVEVIPEP-HLSLSKTLVI------PYHWIDTLVETLGNNLRHLNRLTIKFNS 95
+LN+ P P H+SLS+ L + P+ T + L+ + F
Sbjct: 150 DLNA--------PLPLHISLSRPLSLTTAQKGPFLSSLTSSLSSATGDFALSPRGVGF-- 199
Query: 96 IEIFCNEEKTRSFI-------ALGANSCKTS-----LTSIVQAVDKSAQEFKLPTYYE-- 141
F + + R+F+ A +S TS L +++ + A F P Y+
Sbjct: 200 ---FKSPDSDRAFLILRVADPAASRDSASTSGKNPHLRTLLTRCNAVALRFNHPALYQVH 256
Query: 142 -----EPNFHASIAW---------CLQDKTATLKPLLTKLDNIFTQFKLTSDESFHVVTH 187
+ FH SI W CLQ +P + + + +VVT+
Sbjct: 257 ATELVDDAFHVSIGWTFGLPPEDACLQTYALLKQPEFRLIRQWRIEVAGVKVKIGNVVTN 316
Query: 188 IHMKTGNK 195
+ +KT +K
Sbjct: 317 VPLKTPSK 324
>gi|193211378|ref|NP_001123210.1| F-box/LRR-repeat protein 21 [Ovis aries]
gi|313118242|sp|B3FL73.1|FXL21_SHEEP RecName: Full=F-box/LRR-repeat protein 21; AltName: Full=F-box and
leucine-rich repeat protein 21
gi|164653333|gb|ABY65115.1| F-box and leucine-rich repeat protein 21 long isoform [Ovis aries]
Length = 434
Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 31/132 (23%), Positives = 57/132 (43%), Gaps = 13/132 (9%)
Query: 1 MDNPNEHGGRIRSFPHQRNSWATLVYIPLQTNLARLYAMLKEELNSVGISVEVIPEPHLS 60
+D +E+ G+I +R SW L+ N+ + + +EE+ + E P HL
Sbjct: 263 IDVVSENPGQIEFHSIKRQSWDALIKHSPGVNVVMYFFLYEEEMET--FFKEETPVTHLY 320
Query: 61 LSKTLVIPYHWIDTLVETLGNNLRHLNRLTIKFNSI-----EIFCNEEKTRSFIALGANS 115
+++ ++ L N L L + N I E+ C E ++ ALG +
Sbjct: 321 FGRSVS------KGILGRLSLNCPRLVELVVCANGIQVIDNELICIAEHCKNLTALGLSE 374
Query: 116 CKTSLTSIVQAV 127
C+ S T+ ++ V
Sbjct: 375 CEVSCTAFIEFV 386
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.133 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,033,024,012
Number of Sequences: 23463169
Number of extensions: 107622811
Number of successful extensions: 254267
Number of sequences better than 100.0: 300
Number of HSP's better than 100.0 without gapping: 188
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 253680
Number of HSP's gapped (non-prelim): 348
length of query: 202
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 67
effective length of database: 9,191,667,552
effective search space: 615841725984
effective search space used: 615841725984
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)