BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15697
(332 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|432955674|ref|XP_004085595.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 345
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 153/284 (53%), Gaps = 10/284 (3%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + +I I P ++ + AL++++ + + L + GS + IG S
Sbjct: 45 ERYRFSLQSITYIHNLIQPFISNISHHGRALTSEQILCAALRFFANGSFLYNIGDAEHIS 104
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
K+++CR++ +V + + F + +P + L I F R GFPNV GC+DGT + I
Sbjct: 105 KATVCRAVRKVC-LALKRFLHIFIVFPGHKPLRPIKEEFHRIAGFPNVVGCIDGTHIPII 163
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
APNE+E +VNR H++NV IC + V A WPGSVHD+R+ R S + + +
Sbjct: 164 APNENEADYVNRKSIHSINVQIICDAAH-ITNVEAKWPGSVHDSRIFRESTLSNRLECGE 222
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
D +LGD YP + L+ P P+ QRF H RTR +E + G+LK +F
Sbjct: 223 I----DGFLLGDRGYPCQPKLLTPYP-EPEQGPQQRFNWAHSRTRARVEMTIGLLKARFQ 277
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
CL ++R++P+ A +++A LHNIA I R ++ + I D E
Sbjct: 278 CLRHLRVTPERACDIIVACVVLHNIA--ILRGEQHPALHIQDPE 319
>gi|326668947|ref|XP_003198900.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 344
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 149/273 (54%), Gaps = 9/273 (3%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + +I+ + P + + T R+ AL++ + V L + G+ + IG S
Sbjct: 44 ERYRFSREGIMYIVNLLDPHIKSLTRRSRALTSAQTVCIGLRYFASGTFLYTIGDAENLS 103
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
KS++CR+I +V + + F +V +P + +F GFPNV G +D T + I
Sbjct: 104 KSAVCRAIRKVY-LALKQFLGSVVVFPSHLRPQVVKQSFFAIAGFPNVIGAIDCTHIQIK 162
Query: 127 AP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
AP +E FVNR G H++NV +C Y V A WPGSVHD+R+ R S +Y F+
Sbjct: 163 APPGPNEGDFVNRKGVHSINVQMVCDSMYHITNVEAKWPGSVHDSRIFRESHLYTSFERG 222
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
+ D ++LGD Y +++ + P +P+ + R+ RTR IE +FG LK +F
Sbjct: 223 DY----DGILLGDRGYACRQYFMTPFP-DPNPGPETRYNAALVRTRARIEMTFGHLKGRF 277
Query: 246 PCLNYMRLSPQFAGMVVLASTTLHNIALTIDRD 278
CL ++R++P A +++A T LHNIA TI R+
Sbjct: 278 QCLKHLRVAPDRACDIIVACTVLHNIA-TIRRE 309
>gi|432950594|ref|XP_004084518.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 346
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 149/284 (52%), Gaps = 9/284 (3%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + + I P +T + R AL++++ + + L + GS + IG S
Sbjct: 45 ERYRFSLQSITYTQNLIQPFITNISHRGRALTSEQILCAALRFFANGSFLYNIGDAEHIS 104
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
K+++CR++ +V + + F + +P + L DI F R GFPNV C+DGT + I
Sbjct: 105 KATVCRAVRKVC-LALKRFLHIFIMFPGHKPLRDIKEEFHRIAGFPNVVECIDGTHIPII 163
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP E+E +VNR H++NV IC + V A WPG VHD+R+ R S + + +
Sbjct: 164 APTENEADYVNRKSIHSINVQIICDAAHIITNVEAKWPGYVHDSRIFRESTLSNRLEYGE 223
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
D +LGD YP + L+ P P+ QRF H RT +E + G+LK F
Sbjct: 224 I----DGFLLGDRGYPCQPKLLTPYP-EPEQGPQQRFNWAHSRTGARVEMTIGLLKACFQ 278
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
CL ++R++P+ A +++A LHNIA I R ++ + I D E
Sbjct: 279 CLRHLRVTPERACDIIVACVVLHNIA--ILRGEQHPALHIQDPE 320
>gi|326675671|ref|XP_003200405.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 344
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 148/273 (54%), Gaps = 9/273 (3%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + +I+ + P + + T R+ AL++ + V L + G+ + IG S
Sbjct: 44 ERYRFSREGIMYIVNLLDPHIKSLTRRSRALTSAQTVCIGLRYFASGTFLYTIGDAENLS 103
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
KS++CR+I +V + + F +V +P + +F GFPNV G +D T + I
Sbjct: 104 KSAVCRAIRKVY-LALKQFLGSVVVFPSHLRPQVVKQSFFAIAGFPNVIGAIDCTHIQIK 162
Query: 127 AP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
AP +E FVNR G H++NV +C Y V A WPGSVHD+R+ R S +Y F+
Sbjct: 163 APPGPNEGDFVNRKGVHSINVQMVCDSMYHITNVEAKWPGSVHDSRIFRESHLYTSFERG 222
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
+ D ++LGD Y +++ + P +P+ R+ RTR IE +FG +K +F
Sbjct: 223 DY----DGILLGDRGYACRQYFMTPFP-DPNPGPQTRYNAALVRTRARIEMTFGHVKGRF 277
Query: 246 PCLNYMRLSPQFAGMVVLASTTLHNIALTIDRD 278
CL ++R++P A +++A T LHNIA TI R+
Sbjct: 278 QCLKHLRVAPDRACDIIVACTVLHNIA-TIRRE 309
>gi|432850588|ref|XP_004066823.1| PREDICTED: putative nuclease HARBI1-like, partial [Oryzias latipes]
Length = 326
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 142/267 (53%), Gaps = 6/267 (2%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIG-ATHGP 66
E +R S + FI I P +T + AL++++ + + L + GS + IG A H
Sbjct: 2 ERYRFSLQSITFIHNLIQPYITNISHPGRALTSEQILCAALRFFANGSFLYNIGDAEHIR 61
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
K+++CR++ +V + +V + L I F R FPNV GC+DGT +LI
Sbjct: 62 KKATVCRAVRKVCLALKRFLRIFIVFLGHKPLRAIKEEFHRIVCFPNVVGCIDGTHILII 121
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP E+E ++VNR H++NV C + V A WPGSVHD+++ R S + + +
Sbjct: 122 APTENEAEYVNRKSIHSINVQIKCDAAHIITNVEAKWPGSVHDSQIFRESTLSNRLECGE 181
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
D +LGD YP L+ P P+ QRF H RTR +E + G+LK +F
Sbjct: 182 I----DGFLLGDRGYPYHPKLLTPHP-EPEQGPQQRFNLAHSRTRARVEMTIGLLKARFQ 236
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIAL 273
CL ++R++P+ A +++A LHNIA+
Sbjct: 237 CLRHLRVTPERACDIIVACVVLHNIAI 263
>gi|326675400|ref|XP_003200346.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 344
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 148/297 (49%), Gaps = 15/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
KE +R S + ++ + P + T R ALS + + L + G + +G
Sbjct: 41 LKERYRFSKESLVYLTRLLKPHIANVTNRGSALSTESILSIALRFFASGHFLYNVGDAEH 100
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K+++CR++ V + + V++P + L I F R G PNV GCVDGT +
Sbjct: 101 VGKATVCRAVRTVC-LALKHLLPTFVQFPGHKPLRVIKNEFHRVAGLPNVIGCVDGTHIP 159
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I AP+ +E +VNR H++NV IC V A WPGSVHDAR+ R S + + F +
Sbjct: 160 IKAPSINEGDYVNRKSTHSINVQVICEATQIITNVEAKWPGSVHDARIFRESSLCQAFQQ 219
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
+ + +LGD YP +L+ P P+ R H RTR +E + GILK +
Sbjct: 220 GQY----NGYLLGDRGYPCLPYLMTPYP-EPEPGPQTRLNLAHSRTRAKVEMTIGILKSR 274
Query: 245 FPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVD 301
F CL +R+SP+ A +++A LHNIA TI ++ + E DG EE +D
Sbjct: 275 FQCLRGLRVSPERACDIIVACVVLHNIA-TIRGESLPPCI-------EGDGPEEHLD 323
>gi|326680520|ref|XP_003201537.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 494
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 147/297 (49%), Gaps = 15/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
KE +R S + ++ + P + T R ALS + + L + G + +G
Sbjct: 41 LKERYRFSKESLVYLTRLLKPHIANVTNRGSALSTESILSIALRFFASGHFLYNVGDAEH 100
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K+++CR++ V + + V++P L I F R G PNV GCVDGT +
Sbjct: 101 VGKATVCRAVRTVC-LALKHLLPTFVQFPGHKPLRVIKNEFHRVAGLPNVIGCVDGTHIP 159
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I AP+ +E +VNR H++NV IC V A WPGSVHDAR+ R S + + F +
Sbjct: 160 IKAPSINEGDYVNRKSTHSINVQVICEATQIITNVEAKWPGSVHDARIFRESSLCQAFQQ 219
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
+ + +LGD YP +L+ P P+ R + RTR +E + GILK +
Sbjct: 220 GQY----NGYLLGDRGYPCLPYLMTPYP-EPEPGPQTRLNLAYSRTRAKVEMTIGILKSR 274
Query: 245 FPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVD 301
F CL +R+SP+ A +++A LHNIA TI ++ + E DG EE +D
Sbjct: 275 FQCLRGLRVSPERACDIIVACVVLHNIA-TIRGESLPPCI-------EGDGPEEHLD 323
>gi|166157907|ref|NP_001107363.1| uncharacterized protein LOC100135188 [Xenopus (Silurana)
tropicalis]
gi|163916031|gb|AAI57215.1| LOC100135188 protein [Xenopus (Silurana) tropicalis]
gi|213624369|gb|AAI71002.1| LOC100135188 protein [Xenopus (Silurana) tropicalis]
gi|213625629|gb|AAI71004.1| LOC100135188 protein [Xenopus (Silurana) tropicalis]
Length = 371
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 163/307 (53%), Gaps = 18/307 (5%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
K +RLS+ ++ + E++ L TER+ A+ K+L LH+ GS + + G
Sbjct: 60 KARYRLSSEAIQSLFEKVKENLNPKTERSHAIPGIVKLLCALHFFASGSFQNSVSTAGGI 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
++SS R + +V+ IV+ ++ + +P + +L + +FL GFPNV G +D T V
Sbjct: 120 TQSSFSRFLFQVIQAIVN-LYKEYISFPNDPASLKAVKQSFLSIAGFPNVIGAIDCTHVA 178
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ P+E+E + N H+LN+ + N K V A +PGS D+ ++ S ++++F E
Sbjct: 179 LSPPSENEYIYRNEKHFHSLNMQVVVSNNMKIIDVVAKFPGSTEDSYILSQSGLHQRF-E 237
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
G F +LGDS Y LK WL+ P NP A++++ H TR +I+ +FG+LK +
Sbjct: 238 NG--EFGSGWLLGDSMYGLKPWLMTP-VSNPKTRAEKKYNHAHSATRSIIDRTFGMLKTR 294
Query: 245 FPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA--EDDGEEE 298
F CL+ + SP+ + LHNIA+ NE+ +E++++ A E+ E
Sbjct: 295 FGCLDKSKGVLLYSPEKVCKIFFVCCILHNIAM-----NESDYIEVNEEYAQHEEINMEP 349
Query: 299 AVDDPDE 305
DD DE
Sbjct: 350 FEDDTDE 356
>gi|148225801|ref|NP_001089350.1| uncharacterized protein LOC734400 [Xenopus laevis]
gi|62185695|gb|AAH92316.1| MGC115312 protein [Xenopus laevis]
Length = 371
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 97/310 (31%), Positives = 165/310 (53%), Gaps = 19/310 (6%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
K +RLS+ ++ + E++ L TER+ A+ K+L LH+ GS + + G
Sbjct: 60 KARYRLSSEAIQSLFEKVKENLNPKTERSHAIPGIVKLLCALHFFTSGSFQNSVSTAGGI 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
++SS R + +V+ IV+ ++ + +P + +L + +F+ GFPN+ G +D T V
Sbjct: 120 TQSSFSRFLFQVIQAIVN-LYKEYISFPNDPASLRAVKQSFMSVAGFPNIIGAIDCTHVA 178
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ P+E+E + N H+LN+ + N K V A +PGS D+ ++ S ++++F E
Sbjct: 179 LSPPSENEYIYRNEKHFHSLNMQVVVSHNMKIIDVVAKFPGSTEDSYILSQSGLHQRF-E 237
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
G F +LGDS Y LK WL+ P NP A++++ H TR +I+ +FG+LK +
Sbjct: 238 NG--EFGSGWLLGDSMYGLKPWLMTP-INNPKTRAEKKYNHAHSATRSVIDRTFGMLKTR 294
Query: 245 FPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
F CL+ + SP+ + LHNIA+ NEN ++I+++ A+ EE
Sbjct: 295 FGCLDKSKGVLLYSPEKVCKIFFVCCILHNIAM-----NENDYMDINEEYAQ---HEEIN 346
Query: 301 DDPDEIDEEE 310
+P E D EE
Sbjct: 347 AEPFEDDTEE 356
>gi|328713516|ref|XP_003245100.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 358
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/335 (32%), Positives = 161/335 (48%), Gaps = 21/335 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T FK FRLS + ++++L + L T T+RN+ +S KVL L + +G+ +
Sbjct: 35 LTDEEFKYRFRLSKSCIDYLLSLLEGKLKTATDRNMTMSPMNKVLITLRFYALGTMLISV 94
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
G S S ++I +V VI + ++ P N+ D M FP V G +D
Sbjct: 95 ADMFGVSISCASKTIRDVSYVIAE-LSSIFIQIPVHNIKDTKMKMYEIARFPLVFGAIDC 153
Query: 121 TMVLIDAPNEHEDQ-FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T V I +P H+ + F NR G +LNV A+ +F + A PGS HD+ + RNS ++
Sbjct: 154 THVRIQSPGGHDSEMFRNRKGYFSLNVQALVNSELQFMDIVARCPGSAHDSHIFRNSRLF 213
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
+ E G F ILGDS Y LK +L+ NP + RTR ++E SFG
Sbjct: 214 ARL-ESG--EFQKMAILGDSGYALKPYLLTS-ISNPVGRIQMLYNESQIRTRNVVERSFG 269
Query: 240 ILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD--DEAEDDGE 296
+ K +FP L+ +RL + +++A+ LHNI + D N N E+ + +E ED E
Sbjct: 270 VWKRRFPVLSLGLRLQLKTVQAIIVATAVLHNICREMKEDLPNDNFELTEPHNETEDIIE 329
Query: 297 EEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYF 331
E DD R+ T R L NYF
Sbjct: 330 VEYNDD------------GRDQDSTMRDMLLNNYF 352
>gi|340382901|ref|XP_003389956.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 418
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 161/331 (48%), Gaps = 34/331 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
+ E FR+S T I E+ +L + T + +++V L +M G+ Y IG
Sbjct: 92 WMENFRMSKTTFVMICNELRSSLKKSSTTMRQPIPVEKRVAISLWFMATGTDYRTIGHLF 151
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G SK+S+C +I +V I+ T +R +KWP ENL +I F K GFP G VDGT +
Sbjct: 152 GVSKASVCLAIRQVCRAILTTLLERYIKWPSGENLKNIISGFKHKFGFPQCVGAVDGTHI 211
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I +P ++ + NR G H++ + F + WPG VHDARV NS +Y+K
Sbjct: 212 PIISPEDYPADYYNRKGWHSVLMQGTVDHLGIFIDIYIGWPGRVHDARVFVNSSLYKK-G 270
Query: 184 EEG-----WRPFPDA-----VILGDSAYPLKEWLIPPRTRNPDAAADQ-RFLRCHKRTRR 232
+EG W+ + V+LGD AYPL WL+ P + N DQ RF + R
Sbjct: 271 QEGTLLPNWKESIEGQEVPLVLLGDPAYPLLPWLMKPYSDNGHLTRDQKRFNYRLSKGRV 330
Query: 233 LIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
++E+++G LK ++ C L + +S +F VV A LHNI EIH
Sbjct: 331 VVEHAYGRLKGRWRCLLKRLDVSVEFVPDVVAACCVLHNIC------------EIH---- 374
Query: 292 EDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQ 322
GE + D ID+ R D ++ +Q
Sbjct: 375 ---GETFNNELLDGIDDTLVQRSDYSSDNSQ 402
>gi|432848600|ref|XP_004066426.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 340
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 149/302 (49%), Gaps = 20/302 (6%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + ++ + P ++ T R ALS + + L + GS + +G
Sbjct: 43 ERYRFSRDSLIYLTNLLKPFISNVTHRGAALSTENILCIALRFFATGSFLYSVGDAEHVG 102
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRK-GGFPNVAGCVDGTMVLID 126
K+++CR++ + +RM+ + + P+ +++ GFPNV GC+DGT V I
Sbjct: 103 KATVCRAVRTACLAL-----KRMLGYLVQFPGHKPLRVIKEYSGFPNVIGCMDGTHVPIK 157
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
+P+ +E +VNR H++NV IC + + A WPGSVHDAR+ R S + KF +
Sbjct: 158 SPSVNEGDYVNRKSMHSINVQVICDATHLVTNIEAKWPGSVHDARIFRESSLCPKFHQGH 217
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
+ + +LGD YP +L+ P P F H RTR +E + GILK +F
Sbjct: 218 F----NGYLLGDRVYPCLPFLMSPYP-EPQPGPQAHFNLAHSRTRARVEMTIGILKVRFQ 272
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEA-VDDPDE 305
CL +R+ P+ A ++ A LHNIA E E+DG E+A +D D+
Sbjct: 273 CLRGLRVKPERACDIIAACAVLHNIA--------TMRGERAPPPLEEDGPEDAPLDHADQ 324
Query: 306 ID 307
D
Sbjct: 325 RD 326
>gi|348543892|ref|XP_003459416.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 346
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 135/267 (50%), Gaps = 8/267 (2%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + +I + P + + T R+ AL+ + V L + G+ + +G
Sbjct: 44 ERYRFSREGIIYITTLLDPYVKSSTRRSRALTTAQTVCIALRFFASGTFLYAVGDAENIG 103
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
KS++CR+I +V + + F V +P + F GFPNV G +D T + I
Sbjct: 104 KSAVCRAIRKVY-LALKQFLGVFVVFPSHVRPQVVKQGFFAIAGFPNVIGAIDCTHIAIK 162
Query: 127 AP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
AP +E FVNR G H++NV +C V A WPGSVHD+R+ R S + F+
Sbjct: 163 APPGPNEGDFVNRKGVHSINVQMVCDSMCHITNVEAKWPGSVHDSRIFRESGLCTLFERG 222
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
+ D ++LGD Y + + + P +P+ R+ RTR IE +FG LKE+F
Sbjct: 223 AY----DGILLGDRGYACRRYFLTPFP-DPNPGPQTRYNAALARTRACIEMTFGQLKERF 277
Query: 246 PCLNYMRLSPQFAGMVVLASTTLHNIA 272
CL +R++P A +++A LHNIA
Sbjct: 278 QCLKGLRVAPDRACDIIVACAVLHNIA 304
>gi|432955676|ref|XP_004085596.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 280
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 141/272 (51%), Gaps = 9/272 (3%)
Query: 24 IAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV 83
I A P R A+S+ + + L + GS + IG SK+++CR++ +V + +
Sbjct: 6 IPSATGLPVLRLRAISSAVILCAALRFFANGSFLYNIGDAEHISKATVCRAVGKVC-LAL 64
Query: 84 DTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDH 142
F + +P + L I F R GFPNV GC+DGT + I AP E+E +VNR H
Sbjct: 65 KRFLHIFIVFPGHKPLRPIKEEFHRIAGFPNVVGCIDGTHIPIIAPTENEADYVNRKSIH 124
Query: 143 ALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP 202
++NV IC + V A W GSVHD+R+ R S + + + D +LGD YP
Sbjct: 125 SINVQIICDAAHIITNVEAKWHGSVHDSRIFRESTLSNRLECGEI----DGFLLGDRGYP 180
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVV 262
+ L+ P P+ Q F H RTR +E + G+LK F CL ++R++P+ A ++
Sbjct: 181 CQPKLLTPYP-EPEQGPQQCFNWAHSRTRARVEMTIGLLKACFQCLRHLRVTPERACDII 239
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
+A LHNIA I R ++ + I D E + +
Sbjct: 240 VACVVLHNIA--ILRGEQHPALHIQDPEEDSN 269
>gi|72038947|ref|XP_792833.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 346
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 94/277 (33%), Positives = 144/277 (51%), Gaps = 12/277 (4%)
Query: 32 TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMV 91
T R++AL +V L + GS Y + G G SK+S+ R + ++ V+V +V
Sbjct: 59 TRRSMALPVALQVPMALRFYATGSFYAVTGDLQGVSKASVSRVVRDISRVLVREASNYIV 118
Query: 92 KWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
+P + +L + F G PN G VD T V I +P+ E FVNR G H++N+ I
Sbjct: 119 -FPRDRASLLQVKRGFTDMCGIPNTLGAVDCTHVRIRSPSIDEHLFVNRKGYHSINIQCI 177
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIP 209
C KF V A +PGS HD+ + NS + R+F++ + +LGDS YPL+ +L+
Sbjct: 178 CDSQMKFLNVLARFPGSSHDSYIWANSGICRRFEDVPVQ----GHLLGDSGYPLRPFLLT 233
Query: 210 PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY----MRLSPQFAGMVVLAS 265
P NP ++R+ + HKR R +IE SFG+LK +F C++ ++ SP+ + +A
Sbjct: 234 P-LLNPTTRPEERYNQSHKRGRSVIERSFGVLKSRFRCIDVSGGGLQFSPERVCHIFVAV 292
Query: 266 TTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDD 302
LHNI +T + + V DD DD E D
Sbjct: 293 AVLHNICVTNNLPYDEPIVPHLDDNNHDDVHAEIAGD 329
>gi|270012680|gb|EFA09128.1| hypothetical protein TcasGA2_TC015989 [Tribolium castaneum]
Length = 358
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 156/311 (50%), Gaps = 11/311 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK FR S + ++ E I L T R ++SA K+L + ++ GS ++G T
Sbjct: 47 FKMRFRFSKATILWLHELIGHDLEPTTRRRKSISAINKILITMRYLATGSFQQLVGDTVA 106
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
KS++C I V+ I Q +K P E L+++ + F RK P V G +D + V
Sbjct: 107 VHKSTVCVVIKSVIQKIAQLKPQ-FIKMPNREELHNVQLKFYRKRRMPRVIGAIDCSHVR 165
Query: 125 IDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I++P + + F NR G ++NV A+C + + + A WPGSVHD+ + +S +
Sbjct: 166 IESPGGPNAEIFRNRKGFFSINVQAVCDADLQIRNIVARWPGSVHDSTIFNDSSLCAHL- 224
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
E G + + +LGDS Y + +L+ P NP AA++ + H+ TR IE FG+LK
Sbjct: 225 ERG--EYENGFLLGDSGYACRPFLLTP-VLNPRTAAEEAYNLSHRTTRNAIERCFGVLKR 281
Query: 244 KFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDD 302
+FPCL+ +R ++A LHNIA+ N+N+ D E + E E V
Sbjct: 282 RFPCLSLGLRTKMNTTLATIVACAVLHNIAIF---TNDNEPPRDPDVEVPQELEIEPVHR 338
Query: 303 PDEIDEEEQPR 313
+ I+E R
Sbjct: 339 NNNINENTAAR 349
>gi|270015185|gb|EFA11633.1| hypothetical protein TcasGA2_TC000011 [Tribolium castaneum]
Length = 561
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 155/307 (50%), Gaps = 17/307 (5%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
AFK+ FRL+ V +++ I + T+RN +LSA+ ++L L + G ++G
Sbjct: 251 AFKKRFRLNKATVMHLVDIIGDRVAPKTQRNKSLSAQTQMLIALRFYATGGFLELLGDWI 310
Query: 65 GPSKSSICRSIHEVVPVI--VDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTM 122
KS+ICR I V I + + +M + EE L F R GFP V G +D T
Sbjct: 311 HVHKSNICRVIQRVTHEIARLSPHYIKMPRTAEE-LMATKRKFFRICGFPRVVGAIDCTH 369
Query: 123 VLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
V I +P + + + NR G ++NV IC + K + + WPGSVHD+ + NS +
Sbjct: 370 VGIQSPGGANAELYRNRKGYFSINVQTICDADLKLLHIISRWPGSVHDSTIFNNSPLPVD 429
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
F +LGD YP +++L+ P RNP+ A+ + + R + +TR IE FGIL
Sbjct: 430 FR------LGRGYLLGDGGYPCQQYLLIP-VRNPNNASQEAYNRAYIKTRNTIERFFGIL 482
Query: 242 KEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
K +FP L +RL V++A L+NI N ++E + DE ++ EE+ +
Sbjct: 483 KRRFPLLKSGLRLKIDTIVQVIVACGVLYNIC-----KERNDHIEEYFDEVMENDEEDFI 537
Query: 301 DDPDEID 307
+ I+
Sbjct: 538 YQENNIN 544
>gi|328705061|ref|XP_003242682.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 379
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 105/328 (32%), Positives = 162/328 (49%), Gaps = 32/328 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGS-----PYHII 60
F + FRL+ +++E ++P + + R+ A+ KVL L++ G GS Y+I
Sbjct: 44 FIKNFRLTKDLTRYLIELLSPFVEVKS-RSSAIDLSTKVLIALNFFGTGSYQSPVGYNIF 102
Query: 61 GATHGPSKSSICRSIHEVV--PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCV 118
A PS S + I + + P +++T+ VK+P N+ ++ GFP V GC+
Sbjct: 103 NAVSQPSVSRCVKEIVDALNQPQVINTW----VKFPS-NIQELNQVRDETTGFPGVIGCI 157
Query: 119 DGTMVLIDAP--------NEH-EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHD 169
D T V I P N H E ++NR H++NV IC K VNA +PGS HD
Sbjct: 158 DCTHVAIVPPSTNLNLVENHHPEYLYINRKNYHSINVQLICDSKLKILNVNALFPGSTHD 217
Query: 170 ARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKR 229
+ NS V E R D +LGDS YPL++WL+ P NP +AA+ +
Sbjct: 218 NHIWNNSNVLPVVQELHERNLNDYYLLGDSGYPLRQWLLTP-ILNPSSAAEIHYNTKQMC 276
Query: 230 TRRLIENSFGILKEKFPCL---NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
TR LIE G+LK +F CL + +P+ + ++ A LHN+ +T N I
Sbjct: 277 TRSLIERCNGVLKARFRCLIKDRTLHYNPEKSSAIINACVVLHNLCITY-----NVPEYI 331
Query: 287 HDDEAEDD-GEEEAVDDPDEIDEEEQPR 313
+++ E+D G + V++ D I +Q R
Sbjct: 332 YEELEENDMGIYQEVENNDVIHNNQQNR 359
>gi|432950717|ref|XP_004084578.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 343
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 145/302 (48%), Gaps = 16/302 (5%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + ++ + P ++ T ALS + + L + GS + +G
Sbjct: 42 ERYRFSRDSLIYLTNLLKPFISNVTHWGAALSTENILCIALRFFATGSFLYSMGDAEHVG 101
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
K+++CR++ + + V++P + L I F R GFPNV GC+DGT V I
Sbjct: 102 KATVCRAVRTAC-LALKRLLGYFVQFPGHKPLRVIKEEFHRIAGFPNVIGCMDGTHVPIK 160
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
+P+ +E +VNR H++NV + + A WPGSVHDAR+ R S + KF +
Sbjct: 161 SPSVNEGDYVNRKSMHSINVQVTIYGSCLVTNIEAKWPGSVHDARIFRESSLCPKFHQGH 220
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
+ + +LGD YP +L+ P P F H RTR +E + GILK +F
Sbjct: 221 F----NGYLLGDRVYPCLPFLMSPYP-EPQPGPQAHFNLAHSRTRARVEMTIGILKVRFQ 275
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEA-VDDPDE 305
CL +R+ P+ A ++ A LHNIA E E+DG E+A +D D+
Sbjct: 276 CLRGLRVKPERACDIIAACAVLHNIA--------TMRGERAPPPLEEDGPEDAPLDHADQ 327
Query: 306 ID 307
D
Sbjct: 328 RD 329
>gi|432849892|ref|XP_004066664.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 344
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/314 (28%), Positives = 156/314 (49%), Gaps = 33/314 (10%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T F+F F L +++E + AL T+R+ A+S + +VL+ L + GS +
Sbjct: 36 LTHFSFPRDFIL------YLVELLREALGRRTQRSRAISPEVQVLAALGFYTSGSFQTSM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVD-----TFFQRMVKWPEENLNDIPMTFLRKGGFPNVA 115
G G S++S+ R + V +V+ F R E++L + F R G P V
Sbjct: 90 GDIIGISQASMSRCVSNVTKALVEKAPQFITFDREPSSREQSLQE----FQRVAGIPGVL 145
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +D V I APN + +VN+ G H++ +C + +WPG +HD V+
Sbjct: 146 GVLDCVQVAIKAPNSEDSSYVNKKGFHSVACQLVCDARGLLLSAETHWPGGLHDTEVLER 205
Query: 176 SMVYRKF--DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
S +Y++ EEGW +LGD YPL++WL+ P P++ A+ R+ H T +
Sbjct: 206 SALYKQLQDSEEGW-------LLGDGRYPLRKWLMTP-VEAPESPAECRYNLAHTATHEI 257
Query: 234 IENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRD----NENQNVE 285
++ +F ++ +F CL+ Y++ SP+ + ++LA LHN++L D + +E
Sbjct: 258 VDRTFRAIQTRFRCLDGTKGYLQYSPERSSSILLACCVLHNVSLQSGLDAWTLERTEPLE 317
Query: 286 IHDDEAEDDGEEEA 299
H D + ++EA
Sbjct: 318 EHRDPKQRPADDEA 331
>gi|348545800|ref|XP_003460367.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 342
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 142/299 (47%), Gaps = 17/299 (5%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + ++ + P + + T R+ AL+ + V L + G+ + +G
Sbjct: 44 ERYRFSREGIIYLTNLLDPYVKSSTHRSRALTTAQTVCIALRFFASGTFLYAVGDAENID 103
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
KS++CRSI +V + + F V +P + F GFPNV G +D T + I
Sbjct: 104 KSAVCRSIRKVY-LALKHFLGVFVVFPSHLRPQVVKQGFFSIAGFPNVIGTIDCTHIAIK 162
Query: 127 AP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
AP +E FVNR G H++NV +C V A WPGSVHD+R+ R S + F+
Sbjct: 163 APPGPNEGDFVNRKGVHSINVQMVCDSMCHITNVEAKWPGSVHDSRIFRESGLCTLFERG 222
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
+ D ++LGD Y ++ +P+ R+ RTR IE +FG +K +F
Sbjct: 223 AY----DGILLGDRGYACRQNF-----PDPNPGPRTRYNAALARTRARIEMTFGQIKGRF 273
Query: 246 PCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPD 304
CL +R++P A +++ LHNIA R VE+ D DD + +D P
Sbjct: 274 QCLKSLRVAPDRACDIIVVCAVLHNIATI--RKERTPVVEVQPD---DDLQPVHLDQPS 327
>gi|50748127|ref|XP_421117.1| PREDICTED: putative nuclease HARBI1-like [Gallus gallus]
Length = 348
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 157/293 (53%), Gaps = 12/293 (4%)
Query: 17 VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIH 76
+ ++++ + +L+ PT+R+ A+S + +VL+ L + GS +G G S++S+ R +
Sbjct: 46 ICYLVDLLGASLSRPTQRSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQASMSRCVA 105
Query: 77 EVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
V +V+ Q + +PE+ + + F G P V G VD T V I APN +
Sbjct: 106 NVTEALVERAPQ-FIHFPEDEAAVQSLKDDFYALAGMPGVLGVVDCTHVAIKAPNAEDLS 164
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV 194
+VNR G H+LN + +C + +WPGS+ D V++ + + +F+ E ++ D
Sbjct: 165 YVNRKGLHSLNCLMVCDARGALLSAETHWPGSMPDCNVLQQAALTSQFENELYK---DGW 221
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----Y 250
+LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F ++ +F CL+
Sbjct: 222 LLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHNVIERTFRTIRSRFRCLDGSKGT 280
Query: 251 MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEE-EAVDD 302
++ SP+ + ++LA LHNI+L D + H + AE++ E+ E++D
Sbjct: 281 LQYSPEKSSHIILACCVLHNISLQHGLDVWSAPAAGHVEPAEEEYEQMESMDS 333
>gi|51230557|ref|NP_001003734.1| putative nuclease HARBI1 [Danio rerio]
gi|82181968|sp|Q6AZB8.1|HARB1_DANRE RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|50603827|gb|AAH78390.1| Harbinger transposase derived 1 [Danio rerio]
Length = 349
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 153/320 (47%), Gaps = 32/320 (10%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+ F F F + +++E + +L T+R+ A+S ++L+ L + GS +
Sbjct: 36 LNTFGFPREF------IYYLVELLKDSLLRRTQRSRAISPDVQILAALGFYTSGSFQSKM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTF-----FQRMVKWPEENLNDIPMTFLRKGGFPNVA 115
G G S++S+ R + V +++ F R E F R G PNV
Sbjct: 90 GDAIGISQASMSRCVSNVTKALIEKAPEFIGFTR----DEATKQQFKDEFYRIAGIPNVT 145
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G VD + I APN + +VN+ G H++N +C + +WPGS+ D V +
Sbjct: 146 GVVDCAHIAIKAPNADDSSYVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQ 205
Query: 176 SMVYRKF-----DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
S V + F D+EGW +LGD+ YPLK+WL+ P ++P++ AD R+ H T
Sbjct: 206 SNVAKLFEEQENDDEGW-------LLGDNRYPLKKWLMTP-VQSPESPADYRYNLAHTTT 257
Query: 231 RRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
+++ +F ++ +F CL+ Y++ SP+ ++ A LHNI+L D
Sbjct: 258 HEIVDRTFRAIQTRFRCLDGAKGYLQYSPEKCSHIIQACCVLHNISLQSGLDAWTFERTE 317
Query: 287 HDDEAEDDGEEEAVDDPDEI 306
D++ +D + DDP+ +
Sbjct: 318 ATDQSGEDIDPSDTDDPEAL 337
>gi|270004721|gb|EFA01169.1| hypothetical protein TcasGA2_TC010492 [Tribolium castaneum]
Length = 319
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 137/278 (49%), Gaps = 10/278 (3%)
Query: 42 EKVLSCLHW-MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND 100
EK L W + Y IG G +KSS+ + V+ VI+ R +KWP E+
Sbjct: 35 EKALLVTLWTLATPDSYRSIGERFGIAKSSVYFCLQTVINVILHELTHRFIKWPTEDETV 94
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
K G P V G +DG + I P E+ + NR +++ + +C P+ F +
Sbjct: 95 YVANRFAKYGLPKVIGVIDGCHIPIKKPVENSIDYFNRKKFYSMVLQGVCKPDLTFIDCD 154
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD 220
WPGSVHD RV+R S +Y E+ R P+ ++GDSAYPLK+WLI P N
Sbjct: 155 IRWPGSVHDGRVLRTSDIY-PIAEQLCR--PNYYVIGDSAYPLKKWLITPYRNNGHLLPQ 211
Query: 221 QRFL-RCHKRTRRLIENSFGILKEKFPCL-NYM-RLSPQFAGMVVLASTTLHNIALTIDR 277
Q + R +TR +IEN+F +LK +F L +Y+ R P+ V+A+ TLHNI L
Sbjct: 212 QVYFNRMLAKTRVVIENTFALLKGRFRRLKDYLDRDKPEDIIQTVMAACTLHNICLVEGE 271
Query: 278 DNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQD 315
D Q + + +DD V ++ D E R+D
Sbjct: 272 DEVEQYIL---EGRDDDHAIPMVALYNQNDNEAAERRD 306
>gi|326920414|ref|XP_003206469.1| PREDICTED: putative nuclease HARBI1-like [Meleagris gallopavo]
Length = 348
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 156/293 (53%), Gaps = 12/293 (4%)
Query: 17 VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIH 76
+ ++++ + +L+ PT+R+ A+S + +VL+ L + GS +G G S++S+ R +
Sbjct: 46 ICYLVDLLGASLSRPTQRSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQASMSRCVA 105
Query: 77 EVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
V +V+ R + +PE+ + + F G P V G VD T V I APN +
Sbjct: 106 NVTEALVERA-PRFIHFPEDEAAVRSLKDDFYALAGMPGVLGVVDCTHVAIKAPNAEDLS 164
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV 194
+VNR G H+LN + +C + +WPGS+ D V++ + + +F+ E + D
Sbjct: 165 YVNRKGLHSLNCLMVCDARGALLSAETHWPGSMPDCTVLQQAALTSQFETELHK---DGW 221
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----Y 250
+LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F ++ +F CL+
Sbjct: 222 LLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHNIIERTFRTIRSRFRCLDGSKGT 280
Query: 251 MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEE-EAVDD 302
++ SP+ + ++LA LHNI+L D + H + AE++ E+ E++D
Sbjct: 281 LQYSPEKSSHIILACCVLHNISLQHGLDVWSAPAAGHVEPAEEEYEQMESMDS 333
>gi|348536895|ref|XP_003455931.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 346
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 156/318 (49%), Gaps = 43/318 (13%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T F F F L +++E + L T+R+ A+S + +VL+ L + GS +
Sbjct: 36 LTQFGFPRGFIL------YLVELLREGLCRRTQRSRAISPEVQVLAALGFYTSGSFQTSM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVD-----TFFQRMVKWPEENLNDIPMTFLRKGGFPNVA 115
G T G S++S+ R + V +V+ F R + E++ + F R GFP V
Sbjct: 90 GDTIGISQASMSRCVSNVTRALVEKAPQFITFNRDLSTIEQSFRE----FQRVAGFPGVL 145
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +D V I APN + +VN+ G H++ +C + +W G + D V+
Sbjct: 146 GVLDCVQVAIKAPNSEDSSYVNKKGFHSVACQLVCDARGLLLSAETHWAGGLQDTVVLER 205
Query: 176 SMVYRKFD--EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
S +Y++ E+GW +LGDS YPL++WL+ P P+ A+ R+ H TR +
Sbjct: 206 SALYKELQDTEQGW-------LLGDSRYPLRKWLMTP-VDCPECPAEFRYNLAHAATREI 257
Query: 234 IENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL-------TIDR----- 277
++ +F ++ +F CL+ Y++ SP+ + ++LA LHN +L T++R
Sbjct: 258 VDRTFRAIQTRFRCLDGTKGYLQYSPERSSSILLACCVLHNASLQSGLDAWTLERTEPLE 317
Query: 278 --DNENQNVEIHDDEAED 293
+ NQ E D +AED
Sbjct: 318 QPKSLNQRPEDRDSQAED 335
>gi|426368148|ref|XP_004051074.1| PREDICTED: putative nuclease HARBI1 [Gorilla gorilla gorilla]
Length = 349
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHVPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|72679318|gb|AAI00117.1| Harbi1 protein [Danio rerio]
Length = 349
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 152/320 (47%), Gaps = 32/320 (10%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+ F F F + +++E + +L T+R+ A+S ++L+ L + GS +
Sbjct: 36 LNTFGFPHEF------IYYLVELLKDSLLRRTQRSRAISPDVQILAALGFYTSGSFQSKM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTF-----FQRMVKWPEENLNDIPMTFLRKGGFPNVA 115
G G S++S+ R + V +++ F R E F R G PNV
Sbjct: 90 GDAIGISQASMSRCVSNVTKALIEKAPEFIGFTR----DEATKQQFKDEFYRIAGIPNVT 145
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G VD + I APN + +VN+ G H++N +C + +WPGS+ D V +
Sbjct: 146 GVVDCAHIAIKAPNADDSSYVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQ 205
Query: 176 SMVYRKF-----DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
S V + F D+EGW +LGD+ YPLK+WL+ P ++P + AD R+ H T
Sbjct: 206 SNVAKLFEEQENDDEGW-------LLGDNRYPLKKWLMTP-VQSPASPADYRYNLAHTTT 257
Query: 231 RRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
+++ +F ++ +F CL+ Y++ SP+ ++ A LHNI+L D
Sbjct: 258 HEIVDRTFRAIQTRFRCLDGAKGYLQYSPEKCSHIIQACCVLHNISLQSGLDAWTFERTE 317
Query: 287 HDDEAEDDGEEEAVDDPDEI 306
D++ +D + DDP+ +
Sbjct: 318 ATDQSGEDIDPSDTDDPEAL 337
>gi|149725130|ref|XP_001489949.1| PREDICTED: putative nuclease HARBI1 [Equus caballus]
Length = 349
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGTSLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPADESSMQALKDEFYELAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|301614025|ref|XP_002936498.1| PREDICTED: putative nuclease HARBI1 [Xenopus (Silurana) tropicalis]
Length = 357
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 89/293 (30%), Positives = 147/293 (50%), Gaps = 26/293 (8%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ +R S + + E+I + ERN A+ K+LS LH+ S + A G
Sbjct: 49 RSRYRFSTEVILELFEQIQQDIEPNCERNHAVPGMVKLLSALHYFASASFQGTVSALSGI 108
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S+ S R + +V+ I Q +V +P + L I F + FPNV G +D T V
Sbjct: 109 SQPSFSRHLTQVLKAINKLTPQYIV-FPSDKAALKTIKDGFFKISSFPNVMGVIDCTHVA 167
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ P E D + N H+LNV +C + + + A +PGS HD+ ++++S ++
Sbjct: 168 LSPPTE--DIYRNSKNFHSLNVQMVCASDMRILNLVAGYPGSTHDSYILKHSSLHSILTS 225
Query: 185 ----EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
EGW ILGD AYPL E+L+ P ++ A++++ HK +IE++FG
Sbjct: 226 GNLPEGW-------ILGDDAYPLTEYLLTP-VKDAKTKAEKQYNAAHKLAHSIIESTFGS 277
Query: 241 LKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
+K +F CL+ ++ SP+ ++LA LHN+A++ N +V+I DD
Sbjct: 278 IKSRFRCLDRSGGVLQYSPEKGAQIILACCILHNLAVS-----RNLHVDILDD 325
>gi|395815624|ref|XP_003781325.1| PREDICTED: putative nuclease HARBI1 [Otolemur garnettii]
Length = 349
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 146/279 (52%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + ++++ + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVDLLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASMQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F+ ++ D+ LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQFEAGMYK---DSWFLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLYSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|27883880|ref|NP_776172.1| putative nuclease HARBI1 [Homo sapiens]
gi|74732341|sp|Q96MB7.1|HARB1_HUMAN RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|16552848|dbj|BAB71391.1| unnamed protein product [Homo sapiens]
gi|22477670|gb|AAH36925.1| Harbinger transposase derived 1 [Homo sapiens]
gi|119588398|gb|EAW67992.1| hypothetical protein FLJ32675, isoform CRA_a [Homo sapiens]
gi|119588399|gb|EAW67993.1| hypothetical protein FLJ32675, isoform CRA_a [Homo sapiens]
gi|119588400|gb|EAW67994.1| hypothetical protein FLJ32675, isoform CRA_a [Homo sapiens]
Length = 349
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|410220586|gb|JAA07512.1| harbinger transposase derived 1 [Pan troglodytes]
Length = 349
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLQTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|114637368|ref|XP_521904.2| PREDICTED: putative nuclease HARBI1 [Pan troglodytes]
gi|410248774|gb|JAA12354.1| harbinger transposase derived 1 [Pan troglodytes]
gi|410306074|gb|JAA31637.1| harbinger transposase derived 1 [Pan troglodytes]
gi|410333521|gb|JAA35707.1| harbinger transposase derived 1 [Pan troglodytes]
Length = 349
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLQTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|301623960|ref|XP_002941281.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 463
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 135/275 (49%), Gaps = 11/275 (4%)
Query: 4 FAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
F +RL+ + + E + PAL T R+ A+ K+L LH+ GS +
Sbjct: 118 FGEVRQYRLNKAAITSLYELLEPALEPRTRRSRAVPGLVKLLCSLHFFVSGSFQRVGEVY 177
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGT 121
G S+ + R + V+ I + ++ +P+ N + F G PNV +D T
Sbjct: 178 GGLSQPTFSRYLGHVLDTI-RSVSMNLISFPQHRNEWNTVKRDFYGVSGIPNVLAAIDCT 236
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
V ++ P + E F NR G H+LNV +C ++ + +PGS HDA +R S +Y+
Sbjct: 237 HVALNPPQDREHIFRNRKGYHSLNVQVVCDGRMNILSIVSGFPGSSHDAYTLRQSGLYQS 296
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
F E G P +LGD+ YP WLI P R P + A++ F + H R R +IE +FG+L
Sbjct: 297 F-ETG--QMPHGWLLGDAGYPCCRWLITP-IRRPCSQAERAFNQAHVRARSVIEQTFGVL 352
Query: 242 KEKFPCLN----YMRLSPQFAGMVVLASTTLHNIA 272
K +F CL+ + SP ++ A LHN+A
Sbjct: 353 KSRFRCLDKSGGSLMYSPTKVANIIGACAVLHNLA 387
>gi|160773871|gb|AAI55411.1| LOC100127800 protein [Xenopus (Silurana) tropicalis]
Length = 352
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 89/293 (30%), Positives = 147/293 (50%), Gaps = 26/293 (8%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ +R S + + E+I + ERN A+ K+LS LH+ S + A G
Sbjct: 44 RSRYRFSTEVILELFEQIQQDIEPNCERNHAVPGMVKLLSALHYFASASFQGTVSALSGI 103
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S+ S R + +V+ I Q +V +P + L I F + FPNV G +D T V
Sbjct: 104 SQPSFSRHLTQVLKAINKLTPQYIV-FPSDKAALKTIKDGFFKISSFPNVMGVIDCTHVA 162
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ P E D + N H+LNV +C + + + A +PGS HD+ ++++S ++
Sbjct: 163 LSPPTE--DIYRNSKNFHSLNVQMVCASDMRILNLVAGYPGSTHDSYILKHSSLHSILTS 220
Query: 185 ----EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
EGW ILGD AYPL E+L+ P ++ A++++ HK +IE++FG
Sbjct: 221 GNLPEGW-------ILGDDAYPLTEYLLTP-VKDAKTKAEKQYNAAHKLAHSIIESTFGS 272
Query: 241 LKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
+K +F CL+ ++ SP+ ++LA LHN+A++ N +V+I DD
Sbjct: 273 IKSRFRCLDRSGGVLQYSPEKGAQIILACCILHNLAVS-----RNLHVDILDD 320
>gi|403254691|ref|XP_003920094.1| PREDICTED: putative nuclease HARBI1 [Saimiri boliviensis
boliviensis]
Length = 349
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 144/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDSA+ L WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSAFFLHTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 QTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|355694124|gb|AER99563.1| harbinger transposase derived 1 [Mustela putorius furo]
Length = 339
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 27 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 80
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 81 GDAIGISQASMSRCVANVTEALVERATQ-FIRFPSDETSVQALKDEFYGLAGMPGVIGVV 139
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 140 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSL 199
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 200 SSHF-EAGIH--KDSWLLGDSSFSLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 255
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 256 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 294
>gi|224051095|ref|XP_002200142.1| PREDICTED: putative nuclease HARBI1 [Taeniopygia guttata]
Length = 348
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 154/293 (52%), Gaps = 12/293 (4%)
Query: 17 VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIH 76
+ ++++ + L+ PT+R+ A++ + ++L+ L + GS +G G +++S+ R +
Sbjct: 46 ICYLVDLLGATLSRPTQRSRAITPETQILAALGFYTSGSFQTRMGDAIGITQASMSRCVA 105
Query: 77 EVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
V +V+ Q + +P E L + F G P V G VD T V I APN +
Sbjct: 106 NVTEALVERASQ-FIHFPKDEATLQTLKDDFYGLAGMPGVLGVVDCTHVAIKAPNAEDLS 164
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV 194
+VNR G H+LN + +C + +WPGS+ D V+ + + +F+ E + D
Sbjct: 165 YVNRKGLHSLNCLMVCDSRGVLLSAETHWPGSLPDCTVLEQAALTSQFETELHK---DGW 221
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----Y 250
+LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F ++ +F CL+
Sbjct: 222 LLGDSSFLLRTWLMTP-LHIPETPAEYRYNMAHSATHNIIERTFRTIRSRFRCLDGSKGT 280
Query: 251 MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEE-EAVDD 302
++ SP+ + ++LA LHNI+L D + H +++E++ E+ E+VD
Sbjct: 281 LQYSPEKSSHIILACCVLHNISLQHGLDVWSAPAAGHMEQSEEEYEQMESVDS 333
>gi|296218013|ref|XP_002755264.1| PREDICTED: putative nuclease HARBI1 [Callithrix jacchus]
Length = 349
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 144/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDSA+ L WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSAFFLHTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|449274628|gb|EMC83706.1| Putative nuclease HARBI1 [Columba livia]
Length = 348
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 156/293 (53%), Gaps = 12/293 (4%)
Query: 17 VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIH 76
+ ++++ + +L+ PT+R+ A+S + ++L+ L + GS +G G S++S+ R +
Sbjct: 46 IYYLVDLLGASLSHPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVT 105
Query: 77 EVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
V +V+ Q + +PE+ + + F G P V G VD T V I APN +
Sbjct: 106 NVTEALVERASQ-FIHFPEDEATVQSLKDDFYGLAGMPGVLGVVDCTHVAIKAPNAEDLS 164
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV 194
+VNR G H+LN + +C + +WPGS+ D V++ + + +F+ E + D
Sbjct: 165 YVNRKGLHSLNCLMVCDARGVLLSAETHWPGSLPDCTVLQQAALTSQFETELHK---DGW 221
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----Y 250
+LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F ++ +F CL+
Sbjct: 222 LLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHNVIERTFRTIRSRFRCLDGSKGT 280
Query: 251 MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEE-EAVDD 302
++ SP+ + ++LA LHNI+L D + H ++ E++ E+ E++D
Sbjct: 281 LQYSPKKSSHIILACCVLHNISLEHGLDVWSSPATGHMEQPEEEYEQMESMDS 333
>gi|397488418|ref|XP_003815263.1| PREDICTED: putative nuclease HARBI1 [Pan paniscus]
Length = 349
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLEDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLQTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|332259854|ref|XP_003278999.1| PREDICTED: putative nuclease HARBI1 [Nomascus leucogenys]
Length = 349
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASVQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLQTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|402893678|ref|XP_003910018.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1 [Papio
anubis]
Length = 349
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|311247913|ref|XP_003122875.1| PREDICTED: putative nuclease HARBI1 [Sus scrofa]
Length = 349
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 145/283 (51%), Gaps = 25/283 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGSSLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVTNVTEALVERASQ-FIRFPADEASVQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCVVLQQSSL 208
Query: 179 YRKFD----EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
+F+ +E W +LGDS++ L+ WL+ P P+ A+ R+ H T +I
Sbjct: 209 SSQFEAGMHKESW-------LLGDSSFFLRSWLMTP-LHIPETPAEYRYNMAHSATHSVI 260
Query: 235 ENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
E +F L +F CL+ ++ SP+ ++LA LHNI+L
Sbjct: 261 EKTFRTLCSRFRCLDGSKGALQYSPEKCSHIILACCVLHNISL 303
>gi|395543725|ref|XP_003773764.1| PREDICTED: putative nuclease HARBI1 [Sarcophilus harrisii]
Length = 348
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 157/312 (50%), Gaps = 18/312 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M + F F + ++++ + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MAMYGFPRQF------IYYLVDLLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G T G S++S+ R + V +V+ Q + +P E +L + F G P V G V
Sbjct: 90 GDTIGISQASMSRCVANVTEALVERASQ-FIHFPADETSLQSLKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVGIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALLNVETHWPGSLQDWAVLQQSAL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G +LGDSA+ L+ WL+ P P A+ R+ H T +IE +F
Sbjct: 209 RSQF-EAGMH--KGCWLLGDSAFFLRTWLMTP-LHIPGTPAEYRYNMAHSATHNVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
++ +F CL+ ++ SP+ A ++LA LHNI+L D + V H ++ E++
Sbjct: 265 RTIQSRFRCLDGSKGALQYSPEKASHIILACCVLHNISLEHGMDVWSSPVTGHLEQPEEE 324
Query: 295 GEE-EAVDDPDE 305
E E +D E
Sbjct: 325 HERMETLDSEAE 336
>gi|301608541|ref|XP_002933847.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 386
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 150/310 (48%), Gaps = 26/310 (8%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL T + + ++I L T+R++ + K+L LH++ GS I A G S+
Sbjct: 54 YRLGRTAICELYDKIKHYLEPTTKRSLPIPGISKLLGTLHFLASGSFQTTISACTGFSQP 113
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S + + V+ I + ++K+ + NL + F PNV G +D T V +
Sbjct: 114 SFSKHLKFVLAGIC-SLAAELLKYDMDPANLAKMKSDFYSIAQMPNVIGAIDCTHVALIP 172
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE--- 184
P + E + NR G H++NV +C + + V + +PGS HD+ + RNS ++ +
Sbjct: 173 PADKERFYYNRKGFHSINVQVVCDASCRILDVVSKFPGSTHDSFIFRNSHLHERLQSGEA 232
Query: 185 -EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
GW +LGDS Y +K WLI P NP +++ F HK TR +IE +FGILK
Sbjct: 233 GSGW-------LLGDSGYSVKPWLITP-LLNPQNESEENFNSSHKATRCIIERTFGILKS 284
Query: 244 KFPCLNY----MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEA 299
+F CL+ + P+ ++ LHN AL +N+EI D E+
Sbjct: 285 RFRCLDKTGGALMYKPEKVCQIIFCCCILHNFALL-----HKENMEIAQDLTPQS--EDV 337
Query: 300 VDDPDEIDEE 309
++ EI++E
Sbjct: 338 IEPNSEINQE 347
>gi|193666845|ref|XP_001944531.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 342
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 143/275 (52%), Gaps = 17/275 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK+ FRLS V I+++I + T+RN A+S + +VL L + +G+ Y +G G
Sbjct: 45 FKKRFRLSKNAVVHIIDQIRDKIAHRTDRNNAVSPENQVLLTLRFYALGTIYQAVGDFVG 104
Query: 66 PSKSSICRSI----HEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVD 119
S ++ R I HE+ + D +++ P +E +N + F FP V G +D
Sbjct: 105 VSTATAGRIIPRVTHEIAKMRKD-----IIRMPSTQEEINQAKLNFFSIAKFPRVIGVID 159
Query: 120 GTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
T V I +P ++ + F NR G ++NV + G + K + A WPGS HD+ + NS +
Sbjct: 160 CTHVKILSPGGDNAEIFRNRKGYFSINVQIVAGADLKIKNIVARWPGSAHDSNIFSNSRI 219
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F++ F DA++LGDS Y + +++ P +Q + RTR +E F
Sbjct: 220 RAQFEDN---MFGDALLLGDSGYGVSNYMMTI-LNEPRTEGEQLYNESLIRTRNTVERLF 275
Query: 239 GILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
G+LK +FP ++ ++ S + +++A LHNIA
Sbjct: 276 GVLKRRFPIISLGIKSSLELTQGIIVACAVLHNIA 310
>gi|348543355|ref|XP_003459149.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 295
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 15/230 (6%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR 107
L + GS + IG SK+++CR++ V V + V +P P F++
Sbjct: 32 LRFFANGSFLYNIGDAEHVSKATVCRAVRNVT-VALKRLLYSFVVFPGHR----PTRFIK 86
Query: 108 KG-----GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNAN 162
+G GFP V GC+DGT + I AP+ +E +VNR H++NV IC V A
Sbjct: 87 EGFHKIAGFPGVIGCIDGTHIPIIAPSVNEGDYVNRKSFHSINVQIICDAANIITNVEAK 146
Query: 163 WPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR 222
WPGSVHD+R+ R + KF + + + +LGD YP +L+ P +P QR
Sbjct: 147 WPGSVHDSRIFRECTLSTKFGQGEFTGY----LLGDRGYPCLPYLLTPYP-DPXXXPQQR 201
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
+ H RTR +E + G+LK +F CL +R++P+ A +++A LHNIA
Sbjct: 202 YNLAHCRTRARVEMTIGMLKARFQCLQRLRVTPERACDIIVACVILHNIA 251
>gi|387849125|ref|NP_001248435.1| putative nuclease HARBI1 [Macaca mulatta]
gi|355566570|gb|EHH22949.1| Putative nuclease HARBI1 [Macaca mulatta]
gi|355752181|gb|EHH56301.1| Putative nuclease HARBI1 [Macaca fascicularis]
gi|380786657|gb|AFE65204.1| putative nuclease HARBI1 [Macaca mulatta]
gi|383410017|gb|AFH28222.1| putative nuclease HARBI1 [Macaca mulatta]
Length = 349
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 GSQF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|410973651|ref|XP_003993261.1| PREDICTED: putative nuclease HARBI1 [Felis catus]
Length = 349
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 146/279 (52%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERATQ-FIRFPADEASMQAVKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGD+++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDNSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|73983355|ref|XP_540753.2| PREDICTED: putative nuclease HARBI1 [Canis lupus familiaris]
Length = 349
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERATQ-FIRFPADEASMQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 NSHF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|297688817|ref|XP_002821868.1| PREDICTED: putative nuclease HARBI1 [Pongo abelii]
Length = 349
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + +VL+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASIQALKDEFYGLAGMPGVMGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ ++GDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLVGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|301607685|ref|XP_002933439.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 425
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 150/310 (48%), Gaps = 26/310 (8%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL T + + ++I L T+R++ + K+L LH++ GS I A G S+
Sbjct: 54 YRLGRTAICELYDKIKHYLEPTTKRSLPIPGISKLLGTLHFLASGSFQTTISACTGYSQP 113
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S + + V+ I + ++K+ + NL + F PNV G +D T V +
Sbjct: 114 SFSKHLKFVLAGIC-SLAPELLKYDMDPANLAKMKSDFYSIAQMPNVIGAIDCTHVALIP 172
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE--- 184
P + E + NR G H++NV +C + + V + +PGS HD+ + RNS ++ +
Sbjct: 173 PADKERFYYNRKGFHSINVQVVCDASCRILDVVSKFPGSTHDSFIFRNSHLHERLQSGEA 232
Query: 185 -EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
GW +LGDS Y +K WLI P NP +++ F HK TR +IE +FGILK
Sbjct: 233 GSGW-------LLGDSGYSVKPWLITP-LLNPQNESEENFNSSHKATRCIIERTFGILKS 284
Query: 244 KFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEA 299
+F CL+ + P+ ++ LHN AL +N+EI D E+
Sbjct: 285 RFRCLDKTGGALMYKPEKVCQIIFCCCILHNFALL-----HKENMEIAQDLTPQ--SEDV 337
Query: 300 VDDPDEIDEE 309
++ EI++E
Sbjct: 338 IEPNSEINQE 347
>gi|308321290|gb|ADO27797.1| putative nuclease harbi1 [Ictalurus furcatus]
Length = 348
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 148/318 (46%), Gaps = 37/318 (11%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T F F F + +++E + L T+R+ A+S ++L+ L + GS +
Sbjct: 36 LTTFGFPREF------IYYLVELLKDGLLRRTQRSRAISPDVQILAALGFYTSGSFQSKM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTF-----FQRMVKWPEENLNDIPMTFLRKGGFPNVA 115
G G S++S+ R + V +++ F R E F R G PNV
Sbjct: 90 GDAIGVSQASMSRCVSNVTKALIEKAPEFIGFNR----DEATKQQTKEEFYRVAGIPNVI 145
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G VD + I APN + +VN+ G H++N +C +V +WPGS+ D V
Sbjct: 146 GVVDCAHIAIKAPNAEDSSYVNKKGFHSINCQLVCDARGLLLSVETHWPGSLTDRAVFNQ 205
Query: 176 SMVYRKFDE----EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
S V + F+E EGW +LGD+ YPL++WL+ P NP+ A R+ H T
Sbjct: 206 SSVCKLFEEQVTHEGW-------LLGDNRYPLRKWLMTP-VPNPETPAGYRYNLAHMTTH 257
Query: 232 RLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
+++ +F L+ +F CL+ Y++ SP+ ++ A LHN++L D
Sbjct: 258 EIVDRTFRALQTRFRCLDGAKGYLQYSPEKCSHIIQACCVLHNVSLQSGLDAWT----FE 313
Query: 288 DDEAEDDGEEEAVDDPDE 305
EA D EE +DP E
Sbjct: 314 RTEATDQSEES--NDPTE 329
>gi|26348669|dbj|BAC37974.1| unnamed protein product [Mus musculus]
Length = 349
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 157/318 (49%), Gaps = 20/318 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + F++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYFLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPVDEAAVQSLKDEFYGLAGMPGVIGVA 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E P D+ +LGDS++ L+ WL+ P P+ AA+ R+ R H T +IE +
Sbjct: 209 TSQF--ETGMP-KDSWLLGDSSFFLRSWLLTPLP-IPETAAEYRYNRAHSATHSVIERTL 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD---EA 291
L +F CL+ ++ SP+ ++LA LHNI+L D + V D E
Sbjct: 265 QTLCCRFQCLDGSKGALQYSPEKCSHIILACCVLHNISLDHGMDVWSSPVPGPIDQPPEG 324
Query: 292 EDDGEEEAVDDPDEIDEE 309
ED+ E + D I +E
Sbjct: 325 EDEHMESLDLEADRIRQE 342
>gi|126332706|ref|XP_001369586.1| PREDICTED: putative nuclease HARBI1 [Monodelphis domestica]
Length = 348
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/317 (29%), Positives = 162/317 (51%), Gaps = 19/317 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M + F F + ++++ + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MAMYGFPRQF------IYYLVDLLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G T G S++S+ R + V +V+ Q + +P E ++ ++ F G P V G V
Sbjct: 90 GDTIGISQASMSRCVANVTEALVERASQ-FIHFPTDETSIQNLKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVGIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGSLLNVETNWPGSLQDWAVVQQSAL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D +LGD ++ L+ WL+ P P A+ R+ H T +IE +F
Sbjct: 209 RSQF-EAGMH--KDCWLLGDGSFFLRTWLMTP-LHIPGTPAEYRYNMAHSATHNVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
++ +F CL+ ++ SP+ + ++LA LHNI+L D + H ++ E++
Sbjct: 265 RTIQSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHGMDVWSSPGTGHMEQPEEE 324
Query: 295 GEE-EAVD-DPDEIDEE 309
E EA+D + D + +E
Sbjct: 325 CEHMEALDSEADRVRQE 341
>gi|154759331|ref|NP_848839.2| putative nuclease HARBI1 [Mus musculus]
gi|81913518|sp|Q8BR93.1|HARB1_MOUSE RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|26337267|dbj|BAC32318.1| unnamed protein product [Mus musculus]
gi|62871735|gb|AAH94315.1| Harbi1 protein [Mus musculus]
gi|74151001|dbj|BAE27631.1| unnamed protein product [Mus musculus]
gi|148695628|gb|EDL27575.1| RIKEN cDNA D230010M03 [Mus musculus]
Length = 349
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 157/318 (49%), Gaps = 20/318 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + F++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYFLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPVDEAAVQSLKDEFYGLAGMPGVIGVA 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E P D+ +LGDS++ L+ WL+ P P+ AA+ R+ R H T +IE +
Sbjct: 209 TSQF--ETGMP-KDSWLLGDSSFFLRSWLLTPLP-IPETAAEYRYNRAHSATHSVIERTL 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD---EA 291
L +F CL+ ++ SP+ ++LA LHNI+L D + V D E
Sbjct: 265 QTLCCRFRCLDGSKGALQYSPEKCSHIILACCVLHNISLDHGMDVWSSPVPGPIDQPPEG 324
Query: 292 EDDGEEEAVDDPDEIDEE 309
ED+ E + D I +E
Sbjct: 325 EDEHMESLDLEADRIRQE 342
>gi|327259629|ref|XP_003214638.1| PREDICTED: putative nuclease HARBI1-like [Anolis carolinensis]
Length = 348
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 143/267 (53%), Gaps = 19/267 (7%)
Query: 17 VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIH 76
+ ++++ + L+ PT+R+ A+S + ++L+ L + GS +G G S++S+ R +
Sbjct: 46 IYYLVDLLGANLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVA 105
Query: 77 EVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
V +V+ Q + +PE++ + + F G P V G VD T V I APN +
Sbjct: 106 NVTEALVERSSQ-FIHFPEDDASVQQLKDDFYTLAGMPGVLGLVDCTHVAIKAPNAEDLS 164
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD----EEGWRPF 190
+VNR G H+LN + +C + +WPGS+ D V++ S + +F+ ++GW
Sbjct: 165 YVNRKGLHSLNCLMVCDARGLLLSAETHWPGSLQDCTVLQQSGLRNQFEAGMHKDGW--- 221
Query: 191 PDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN- 249
+LGD+++ L+ WL+ P P+ A+ R+ H T +IE +F ++ +F CL+
Sbjct: 222 ----LLGDNSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHNIIEQTFWSIRSRFRCLDG 276
Query: 250 ---YMRLSPQFAGMVVLASTTLHNIAL 273
++ SP+ A ++LA LHNIAL
Sbjct: 277 SKGTLQYSPEKASHIILACCVLHNIAL 303
>gi|449683707|ref|XP_004210434.1| PREDICTED: uncharacterized protein LOC101239557 [Hydra
magnipapillata]
Length = 264
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 120/210 (57%), Gaps = 2/210 (0%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
++ ++ R+ ++++ ++ L T+RN LS +E++L ++G + YH++
Sbjct: 39 VSDLVYRHQNRVPRAVIDYLELRLSNILQHQTKRNHPLSPREQILLFSQFLGTSAFYHLL 98
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
G +I ++H V I + +++KWP++ + + F + GGF VAG +DG
Sbjct: 99 RDARGVDTKTIHSTVHRVANSI-QSLKNKVIKWPDD-CSRLVDEFFKLGGFSCVAGAIDG 156
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
T+V + P E FV+R +HA++++A+ GPN FY VN N P HD+R+++ S ++
Sbjct: 157 TLVQVMPPKIDEALFVDRYQNHAISILAVAGPNMAFYYVNTNNPSRCHDSRIVKESKLWE 216
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ G RPF AV +G+S Y L++WLI P
Sbjct: 217 SWETNGLRPFQGAVNIGNSGYSLRDWLITP 246
>gi|432090387|gb|ELK23813.1| hypothetical protein MDA_GLEAN10019053 [Myotis davidii]
Length = 349
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 147/279 (52%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q ++++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-IIRFPADEASIQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P + + A+ R+ H T +IE +F
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLRAWLMTPLHIH-ETPAEYRYNVAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|291384923|ref|XP_002709124.1| PREDICTED: harbinger transposase derived 1 [Oryctolagus cuniculus]
Length = 349
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 144/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGTSLSRPTQRSRAISPETQILAALGFYTSGSFQTRL 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E + + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIRFPADEASTQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D+ V++ S +
Sbjct: 149 DCLHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLRDSAVLQQSAL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
R E G + +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 SRHL-EAGLH--KGSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|427798307|gb|JAA64605.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 346
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/293 (31%), Positives = 140/293 (47%), Gaps = 32/293 (10%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T F++ FRLS V + +E+ P + +R LS + KVL L + GS +
Sbjct: 34 LTEEEFRQCFRLSKRTVRSLCDELEPIIGC--QRASGLSTERKVLCALRFFATGSFQRSV 91
Query: 61 GATH--GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT----------FLRK 108
G G ++S++ +IHEV I+ +R L D +T F R+
Sbjct: 92 GREEHIGMAQSAVSNTIHEVTEAIITVSARR-------KLVDFSLTPAAKEEAKAAFARR 144
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHE----DQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
G P V CVDGT+V I P F++R G +ALNVM +C + V+ +P
Sbjct: 145 GDIPGVLACVDGTLVAIRKPEGFNLPDTASFMSRKGYYALNVMIVCNAQLRILVVDPRFP 204
Query: 165 GSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN-PDAAADQRF 223
GS HD+ V ++ + + + P +LGDS YPL+ WL+ P + P + R+
Sbjct: 205 GSCHDSWVWEHNPLRGRLAAQLQ---PGEYVLGDSGYPLEPWLLTPVLGSPPRNTPEGRY 261
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMR---LSPQFAGMVVLASTTLHNIAL 273
R H R ++E G+LK KF CL + R +P A ++ A LHNIAL
Sbjct: 262 NREHASMRNVVERCIGVLKSKFRCLQHFRTMLYNPDRAARIIYACVALHNIAL 314
>gi|328697693|ref|XP_001952791.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 616
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 159/328 (48%), Gaps = 16/328 (4%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
FRLS V F+L I L PT RN +++ ++L L + G I G S
Sbjct: 296 FRLSKDAVLFLLNWIEKDLEYPTNRNQSIAPVNQILCFLRFCATGKHLISIADFMGCHVS 355
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
++ R ++ V VI + + P+ ++ D F FP V G +D T V + +
Sbjct: 356 TVSRIVYRVARVI-SIRCKNFILMPKTDIEMADTTKEFYNIAKFPKVIGVIDCTHVKVQS 414
Query: 128 PNEHEDQ-FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
+ + + NR G ++NV +C N V A WPGSVHD + NS ++ +F+ E
Sbjct: 415 LGGGDAEIYRNRKGYFSMNVQCVCRANLTISNVVARWPGSVHDLTIFNNSRLHTEFETEV 474
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
+R + +ILGDS YPL +L+ P +P +Q + H RTR IE FG+L +FP
Sbjct: 475 YRNY---IILGDSGYPLLNYLMTP-LLHPATVNEQLYNESHIRTRNCIERCFGVLNRRFP 530
Query: 247 CLNY-MRLSPQFAGMVVLAST-TLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPD 304
L Y +RL A M V+ ST LHNIA+ + D ++++ D + + +++
Sbjct: 531 ILAYGIRLQNIDAIMAVITSTCILHNIAILFN-DQVQDSIDLEDPQHLN----LIINNGH 585
Query: 305 EIDEEEQPRQDRNNPGTQRLARLL-NYF 331
+I+ + + G ++ L+ NYF
Sbjct: 586 QINGRDNRQHGDEVVGLNQITNLINNYF 613
>gi|426245379|ref|XP_004016489.1| PREDICTED: putative nuclease HARBI1 [Ovis aries]
Length = 349
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 145/283 (51%), Gaps = 25/283 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPGDEASVQALKDEFYGLAGIPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCMHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQQSSL 208
Query: 179 YRKFD----EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
+F+ +E W +LGDS++ L+ WL+ P P+ A+ R+ H T +I
Sbjct: 209 SSQFEAGMHKESW-------LLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVI 260
Query: 235 ENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
E +F L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 261 EKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|193610737|ref|XP_001943491.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 380
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 157/344 (45%), Gaps = 27/344 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F +RL+ V +L+ + P + + N LS + K+L+ L + GS IG G
Sbjct: 47 FIGTYRLNKILVRELLDILTPFMVSSVT-NAGLSIQRKLLTALRFFASGSYQQDIGENRG 105
Query: 66 P--SKSSICRSIHEVV-----PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCV 118
S+ S+ R I EVV P I++ + E LND+ + F K G P V G +
Sbjct: 106 SAVSQPSVSRCITEVVNAFNRPEILNKYIHFPSSLGE--LNDVRLGFYEKFGIPGVIGVI 163
Query: 119 DGTMVLIDAPNEH-----EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
DGT + I P E ++NR G H++N IC N K V A +PGS HD+ +
Sbjct: 164 DGTHIAIVPPKSEDIIYPEHVYINRKGYHSINTQLICDSNMKILNVCAKFPGSTHDSHIW 223
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
R S V +LGDS Y L+ WL+ P T + R+ R R L
Sbjct: 224 RVSPVLGLLKHLHSIGHSSYFLLGDSGYGLRPWLLTPLTEYQPNTPEARYNTWLCRARSL 283
Query: 234 IENSFGILKEKFPCLNYMRL---SPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
IE G+LK +F CL R+ +P+ A ++ A T LHNI + N+ I D++
Sbjct: 284 IERCNGVLKMRFRCLLKHRVLHYAPEKASSIINACTVLHNICI-------GNNLPIIDND 336
Query: 291 AEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLL--NYFR 332
E + + + + I+E R + +R+ + + +YF+
Sbjct: 337 GEFEENDYGILNNTAINEPNATRANTELIAGKRMQQQIINSYFK 380
>gi|115496966|ref|NP_001069136.1| putative nuclease HARBI1 [Bos taurus]
gi|119368340|sp|Q17QR8.1|HARB1_BOVIN RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|109659164|gb|AAI18218.1| Harbinger transposase derived 1 [Bos taurus]
gi|296479654|tpg|DAA21769.1| TPA: putative nuclease HARBI1 [Bos taurus]
gi|440903445|gb|ELR54100.1| Putative nuclease HARBI1 [Bos grunniens mutus]
Length = 349
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 145/283 (51%), Gaps = 25/283 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPADEASVQALKDEFYGLAGIPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCMHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQQSSL 208
Query: 179 YRKFD----EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
+F+ +E W +LGDS++ L+ WL+ P P+ A+ R+ H T +I
Sbjct: 209 SSQFEAGMHKESW-------LLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVI 260
Query: 235 ENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
E +F L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 261 EKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|166795899|ref|NP_001107265.2| putative nuclease HARBI1 [Rattus norvegicus]
gi|205831681|sp|B0BN95.1|HARB1_RAT RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|165971427|gb|AAI58735.1| Harbi protein [Rattus norvegicus]
Length = 349
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 159/318 (50%), Gaps = 20/318 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPADEAAIQSLKDEFYGLAGMPGVIGAV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E P D+ +LGDS++ L WL+ P P+ A+ R+ R H T +IE +
Sbjct: 209 SSQF--ETGMP-KDSWLLGDSSFFLHTWLLTP-LHIPETPAEYRYNRAHSATHSVIEKTL 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVE--IHDDEAE 292
L +F CL+ ++ SP+ + ++LA LHNI+L D + V I
Sbjct: 265 RTLCCRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHGMDVWSSPVTGPIEQPPEG 324
Query: 293 DDGEEEAVD-DPDEIDEE 309
+D + E++D + D I +E
Sbjct: 325 EDEQMESLDLEADRIRQE 342
>gi|301632320|ref|XP_002945236.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 349
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 133/269 (49%), Gaps = 11/269 (4%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL+ + + E P L T R+ + K+ LH+ GS + G G S+
Sbjct: 56 YRLNKAAIYTLYELFEPGLEPRTRRSRGIRGIVKLPCSLHFFASGSFQRVGGVYGGVSQP 115
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
+ + + +V+ I + + + +P+ + F R G PNV G +D T V ++
Sbjct: 116 TFSQCLGQVLDAI-RSVSRTFISFPQNQNEWGTVKRDFYRVSGIPNVLGAIDCTHVALNP 174
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P + E + NR G H++N+ +C ++ + +PGS HDA ++R S +Y+ F+
Sbjct: 175 PQDREHVYRNRKGYHSINIQVVCDAKMNILSIVSGFPGSSHDAYILRQSGLYQAFETG-- 232
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
P +LGD+ YP WLI P R P + A+ F + H RTR +IE +FG+LK +F C
Sbjct: 233 -QMPHGWLLGDAGYPCGRWLITPIHR-PRSQAECAFNQSHVRTRSVIERTFGVLKSRFRC 290
Query: 248 LN----YMRLSPQFAGMVVLASTTLHNIA 272
L+ + SP ++ A L+N+A
Sbjct: 291 LDRSGGSLLYSPTKVANIIGACAVLNNLA 319
>gi|301772494|ref|XP_002921667.1| PREDICTED: putative nuclease HARBI1-like [Ailuropoda melanoleuca]
gi|281340474|gb|EFB16058.1| hypothetical protein PANDA_010579 [Ailuropoda melanoleuca]
Length = 348
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 144/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGANLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERATQ-FIRFPADEASVQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDYAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
F E G D+ +LGDS++ L+ WL+ P P+ A+ R+ H T +IE +F
Sbjct: 209 NSHF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHIPETPAEYRYNMAHSATHSVIEKTF 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|354469884|ref|XP_003497342.1| PREDICTED: putative nuclease HARBI1-like [Cricetulus griseus]
gi|344247825|gb|EGW03929.1| Putative nuclease HARBI1 [Cricetulus griseus]
Length = 349
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 143/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVAHVTEALVERASQ-FIHFPADEAAVQSLKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLIVCDIRGALMTVETSWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P + A+ R+ R H T +IE +
Sbjct: 209 SSQF-ETGMH--KDSWLLGDSSFFLRTWLMTP-LHIAETPAEYRYNRAHSATHSVIEKTL 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++LA LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISL 303
>gi|390360974|ref|XP_794543.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 234
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 114/206 (55%), Gaps = 12/206 (5%)
Query: 90 MVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNV 146
++K+PE E + + F G PNV G VDGT V L AP + E + NR G H++NV
Sbjct: 10 VIKFPETMEEVRTAQVDFFNLAGMPNVVGAVDGTHVELHGAPLDDEYIYTNRKGKHSINV 69
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEW 206
IC YK V A WPGS HD+RV+RNS + +F +G P +++GDS YPL+ W
Sbjct: 70 QLICNARYKITNVCARWPGSTHDSRVLRNSRIGERF-ADGELP---GILVGDSGYPLQPW 125
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL--NYMRLSPQFAGMVVLA 264
LI P R+P A++ + R H RTR IE G LK KF CL +++S A +++A
Sbjct: 126 LITP-LRDPQGNAERNYNRAHCRTRVTIEQLNGQLKNKFRCLMGQGIQMSAPRACDIIIA 184
Query: 265 STTLHNIALTIDRDNENQNVEIHDDE 290
L NIA D Q +E+ DE
Sbjct: 185 CAVLFNIAK--DLKEPEQAIEVEPDE 208
>gi|301610538|ref|XP_002934801.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 328
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 138/272 (50%), Gaps = 15/272 (5%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL+ + + E + P L T R+ A+ K+L LH+ GS + G G S+
Sbjct: 56 YRLNKAAIYTLYELLEPGLEPRTRRSRAIPGMVKLLCSLHFFASGSFQRVGGVYGGVSQP 115
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEENLND---IPMTFLRKGGFPNVAGCVDGTMVLID 126
+ R + +V+ I + + + +P+ N N+ + F R G PNV G +D T V ++
Sbjct: 116 TFSRCLSQVLDAI-RSASRTFISFPQ-NRNEWGTVKRDFYRVSGIPNVLGEIDCTHVALN 173
Query: 127 AP--NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
P + E + NR G H++NV +C ++ + +PGS H+A ++R S +Y+ F+
Sbjct: 174 PPPPQDREHVYRNRKGYHSINVQVVCDAKMNILSIVSGFPGSSHNAYILRQSGLYQTFET 233
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
P +LGD+ YP WLI P R P + A+ F + H RT +IE +FG+LK +
Sbjct: 234 G---QMPHGWLLGDAGYPCGRWLITPIHR-PRSRAECDFNQAHVRTLSVIERTFGVLKSR 289
Query: 245 FPCLN----YMRLSPQFAGMVVLASTTLHNIA 272
F CL+ + SP ++ A LHN+A
Sbjct: 290 FRCLDRSGGSLLYSPTKVANIIGACAVLHNLA 321
>gi|328700311|ref|XP_003241214.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 287
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 146/298 (48%), Gaps = 16/298 (5%)
Query: 36 VALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE 95
+AL+ +VL L + +G+ + G S SS R+I V I F +K P
Sbjct: 1 MALAPINQVLLTLRFYALGTMLISVADMFGVSVSSASRTIKNVSYAIAG-FSGLFLKIPT 59
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNY 154
+L + M + FP V G +D T V I +P E + F NR G +LNV A+ +
Sbjct: 60 NDLVETKMKMFKIARFPLVFGAIDCTHVRIQSPGGEFAESFRNRKGYFSLNVQALVNSDL 119
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
KF + A WPGS HD+ + RNS +Y + E G F + ILGDS Y LK +++ P N
Sbjct: 120 KFMDIVAIWPGSAHDSNIFRNSRLYARL-ESG--EFNNNAILGDSGYALKPYMLTP-ILN 175
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
P + + RTR +IE FG+ K +FP L+ MRL + +++A+ LHNI
Sbjct: 176 PVGRIEMLYNESQIRTRNIIERCFGVWKRRFPVLSLGMRLQLKTVQAIIVATAILHNIC- 234
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYF 331
+ N ++ +D ++D + +++ +++ E + D R + L NYF
Sbjct: 235 ------RDMNEDLPEDSSDDVLHQ--LNEAEDMTETDHRHDDNGEDSITRNSLLNNYF 284
>gi|348558866|ref|XP_003465237.1| PREDICTED: putative nuclease HARBI1-like [Cavia porcellus]
Length = 350
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 144/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + P+L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGPSLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +++ Q + +P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALMERATQ-FIHFPADEASVQALKDEFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G D+ +LGDS++ L+ WL+ P P A+ R+ H T +++ +
Sbjct: 209 SSQF-EAGMH--KDSWLLGDSSFFLRTWLMTPLPL-PKTPAEDRYNLAHSATHSVMQKTL 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + +++A LHNI+L
Sbjct: 265 RTLCSRFRCLDGSKGALQYSPEKSSHIIMACCVLHNISL 303
>gi|351697362|gb|EHB00281.1| Putative nuclease HARBI1 [Heterocephalus glaber]
Length = 349
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 143/279 (51%), Gaps = 17/279 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + P+L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYYLVELLGPSLSRPTQRSRAISPETQILAALGFYTSGSFQMRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +++ Q +++P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALMERASQ-FIRFPADEASMQALKDEFYGLAGMPGVVGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G + +LGDS++ L+ WL+ P P A+ R+ H T ++E +
Sbjct: 209 SSQF-EAGM--LKNNWLLGDSSFFLRTWLMTPLPL-PKTPAEDRYNLAHSATHSVMEKTL 264
Query: 239 GILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIAL 273
L +F CL+ ++ SP+ + ++ A LHNI+L
Sbjct: 265 QTLCSRFRCLDGSKGALQYSPEKSSHIITACCVLHNISL 303
>gi|328711563|ref|XP_003244572.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 335
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 140/271 (51%), Gaps = 10/271 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F++ FRL V+ ILEE+ + PT N ++S ++L L + G+ G G
Sbjct: 20 FQKRFRLKKRTVKMILEELQDQIKYPTNSN-SISPMTQLLLTLRFFATGNFLITAGDFSG 78
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMV 123
S ++ + + V V+ T + +K P+ + ++ + F F V G +D T +
Sbjct: 79 VSVAAAGQIVKRVSYVLA-TKSDKYIKMPKTTAEIMELKVKFYGLARFLKVIGAIDCTHI 137
Query: 124 LIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ +P+ E+ +Q+ NRNG +LN+ A+ N +F V A WPGS HD+ V NS + +
Sbjct: 138 KLQSPSREYGEQYRNRNGYFSLNLQALVNANLEFLDVVARWPGSAHDSNVFANSRLRARM 197
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
+ F D VILGD Y L +L+ P NP A++ + RTR ++E +FG+ K
Sbjct: 198 E---LHEFKDCVILGDPGYALSHYLLTP-IANPTTKAERLYNESQIRTRNVVERTFGVWK 253
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
+FP L + +RL + + V+ + LHNIA
Sbjct: 254 RRFPVLFFGLRLKMETSMAVIQSCAILHNIA 284
>gi|72126314|ref|XP_795950.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 293
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 124/245 (50%), Gaps = 8/245 (3%)
Query: 32 TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMV 91
TERN L +VL+ L + VGS + G S+SS+CR I +V I Q M
Sbjct: 10 TERNDPLPVYLQVLTALRFYAVGSFQKMHGDEATVSQSSVCRIIKDVSEAIAGRKRQFM- 68
Query: 92 KWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMA 148
K+P + + F GFP V G +DGT V I +P + F+NR +++NV
Sbjct: 69 KFPTTRDEIETTQQQFYEYCGFPGVIGAIDGTHVYIRSPGGDQALYFMNRKNRYSVNVQV 128
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+C K ++ A WPGS HD+R+ + + + R ++LGDS YP +L+
Sbjct: 129 VCDHAGKMTSIVARWPGSTHDSRIFSECTLKEQLEA---RADGSELLLGDSGYPCLPYLL 185
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTL 268
P R P + R+ HKR R +IE +FG K +FPCLN +R+ +++A + L
Sbjct: 186 TPLLR-PQGRPNVRYNTAHKRGRCVIERTFGRWKRRFPCLNDLRVKVDTTFTIIVACSVL 244
Query: 269 HNIAL 273
NI+L
Sbjct: 245 WNISL 249
>gi|328696997|ref|XP_003240200.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 356
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 154/327 (47%), Gaps = 28/327 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK FRLS V IL+ I P++++ T+RN A++ +K+L L + GS G G
Sbjct: 44 FKTRFRLSKETVLMILDLIGPSISSNTDRNNAVTTTQKLLLALRFYATGSFLISAGDVVG 103
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMV 123
SKS+ C + + V V + + +K PE N + ++ F FP V G +D T +
Sbjct: 104 VSKSTACVIVRD-VSVALAQLRPQFIKMPETNDEIKELQKQFYGIAKFPLVIGAIDCTHI 162
Query: 124 LIDAPNE-HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I +P + + F NR G +LNV + K + WPGS HD+ + S +
Sbjct: 163 KIQSPGGPNAEYFRNRKGWFSLNVQTVVSAKLKIMDIVVRWPGSTHDSTIFSRSKINNDL 222
Query: 183 D-EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
E+ W +++I+ DS Y + ++ P NP A + + RTR +E +G+L
Sbjct: 223 HVEQKW---GNSLIVADSGYANTKHIVTPFL-NPQAGPENLYNESQIRTRNPVERCYGVL 278
Query: 242 KEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
K +FP L+ MRL V++A + LHNIA ++ +D DD V
Sbjct: 279 KRRFPVLSLGMRLQISNIQNVIIACSVLHNIA-----------IDCNDVMPMDD-----V 322
Query: 301 DDPDEIDE--EEQPRQDRNNPGTQRLA 325
PDE+ E E ++R N RL
Sbjct: 323 QLPDELTELINEPTIENRQNNARTRLV 349
>gi|410905903|ref|XP_003966431.1| PREDICTED: putative nuclease HARBI1-like [Takifugu rubripes]
Length = 346
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 161/321 (50%), Gaps = 49/321 (15%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
++ F F F L +++E + AL T+R+ A+S + +VL+ L + GS +
Sbjct: 36 LSQFGFPRQFIL------YLIELLQEALCRRTQRSRAISPEVQVLAALGFYTSGSFQTSM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI-PMT-------FLRKGGFP 112
G T G S++S+ R + +V +V+ K P+ D+ P++ F R G P
Sbjct: 90 GTTIGISQASMSRCVSDVTRALVE-------KAPQFITFDLDPLSREQSFQEFQRVAGLP 142
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
V G +D V I AP + +VN+ G H++ +C + NWPG + D +
Sbjct: 143 GVLGVLDCVQVTIKAPTIEDLSYVNKKGFHSVGCQLVCNAQGLLLSAETNWPGGLKDTDI 202
Query: 173 MRNSMVYR--KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
+++S + + ++ EEGW +LGD YPL++W++ P +P+++A+ ++ H T
Sbjct: 203 LQSSNLSKQMQYTEEGW-------LLGDHRYPLRKWMMTP-VDSPESSAELQYNLAHAAT 254
Query: 231 RRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNI-------ALTIDR-- 277
+++ +F ++ +F CL+ Y++ SP+ + ++LA LHN A ++R
Sbjct: 255 CEIVDRTFRAIQSRFKCLDGSKGYLQYSPERSSAILLACCVLHNAFLQSGFDAWAVERVA 314
Query: 278 -----DNENQNVEIHDDEAED 293
D+ + E++D +AE+
Sbjct: 315 PVEQPDSTEERTEVNDSQAEE 335
>gi|192455694|ref|NP_001122281.1| uncharacterized protein LOC560936 [Danio rerio]
Length = 351
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 136/281 (48%), Gaps = 21/281 (7%)
Query: 3 PFAFK-----EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPY 57
P AF E +R SA + ++ + P + PT R+ A + + + L + G+
Sbjct: 33 PLAFSDEHLYERYRFSAEGMLYLCRLLEPHIKNPTRRSHATTVPQMICIALRFFTSGTFL 92
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG-----GFP 112
+ +G SK+++CR+I VV + + + V +P +P +++G G P
Sbjct: 93 YEVGDAEKLSKNTVCRTIRRVV-IALQRYINTFVAFP----GHLPTQAIKEGFSQIAGLP 147
Query: 113 NVAGCVDGTMVLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDAR 171
V G +D + I P E E F+NRN H++NV C +++A WPGS+ D +
Sbjct: 148 GVIGAIDCIHIPISTPVKEIEATFLNRNATHSINVQMTCDHQCLITSLDARWPGSMQDNQ 207
Query: 172 VMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ S + ++F + + D V++GD Y + +L+ P P F +TR
Sbjct: 208 IFEKSKLCQRFQQGLF----DGVLVGDGTYACQSFLLTPYP-EPKTKPQHEFNIALSQTR 262
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
I+N+ ILK +F CL +R+SP+ A +V A LHNIA
Sbjct: 263 LKIDNTLAILKARFNCLRDLRVSPERASQIVGACAVLHNIA 303
>gi|432951686|ref|XP_004084885.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 360
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 136/279 (48%), Gaps = 19/279 (6%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
FR+ E + ++P L+ T A+ +++V L W+ G+ Y + G S
Sbjct: 37 FRMRQETFEMLSVRLSPMLSYEDTTFRQAIPVQKRVGVGLWWLATGAGYRTLAHLFGISD 96
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
+S+C + E + + +K PE E L + + F + GFP AG +DG+ + + A
Sbjct: 97 ASVCLIVREFCHAVRHERMREYIKLPEGEELQTVLLGFKNRWGFPQCAGAIDGSHIPVIA 156
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P+E+ + NR G H++ + A+ Y F +N WPGSVHD+RV+RNS +Y K E
Sbjct: 157 PHENHADYFNRKGCHSVILQAVVDHKYCFKNINIGWPGSVHDSRVLRNSEMYEK--AESG 214
Query: 188 RPFPDA-----------VILGDSAYPLKEWLIP--PRTRNPDAAADQRFLRCHKRTRRLI 234
FP+ ++LGD AYPL+ WL+ P T N + Q F R
Sbjct: 215 VLFPNTTEEIQGTQVPIMLLGDPAYPLRSWLMKGYPETGNLNEQQRQ-FNNRLSGARMTA 273
Query: 235 ENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
E +FG LK ++ CL+ + + V+ A TLHNI
Sbjct: 274 ECAFGRLKGRWRCLSKRLDVDISLVPTVISACCTLHNIC 312
>gi|126631710|gb|AAI34162.1| Zgc:162945 protein [Danio rerio]
Length = 351
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 136/281 (48%), Gaps = 21/281 (7%)
Query: 3 PFAFK-----EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPY 57
P AF E +R SA + ++ + P + PT R+ A + + + L + G+
Sbjct: 33 PLAFSDEHLYERYRFSAEGMLYLCRLLEPHIKNPTRRSHATTVPQMICIALRFFTSGTFL 92
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG-----GFP 112
+ +G SK+++CR+I VV + + + V +P +P +++G G P
Sbjct: 93 YEVGDAEKLSKNTVCRTIRRVV-IALQRYINTFVAFP----GHLPAQAIKEGFSQIAGLP 147
Query: 113 NVAGCVDGTMVLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDAR 171
V G +D + I P E E F+NRN H++NV C +++A WPGS+ D +
Sbjct: 148 GVIGAIDCIHIPISTPVKEIEATFLNRNATHSINVQMTCDHQCLITSLDARWPGSMQDNQ 207
Query: 172 VMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ S + ++F + + D V++GD Y + +L+ P P F +TR
Sbjct: 208 IFEKSKLCQRFQQGLF----DGVLVGDGTYACQSFLLTPYP-EPKTKPQHEFNIALSQTR 262
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
I+N+ ILK +F CL +R+SP+ A +V A LHNIA
Sbjct: 263 LKIDNTLAILKARFNCLRDLRVSPERASQIVGACAVLHNIA 303
>gi|390364877|ref|XP_794452.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 291
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/235 (36%), Positives = 122/235 (51%), Gaps = 22/235 (9%)
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVD 119
H + IC + +++ VI K+PE E + + F G PNV G VD
Sbjct: 47 GKHSINVQLICNARYKITNVI---------KFPETMEEVRTAQVDFFNLAGMPNVVGAVD 97
Query: 120 GTMVLIDAPNEHEDQFV--NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
GT V + +D+++ NR G H++NV IC YK V A WPGS HD+RV+RNS
Sbjct: 98 GTHVELHGAPLLDDEYIYTNRKGKHSINVQLICNARYKITNVCARWPGSTHDSRVLRNSR 157
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ +F +G P +++GDS YPL+ WLI P R+P A++ + R H RTR IE
Sbjct: 158 IGERF-ADGELP---GILVGDSGYPLQPWLITP-LRDPQGNAERNYNRAHCRTRVTIEQL 212
Query: 238 FGILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
G LK KF CL +++S A +++A L NIA D Q +E+ DE
Sbjct: 213 NGQLKNKFRCLMGQGIQMSAPRACDIIIACAVLFNIAK--DLKEPEQAIEVEPDE 265
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 2/80 (2%)
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
E + + F G PNV G VDGT V L AP + E + NR G H++NV IC Y
Sbjct: 2 EEVRTAQVDFFNLAGMPNVVGAVDGTHVELHGAPLDDEYIYTNRKGKHSINVQLICNARY 61
Query: 155 KFYAVNANWPGSVHDARVMR 174
K V +P ++ + R +
Sbjct: 62 KITNV-IKFPETMEEVRTAQ 80
>gi|449691294|ref|XP_002166584.2| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 226
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 120/207 (57%), Gaps = 9/207 (4%)
Query: 91 VKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
+K+P + LN+ + F FP V G +D T + I P+EHE +V+ + +H++NV A
Sbjct: 7 IKYPTNQHVLNESRVEFYESAEFPQVTGVIDCTHICIQKPHEHEYAYVDSSSNHSINVQA 66
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+C KF V A WPGS HDAR++R S + +KF + ++ +++GDS YP WL+
Sbjct: 67 VCDNKGKFIDVVAKWPGSTHDARILRESKLGKKFMDGTFK----GLLIGDSGYPCFRWLL 122
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTT 267
P NP A+ R+ ++TR +IE FG K +F L+ +R++P+ +V A
Sbjct: 123 TPYL-NPTTASQHRYSISLRKTRVIIEQVFGRWKRRFHLLHGEIRMTPERTCTLVAACAV 181
Query: 268 LHNIALTIDRDNENQN-VEIHDDEAED 293
LHN+A+ ++ + ++N +E +++ ED
Sbjct: 182 LHNLAIQLNDGDMDENPIENDENDNED 208
>gi|390366728|ref|XP_003731101.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 426
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 152/330 (46%), Gaps = 21/330 (6%)
Query: 10 FRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
FR+S F+ ++ P + T+ VA+ + +V L + Y I G
Sbjct: 102 FRMSWQTFNFLCGQLRPIIQKMDTKFRVAIRVEHRVAITLWRLATNVEYRTIAQMFGVGT 161
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S++C +H+V IV T M++ P + D+ F K GFP G +DG+ + + +
Sbjct: 162 STVCCIVHQVCRAIVVTLANDMIRIPRGDAAADVVREFEVKWGFPQCFGAIDGSHIPVLS 221
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE-- 185
P E + NR G +++ + + Y+F +N +PGSVHDARV NS V+R +E
Sbjct: 222 PKEFRADYYNRKGFYSMVLQGLVDHRYRFMNINFGYPGSVHDARVFTNSRVFRLGNEGEL 281
Query: 186 --------GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIEN 236
G P A ILGDSAYPL WL+ P N ++R R R ++EN
Sbjct: 282 CPPLLREIGETQVPVA-ILGDSAYPLLPWLMKPFHDNGQLTREKRHFNYRLSRARMVVEN 340
Query: 237 SFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
FG LK ++ L ++ G +V+A LHN+ ++ +N ++ D + G
Sbjct: 341 GFGRLKGRWRILLKRQDTHVKYLGDLVVACCVLHNLC-----ESAGENFDL-DLLGANGG 394
Query: 296 EEEAVDDPDEIDEEEQPRQDRNNPGTQRLA 325
+ +A + P DE+ + N Q L
Sbjct: 395 DGQAPNVPGMNDEQNRDATRIRNALLQHLT 424
>gi|328704555|ref|XP_001947392.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 370
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 159/332 (47%), Gaps = 22/332 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FRL + V+ +L EI L T+RN +++ ++L L + G+ +G G
Sbjct: 52 FVRRFRLKKSTVDKVLNEIDDQLKYSTDRNNSIAPMTQLLITLRFYATGNFLITVGDFGG 111
Query: 66 PSKSSICRSIHEVVPVI--VDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S +S + + V + + + + R+ + PEE + ++ + F FP V G +D T +
Sbjct: 112 ISVASAGKIVKRVSYALAFLSSRYIRLPETPEEKM-ELKVQFYGLARFPKVIGAIDCTHI 170
Query: 124 LIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ P+ E+ + + NR G +LNV A+ N +F V WPGS HD+ + NS + +
Sbjct: 171 KLQCPSKEYGELYRNRKGYFSLNVQALVNANLEFMDVVVRWPGSAHDSNIFANSRLKARI 230
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
+ F D +ILGDS Y L +L+ P + A++ + RTR ++E SFG+ K
Sbjct: 231 E---LPEFSDCIILGDSGYALSHYLLTPLA-HTTTNAERLYNESQIRTRNVVERSFGVWK 286
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVD 301
+FP L + +RL + V+ A LHN+A QN D + DD E
Sbjct: 287 RRFPVLFFGLRLKMETTMAVIQACAVLHNMA-------RLQN----DPQPPDDIENIKQL 335
Query: 302 DPDEIDEEEQPRQDRNNPGTQ--RLARLLNYF 331
EI E E Q + N Q R A ++ +F
Sbjct: 336 LSAEISEIEAATQPQINSPAQALRTALIMEHF 367
>gi|328701629|ref|XP_003241664.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 348
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 131/275 (47%), Gaps = 16/275 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FRL+ V+F+L I P + + T RN A++ +K+L L + GS G
Sbjct: 50 FLNRFRLNKDTVKFLLRIIKPKIVSQTSRNNAVTHSQKLLMTLRYYATGSFIATCADFAG 109
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
K++ + I EV I + +P ++ + + F FP+ G +D T +
Sbjct: 110 IHKTTGGKIIIEVSKAIA-ALRPDFIHFPTTDDEIRTVKQDFFNIAKFPSCIGAIDCTHI 168
Query: 124 LIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I + ++ D F NR ++NV IC + A WPGS HDA + RNS + + F
Sbjct: 169 KIRSLGGNDADIFRNRKQFFSMNVQTICDSKLIIQNIVARWPGSSHDANIFRNSAIKQHF 228
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
D R F D V++ DS YP++ ++I P NP + F RTR IE S+G+ K
Sbjct: 229 DN---REFKDCVLVADSGYPMQSYMITPML-NPITNVENTFNESQIRTRNPIERSYGVWK 284
Query: 243 EKFPCLNYMRLSPQFAGM-----VVLASTTLHNIA 272
+FP L+ L M +++A+ LHNIA
Sbjct: 285 RRFPILS---LEINVRNMDTVQAIIVATAVLHNIA 316
>gi|301609160|ref|XP_002934124.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 347
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 154/302 (50%), Gaps = 27/302 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T + F F ++++++ + +L+ PT+R+ A+S + ++++ L + GS +
Sbjct: 36 VTTYGFPRPF------IDYLVDLLGGSLSRPTQRSRAISPETQIMAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G T G S++S+ R + V +V+ Q + +P E ++ + F G P V G V
Sbjct: 90 GDTIGISQASMSRCVTNVTEALVERASQ-FISFPRDERSVQRLKDEFYNLAGVPGVLGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D T V I APN + +VNR G H+LN + +C + GS+ D V+ S +
Sbjct: 149 DCTQVNIKAPNSEDLSYVNRRGLHSLNCLLVCDARGSLLWAETSRLGSMQDNAVLHQSEL 208
Query: 179 YRKFD----EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
Y F+ ++GW +L D+A+ L+ WL+ P + P++ +D R+ H T ++
Sbjct: 209 YSLFETKMHKQGW-------LLADNAFILRPWLMTP-VQLPESPSDYRYNMAHTATHSVM 260
Query: 235 ENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRD--NENQNVEIHD 288
E + L+ +F CL+ ++ SP+ + +VLA LHNIAL D D +E+ +
Sbjct: 261 ERTQRSLRLRFRCLDGSRATLQYSPEKSAQIVLACCILHNIALQHDLDIVSESGTSSLEP 320
Query: 289 DE 290
DE
Sbjct: 321 DE 322
>gi|157423504|gb|AAI53379.1| Zgc:162945 [Danio rerio]
Length = 351
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 135/281 (48%), Gaps = 21/281 (7%)
Query: 3 PFAFK-----EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPY 57
P AF E +R SA + ++ + P + PT R+ A + + + L + G+
Sbjct: 33 PLAFSDEHLYERYRFSAEGMLYLCRLLEPHIKNPTRRSHATTVPQMICIALRFFTSGTFL 92
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG-----GFP 112
+ +G SK+++CR+I VV + + + V +P +P +++G G P
Sbjct: 93 YEVGDAEKLSKNTVCRTIRRVV-IALQRYINTFVAFP----GHLPTQAIKEGFSQIAGLP 147
Query: 113 NVAGCVDGTMVLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDAR 171
V G +D + I P E E F+NRN H++NV C +++A WPGS+ D +
Sbjct: 148 GVIGAIDCIHIPISTPVKEIEATFLNRNATHSINVQMTCDHQCLITSLDARWPGSMQDNQ 207
Query: 172 VMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ S + ++F + + D V++GD + +L+ P P F +TR
Sbjct: 208 IFEKSKLCQRFQQGLF----DGVLVGDGTSACQSFLLTPYP-EPKTKPQHEFNIALSQTR 262
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
I+N+ ILK +F CL +R+SP+ A +V A LHNIA
Sbjct: 263 LKIDNTLAILKARFNCLRDLRVSPERASQIVGACAVLHNIA 303
>gi|328700543|ref|XP_003241296.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 377
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 152/319 (47%), Gaps = 18/319 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F + FRLS + + P L ERN + + +VL +++ GS Y H
Sbjct: 43 FMDIFRLSPELSMDLTNTLRPFLQR--ERNSGIPVEIQVLVAVYFYAKGS-YQRATGDHF 99
Query: 65 --GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL--NDIPMTFLRKGG-FPNVAGCVD 119
S+ S+ R +H V I + ++ V++P NL N+ F F G +D
Sbjct: 100 DLNVSQPSVSRCLHAVTDAINNNLLRQWVRFPMTNLERNNAREEFCNAPQPFEGAIGAID 159
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T + I AP +HE+ FVN +G+H+LNV A+ P K +NA +PG+ +D+ + S +
Sbjct: 160 CTYINILAPKDHEEAFVNHHGNHSLNVQAVVSPKLKILNINARYPGARNDSYIWSTSPIR 219
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
R + + ++GDS YPL+ WL+ P + ++ H + R ++E FG
Sbjct: 220 RAMEFHYNKGERHTWLIGDSGYPLEPWLMTPLPHYREGTRQFKYTMKHCKARNVVERFFG 279
Query: 240 ILKEKFPCLNYMRL---SPQFAGMVVLASTTLHN--IALTIDRDNENQNVEIHDDEAEDD 294
+ K + CL+Y R+ +P AG +V A LHN I I D EN +D+ A +
Sbjct: 280 VFKSVWRCLSYQRVLMYAPDMAGKIVNACAVLHNMRIHYRIPIDVENIENGNNDEYAIRN 339
Query: 295 GEEEAVDDPDEIDEEEQPR 313
++ + DEI E PR
Sbjct: 340 FQQ----NEDEIAERGGPR 354
>gi|301617460|ref|XP_002938161.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 332
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/242 (32%), Positives = 119/242 (49%), Gaps = 7/242 (2%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL+ + + E P L T R+ A+ K+L LH++ GS + G G S+
Sbjct: 80 YRLNRAAIYSLYEVQEPYLQPLTRRSHAVPGMVKLLCSLHFLATGSFQKVGGVYGGVSQP 139
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
+ R + +V+ I + + +P N I F G PNV G +D T + +++
Sbjct: 140 TFSRCLGQVLDAI-HSVSANYISFPTNRNEWNTIKRQFYGVSGIPNVLGDIDCTNMALNS 198
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P + E + NR H+LN+ +C ++ + +PGS HDA ++R S Y F E G
Sbjct: 199 PQDREHVYRNRKSYHSLNIQMVCDATMNIRSIMSGFPGSSHDAYILRQSGHYDGF-ETG- 256
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
FP I GD+ YP WLI P R P A+ F H R R +IE +FG+LK +F C
Sbjct: 257 -KFPSYFISGDAGYPCSRWLITPIHR-PRTEAECAFNEAHVRARSVIERTFGVLKSRFCC 314
Query: 248 LN 249
L+
Sbjct: 315 LD 316
>gi|115683787|ref|XP_001179790.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 219
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 115/212 (54%), Gaps = 13/212 (6%)
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA----PNEHEDQFVNRNGDHALNVMAICG 151
E +N + F GFP V G VDGT + ++ P EH ++NR G +++N IC
Sbjct: 2 EEVNRAQVDFFNISGFPQVIGVVDGTHIRLNGAPLGPGEHV--YMNRKGYYSINTQIICD 59
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR 211
NYK + A WPGS HD+R+ +NS V + F++ ++LGDS Y L+ +L+ P
Sbjct: 60 TNYKIINILARWPGSTHDSRIFQNSRVGQTFEDLQQH----GLLLGDSGYALRPYLMTP- 114
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHN 270
NP A+Q + R H TR IE G LK KF C LN M+++P+ A ++ A LHN
Sbjct: 115 VLNPRTPAEQAYNRAHATTRVRIEQVNGQLKHKFRCLLNGMQMAPRRACKIITACAVLHN 174
Query: 271 IALTIDRDNENQNVEIHDDEAEDDGEEEAVDD 302
+A + + +E + I E E D ++ VD+
Sbjct: 175 VAKDLKQPDE-VDFGIEQHEHEPDRDDHIVDN 205
>gi|328700912|ref|XP_003241424.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 429
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 132/271 (48%), Gaps = 9/271 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+ FRLS VE +L+ I ++ + N A++ +K+L L + GS G G
Sbjct: 144 FRMRFRLSKVVVEQVLDLIRDNISVENQWNCAIAPIDKLLLTLRFYATGSFLITAGDFLG 203
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
SKSS C + V I Q + P +E + + +F + FP G +D T +
Sbjct: 204 VSKSSACVIVRTVSTAIARLCHQ-FIYMPTTDEEVYTLQRSFYKIARFPRAIGAIDCTHI 262
Query: 124 LIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I +P H + F NR G +LNV + P+ K + A WPGS HD + RNS ++ +
Sbjct: 263 RIQSPGGHNAEYFRNRKGYFSLNVQTVVSPDLKIMDIVARWPGSCHDQTIFRNSNIHSQL 322
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
W +++I+ DS Y ++ P NP ++ + RTR +E ++G+LK
Sbjct: 323 VNGKW---GNSLIVADSGYKNTSHIVTPFI-NPRGNIEELYNESIIRTRNPVERTYGVLK 378
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
+FP L+ +RL + V++ + LHNIA
Sbjct: 379 RRFPILSLGLRLKLTTSQAVIVTCSILHNIA 409
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 3/78 (3%)
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD 192
+ F NR G +LNV + P+ K + A WPGS HD + RNS ++ + W +
Sbjct: 16 EYFRNRKGYFSLNVQTVVSPDLKIMDIVARWPGSCHDQTIFRNSNIHSQLVNGKW---GN 72
Query: 193 AVILGDSAYPLKEWLIPP 210
++I+ DS Y ++ P
Sbjct: 73 SLIVADSGYKNTSHIVTP 90
>gi|270016593|gb|EFA13039.1| hypothetical protein TcasGA2_TC010570 [Tribolium castaneum]
Length = 393
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 144/304 (47%), Gaps = 14/304 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVG-SPYHIIGATH 64
F++ FR++ E +L+ + P L + EK L + W+ V Y +G
Sbjct: 70 FQQHFRVTIEAYEQLLQTVGPYLMRQAATGRSTVQVEKQLLSVIWILVTPDSYRSVGERF 129
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G +KSS+ + VV I++ ++KWPE L ++ F P+ G +DGT V
Sbjct: 130 GLAKSSLSQCFKRVVK-ILNAVAPTIIKWPEGMQLQNVERRFHAFARMPHCIGAIDGTYV 188
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I AP E ++ R ++A + AI P+ KF +PGSV D+R+ +NS Y +
Sbjct: 189 EIKAPKEDPQSYITRKCNYAFTLQAIAVPSLKFTDAFIGYPGSVSDSRIFKNSDFYNAVN 248
Query: 184 EEGWRPFPDA-VILGDSAYPLKEWLIPPRT-RNPDAAADQRFLRCHKRTRRLIENSFGIL 241
F + ++GD AYP W I P R AA F H +TR++IE SF +L
Sbjct: 249 ANMGHYFAEEHYLIGDKAYPNLPWCIAPYINRGNLNAAQVYFNTVHAQTRQVIERSFALL 308
Query: 242 KEKFPCLNYMRLS-PQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
+F L ++ ++ + V+A+ LHN+ L ++H D+ ++G + V
Sbjct: 309 FGRFRRLKFLDMNDTKLIPATVIAACVLHNVCLDFH--------DLHIDDYINNGIDYVV 360
Query: 301 DDPD 304
++ D
Sbjct: 361 NNND 364
>gi|328697463|ref|XP_003240344.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 279
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/240 (31%), Positives = 120/240 (50%), Gaps = 8/240 (3%)
Query: 4 FAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
F FK F+LS +L I + T RN +LSA+ ++L L + G+ ++G
Sbjct: 45 FDFKIRFKLSKETFMILLHMIGENIEHKTLRNFSLSAEVQILIALRYYATGTFQAVLGDH 104
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGT 121
K+++CR + V I Q +++P L I F + GFP V G +D T
Sbjct: 105 IHVHKATVCRIVKRVSLQIAQLRPQ-YIQFPNNTVQLQQIQARFYKLHGFPRVIGAIDCT 163
Query: 122 MVLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ I +P N+ ++F NR G +LN+ AIC K + A WP SVHD+ + NS +
Sbjct: 164 HIRIQSPKNDIGEKFRNRKGYFSLNIQAICDSQLKIMNIVARWPRSVHDSTIFDNSFIRA 223
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
KF+ F + +LGD YP + +L+ P NP A++++ + TR ++E FG+
Sbjct: 224 KFENN---EFGNTFLLGDGGYPCRSYLLTP-LLNPRTDAERKYQKAQIGTRNVVERLFGV 279
>gi|350536511|ref|NP_001233059.1| uncharacterized protein LOC100569528 [Acyrthosiphon pisum]
gi|239789800|dbj|BAH71500.1| hypothetical protein [Acyrthosiphon pisum]
Length = 353
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 146/307 (47%), Gaps = 20/307 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+ FR+S V +L+ I +++ T+RN A+++ K+L L + G+ G G
Sbjct: 43 FRIRFRISKVVVLQVLDHINDQISSLTDRNHAVTSINKLLLALRFYATGNFLITSGDFLG 102
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
SK++ + +V + + R ++ P E ++ + +F + FP G +D T V
Sbjct: 103 VSKTTASLIVRDV-SIAIAKLRPRFIQMPTTEREISKLQRSFYQIARFPRTIGAIDCTHV 161
Query: 124 LIDAPNE-HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I P + + F NR G ++NV I PN K V A WPGS HD + + S +Y
Sbjct: 162 KIQNPGGPNAEYFRNRKGYFSINVQTIACPNLKIMDVVARWPGSCHDQTIFKKSQIYYNL 221
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
W +++I+ DS Y L+ P NP ++ + RTR +E S+G+LK
Sbjct: 222 INGKW---GNSLIVADSGYANSRHLVTPFL-NPRNDIEELYNESIIRTRNPVERSYGVLK 277
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL-----------TIDRDNENQNVEIHDDE 290
+FP L+ +RL + V++A LHNIA + +NEN N+E E
Sbjct: 278 RRFPVLSLGLRLKLETTQAVIVACCVLHNIACDNNDMDPPALDIVLPENENINIEEQQPE 337
Query: 291 AEDDGEE 297
E+ ++
Sbjct: 338 GENARQQ 344
>gi|349604650|gb|AEQ00142.1| Putative nuclease HARBI1-like protein, partial [Equus caballus]
Length = 286
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 125/243 (51%), Gaps = 11/243 (4%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-- 94
A+S + ++L+ L + GS +G G S++S+ R + V +V+ Q + +P
Sbjct: 3 AISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQ-FIHFPAD 61
Query: 95 EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
E ++ + F G P V G VD V I APN + +VNR G H+LN + +C
Sbjct: 62 ESSMQALKDEFYELAGMPGVIGVVDCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDIRG 121
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
V NWPGS+ D V++ S + +F E G D+ +LGDS++ L+ WL+ P
Sbjct: 122 ALMTVETNWPGSLQDYAVLQQSSLSSQF-EAGMH--KDSWLLGDSSFFLRTWLMTP-LHI 177
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHN 270
P+ + R+ H T +IE +F L +F CL+ ++ SP+ + ++LA LHN
Sbjct: 178 PETPVEYRYNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHN 237
Query: 271 IAL 273
I+L
Sbjct: 238 ISL 240
>gi|328697485|ref|XP_003240353.1| PREDICTED: hypothetical protein LOC100574226 [Acyrthosiphon pisum]
Length = 681
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 134/271 (49%), Gaps = 9/271 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FRL T V+ +L +I L + T RN A+ K+L L + +G+ G G
Sbjct: 43 FFMRFRLKKTTVKELLFKIEFHLKSKTNRNRAILPITKLLLTLRFYALGTILLANGDFVG 102
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
SK+S C + V I + +K PE N+ + + F FP + G +D T V
Sbjct: 103 VSKTSACNIVRTVTEAIA-SLRPLYIKMPEHHCNIQETRLKFYNIARFPRIIGAIDCTHV 161
Query: 124 LIDAPNEH-EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ +P + + + NR G +LNV + G + + + A WPGS HD + NS ++ KF
Sbjct: 162 KLQSPGGNIAEVYRNRKGYFSLNVQVVGGRSLEILDIVARWPGSTHDQVIFNNSTIHFKF 221
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
+ D ++LGD Y + +++ P +P+ A+ + RTR IE FG+ K
Sbjct: 222 ETN---EMGDNILLGDGGYECRPYILVPLI-SPNTNAELLYNESQIRTRNTIERLFGVWK 277
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
+FP L+ +R +P A +++A+ LHN+A
Sbjct: 278 RRFPILSLGIRTTPDRAQAIIVATAVLHNLA 308
>gi|301623992|ref|XP_002941295.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 309
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 98/173 (56%), Gaps = 16/173 (9%)
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
G FPN G +D T V + P H+++++NR H++NV +C + + +V + +PGSVH
Sbjct: 105 GHFPNCLGAIDCTHVPLTPPRAHQERYLNRKRSHSINVQVVCDSHLRIMSVRSGFPGSVH 164
Query: 169 DARVMRNSMVYRKFDE----EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFL 224
DA ++R S +Y +F + GW ++GD+ Y + WL+ P R P A +R+
Sbjct: 165 DAHILRQSALYERFTQGEMPRGW-------LVGDAGYGVLPWLMTP-VRFPRTPAQRRYN 216
Query: 225 RCHKRTRRLIENSFGILKEKFPCLNY----MRLSPQFAGMVVLASTTLHNIAL 273
R H++TR +IE FG+LK +F CL+ + SP +++ LHN+AL
Sbjct: 217 RAHRKTRNVIERLFGVLKSRFRCLSVTGGALLYSPIKVSEIIVVCAMLHNVAL 269
>gi|47210238|emb|CAF92077.1| unnamed protein product [Tetraodon nigroviridis]
Length = 373
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 132/258 (51%), Gaps = 23/258 (8%)
Query: 27 ALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTF 86
AL T+R+ A+S + +VL+ L + GS +G T G S++S+ R + +V +V+
Sbjct: 84 ALCRRTQRSRAISPEVQVLAALGFYTSGSFQTSMGTTIGISQASMSRCVSDVTRALVE-- 141
Query: 87 FQRMVKWPEENLNDIP-----MTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGD 141
+ ++ NL+ + F R GFP V G +D V I AP + +VN+ G
Sbjct: 142 --KAPQFITFNLDPLSREQSFQEFQRVAGFPGVLGVLDCVQVTIKAPTIEDMSYVNKKGF 199
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE--EGWRPFPDAVILGDS 199
H++ +C + NWPG + ++ S + ++ E EGW +LGD
Sbjct: 200 HSVACQLVCNAQGLLLSAETNWPGGLRATDILERSSLNKQMQEAAEGW-------LLGDR 252
Query: 200 AYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSP 255
YPL++W++ P +P+++A+ ++ H T +++ +F ++ +F CL+ Y++ SP
Sbjct: 253 RYPLRKWMMTP-VDSPESSAELQYNAAHAATHEIVDKTFRAIRSRFKCLDGTKGYLQYSP 311
Query: 256 QFAGMVVLASTTLHNIAL 273
+ + ++LA LHN L
Sbjct: 312 ERSAAILLACCVLHNAFL 329
>gi|301611155|ref|XP_002935119.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 320
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 101/177 (57%), Gaps = 9/177 (5%)
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+P +L G FPN G +D T V + P H+++++NR H++NV +C + + +V
Sbjct: 97 LPAVYL-IGHFPNCLGAIDCTHVPLTPPRAHQERYLNRKRSHSINVQVVCDSHLRIMSVR 155
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD 220
+ +PGSVHDA ++R S +Y +F + P ++GD+ Y + WL+ P R P A
Sbjct: 156 SGFPGSVHDAHILRQSALYERFTQG---EMPRGWLVGDAGYGVLPWLMTP-VRFPRTPAQ 211
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLNY----MRLSPQFAGMVVLASTTLHNIAL 273
+R+ R H++TR +IE FG+LK +F CL+ + SP +++ LHN+A+
Sbjct: 212 RRYNRAHRKTRNVIERLFGVLKSRFRCLSVTGGALLYSPIKVSEIIVVCAMLHNVAM 268
>gi|148228010|ref|NP_001088509.1| putative nuclease HARBI1 [Xenopus laevis]
gi|82180108|sp|Q5U538.1|HARB1_XENLA RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|54311411|gb|AAH84846.1| LOC495378 protein [Xenopus laevis]
Length = 347
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 144/288 (50%), Gaps = 25/288 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T + F F ++++++ + +L+ PT R+ A+S + ++++ L + GS +
Sbjct: 36 VTTYGFPRPF------IDYLVDLLGASLSRPTHRSRAISPETQIMAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G T G S++S+ R + V +V+ Q + +P E ++ + F G P V G V
Sbjct: 90 GDTIGISQASMSRCVTNVTEALVERASQ-FISFPRDERSVQGLKDEFYNLAGVPGVLGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS-- 176
D T V I APN + +VN G H+LN + +C + GS+ D V+ S
Sbjct: 149 DCTQVNIKAPNSEDLSYVNSRGLHSLNCLLVCDARGSLLWAETSRLGSMQDNAVLHQSEL 208
Query: 177 --MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
+ K ++GW +L D+A+ L+ WL+ P + P++ +D R+ H T ++
Sbjct: 209 SGLFETKMHKQGW-------LLADNAFILRPWLMTP-VQIPESPSDYRYNMAHTATHSVM 260
Query: 235 ENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRD 278
E + L+ +F CL+ ++ SP+ + +VLA LHNIAL D D
Sbjct: 261 ERTQRSLRLRFRCLDGSRATLQYSPEKSAQIVLACCILHNIALQHDLD 308
>gi|193592115|ref|XP_001950353.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 375
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 136/289 (47%), Gaps = 12/289 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F FRLS + + P L ERN + + +VL+ ++ GS Y H
Sbjct: 43 FMNVFRLSPELSMDLTNTLRPFLQR--ERNSGIPVEIQVLAAVYVYAKGS-YQRATGDHF 99
Query: 65 --GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG---FPNVAGCVD 119
S+ S+ R +H V VI D ++ V +P N + K F G +D
Sbjct: 100 DLNISQPSVSRCLHSVTNVINDRLLRQWVTFPMTNADRNKAREEFKNAPQPFEGAIGAID 159
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T + I AP +HE+ +VN +G+H+LNV AI P K +N +PG+ +D+ V S +
Sbjct: 160 CTYINILAPKDHEEAYVNHHGNHSLNVQAIVSPKLKILNINPRYPGARNDSYVWSTSPIR 219
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
R + + ++GDS YPL+ WL+ P + ++ H + R ++E FG
Sbjct: 220 RAMEYHYNQGERHTWLIGDSGYPLEPWLMTPLLHYRERTRQFKYTLKHCKARNVVERFFG 279
Query: 240 ILKEKFPCLNYMRL---SPQFAGMVVLASTTLHNIALTIDRDNENQNVE 285
+ K + CL+Y R+ +P AG +V A LHN+ + E +N++
Sbjct: 280 VFKSVWRCLSYQRVLMYAPVMAGKIVNACAVLHNMRIHYRIPIEEENID 328
>gi|322786153|gb|EFZ12759.1| hypothetical protein SINV_12000 [Solenopsis invicta]
Length = 313
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 146/316 (46%), Gaps = 29/316 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT----PTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
FKE FR+ E +L+ I P L P + +SA++++L + +M Y +
Sbjct: 1 FKEHFRMFPETFEEVLQLIGPGLKAINALPGRK--PISAEKQLLIAIWFMSTPDSYRSVS 58
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDG 120
G K++ R++ V + + ++WP+ + ++ F + GFPNV GC+DG
Sbjct: 59 TKFGVGKATAFRALRRVTYAL-HCIAPQFIQWPKGAIATNVMRKFEQVCGFPNVIGCIDG 117
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
T + I AP E ++NR G H++NV +C F A GSVHDARV RNS V
Sbjct: 118 THIKIRAPKEDPVSYINRKGFHSINVQVVCESRGLFTHCYAGHVGSVHDARVFRNSPV-A 176
Query: 181 KFDEEGWRPFPD-AVILGDSAYPLK-EWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
F + FPD + I+GD+AY + ++P R A + F C TR IE +
Sbjct: 177 DFLQLPETYFPDNSHIIGDAAYSIHPHCMVPFRDNGHLTNAQKNFNYCLSSTRMTIERAI 236
Query: 239 GILKEKFPC------LNYMRLSPQFAGMVVLASTTLHNIA--------LTIDRDNENQNV 284
G LK +F L +R P+F +LA LHNI L I + E
Sbjct: 237 GHLKVRFRILLDCLPLTNIRKIPEF----ILACCVLHNICILKNDIIELGIYPNEEEVRP 292
Query: 285 EIHDDEAEDDGEEEAV 300
+HD+ E + V
Sbjct: 293 ILHDNAVELGNAKRTV 308
>gi|390338881|ref|XP_003724868.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 345
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/276 (29%), Positives = 133/276 (48%), Gaps = 13/276 (4%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+ P +++ FR VE++ P +R L+ E + + LH++ +
Sbjct: 21 IAPETYRKLFRFDKENVEWLAHRFIPE--HDEKRGGCLTTVESMEATLHYLADLGYQTTV 78
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCV 118
G G S++++ R + + + +I +K+P N + + K GFP G +
Sbjct: 79 GEVMGISQTTVSRHVAKNIDLIAAQH-PTWIKFPTSNADVTHAKERWAEKLGFPFTIGAI 137
Query: 119 DGTMVLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
D T V ID P + D+F+NR + NV A C NY F +V+ WPGSVHD+R+ + S
Sbjct: 138 DCTHVRIDKPRGQFGDEFINRKNYPSFNVQATCDENYIFTSVDVGWPGSVHDSRIFQTSD 197
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ R R +LGDS Y + +++ P +PD + F R H R R +IE +
Sbjct: 198 L-RDVVSRNVR----GCLLGDSGYGISPYMMTPFA-DPDTPVKRSFNRAHTRNRVVIEQA 251
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
FG LK + P L Y +RL + ++A LHN++
Sbjct: 252 FGQLKRRSPILRYGVRLKLENIPKCIVACFVLHNVS 287
>gi|301627133|ref|XP_002942730.1| PREDICTED: putative nuclease HARBI1-like, partial [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 128/263 (48%), Gaps = 46/263 (17%)
Query: 18 EFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHE 77
+ + +E+ P T R+ AL K+L+ LH++G GS + G S+ + R + +
Sbjct: 2 DLVRQELDPV----TARSQALPGISKLLAVLHFLGSGSFQQVSARLVGMSQPTFSRILKQ 57
Query: 78 VVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVN 137
V+ ++ P G +D T V + P H+++++N
Sbjct: 58 VLRALL----------PHSQ----------------SLGAIDCTHVPLTPPRAHQERYLN 91
Query: 138 RNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE----EGWRPFPDA 193
R H++NV +C + + +V + +PGSVHDA ++R S +Y +F + GW
Sbjct: 92 RKRSHSINVQVVCDSHLRIMSVRSGFPGSVHDAHILRQSALYERFTQGEMLRGW------ 145
Query: 194 VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY--- 250
++GD+ Y + WL+ P R P A +R+ R H++TR +IE FG+LK +F CL+
Sbjct: 146 -LVGDAGYGVLPWLMTP-VRFPRTPAQRRYNRAHRKTRNVIERLFGVLKLRFRCLSVTGG 203
Query: 251 -MRLSPQFAGMVVLASTTLHNIA 272
+ SP +++ LHN+A
Sbjct: 204 ALLYSPIKVSEIIVVCAMLHNVA 226
>gi|301611277|ref|XP_002935169.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 427
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 145/309 (46%), Gaps = 26/309 (8%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
+ +RL+ + + + E I + T+R+ A+ K+L+CLH+ GS + G S
Sbjct: 78 QRYRLNRSAINELYELIKGDIDPLTQRSHAIPGMVKLLNCLHFFASGSFQTRTSSIGGVS 137
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTMVLI 125
+S+ R + V+ I + + +P++ + F R G P+V G +D + +
Sbjct: 138 QSAFSRFMGPVIDSI-KKHLKAYIYFPKDKAGWQRVKRGFYRIAGMPHVMGVLDCMHIAL 196
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE- 184
P+E E+ + N G H++NV IC K ++ + +PGS D+ +++ + VY F+
Sbjct: 197 SPPHEREELYRNSKGFHSVNVQVICDCKGKILSIYSAFPGSSQDSVILKQTSVYEAFENG 256
Query: 185 ---EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
GW ++G Y + WL+ NP A+ F H RT +IE +F L
Sbjct: 257 KLTGGW-------LIGGPGYTCQPWLLTA-VANPTTTAEDSFNEAHARTHSVIERTFSSL 308
Query: 242 KEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDN--ENQNVEIHDDEAED-- 293
K +F CL+ ++ +P +++ LHNI + + N EN E DDE E
Sbjct: 309 KSQFKCLDKPGGVLQYNPTRVADIIVVCCILHNIGI---KHNVLENFACEADDDELEALL 365
Query: 294 DGEEEAVDD 302
GE E D+
Sbjct: 366 SGEPELGDN 374
>gi|340381674|ref|XP_003389346.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 368
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 128/283 (45%), Gaps = 13/283 (4%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P ++E R+S ++ ++ P + T+ A++ +E+V + + Y +
Sbjct: 29 PSWWRENLRMSYETFTMLVNQLRPYIFKRDTQLRTAVTVEERVAITVWRLATNVEYRTLS 88
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDG 120
A G S++C + E VI Q VK P L + F + GFP V G +DG
Sbjct: 89 ALFGVGISTVCTIVMETCTVISQHLLQHYVKIPTGSKLRETVDGFKTRWGFPQVVGAIDG 148
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + + P+E + NR +++ + + +F WPG +HDARV NS Y
Sbjct: 149 SHIPVLRPSECATDYYNRKCFYSIIIQGVVDYQGQFIDTYIGWPGKLHDARVFYNSSFYN 208
Query: 181 KFDEEGWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK-RT 230
K + P +ILGD AYPL WL+ P +P A +++ + R
Sbjct: 209 KGRQGTLFPSMSVRIEQTDIPLLILGDPAYPLLPWLMKPYPDSPSATTEEKHYNYRQSRA 268
Query: 231 RRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIA 272
R ++EN+FG LK ++ C L M ++V A TLHNI
Sbjct: 269 RMVVENAFGRLKGRWRCLLKRMDCHLTNVPVIVAACVTLHNIC 311
>gi|328715942|ref|XP_003245784.1| PREDICTED: hypothetical protein LOC100572478 [Acyrthosiphon pisum]
Length = 732
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 127/246 (51%), Gaps = 11/246 (4%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
N A+SA+ ++L L + GS + G +S+ R + V + T +K+P
Sbjct: 445 NHAVSAQNQLLIALRYYATGSFLRVSADFTGVERSTCGRIVRRVSKALA-TLHPHFIKFP 503
Query: 95 E--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDH--ALNVMAIC 150
+ + + F R FP G +D T + I +P ++ +NRN ++N IC
Sbjct: 504 TTADEVETVKQGFYRIAKFPRCIGAIDCTHIKICSPGG-DNAELNRNRKQFFSINFQTIC 562
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
N + + WPGS HDA + +NS++ KF E G + +++GDS YPLK++L+ P
Sbjct: 563 DSNLQIQNIVCRWPGSAHDANIFKNSIIRSKF-EHGL--MGNNLLVGDSGYPLKKYLMTP 619
Query: 211 RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLH 269
+ NP A++R+ RTR IE+S+ + K +FP L + + + A V++A+ LH
Sbjct: 620 LS-NPQTIAEERYNESQIRTRNPIEHSYSVWKRRFPILAIGINVDLETAKTVIVATAVLH 678
Query: 270 NIALTI 275
NIA T+
Sbjct: 679 NIANTL 684
>gi|340377501|ref|XP_003387268.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 414
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 126/286 (44%), Gaps = 15/286 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHI 59
M +K+ FR++ E I E+ P + T +S E+V+ L + Y
Sbjct: 82 MGEVWWKDNFRVTQQTFEEICREVQPYIVRKNTYLRQPISVDERVVVTLWRLATNVDYRT 141
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCV 118
I A G S++C + E V+ + F V P E + F + GFP G +
Sbjct: 142 ISALFGIGCSTVCTIVIETCEVLSEHLFHMYVYIPTGEGQQKVVDGFQSRWGFPQAIGAI 201
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DG+ + I P + NR G +++ V + KF WPG HDARV +NS +
Sbjct: 202 DGSHIPIIKPCHCPSDYYNRKGFYSIIVQGLVDHTGKFLDAYIGWPGKCHDARVFQNSSL 261
Query: 179 YRKFDEEGWRPFPD---------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK- 228
Y+K + P VILGD+AYPL WL+ P P + D + +
Sbjct: 262 YKKGIKGSLLPCLTRKLGSVDVPLVILGDAAYPLLSWLMKPYIETPSSPDDIKLYNYRQS 321
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIA 272
R R ++ENSFG LK ++ L RL + +V A TLHN+
Sbjct: 322 RARMVVENSFGRLKGRWRIL-LKRLDCHLDNVPSIVSACITLHNMC 366
>gi|432865843|ref|XP_004070641.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 266
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 112/233 (48%), Gaps = 7/233 (3%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
F +++I + + P L T R+ +LS +++VL L + G+ Y + G +KS
Sbjct: 40 FHFGNADIKYIADLVMPKLQRRTRRSHSLSLEQQVLIGLRFYASGTFYQAVSDNIGVNKS 99
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
++ + + V + + + V +P + + F G PN G +D T V I AP
Sbjct: 100 TVSDVV-KAVSIALASLVNPFVSFPNDVQIAQTKHKFFTLGNMPNTIGVIDCTHVHIQAP 158
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR 188
+E + +++NR G ++N+ + + WPGSVHDAR++R S +YR+
Sbjct: 159 HERDWEYINRKGRRSINIQLVGDADLIITNCVVKWPGSVHDARILRESALYRELQTNR-- 216
Query: 189 PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
P +ILGDSAYPL WLI P F H +TR IE G+L
Sbjct: 217 --PHGIILGDSAYPLLPWLITPFL-AASTPVQAHFNTAHCKTRCAIERLNGVL 266
>gi|390352427|ref|XP_003727898.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 201
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 105/194 (54%), Gaps = 13/194 (6%)
Query: 114 VAGCVDGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDAR 171
V G +D T V +D E +VNR G H++NV + NY V A WPGSVHD R
Sbjct: 2 VVGAIDCTHVWLDGSPLKTTEYAYVNRKGWHSINVQFVTDANYNIINVCARWPGSVHDCR 61
Query: 172 VMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
V+ NS V + F R + V+L DS YP + WL+ P RNP A++ + R H R R
Sbjct: 62 VLENSYVGQDFQ----RGELEGVLLSDSGYPQRSWLMTP-FRNPQTHAERTYNRAHMRGR 116
Query: 232 RLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL-----TIDRDNENQNVE 285
++E + G +K+KFPCL +R+ P+ A +++A T L+ ++ + RD+E +V
Sbjct: 117 VVVEQTNGQIKKKFPCLRRGLRVKPKKACQIIIACTVLYRLSKDWKEPYLGRDHELPDVG 176
Query: 286 IHDDEAEDDGEEEA 299
+ D + + E A
Sbjct: 177 VDDFDGPVNNEGYA 190
>gi|328705831|ref|XP_003242919.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 326
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 144/303 (47%), Gaps = 22/303 (7%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVI--VDTFFQRMVK 92
N +++ ++L L + G+ +G G S +S + + V + + + + R+ +
Sbjct: 37 NNSIAPMTQLLITLRFYATGNFLITVGDFGGISVASAGKIVKRVSYALAFLSSRYIRLPE 96
Query: 93 WPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICG 151
PEE + ++ + F FP V G +D T + + P+ E+ + + NR G +LNV A+
Sbjct: 97 TPEEKM-ELKVQFYGLARFPKVIGAIDCTHIKLQCPSKEYGELYRNRKGYFSLNVQALVN 155
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR 211
N +F V WPGS HD+ + NS + + + F D +ILGDS Y L +L+ P
Sbjct: 156 ANLEFMDVVVRWPGSAHDSNIFANSRLKARIE---LPEFSDCIILGDSGYALSHYLLTPL 212
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
R A++ + RTR ++E SFG+ K +FP L + +RL + V+ A LHN
Sbjct: 213 ART-TTNAERLYNESQIRTRNVVERSFGVWKRRFPVLFFGLRLKMETTMAVIQACAVLHN 271
Query: 271 IALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQ--RLARLL 328
+A QN D + DD E EI E E Q + N Q R A ++
Sbjct: 272 MA-------RLQN----DPQPPDDKENIKQLLSAEISEIEAATQPQINSPAQALRTALIM 320
Query: 329 NYF 331
+F
Sbjct: 321 EHF 323
>gi|449669223|ref|XP_004206967.1| PREDICTED: uncharacterized protein LOC101235795 [Hydra
magnipapillata]
Length = 305
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 134/290 (46%), Gaps = 15/290 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S + V LH++ Y
Sbjct: 1 MEECEFKEHFRVNRNPFNFLVNELHPHLGKTTTTMRE-PISVVKCVAVALHYLASCEEYR 59
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ ++E + + D + VK+P ENLN F GFP G
Sbjct: 60 VVSSLFGIGKSTAILIVYEFINAVNDILLPKYVKFPLSVENLNKYSRDFEAILGFPQCVG 119
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VD + I AP + + N G +++ + A+ Y F + PG +D+ +++NS
Sbjct: 120 AVDRCHIPISAPKDQAISYYNYKGWYSIVLFAVVDCRYHFIYTSVGSPGRNNDSYILQNS 179
Query: 177 M--------VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCH 227
++ K +E ++GDSA+PL L+ P N + + Q+ F +
Sbjct: 180 SLKAILESNLFDKCCKELGDSLVPLCLIGDSAFPLTRHLLKPYPENLELSKIQKNFSKIL 239
Query: 228 KRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
RR++EN+FG LK +F + M FA +V A TLHNI D
Sbjct: 240 CGARRVVENAFGRLKARFRVICKRMECDINFATRIVNACVTLHNICENYD 289
>gi|72158323|ref|XP_794409.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 321
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 110/221 (49%), Gaps = 11/221 (4%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+R S ++ ++ + T+RN AL +V L + GS HG S
Sbjct: 39 YRFSRQGCLHLINRLSRRIQHRTQRNHALPPSLQVFVALRFFATGSVMDCTSCIHGIHIS 98
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
+ R++ V + + ++K+PE E + + F G P+V G VDGT V +
Sbjct: 99 TTSRTVRRVA-LALGELRDEVIKFPETLEEVGTAQVDFFNLAGMPHVVGAVDGTHVELHG 157
Query: 128 PNEHEDQ--FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
+D+ F NR G H++NV IC YK V A WPGS HD+RV+R+ +R+ +
Sbjct: 158 APLLDDEYIFTNRKGKHSINVQLICNARYKITNVCARWPGSTHDSRVLRSH--WRERFAD 215
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
G P +++GDS YPL+ WLI P R+P A++ + RC
Sbjct: 216 GELP---GILVGDSGYPLQPWLITP-LRDPQGNAERNYNRC 252
>gi|328711999|ref|XP_003244703.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 418
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 134/268 (50%), Gaps = 10/268 (3%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RLS V +LE+I AL T+RN +S ++L L + G G G S S
Sbjct: 89 YRLSKPTVLSMLEKIEDALEFETDRNNCISPINQLLCTLRYYATGCFQTTGGDLCGFSSS 148
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV-LID 126
++ R +H+V I + + +P+ E + + F R+ FP V G +D T + L
Sbjct: 149 TMNRIVHKVSCAIA-LLRSQYIHFPDNPEEIRRTQLEFYRRAKFPRVVGAIDCTHIKLWQ 207
Query: 127 APN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
+P + ++F NR G ++LNV AIC N + V A + GS HD+R+ R S R E+
Sbjct: 208 SPGGDTAERFRNRKGYYSLNVQAICNANLEVMDVVARYDGSTHDSRIFRESK-RRALFEQ 266
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
G + DA+++ DS Y +++ P A+Q + RTR IE FG+ K +F
Sbjct: 267 G--VYGDALLVADSGYACTSYMMTP-LHECHTPAEQLYNESQIRTRNPIERFFGVWKRRF 323
Query: 246 PCLNY-MRLSPQFAGMVVLASTTLHNIA 272
P + +R+ + ++ A+ L+NIA
Sbjct: 324 PIMALGLRVKLKRVFPIITATLVLNNIA 351
>gi|270000989|gb|EEZ97436.1| hypothetical protein TcasGA2_TC011267 [Tribolium castaneum]
Length = 364
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 143/294 (48%), Gaps = 23/294 (7%)
Query: 11 RLSATQVEFILEEIAPALTTPTE--RNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
RL T ++L+ I+PALT T+ R L+ K+ L + G+ ++G++ S+
Sbjct: 54 RLDRTVFNYVLQLISPALTKNTKKGRYSHLTPMHKLYVALQFYATGTYQWMVGSSSRISQ 113
Query: 69 SSICRSIHEVVPVIVDTF-----FQRMVKWPEENLNDIPMTFLRKG-GFPNVAGCVDGTM 122
S+ ++IHEV +V+ F ++P + R G GFPNV G +D T
Sbjct: 114 SATSKAIHEVTAALVEVAPQFIRFPAAAEYPTVKQAFYALGMTRHGAGFPNVLGAIDCTH 173
Query: 123 VLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
V I P+ E +Q++NR+ +++NV + + K + V A +PGS HD+ + RNS V
Sbjct: 174 VRIQKPSTEAPEQYLNRHLYYSINVQLVVSSSLKIFNVFAAFPGSNHDSFIWRNSAVRDG 233
Query: 182 FD----EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
EGW ++GDS YP + WL+ P +Q + H TR +E +
Sbjct: 234 IVAGNFTEGW-------LVGDSGYPQEPWLM-TPVTVPTTVPEQLYNAVHISTRNPVERT 285
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNEN-QNVEIHDD 289
G+LK ++ L+ + SP + A LHNI + + E+ Q +HDD
Sbjct: 286 NGVLKSRWHVLDSPLAYSPAKVCAITTACCVLHNICTEYNLEVEDLQEHPLHDD 339
>gi|332374146|gb|AEE62214.1| unknown [Dendroctonus ponderosae]
Length = 353
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 141/299 (47%), Gaps = 22/299 (7%)
Query: 6 FKEAFRLSATQVE-FILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++ FR S V+ +L + T R + + +L L + G GS I G
Sbjct: 44 FRKRFRFSKEIVQNVLLPLVGHQREQITNRGLPVPLIIGLLLTLQFYGSGSLRIICGELK 103
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G +S++ R I +V I + + +K+PE NLN + F FPNV GC+ +
Sbjct: 104 GFHQSTVSRVITKVTRSIYEQS-RNYIKFPE-NLNTVQTQFQTIRNFPNVIGCIGCAQIP 161
Query: 125 IDAPNE-HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I +P + + F N +LNV + GP+ + Y + A PGS + V S V +F+
Sbjct: 162 IMSPGGPNAELFRNGKRSFSLNVQIVAGPDLEIYDIVATNPGSYRNDHVFNKSAVKTRFE 221
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
++ + +LGDS YP +L+ P R+P ++R+ H + R +++ +FG+L+
Sbjct: 222 KKQLPGY----LLGDSGYPSLTYLLTP-FRDPCTNEEKRYNEAHAQIRNIVDRTFGVLRH 276
Query: 244 KFPCL-NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVD 301
+F CL +R + +++A LHNI++ H + E DGE+ D
Sbjct: 277 RFSCLRRLLRHKHETICCIIVACAVLHNISIA------------HREVVEPDGEDGVED 323
>gi|432951335|ref|XP_004084780.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 254
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/233 (30%), Positives = 108/233 (46%), Gaps = 27/233 (11%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + +I I P +T + R AL++++ + + L + GS + IG S
Sbjct: 46 ERYRFSLQSITYIHNLIQPYITNISHRGRALTSEQILCAALRFFANGSFLYNIGDAEHIS 105
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
K+++CRS+ +V + F + + +P + L I F R G PNV GC+DGT + I
Sbjct: 106 KATVCRSVRKVC-FALKRFLRIFIVFPGHKPLRAIKEEFHRIAGLPNVVGCIDGTHIPII 164
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P E+E +VNR H++NV IC + V A WPG +
Sbjct: 165 TPTENEADYVNRKSIHSINVQIICDAAHFITNVEAKWPGEI------------------- 205
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
D +LGD YP + L+ P P+ QRF H RTR +E + G
Sbjct: 206 -----DGFLLGDRGYPCQPKLLTPYP-EPEQGPQQRFNLAHSRTRARVEMTLG 252
>gi|270005023|gb|EFA01471.1| hypothetical protein TcasGA2_TC007020 [Tribolium castaneum]
Length = 270
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 70/227 (30%), Positives = 116/227 (51%), Gaps = 13/227 (5%)
Query: 47 CLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFL 106
CL ++ IG G +S++ R++ VV I++ +++P N + +
Sbjct: 4 CLRYLADPGFQKGIGEELGVEQSTVSRTVKYVVQKIIEQT-ANWIQFPASNQKNCGKANI 62
Query: 107 RKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
GFP VD T V I+ P H D+++NR G +NV A C F +V +WPGS
Sbjct: 63 ---GFP--LQLVDCTHVEIEKPKNHGDEYINRKGKPTINVQATCDAREIFTSVEVSWPGS 117
Query: 167 VHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
VHD+R+ +NS + + + V++GD Y ++ WL+ P NP+ +R+ +
Sbjct: 118 VHDSRIWKNSEIRQIMMRS-----TNTVLMGDDGYGIEPWLMTPFP-NPNDNTKKRYNKI 171
Query: 227 HKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIA 272
K+ R +IE FG +K +FP L Y+ R+ + +++A LHNIA
Sbjct: 172 FKKERVIIERCFGQVKRRFPILKYVCRVKLENISNIIIACFVLHNIA 218
>gi|115767324|ref|XP_791763.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 250
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 110/211 (52%), Gaps = 10/211 (4%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S+SS+ R + +V + T R +++P + + + F FP V G VD T V
Sbjct: 2 SQSSVSRVVLDVSTALA-TLRPRYIQFPRTPQEVLRVQQRFFDYSQFPGVIGAVDCTHVP 60
Query: 125 IDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I P E +F+NR +LNV IC + + V A WPG HD+R+ R S + + +
Sbjct: 61 IQNPGGEQAQRFINRKNTSSLNVQMICDCDGRILNVVARWPGGSHDSRIFRESAIKHQLE 120
Query: 184 EEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
E DA +LGDS Y + +++ P + P A+QR+ HKR R +IE +FG +K
Sbjct: 121 AERQ----DAKWLLGDSGYGCQPYVMTPLLQ-PANPAEQRYNTAHKRGRCIIERTFGQMK 175
Query: 243 EKFPCLNYMRLSPQFAGMVVLASTTLHNIAL 273
+FPCL +RL + ++A T L NI+L
Sbjct: 176 RRFPCLKGLRLKLETTMTTIVAVTVLWNISL 206
>gi|328699306|ref|XP_003240898.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 395
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 134/297 (45%), Gaps = 14/297 (4%)
Query: 5 AFKEAFRLSATQVEFILE---EIAPALTTPTERNVA---LSAKEKVLSCLHWMGVGSPYH 58
F+ FRLS T L EI+ T E+ + +++LS L + Y
Sbjct: 63 VFQSHFRLSRTVFYNTLLPMIEISQNNMTNKEKKTGRPEIPIVKQLLSVLWILATPDSYR 122
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGC 117
+G KSS+ S ++ +++ ++ WP E + F R PNV G
Sbjct: 123 SVGEKFDMGKSSLSVSFFRIINLLIMNA-SNVIIWPHGEEMERQKELFFRMASIPNVIGA 181
Query: 118 VDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
VDGT + I AP + + +V R ++A+ + AI KF V +PGSV D R+ RNS
Sbjct: 182 VDGTFIPIKAPKQDAEVYVTRKCNYAITLQAITNAELKFTDVFVGYPGSVSDTRIFRNSD 241
Query: 178 VYRKFDEEGWRPFP-DAVILGDSAYPLKEWLIPPR-TRNPDAAADQRFLRCHKRTRRLIE 235
++ FP D ILGD AYP W + P R A + F RTR+ +E
Sbjct: 242 IFINIMRNQKFYFPNDEHILGDKAYPPLTWCVEPYINRGRLTEAQKNFNFNISRTRQTVE 301
Query: 236 NSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
SF + +F Y+ +S F VLA+ LHN+ L ++ + N++ + +E
Sbjct: 302 RSFALFFGRFRRFKYLDMSRTDFIPSTVLAACVLHNLCL---KNKSDANIKDYIEEG 355
>gi|328701623|ref|XP_003241661.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 191
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 77/195 (39%), Positives = 103/195 (52%), Gaps = 20/195 (10%)
Query: 65 GPSKSSICRSIHEVVPVI-VDTFFQRMVKWPEE--NLNDIPMTFLRKG--GFPNVAGCVD 119
G S+ S+ R I EVV + + F VK+P+ LNDI R G GFP V GC+D
Sbjct: 2 GLSQPSVSRCISEVVTALNLPEIFNAWVKFPKNINELNDI-----RNGNTGFPGVVGCID 56
Query: 120 GTMVLIDAP----NEHEDQ-----FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
T V I P N +E+Q +VNR G H++NV IC N K VNA +PGS HD
Sbjct: 57 CTHVAITPPSTNLNLNENQHPEYIYVNRKGYHSINVQLICDSNLKILNVNALFPGSTHDT 116
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
V NS V E R + + +LGDS Y L++WL+ P +P A++ + R T
Sbjct: 117 HVWNNSKVLPILQELHRRNYNNFYLLGDSGYALRQWLLTP-IPDPSTEAEENYNRRQMST 175
Query: 231 RRLIENSFGILKEKF 245
R +IE G+LK +F
Sbjct: 176 RSIIERCNGVLKMRF 190
>gi|432962500|ref|XP_004086700.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 387
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 143/326 (43%), Gaps = 64/326 (19%)
Query: 1 MTPFAF-----KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGS 55
+ P AF E R SA + ++ + P + T + ALS ++ V L + G+
Sbjct: 31 LDPLAFPDDHLYERCRFSADGIRYLCRLLGPRIKHRTAWSHALSVEQMVCVALRFFASGA 90
Query: 56 PYHIIGATHGPSKSSICRSIHEV---VPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGF 111
+ +G +K++ICR+I V + + D F + +P L D+ F R GF
Sbjct: 91 FLYSVGDAEQLNKATICRTIRSVCLAIKALADVF----ISFPGHRRLCDVKEEFYRIAGF 146
Query: 112 PNVAGCVDGTMVLIDA------PNEHEDQFVNRN------GDH----------------- 142
PNV G VD T + I P E V R G H
Sbjct: 147 PNVIGAVDCTHIRIKRGHTQHPPLTIEGAAVERVSSTKFLGVHFSEDLSWTTNTASLAKK 206
Query: 143 ---------------ALNV-MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AL++ + +C + V A WPGSVHD+R+ R S +Y+ +
Sbjct: 207 AQQRSSAPALPDSPTALSIRLMVCNADCVISNVVAKWPGSVHDSRIFRASEIYQCLSQGE 266
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
+ V+LGD Y + +L+ P T +P A Q + H RTR +E +FG+LK +F
Sbjct: 267 F----SGVLLGDRGYGCQPFLLTPFT-DPQEA-QQAYNHAHARTRTRVEMTFGLLKARFH 320
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIA 272
CL+ +R++P A + +A LHN+A
Sbjct: 321 CLHKLRVNPVTACDITVACAVLHNVA 346
>gi|449688839|ref|XP_004211862.1| PREDICTED: uncharacterized protein LOC101235141 [Hydra
magnipapillata]
Length = 310
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 135/285 (47%), Gaps = 18/285 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y ++ +
Sbjct: 6 FKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVTVALHYLASCEEYRVVTSL 64
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ ++E + + D + VK+P ENLN F GFP G VDG
Sbjct: 65 FGIGKSTANLIVYEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAILGFPQCIGAVDGR 124
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM---- 177
+ I AP + + N G +++ + A+ Y+F + PG +D+ +++NS
Sbjct: 125 HIPISAPKDQAISYYNYKGWYSIVLFAVVDCRYRFIYTSVGSPGRNNDSYILQNSSLKAT 184
Query: 178 ----VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRR 232
++ K ++ P ++GDS +PL L+ P N + Q+ F + RR
Sbjct: 185 LELNLFNKCCKDSLVPL---CLIGDSVFPLTLHLLKPYPENLVLSEIQKNFNKILCGARR 241
Query: 233 LIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
++EN+FG +K +F + M FA +V A TLHNI D
Sbjct: 242 VVENAFGRVKARFRVVCKRMECDINFATRIVNACVTLHNICKYYD 286
>gi|291227593|ref|XP_002733771.1| PREDICTED: transposase, putative-like, partial [Saccoglossus
kowalevskii]
Length = 214
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 111/216 (51%), Gaps = 12/216 (5%)
Query: 28 LTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFF 87
L P+ +LS ++K+LS L +G Y + G KSS+ H VV +
Sbjct: 5 LLAPSIPERSLSIEKKMLSTLWVLGNQESYRGVADRFGIGKSSL----HYVVMTVCQALV 60
Query: 88 QRM---VKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHA 143
+ + WP+ + I +F +K GFP V G VDGT + I P+ H + ++NR G +
Sbjct: 61 AKQSDYICWPKGIEVQQICESFRQKTGFPGVIGAVDGTHIYIPGPSHHRNSYINRKGFPS 120
Query: 144 LNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPL 203
+ + A+C N +F V WPGSV+DARV +NS V E P P+ +LGDSAY L
Sbjct: 121 IQLQAVCDSNLRFTDVYTGWPGSVNDARVFKNSPVRNVLHNE--LP-PNLHLLGDSAYAL 177
Query: 204 KEWLIPPRTRNPDA-AADQRFLRCHKRTRRLIENSF 238
+++ P N A +++F + H TR IE +F
Sbjct: 178 STYVLTPYRDNGHLNAVEKQFNKYHSSTRVDIERAF 213
>gi|397787611|gb|AFO66517.1| putative kinase [Brassica napus]
Length = 1266
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 138/296 (46%), Gaps = 13/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+S + E I EE+ A+ T A+ +++V C+ + G P ++
Sbjct: 178 FKKAFRMSKSTFELICEELNAAVAKEDTALRNAIPVRQRVAVCVWRLATGEPLRLVSKKF 237
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + ++WP+ E+L +I T+ G PNV G + T +
Sbjct: 238 GLGISTCHKLVLEVCKAIKEVLMPKYLQWPDDESLRNIRETYESISGIPNVVGSMYTTHI 297
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ + A+ PN F + WPGS+ D +V+ S
Sbjct: 298 PIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPNGVFTDLCIGWPGSMPDDKVLEKS 357
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++Y++ + G + G + +PL +W++ P T+ F +R+ +
Sbjct: 358 LLYQRANNGGL--LKGLWVAGGAGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQRVAKE 415
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
+FG LK ++ CL + Q V+ A LHNI + +R VE+ DDE
Sbjct: 416 AFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLHNICEIRGERMEPELMVEVVDDE 471
>gi|301617451|ref|XP_002938158.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 276
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 124/258 (48%), Gaps = 29/258 (11%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL+ T + + E + PAL T + A+ K+L LH+ GS + G G S+
Sbjct: 24 YRLNKTAINTLYELLEPALEPRTCPSRAVPGMVKLLCSLHFFASGSFQRVGGVYGGVSQP 83
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEENLNDI--PMTFLRKGGFPNVAGCVDGTMVLIDA 127
+ + + +V+ I + + + +P+ + PM T V ++
Sbjct: 84 TFSQCLGQVLDAI-RSASRNFISFPQHRNEWVASPMC----------------THVALNP 126
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P + E NR G H+LNV +C N ++ + +PGS HDA ++R S +Y+ F+
Sbjct: 127 PQDKEHISRNRKGYHSLNVQVVCDANMNIMSIVSGFPGSSHDAYILRQSGLYQAFETG-- 184
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
P +LGD+ YP WLI P R P + A+ F + H RTR +IE +FG+LK +F C
Sbjct: 185 -QMPHGWLLGDAGYPCGRWLITPIHR-PRSRAECAFNQAHVRTRSVIERTFGVLKSRFRC 242
Query: 248 L------NYMRLSPQFAG 259
L N + L P AG
Sbjct: 243 LSPTKVANIIALPPPPAG 260
>gi|294951847|ref|XP_002787148.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239901812|gb|EER18944.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 210
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/183 (37%), Positives = 101/183 (55%), Gaps = 16/183 (8%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
NV GCVDGT V + P ED ++NR+ H+LN +C + + ++ W G VHD+RV
Sbjct: 3 NVWGCVDGTHVKVHPPQLGEDSYLNRHHFHSLNCQVVCTSSLEICSLTTRWGGRVHDSRV 62
Query: 173 MRNSMVYRKFDEEGWRP-FPDAVILGDSAYPLKEWLIPPRTRNPD---AAADQRFLRCHK 228
RNS + R+ E GWRP D V+LGDSAY ++L P ++ +A ++ + H+
Sbjct: 63 FRNSTLGRRL-ESGWRPDNRDVVLLGDSAYRASDFLKTPVSKAGGRVLSAQERAYNERHR 121
Query: 229 RTRRLIENSFGILKEKFPCLNYMRL-SPQFAG---MV-----VLASTTLHN--IALTIDR 277
RTR ++E G+LK + CL +R+ P+ G MV + A L N IA I+R
Sbjct: 122 RTRSVVERCIGVLKSQLRCLKDLRIRGPRLTGETSMVLTSNAIRACVALRNFFIANNIER 181
Query: 278 DNE 280
+ E
Sbjct: 182 EEE 184
>gi|449665223|ref|XP_004206095.1| PREDICTED: uncharacterized protein LOC101236818 [Hydra
magnipapillata]
Length = 336
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 135/286 (47%), Gaps = 15/286 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y
Sbjct: 1 MEECEFKEHFRVNRNTFNFLVNELHPHLGQTTTTMRE-PISVVKRVAVALHYLASCEEYR 59
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ HE + + D + VK+P ENLN+ F GFP G
Sbjct: 60 VVSSLFGIGKSTANLIAHEFINAVNDILLPKYVKFPLSVENLNNHSRDFEAILGFPQCFG 119
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG + I A + + N G +++ + +I Y+F + PG +D+ ++ NS
Sbjct: 120 AVDGCYIPILASKDQAISYYNYKGWYSIVLFSIVDCRYRFIYTSVGSPGKNNDSYILHNS 179
Query: 177 ----MVYRKFDEEGWRPFPDAVI----LGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCH 227
++ ++ + D+++ + DSA PL L+ P N + + Q+ F +
Sbjct: 180 SLKAILESSLFDKCCKELGDSLVPLCLISDSASPLTRHLLKPYPENSELSEIQKNFNKIL 239
Query: 228 KRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
RR++EN+FG +K +F + M FA +V A TLHNI
Sbjct: 240 YGARRVVENAFGRVKARFGVICKRMECDINFATRIVNACVTLHNIC 285
>gi|345496226|ref|XP_003427676.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 384
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/318 (28%), Positives = 140/318 (44%), Gaps = 37/318 (11%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE-RNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FKE FR+S T E++L I P L E N+ ++ +++ L+ +G Y +
Sbjct: 66 FKETFRMSRTTFEYLLLLIRPNLEGLNEFGNMPINPDKQLYITLYVLGTPDSYRSVTTKF 125
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
K++ R++ VV I + ++WP + M R+ V G VDGT +
Sbjct: 126 NVGKATAWRAVRRVVRSICQ-YRNYFIRWPSAREAAETSMRIARRRRLHGVIGAVDGTHI 184
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I A ++NR G H++ + IC +F A PGSVHD RV R S V ++
Sbjct: 185 RIAALRVDSQAYINRKGVHSIQLQVICNDKLEFIHCYAGLPGSVHDTRVFRYSGVQQRCT 244
Query: 184 EEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNPDAAADQ-RFLRCHKRTRRLIENSFGIL 241
+E FP+ +LGDSAY L+ +I P N ++ F TR ++E S G+L
Sbjct: 245 DEY---FPNNTHLLGDSAYTLQNHVIVPYRDNGHLTVEEVHFNHVLSGTRMMVERSIGLL 301
Query: 242 K-------EKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
K +K P + L P + ++ + LHNI L + +D
Sbjct: 302 KVRWRYFLDKLP-MRRTDLIPYY----IVCACVLHNICLKV-----------------ED 339
Query: 295 GEEEAVDDPDEIDEEEQP 312
E V PD ID +P
Sbjct: 340 TFEYPVIIPDTIDVNPEP 357
>gi|449692693|ref|XP_004213135.1| PREDICTED: uncharacterized protein LOC101236914 [Hydra
magnipapillata]
Length = 336
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 15/286 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y
Sbjct: 1 MEECEFKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLVSREEYR 59
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ +H+ + + + VK+P ENLN F GFP G
Sbjct: 60 VVSSLFGIGKSTANLIVHKFINAVKYILLPKYVKFPLSVENLNKHSRDFEAILGFPQCVG 119
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN- 175
VDG + I AP + N G + + + ++ Y+F N PG +D+ +++N
Sbjct: 120 AVDGCHISILAPKSQAISYYNYKGWYFIVLFSVVDCRYRFIYTNVGSPGRNNDSYILQNS 179
Query: 176 -------SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCH 227
S ++ K +E ++GDSA+PL L+ P N + + Q+ F +
Sbjct: 180 SLKAFLESSLFDKCCKELGDSLVPLCLIGDSAFPLTRHLLKPYPENLELSEIQKNFNKIL 239
Query: 228 KRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
RR++EN+FG +K +F + M FA +V A TL NI
Sbjct: 240 CGARRVVENAFGRVKARFRVICKRMECDINFATRIVNACVTLQNIC 285
>gi|328698115|ref|XP_003240546.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 241
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 97/191 (50%), Gaps = 15/191 (7%)
Query: 90 MVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
+ WP EE + I F FPNV G +DGT + I AP++H++ +VNR G H++ + A
Sbjct: 6 FITWPNEEKVEKIKNGFFSYSTFPNVFGAIDGTHINILAPHDHQEAYVNRKGHHSIQLQA 65
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWL 207
+C +F A GSVHD RV R S + D+ FP+ ++GDSAY L E L
Sbjct: 66 VCDHKCRFIHCYAGNVGSVHDQRVFRLSELKNYLDDA--TKFPNNTHLIGDSAYTLHEHL 123
Query: 208 IPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCL------NYMRLSPQFAGM 260
+ P N Q+ F CH R IE SFG LK +F L + L P++
Sbjct: 124 MVPYRDNGHLTQKQKNFNFCHSSARMAIERSFGFLKGRFRSLLTTLDMKRVDLIPKY--- 180
Query: 261 VVLASTTLHNI 271
++A LHNI
Sbjct: 181 -IIACCILHNI 190
>gi|301616303|ref|XP_002937601.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 269
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 107/218 (49%), Gaps = 32/218 (14%)
Query: 32 TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMV 91
T R+ AL K+L+ LH++G GS + G S+ + R + +V+
Sbjct: 24 TARSQALPGISKLLAVLHFLGSGSFQQVSARLVGMSQPTFSRILRKVL------------ 71
Query: 92 KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICG 151
+ FPN G +D T V + P H++ ++NR H++NV +C
Sbjct: 72 ----------------RALFPNCLGAIDCTHVPLTPPRAHQEHYLNRKRSHSINVQVVCD 115
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR 211
+ + ++ + +PGSVHDA ++R S +Y +F + P ++GD+ Y + W++ P
Sbjct: 116 SHLRIMSMRSGFPGSVHDAHILRQSALYERFTQG---EMPHGWLVGDAGYGVLPWMMTP- 171
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
R P A +R+ H++TR +IE F +L +F CL+
Sbjct: 172 VRFPRTPAQRRYNCAHRKTRNVIERLFVVLMSRFRCLS 209
>gi|350536071|ref|NP_001234228.1| salt responsive protein 2 [Solanum lycopersicum]
gi|195549553|gb|ACG50004.1| salt responsive protein 2 [Solanum lycopersicum]
Length = 499
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 134/297 (45%), Gaps = 15/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+S + I EE+ +T T A+ +++V C+ + G P +
Sbjct: 172 FKKAFRVSRATFDMICEELESVVTKKDTMLRQAIPVRQRVAVCIWRLATGEPLREVSKRF 231
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + V+WP E+ +N+I F G PNV G + T
Sbjct: 232 GLGISTCHKLVLEVCTAIKGVLMPKFVQWPNEDYKMNEIKSEFQMLSGMPNVGGSIYTTH 291
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
V I AP + N+ +++ V + P F V WPGS+ D +V+
Sbjct: 292 VPIIAPKVSVAAYFNKRHTERNQKTSYSVTVQGVVDPKGVFTDVCIGWPGSMSDDKVLEK 351
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S +Y++ + + D ++G+S YPL +W++ P TR F +++ +
Sbjct: 352 SALYQRANRGQLK---DTWVVGNSGYPLMDWVLAPYTRQNLTWTQHAFNEKVGEVQKVAK 408
Query: 236 NSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
+F +K ++ CL + Q +V+ A LHNI + ++ N ++ DDE
Sbjct: 409 EAFMRMKARWSCLRKRTEVKLQDLPVVLGACCVLHNICEMRGEQLNPGLRFDLFDDE 465
>gi|340372975|ref|XP_003385019.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 386
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 132/308 (42%), Gaps = 23/308 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
+KE FR+S + +E+ + T + S +++V L+++ +
Sbjct: 64 WKENFRMSRNSFHQLCDELRTYIERQDTVMRPSTSVEKQVAITLYYLSDEGRMRKTANSF 123
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+S++ + V I + + P ++ + + F + GFP G VDGT
Sbjct: 124 GVSRSTVSIIVRLVTSAITQYLGPKYISLPLTANDVKEKTVGFFKAFGFPQCIGTVDGTH 183
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ I AP + ++NR +LN+ A C Y+F V WPGSVHDARV NS +
Sbjct: 184 IEIKAPRRNPTDYINRKSKFSLNIQACCDYRYQFIDVVIKWPGSVHDARVFANSKLNNML 243
Query: 183 DEEGW-----------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+E P P I+GD AYPL +++ T +Q F R
Sbjct: 244 RDETIPSCKVQLVENEDPIP-IFIIGDPAYPLMPYVMKEYTGGGATVHEQYFGYKLCSAR 302
Query: 232 RLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
+IE SFG LK +F CL M + + V+ A LHN E++N + DD
Sbjct: 303 NVIECSFGRLKARFACLKRAMDTNLEDLPSVIYACFVLHNFC-------ESRNENVQDDL 355
Query: 291 AEDDGEEE 298
E E
Sbjct: 356 VRSTIEYE 363
>gi|449684794|ref|XP_004210718.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 277
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 96/179 (53%), Gaps = 9/179 (5%)
Query: 44 VLSCLHWMGVGSPYHIIG-ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLND 100
+L L + GS ++ +T S+ +I R I V + + +K+P + LN+
Sbjct: 105 LLCALRYYATGSFQIVVSDSTAALSQPTISRIIRRVSLSLAKRI-NKYIKYPTNQHVLNE 163
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+ F FP V G +D T + I P+EHE +V+R+ +H++NV A+C KF V
Sbjct: 164 SKVKFYEIAKFPKVTGVIDCTHICIQKPHEHEYAYVDRSSNHSINVQAVCDNKGKFIGVV 223
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA 219
A WPGS HDAR++R S + +KF + ++ +++GDS YP WL+ P NP A+
Sbjct: 224 AKWPGSTHDARILRESKLGKKFMDGTFK----GLLIGDSGYPCFRWLLTPYL-NPTTAS 277
>gi|449684266|ref|XP_004210586.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 247
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 9/200 (4%)
Query: 43 KVLSCLHWMGVGSPYHIIG-ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLN 99
++L L + GS ++G +T S+ +I R I V + + +K+P LN
Sbjct: 53 QLLCALRFYATGSFQIVVGNSTAALSQPTISRIIRRV-SLSLAKRINEYIKYPTNLHVLN 111
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
+ F FP V G +D T + I P+EHE +V+R+ +H++NV A+C KF V
Sbjct: 112 KSRVKFYEIAEFPKVTGVIDCTHICILKPHEHEYAYVDRSSNHSINVEAVCDNKGKFIDV 171
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA 219
A W GS HDAR +R S + +K +E ++ +++GD YP WL+ P NP A+
Sbjct: 172 VAKWSGSTHDARTLRESKLGKKIMDETFK----GLLIGDYGYPCFCWLLTPYL-NPTTAS 226
Query: 220 DQRFLRCHKRTRRLIENSFG 239
R+ ++TR +IE FG
Sbjct: 227 QHRYNISLRKTRVIIEQVFG 246
>gi|326320058|ref|NP_001191896.1| nuclease HARBI1-like [Acyrthosiphon pisum]
gi|239790602|dbj|BAH71852.1| ACYPI004890 [Acyrthosiphon pisum]
Length = 369
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 132/275 (48%), Gaps = 10/275 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FR+ T +IL +I L+T +R +++ +K+L L + G+ G G
Sbjct: 48 FFRRFRMKKTTFMYILGKIENTLST-AKRMTSITPMQKLLVALRFYATGNFLITAGELVG 106
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S+ S CR + +V + + VK P+ E + + F + P V G +D T +
Sbjct: 107 ISEPSACRIVQQVTHALAE-LRPDYVKMPQNVEEIRQVSYHFYKIAKMPRVIGAIDCTHI 165
Query: 124 LIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I +P E+ + F NR G ++NV + N + + WPGSVHD + NS + K
Sbjct: 166 RIHSPGGENAEYFRNRKGYFSMNVQTVADANLQILDIVVRWPGSVHDQTIFLNSKL--KT 223
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
D E R F + +++ DS Y + +I P + + ++ + RTR ++E +G+ K
Sbjct: 224 DLENGR-FGNNLLVADSGYANSQHVITPLLQT-NNRVEELYNESLIRTRNVVERQYGVWK 281
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTID 276
+FP L++ R+ + ++A L NI ++D
Sbjct: 282 RRFPILSFGFRIKLETTMAAIVACAVLFNITRSLD 316
>gi|449688903|ref|XP_004211882.1| PREDICTED: uncharacterized protein LOC101238937 [Hydra
magnipapillata]
Length = 280
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 133/281 (47%), Gaps = 22/281 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V+ LH++ + Y ++ +
Sbjct: 6 FKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVVVALHYLALCEEYRVVSSL 64
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G +S+ +HE + + D + VK+P ENLN + GFP+ G VDG
Sbjct: 65 FGTGESTANLIVHEFINAVNDILLPKYVKFPLSVENLN-------KHIGFPHCVGAVDGC 117
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM---- 177
+ I AP + + N +++ + A Y+F + PG +D+ +++NS
Sbjct: 118 HIPILAPKDQAISYYNYKEWYSIVLFAFVDCRYRFIYTSVGSPGRNNDSYILQNSSLKAI 177
Query: 178 ----VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRR 232
++ K E ++GDSA+PL L+ P N + + + F + RR
Sbjct: 178 LESNIFDKCCNELGDSLVPLCLIGDSAFPLSRHLLKPYPENLELSKILKNFNKILCGARR 237
Query: 233 LIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
++EN+FG ++ +F + M FA +V A LHNI
Sbjct: 238 VVENAFGCVRARFRVICKRMEFDINFAKRIVNACVALHNIC 278
>gi|326673584|ref|XP_003199930.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 361
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 135/289 (46%), Gaps = 24/289 (8%)
Query: 8 EAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
E FR+S +I + P+L T + + ++V L+ + + Y +
Sbjct: 39 EDFRVSRETFRYICSTLKPSLQRLDTSFRLCIPLAKRVAIALYKLASTTEYRTVTNLFAV 98
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
S++S+CR +H+ ++ +++ P++ + +I F K G P G +DG+ + I
Sbjct: 99 SRTSVCRCVHDFCKAVIAVLRPKLINTPDQAKMAEIADYFEDKFGIPQCVGAIDGSHIPI 158
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
P +++ F NR G H++ + A+ F+ +N PG HDA V++ S ++
Sbjct: 159 LKPPQYQSDFHNRKGWHSIILQAVVDGKGLFWDLNVGQPGREHDASVLKKSCLWTWATST 218
Query: 186 GWRPFPDAV-----------ILGDSAYPLKEWLIPPRTRNPDAA----ADQRFLRCHKRT 230
FP V ILGDSAYPL++WL+ P PD A + + R
Sbjct: 219 T--AFPGRVKNICGTEVGYFILGDSAYPLQKWLLKP---YPDTGRLTEAQELYNMRTSRA 273
Query: 231 RRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIA-LTIDR 277
R ++E++FG LK ++ CL+ + +V TLHN+ L +DR
Sbjct: 274 RCVVEHAFGRLKGRWKCLSKRNDCNVNVVVDMVETCCTLHNLCELHMDR 322
>gi|340368843|ref|XP_003382960.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 368
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 122/286 (42%), Gaps = 31/286 (10%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTF 105
L+++ S + G S++S+ I V VI ++ +K P EE++ D F
Sbjct: 100 LYYLADESRLRKTANSFGLSRASVSIIIRLVTTVISLYLGKKYIKLPVTEEDVQDKVKGF 159
Query: 106 LRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPG 165
P G VDGT + I P+ + ++NR ++LNV A C Y F V WPG
Sbjct: 160 YNAFNIPQCIGAVDGTHIEIKQPHSNPTDYINRKSRYSLNVQACCDYRYCFIDVVVKWPG 219
Query: 166 SVHDARVMRNSMVYRKFDEE-----------GWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
SVHDAR+ NS + + E G P P +LGD AYPL +++ T
Sbjct: 220 SVHDARMFANSKLNQMLKNEMIPPCKRRIDDGEIPVP-VYLLGDPAYPLMPYIMKEYTSG 278
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIAL 273
+Q F R +IE SFG LK +F CL M ++ MV+ A LHN
Sbjct: 279 GTTPREQYFGYKLCSARNVIECSFGRLKARFSCLKRAMDININDIPMVIYACFVLHNFC- 337
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNP 319
E N IH EE+V D P Q + +P
Sbjct: 338 ------ELNNEAIH---------EESVRSAISYDRIFSPLQQQASP 368
>gi|328710597|ref|XP_003244309.1| PREDICTED: hypothetical protein LOC100571044 [Acyrthosiphon pisum]
Length = 405
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 133/287 (46%), Gaps = 6/287 (2%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
R++ +L+++AP + T AL AK K+ L ++ G Y + + K
Sbjct: 64 LRINEEMFNVLLQKMAPKIQKQNTMMREALPAKLKLEVTLRYLATGDSYKTLQYIYRVGK 123
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
SSI + EV I++ + ++ P E N I F FPN G +DG ++++
Sbjct: 124 SSISEFVPEVFNAILEEL-KEYIEVPREKSKWNKIMDGFNSLWNFPNCIGAIDGKHIVME 182
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P + N G ++ ++A+ NY F ++ GS D + S + + +E
Sbjct: 183 CPANSGSNYFNYKGTFSIVLLALVDHNYNFTCIDIGSYGSNSDGGIFAKSALKKAIEEHV 242
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRT-RNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
D+VILGD A+PL +L+ P RN ++ + H R RR++EN FGIL +F
Sbjct: 243 LHTPTDSVILGDDAFPLLPYLMKPYARRNHLTEREKVYNYRHCRARRIVENGFGILSSRF 302
Query: 246 PCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ L+P+ +V A+ LHN ++ + +V+I D E
Sbjct: 303 RVFRRPITLTPENTIQLVKAACALHNWIRKSGKNPNSISVDIEDTET 349
>gi|255546077|ref|XP_002514098.1| conserved hypothetical protein [Ricinus communis]
gi|223546554|gb|EEF48052.1| conserved hypothetical protein [Ricinus communis]
Length = 541
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 131/296 (44%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+S E I E+ A+T T A+ +++V C+ + G P ++
Sbjct: 215 FKKAFRMSKATFETICNELDSAVTKKNTMLRDAIPVRQRVAVCIWRLATGEPLRLVSKRF 274
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + R ++WP EE L I + G PNV G + T +
Sbjct: 275 GLGISTCHKLVLEVCSAIRNVLMPRFLQWPDEEKLRMIKSEYEAVSGIPNVGGSMYTTHI 334
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + P F V WPGS+ D +V+ S
Sbjct: 335 PIIAPKISVAAYFNKRHTERNQKTSYSMTVQGVVDPRGVFTDVCIGWPGSMTDDQVLEKS 394
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + D I+G+S YPL +W++ P T F +R+ +
Sbjct: 395 ALFQRANRGA---LKDVWIVGNSGYPLMDWVLVPYTHQNLTWTQHAFNEKIGEVQRVGKE 451
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNE-NQNVEIHDDE 290
+F LK ++ CL + Q +V+ A LHNI + + E EI DDE
Sbjct: 452 AFARLKARWSCLQKRTEVKLQDLPVVLGACCVLHNICEMRNEEIEPGHRYEIFDDE 507
>gi|449452530|ref|XP_004144012.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449500453|ref|XP_004161101.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 483
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 132/296 (44%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEI-APALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S + + I +E+ + + T VA+ +++V C+ + G P ++
Sbjct: 156 FRRAFRMSKSTFDMICKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRF 215
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + WP+E+ L I F G P V G + T +
Sbjct: 216 GLGISTCHKLVLEVCSAIRKVLMPKFLNWPDESKLTKIKQEFESISGIPKVGGSIYTTHI 275
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + N+ +++ V + P+ F V WPGS+ D +V+ S
Sbjct: 276 PIIAPKNNVAAYFNKRHTERNQKTSYSITVQGVVDPSGVFTDVCIGWPGSMPDDQVLEKS 335
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++Y + D I+G+S YPL +WL+ P T F + +
Sbjct: 336 LLYERASMGS---LNDVFIVGNSGYPLMDWLLVPYTVQNLTWTQHGFNEKVGEIQAAAKA 392
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
+FG LK ++ CL + Q +V+ A LHNI + ++ + E++DDE
Sbjct: 393 AFGRLKGRWTCLQKRTEVKLQELPVVLGACCVLHNICEMRKEKFDPELKFEVYDDE 448
>gi|449691647|ref|XP_004212750.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 182
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 99/168 (58%), Gaps = 7/168 (4%)
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P+EHE +V+R+ +H++NV A+C KF V A WPGS HDAR++R S + +KF + +
Sbjct: 2 PHEHEYAYVDRSSNHSINVQAVCDNKGKFIDVVAKWPGSTHDARILRESKLGKKFIDGTF 61
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
+ +++GDS YP WL+ P NP A+ R+ ++ R +IE FG K +
Sbjct: 62 K----GLLIGDSGYPCFRWLLTPYL-NPTTASQHRYNISLRKIRVIIEQVFGRGKRRLYL 116
Query: 248 LN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
L+ +R++P+ +V A LHN+A+ +D ++N I +DE +++
Sbjct: 117 LHGEIRMTPEKTCTLVAACAVLHNLAIQLDDGFMDEN-PIENDENDNE 163
>gi|328705664|ref|XP_001951832.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 408
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 134/289 (46%), Gaps = 17/289 (5%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
R++ + +L ++ ++ T AL+ + K+ L ++ G Y + + SK
Sbjct: 65 LRINESMFNILLGKVKDQISKQDTAMREALTPRIKLEIALRFLATGDSYTSLQYLYRVSK 124
Query: 69 SSICRSIHEVVPVIVDTFF---QRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S+I E +P + D F + ++ P+ + + I F FPN G +DG +
Sbjct: 125 SAI----SEFMPDVFDAIFAGLKEFIQVPKTTNDWDSIVHGFNLSWNFPNCFGAIDGKHI 180
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+I+ P F N G ++ ++A+ +Y F ++ GS D + + +K
Sbjct: 181 IIECPANSGSNFYNYKGSFSIVLLALVDHSYNFTCIDVGAYGSASDGGIFSKCTL-KKAI 239
Query: 184 EEGWRPFPD-AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC-HKRTRRLIENSFGIL 241
EE PD AV+LGD A+PL ++L+ P R Q+ H R RR++ENSFGIL
Sbjct: 240 EENQLNLPDEAVMLGDEAFPLTKYLMKPYPRRNILTKKQKIYNYRHCRARRIVENSFGIL 299
Query: 242 KEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
+F +RL P +V A+ +LHN I + NQ +EI D
Sbjct: 300 SSRFRVFRRPLRLLPSTVVKLVKAACSLHN---WIRKSGLNQTIEIIQD 345
>gi|15239835|ref|NP_196762.1| uncharacterized protein [Arabidopsis thaliana]
gi|7573368|emb|CAB87674.1| putative protein [Arabidopsis thaliana]
gi|16604352|gb|AAL24182.1| AT5g12010/F14F18_180 [Arabidopsis thaliana]
gi|25141199|gb|AAN73294.1| At5g12010/F14F18_180 [Arabidopsis thaliana]
gi|332004367|gb|AED91750.1| uncharacterized protein [Arabidopsis thaliana]
Length = 502
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 134/296 (45%), Gaps = 13/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+S + E I +E+ A+ T A+ +++V C+ + G P ++
Sbjct: 175 FKKAFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWRLATGEPLRLVSKKF 234
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I D + ++WP+ E+L +I F G PNV G + T +
Sbjct: 235 GLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVVGSMYTTHI 294
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ + A+ P F + WPGS+ D +V+ S
Sbjct: 295 PIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKS 354
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++Y++ + G + G +PL +W++ P T+ F + + +
Sbjct: 355 LLYQRANNGG--LLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKE 412
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQ-NVEIHDDE 290
+FG LK ++ CL + Q V+ A LHNI + E + VE+ DDE
Sbjct: 413 AFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKMEPELMVEVIDDE 468
>gi|291233883|ref|XP_002736880.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 389
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 20/285 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV--------ALSAKEKVLSCLHWMGVGSPY 57
F+ AFR+ + + E +A LT P N A+ +K+ L +G
Sbjct: 59 FRRAFRVPHS----VFEYLADILTGPLIENEPIHCGGRPAILPDKKIAIFLKCVGSMETI 114
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTF--LRKGGFPNV 114
I ++SS+ + +V VI+ + WP L I F + PN+
Sbjct: 115 LDIAQLFNITESSVIKVRRQVTNVILIHLLNNTIHWPLRRELQGISACFNDMHTSNLPNI 174
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G VDG+ + I P+E D + NR ++ + +C + +F V+ PG +HDARV+R
Sbjct: 175 IGAVDGSHIPISTPHEQPDAYYNRKKFRSVVLQGVCREDLQFIDVSVGCPGRMHDARVLR 234
Query: 175 NSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRL 233
NS ++ R ++GD+AYPL WL+ P N QR R R +
Sbjct: 235 NSTLWNTGMGNCQR--GQYHVIGDAAYPLTNWLMTPYRDNGHLDEHQRLFNTSLSRRRVI 292
Query: 234 IENSFGILKEKF-PCLNYMRLSP-QFAGMVVLASTTLHNIALTID 276
IE +FG LK +F LN + ++ +VLA+ +HNI + D
Sbjct: 293 IERAFGSLKRRFRRLLNGIDITDVNEINKIVLAACIIHNICIKHD 337
>gi|326667245|ref|XP_003198536.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 349
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 141/343 (41%), Gaps = 28/343 (8%)
Query: 2 TPFAFKEAFRLSATQVEFILEEIAPALTTPTER-NVALSAKEKVLSCLHWMGVGSPYHII 60
T + + R+ + + + + P LT R + + +V C+ + Y I
Sbjct: 19 TDHEWLQNLRMERSSFILLCDTLRPWLTRQNTRYRKPVPVEIRVAICIWRLATNLEYRSI 78
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVD 119
G S+ C EVV I + +K P I F + GFP VAG +D
Sbjct: 79 SHLFGVGVSTCCIITQEVVTAINVIMKPQYIKKPSAAEFKMIVQGFRDRWGFPQVAGAID 138
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
GT + I AP+ + NR G++++ + A+ KF+ +N PG VHDARV S ++
Sbjct: 139 GTHINIKAPSNTPADYYNRKGNYSIVLQAVVDNKMKFWDINVGQPGKVHDARVFCLSSLF 198
Query: 180 RKFDEE----GWRPFPDAV-----ILGDSAYPLKEWLIPPRTRNPDAAADQ-RFLRCHKR 229
W +A+ +LGDSAYPL WL+ P +Q +F +
Sbjct: 199 DGGSSGTLLPTWTETFEAIDVPLFLLGDSAYPLSHWLMKPYPEGRGVTPEQIKFNHRLSQ 258
Query: 230 TRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
R +E +FG LK ++ C L +V A LHN E +N E +
Sbjct: 259 ARMTVERAFGRLKGRWRCLLKQCEAHITLVSRIVSACCVLHNFC-------EVRNEEFY- 310
Query: 289 DEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYF 331
DD EE +E ++ +P++ N G L YF
Sbjct: 311 --GRDDVREE-----EEQSQDARPQEQENVQGNNVRDALCRYF 346
>gi|156546775|ref|XP_001605652.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 363
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 131/278 (47%), Gaps = 17/278 (6%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIG--A 62
F++ +R+ ++E + P L E + +VLS L ++ GS +
Sbjct: 44 VFRKHYRMIPAIAIHLIELLRPQLI---EHEQGIPVHLQVLSVLRFLAEGSYQKGVAHDC 100
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDG 120
H +S+ + +HEVVP IV + +V +P E ++ F GFP V G +D
Sbjct: 101 YHPMGQSTFSKYLHEVVPAIVRLADEYIV-FPRNMEERQEMSRRFEESLGFPGVIGAIDC 159
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
T+ I P+EHE+ +VN N DH+LNV + +Y + GS +D V R+S +
Sbjct: 160 TLPKIHTPHEHEEAYVNHNQDHSLNVQVVSDMDYNILNIRIT-NGSSNDKFVWRHSQMRE 218
Query: 181 KFDEEGWRPFPDAV-----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
+ P P + ++GD Y L+ P + + R+ +RTR +E
Sbjct: 219 TMYQLRNNPDPAEINNQYYLIGDGGYTPSPVLLTPCQEAAPGSPEFRYSDRVRRTRCRVE 278
Query: 236 NSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHN 270
+FGI K+ F C+N + +P+F+ +VLA+ L+N
Sbjct: 279 QTFGIWKQVFRCINGDRSLHYTPEFSSQIVLATAVLYN 316
>gi|322785647|gb|EFZ12298.1| hypothetical protein SINV_06104 [Solenopsis invicta]
Length = 365
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTER--NVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK FRL +IL+ I P L ER +SA +++ L + Y +
Sbjct: 61 FKSNFRLRRDTYGYILDLIRPDLEKWPERFGRYPISASKQLYVALWMLATPDAYRSVCTK 120
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFL-RKGGFPNVAGCVDGTM 122
K++ RS+ VV + + WP + T L ++ FP + G VDGT+
Sbjct: 121 FDIGKATAWRSVLRVVKALYK-LRNVFISWPTREQAENTWTKLEQQYQFPGIIGAVDGTL 179
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V I AP H + ++ R HA+ + +C +F A P SVHD R S + +K
Sbjct: 180 VKITAPKVHPEAYICRKHYHAIQLQVVCDAELRFIHCYAGQPESVHDMRTFMYSGLKQKC 239
Query: 183 DEEGWRPFP-DAVILGDSAYPL-KEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
+E FP D+ +LGD+AY + + ++P R AA+ F + R ++E S G+
Sbjct: 240 NEYF---FPEDSHLLGDAAYTIQRHIMVPYRDNGHLTAAENLFNKKLSSARMIVERSIGL 296
Query: 241 LKEKFPCL------NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
LK ++ CL L P++ ++A LHNI L + RD + I D
Sbjct: 297 LKGRWRCLLDKLPMTRTDLIPRY----IIACCVLHNICL-LRRDEIEIPILIED 345
>gi|328709487|ref|XP_003243975.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 332
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 138/306 (45%), Gaps = 31/306 (10%)
Query: 28 LTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFF 87
L+ T N A++ +K+L L + GS G G SKS+ C +H+V V +
Sbjct: 43 LSKETVINNAITPAQKLLLALRFYATGSFSISAGDFMGVSKSAACVIVHDV-SVALAKLR 101
Query: 88 QRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNE-HEDQFVNRNGDHAL 144
++VK PE N + ++ F F V G +D T + I +P + + F NR G +L
Sbjct: 102 PQIVKMPETNDEIKELHKQFYGLAKFLLVIGAIDCTHIKIQSPGGPNAEYFRNRKGWFSL 161
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK 204
NV + P K + WP S HD+ + +S + R D + + +++I+ DS Y
Sbjct: 162 NVQTVVSPKLKIMDIVVRWPDSTHDSTIFSHSKINR--DLHVTQKWANSLIVADSGYANN 219
Query: 205 EWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVL 263
++ P NP A + + RTR +E + + K +FP L+ MRL V++
Sbjct: 220 MHIVTPFL-NPHAGPENLYNESQIRTRNPVERCYVVWKRRFPVLSLGMRLQICNIQTVIV 278
Query: 264 ASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDD---PDEIDEEEQPR-QDRNNP 319
A + LHNIA+ D EE +DD PDE+ E P ++R N
Sbjct: 279 ACSVLHNIAI-------------------DCNEEMPMDDIQLPDELTELMTPTIENRQNN 319
Query: 320 GTQRLA 325
RL
Sbjct: 320 ARTRLV 325
>gi|449687721|ref|XP_004211522.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 173
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 7/178 (3%)
Query: 118 VDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+D T + I P+EHE +V+R+ H +NV A+C KF V A WP S HDAR++ S
Sbjct: 1 MDCTHICIQKPHEHEYVYVDRSSKHLINVRAVCDNKEKFNDVVAKWPRSTHDARILGESK 60
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ +KF + +++GDS YP L+ P NP A+ R+ ++TR +IE
Sbjct: 61 LGKKFMNVTLK----GLLIGDSGYPCFRCLLTPYL-NPTTASQHRYNISLRKTRVIIEQV 115
Query: 238 FGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
FG K +F L+ +RL+P+ V A T LHN+A+ ++ + ++N I DE +++
Sbjct: 116 FGRWKRRFHMLHGEIRLTPERTCTFVTAYTVLHNLAIQLNHGDMDEN-PIESDEIDNE 172
>gi|115465799|ref|NP_001056499.1| Os05g0593000 [Oryza sativa Japonica Group]
gi|48475235|gb|AAT44304.1| unknown protein [Oryza sativa Japonica Group]
gi|113580050|dbj|BAF18413.1| Os05g0593000 [Oryza sativa Japonica Group]
gi|222632770|gb|EEE64902.1| hypothetical protein OsJ_19762 [Oryza sativa Japonica Group]
Length = 423
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 131/300 (43%), Gaps = 14/300 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S F+ + +A A+ T A+ +++V C+ + G P ++
Sbjct: 96 FRRAFRMSRPTFHFLCDALAAAVAKEDTALRAAIPVRQRVAVCVWRLATGEPLRVVSKRF 155
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT-FLRKGGFPNVAGCVDGTMV 123
G S+ + I EV I + R + WP+ + T F G P V G + T +
Sbjct: 156 GLGISTCHKLILEVCAAIRNLLMPRFLHWPDHPTSTAYKTRFEATSGVPGVVGAMYTTHI 215
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP ++NR +++ + + GP+ F V WPGS+ D +V+R S
Sbjct: 216 PIIAPKVSVAAYLNRRHTERNHKTSYSITLQGVVGPDGTFTDVCIGWPGSMSDEQVLRKS 275
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + ++G ++YPL EW++ P + F RR+
Sbjct: 276 ALHQRASAAA---GSMSWVVGGASYPLTEWMLVPYAQRNLTWTQHAFNEKVGEVRRVATE 332
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRD-NENQNVEIHDDEAEDD 294
+F LK ++ CL + Q V+ A LHNI T D + + ++ DE EDD
Sbjct: 333 AFVRLKGRWACLQKRTEVKLQDLPAVLAACCVLHNICETRGEDMDPDLRCDLPPDEEEDD 392
>gi|357437265|ref|XP_003588908.1| hypothetical protein MTR_1g015070 [Medicago truncatula]
gi|355477956|gb|AES59159.1| hypothetical protein MTR_1g015070 [Medicago truncatula]
Length = 1056
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 133/296 (44%), Gaps = 13/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+ + + I EE+ A+ T A+ +++V CL + G P I+
Sbjct: 183 FKKAFRMGKSTFDLICEELNSAIVKEDTTLRTAIPVRQRVAVCLWRLATGDPLRIVSKRF 242
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP E +L I F G PNV G + + V
Sbjct: 243 GLGISTCHKLVLEVCTAIKTVLMPKYLQWPNETSLRKIKNEFEGMSGIPNVVGSMYTSHV 302
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + N F V WPGS+ D +V+ S
Sbjct: 303 PIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDTNGVFTDVCIGWPGSMPDDQVLEKS 362
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + G I+G S YPL +W++ P T+ F +++ ++
Sbjct: 363 ALFQRANNGG--LLKGVWIVGSSGYPLMDWVLVPYTQQNLTWTQHGFNEKIGEIQKVAKD 420
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
+F LK ++ CL + Q +V+ A LHNI + ++ + V+I DDE
Sbjct: 421 AFARLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNICEMKGEKMDPELMVDIVDDE 476
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 133/296 (44%), Gaps = 13/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+ + +FI E++ A+ T A+ +++V CL + G P I+
Sbjct: 729 FKKAFRMGKSTFDFICEKLNSAIVKEDTTLRTAIPVRQRVAVCLWRLATGDPLSIVSKRF 788
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP E +L I F G PNV G + + V
Sbjct: 789 GLGISTCHKLVLEVCTAIKTVLMPKYLQWPNETSLRKIKNEFEGMSGIPNVVGSMYTSHV 848
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ V + N F V WPGS+ D +V+ S
Sbjct: 849 PIIAPKISVADYFNRRHTERNQKTSYSITVQGVVDTNGVFTDVCIGWPGSMPDDQVLEKS 908
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + G I+G S YPL +W++ P T+ F +++ ++
Sbjct: 909 ALFQRANNGG--LLRGVWIVGSSGYPLMDWVLVPYTQQNLTWTQHGFNEKIGEIQKVAKD 966
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
+F LK ++ CL + Q + + A LHNI + ++ + V+I DDE
Sbjct: 967 AFARLKGRWSCLQKRTEVKLQDLPVELCACCVLHNICEMKGEKMDPELMVDIVDDE 1022
>gi|449687590|ref|XP_004211494.1| PREDICTED: uncharacterized protein LOC101237280 [Hydra
magnipapillata]
Length = 450
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 126/278 (45%), Gaps = 27/278 (9%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y
Sbjct: 143 MEECEFKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYR 201
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ +HE + + D + VK+P ENLN F GFP G
Sbjct: 202 VVSSLFGIGKSTPNLIVHEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAILGFPQCVG 261
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG + I AP + + N G +++ + A+ Y+F + P + +S
Sbjct: 262 AVDGCHIPISAPKDQATSYYNYKGWYSIVLFAVADSRYRFIYTSVGSPE-------LGDS 314
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIE 235
+V P ++GDS +PL L+ P N + + Q+ F + RR++E
Sbjct: 315 LV----------PL---CLIGDSVFPLTRHLLKPYPENLELSEIQKNFNKILCGARRVVE 361
Query: 236 NSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
N+FG +K +F + FA +V A TLHNI
Sbjct: 362 NAFGRVKARFRVICKRTECDINFATRIVNACVTLHNIC 399
>gi|449676027|ref|XP_002158178.2| PREDICTED: uncharacterized protein LOC100199123 [Hydra
magnipapillata]
Length = 521
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 123/278 (44%), Gaps = 32/278 (11%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ ++ P L TT T R +S ++V LH++ Y
Sbjct: 219 MEECEFKEHFRVNRNTFNFLVNKLHPHLGKTTKTMRE-PISVVKRVAVALHYLASCEEYR 277
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ +HE + + D + VK+P ENLN F GFP G
Sbjct: 278 VVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAILGFPQCVG 337
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG + I AP + + N G +++ + A+ Y+F
Sbjct: 338 AVDGCHIPISAPKDQATSYYNYKGWYSIVLFAVVDSRYRF-------------------- 377
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIE 235
+Y K + + GDSA+PL L+ P N + + Q+ F + RR++E
Sbjct: 378 -IYTKLGDS----LVPLCLKGDSAFPLTRHLLKPYPENLELSEIQKNFNKILCGARRVVE 432
Query: 236 NSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
N+FG +K +F + M FA +V A TLHNI
Sbjct: 433 NAFGRVKARFRVICKRMECDINFATRIVNACVTLHNIC 470
>gi|255550093|ref|XP_002516097.1| conserved hypothetical protein [Ricinus communis]
gi|223544583|gb|EEF46099.1| conserved hypothetical protein [Ricinus communis]
Length = 498
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 133/297 (44%), Gaps = 16/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK+AFR+S + I EE+ + T RN A+ +++V C+ + G P ++
Sbjct: 172 FKKAFRMSKATFDLICEELHSCIQKEDTTLRN-AIPVRQRVAVCIWRLATGEPLRLVSKR 230
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + + ++WP+E+ L + F G PNV G + T
Sbjct: 231 FGLGISTCHKLVLEVCSAIKNVLMPKYLQWPDEDSLKKVKNEFESISGIPNVVGSMYTTH 290
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I AP + N+ +++ V + P F V WPGS+ D +V+
Sbjct: 291 IPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPKGVFTDVCIGWPGSMPDDQVLEK 350
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S +Y++ + D I+G S YPL +W++ P T+ F + + +
Sbjct: 351 SALYQRANGG---LLKDVWIVGSSGYPLMDWVLVPYTQQHLTWTQHAFNEKIGEVQTVAK 407
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
+F LK ++ CL + Q +V+ A LHNI L + + VE+ DDE
Sbjct: 408 EAFTRLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNICELRKEEIDPKLRVELVDDE 464
>gi|449691682|ref|XP_004212760.1| PREDICTED: uncharacterized protein LOC100208553 [Hydra
magnipapillata]
Length = 492
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 122/272 (44%), Gaps = 34/272 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V LH++ YH++ +
Sbjct: 199 FKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYHVVSSL 257
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G +S+ +HE V + D + VK+P ENLN F FP G VDG
Sbjct: 258 FGIGESTANLIVHEFVNAVNDILLPKYVKFPLSVENLNKYSRDFEAILDFPQCVGAVDGC 317
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ I AP + + N G ++ + A+ Y+F + PG +D+ +++NS +
Sbjct: 318 HIPISAPKDQAISYYNYKGLCSIVLFAVVDCRYRFIYTSVGSPGRNNDSYILQNSSL--- 374
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
+ ++ + G L + L+P +RR++EN+FG +
Sbjct: 375 ------KAILESNLFGKCCKELGDSLVP-------------------LSRRVVENAFGRV 409
Query: 242 KEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
K +F + M FA +V A TLHNI
Sbjct: 410 KARFRVICKRMECDINFATRIVNACVTLHNIC 441
>gi|356507015|ref|XP_003522267.1| PREDICTED: uncharacterized protein LOC100809836 [Glycine max]
Length = 441
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 134/309 (43%), Gaps = 15/309 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+ E I EE+ + T A+ K++V CL + G P ++
Sbjct: 108 FKKAFRMRRLTFEAICEELNSVIAKEDTTLRSAIPVKQRVAVCLWRLATGDPLSVVSRRF 167
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP L DI F G PNV G + + V
Sbjct: 168 GLGISTCHKLVLEVCTAIKTVLMPKYLQWPNVVALRDIKGEFENISGIPNVVGSMLTSHV 227
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + N+ +++ V + F V WPGS+ D +V+ S
Sbjct: 228 PIIAPKMNVSAYFNKRQTERNHKTSYSVTVQGVVDHRGVFTDVCIGWPGSMGDDQVLEKS 287
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++ + + G ++G YPL +W++ P T+ F RR+ +
Sbjct: 288 ALFHRANNAGLN-LSGVWVVGGCGYPLLDWVLVPYTQQNLTWTQHAFNEKIGEVRRVAKG 346
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+FG LK ++ CL + Q V+ A LHNI + NE + E+ ++ +DDG
Sbjct: 347 AFGRLKGRWGCLQKRTEVKLQDLPFVLGACCVLHNIC---ELKNEEMDPELLEN-IDDDG 402
Query: 296 EEEAVDDPD 304
+ V +P+
Sbjct: 403 DYAEVMEPE 411
>gi|328722179|ref|XP_003247501.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 338
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 119/277 (42%), Gaps = 46/277 (16%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F + +RLS V+ ++ I P + P R+ ++ EKV L++ G
Sbjct: 42 FLKNYRLSKDLVQQLIALITPYIE-PERRSSSIKLSEKVFLALNFFATGC---------- 90
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
+ P I F FP V GC+D T + I
Sbjct: 91 -------------------------YQTPIGFYFFIFFRFFINTSFPGVVGCIDCTHIAI 125
Query: 126 DAPNEH-----EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
PN E +VNR G H+LNV IC +VNA +PGS D+ + ++ +
Sbjct: 126 VPPNNQNAEIPEYIYVNRKGYHSLNVQLICDSKLYILSVNAKFPGSTADSHIWNSTTDIK 185
Query: 181 KFDEEGW-RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
EE + R + +LGDS Y L+ WL+ P + ++++ + TR +IE G
Sbjct: 186 NILEELYRREYKGIYLLGDSGYALRPWLLTPYSET-TTNEEEKYNKMQMSTRSIIERCNG 244
Query: 240 ILKEKFPCLNYMRL---SPQFAGMVVLASTTLHNIAL 273
+LK +F CL R+ +P+ +G +V A LHN+ +
Sbjct: 245 VLKARFRCLLKHRVLHYTPKKSGKIVNACVILHNMCI 281
>gi|340375949|ref|XP_003386496.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 435
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/329 (27%), Positives = 133/329 (40%), Gaps = 45/329 (13%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL---------TTPTERNVALSAKEKVLSCLHWMGVGSP 56
+K+ FR++ + +E+ P + TER VAL+ L+++
Sbjct: 102 WKDNFRMNKANFFKLCDELRPFIEKKSTNMRQCIETERQVALT--------LYYLSDEGC 153
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNV 114
G ++SS I V +I R +K P EE +N+ F G P
Sbjct: 154 LRKTANAFGIARSSASVVIRRVCYIISIHLGPRYIKLPLTEEEVNEKVTGFYIAFGVPQC 213
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DGT + I AP + ++NR ++LNV A C Y F V WPGSVHDARV
Sbjct: 214 IGAIDGTHIDIKAPKSNPTDYINRKNRYSLNVQACCDHKYCFLDVVVKWPGSVHDARVFS 273
Query: 175 NSMV-----------YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
NS + R+ E P P ++GD AYPL+ +L+ +Q +
Sbjct: 274 NSTLNNLLKTAQIPPCRRSLVEDRDPIP-VFLIGDPAYPLQTYLMKEYANGGSTVQEQYY 332
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN------------ 270
R +IE +FG LK F L + ++ V+ A LHN
Sbjct: 333 GYKLCSARMVIECAFGRLKAHFGILKRPLDININEVAHVIYACFVLHNYCELNHESIAEE 392
Query: 271 -IALTIDRDNENQNVEIHDDEAEDDGEEE 298
+ + I DN Q + + EEE
Sbjct: 393 RVQVAIHYDNRFQPPTVRNRSTTHCNEEE 421
>gi|328698221|ref|XP_003240585.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 326
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 111/220 (50%), Gaps = 12/220 (5%)
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNV 114
+H+ T+ +I R++ E + + + +K PE N+ + + F FP +
Sbjct: 63 FHLKSKTNRYLTCNIVRTVTEAIASLRPLY----IKMPEHHCNIQETRLKFYNIARFPRI 118
Query: 115 AGCVDGTMVLIDAPNEH-EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G +D T V + +P + + + NR G +LNV + G + + + A WPGS HD +
Sbjct: 119 IGAIDCTHVKLQSPGGNIAEVYRNRKGYFSLNVQVVGGRSLEILDIVARWPGSTHDQVIF 178
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
NS ++ KF+ D ++LGD Y + +++ P +P+ A+ + RTR
Sbjct: 179 NNSTIHFKFETN---EMGDNILLGDGGYECRPYILVPLI-SPNTNAELLYNESQIRTRNT 234
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
IE FG+ K +FP L+ +R +P A +++A+ +HN+A
Sbjct: 235 IERLFGVWKRRFPILSLGIRTTPDRAQAIIVATAVVHNLA 274
>gi|449667873|ref|XP_004206663.1| PREDICTED: uncharacterized protein LOC101235010 [Hydra
magnipapillata]
Length = 351
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/254 (27%), Positives = 121/254 (47%), Gaps = 19/254 (7%)
Query: 29 TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQ 88
TT T R +S ++V LH++ Y ++ + G KS+ +HE + + D
Sbjct: 67 TTTTMRE-PISVVKRVAVALHYLASCEEYCVVSSLFGIGKSTANLIVHEFINAVNDILLP 125
Query: 89 RMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
+ VK+P + F GFP G VDG + I AP + + N G +++ + A
Sbjct: 126 KYVKFP--------LDFEAILGFPQCVGAVDGCHIPISAPKDQAISYYNYKGWYSIVLFA 177
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNS----MVYRKFDEEGWRPFPDAVI----LGDSA 200
I Y+F + PG +D+ +++NS ++ ++ + D+++ +GDSA
Sbjct: 178 IVDCRYRFIYTSVGLPGRNNDSYILQNSSLKAILESNLFDKCCKELGDSLVPLSLIGDSA 237
Query: 201 YPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFA 258
+PL L+ P N + + Q+ F + RR++EN+FG +K +F + M FA
Sbjct: 238 FPLTRHLLKPYPENLELSEIQKHFNKILCGARRVVENAFGRVKARFRVICKRMECDINFA 297
Query: 259 GMVVLASTTLHNIA 272
+V A TLHNI
Sbjct: 298 TRIVNACVTLHNIC 311
>gi|195390819|ref|XP_002054065.1| GJ24230 [Drosophila virilis]
gi|194152151|gb|EDW67585.1| GJ24230 [Drosophila virilis]
Length = 347
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 135/304 (44%), Gaps = 19/304 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F + FR+ ++IL +I P T L+ ++ + LH++ GS Y + + +
Sbjct: 41 FIKLFRVDKETFKWILRKIQPDFTLSY-----LTPALQLAATLHFLATGS-YKLSVSRNR 94
Query: 66 P---SKSSICRSIHEVVPVIVDTFFQRMV--KWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
P +S+ R +H V+P++ + + + ++ + F P V CVDG
Sbjct: 95 PVNIGRSTFSRILHNVIPLMEAVLCREAITLRMSQQQIKQSRDYFYENYQLPRVVACVDG 154
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ I P E + N +++N M +C N + A++ PGS D + NS
Sbjct: 155 MQIKILKPVRDETIYFNDKDYYSINAMLVCNYNMEILAIDCTHPGSCSDHYIWSNSQA-- 212
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDA-AADQRFLRCHKRTRRLIENSFG 239
E R +L D+AY L+ +++ P RN D + +QRF H R +++ +
Sbjct: 213 --REYLSRNVKCHYVLADTAYALENYVLTP-YRNADQDSVEQRFNERHAAARSVVQRTIN 269
Query: 240 ILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
+LK +F C+ + +F VV LHNI R ++ +H+ E DGE E
Sbjct: 270 LLKSRFGCMQRALMYDARFVANVVNVCCALHNICRR-RRTPLDEREMLHEQMDEFDGEIE 328
Query: 299 AVDD 302
DD
Sbjct: 329 YEDD 332
>gi|390360486|ref|XP_001201441.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 304
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 130/296 (43%), Gaps = 49/296 (16%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+R + ++ I P L T+RN AL + +V L++ G+ HG ++S
Sbjct: 41 YRFTRDGCMHVINLIRPLLQHDTQRNQALPPELQVFIALNFYATGAVLDSTATIHGITRS 100
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
+ R IH V V + T ++K+P E +N + F GFP V G VDGT + ++
Sbjct: 101 TASRVIHRV-SVTLGTLKHEIIKFPTTMEEVNRAQVDFFNISGFPQVIGVVDGTHIRLNG 159
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
A GP H++RV + + G
Sbjct: 160 --------------------APLGPGE-------------HNSRV---GQTFEDLQQHG- 182
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
++LGDS Y L+ +L+ P NP A+Q + R H TR IE G LK KF C
Sbjct: 183 ------LLLGDSGYALRPYLMTP-VLNPRTPAEQAYNRAHATTRVRIEQVNGQLKHKFRC 235
Query: 248 -LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDD 302
LN M+++P+ A ++ A LHN+A + + +E + I E E D ++ VD+
Sbjct: 236 LLNGMQMAPRRACKIITACAVLHNVAKDLKQPDE-VDFGIEQHEHEPDRDDHIVDN 290
>gi|209863095|ref|NP_001129460.1| uncharacterized protein LOC100191015 [Danio rerio]
gi|190337069|gb|AAI62733.1| Similar to ENSANGP00000010363 [Danio rerio]
gi|190338874|gb|AAI62738.1| Similar to ENSANGP00000010363 [Danio rerio]
Length = 394
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 126/282 (44%), Gaps = 16/282 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FRL Q + +L ++ P + T ++ E++ CL ++ G Y I ++
Sbjct: 53 FQRYFRLDREQFDSLLSKVGPQIARQDTNYRQSIEPAERLAICLRFLATGDSYRTIAFSY 112
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S++ + V I DT Q ++ P E+ +I FL + FPN G +DG V
Sbjct: 113 RVGVSTVAGIVAAVTRAIWDTLAQEVMPVPTTEDWRNISTDFLHRWNFPNCLGSIDGKHV 172
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+I AP+ F N G +++ ++A+ Y+F V+ G + D V+ NS+ +
Sbjct: 173 VIKAPDNSGSLFYNYKGTYSVVLLAVVDSQYRFRVVDVGSYGRMSDGGVLANSIFGQALR 232
Query: 184 EEGWRPFPDAVILG-------------DSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
+ DA++ G D A+PL+ L+ P + + + F R
Sbjct: 233 DGALGLPQDALLSGAEHFGPQPHVFVADEAFPLRRDLMRPFPGHNLSGRQRIFNYRLSRA 292
Query: 231 RRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNI 271
R ++EN+FGIL ++ + +SP V A+ LHN
Sbjct: 293 RLIVENTFGILTAQWRMYRGAIEISPANVDACVKATCVLHNF 334
>gi|449691348|ref|XP_004212640.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 296
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 113/237 (47%), Gaps = 43/237 (18%)
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCV 118
A P+ S I R + V+ + +K+P + LN+ + F FP + G +
Sbjct: 79 AALSQPTISKIIRQFSMSLAKRVNEY----IKYPTDPHVLNESRVNFYNVAEFPKITGLI 134
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DGT V I P + E +VNR+ +H++NV A+C N KFY + A WPGS HDARV+R S
Sbjct: 135 DGTHVCIQKPRKREYIYVNRSSNHSINVQAVCAYNGKFYDIVAKWPGSTHDARVLRESN- 193
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
LG + AA +R+ +RTR LIE F
Sbjct: 194 -----------------LGST-----------------TAAQRRYNIFLRRTRVLIEQVF 219
Query: 239 GILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTI-DRDNENQNVEIHDDEAED 293
G K +F L+ ++++P+ ++ A L N+A+ + D D ++ +E +++ ED
Sbjct: 220 GRWKRQFHLLHGEVQMTPERTCTIIAACAVLQNLAIELNDVDIDDNPIENYENNNED 276
>gi|218197366|gb|EEC79793.1| hypothetical protein OsI_21220 [Oryza sativa Indica Group]
Length = 423
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 130/300 (43%), Gaps = 14/300 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S F+ + +A A+ T A+ +++V C+ + G P ++
Sbjct: 96 FRRAFRMSRPTFHFLCDALAAAVAKEDTALRAAIPVRQRVAVCVWRLATGEPLRVVSKRF 155
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT-FLRKGGFPNVAGCVDGTMV 123
G S+ + I EV I + R + WP+ + T F G V G + T +
Sbjct: 156 GLGISTCHKLILEVCAAIRNLLMPRFLHWPDHPTSTAYKTRFEATSGVSGVVGAMYTTHI 215
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP ++NR +++ + + GP+ F V WPGS+ D +V+R S
Sbjct: 216 PIIAPKVSVAAYLNRRHTERNHKTSYSITLQGVVGPDGTFTDVCIGWPGSMSDEQVLRKS 275
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + ++G ++YPL EW++ P + F RR+
Sbjct: 276 ALHQRASAAA---GSMSWVVGGASYPLTEWMLVPYAQRNLTWTQHAFNEKVGEVRRVATE 332
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNE-NQNVEIHDDEAEDD 294
+F LK ++ CL + Q V+ A LHNI T D + + ++ DE EDD
Sbjct: 333 AFVRLKGRWACLQKRTEVKLQDLPAVLAACCVLHNICETRGEDMDPDLRCDLPPDEEEDD 392
>gi|224113883|ref|XP_002332479.1| predicted protein [Populus trichocarpa]
gi|222832470|gb|EEE70947.1| predicted protein [Populus trichocarpa]
Length = 465
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 136/308 (44%), Gaps = 27/308 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK+AFR+S + + I EE+ + T RN A+ +++V C+ + G P ++
Sbjct: 139 FKKAFRMSKSTFDSICEELNSVIAKEDTTLRN-AIPVRQRVAVCIWRLATGEPLRLVSKR 197
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + + ++WP E+NL F G PNV G + T
Sbjct: 198 FGLGISTCHKLVLEVCSAIRNVLMPKYLQWPDEDNLRKTKNEFESISGIPNVVGSMYTTH 257
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I AP + N+ +++ V + P F V WPGS+ D +V+
Sbjct: 258 ISIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPKGVFTDVCIGWPGSMPDDQVLEK 317
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S ++++ + D ++G S YPL +W++ P T+ F + + +
Sbjct: 318 SALFQRANGG---LLKDVWLVGTSGYPLMDWVLVPYTQQNLTWTQHAFNEKIGEIQSVAK 374
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
++F LK ++ CL + Q +V+ A LHNI E+ ++E + +
Sbjct: 375 DAFARLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNIC------------EMQNEEMDPE 422
Query: 295 GEEEAVDD 302
E VDD
Sbjct: 423 LRMELVDD 430
>gi|432950744|ref|XP_004084590.1| PREDICTED: uncharacterized protein LOC101165370 [Oryzias latipes]
Length = 676
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 131/284 (46%), Gaps = 50/284 (17%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + +I I P +T + R AL++++ + + L + GS + IG S
Sbjct: 416 ERYRFSLQSITYIHNLIQPYITNISHRGRALTSEQILCAALRFFANGSFLYNIGDAEHIS 475
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
K+++CR+ ++ + + F + + +P + L I F R +++ D
Sbjct: 476 KATVCRAFRKMC-LALKRFLRIFIVFPVHKPLRAIKEEFHR--------------IIICD 520
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
A + D+ A WPGSVHD+R+ R S + + +
Sbjct: 521 AAHITNDE--------------------------AKWPGSVHDSRIFRESALSNRLECGE 554
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
D + GDS YP + L+ P T P+ QRF H RTR +E + G+LK +F
Sbjct: 555 I----DGFLQGDS-YPCQHKLLTPYT-EPEQGPQQRFNLAHSRTRARVEMTIGLLKARFQ 608
Query: 247 CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
CL +++++P+ A +++A LHNIA I R ++ + I D E
Sbjct: 609 CLRHLKVTPERACDIIVACVVLHNIA--IFRGEQHPALHIQDPE 650
>gi|390367614|ref|XP_785880.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 248
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 103/221 (46%), Gaps = 33/221 (14%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ +R S +++ + L PT R+ AL A ++ L + G+ +G HG
Sbjct: 34 LRKRYRFSRRGCIYLIHLLRIDLEHPTGRSRALPASLQIFLALAFFASGALLTTVGTMHG 93
Query: 66 PSKSSICRSIHEVVPVI---------------------VDTFFQRMVKWP--EENLNDIP 102
S +S R+I V + +F + +K+P E ++ I
Sbjct: 94 ISIASTSRAIRRVTKALFRRRNQSNYSYRHLVFALFGTAHSFLYQFIKFPKSERDVLHIQ 153
Query: 103 MTFLRKGGFPNVAGCVDGTMVLIDA----PNEHEDQFVNRNGDHALNVMAICGPNYKFYA 158
F GFP V G +D T V + P EH +VNR H++NV ICGP++
Sbjct: 154 EQFFAVAGFPKVVGVIDCTHVHLHGSKLKPIEH--LYVNRKNRHSINVQLICGPDFLISN 211
Query: 159 VNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDS 199
V A WPGS HD+R+++ S + RKF++ ++ ++LGDS
Sbjct: 212 VVARWPGSTHDSRILQMSRLARKFEQREYK----GILLGDS 248
>gi|432952358|ref|XP_004085075.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 249
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 115/247 (46%), Gaps = 35/247 (14%)
Query: 44 VLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPM 103
+ + L + GS + IG H SK+++CR++ +V + + F + +
Sbjct: 12 LCAALRFFANGSFLYNIGDAH-ISKATVCRAVRKVC-LALKRFLHIFIVF---------- 59
Query: 104 TFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANW 163
G P + A E + VNR H++NV IC + V A W
Sbjct: 60 ----SGHKP------------LRAIKEEFHRIVNRKSIHSINVQIICDAAHIITNVEAKW 103
Query: 164 PGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
PGSVHD+R+ R S + + + D +LGD YP + L+ P P+ QRF
Sbjct: 104 PGSVHDSRIFRESTLSNRLECGEI----DGFLLGDRGYPCQPKLLTPYP-EPEQGPQQRF 158
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQN 283
H RTR +E + G+L+ +F CL +++++P+ A ++A LHNIA I R ++
Sbjct: 159 NLTHSRTRARVEMTIGLLRARFQCLLHLKVTPERACDFIVACVVLHNIA--IFRGEQHPA 216
Query: 284 VEIHDDE 290
+ I D E
Sbjct: 217 LHIQDPE 223
>gi|328724776|ref|XP_003248249.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 275
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 123/261 (47%), Gaps = 13/261 (4%)
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTF--LRKGGFPNVAG 116
IG S+ CR+I + + V V WP+ + + I F LR P+ G
Sbjct: 5 IGDRFDLSRGLACRTILKFIRVF-SKLLDNFVLWPKGSSDAAIIKDFKSLRFNYMPSTIG 63
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
C+DG + I AP + + NR ++ ++A+C ++F + + WPGS HDARV +NS
Sbjct: 64 CIDGCHIRIHAPRDKRSDYTNRKMFQSIVLLAVCNAKFEFTYMFSGWPGSSHDARVFKNS 123
Query: 177 MVYRKFDEEGWRPFP-DAVILGDSAYPLKE-WLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
+ + ILGDSA+PL E ++P + + ++ F R TR +I
Sbjct: 124 SLGHTLINSPQEIISKNQHILGDSAFPLLENLMVPYKATHILTEKEKSFNRRLSSTRVVI 183
Query: 235 ENSFGILKEKFPCLNYMRL-SPQFAGMVVLASTTLHNIALT------IDRDNENQNVEIH 287
E +FG+L +F L + S + + V ++ LHN+AL ID D+ E
Sbjct: 184 EQAFGLLLGRFWRLKILEAKSIELMSLTVTSACILHNLALQNNDFIEIDNDHNTDLTEFT 243
Query: 288 DDEAEDDGEEEAVDDPDEIDE 308
++ ++ A++ + +D+
Sbjct: 244 EENNQECSVSLAINQRNGVDK 264
>gi|301624428|ref|XP_002941507.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 350
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 126/273 (46%), Gaps = 9/273 (3%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
+ +RL + + + L T R+ AL KV + L + G+ G G S
Sbjct: 40 QRYRLHPALIRDLCHLLEHDLQPSTGRSHALPVYVKVTAALDFYTSGTFQTPAGDAAGIS 99
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT--FLRKGGFPNVAGCVDGTMVLI 125
++S+ R + +V IV R +++P + F R GFP G V T V +
Sbjct: 100 QASMSRCVTQVTEAIVRRA-HRFIRFPRGCIQRRAAARDFHRLHGFPGAVGAVGCTHVAL 158
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
P++HE ++ NR H++++ A+C + A +PGSV + ++ S + F
Sbjct: 159 KPPSDHESRYRNRWRYHSMHMQAVCDARGALTHIVAEYPGSVSEEDILVQSSLGHMFQNR 218
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
G + DA ++G Y K WL+ P +P + A+ ++ H ++ +FG LK +F
Sbjct: 219 G-QDDDDAWLIGGRCYTQKPWLMTP-IEDPQSPAELQYNDSHTAALSVLSAAFGSLKSRF 276
Query: 246 PCLN----YMRLSPQFAGMVVLASTTLHNIALT 274
CL+ ++ SP + +A LHN+AL+
Sbjct: 277 RCLSRRGGSLQYSPLKVSQIFVACCVLHNMALS 309
>gi|242056125|ref|XP_002457208.1| hypothetical protein SORBIDRAFT_03g003310 [Sorghum bicolor]
gi|241929183|gb|EES02328.1| hypothetical protein SORBIDRAFT_03g003310 [Sorghum bicolor]
Length = 470
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 72/277 (25%), Positives = 118/277 (42%), Gaps = 13/277 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S E + EE+ A+ T A+ +++V C+ + G P ++
Sbjct: 137 FRRAFRMSRATFEAVCEELGAAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 196
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + V+WPE + + TF G P V G + T
Sbjct: 197 GLGISTCHKLVLEVCAAIKAVLMPKAVQWPEAPDAAAGVSATFEAASGIPGVVGAMYTTH 256
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I AP + + NR +++ V + F V WPGS+ DA V+
Sbjct: 257 IPIIAPKANVAAYYNRRHTERNQKTSYSITVQGVVDAGGAFTDVCIGWPGSMSDADVLDR 316
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S +Y + G ++G + YPL +WL+ P T + A F R +
Sbjct: 317 SALYAQRGAAGL--LQGQWVVGGAGYPLMDWLLVPYTHHNMTWAQHVFNERVDGVRAVAR 374
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
++F LK ++ CL + Q +V+ A LHNI
Sbjct: 375 DAFQRLKSRWGCLQKRTEVKLQDLPVVLGACCVLHNI 411
>gi|356514661|ref|XP_003526022.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 440
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 133/310 (42%), Gaps = 16/310 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+AFR+ E I +E+ + T A+ K++V CL + G P ++
Sbjct: 105 FKKAFRMGRLTFEAICQELNSVIAKEDTTLRTAIPVKQRVAVCLWRLATGDPLSVVSRRF 164
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP L DI F G PNV G + + V
Sbjct: 165 GLGISTCHKLVLEVCTAIKTVLMPKYLQWPNVVTLRDIKGEFENISGIPNVVGSMFTSHV 224
Query: 124 LIDAPNEHEDQFVNRN-------GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + F V WPGS+ D +V+ S
Sbjct: 225 PIIAPKMSVSAYFNKKHTERNHKTSYSVTVQGVVDHRGVFTDVCIGWPGSMGDDQVLEKS 284
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++ + + G ++G YPL +W++ P T+ F RR+ +
Sbjct: 285 SLFHRANNGGIN-LSGVWVVGGCGYPLLDWVLVPYTQQNLTWTQHSFNEKIGEVRRVAKG 343
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+FG LK ++ CL + Q +V+ A LHNI + NE + E+ + DDG
Sbjct: 344 AFGRLKGRWGCLQKRTEVKLQDLPLVLGACCVLHNIC---ELKNEEMDHELLKN--IDDG 398
Query: 296 EEEAVDDPDE 305
+++ V + E
Sbjct: 399 DDDYVGEVME 408
>gi|115694524|ref|XP_797119.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 378
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 137/309 (44%), Gaps = 20/309 (6%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P AF+ R+ + +++ + P LT T +L KV L + G+ Y +
Sbjct: 68 PRAFQHFMRMPPAVFDEVVQRLTPRLTKQDTNYRASLEPGLKVAITLRHLASGNTYRNMQ 127
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVD 119
++I + EVV I++ + ++ P E+ D+ + ++ FP+ G +D
Sbjct: 128 YAWRVPHNTISVVVREVVKAIIEEYTDELLFCPTTEQGWRDLADQWYQRWNFPHTVGAID 187
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
G V AP +F N G +++ + A+ +YKF ++ + GS DA + S ++
Sbjct: 188 GKHVACKAPPNSGSEFYNYEGFYSVILFAMVDADYKFTYIDVSGNGSSSDAHIYNESDLH 247
Query: 180 RKFDEEGWRPFPD------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
R D+ FP I+GD A+ L+ +L+ P + ++ F
Sbjct: 248 RGLDQNRIHAFPQPDPLPNDNQDVPYFIIGDDAFSLRTYLMKPYSTRNLTHEERIFNYRL 307
Query: 228 KRTRRLIENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
R RR++EN+FGIL +F L M+ + +V A LHN+ T +N+ V
Sbjct: 308 SRARRVVENTFGILANRFQILLTTMQHHHETVRTIVEACCILHNLMRTRYPVLQNRLV-- 365
Query: 287 HDDEAEDDG 295
D A+ DG
Sbjct: 366 --DRAQPDG 372
>gi|340382450|ref|XP_003389732.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 423
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 121/278 (43%), Gaps = 17/278 (6%)
Query: 10 FRLSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
R++ + + +AP L + T + +S E++ L ++ G I +
Sbjct: 61 LRVTKETFDVLCRLVAPYLQKRRSYTNKRPHISPGEQLAMTLRFLASGDSQTSISYSFRV 120
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K+S+C I++ V+ ++ + +P E+ I F FPN G +DG V
Sbjct: 121 GKASVCHIIYKTCCVLWKVLHKKYLPFPSTEDEWKKISHQFWMLWQFPNCLGAIDGKHVR 180
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I APN F N G ++ +MAIC +Y+F V+ G D V N ++ E
Sbjct: 181 IQAPNNSGSMFFNYKGTFSIVLMAICDAHYRFIMVDIGEGGKESDGGVFSNCTFGQRLME 240
Query: 185 EGWR-PFPDA----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
+ + P P+A V +GD A+PL+ L+ P + + F R RRL
Sbjct: 241 QSLQLPLPNALPGTSVVAPYVFVGDEAFPLRPDLLRPFPGSNLSEKQSVFNYRLSRARRL 300
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
IENSFGIL +F + SP V A+ LHN
Sbjct: 301 IENSFGILASRFRIYRQPIHSSPAKVVAYVKATIVLHN 338
>gi|449447145|ref|XP_004141329.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 496
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 132/296 (44%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+ FR+ + I EE+ A+ T A+ +++V CL + G P ++
Sbjct: 170 FKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKF 229
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP EE L I + G PNV G + T +
Sbjct: 230 GLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHI 289
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + P F V WPGS+ D +V+ S
Sbjct: 290 PIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKS 349
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + I+G S+YPL +W++ P T+ F +++ ++
Sbjct: 350 ALFQRANGG---LLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKD 406
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNV-EIHDDE 290
+F LK ++ CL + Q +V+ A LHNI +++ + + + E+ DDE
Sbjct: 407 AFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDE 462
>gi|224114577|ref|XP_002332342.1| predicted protein [Populus trichocarpa]
gi|222831909|gb|EEE70386.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 132/299 (44%), Gaps = 16/299 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++AFR+S + I E+ +T T A+ +++V C+ + G P ++
Sbjct: 130 FRKAFRMSKATFDMICVELDSVVTKKNTMLRDAIPVRQRVAVCIWRLATGEPLRLVSKRF 189
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + ++WP E+ L I F G PNV G + T V
Sbjct: 190 GLGISTCHKLVLEVCSAIRNVLMPKFLQWPNEDKLKMIKGEFESMSGIPNVGGSMYTTHV 249
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + N+ +++ V + P F V +PGS+ D RV+ S
Sbjct: 250 PIIAPKINVAAYYNKRHTERNQKTSYSMTVQGVVSPKGVFTDVCIGYPGSMPDDRVLEES 309
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + + I+G+S +PL +W++ P T F +R+ +
Sbjct: 310 ALFKRANRGA---LKNVCIVGNSGHPLMDWVLVPYTHQNLTWTQHAFNEKIGEIQRVSKE 366
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
+F LK ++ CL + Q +V+ A LHNI + NE E+ D +DD
Sbjct: 367 AFARLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNIC---EMRNEEMGPELKFDIFDDD 422
>gi|432924657|ref|XP_004080663.1| PREDICTED: uncharacterized protein LOC101167745 [Oryzias latipes]
Length = 707
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 129/308 (41%), Gaps = 16/308 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+S ++ E +L+++AP+LT T + ++++ CL ++ G Y I ++
Sbjct: 62 FRGYFRMSVSEFENLLQQLAPSLTKEQTHYRKPIDPEQRLAVCLRFLSTGDSYRTIASSF 121
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S++ + E + + P EE F + FPN G +DG +
Sbjct: 122 RLGVSTVASIVRETCDALWHCLRDEHLPVPTEEMWRSTARRFHERWNFPNCLGAMDGKHI 181
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I AP + N G ++ ++A+ +Y+F V+ GS D + NS + +
Sbjct: 182 FIQAPANSSSLYFNYKGTFSVVLLALVDADYRFLMVDVGSYGSNSDGGIFANSGLGKALR 241
Query: 184 EEGWRPFP-------------DAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
+ P + VI+ D A+PLK +L+ P + F R
Sbjct: 242 DGTLNVPPPSELPGAPELGKLNHVIVADEAFPLKPYLLRPYPGRRLPTEKRIFNYRLSRA 301
Query: 231 RRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
RR+ EN FGIL ++F +++ P VV A+ L N D + + HD
Sbjct: 302 RRISENVFGILSQRFRVFQRTLQVQPSVVDKVVKAACVLCNYLRPKGEDQNHATEDDHDC 361
Query: 290 EAEDDGEE 297
E G E
Sbjct: 362 EQPLQGFE 369
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/264 (23%), Positives = 102/264 (38%), Gaps = 15/264 (5%)
Query: 49 HWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLR 107
H++ G Y I ++ S++ + E + + P EE F
Sbjct: 401 HFLSTGDSYRTIASSFRLGVSTVASIVRETCDALWHCLRDEHLPVPTEEMWRSTARRFHE 460
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
+ FPN G +DG + I A + N G ++ ++A+ +Y+F V+ GS
Sbjct: 461 RWNFPNCLGAMDGKRISIQATANSGSLYFNYKGTFSVVLLALVDADYRFLMVDVGSYGSN 520
Query: 168 HDARVMRNSMVYRKFDEEGWRPFP-------------DAVILGDSAYPLKEWLIPPRTRN 214
D + NS + + + P + VI+ D A+PLK +L+ P
Sbjct: 521 SDGGIFANSGLGKALRDGTLNVPPPSELPGAPELGKVNHVIVADEAFPLKPYLLRPYPGR 580
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIAL 273
+ F R RR+ EN FGIL ++F +++ P VV A+ L N
Sbjct: 581 RLPTDKRIFNYRLSRARRISENVFGILSQRFRVFQRTLQVQPSVVDKVVKAACVLCNYLR 640
Query: 274 TIDRDNENQNVEIHDDEAEDDGEE 297
D + + HD E G E
Sbjct: 641 PKGEDQNHATEDDHDCEQPLQGFE 664
>gi|224085922|ref|XP_002307739.1| predicted protein [Populus trichocarpa]
gi|222857188|gb|EEE94735.1| predicted protein [Populus trichocarpa]
Length = 453
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 123/277 (44%), Gaps = 13/277 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++AFR+S + I E+ A+T T A+ ++++ C+ + G P ++
Sbjct: 129 FRKAFRMSKATFDLICMELDSAVTKKNTMLRDAIPVRQRIAVCIWRLATGEPLRLVSKRF 188
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + ++WP E+ L I F G PNV G + T V
Sbjct: 189 GLGISTCHKLVLEVCSAIRNVLMPKFLQWPNEDKLKMIKGEFESMSGIPNVGGSMYTTHV 248
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + N+ +++ V + P F V +PGS+ D RV+ S
Sbjct: 249 PIIAPKINVAAYYNKRHTERNQKTSYSMTVQGVVSPKGVFTDVCIGYPGSMPDDRVLEES 308
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ + + I+G+S +PL +W++ P T F +R+ +
Sbjct: 309 ALFKRANRGA---LKNVCIVGNSGHPLMDWVLVPYTHQNLTWTQHAFNEKIGEIQRVSKE 365
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
+F LK ++ CL + Q V+ A LHNI
Sbjct: 366 AFARLKGRWSCLQKRTEVKLQDLPAVLGACCVLHNIC 402
>gi|301611294|ref|XP_002935178.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 378
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 80/168 (47%), Gaps = 8/168 (4%)
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
G PNV G +D T V I P + E F N H+LN+ +C ++ + +PGS
Sbjct: 205 SGIPNVLGAIDRTHVAISHPQDREHVFRNCKSYHSLNIQVVCDATMNIRSIVSGFPGSSQ 264
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
DA ++R S +Y F P +LGD+ YP WLI P R P A+ F H
Sbjct: 265 DAYILRQSGLYEGFQAG---KMPHWWLLGDAGYPCSRWLITPIPR-PHTRAECAFNEAHI 320
Query: 229 RTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIA 272
+IE + G+LK + CL+ + SP +V+A LHN+A
Sbjct: 321 TAWSVIERTLGVLKSRSHCLDKSGGSLMYSPTKVAQIVVAYAVLHNVA 368
>gi|72015146|ref|XP_783499.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 429
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 68/271 (25%), Positives = 121/271 (44%), Gaps = 23/271 (8%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE- 95
ALS K+ L + G Y + + + ++I + EV I++ + +VK P
Sbjct: 107 ALSPALKLAITLRHLATGDSYKSLMYSFRVAHNTISLIVREVCTAIIEEYGDELVKCPTS 166
Query: 96 -ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
+ + F + P+ G +DG V I P + F N G +++ +MA+ NY
Sbjct: 167 PQEWKAVAHGFEDRWNSPHTIGALDGKHVAIKCPKDSGSIFYNYKGFYSIVLMALVDANY 226
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-PFPDAV----------ILGDSAYPL 203
KF V+ + GS DA++ + + ++ + P P+ + + D A+ L
Sbjct: 227 KFLWVDVGYEGSTSDAQLFNSCELKQRLKSDTLGVPLPEPMTNDDVDTPYFFVADEAFAL 286
Query: 204 KEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVV 262
+ WL+ P R A ++ + R RR++EN+FGI +F C L +R +P +V
Sbjct: 287 RTWLMKPYARTAMARHERIYNYRLSRARRVVENAFGIHANRFQCLLTTLRQNPSTVRTMV 346
Query: 263 LASTTLHNIALT---------IDRDNENQNV 284
A LHN+ T +DR++ N+
Sbjct: 347 QACVCLHNLMRTRYPCIQNSALDREDAEHNI 377
>gi|301604055|ref|XP_002931683.1| PREDICTED: hypothetical protein LOC100493199 [Xenopus (Silurana)
tropicalis]
Length = 617
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 101/224 (45%), Gaps = 51/224 (22%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
+ +RLS T + + + L T R+ AL K+L+ LH++G GS
Sbjct: 146 QMYRLSRTAILQVFHLVRQDLDPVTARSQALPGISKLLAVLHFLGSGS------------ 193
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
F Q ++ G FPN G +D T V +
Sbjct: 194 ------------------FQQYLI-----------------GHFPNCLGAIDCTHVPLTP 218
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P H+++++NR H++NV +C + + +V + +PGSVHDA ++R S +Y +F +
Sbjct: 219 PRAHQERYLNRKRSHSINVQVVCDSHLRIISVRSGFPGSVHDAHILRQSALYERFTQG-- 276
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
P ++GD+ Y + WL+ P P A +R+ R H++TR
Sbjct: 277 -EMPQGWLVGDAGYGVLPWLMTPVC-FPRTPAQRRYNRAHRKTR 318
>gi|449690755|ref|XP_004212446.1| PREDICTED: uncharacterized protein LOC101235855 [Hydra
magnipapillata]
Length = 315
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 118/272 (43%), Gaps = 26/272 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F + E+ P L TT T R +S ++V LH++ Y + +
Sbjct: 14 FKEYFRVNRNTFNFPVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYRAVSSL 72
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ +HE + + D + VK+P E LN + F GFP G VDG
Sbjct: 73 FGIGKSTANLIVHEFINAVNDILLPKYVKFPLSVEILNKHSIDFEAILGFPQCVGAVDGC 132
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ I AP + + N G +++ + A+ Y+F + P
Sbjct: 133 HIPISAPKDQAISYYNYKGWYSIVLFAVVDCRYRFIYTSVGLP----------------- 175
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
E G P +I GDSA+PL L+ P N + + Q+ R +EN+FG +
Sbjct: 176 --ELGDSLVPLCLI-GDSAFPLTRHLLKPYPENLEPSEIQKNFNKILCGARRVENAFGCV 232
Query: 242 KEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
K +F + M FA +V A TLHNI
Sbjct: 233 KARFRVICKRMECDINFATKIVNACVTLHNIC 264
>gi|241573389|ref|XP_002403173.1| transposase, putative [Ixodes scapularis]
gi|215500182|gb|EEC09676.1| transposase, putative [Ixodes scapularis]
Length = 275
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 97/205 (47%), Gaps = 9/205 (4%)
Query: 91 VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAIC 150
++W + L + F RK FP V G VDG + I AP E +D + NR H++ + C
Sbjct: 77 IRWLMQ-LQRVSRAFTRKWRFPGVVGAVDGCHIAIKAPEEEQDAYYNRKEFHSIILQGCC 135
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIP 209
F V+ PG +HDARV S + D P ++ +LGDSAY L+ L+
Sbjct: 136 NSEMVFTHVHVGSPGRMHDARVFSISGLDEIIDN-----LPLSLHVLGDSAYSLRLGLMR 190
Query: 210 PRTRNPDAAADQ-RFLRCHKRTRRLIENSFGILKEKFPCLNYMRL-SPQFAGMVVLASTT 267
P N + +Q F R +IE +F LK KF L Y+ + + + VLAS
Sbjct: 191 PYRNNGHLSEEQVTFNETLSAARSVIERAFAQLKGKFRRLKYLDMEATEMMSKYVLASCV 250
Query: 268 LHNIALTIDRDNENQNVEIHDDEAE 292
LHNI L NVEI D+ E
Sbjct: 251 LHNIILQSKGPFVVDNVEIERDDDE 275
>gi|86155935|gb|ABC86705.1| R111 [Coffea arabica]
Length = 498
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 131/298 (43%), Gaps = 16/298 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++AFR+S + I +E+ +T T +A+ +++V C+ + G P +
Sbjct: 172 FRKAFRMSKATFDMICDELESVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLREVSKRF 231
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + ++WP EEN+ ++ F G +VAG + + V
Sbjct: 232 GLGISTCHKLVLEVCSAIRNVLMPKFLQWPNEENMREVKRGFEMISGVADVAGSIYTSHV 291
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + P F + WPGS+ D +V+ S
Sbjct: 292 PIIAPKVSVASYFNKRHTERNQKTSYSITVQGVVDPKGVFTDICVGWPGSMTDDKVLEQS 351
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y++ + D ++G++ +PL +W++ P TR + + +
Sbjct: 352 ALYQRANRG---LLKDVWVVGNAGFPLMDWVLVPYTRQNLTWTQHALNEKVGEVQNIAKE 408
Query: 237 SFGILKEKFPCLNYMRLSPQFAGMVVL--ASTTLHNIALTIDRDNENQ-NVEIHDDEA 291
+F LK ++ CL R + + +L A LHNI D + E+ DDE
Sbjct: 409 AFMRLKARWRCLQK-RTEVKLQDLPILLGACCVLHNICEIRDEGMSPELRFELFDDEV 465
>gi|224092160|ref|XP_002309488.1| predicted protein [Populus trichocarpa]
gi|222855464|gb|EEE93011.1| predicted protein [Populus trichocarpa]
Length = 507
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 132/297 (44%), Gaps = 16/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK+AFR+S + + I EE+ + T RN A+ +++V C+ + G P ++
Sbjct: 181 FKKAFRMSKSTFDMICEELNSVIAKEDTTLRN-AIPVRQRVAVCIWRLATGEPLRLVSKR 239
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + ++WP+E+ L I F G PN G + T
Sbjct: 240 FGLGISTCHKLVLEVCSAIRSVLMPKYLQWPDEDGLRKIKNEFESISGIPNAVGSMYTTH 299
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I AP + N+ +++ V + P F V WPGS+ D +V+
Sbjct: 300 IPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPKGIFTDVCIGWPGSMPDDQVLEK 359
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S ++++ + D I+G S YPL +W++ P + F + + +
Sbjct: 360 SALFQRANGG---LLKDVWIVGTSGYPLMDWVLVPYAQQNLTWTQHAFNEKIGEIQLVAK 416
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
++F LK ++ CL + Q +V+ A LHNI + + + VE+ DDE
Sbjct: 417 DAFARLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNICEMQNEEIDPELRVELVDDE 473
>gi|291228639|ref|XP_002734285.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 429
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 135/301 (44%), Gaps = 19/301 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F R+ A +L+ I P + L K+ L ++ G+ Y +
Sbjct: 76 FLNYMRMPAETFLVLLQRITPRIEKSYRYRQPLDPGLKLAITLRYLATGNSYKTLQYAFR 135
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+ ++I I EV I+ + + P + +++ T+ F +V G +DG +
Sbjct: 136 VAHNTISLFIPEVCLAIISEYQDEVFSCPITTDEWSEVAQTYADTWNFHHVCGALDGKHI 195
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I P + N G ++L ++A+ NY+F + PGS DA+V +S + R
Sbjct: 196 AIRNPPGSGTIYYNYKGFYSLTLLALVDGNYEFLWADVGNPGSSLDAQVFNHSPL-RHGL 254
Query: 184 EEGWRPFPDA------------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
E G PD ++GD +PL+ W+ P + ++ F R R
Sbjct: 255 ENGTLGLPDPDPLPDDDRDTPYFLIGDDVFPLRTWMQKPYSNREQTDEERIFNYRLSRAR 314
Query: 232 RLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI-HDD 289
R++ENSFGIL ++ C L+ ++L P+ A V++A LHN L DR QN+++ H+D
Sbjct: 315 RVVENSFGILAHRWRCLLSTLQLDPEKARTVIMACMCLHN--LMRDRFPGLQNIDVDHED 372
Query: 290 E 290
E
Sbjct: 373 E 373
>gi|225429570|ref|XP_002280033.1| PREDICTED: uncharacterized protein LOC100262760 [Vitis vinifera]
gi|296081670|emb|CBI20675.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 119/277 (42%), Gaps = 13/277 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++AFR+S + I +E+ + T A+ +++V C+ + G P ++
Sbjct: 158 FRKAFRMSRATFDMICDELNSVVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKKF 217
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP EE L + F G PNV G + T +
Sbjct: 218 GLGISTCHKLVLEVCSAIRTVLMPKYLQWPDEETLRRMKDEFESISGIPNVVGSMYTTHI 277
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ V + P F V WPGS+ D +V+ S
Sbjct: 278 PIIAPKISVAAYFNRRHTERNQKTSYSITVQGVVDPKGVFTDVCIGWPGSMPDDQVLEKS 337
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y++ + I+G S YPL +W++ P T+ F +R+
Sbjct: 338 ALYQRANGG---LLKGVWIVGGSGYPLMDWVMVPYTQQNLTWTQHAFNEKIGEIQRVSRE 394
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
+F LK ++ CL + Q +V+ A LHNI
Sbjct: 395 AFARLKARWCCLQKRTEVKLQDLPVVLGACCVLHNIC 431
>gi|15233248|ref|NP_191095.1| PIF / Ping-Pong family of plant transposase [Arabidopsis thaliana]
gi|7076778|emb|CAB75893.1| putative protein [Arabidopsis thaliana]
gi|21593300|gb|AAM65249.1| unknown [Arabidopsis thaliana]
gi|26449824|dbj|BAC42035.1| unknown protein [Arabidopsis thaliana]
gi|32362295|gb|AAP80175.1| At3g55350 [Arabidopsis thaliana]
gi|332645850|gb|AEE79371.1| PIF / Ping-Pong family of plant transposase [Arabidopsis thaliana]
Length = 406
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 139/304 (45%), Gaps = 29/304 (9%)
Query: 3 PFAFKEAFRLSATQVEFILEEIA------PALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
P F+ F++S ++I + PA + + N LS ++V L +G G
Sbjct: 69 PKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGN-PLSLNDRVAVALRRLGSGES 127
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
+IG T G ++S++ + V + + + WP + L++I F + G PN G
Sbjct: 128 LSVIGETFGMNQSTVSQITWRFVESMEERAIHHL-SWPSK-LDEIKSKFEKISGLPNCCG 185
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDH--ALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
+D T ++++ P V +G+ ++ + A+ P+ +F V A WPGS++D V++
Sbjct: 186 AIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWPGSLNDDVVLK 245
Query: 175 NSMVYRKFDEEGWR------PFPDAV-----ILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
NS Y K E+G R P + I+GDS +PL WL+ P P + F
Sbjct: 246 NSGFY-KLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPTSLPQTEF 304
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNEN 281
+ H + + + LK+++ +N + P + ++ LHNI + D E+
Sbjct: 305 NKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIII----DMED 360
Query: 282 QNVE 285
Q ++
Sbjct: 361 QTLD 364
>gi|343887274|dbj|BAK61820.1| hypothetical protein [Citrus unshiu]
Length = 515
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 132/300 (44%), Gaps = 18/300 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK+ FR+ + I EE+ + T RN A+ +++V C+ + G P ++
Sbjct: 189 FKKWFRMRRQTFDMICEELNSVIAKEDTTLRN-AIPVRQRVAVCIWRLATGEPLRLVSKR 247
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + ++WP+++ L I F G PNV G + T
Sbjct: 248 FGLGISTCHKLVLEVCTAIRSVLMPKYLQWPDDDALRKIKDEFEVISGIPNVVGSMYTTH 307
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
V I AP + N+ +++ V + PN F V WPGS+ D +V+
Sbjct: 308 VPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVNPNGVFTDVCIGWPGSMPDDQVLEK 367
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S +Y++ I+G S YPL +W++ P T+ F +R+ +
Sbjct: 368 SALYQRASGG---LLKGVWIVGGSGYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQRVSK 424
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
++F LK ++ CL + Q +V+ A LHNI + NE + E+ D +D+
Sbjct: 425 DAFARLKGRWCCLQKRTEVKLQDLPVVLGACCVLHNICEMM---NEEMDPELAFDLVDDE 481
>gi|328696952|ref|XP_003240188.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 371
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 144/305 (47%), Gaps = 20/305 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV--ALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK +FR+ ++++ + P + +++ +LS + + S +
Sbjct: 55 FKGSFRMERATAYYLIKTFEDSSFFPQQHKYEPGQTSENYILSYIWFSANKSCLRDVAER 114
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G ++ R V+ ++D +++ + +E + ++ F + G P+V GC+DGT +
Sbjct: 115 FGNGLTTQFRINDRVMNFLID-ISPKVINF-DEGITNLAREFKKVSGMPHVIGCIDGTSI 172
Query: 124 LIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I P ++ + + NR+ ++ + +C KF V P +HDARV S + +
Sbjct: 173 PIRTPAHKIKSTYTNRHDMPSITLQGVCDYKKKFIDVFTGIPAKIHDARVFVLSDLSKDL 232
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGIL 241
+ + +LGD AYP++EWL+ P Q+ F + TR LIEN+FG+L
Sbjct: 233 PSMCEKKYN---LLGDGAYPIREWLLVPYKDYGRLTESQKTFNKALSSTRVLIENTFGLL 289
Query: 242 KEKFPCLNYMRL-SPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
K +F L + + S +++S LHN+ + +D ++E+ D+E EE +
Sbjct: 290 KSRFRQLLQLDIHSVDKITKFIISSCVLHNLCIDMD-----DHIELRDEE-----NEELL 339
Query: 301 DDPDE 305
+ PD+
Sbjct: 340 NGPDD 344
>gi|21902022|dbj|BAC05572.1| unknown protein [Oryza sativa Japonica Group]
Length = 551
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 127/296 (42%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I E + A+ T A+ +++V C+ + G P ++
Sbjct: 225 FRRAFRMGRETFDMICEALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 284
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F G P V G + T +
Sbjct: 285 GLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEAAAAAFKERFQAAYGVPGVIGAMYTTHI 344
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 345 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSMPDDQVLEKS 404
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ A ++G ++YPL +W++ P T F RR+ +
Sbjct: 405 MLHQR---AAAGMMHSACLVGGASYPLMDWVLVPYTHQNLTWTQHAFNEKVGDLRRVAVD 461
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQ-NVEIHDDE 290
+F LK ++ CL + Q +V+ A LHNI T + E + E+ DDE
Sbjct: 462 AFARLKARWACLQKRTEVKLQDLPVVLGACCVLHNICETRGEELEPELRFELVDDE 517
>gi|328699930|ref|XP_003241093.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 371
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 144/305 (47%), Gaps = 20/305 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV--ALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK +FR+ ++++ + P + +++ +LS + + S +
Sbjct: 55 FKGSFRMERATAYYLIKTFEDSSFFPQQHKYEPGQTSENYILSYIWFSANKSCLRDVAER 114
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G ++ R V+ ++D +++ + +E + ++ F + G P+V GC+DGT +
Sbjct: 115 FGNGLTTQFRINDRVMNFLID-ISPKVINF-DEGITNLAREFKKVSGMPHVIGCIDGTSI 172
Query: 124 LIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I P ++ + + NR+ ++ + +C KF V P +HDARV S + +
Sbjct: 173 PIRTPAHKIKSTYTNRHDMPSITLQGVCDYKKKFIDVFTGIPAKIHDARVFVLSDLSKDL 232
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGIL 241
+ + +LGD AYP++EWL+ P Q+ F + TR LIEN+FG+L
Sbjct: 233 PSMCEKKYN---LLGDGAYPIREWLLVPYKDYGRLTESQKTFNKTLSSTRVLIENTFGLL 289
Query: 242 KEKFPCLNYMRL-SPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
K +F L + + S +++S LHN+ + +D ++E+ D+E EE +
Sbjct: 290 KSRFRQLLQLDIHSVDKITKFIISSCVLHNLCIDMD-----DHIELRDEE-----NEELL 339
Query: 301 DDPDE 305
+ PD+
Sbjct: 340 NGPDD 344
>gi|222618738|gb|EEE54870.1| hypothetical protein OsJ_02361 [Oryza sativa Japonica Group]
Length = 551
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 127/296 (42%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I E + A+ T A+ +++V C+ + G P ++
Sbjct: 225 FRRAFRMGRETFDMICEALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 284
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F G P V G + T +
Sbjct: 285 GLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEAAAAAFKERFQAAYGVPGVIGAMYTTHI 344
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 345 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSMPDDQVLEKS 404
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ A ++G ++YPL +W++ P T F RR+ +
Sbjct: 405 MLHQR---AAAGMMHSACLVGGASYPLMDWVLVPYTHQNLTWTQHAFNEKVGDLRRVAVD 461
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQ-NVEIHDDE 290
+F LK ++ CL + Q +V+ A LHNI T + E + E+ DDE
Sbjct: 462 AFARLKARWACLQKRTEVKLQDLPVVLGACCVLHNICETRGEELEPELRFELVDDE 517
>gi|225444848|ref|XP_002281001.1| PREDICTED: uncharacterized protein LOC100247440 [Vitis vinifera]
Length = 509
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 129/296 (43%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S + I +E+ +T T +A+ +++V C+ + G P ++
Sbjct: 183 FRRAFRMSRATFDMICDELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRF 242
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPM-TFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP++ + I F G PNV G + T +
Sbjct: 243 GLGISTCHKLVLEVCAAIKTVLMPKFLQWPDDEKSKIVKDEFESISGIPNVGGSMYTTHI 302
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + P F V+ WPGS+ D +V+ S
Sbjct: 303 PIIAPKVSVAAYFNKRHTERNQKTSYSVTVQGVVDPKGVFTDVSIGWPGSMSDEKVLEQS 362
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y++ + G+ D ++G S YPL +W++ P T F + + +
Sbjct: 363 SLYQRAN-SGF--LKDVWVVGGSGYPLLDWVMVPYTHQNLTWTQHAFNEKIGDIQNVAKE 419
Query: 237 SFGILKEKFPCLNYMRLSPQFAGMVVL-ASTTLHNIALTIDRDNE-NQNVEIHDDE 290
+F LK ++ CL VVL A LHNI + + + E+ DDE
Sbjct: 420 AFARLKRRWSCLQKRTEVKLLDLPVVLGACCVLHNICQMRNEEMDPALQFELFDDE 475
>gi|449667893|ref|XP_004206668.1| PREDICTED: uncharacterized protein LOC101236189 [Hydra
magnipapillata]
Length = 254
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 120/250 (48%), Gaps = 15/250 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y ++ +
Sbjct: 6 FKEHFRVNRNTFIFLVNELHPHLGKTTTTMRE-HISVVKRVAVALHYLASCEEYRVVSSL 64
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ +HE + + D + VK+P ENLN F GFP G VDG
Sbjct: 65 FGIGKSTTNLIVHEFINAVNDISLPKYVKFPLSVENLNKHSRDFEAILGFPQCVGAVDGC 124
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM---- 177
+ I AP + ++ + G ++ + A+ Y+F PG +D+ +++NS
Sbjct: 125 HIPISAPKDQAISYIYK-GWYSNVLFAVVDCRYRFIYTIVGLPGRNNDSYILQNSSLKAI 183
Query: 178 ----VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRR 232
++ K +E ++GDSA+PL L+ P N + + Q+ F + R+
Sbjct: 184 LESNLFNKCCKELGDSLVPLCLIGDSAFPLTRHLLKPYPENLELSEIQKNFNKILCGARK 243
Query: 233 LIENSFGILK 242
++EN+FG +K
Sbjct: 244 VVENAFGRVK 253
>gi|344280766|ref|XP_003412153.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Loxodonta africana]
Length = 305
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 119/275 (43%), Gaps = 53/275 (19%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSVYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E ++ + F G P V G V
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPADETSMQALKDGFYGLAGMPGVIGVV 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V NWPGS+ D+ V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLMVCDLRGALMTVETNWPGSLQDSTVLQQSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
+F E G WL+ R R D + S
Sbjct: 209 CSQF-EAGMHK--------------DSWLLXXRFRCLDGSX----------------GST 237
Query: 239 GILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIAL 273
+L EK + ++LA LHNI+L
Sbjct: 238 AVLTEK-------------SSHIILACCVLHNISL 259
>gi|221110397|ref|XP_002154191.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 320
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 129/278 (46%), Gaps = 40/278 (14%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIG-ATHGPSK 68
FR + ++ + + L + + + ++ ++L L + +GS ++G +T S+
Sbjct: 50 FRFRRASIFYLTDLLQIDLQPKSNQTLVVAPILQLLCALRFYAIGSFQIVVGDSTAALSE 109
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+I R I V + +K+P + LN+ + F FP V G +D T + I
Sbjct: 110 PTISRIIRRVSLSLAKRI-NEYIKYPTNQHVLNESRVKFYEIAEFPKVTGVIDCTHICIQ 168
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P+EH+ +V+ + +H++NV A+C KF V A WP S HDAR++R S + +KF +
Sbjct: 169 KPHEHKYAYVDSSSNHSINVQAVCDNKGKFIDVVAKWPWSTHDARILRESKLGKKFMDGT 228
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
++ +++G +TR +IE FG +F
Sbjct: 229 FK----GLLIG-------------------------------KTRVIIEQVFGRWIRRFH 253
Query: 247 CLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQN 283
L+ +R++P+ +V LHN+A+ ++ + ++N
Sbjct: 254 LLHGEIRMTPERTCTLVAVCAVLHNLAIQLNDGDMDEN 291
>gi|301607129|ref|XP_002933172.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 286
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 117/273 (42%), Gaps = 46/273 (16%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL+ + + E + P L T R+ ++ K+L LH+ GS + G G S+
Sbjct: 40 YRLNRAAIYSLYEVLEPYLQALTRRSHSVPGMVKLLCSLHFFATGSFQKVGGVYGGVSQP 99
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
+ R + +V+ I + + +P N I F G PNV G +D T V ++
Sbjct: 100 TFSRCLCQVLDTI-RSVSANYISFPTNRNEWNTIKRQFYGVSGIPNVLGAIDCTHVALNL 158
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD---- 183
+ E F NR H+LN+ +C ++ + +PGS HDA ++R S +Y F
Sbjct: 159 AQDREHVFRNRKSYHSLNIQVVCDATMNIRSIVSRFPGSSHDAYILRQSRLYDGFQTGQM 218
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
GW +LG R H R R +I+ +FG+LK
Sbjct: 219 SHGW-------LLG---------------------------RIH-RARPVIKRTFGVLKS 243
Query: 244 KFPCLNYMRLSPQFA----GMVVLASTTLHNIA 272
+F CL+ S ++ +V+A LHN+A
Sbjct: 244 RFHCLDKSGGSLMYSRTKVAQIVVACAVLHNLA 276
>gi|449688568|ref|XP_004211777.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 194
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 87/156 (55%), Gaps = 2/156 (1%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
++ +++ R+ ++++ ++ L PT+RN LS +E++L ++G + YH++
Sbjct: 39 VSDLVYRQQNRVPRAVIDYLELRLSNILQHPTKRNHPLSTREQILLFSQFLGTSAFYHLL 98
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
G +I ++H V I + ++KWP++ + + F + GGFP VAG +DG
Sbjct: 99 RDARGVDTKTIHSTVHRVANAI-QSVKNEVIKWPDD-CSRLADEFFKLGGFPCVAGAIDG 156
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKF 156
T+V + P E FVNR +HA+N++A+ GPN F
Sbjct: 157 TLVQVMPPKIDEASFVNRYQNHAINILAVAGPNMAF 192
>gi|326528651|dbj|BAJ97347.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 521
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/334 (23%), Positives = 138/334 (41%), Gaps = 28/334 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I + + A+ T A+ +++V C+ + G P ++
Sbjct: 195 FRRAFRMGRETFDMICDALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 254
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F R G P V G + T +
Sbjct: 255 GLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFERSFGVPGVIGAMYTTHI 314
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 315 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDVCIGWPGSMPDDQVLEKS 374
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ DA ++G ++YPL +W++ P T F R +
Sbjct: 375 MLHQR---AAAGMMHDACLVGGASYPLMDWVLVPYTHQNLTWTQHAFNEKVGDIRNVAVE 431
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI E+ +E E +
Sbjct: 432 AFARLKTRWACLQKRTEVKLQDLPVVLGACCVLHNI------------CEMRREELEPEV 479
Query: 296 EEEAVDD---PDEIDEEEQPRQDRNNPGTQRLAR 326
+ DD P+ E +Q+R++ L R
Sbjct: 480 QFALFDDDTTPETPVRSETAKQERDSIAHNLLHR 513
>gi|326665390|ref|XP_002661050.2| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 426
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 143/321 (44%), Gaps = 21/321 (6%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTPTER-NVALSAKEKVLSCLHWMGVGSPYHIIG 61
P + E FR++ + ++ P L R AL +++V L + Y I
Sbjct: 95 PSDWLEKFRMTKETFFLLCGKLKPRLNRQDTRLRPALPLEKRVAVALWRLASNVEYRTIS 154
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDG 120
G +S++C+ + +V IV ++ P E+ L D F + GFP+ G V
Sbjct: 155 TLFGVGRSTVCKCVRDVCHAIVLLLRPLYLRTPSEQELEDAARLFATRWGFPHCVGAVGS 214
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
V I AP+ + D + N G ++ +F+ V A +PGS + +++NS ++
Sbjct: 215 LHVPIIAPSSNTDNYWNSRGWLSVVTQGAVNGLGQFWDVCAGFPGSTEHSAILQNSTLWA 274
Query: 181 KFDEEGW---RPFPD-------AVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKR 229
+ + G+ +P D ++LGD+ YPLK WL+ + A QR F R +R
Sbjct: 275 RGCDGGFLLRQPPLDFMGHPLGFLMLGDAGYPLKSWLLKGYPESSALTAGQRAFNRRLER 334
Query: 230 TRRLIENSFGILKEKFPCL---NYMRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVE 285
R +++ +F L+ ++ CL N R+ ++LA LHN+ L D E
Sbjct: 335 ARSVVDQAFLRLRARWQCLLKRNDCRMD--VVPTMILACCVLHNVCELHGDAFAERWEEA 392
Query: 286 IHDDEAEDDGEE--EAVDDPD 304
+ +E+ +E A DDP
Sbjct: 393 VRHEESPQPADEVSPAADDPQ 413
>gi|193641226|ref|XP_001951909.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 329
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 131/288 (45%), Gaps = 39/288 (13%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK FRLS +L I ++ R+ +LS ++L L + G+ ++G
Sbjct: 43 FKCRFRLSKETFMILLHMIGDSIK---HRSSSLSPVLQLLIALRYYATGAFQMVLGDHIQ 99
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
KS++ R I + V + +++P + + F R FP V G +D T +
Sbjct: 100 VHKSTVYRVI-KTVSTEIARLRPHFIEFPSTVAEQRRVQLGFFRLHQFPRVIGALDCTHI 158
Query: 124 LIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I +P ++ +QF NR G WPGSVHD+ + NS++ KF
Sbjct: 159 RIQSPKSDIGEQFRNRKG----------------------WPGSVHDSTIFDNSLIRAKF 196
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
+ F + +LGD YP + + + P NP A++++ + +R ++E FG+LK
Sbjct: 197 ENN---EFGNTFLLGDGGYPCRNYHLTP-LLNPRTEAERKYQKAQIGSRNVVEGLFGVLK 252
Query: 243 EKFPCLNY---MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
+FP L +LS A +V A+ L+N L ++D Q ++ H
Sbjct: 253 RRFPVLAVGIRTKLSTTMATIV--ATAVLYNFLLK-EKDEIPQEIDNH 297
>gi|242053391|ref|XP_002455841.1| hypothetical protein SORBIDRAFT_03g026050 [Sorghum bicolor]
gi|241927816|gb|EES00961.1| hypothetical protein SORBIDRAFT_03g026050 [Sorghum bicolor]
Length = 559
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 136/330 (41%), Gaps = 36/330 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I + + ++ T A+ +++V C+ + G P ++
Sbjct: 233 FRRAFRMGRETFDMICDALGSSVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 292
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E +F R G P V G + T +
Sbjct: 293 GLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEPAAARFKESFERSSGVPGVIGAMYTTHI 352
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 353 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSMPDDQVLEKS 412
Query: 177 MVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
M++++ E W ++G ++YPL +W++ P T F RR
Sbjct: 413 MLHQRAAAGMMHESW-------LVGGASYPLMDWVLVPYTHPNLTWTQHAFNEKVGELRR 465
Query: 233 LIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ +F LK ++ CL + Q +V+ A LHNI E+ +E
Sbjct: 466 VAVEAFARLKARWACLQKRTEVKLQDLPVVLGACCVLHNI------------CELRGEEV 513
Query: 292 EDDGEEEAVDD---PDEIDEEEQPRQDRNN 318
+ E VDD P+ E + R+N
Sbjct: 514 DPAIRCELVDDETTPENPVRSETAKHARDN 543
>gi|449682371|ref|XP_004210059.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 170
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 96/185 (51%), Gaps = 30/185 (16%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP V G +D T + I ++HE +V+R+ +H++NV A+C KF V A WPGS
Sbjct: 4 FPKVTGVIDCTHICIQKTHKHEYAYVDRSSNHSINVQAVCDNKGKFINVVAKWPGS---- 59
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
+++GDS+YP WL+ P NP A+ +R+ ++T
Sbjct: 60 -----------------------LLIGDSSYPCFRWLLTPYL-NPTTASQRRYNISLRKT 95
Query: 231 RRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTI-DRDNENQNVEIHD 288
R +IE FG K +F L+ +R++P+ ++ A L+N+A+ + D D + ++E +
Sbjct: 96 RVIIEQVFGRWKRRFHLLHGEIRMTPERTCTLIAACAVLYNLAIQLNDGDRDENSIENDE 155
Query: 289 DEAED 293
++ ED
Sbjct: 156 NDNED 160
>gi|326526371|dbj|BAJ97202.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 532
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 136/326 (41%), Gaps = 28/326 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I + + A+ T A+ +++V C+ + G P ++
Sbjct: 206 FRRAFRMGRETFDMICDALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 265
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F R G P V G + T +
Sbjct: 266 GLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFERSFGVPGVIGAMYTTHI 325
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 326 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDVCIGWPGSMPDDQVLEKS 385
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ DA ++G ++YPL +W++ P T F R +
Sbjct: 386 MLHQR---AAAGMMHDACLVGGASYPLMDWVLVPYTHQNLTWTQHAFNEKVGDIRNVAVE 442
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI E+ +E E +
Sbjct: 443 AFARLKTRWACLQKRTEVKLQDLPVVLGACCVLHNI------------CEMRREELEPEV 490
Query: 296 EEEAVDD---PDEIDEEEQPRQDRNN 318
+ DD P+ E +Q+R++
Sbjct: 491 QFALFDDDTTPETPVRSETAKQERDS 516
>gi|357449945|ref|XP_003595249.1| hypothetical protein MTR_2g040130 [Medicago truncatula]
gi|355484297|gb|AES65500.1| hypothetical protein MTR_2g040130 [Medicago truncatula]
Length = 512
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 127/307 (41%), Gaps = 25/307 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+S I E+ ++T T A+ +++V C++ + G P ++
Sbjct: 186 FRRCFRMSKQTFNMICNELDSSVTKKNTTLRDAIPVRQRVAVCIYRLATGEPLRLVSKKF 245
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I Q+ ++WP EE + F G PN+ G + T V
Sbjct: 246 GLGISTCHKLVLEVCAAIKSVLMQKFLRWPDEETTKETKQAFEGSFGIPNIGGVMYTTHV 305
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + N+ +++ V + F V WPGS+ D V+ S
Sbjct: 306 PIIAPKVNVGLYFNKKLTERNQKTSYSVTVQGVVNSKGVFTDVCIGWPGSMQDNTVLEKS 365
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y + + I+G+S Y L +W++ P F ++ ++
Sbjct: 366 ALYER---AKLGNLKNIWIVGNSGYSLMDWVLVPYKHQNLTWTQHGFNERIGEIEKIGKD 422
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI EI ++E +D+
Sbjct: 423 AFSRLKGRWGCLKKRTEIKLQDLPVVLGACCVLHNIC------------EIMNEEMDDEW 470
Query: 296 EEEAVDD 302
+ E DD
Sbjct: 471 KFEVFDD 477
>gi|326494082|dbj|BAJ85503.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 532
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 136/326 (41%), Gaps = 28/326 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I + + A+ T A+ +++V C+ + G P ++
Sbjct: 206 FRRAFRMGRETFDMICDALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 265
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F R G P V G + T +
Sbjct: 266 GLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFERSFGVPGVIGAMYTTHI 325
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 326 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDVCIGWPGSMPDDQVLEKS 385
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ DA ++G ++YPL +W++ P T F R +
Sbjct: 386 MLHQR---AAAGMMHDACLVGGASYPLMDWVLVPYTHQNLTWTQHAFNEKVGDIRNVAVE 442
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI E+ +E E +
Sbjct: 443 AFARLKTRWACLQKRTEVKLQDLPVVLGACCVLHNI------------CEMRREELEPEV 490
Query: 296 EEEAVDD---PDEIDEEEQPRQDRNN 318
+ DD P+ E +Q+R++
Sbjct: 491 QFALFDDDTTPETPVRSETAKQERDS 516
>gi|346473213|gb|AEO36451.1| hypothetical protein [Amblyomma maculatum]
Length = 423
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 127/294 (43%), Gaps = 29/294 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F R+S Q +F+ + P + T +LS+ E++ L ++ G+ Y + +
Sbjct: 68 FYNFVRMSPQQFDFLESLVKPLIQVQETPLRESLSSAERLAITLRFLASGNSYQSLAYSF 127
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
KS+I + + EV I R+++ P+ L I F+ K FPN G +DG +
Sbjct: 128 RVGKSTISQLVPEVCIAIWKVLKPRVLRQPDHRQLKAIADEFMWKWNFPNCVGAIDGKHI 187
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+DAP + N ++N+MA C Y+F +V GS D V ++ +
Sbjct: 188 HLDAPPNSGSMYFNYKRSFSINLMASCDAGYRFTSVYIGAFGSESDGGVF-SACGLGEIV 246
Query: 184 EEGWRPFPDAVIL------------GDSAYPLKEWLI-------PPRTRNPDAAADQRFL 224
E + FP V L D A+PLK L+ PP T P + R
Sbjct: 247 ESAPQTFPSPVALSNNGPELPFVMVADDAFPLKHNLMKPYPGVQPPNT--PKRVYNYRL- 303
Query: 225 RCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTIDR 277
R RR IEN+FGIL ++ L N M L P+ A V A LHN + R
Sbjct: 304 ---SRARRCIENAFGILCARWRVLRNRMSLLPKNAVAVSAACCCLHNYLMQEGR 354
>gi|357135326|ref|XP_003569261.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 539
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 118/276 (42%), Gaps = 13/276 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I E + A+ T A+ +++V C+ + G P ++
Sbjct: 210 FRRAFRMGRETFDMICEALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 269
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F R G P V G + T +
Sbjct: 270 GLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFERSFGVPGVIGAMYTTHI 329
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D V+ S
Sbjct: 330 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDVCIGWPGSMADDVVLEKS 389
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ DA ++G ++YPL +W++ P T F R + +
Sbjct: 390 MLHQR---AAAGMMHDACLVGGASYPLMDWVMVPYTHQNLTWTQHAFNEKVGDIRNVAVD 446
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
+F LK ++ CL + Q +V+ A LHNI
Sbjct: 447 AFLRLKTRWACLQKRTEVKLQDLPVVLGACCVLHNI 482
>gi|301623609|ref|XP_002941106.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 348
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 123/272 (45%), Gaps = 12/272 (4%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + +L+++ P + T ++SA++++++ L ++ G + + G S +
Sbjct: 1 MSESCFRSLLQKVGPLIQKDDTIMRKSISAEQRLIATLRFLATGRSFADLKFLTGISAQA 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+ I E IV+T +K+PE E I F FPN G +DG + I+ P
Sbjct: 61 LGHIIPETCNAIVETLKGEYLKFPETSEEWQVIAQQFNDYWNFPNCGGAIDGKHIRINPP 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR 188
F N G ++ ++AI NY+F V+ G + D V+ + +K +
Sbjct: 121 PNSGSYFFNYKGFFSIVLLAIVNANYEFIMVDIGKNGRLSDGGVIEQTHFNQKLKSKQLN 180
Query: 189 PFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
+A V +GD A+ L E L+ P + + F R RR++EN+FGI
Sbjct: 181 LPTNAETKEGLNFVFVGDEAFGLHENLLKPFPQKVLTPERKIFNYRLSRARRVVENAFGI 240
Query: 241 LKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
L +F + + L P+ MVVL+ LHN
Sbjct: 241 LANRFRIFHTSINLCPEKIDMVVLSCCVLHNF 272
>gi|390363706|ref|XP_003730429.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 163
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 73/125 (58%), Gaps = 6/125 (4%)
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+C +K V A WPGS HD+R++ +S + F E G + + VI+GDS YPLK WLI
Sbjct: 1 MCTTRFKITNVVARWPGSTHDSRILTHSNIAADF-EAGRK---EGVIIGDSGYPLKPWLI 56
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTT 267
P +NP A+ + R H RTR IE G +K KFPCL +R++P+ A ++A
Sbjct: 57 TP-IQNPQTPAEYAYNRAHPRTRVFIEQVNGQIKAKFPCLAVGLRVAPKHACRTIVACAV 115
Query: 268 LHNIA 272
L N+A
Sbjct: 116 LFNMA 120
>gi|241669308|ref|XP_002411397.1| transposase, putative [Ixodes scapularis]
gi|215504029|gb|EEC13523.1| transposase, putative [Ixodes scapularis]
Length = 411
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 148/316 (46%), Gaps = 20/316 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+E FR++ + +L+ + T + S + ++L L W+G G+ + ++ A
Sbjct: 74 FREHFRITRSSFRNLLKTLWDDTDTKSH---GWSHETELLIFLFWLGCGAVFKVVSACFN 130
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFL---RKGGFPNVAGCVDGT 121
+++ ++ V+ I+ RMV +P E+L +I +F R F + AG V
Sbjct: 131 TPRTTTFSVVNRVLERIISKL-DRMVYFPTSEDLREIKASFASLARDNKFRSCAGVVGSC 189
Query: 122 MVLIDAPNE-HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + AP H D + + +++ + A+C F V A +PGSV + V+ NS ++
Sbjct: 190 HIQVQAPESMHMDYYCRKRASYSIQMQAVCDHRGVFLDVFAGYPGSVPCSCVLENSPLFV 249
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRT--RNPDAAADQRFLRCHKRTRRLIENSF 238
P + +LGDS+YP + + T P A + F H R ++E +F
Sbjct: 250 G----ALYPPQGSTLLGDSSYPCIDAPVAIATPYGAPRDAVQRHFNAVHARACCVVEQAF 305
Query: 239 GILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVE---IHDDEAED 293
++K ++ + +++S Q A VV A +HN+ + + VE + DEA+
Sbjct: 306 ALMKGRWKSVFTKPLQVSIQKAPQVVAACAAMHNVCTCLGDFSLEAVVEQCSLQGDEADG 365
Query: 294 DGEEEAVDDPDEIDEE 309
+ EE + P+ ++++
Sbjct: 366 EAVEETDEAPEAVEDD 381
>gi|356563940|ref|XP_003550215.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 536
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 130/297 (43%), Gaps = 16/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FK+AFR+ + I EE+ A+ T RN A+ +++V CL + G P I+
Sbjct: 210 FKKAFRMGRETFDMICEELNSAIVKEDTTLRN-AIPVRQRVAVCLWRLATGDPLRIVSKR 268
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + + WP+E L + F G PNV G + +
Sbjct: 269 FGLGISTCHKLVLEVCTAIKSVLMPKYLNWPDEVALRRVKSEFEGVSGIPNVVGSMYTSH 328
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
V I AP + N+ +++ V + F V WPGS+ D +V+
Sbjct: 329 VPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDHRGVFTDVCIGWPGSMPDDQVLEK 388
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S ++++ + I+G S YPL +W++ P ++ F +++
Sbjct: 389 SALFQRANGG---LLKGDWIVGSSGYPLMDWVLVPYSQQNLTWTQHAFNEKIGEVQKVAR 445
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
++F LK ++ CL + Q +V+ A LHNI L ++ + V++ DDE
Sbjct: 446 DAFARLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNICELKGEKIDPELKVDLMDDE 502
>gi|294939454|ref|XP_002782478.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239894084|gb|EER14273.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 319
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 126/269 (46%), Gaps = 25/269 (9%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
+++L L ++ GS I G S+ + RS+ EVV I++ +R +++P +
Sbjct: 46 KRLLVALRYLANGSFQLISGDALDCSQEYVSRSVTEVVDAIIENR-ERFIRFPSDE---- 100
Query: 102 PMTFLRKGG-----FPNVAGCVDGTMVLID--APNEHEDQFVNRNGDHALNVMAICGPNY 154
T +R P G +DGT +L+ P + FV+ + +LNV A+ G N
Sbjct: 101 --TIVRGDADVDPRLPRCIGLIDGTHILLKDVVPIDQRADFVDSKKNISLNVQAVLGANN 158
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVIL-GDSAYPLKEWLIPPRTR 213
+ +VN++ PGSVHD+RV ++S ++ + R P L GDSAY L L+ P
Sbjct: 159 LWVSVNSSNPGSVHDSRVYQSSRLFLRM-----RSIPAGYWLCGDSAYGLSPSLLTPYND 213
Query: 214 NPDAAADQRFLRCHKRTRRLIENSFGILKEKFP-CLNYMRLS-PQFAGMVVLASTTLHNI 271
A+ F HK R +E F LK + C +RL+ P A VLA LHN
Sbjct: 214 PIPGTAEASFNIAHKSVRSTVERGFRKLKRSWAICNGSLRLTPPGKAAKAVLACFCLHNY 273
Query: 272 ALTIDR---DNENQNVEIHDDEAEDDGEE 297
+ D +++ V D+E + G E
Sbjct: 274 QKMMGEPEPDVDDEVVVPEDEEGVNFGAE 302
>gi|149537657|ref|XP_001519824.1| PREDICTED: putative nuclease HARBI1-like, partial [Ornithorhynchus
anatinus]
Length = 223
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 100/193 (51%), Gaps = 13/193 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
++ + F F + ++++ + +L PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 ISTYGFSREF------IRYLVDLLGASLCRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ R +++PE+ ++ + F G P V G +
Sbjct: 90 GDAVGISQASMSRCVANVTEALVERA-SRFIRFPEDEMSVQGLKGDFYGLAGMPGVVGVL 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D T V I APN + +VNR G H+LN + +C V +WPGS+ D+ V++ S +
Sbjct: 149 DCTHVTIKAPNAEDLSYVNRKGLHSLNCLMVCDIRGTLLHVETHWPGSLRDSAVLQRSAL 208
Query: 179 YRKFD----EEGW 187
+ + ++GW
Sbjct: 209 AAQLEAGMPKDGW 221
>gi|356532740|ref|XP_003534929.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 527
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 72/296 (24%), Positives = 129/296 (43%), Gaps = 14/296 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+S + + I +E+ A+T T A+ +++V C+ + G P ++
Sbjct: 201 FRRWFRMSRSTFDMICDELDAAVTKKNTMLRDAIPVRQRVAVCIWRLATGDPLRLVSKRF 260
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + ++WP EE I F + G PNV G + T V
Sbjct: 261 GLGISTCHKLVLEVCSAIRTVLMPKFLQWPNEEKTKQIKDEFEKISGIPNVGGAMYTTHV 320
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + N+ +++ V + F V WPGS+ D RV+ S
Sbjct: 321 PIIAPKISVSAYFNKRHTERNQKTSYSITVQGVVDSKGVFNDVCIGWPGSMPDDRVLEKS 380
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
++++ I+G+S +PL +W++ P T F + + + +
Sbjct: 381 ALFQRASRGN---LKGVWIVGNSGHPLMDWVLVPYTHANLTWTQHAFNEKIEEIQGVAKE 437
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQ-NVEIHDDE 290
+F LK ++ CL + Q +V+ A LHNI + + +++ +I DDE
Sbjct: 438 AFARLKGRWGCLQKRTEVKLQDLPVVLGACCVLHNICEMRNEEMDDEWRFQIFDDE 493
>gi|449683828|ref|XP_004210468.1| PREDICTED: uncharacterized protein LOC101239328 [Hydra
magnipapillata]
Length = 302
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 120/277 (43%), Gaps = 31/277 (11%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S ++V LH++ YH
Sbjct: 1 MEECEFKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYH 59
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ +H+ + + D VK+P ENLN F GFP G
Sbjct: 60 VVSSLFGIGKSTANLIVHKFINAVNDILLPIYVKFPLSVENLNKHSRDFEAILGFPQCVG 119
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG + I AP + + N +++ + A+ Y+F + PG +D+ +++NS
Sbjct: 120 AVDGCHIPISAPKDQAISYYNYKRWYSIVLFAVVDCRYRFIYTSVGSPGRNNDSYILQNS 179
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+ + + + LGDS F+ RR+ EN
Sbjct: 180 SLKAILESKLFDKCCKE--LGDS-----------------------FVLLCFIARRVAEN 214
Query: 237 SFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
+FG +K +F + +M FA +V A LHNI
Sbjct: 215 AFGCVKARFRVICEHMECDINFATRIVNACVILHNIC 251
>gi|328714499|ref|XP_003245376.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 180
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 82/161 (50%), Gaps = 5/161 (3%)
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV 194
F R +LNV A+ +F + A WPGS HD+ + RNS ++ + E G F
Sbjct: 8 FGKRIKYFSLNVQALVNSELQFMDIVARWPGSAHDSHIFRNSRLFARL-ESG--EFQKMA 64
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRL 253
ILGDS Y LK +L+ P NP + RTR ++E SFG+ K +FP L+ +RL
Sbjct: 65 ILGDSGYALKPYLLTP-ISNPVGRIQMLYNESQIRTRNVVERSFGVWKRRFPVLSLGLRL 123
Query: 254 SPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
+ +++A+ LHNI + D N N E+ + E +
Sbjct: 124 QLKTVQAIIVATAVLHNICREMKEDLPNDNFELTEPHNETE 164
>gi|328704981|ref|XP_003242661.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 242
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 108/210 (51%), Gaps = 18/210 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +D T + I AP HE +VN +G+H+LNV AI + K +NA +PG+ +D+
Sbjct: 20 FESTLGAIDCTHINILAPVVHEGAYVNHHGNHSLNVQAIVDSDLKILNINARYPGARNDS 79
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
+ S V R + ++GD+ YPL+ WL+ P + + + + +
Sbjct: 80 YIWSTSAVRRGMEFHYNNGERRTWLIGDAGYPLEPWLMTPLPHYQEGTREFEYTKNLCKA 139
Query: 231 RRLIENSFGILKEKFPCLNYMRL---SPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R ++E FG+ K CL+Y R+ +P AG +V A TLHN+ + + N+ I
Sbjct: 140 RNVVERFFGVFKR---CLSYQRVLMYAPDKAGRIVNACATLHNMRI-------HYNLPIP 189
Query: 288 DDE--AEDDGEE--EAVDDPDEIDEEEQPR 313
++E E G+ E ++ P+ ID++ PR
Sbjct: 190 EEELAGEQIGQNIYENIEIPN-IDQQGGPR 218
>gi|357131373|ref|XP_003567313.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 455
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 118/274 (43%), Gaps = 12/274 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+ + I +E+ A+ T A+ ++V C+ + G +H+I
Sbjct: 134 FRSHFRMCRETFDMICDELGSAIAKEDTPSRPAIPVCQRVAVCIWRLATGDAFHVISNRF 193
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G + S+ + + EV I R ++WP + + +F G PNV G + T +
Sbjct: 194 GLANSTCQKIVLEVCVAIESILMPRFIRWPGQ-ATALEASFKAISGIPNVIGSMYTTHIS 252
Query: 125 IDAPNEHEDQFVN----RNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
I AP + N R +++ + P+ F V+ WPGS+ D RV+ S +
Sbjct: 253 IIAPRFFPLHYCNHICIRGSFYSIIFQGVVDPDGTFIHVSIGWPGSMSDDRVVDQSHLSH 312
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
+ + + ++G + YPL +WL+ P T Q F + + +++F
Sbjct: 313 LYGNDA---MAGSWVVGGTNYPLNDWLLVPYTHQNLTREQQIFNENVTKLNHVAKDAFAR 369
Query: 241 LKEKFPCLNYMRLSPQFAGMVVLAS--TTLHNIA 272
LK ++ CL R V++A + LHNI
Sbjct: 370 LKGRWACLQ-KRADVNIKNFVLMAGACSVLHNIC 402
>gi|356561879|ref|XP_003549204.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 451
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 129/297 (43%), Gaps = 16/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+S + I + + A+T T A+ +++V C+ + G P +
Sbjct: 124 FRLHFRMSKATFDMICQHLDSAVTKKNTMLREAIPVRQRVAVCIWRLATGDPLREVSKRF 183
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + WP E + I F G PNV G + T +
Sbjct: 184 GLGISTCHKLVLEVCSAIKSVLMPKFLHWPNEAEMKPITEKFESLSGIPNVGGSMYTTHI 243
Query: 124 LIDAPNEHEDQFVNRNGD-------HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + + N+ +++ V + P F V WPGS+ D +V+ S
Sbjct: 244 PIIAPKSNVNAYFNKRHTERNQKTCYSITVQGVVDPKGIFSDVCIGWPGSLSDDQVLEKS 303
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y++ + D ++G+S +PL E ++ P T F + + + ++
Sbjct: 304 ALYQRATMGNLK---DVWVVGNSGHPLMEGVLVPYTHQNLTWTQHAFNQKVGEIQSIAKD 360
Query: 237 SFGILKEKFPCLNYMRLSPQFAGMVVL--ASTTLHNIALTIDRDNENQ-NVEIHDDE 290
+F LK ++ CL R + + VL A LHNI D + + Q N +I DD+
Sbjct: 361 AFARLKGRWSCLQ-KRTEVKLEDLPVLLGACCVLHNICEMRDEEMKPQWNFDIFDDQ 416
>gi|449671020|ref|XP_004207411.1| PREDICTED: uncharacterized protein LOC101237154 [Hydra
magnipapillata]
Length = 311
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 121/281 (43%), Gaps = 30/281 (10%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M KE FR++ F++ E+ P L TT T R +S ++V LH++ Y
Sbjct: 1 MEECELKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYR 59
Query: 59 ---IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPN 113
++ + G KS+ +HE + + D + VK+P E LN F FP
Sbjct: 60 NVRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSVEYLNKHSRDFKDILDFPQ 119
Query: 114 VAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDG + I AP + + N G +++ + A+ Y+F + P
Sbjct: 120 CVGAVDGCHIPISAPKDQAISYYNYKGWYSIVLFAVVDCRYRFIYTSVGSP--------- 170
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRR 232
E G P + GDSA+PL L+ P N + + Q+ F + RR
Sbjct: 171 ----------ELGDSLVP-LCLTGDSAFPLTRHLLKPYPENLELSEIQKNFNKILCGARR 219
Query: 233 LIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
++EN+FG ++ +F + M FA +V A TLHNI
Sbjct: 220 VVENAFGCVRARFRVICKRMECDINFATRIVNACVTLHNIC 260
>gi|345496435|ref|XP_001603548.2| PREDICTED: hypothetical protein LOC100119837 [Nasonia vitripennis]
Length = 331
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 141/301 (46%), Gaps = 17/301 (5%)
Query: 12 LSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSI 71
+++ Q + ILE + P LT ++R AL+ + ++ + L+++ G+ +S++
Sbjct: 1 MTSEQFDEILELVRPQLTKRSKRR-ALTPEMRLAAVLNFLAHGNSIQKSAWMFLIGRSTM 59
Query: 72 CRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNE 130
R + EV I + ++ V +P +++L+ I + R PN G +DG + + AP
Sbjct: 60 YRLVTEVCTAICNVLEEKYVSFPSQDDLSVIANMYWRIWHMPNCFGAIDGKHIRVKAPPN 119
Query: 131 HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE----- 185
F N ++ +M + +F VN GS +DA + + S + + D E
Sbjct: 120 SGSYFFNYKKHFSIVLMGLSDAFCRFIWVNIGDFGSSNDAGIFQRSDLRQALDNEEIDIP 179
Query: 186 --GWRPFPDAV----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIENSF 238
+ P D + I+GD A+PLK +L+ P TR + +++ + R R IE +F
Sbjct: 180 APTYLPRTDVLCPYFIIGDGAFPLKNYLMKPYTRANNLTHEEKIVNYRLSRARLTIERAF 239
Query: 239 GILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALT--IDRDNENQNVEIHDDEAEDDG 295
GIL +K+ L + + V++A LHN +T + +D + H E++
Sbjct: 240 GILTKKWRILESPVDWKLKNIETVIMALICLHNFLITEEMSKDEAERKYVFHPYNIENEA 299
Query: 296 E 296
E
Sbjct: 300 E 300
>gi|403182852|gb|EJY57671.1| AAEL017220-PA [Aedes aegypti]
Length = 367
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 124/287 (43%), Gaps = 24/287 (8%)
Query: 6 FKEAFRLSATQVE--FILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE F +S + F L E A T + K K+ + ++ G + +
Sbjct: 8 FKENFHMSKNSFDQLFKLVEKDLAPKRNTRPKDGIPPKLKLGLVIEYLASGGLQRHLASC 67
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+ SK + I +V I + +E+ ++ F + FPN G +DG V
Sbjct: 68 YRVSKQHMGSIIDQVCDAICRALSAYVADPCQESFLEVANGFNSRWNFPNCIGAIDGKHV 127
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS-----MV 178
I AP F N G H+L +MAIC +Y+F ++ GS D + + S ++
Sbjct: 128 SIKAPPNAGSIFYNYKGFHSLALMAICDASYRFTYLDVGAYGSEGDCNIFKESKFGTDVL 187
Query: 179 YRKFDEEGWRPFPDAVI----------LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
+ + D FP+ + D A+PL + +I P ++ +A ++ F
Sbjct: 188 HDRLD------FPENATVNGVKLPFFYVADDAFPLCKRIIKPYSKKNLSAEERIFNYRLS 241
Query: 229 RTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALT 274
R RR IEN+FG+L K+ CL + SP A ++ A LHN +T
Sbjct: 242 RARRCIENAFGLLCSKWACLKKTLYCSPDRAQKIISACCMLHNFLIT 288
>gi|356552466|ref|XP_003544588.1| PREDICTED: uncharacterized protein LOC100804219 [Glycine max]
Length = 534
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 131/297 (44%), Gaps = 16/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
F++AFR+ + I +E+ A+ T RN A+ +++V CL + G P I+
Sbjct: 208 FRKAFRMGRETFDMICDELNSAIVKEDTTLRN-AIPVRQRVAVCLWRLATGDPLRIVSKR 266
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + + WP+E +L + F G PNV G + +
Sbjct: 267 FGLGISTCHKLVLEVCTAIKSVLMPKYLNWPDEGSLRRVKSEFEGVSGIPNVVGSMYTSH 326
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
V I AP + N+ +++ V + F V WPGS+ D +V+
Sbjct: 327 VPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDHRGVFTDVCIGWPGSMPDDQVLEK 386
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S ++++ + I+G S YPL +W++ P ++ F +++
Sbjct: 387 SALFQRANGG---LLKGVWIVGSSGYPLMDWVLVPYSQQNLTWTQHAFNEKIGEVQKVAR 443
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
++F LK ++ CL + Q +V+ A LHNI L ++ + V++ DDE
Sbjct: 444 DAFARLKGRWSCLQKRTEVKLQDLPVVLGACCVLHNICELKGEKIDPELKVDLVDDE 500
>gi|328700758|ref|XP_003241371.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 394
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 126/303 (41%), Gaps = 14/303 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FR+S + +L ++ + ++S E++ L ++ G+ + + ++
Sbjct: 57 FFNYFRMSIKSFDELLSKLESGIKVSNSGRPSISPTERLCVTLRYLASGNTFTDLQYSYR 116
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
S+I + +V I + + EE +I + F + FPN G +DG + +
Sbjct: 117 MGISTISGIVEDVCEQIWKMKSECIPLPTEEKWREISLDFEKNTNFPNCIGALDGKHIRV 176
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
P + F N +++ +MAIC NY F V+ G D+ V +N + K + E
Sbjct: 177 IKPIKSGSLFYNYKHYYSIVLMAICDANYCFTFVDVGAYGKFSDSSVFKNGKFFEKLENE 236
Query: 186 GW-----RPFPD--------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
+P P VIL D A+ + + ++ P R+ + F R RR
Sbjct: 237 TLSIPQPKPLPGDNENGPLPYVILADEAFGVSKTILRPYARSNLNYKKKIFNYRLSRARR 296
Query: 233 LIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
IE+SFGIL K+ + M S + + +V A LHN D + I E
Sbjct: 297 YIESSFGILSNKWRIFHRPMNTSVELSTKIVKACCVLHNFVRVRDGYKFAHTLSIQGFET 356
Query: 292 EDD 294
+D
Sbjct: 357 SND 359
>gi|449672868|ref|XP_004207811.1| PREDICTED: uncharacterized protein LOC101239908 [Hydra
magnipapillata]
Length = 312
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 105/239 (43%), Gaps = 24/239 (10%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
+S ++V LH++ Y ++ + G KS+ +HE + + D + VK+P
Sbjct: 5 ISVVKRVAVALHYLASCEEYRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSV 64
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
ENLN F GFP G VDG + I AP + + N G +++ + A+ Y+
Sbjct: 65 ENLNKHSRDFEAILGFPQCVGAVDGCHIPISAPKDQATSYYNYKGWYSIVLFAVVDSRYR 124
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNP 215
F + P E G P +I GDSA+PL L+ P N
Sbjct: 125 FIYTSVGSP-------------------ELGDSLVPLCLI-GDSAFPLTRHLLKPYPENL 164
Query: 216 DAAADQR-FLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
+ + Q+ F + RR++EN FG +K +F + M FA +V A TLHNI
Sbjct: 165 ELSEIQKNFNKILCGARRVVENVFGRVKARFRVICKRMECDINFATRIVNACVTLHNIC 223
>gi|242091581|ref|XP_002441623.1| hypothetical protein SORBIDRAFT_09g030610 [Sorghum bicolor]
gi|241946908|gb|EES20053.1| hypothetical protein SORBIDRAFT_09g030610 [Sorghum bicolor]
Length = 457
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/320 (22%), Positives = 133/320 (41%), Gaps = 19/320 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + + + A+ T A+ +++V C+ + G P ++
Sbjct: 131 FRAAFRMGRATFAMLCDALGSAVAKEDTALRAAIPVRQRVAVCVWRLATGEPLRLVSKRF 190
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+ D F + G P V G + T +
Sbjct: 191 GLGISTCHKLVLEVCAAIRGLLMPRFLRWPDAAAADAFKARFQAESGIPGVVGAIYTTHI 250
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 251 PIIAPKTSVAAYFNRRHTERNHKTSYSITLQGVVGPDGAFTDVCIGWPGSMPDDQVLERS 310
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+ ++ + ++G +++PL EW++ P + A F RR+ +
Sbjct: 311 ALQQRAAAG---MMAGSWVVGGASFPLTEWVLTPYAQANLTWAQHAFNEKVAEVRRVAVD 367
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI + E + + D +D
Sbjct: 368 AFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNI---CESRGEGMDAALRCDITDD-- 422
Query: 296 EEEAVDDPDEIDEEEQPRQD 315
E ++P D+ + R D
Sbjct: 423 -ETVPENPVRSDDATKARDD 441
>gi|388495434|gb|AFK35783.1| unknown [Medicago truncatula]
Length = 512
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 126/307 (41%), Gaps = 25/307 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+S I E+ ++T T A+ +++V C++ + G P ++
Sbjct: 186 FRRCFRMSKQTFNMICNELDSSVTKKNTTLRDAIPVRQRVAVCIYRLATGEPLRLVSKKF 245
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I Q+ ++WP EE + F G PN+ G + T V
Sbjct: 246 GLGISTCHKLVLEVCAAIKSVLMQKFLRWPDEETTKETKQAFEGSFGIPNIGGVMYTTHV 305
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + N+ +++ V + F V WP S+ D V+ S
Sbjct: 306 PIIAPKVNVGLYFNKKLTERNQKTSYSVTVQGVVNSKGVFTDVCIGWPDSMRDNTVLEKS 365
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y + + I+G+S Y L +W++ P F ++ ++
Sbjct: 366 ALYER---AKLGNLKNIWIVGNSGYSLMDWVLVPYKHQNLTWTQHGFNERIGEIEKIGKD 422
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI EI ++E +D+
Sbjct: 423 AFSRLKGRWGCLKKRTEIKLQDLPVVLGACCVLHNIC------------EIMNEEMDDEW 470
Query: 296 EEEAVDD 302
+ E DD
Sbjct: 471 KFEVFDD 477
>gi|328724899|ref|XP_001949872.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 325
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 105/219 (47%), Gaps = 14/219 (6%)
Query: 61 GATHGPSKSS---ICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAG 116
G G SK++ I R + + + F VK P E + + F + FP G
Sbjct: 96 GDFMGVSKTTASLIVRDVSTAIAKLRPIF----VKMPIESEIPTMQKRFYKIARFPRTIG 151
Query: 117 CVDGTMVLIDAPNE-HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T + I P + + + NR G +LNV + P+ K V A WPGS HD V++
Sbjct: 152 AIDCTHLKIQNPGGPNAEYYRNRKGYFSLNVQTVSCPDLKIMDVVARWPGSCHDYTVLKK 211
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S + + + W +++I+ DS Y ++ P NP ++ + RTR ++
Sbjct: 212 SRFFNRLNSGEW---GNSLIVADSGYANSLRIVTPFI-NPRNYIEELYNESIIRTRNPVK 267
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
S+G+LK +FP L++ RL Q V++A LHNIA+
Sbjct: 268 RSYGVLKRRFPILSFGSRLKLQTTQAVIVACCVLHNIAI 306
>gi|328702103|ref|XP_003241802.1| PREDICTED: hypothetical protein LOC100570146 [Acyrthosiphon pisum]
Length = 358
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 77/301 (25%), Positives = 123/301 (40%), Gaps = 38/301 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++ FRLS T E L AP L T E + ++LS + + + +
Sbjct: 70 FQQHFRLSPTSFERCLLICAPLLQKTCVEGRKGSDERLQLLSVIWLLATPDSFRSVSDRF 129
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
KSS+ S V V+ ++ WPE L D+ +L +
Sbjct: 130 DIGKSSLHDSFVRVTTVL-HRLSPTIINWPEVLLMDLIFQYLHQSA-------------- 174
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ ++ R HA+ + A+C + A +PGSV D R+ NS++Y + +
Sbjct: 175 --------NSYLTRKCFHAITLQAVCDDKLRIIDAFAGYPGSVGDRRIFTNSLIYERILQ 226
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIENSFGILKE 243
R AYP+ EW IPP + + ++F R R IE F +LK
Sbjct: 227 NKNRY---------QAYPILEWCIPPYIDRGNLSEQEKFFNVSLSRARLSIERCFALLKS 277
Query: 244 KFPCLNYMRLSP-QFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAE--DDGEEEAV 300
+F L Y+ + + ++A LHNI L D DNE N I + ++ DD E +
Sbjct: 278 RFRRLKYLDMKKVELIPQTIIACCVLHNICLQFD-DNEWMNNLISEQQSNIGDDITAEQL 336
Query: 301 D 301
+
Sbjct: 337 N 337
>gi|212276182|ref|NP_001130453.1| hypothetical protein [Zea mays]
gi|194689168|gb|ACF78668.1| unknown [Zea mays]
gi|223943395|gb|ACN25781.1| unknown [Zea mays]
gi|224029997|gb|ACN34074.1| unknown [Zea mays]
gi|224030945|gb|ACN34548.1| unknown [Zea mays]
gi|414881571|tpg|DAA58702.1| TPA: hypothetical protein ZEAMMB73_471184 [Zea mays]
Length = 568
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/337 (23%), Positives = 137/337 (40%), Gaps = 28/337 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I + + ++ T A+ +++V C+ + G P ++
Sbjct: 242 FRRAFRMGRGTFDMICDALGSSVAKEDTMLRAAIPVRQRVAVCVWRLATGEPLRLVSKRF 301
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+ + F R G P V G + T +
Sbjct: 302 GLGISTCHKLVLEVCAAIKSVLMPRFLQWPDADAAARFKEGFERASGVPGVIGAMYTTHI 361
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 362 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSMPDDQVLEKS 421
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ D+ ++ ++YPL +W++ P T F RR+
Sbjct: 422 MLHQRAAAG---MMHDSWLVAGASYPLMDWVLVPYTHPNLTWTQHAFNEKVAELRRVAVE 478
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDG 295
+F LK ++ CL + Q +V+ A LHNI EI +E +
Sbjct: 479 AFARLKARWACLQKRTEVKLQDLPVVLGACCVLHNI------------CEIRGEEVDPAI 526
Query: 296 EEEAVDD---PDEIDEEEQPRQDRNNPGTQRLARLLN 329
E VDD P+ E + R+N L R L
Sbjct: 527 RCELVDDETTPENPVRSETASRARDNIAHNLLHRGLG 563
>gi|328698183|ref|XP_003240572.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 240
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/215 (31%), Positives = 97/215 (45%), Gaps = 13/215 (6%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQ--------FVNRNGDHALNVMAICGPNYKFYAVNAN 162
FP V G +DG + I AP E + F NR + + I + KF A
Sbjct: 23 FPGVFGAIDGCHISILAPWEKRTKMPKLNRTMFYNRKQVPTVLLQGIVDSDLKFIDCFAG 82
Query: 163 WPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNPDAAADQ 221
WPGS HDARV R S++ K + P ILGD AYPL L+ P N + Q
Sbjct: 83 WPGSSHDARVFRRSIIGEKLLSQPCVILPPGCHILGDGAYPLTSTLMVPFKDNGHLSDSQ 142
Query: 222 -RFLRCHKRTRRLIENSFGILKEKFPCLNYMRL-SPQFAGMVVLASTTLHNIALTIDRDN 279
+F +C +R +IE +F L +F L +M + + V+ A+ LHN L ID +
Sbjct: 143 LKFNKCLSSSRVVIEQAFRKLIGRFRKLKHMDIYHKENCSKVITAACCLHN--LCIDNSD 200
Query: 280 ENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQ 314
+ + EI+ E ED E + +E +Q
Sbjct: 201 DFNSTEIYTSEMEDGNHHEDDSLSGNVKRQELCQQ 235
>gi|449690961|ref|XP_002170619.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 209
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 68/124 (54%), Gaps = 5/124 (4%)
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
LN+ +TF FP + G +DGT V I P ++E +VNR+ +H++NV +C N KF
Sbjct: 91 LNESRVTFYNVAEFPRITGLIDGTHVSIQKPRKYEYVYVNRSSNHSINVQVVCDYNGKFI 150
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDA 217
+ A WPG HDAR++R S + +K + + ++L D YP WL+ P +P
Sbjct: 151 DIVAKWPGITHDARILRESKLSKKMIDGTLK----GLLLSDFGYPCFRWLLTPYL-SPTT 205
Query: 218 AADQ 221
AA
Sbjct: 206 AAQH 209
>gi|26351017|dbj|BAC39145.1| unnamed protein product [Mus musculus]
Length = 246
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 108/237 (45%), Gaps = 29/237 (12%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + F++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYFLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPVDEAAVQSLKDEFYGLAGMPGVIGVA 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSL 208
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
+F ++ P WL+ A+ + +F+ K RR+ E
Sbjct: 209 TSQF---------------ETGMPKDSWLL-----GKFASINFQFMSIEKLCRRVGE 245
>gi|449679794|ref|XP_004209423.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 225
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 65/111 (58%), Gaps = 6/111 (5%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP + G +DGT V I P EHE +VNR+ +H +NV A+CG N +F + A W S H+A
Sbjct: 121 FPRITGLIDGTHVCIQKPREHEYVYVNRSSNHLINVQAVCGYNGEFIDIVAKWHESTHNA 180
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ 221
R++R S + +K ++ ++LGDS YP WL+ P +P AA +
Sbjct: 181 RILRESKLGKKMIDD-----LKGLLLGDSGYPCFRWLLTPYL-SPTTAAQR 225
>gi|149022650|gb|EDL79544.1| rCG26755 [Rattus norvegicus]
Length = 302
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 93/185 (50%), Gaps = 9/185 (4%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + +++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 59 MSMYGFPRQF------IYYLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 112
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G V
Sbjct: 113 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPADEAAIQSLKDEFYGLAGMPGVIGAV 171
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 172 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSL 231
Query: 179 YRKFD 183
+F+
Sbjct: 232 SSQFE 236
>gi|403182853|gb|EJY57672.1| AAEL017220-PB [Aedes aegypti]
Length = 319
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 110/246 (44%), Gaps = 22/246 (8%)
Query: 45 LSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT 104
+S L ++ G + + + SK + I +V I + +E+ ++
Sbjct: 1 MSLLRYLASGGLQRHLASCYRVSKQHMGSIIDQVCDAICRALSAYVADPCQESFLEVANG 60
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F + FPN G +DG V I AP F N G H+L +MAIC +Y+F ++
Sbjct: 61 FNSRWNFPNCIGAIDGKHVSIKAPPNAGSIFYNYKGFHSLALMAICDASYRFTYLDVGAY 120
Query: 165 GSVHDARVMRNS-----MVYRKFDEEGWRPFPDAVI----------LGDSAYPLKEWLIP 209
GS D + + S +++ + D FP+ + D A+PL + +I
Sbjct: 121 GSEGDCNIFKESKFGTDVLHDRLD------FPENATVNGVKLPFFYVADDAFPLCKRIIK 174
Query: 210 PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTL 268
P ++ +A ++ F R RR IEN+FG+L K+ CL + SP A ++ A L
Sbjct: 175 PYSKKNLSAEERIFNYRLSRARRCIENAFGLLCSKWACLKKTLYCSPDRAQKIISACCML 234
Query: 269 HNIALT 274
HN +T
Sbjct: 235 HNFLIT 240
>gi|26334701|dbj|BAC31051.1| unnamed protein product [Mus musculus]
Length = 244
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 92/185 (49%), Gaps = 9/185 (4%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M+ + F F + F++E + +L+ PT+R+ A+S + ++L+ L + GS +
Sbjct: 36 MSMYGFPRQF------IYFLVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRM 89
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G G S++S+ R + V +V+ Q + +P E + + F G P V G
Sbjct: 90 GDAIGISQASMSRCVANVTEALVERASQ-FIHFPVDEAAVQSLKDEFYGLAGMPGVIGVA 148
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D V I APN + +VNR G H+LN + +C V +WPGS+ D V++ S +
Sbjct: 149 DCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSL 208
Query: 179 YRKFD 183
+F+
Sbjct: 209 TSQFE 213
>gi|328721185|ref|XP_003247234.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 244
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 92/203 (45%), Gaps = 16/203 (7%)
Query: 89 RMVKWP---EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHE--------DQFVN 137
+ ++WP E + LR FP V G VDG + I AP E + F N
Sbjct: 38 KFIRWPRGQEAIITVQKFNCLRPNAFPGVLGAVDGCHISILAPWEKRTVMEKLDRNMFYN 97
Query: 138 RNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-IL 196
R ++ + I + KF V + WPGS HDARV R S++ +K P+ IL
Sbjct: 98 RKQVPSVLLQGIVDSDLKFIEVFSGWPGSSHDARVFRRSLIGQKLLSHDLSILPENCHIL 157
Query: 197 GDSAYPLKE-WLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL-- 253
GD AYPL E +IP R A + F RC +R ++E +FG L +F L +M +
Sbjct: 158 GDGAYPLSENVMIPYRDNGNLTLAQKHFNRCLSSSRVVVEQAFGKLYCRFRKLKHMDVYH 217
Query: 254 -SPQFAGMVVLASTTLHNIALTI 275
S Q + S + LT+
Sbjct: 218 KSLQLVAYITYVSICKTILMLTL 240
>gi|449678529|ref|XP_002155205.2| PREDICTED: uncharacterized protein LOC100210225, partial [Hydra
magnipapillata]
Length = 338
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 109/230 (47%), Gaps = 13/230 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L +T T R +S ++V LH++ Y
Sbjct: 97 MEECEFKEHFRVNRNTFNFLVNELHPHLGKSTTTMRE-PISVVKRVAVALHYLASCEEYR 155
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ +HE + + D + VK P ENLN F GFP G
Sbjct: 156 VVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKIPLSVENLNKHSRDFEAILGFPQCVG 215
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG +LI A + + N G +++ + ++ Y+F + PG +D+ +++NS
Sbjct: 216 AVDGCHILILALKDQAISYYNYKGWYSIVLFSVVDCRYRFIYTSVGSPGRNNDSYILQNS 275
Query: 177 ----MVYRKFDEEGWRPFPDAVI----LGDSAYPLKEWLIPPRTRNPDAA 218
++ ++ + D+++ +GDSA+PL L+ P N + +
Sbjct: 276 SLKAILESSLFDKCCKELGDSLVPLCLIGDSAFPLTRHLLKPYPENLELS 325
>gi|328710225|ref|XP_003244198.1| PREDICTED: hypothetical protein LOC100573332 [Acyrthosiphon pisum]
Length = 423
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 127/282 (45%), Gaps = 17/282 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK R+S +L + P + T+ A+ + E++ L ++ G + +
Sbjct: 65 FKNCCRMSYETFSQLLTLVDPKIRKEDTKFRKAIPSNERLALTLRYLATGDSFSSLSLIF 124
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTM 122
SKSSI I EV I++ Q ++ P +E + + FP+ G +DG
Sbjct: 125 KISKSSISHIIPEVCTAIIEVL-QDYIQVPKSQEEWLSVSKKYEDVWNFPHCIGAIDGKH 183
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V + AP F N ++ +MA+ +Y F + G + D V RN+ +++
Sbjct: 184 VQLQAPIGSGSNFYNYKSTFSIVLMAVVDADYNFLYADVGCQGRISDGGVFRNTSFFKQL 243
Query: 183 DEEGWR-PFPDAVI----------LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRT 230
+E+ P P+ +I + D+A+PLKE ++ P + + +D+R R
Sbjct: 244 EEQKLEIPPPEKLIGREKAVPYVFVADAAFPLKENILRPYPGSHEKGSDKRIFNYRLSRA 303
Query: 231 RRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
RR++EN FGIL F M L P+ A +VV+A LHN
Sbjct: 304 RRVVENVFGILSAVFRVFRKPMLLEPKKAELVVMACVYLHNF 345
>gi|297597079|ref|NP_001043414.2| Os01g0582600 [Oryza sativa Japonica Group]
gi|255673396|dbj|BAF05328.2| Os01g0582600 [Oryza sativa Japonica Group]
Length = 500
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/252 (24%), Positives = 108/252 (42%), Gaps = 12/252 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + I E + A+ T A+ +++V C+ + G P ++
Sbjct: 225 FRRAFRMGRETFDMICEALGSAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 284
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP+E F G P V G + T +
Sbjct: 285 GLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEAAAAAFKERFQAAYGVPGVIGAMYTTHI 344
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + GP+ F V WPGS+ D +V+ S
Sbjct: 345 PIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSMPDDQVLEKS 404
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
M++++ A ++G ++YPL +W++ P T F RR+ +
Sbjct: 405 MLHQRAAAG---MMHSACLVGGASYPLMDWVLVPYTHQNLTWTQHAFNEKVGDLRRVAVD 461
Query: 237 SFGILKEKFPCL 248
+F LK ++ CL
Sbjct: 462 AFARLKARWACL 473
>gi|226499058|ref|NP_001143085.1| hypothetical protein [Zea mays]
gi|195614032|gb|ACG28846.1| hypothetical protein [Zea mays]
gi|413948677|gb|AFW81326.1| hypothetical protein ZEAMMB73_227542 [Zea mays]
Length = 451
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/317 (22%), Positives = 132/317 (41%), Gaps = 26/317 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + + + A+ T A+ +++V C+ + G P ++
Sbjct: 125 FRSAFRMGRATFAMLCDALGAAVAKEDTALRAAIPVRQRVAVCVWRLATGEPLRLVSKRF 184
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND-IPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I R ++WP++ + F + G P+V G + T +
Sbjct: 185 GLGISTCHKLVLEVCAAIRGLLMPRFLRWPDDAAAEAFKSRFQAESGIPSVVGAIYTTHI 244
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + NR +++ + + P+ F V WPGS+ D +V+ S
Sbjct: 245 PIIAPKVSVTAYFNRRHTERNHKTSYSITLQGVVDPDGAFTDVCIGWPGSMPDDQVLERS 304
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+ ++ + ++G ++PL EW++ P + A F RR+ +
Sbjct: 305 ALQQRAAAGM---MAGSWVVGGVSFPLTEWVLAPYAQANLTWAQHAFNEKVAEVRRVAVD 361
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI----------ALTIDRDNENQNVE 285
+F LK ++ CL + Q +V+ A LHNI AL D D++
Sbjct: 362 AFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICESRGEGMDPALRCDIDDDE---T 418
Query: 286 IHDDEAEDDGEEEAVDD 302
+ ++ DG +A DD
Sbjct: 419 VPENPVRSDGATKARDD 435
>gi|72018042|ref|XP_786652.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 304
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 113/241 (46%), Gaps = 24/241 (9%)
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+SI + + EVV IV+ + +++ P E+ + + ++ FP+ G +DG V
Sbjct: 10 NSISKVVREVVEAIVEEYVDELLRCPTNEQGWRQLAEDWYQRWNFPHTVGAIDGKHVACK 69
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE- 185
AP + N G +++ + A+ +YKF ++A+ GS DA++ S + +
Sbjct: 70 APANSGSTYYNYKGFYSILLFAMVDADYKFIYIDASSKGSASDAQIYNASDLKDGLERNL 129
Query: 186 -----GWRPFPDAV------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
G P P+ I+GD A+ L+ +L+ P + A ++ F R RR++
Sbjct: 130 IMGFPGPDPLPNDTQDVPYFIVGDDAFSLRTYLMKPYSSRYLAREERIFNYRLSRARRVV 189
Query: 235 ENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIALT---------IDRDNENQNV 284
EN+FGIL +F L M+ P+ +V A LHN+ T +DR N ++
Sbjct: 190 ENAFGILANRFQILLTTMQHDPETVKSIVEARCILHNLMRTRYPVLQNRLVDRQRGNGDM 249
Query: 285 E 285
E
Sbjct: 250 E 250
>gi|301101403|ref|XP_002899790.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102792|gb|EEY60844.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 328
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 128/262 (48%), Gaps = 17/262 (6%)
Query: 39 SAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN- 97
S ++KV + L+++G Y +GA G S+S + EV+ V+ Q ++ +P +
Sbjct: 45 SYEKKVAAALYFLGSSGGYREVGAAMGMSRSYVKEITDEVIHVLKLVACQ-VISFPRDRR 103
Query: 98 -LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKF 156
N I F + G+P V G +DG++V ++ P++ D F R ALNV AI + F
Sbjct: 104 RWNTIEDQFALRQGYPGVVGAIDGSLVEVERPDDF-DGFYCRKSYPALNVQAIVTMDNYF 162
Query: 157 YAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNP 215
+V PGS D + ++S++ R + P +GD+ Y L L+ P +
Sbjct: 163 LSVEVR-PGSWSDRKCWQHSVIARNV----FNIIPAGTHFVGDAGYALSPGLMVPYSDRE 217
Query: 216 DAAA----DQRFLRCHKRTRRLIENSFGILKEKFPCLNYM--RLSPQFAGMVVLASTTLH 269
+ A +F H TR ++E++FG K +F L + +P+ A V++A+ LH
Sbjct: 218 EGGALTERQNKFNYFHSSTRMVVESTFGHWKGRFKILQCTLNQDTPRDASDVIVATIVLH 277
Query: 270 NIALTI-DRDNENQNVEIHDDE 290
N+ ++ D + VE DDE
Sbjct: 278 NLIISFRDAAAIPRYVEEDDDE 299
>gi|72145861|ref|XP_795502.1| PREDICTED: uncharacterized protein LOC590821 [Strongylocentrotus
purpuratus]
Length = 372
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 127/297 (42%), Gaps = 17/297 (5%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
AFK R+ + +L + P +T T AL K+ L + G+ Y +
Sbjct: 75 AFKNFMRVPQEMFDELLTRVGPRITKQKTNYRDALHPGLKLALTLRDLASGTKYRSMSYG 134
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGT 121
++I I EV I+ + M+K P E I F+ K FP G +DG
Sbjct: 135 WRVPHNTISLLIPEVCQAIIKEYRDEMMKCPTTPEEWRAISDKFMEKWNFPRTCGALDGK 194
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
V P+ + N G +++ +MA+ +Y+F + GS DA++ N+ +
Sbjct: 195 HVNCKRPSNSGSLYYNYKGFYSVVLMALVDADYRFIWADIGGMGSASDAQIY-NASELKA 253
Query: 182 FDEEGWRPFPD------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKR 229
E G FPD +GD A+ L+ ++ P + ++ + R
Sbjct: 254 CVEGGSLGFPDPDPLPNDNQDMSYFFVGDDAFALRPTMMKPYSLRGLTRPERIYNHPLSR 313
Query: 230 TRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVE 285
RR++EN+FGIL +F L+ M+ P+ ++V A LHN+ T +NQ ++
Sbjct: 314 ARRVVENAFGILANRFQVLLSTMQQQPETVKLIVTACMILHNLLRTQYPGMQNQQLD 370
>gi|414875939|tpg|DAA53070.1| TPA: transposon protein CACTA, En/Spm sub-class [Zea mays]
Length = 472
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 116/277 (41%), Gaps = 13/277 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S E + EE+ A+ T A+ +++V C+ + G P ++
Sbjct: 137 FRRAFRMSRATFEAVCEELGAAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 196
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT--FLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + V+WPE + F G P V G + T
Sbjct: 197 GLGISTCHKLVLEVCAAIKAVLMPKAVQWPEAPDAAAAESARFEAASGIPGVVGAMYTTH 256
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I AP + + NR +++ V + F V WPGS+ DA V+
Sbjct: 257 IPIIAPKANVAAYYNRRHTERNQKTSYSITVQGVVDAGGAFTDVCIGWPGSMSDADVLDR 316
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S +Y + G ++G + YPL +WL+ P T + A F R +
Sbjct: 317 SALYLQRGAAGL--LQGQRLVGGAGYPLMDWLLVPYTHHNMTWAQHVFNERVDGVRAVAR 374
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
++F LK ++ CL + Q +V+ A LHNI
Sbjct: 375 DAFQRLKSRWGCLQKRTEVKLQDLPVVLGACCVLHNI 411
>gi|226493946|ref|NP_001146875.1| LOC100280483 [Zea mays]
gi|195604564|gb|ACG24112.1| transposon protein CACTA, En/Spm sub-class [Zea mays]
Length = 473
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 116/277 (41%), Gaps = 13/277 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S E + EE+ A+ T A+ +++V C+ + G P ++
Sbjct: 137 FRRAFRMSRATFEAVCEELGAAVAKEDTMLRAAIPVRQRVAVCIWRLATGEPLRLVSKRF 196
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT--FLRKGGFPNVAGCVDGTM 122
G S+ + + EV I + V+WPE + F G P V G + T
Sbjct: 197 GLGISTCHKLVLEVCAAIKAVLMPKAVQWPEAPDAAAAESARFEAASGIPGVVGAMYTTH 256
Query: 123 VLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I AP + + NR +++ V + F V WPGS+ DA V+
Sbjct: 257 IPIIAPKANVAAYYNRRHTERNQKTSYSITVQGVVDAGGAFTDVCIGWPGSMSDADVLDR 316
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S +Y + G ++G + YPL +WL+ P T + A F R +
Sbjct: 317 SALYLQRGAAGL--LQGQRLVGGAGYPLMDWLLVPYTHHNMTWAQHVFNERVDGVRAVAR 374
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
++F LK ++ CL + Q +V+ A LHNI
Sbjct: 375 DAFQRLKSRWGCLQKRTEVKLQDLPVVLGACCVLHNI 411
>gi|328701170|ref|XP_003241510.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 175
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 83/163 (50%), Gaps = 9/163 (5%)
Query: 90 MVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM- 147
+ WP EE +I F FPNV G +DGT + I AP +H++ +VNR G H++ +
Sbjct: 12 FITWPSEEKAGEIKNGFFSTCTFPNVLGAIDGTHINIHAPIDHQEAYVNRKGHHSIQLFT 71
Query: 148 ----AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFP-DAVILGDSAYP 202
A+C +F A GSVHD +V R S + + + ++ FP + ++GD+AY
Sbjct: 72 DFSQAVCDNKCRFIHCYAGNVGSVHDQQVFRLSEL-KNYLDDATIYFPINTHLIGDAAYT 130
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
L E L+ P N ++F H +E SFG LK +F
Sbjct: 131 LHEHLLVPYHDNGH-LTQKKFNFYHSSASMAVERSFGFLKGQF 172
>gi|302786758|ref|XP_002975150.1| hypothetical protein SELMODRAFT_102485 [Selaginella moellendorffii]
gi|300157309|gb|EFJ23935.1| hypothetical protein SELMODRAFT_102485 [Selaginella moellendorffii]
Length = 433
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 77/327 (23%), Positives = 137/327 (41%), Gaps = 32/327 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F AFR+S + I +++ A+ T A+ +++V C+ + G P ++
Sbjct: 110 FHRAFRMSKSTFNIICDQLGLAVAKEDTMLRQAIPVRQRVAVCIWRLATGEPLRLVSKRF 169
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I D + V+WP EE L + F G PN+ G + T +
Sbjct: 170 GLGISTCHKLVLEVCAAINDVLLSKYVQWPTEERLQAVMEEFEAISGIPNLVGAMYTTHI 229
Query: 124 LIDAP---------NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
I AP H D+ N+ +++ + + F ++ WPGS+ D R++
Sbjct: 230 PIIAPKMNVAAYFNKRHTDR--NQKTSYSITLQGVVDIRGIFTDLSIGWPGSMSDERILE 287
Query: 175 NSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
S + ++ + + G YPL WL+ P T++ A F ++
Sbjct: 288 KSNLAQRAADG---LLNGVWVSGGPGYPLLSWLLVPYTQHNLGWAHHAFNEKLSEVLKIA 344
Query: 235 ENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAED 293
++FG LK ++ L + Q V+ A + LHN+ E H++ ++
Sbjct: 345 RDAFGRLKGRWKFLQKRTEVKLQELPAVLGACSVLHNVC------------EHHNEGFDE 392
Query: 294 DGEEEAVDD---PDEIDEEEQPRQDRN 317
+ EAVDD PD + Q R+
Sbjct: 393 ELAYEAVDDLMFPDSSSSDPAAVQARD 419
>gi|449689700|ref|XP_004212115.1| PREDICTED: uncharacterized protein LOC101240871 [Hydra
magnipapillata]
Length = 169
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 66/187 (35%), Positives = 96/187 (51%), Gaps = 21/187 (11%)
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEW 206
MA+CGP+ + N PG HD+ V+RNS ++ KF E G PF VIL DSAYP +E
Sbjct: 1 MAVCGPDTYICYASTNSPGRWHDSHVIRNSNLWNKF-ENGEIPFNGNVILADSAYPCREC 59
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSP-QFAGMVVLA 264
LIP +PD +RF +++TR IE FGI+K +F L +RL + A +++
Sbjct: 60 LIPHFPGDPD-GTQKRFNIAYRKTRNTIERCFGIVKNRFYALKTGIRLHKVEDASKLIMC 118
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRL 324
+HN+ L DN N ++G+EE V E + ++N+ R
Sbjct: 119 GFVIHNLCLRYG-DNGND---------FENGDEEHV-------YESLTQANQNDDAFNRR 161
Query: 325 ARLLNYF 331
+LLN F
Sbjct: 162 NQLLNNF 168
>gi|72125597|ref|XP_788419.1| PREDICTED: uncharacterized protein LOC583419 [Strongylocentrotus
purpuratus]
Length = 366
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 123/300 (41%), Gaps = 15/300 (5%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
F+ R++ ++ + P L T AL KV + ++ G YH +
Sbjct: 11 GFRNFTRITPAMFAEMVTRLTPRLQKYDTWYRKALPVGLKVAITMRYLASGDSYHSLMYL 70
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGT 121
++I + +V I + + + P E I TF + F + G +DG
Sbjct: 71 FYVPHNTISLLVLDVCQAIYAEYGEETISNPSTPEGWKGIAQTFSDRWNFHHCLGALDGK 130
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ I AP QF N G +++ ++A+ NYKF V G+ DA++ +
Sbjct: 131 HIRIKAPANCGSQFYNYKGYNSIVLLALVDGNYKFRWVEVGAGGASSDAQIWNTCSLKEA 190
Query: 182 FDEEGWRPFPD-----------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
D+E PD ++ D+A+ L+ L+ P P ++ F R
Sbjct: 191 IDDENIGIPPDEPLPHDDRDIPYFVIADAAFALRTTLMKPFGAKPLTMEERIFNYRLSRA 250
Query: 231 RRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
RR +EN+FGIL +F C L+ M P +VLA +HN+ +N ++ DD
Sbjct: 251 RRCVENAFGILANRFRCLLSAMAQVPSTVETIVLACLCIHNLLRDQASAEQNGMMDREDD 310
>gi|241730710|ref|XP_002413838.1| transposase, putative [Ixodes scapularis]
gi|215507654|gb|EEC17146.1| transposase, putative [Ixodes scapularis]
Length = 413
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 129/301 (42%), Gaps = 20/301 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+E FRL + +++ T A+ +++V L+ + + + G
Sbjct: 97 FRENFRLDRSTFRYVVSVCECMRRNNTNMRQAIPLEKRVAIALYRLATSAEDRTVANLFG 156
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S+SS+ E V+V R VK+P N L + F GFP G +DG +
Sbjct: 157 VSRSSVNIIFREFCEVLVQRIEPRFVKFPRPNDLAEHLRQFAAVAGFPQGVGALDGCHLE 216
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ P +H+ + N G ++ ++A+ Y+F N PG HD+ V + S +
Sbjct: 217 VCPPKDHDSDYHNYKGWYSTILLAVPDHAYRFLYTNVGSPGRNHDSAVFQRSRLPGVLAG 276
Query: 185 EGWRPFPDA--------VILGDSAYPLKEWLIPP-----RTRNPDAAADQRFLRCHKRTR 231
+ +R V+L D A+PL+ ++ P +P A + R R
Sbjct: 277 DLFRSEAKTFEGISVGPVLLADQAFPLQCHMMKPFPQPGSVGSPTRAFNYRL----SSAR 332
Query: 232 RLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTI-DRDNENQNVEIHDD 289
R++EN+FG LK +F L+ + L + +V A LHNI + DR +N + D
Sbjct: 333 RVVENAFGRLKARFRILHKGLELDIESVNTIVRACCVLHNICEQLNDRCEKNWLDTVQSD 392
Query: 290 E 290
+
Sbjct: 393 D 393
>gi|307166817|gb|EFN60761.1| Putative nuclease HARBI1 [Camponotus floridanus]
Length = 163
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 83/159 (52%), Gaps = 5/159 (3%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DG+ + ID E D ++NR +L++ KF V +PGSVHD RV +N
Sbjct: 1 GAIDGSHIRIDKLTEDLDSYINRKQYFSLHMQGTVDHKMKFIDVFIGYPGSVHDTRVFKN 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
S + E + ++LGDSAYP LKE ++P R A + F + R +I
Sbjct: 61 SPLRNDLHEFCGNNY---LLLGDSAYPCLKELIVPYRDNGHLTHAQRSFNQKLSSCRVII 117
Query: 235 ENSFGILKEKFPCLNYMRLSPQFAGMVVL-ASTTLHNIA 272
EN+FG LK++FP L + +L + V+ A LHNIA
Sbjct: 118 ENAFGYLKQRFPQLYHFKLRDIVRMVYVIHACCVLHNIA 156
>gi|301607557|ref|XP_002933374.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 346
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 122/272 (44%), Gaps = 14/272 (5%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + +L+ + P + T ++SA++++++ L ++ G + + + G S +
Sbjct: 1 MSESCFRSLLQMVGPLIQKEDTVMRKSISAEQRLIATLRFLATGRSFEDLKFSTGISAQA 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+ I E IV+T +K+PE E I F FPN G +DG + + P
Sbjct: 61 LGHIIPETCNTIVETLKGEYLKFPETSEEWQVIAQQFNDYWNFPNCGGVIDGKHIRRNPP 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE---- 184
F N G ++ ++AI NY+F V+ N G + D V+ + +K
Sbjct: 121 PNSGSYFFNYKGFFSIVLLAIVNANYEFIMVDKN--GRLSDGGVIEQTHFNQKLKSKQLN 178
Query: 185 ----EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
E + + V + D A+ L E L+ P + + F R RR++EN+FGI
Sbjct: 179 LPTNEETKEGLNFVFVADEAFGLHENLLKPFPQKVLTPERKIFNYRLSRARRVVENAFGI 238
Query: 241 LKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
L +F + + L P+ MVVL+ LHN
Sbjct: 239 LANRFRIFHTSINLCPEKIDMVVLSCCVLHNF 270
>gi|449691603|ref|XP_004212734.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 170
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 63/113 (55%), Gaps = 4/113 (3%)
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
LN+ + F P + G +DGT V I P EHE +VNR+ +H++NV A C N KF
Sbjct: 26 LNESRVNFYNLAESPRITGLIDGTHVCIQNPREHEYVYVNRSSNHSINVQAGCDYNGKFI 85
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
+ + WPGS H AR++R S + +K + ++LGDS YP WL+ P
Sbjct: 86 DIVSKWPGSTHKARILRESKLCKKMIHGTLK----GILLGDSGYPCFCWLLTP 134
>gi|357125791|ref|XP_003564573.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 396
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 137/308 (44%), Gaps = 27/308 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RN------VALSAKEKVLSCLHWMGVGSPY 57
F+ F++S +I I+ T T+ RN L +++V L + G P
Sbjct: 54 FESVFKMSRGTFNYICSLISGDFTRKTQSFRNFRFGDKTILGVEDQVAVALLRLTTGEPL 113
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + D R +KWP E + I F + G PN G
Sbjct: 114 LSIGNRFGMNHSAISNITWKFIESLEDHGI-RHLKWPGPEEMATIKAKFEKLQGLPNCCG 172
Query: 117 CVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +L+ + + + +++ +++ + AI P+ +F + + WPGS+ D+ ++R
Sbjct: 173 AIDTTHILMCSSAQPNSNVWLDGENRNSMVLQAIVDPDMRFRDIVSGWPGSLDDSCILRT 232
Query: 176 SMVYRKFDEEGWR----------PFPDAV---ILGDSAYPLKEWLIPPRTRNPDAAADQR 222
S Y K E+G R P V I+GD++YPL WL+ P + + A
Sbjct: 233 SGFY-KLCEKGARLEEQMELPGEPAGSVVREYIIGDASYPLLPWLMTPYQEHDPSPAQVE 291
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
F + H R +++ + LKE++ L P + ++ A L NI + ++
Sbjct: 292 FNKRHTAARMVVQGAMARLKERWQVLKGELWRPDKHRLPRIIYACCLLTNIMIDLEDTQR 351
Query: 281 NQNVEIHD 288
++ H+
Sbjct: 352 DRTPASHN 359
>gi|326528107|dbj|BAJ89105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 130/333 (39%), Gaps = 24/333 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F FR+ + + + A+ T A+ +++V C+ + G P ++
Sbjct: 132 FLREFRMGRATFGMVCDALGAAVAKEDTALRAAIPVRQRVAVCVWRLATGEPLRLVSKRF 191
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN---LNDIPMTFLRKGGFPNVAGCVDGT 121
G S+ + + EV I R ++WP+ + +F + G P+V G + T
Sbjct: 192 GIGISTCHKLVLEVCAAIRSILMPRFLRWPDRGSAAAAAVKASFEARSGVPDVVGAMYTT 251
Query: 122 MVLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
+ I AP H + NR +++ + + GP+ F V WPGS+ D +V+
Sbjct: 252 HIPIIAPKVHVASYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSLSDDQVLD 311
Query: 175 NSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
S + ++ + W ++G ++YPL W + P T F
Sbjct: 312 KSALQQRAAAGMMDGSW-------VVGGASYPLTGWCLVPYTHQNLTWTQHAFNEKVGEL 364
Query: 231 RRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHD 288
R++ ++F LK ++ CL + Q +V+ A LHNI + VE+ D
Sbjct: 365 RQVAVDAFSRLKSRWACLQKRTEVKLQDLPVVLGACCVLHNICEARGEAMTPELRVEVQD 424
Query: 289 DEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGT 321
D D + D D+ R GT
Sbjct: 425 DLPVLDNPVRSADAAKARDKMAHNLLHRGLAGT 457
>gi|326514010|dbj|BAJ92155.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 130/333 (39%), Gaps = 24/333 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F FR+ + + + A+ T A+ +++V C+ + G P ++
Sbjct: 132 FLREFRMGRATFGMVCDALGAAVAKEDTALRAAIPVRQRVAVCVWRLATGEPLRLVSKRF 191
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN---LNDIPMTFLRKGGFPNVAGCVDGT 121
G S+ + + EV I R ++WP+ + +F + G P+V G + T
Sbjct: 192 GIGISTCHKLVLEVCAAIRSILMPRFLRWPDRGSAAAAAVKASFEARSGVPDVVGAMYTT 251
Query: 122 MVLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
+ I AP H + NR +++ + + GP+ F V WPGS+ D +V+
Sbjct: 252 HIPIIAPKVHVASYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGWPGSLSDDQVLD 311
Query: 175 NSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
S + ++ + W ++G ++YPL W + P T F
Sbjct: 312 KSALQQRAAAGMMDGSW-------VVGGASYPLTGWCLVPYTHQNLTWTQHAFNEKVGEL 364
Query: 231 RRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-LTIDRDNENQNVEIHD 288
R++ ++F LK ++ CL + Q +V+ A LHNI + VE+ D
Sbjct: 365 RQVAVDAFSRLKSRWACLQKRTEVKLQDLPVVLGACCVLHNICEARGEAMTPELRVEVQD 424
Query: 289 DEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGT 321
D D + D D+ R GT
Sbjct: 425 DLPVLDNPVRSADAAKARDKMAHNLLHRGLAGT 457
>gi|62122827|ref|NP_001014341.1| uncharacterized protein LOC541506 [Danio rerio]
gi|61403162|gb|AAH91804.1| Zgc:113227 [Danio rerio]
Length = 415
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/324 (23%), Positives = 135/324 (41%), Gaps = 27/324 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHI 59
TP F + FR+S E+I + L T +++ K++V L + GS Y
Sbjct: 76 FTPEEFIQNFRVSRESFEYICRRLRHMLERKDTNFRLSVPVKKRVAIALCKLATGSEYRY 135
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCV 118
+ G S++ + + ++ +K+P E L ++ F P G +
Sbjct: 136 VSQLFGVGVSTVFNCVQDFCSAVIKILVPVHMKFPSPEKLKEMADVFENCWNVPQCIGSI 195
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
D + I AP ++ ++NR G H++ + A+ N F+ + + G++ DARV+R S +
Sbjct: 196 DAHHIPIIAPEKNPRGYLNRKGWHSVVLQAVVDGNGLFWDLCVGFSGNLSDARVLRQSYL 255
Query: 179 YRKFDEEGWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAA----DQRFLR 225
+ E ++GDSAYPL+ WL+ P PD + F
Sbjct: 256 WSLLSERDLLNHNKVDISGCDVGYYLIGDSAYPLQNWLMKP---FPDIGGLTPQQESFNS 312
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMR--LSPQFAGMVVLASTTLHNIA------LTIDR 277
R + + SF LK ++ CL + R + + L LHNI + D
Sbjct: 313 RLSSARSVSDLSFKKLKARWQCL-FRRNDCKVELVKKMALTCCVLHNICEEKGTQFSEDH 371
Query: 278 DNENQNVEIHDDEAEDDGEEEAVD 301
++ N++ E ++G+ E D
Sbjct: 372 STDHLNLQPPVQEFIENGQPEGTD 395
>gi|340381732|ref|XP_003389375.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 399
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 110/241 (45%), Gaps = 29/241 (12%)
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F K G G +DG+ + + P + + NR G +++ + A+ NY F +N WP
Sbjct: 172 FKIKFGMIQCLGSIDGSHIPVMPPALNHTDYYNRKGYYSMILQAVVDHNYVFRDINIRWP 231
Query: 165 GSVHDARVMRNSMVY-----RKFDEEGWRPFPDAVI----LGDSAYPLKEWLIPPRTRNP 215
GSVHDARV NS +Y R+ D V+ +GDSAYPL WL+ P N
Sbjct: 232 GSVHDARVFVNSSLYHKAINREILTGNELKIDDKVVPLFLIGDSAYPLSSWLMKPFAHNT 291
Query: 216 DAAADQRFLRCH-KRTRRLIENSFGILKEKFPCL---NYMRLSPQFAGMVVLASTTLHNI 271
+ +R + ++R ++EN+FG LK ++ L N M + VV A LHN+
Sbjct: 292 ELNDSERKYNYYLSKSRIVVENAFGRLKARWRRLLKRNDMMIDN--VPYVVSACCILHNV 349
Query: 272 ALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYF 331
E+H EA + + VD ++ Q R NN + L++YF
Sbjct: 350 C------------EVH-GEAFIESWMQGVDTSNQ-PTTSQSRTVLNNNAKEIRNTLVDYF 395
Query: 332 R 332
+
Sbjct: 396 K 396
>gi|301632578|ref|XP_002945359.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 399
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 128/288 (44%), Gaps = 21/288 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTPTERNVA---LSAKEKVLSCLHWMGVGSPYHI 59
P F FR+S T + +L + P L+ R++ +S +E++ L ++ G +
Sbjct: 60 PEKFVTFFRMSITSFDELLTILKPGLSR--ARSLMRDPISPEERLCLTLRFLATGQSFSS 117
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCV 118
+ +++I R + E +I + ++ P+EN +I F +K FPN G +
Sbjct: 118 LYFQFLIGRTTIGRIVRETCLLIWSELQRIVMPSPDENTWVEIAEDFHKKTNFPNCLGAL 177
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DG + + P ++ N ++ ++A+ NY F ++ GS DA RNS +
Sbjct: 178 DGKHIRVTMPFNSGSKYFNYKKYFSVVLLAVVDANYCFTIIDVGAYGSTGDASAFRNSAL 237
Query: 179 YRKFDE-------------EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
R+ E P P V +GD A+ L E ++ P + + + F
Sbjct: 238 GRQLTEGTLRLPLPKPLPGTAAPPMP-YVFVGDEAFGLTENIMRPYPGSQRSVQKRLFNY 296
Query: 226 CHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
R RR++E +FGIL K+ + ++L P+F ++ A LHN
Sbjct: 297 RLSRARRMVECAFGILANKWRVFHTALQLEPEFVDKIIKACCVLHNFV 344
>gi|328701313|ref|XP_001947463.2| PREDICTED: hypothetical protein LOC100164328 [Acyrthosiphon pisum]
Length = 383
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 126/300 (42%), Gaps = 46/300 (15%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F E FR++ + E++ +T T + +KE++ CL ++ G
Sbjct: 65 FYEYFRMTQYTFHVLHEKLENLITKKETHWRKPIPSKERLAVCLRYLATGDS-------- 116
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
RSI + M EE +I F FPN G +DG V
Sbjct: 117 -------MRSISFI-----------MAAPNEEKWKEIASEFWTSWNFPNCLGAIDGKHVT 158
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR---- 180
I AP Q+ ++ ++A+ +Y F AV+ G D ++ +SM+ +
Sbjct: 159 IQAPPNSGSQYFCYKKTFSVVLLALVDAHYNFIAVDVGSYGKNSDGGILNHSMLGKALEQ 218
Query: 181 -KFDEEGWRPFPDA------VILGDSAYPLKEWLIPPR-TRNPDAAADQRFLRCHKRTRR 232
K D P VI+GD A+PLK +L+ P +N D + + R RR
Sbjct: 219 NKLDIPEKAALPGTTNEVPFVIIGDEAFPLKTYLLRPYPGKNLDCNEKRIYNYRICRARR 278
Query: 233 LIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNI------ALTIDRDNENQNVE 285
++EN+FGIL +KF N ++ P+ ++++ + LHN LT RDN+ N +
Sbjct: 279 VVENAFGILSQKFRIYNRRIQAKPENVDVIIITTCILHNFIKKYDGKLTYIRDNQQFNAD 338
>gi|270011972|gb|EFA08420.1| hypothetical protein TcasGA2_TC006067 [Tribolium castaneum]
Length = 384
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 135/301 (44%), Gaps = 21/301 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FRL+ +Q IL I+ +TT T +S +K+ L ++ G + +
Sbjct: 60 FKEYFRLTKSQFFSILSYISEDITTMPSTFVKYPISPCQKLAITLRFLATGESFRSMAFN 119
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTM 122
+ I + + EV+ + + + P + L I F + FPN G +DG
Sbjct: 120 FRIFHNHISKLVREVLKELRNKLLPIYLPKPTTSELKQIAEDFNARWNFPNCCGAIDGKH 179
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ I P + N ++ ++A+ YKF V+ G D+ + + S +Y+
Sbjct: 180 IRIVCPKGSGSLYFNYKSYFSIVLLALIDARYKFTWVDVGAYGKEGDSGIFQRSSIYQLI 239
Query: 183 DEEGWRPFPDA----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK--RT 230
+ P P VI+GD A+ L E ++ P R+ A ++ + ++ R
Sbjct: 240 HNNNFLPEPTLLPRTDIILPHVIVGDEAFKLTENVLRPYPRDQAKADREKKIFNYRLSRA 299
Query: 231 RRLIENSFGILKEKFPCLNYMRLS--PQFAGMVVLASTTLHNI---ALTIDRDNENQNVE 285
RR EN+FGIL + F C+ Y ++ P +V+ S LHN+ +LT + + + ++E
Sbjct: 300 RRTSENAFGILCQTF-CIFYTPIATHPTLVDDIVMVSCCLHNLLRGSLTANHEEGDIDIE 358
Query: 286 I 286
+
Sbjct: 359 M 359
>gi|328724827|ref|XP_003248262.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 394
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 124/303 (40%), Gaps = 14/303 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FR+S + +L ++ + ++S E++ L ++ G+ + + ++
Sbjct: 57 FFNYFRMSIKSFDELLSKLESGIKVSNSGRPSISPTERLCVTLRYLASGNTFTDLQYSYR 116
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
S+I + V I + + EE +I + F + FPN G +DG + +
Sbjct: 117 MGISTISGIVEYVCEQIWKMKSECIPLPTEEKWREISLDFEKNTNFPNCIGALDGKHIRV 176
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
P + F N +++ +MAI NY F V+ G D+ V +N + K + E
Sbjct: 177 IKPIKSGSLFYNYKHYYSIVLMAISDANYCFTFVDVGAYGKFSDSSVFKNGKFFEKLENE 236
Query: 186 GW-----RPFPD--------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
+P P VIL D A+ + + ++ P R+ + F R RR
Sbjct: 237 TLSIPQPKPLPGDNENGPLPYVILADEAFGVSKTILRPYARSNLNYKKKNFNYRLSRARR 296
Query: 233 LIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
IE+SFGIL K+ + M S + + +V A LHN D + I E
Sbjct: 297 YIESSFGILSNKWRIFHRPMNTSVELSTKIVKACCVLHNFVRVRDGYKFAHTLSIQGFET 356
Query: 292 EDD 294
+D
Sbjct: 357 SND 359
>gi|328697775|ref|XP_001946487.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 412
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/309 (23%), Positives = 136/309 (44%), Gaps = 23/309 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F + FR+S Q ++ I + T+ + +++++ CL ++ G Y +
Sbjct: 61 PEKFHQFFRMSYDQFSYLHNLIKDDIKKQNTQFRAPVPSEQRLAVCLRFLATGESYRSLA 120
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQ-RMVKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
+ S+ + EV VI T M K +++ +I + FPN G +DG
Sbjct: 121 FSFRLGFSTTREIVEEVCEVIWKTLRPIYMPKPTKDDWQNISREYKEIWNFPNCIGSLDG 180
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ I P + N G +++ ++A+ +Y+F ++ G D V S + +
Sbjct: 181 KHINIQCPINGGSAYFNYKGVNSIVLLALVDAHYRFITIDVGSYGRNSDGNVFAKSALGK 240
Query: 181 KFD------------EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA--ADQRFLRC 226
+ EE P P VI+ D A+PLK +L+ P +R ++ F
Sbjct: 241 ALENDTLDVPPDTPIEENGDPMP-YVIVADEAFPLKPYLMRPYSRVTLGGNEGNKIFNYR 299
Query: 227 HKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA-----LTIDRDNE 280
R RR++EN+FGIL ++ +++ P+ +VLA+ LHN+ +D N
Sbjct: 300 LSRARRVVENAFGILSNRWRVFRTNIQVQPKSVDNIVLAACCLHNMLCQSHDFQLDETNN 359
Query: 281 NQNVEIHDD 289
++ E+ ++
Sbjct: 360 YESTEVLEN 368
>gi|289742715|gb|ADD20105.1| putative transposase [Glossina morsitans morsitans]
Length = 361
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/285 (25%), Positives = 124/285 (43%), Gaps = 33/285 (11%)
Query: 5 AFKEAFRLSATQVEFILEEIAP-----ALTTPTERNVALSAKEKVLSCLHWMGVGSPYHI 59
F + FR+S E+IL +I TP +AL+ L +G S +
Sbjct: 43 VFLQKFRVSKEAFEYILGKIGANDGIRRTYTPPVIRLALT--------LELLGSRSHGRL 94
Query: 60 IGATHGPSKSSICRSIHEVV-PVIVDTFFQRM-----VKWPEENLN-DIPMTFLRKGGFP 112
G S +C ++ + V+V Q + +W + ++N + F ++ G P
Sbjct: 95 AG-------SDLCSTVAQATFSVVVSEMIQELEDKLCTEWIQLDINANTKRWFYKEFGIP 147
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
V GCV GT + ++ E+ F+N +G ++N M +C + K VN +PGS HD+ V
Sbjct: 148 GVIGCVGGTHIYFRKTSQSEETFINPSGKASVNGMVVCDHDMKIIGVNFGFPGSTHDSHV 207
Query: 173 MRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+S R + ++ W R + +LG S YPL+ WL+ P D +++ H + R
Sbjct: 208 WNHSN-ERTYLKDHWNRNESNGWLLGGSGYPLEPWLMTPYKSAVD-PTKRKYNDVHSQAR 265
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQFA---GMVVLASTTLHNIAL 273
++ + K ++ R S A G LHNI +
Sbjct: 266 SIVNRCINLYKGRWRIFMEDRKSRYNALKMGKFATVCAALHNICI 310
>gi|322797948|gb|EFZ19802.1| hypothetical protein SINV_05984 [Solenopsis invicta]
Length = 334
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 130/291 (44%), Gaps = 22/291 (7%)
Query: 12 LSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSI 71
+SATQ+E +L I L + + S E++ L ++ G + + ++
Sbjct: 1 MSATQLEELLGLIGCKLQK--QDFIRQSPPERLALTLRYLASGDSMTSMSYQYLVGVTTA 58
Query: 72 CRSIHEVVPVIVDTFFQRMV--KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN 129
IHE VI + F ++ + E++ DI F K F + +DG V+I PN
Sbjct: 59 STIIHETCKVIWEDLFPLVLPNQLQEKDWLDIANDFNEKWNFIHCIEAIDGKHVIIQCPN 118
Query: 130 EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE----- 184
+ N H++ +MAIC NY V+ G +D + ++S + + FDE
Sbjct: 119 NAGSAYFNYKNSHSIVLMAICDANYIIRFVDIGAYGRRNDGGIFKDSAIGKAFDEGRMNI 178
Query: 185 -------EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIEN 236
EG P ++GD A+PLK +L+ P +Q R RR+IEN
Sbjct: 179 PQSAAIREG-GPILPYCLVGDEAFPLKSFLLRPYPGRGGLTPEQNIYNYRLSRARRIIEN 237
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
+FGI+ ++ + S + A ++V A+ LHN + R + ++N+ +
Sbjct: 238 TFGIITSQWRIYRKPIIASTENAKLMVQATIYLHN---WLRRQDNDENIYV 285
>gi|390360259|ref|XP_003729666.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 369
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 114/258 (44%), Gaps = 25/258 (9%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRK 108
+G G+ Y + + +SI + EV I + ++M P ++ + F +
Sbjct: 57 LGDGNSYRSLAYDFRVAHNSISTFVPEVCTAIYKEYKEKMFNMPSTQDEWKAVARQFGTR 116
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
F + G +DG + I PN+ + N ++ ++A+ NY F GS
Sbjct: 117 WNFHHCCGAIDGKHIAIKKPNKSGSLYYNYKRFCSVVLLAVVNANYSFLWCKVGANGSSS 176
Query: 169 DARVMRNSMVYRKFDEE--GWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPD 216
DA V S + ++ G+ P PD + I+GD A+PL++WL+ +R
Sbjct: 177 DAGVFNESTLRGALEDNTIGF-PAPDPLPGVNRNFPYFIIGDDAFPLRKWLLKTYSRRAM 235
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA--- 272
++ R RR++EN FGIL ++ CL ++L P VVL TLHN
Sbjct: 236 THKERVMNYRTSRARRVVENGFGILAHRWRCLLITLQLEPGRVIYVVLGCLTLHNRLRKR 295
Query: 273 ------LTIDRDNENQNV 284
L +DR++E+ NV
Sbjct: 296 RPGLQDLNLDREDEDGNV 313
>gi|218188534|gb|EEC70961.1| hypothetical protein OsI_02577 [Oryza sativa Indica Group]
Length = 299
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 124/302 (41%), Gaps = 27/302 (8%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE 96
A+ +++V C+ + G P ++ G S+ + + EV I R ++WP+E
Sbjct: 5 AIPVRQRVAVCIPRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDE 64
Query: 97 NLND-IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNR-------NGDHALNVMA 148
F G P V G + T + I AP + NR +++ +
Sbjct: 65 AAAAAFKERFQAAYGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQG 124
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+ GP+ F V WPGS+ D +V+ SM++++ A ++G ++YPL +W++
Sbjct: 125 VVGPDGAFTDVCIGWPGSMPDDQVLEKSMLHQR---AAAGMMHSACLVGGASYPLMDWVL 181
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTT 267
P T F RR+ ++F LK ++ CL + Q +V+ A
Sbjct: 182 VPYTHQNLTWTQHAFNEKVGDLRRVAVDAFARLKARWACLQKRTEVKLQDLPVVLGACCV 241
Query: 268 LHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDD---PDEIDEEEQPRQDRNNPGTQRL 324
LHNI E +E E + E VDD P+ E ++ R+N L
Sbjct: 242 LHNIC------------ETRGEELEPELRFELVDDETSPETPVRSEAAKRARDNIAHNLL 289
Query: 325 AR 326
R
Sbjct: 290 HR 291
>gi|449459932|ref|XP_004147700.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449513511|ref|XP_004164345.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 392
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 144/332 (43%), Gaps = 33/332 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAK-----EKVLSCLHWMGVGSPYHII 60
F+ F++S +I + + T L+ K ++V L + G I
Sbjct: 56 FESVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNI 115
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + G ++SS+ + V + + + WP EE+++ I F + G PN G V
Sbjct: 116 GDSFGLNQSSVSQITWRFVEAMEEKGLHHL-SWPSTEEDMDKIKSKFKKIRGLPNCCGVV 174
Query: 119 DGTMVLIDAPNEHEDQ--FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+ T +++ P +++R + ++ + I P +F + WPGS+ DA V+++S
Sbjct: 175 ETTHIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSS 234
Query: 177 MVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLIPPRTRN--PDAAADQRF 223
+ K ++G R + I+GDS +PL WL+ P PD A+ F
Sbjct: 235 GFF-KLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAE--F 291
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNEN 281
+ H TR + + + LKE + + + P + ++L LHNI ID ++E
Sbjct: 292 NKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIV--IDMEDEV 349
Query: 282 QN---VEIHDDEAEDDGEEEAVDDPDEIDEEE 310
Q+ + H D + E VD+ I E+
Sbjct: 350 QDEMPLSHHHDPSYRQQSCEFVDNTASISREK 381
>gi|449692104|ref|XP_004212903.1| PREDICTED: uncharacterized protein LOC101239628, partial [Hydra
magnipapillata]
Length = 307
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 122/288 (42%), Gaps = 34/288 (11%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y
Sbjct: 1 MKECDFKEHFRVNRNPFNFLVNELQPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYR 59
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCV 118
++ + G KS+ ++E + QR F F G V
Sbjct: 60 VVSSLFGIGKSTANLIVYEFINA------QR---------------FRSYSWFSQCVGAV 98
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM- 177
DG + I AP + + N G +++ A+ Y+F + PG +D +++NS
Sbjct: 99 DGCHIPISAPKDQAISYYNYKGWYSIVRFAVVDCRYRFIFTSVGSPGRNNDCYILQNSSL 158
Query: 178 -------VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKR 229
++ K +E ++GDSA+PL L+ P N + Q+ F +
Sbjct: 159 KAILESNLFDKCCKELGDSLVPLCLIGDSAFPLTRHLLKPYPENLVLSEIQKNFNKILCG 218
Query: 230 TRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
RR++EN+FG +K +F + M FA +V A LHNI D
Sbjct: 219 ARRVVENAFGRVKAQFRVICKRMECDINFATRIVNACVILHNICENYD 266
>gi|356547899|ref|XP_003542342.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 432
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 126/297 (42%), Gaps = 16/297 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FR+S + I + + A+T T A+ +++V C+ + G P +
Sbjct: 106 FRRYFRMSKATFDMICQHLDSAVTKKNTMLRQAIPVRQRVAVCIWRLATGDPLREVSKRF 165
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G S+ + + EV I + + WP E + I F G P V G + T +
Sbjct: 166 GLGISTCHKLVLEVCSAIKTVLMPKFLHWPNEAEMKPITQKFESLSGIPYVGGSMYTTHI 225
Query: 124 LIDAPNEHEDQFVNRN-------GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + + N++ +++ V + F V WPGS+ D +V+ S
Sbjct: 226 PIIAPKSNVNAYFNKHHTERNQKTSYSITVQGVVDSKGVFSDVCIGWPGSLSDDQVLEKS 285
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+Y++ D ++G+S +PL + ++ P T F + + + ++
Sbjct: 286 ALYQR---ATMGTLKDVWVVGNSGHPLMDGVLVPYTHQNLTWTQHAFNQKVGEIQSIAKD 342
Query: 237 SFGILKEKFPCLNYMRLSPQFAGMVVL--ASTTLHNIALTIDRDNE-NQNVEIHDDE 290
+F LK ++ CL R + + VL A LHNI D + + + EI DDE
Sbjct: 343 AFARLKGRWSCLQ-KRTEVKLEDLPVLLGACCVLHNICEMRDEEMDPHWKFEIFDDE 398
>gi|219990729|gb|ACL68738.1| MIP02663p [Drosophila melanogaster]
Length = 344
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 104/236 (44%), Gaps = 17/236 (7%)
Query: 69 SSICRSIHEVVPVIVDTFFQRMV--KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
S + + ++ +P++ Q + + + + F K P V C+ GT + I
Sbjct: 112 SMLAKILNRFIPLLQKLLCQATISIQMSRQQMQISSKNFWEKYKLPKVVACLVGTHIGIK 171
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P + F+N+ G ++LNVM +C N + A +A +PGS D+ + S + E
Sbjct: 172 KPAKDCSDFLNKKGYYSLNVMLVCNDNMEIIASDATFPGSCRDSVIWNRS----RARELL 227
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
IL +S Y + +++ P F H + R ++E + +LK +F
Sbjct: 228 SVTLNGHFILANSKYSQESFVLTPYKNAEIGTYQHTFNLRHAQARNMVEQTIEVLKNRFL 287
Query: 247 CLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD---EAEDDGEEE 298
CL ++ P F M+V LHN+ +QN+ I D+ E +D+G+ E
Sbjct: 288 CLQRGLKYEPSFCCMIVNVCCALHNLC-------GSQNLTITDEFQFEGDDNGQRE 336
>gi|115642579|ref|XP_001196714.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 253
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 112/242 (46%), Gaps = 19/242 (7%)
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
++I + EVV I++ + ++ P E+ D+ + ++ FP+ G +DG V
Sbjct: 10 NTISVVVREVVKAIIEEYTDELLFCPTTEQGWRDLADQWYQRWNFPHTVGAIDGKHVACK 69
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP +F N G +++ + A+ +YKF ++ + GS DA + S ++R D+
Sbjct: 70 APPNSGSEFYNYEGFYSVILFAMVDADYKFTYIDVSGNGSSSDAHIYNESDLHRGLDQNR 129
Query: 187 WRPFPD------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
FP I+GD A+ L+ +L+ P + ++ F R RR++
Sbjct: 130 IHAFPQPDPLPNDNQDVPYFIIGDDAFSLRTYLMKPYSTRNLTHEERIFNYRLSRARRVV 189
Query: 235 ENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAED 293
EN+FGIL +F L M+ + +V A LHN+ T +N+ V D A+
Sbjct: 190 ENTFGILANRFQILLTTMQHHHETVRTIVEACCILHNLMRTRYPVLQNRLV----DRAQP 245
Query: 294 DG 295
DG
Sbjct: 246 DG 247
>gi|301118887|ref|XP_002907171.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105683|gb|EEY63735.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 329
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 122/270 (45%), Gaps = 28/270 (10%)
Query: 39 SAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--E 96
S +KV + L+++G Y GA G S+S + EVV V+ +++ +P E
Sbjct: 45 SYNKKVAASLYFLGSSGGYRETGAAMGMSRSYVMEITSEVVRVL-KIVAPQVISFPSNRE 103
Query: 97 NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKF 156
N + F K G+P +AG +DG+++ I+ P+ D F R ALN+ AI + F
Sbjct: 104 EWNAVEAGFASKHGYPGIAGAIDGSLIEIERPDNF-DGFYCRKAYPALNMQAIVTSDGFF 162
Query: 157 YAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNP 215
+V+ PGS D++ + S + R P +GD+ Y L WLI P
Sbjct: 163 LSVDVR-PGSWSDSKCWQYSTIGRSVGN----VLPAGKHFIGDAGYALLPWLIVPYCERE 217
Query: 216 DAA----ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA---GMVVLASTTL 268
+ ++F H TR +E +FG K +F L LS + A V+A+ L
Sbjct: 218 EGGRLSQQQKQFNFLHSSTRMAVECTFGRWKGRFRMLQ-CALSQETARRSANFVVATVVL 276
Query: 269 HNIALTIDRDNE---------NQNVEIHDD 289
HN+ + I RD+ N+N DD
Sbjct: 277 HNL-MKIYRDSAQFPLFRERGNENAAADDD 305
>gi|386764417|ref|NP_001245670.1| CG43088 [Drosophila melanogaster]
gi|383293392|gb|AFH07384.1| CG43088 [Drosophila melanogaster]
Length = 362
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 104/236 (44%), Gaps = 17/236 (7%)
Query: 69 SSICRSIHEVVPVIVDTFFQRMV--KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
S + + ++ +P++ Q + + + + F K P V C+ GT + I
Sbjct: 130 SMLAKILNRFIPLLQKLLCQATISIQMSRQQMQISSKNFWEKYKLPKVVACLVGTHIGIK 189
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P + F+N+ G ++LNVM +C N + A +A +PGS D+ + S + E
Sbjct: 190 KPAKDCSDFLNKKGYYSLNVMLVCNDNMEIIASDATFPGSCRDSVIWNRS----RARELL 245
Query: 187 WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
IL +S Y + +++ P F H + R ++E + +LK +F
Sbjct: 246 SVTLNGHFILANSKYSQESFVLTPYKNAEIGTYQHTFNLRHAQARNMVEQTIEVLKNRFL 305
Query: 247 CLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD---EAEDDGEEE 298
CL ++ P F M+V LHN+ +QN+ I D+ E +D+G+ E
Sbjct: 306 CLQRGLKYEPSFCCMIVNVCCALHNLC-------GSQNLTITDEFQFEGDDNGQRE 354
>gi|390343679|ref|XP_787690.2| PREDICTED: uncharacterized protein LOC582653 [Strongylocentrotus
purpuratus]
Length = 341
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 113/268 (42%), Gaps = 14/268 (5%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE- 95
AL KV + ++ G YH + ++I + +V I + + + P
Sbjct: 19 ALPVGLKVAITMRYLASGDSYHSLMYLFYVPHNTISLLVLDVCQAIYAEYGEETISNPST 78
Query: 96 -ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
E DI +F + F + G +DG + I AP QF N G +++ ++A+ NY
Sbjct: 79 PEGWKDIAQSFSDRWNFHHCLGALDGKHIRIKAPANCGSQFYNYKGYNSIVLLALVDGNY 138
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD-----------AVILGDSAYPL 203
KF V G+ A++ + D+E PD ++ D+A+ L
Sbjct: 139 KFRWVEVGAGGASSGAQIWNTCSLKEAIDDENIGIPPDEPLPHDDRDIPYFVIADAAFAL 198
Query: 204 KEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVV 262
+ L+ P P ++ F R RR +EN+FGIL +F C L+ M P +V
Sbjct: 199 RTTLMKPFGAKPLTMEERIFNYRLSRARRCVENAFGILANRFRCLLSAMAQVPSTVETIV 258
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDE 290
LA +HN+ +N ++ DD+
Sbjct: 259 LACLCIHNLLRDQASAEQNGMMDREDDQ 286
>gi|328708835|ref|XP_003243811.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 181
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 65/126 (51%), Gaps = 10/126 (7%)
Query: 104 TFLRKGGFPNVAGCVDGTMVLIDAP--------NEH-EDQFVNRNGDHALNVMAICGPNY 154
TF GFP V GC+D T V I P N H E ++NR H++NV IC
Sbjct: 13 TFYGTTGFPGVIGCIDCTHVAIVPPSTNLNLVENHHPEYLYINRKNYHSINVQLICDSKL 72
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
K VNA +PGS HD + NS V E R D +LGDS YPL++WL+ P N
Sbjct: 73 KILNVNALFPGSTHDNHIWNNSNVLPVVQELHERNLNDYYLLGDSGYPLRQWLLTP-ILN 131
Query: 215 PDAAAD 220
P +AA+
Sbjct: 132 PSSAAE 137
>gi|340382769|ref|XP_003389890.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 421
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 118/279 (42%), Gaps = 14/279 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
+K R+ + E +L ++P + T ++S E++ L ++ G + + +
Sbjct: 62 YKNFLRMDVSSFESLLNLVSPIIRKQDTVMRQSISPAERLAVTLRYLATGETFQSLQYLY 121
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRM-VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+I R + E IV + V +I F FP+ G +DG +
Sbjct: 122 RIPAQTIGRIVPETCGAIVKCLHSFIEVPTTTSKWKEIATGFETYWQFPHCIGALDGKHI 181
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+I P + N ++ ++A+ +YKF VN G + D V NS + +
Sbjct: 182 VIRPPPNSGSYYFNYKHTFSIVLLALVDADYKFTYVNIGCNGRISDGGVNGNSSLCAALE 241
Query: 184 EEGWR---PFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
PFP +I+ D A+PLK ++ P + + F R RR
Sbjct: 242 TNSLNVPLPFPICEDGIPLPYMIVADEAFPLKTYIQKPYAQIGLTKEKRIFNYRLSRARR 301
Query: 233 LIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHN 270
++EN+FGIL +F + +RLSP A +VLA +LHN
Sbjct: 302 IVENAFGILANRFQVFMTPIRLSPDKAETIVLACCSLHN 340
>gi|449687494|ref|XP_002158425.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 128
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 71/119 (59%), Gaps = 3/119 (2%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR 107
LH++G +H++ HG S +++ R IH V I + Q ++KWP + + +P F+
Sbjct: 4 LHFIGTNVFFHVLRDCHGISTNTVYRIIHSVGEAIFN-IRQEIIKWPND-CSTLPQKFME 61
Query: 108 KGGFPNVAGCVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPG 165
GGFP+V G +DG+ +LI + PN ED +NR+ H++N MA+CGP+ + + N P
Sbjct: 62 IGGFPSVCGVLDGSHILISNPPNADEDSLINRHHVHSINAMAVCGPDTSIFYASTNSPA 120
>gi|328698759|ref|XP_003240725.1| PREDICTED: hypothetical protein LOC100568546 [Acyrthosiphon pisum]
Length = 1190
Score = 88.2 bits (217), Expect = 4e-15, Method: Composition-based stats.
Identities = 80/305 (26%), Positives = 136/305 (44%), Gaps = 25/305 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F + FR+S Q + +I ++ T+ ++ AKEK+ CL ++ G Y I +
Sbjct: 852 FFKYFRMSRFQFNTLYLKIKNEISKQNTQFRESIPAKEKLGVCLRFLATGDSYQTIAFSF 911
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S++ + EV I+ + +K P+ E+ I F FPN G +DG V
Sbjct: 912 RLGHSTVQGIVIEVCNAIILKLKEECIKTPQKEDWERIANEFWEIWNFPNCIGALDGKHV 971
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+I+AP + N ++ ++A+ YKF V+ G D ++ +S + +
Sbjct: 972 VIEAPPNTGSLYYNYKKTFSIVLLALVDAQYKFTVVDIGAFGKNSDGGILSHSNFGKALE 1031
Query: 184 EEGW-----RPFPDA------VILGDSAYPLKEWLIPPRTRNPDAAADQR---FLRCHKR 229
+ R P VI+GD A+PLK +L+ P P +D + F R
Sbjct: 1032 KNKLHIPNNRALPGTNEKLPYVIIGDEAFPLKNYLLRPYP-GPQMYSDVKKKIFNERLSR 1090
Query: 230 TRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
R+++E++FG L KF ++ P+ A +V+ + LHN I +D+ EIH
Sbjct: 1091 ARKVVEDAFGQLTAKFRIYCRRLKSLPENADKIVMTTCILHNY---IKQDSS----EIHT 1143
Query: 289 DEAED 293
E D
Sbjct: 1144 SEVAD 1148
>gi|270010647|gb|EFA07095.1| hypothetical protein TcasGA2_TC010085 [Tribolium castaneum]
Length = 1007
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 88/175 (50%), Gaps = 4/175 (2%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
AFK+ FRL+ V +++ I + T+RN +LSA+ ++L L + G ++G
Sbjct: 41 AFKKRFRLNKATVMHLVDIIGDRVALKTQRNKSLSAQTQMLIALRFYATGGFLELLGDWI 100
Query: 65 GPSKSSICRSIHEVVPVI--VDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTM 122
KS+ICR I V I + + +M + EE L F R GF V G +D T
Sbjct: 101 HVHKSNICRVIQRVTHDIARLSPHYIKMPRMTEE-LMATKRKFFRICGFSRVVGAIDCTH 159
Query: 123 VLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
V I +P + + + NR G ++NV A C + K + + WPG VHD+ ++S
Sbjct: 160 VGIQSPGGANGELYRNRKGYFSINVQATCDADLKLLHIISRWPGLVHDSTKTQSS 214
>gi|331219511|ref|XP_003322432.1| hypothetical protein PGTG_03969 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 387
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 123/281 (43%), Gaps = 15/281 (5%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F E FR+S ++ + + L P R LS + +V L+ + GS Y IG
Sbjct: 53 PARFLEFFRMSVEDFRWLSDSLRDLLQLDPLRRGDPLSVEAQVAVGLYRLAHGSSYLTIG 112
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP----EENLNDIPMTFLRKGGFPNVAGC 117
K + ++ V ++ F + + +P +E + I +F K G PNV G
Sbjct: 113 HVFNIGKETADKAAGRFVIAVLQQFRRVAISYPALANQEQWDKISASFEAKHGIPNVVGA 172
Query: 118 VDGTMVLIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+DGT + + P E ++NR ++ + + F V+ PGS+HD R+ R S
Sbjct: 173 IDGTHIPLAVPAEDRWKGYINRKSWASIVFQCVVDGDGNFRNVSGGAPGSMHDGRLFRRS 232
Query: 177 MVYRKFDEEGWRP--FP-DAVILGDSAYP--LKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ P P ++GD+ YP ++ L P P+ ++ F TR
Sbjct: 233 ELGHSITPGSMAPAMIPMGTYLIGDAGYPSNVRVLLPYPSIATPE---NKHFNFIQSLTR 289
Query: 232 RLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNI 271
++E +FG LK +F L + +++ P A A LHN+
Sbjct: 290 IIVEQAFGRLKNQFQILLHPQMARPLRARNNAFACMILHNL 330
>gi|301619233|ref|XP_002939009.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 396
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 70/284 (24%), Positives = 122/284 (42%), Gaps = 16/284 (5%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F++ FR+S T + +L + P L+ T T +S +E++ L ++ G + +
Sbjct: 60 PKKFQKHFRMSTTSFDELLTILKPGLSRTQTLMRDPISPEERLCLTLRFLATGQSFSSLY 119
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDG 120
+++I + + E ++ + ++ P+EN I F K FPN AG + G
Sbjct: 120 IQFHIGRTTIGKIVRETCLLLWSELQRIVMPSPDENKWMQIAEDFHEKVNFPNCAGALGG 179
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + P ++ +N L +A NY F ++ S +A +NS + R
Sbjct: 180 RHIRVRMPANRGSKYYKKNSSVVL--LAAADVNYCFSVIDVGSYHSTGNASAFQNSELGR 237
Query: 181 KFDEEGWR------------PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
+ E P V LGD A+ L E +I P + + + F
Sbjct: 238 QLSERTLHLPLPKPLPGTVVPNMPYVFLGDEAFGLAENVIQPYPGSQKSVQKRVFNYRFS 297
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
R +R++E +FGIL K+ L+ +L P F V+ A LHN
Sbjct: 298 RAQRILECAFGILSNKWHVLHTAKLKPDFVSTVIKACCVLHNFV 341
>gi|449666953|ref|XP_004206453.1| PREDICTED: uncharacterized protein LOC101234489 [Hydra
magnipapillata]
Length = 296
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 103/225 (45%), Gaps = 16/225 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ FI+ E+ P L TT T R +S ++V LH++ Y ++ +
Sbjct: 74 FKEHFRVNRNTFSFIVNELHPHLCKTTTTMRE-PISVVKRVAVALHYLASHEEYRVVSSL 132
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ ++E + + D + VK+P EENLN F GF C DG
Sbjct: 133 FGIGKSTTNLIVYEFINAVNDILLPKYVKFPLSEENLNKCSRDFKTILGFTQ---CADGC 189
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM---- 177
+ I AP + + N G +++ + A+ Y+F + P +D+ +++NS
Sbjct: 190 HIPISAPKDQAISYYNYKGWYSIVLFAVVECRYRFIYTSVGLPSKNNDSYILQNSFLKAI 249
Query: 178 ----VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA 218
++ K +E ++GDSA PL L+ +N + +
Sbjct: 250 LESNLFDKCYKELGSSLVPLCLMGDSAVPLTRHLLKTYPKNLELS 294
>gi|242059175|ref|XP_002458733.1| hypothetical protein SORBIDRAFT_03g039260 [Sorghum bicolor]
gi|241930708|gb|EES03853.1| hypothetical protein SORBIDRAFT_03g039260 [Sorghum bicolor]
Length = 400
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 139/321 (43%), Gaps = 28/321 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RN------VALSAKEKVLSCLHWMGVGSPY 57
F+ A ++S +++ I T T+ RN V L +++V L + G
Sbjct: 58 FETALKMSRKTFDYLCCLIKGDCTRKTQSYRNFRFGDKVVLGVEDQVAIALLRLTTGESL 117
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + + KWP+ E + I F + G PN G
Sbjct: 118 LGIGTRFGMNHSAISNITWRFIECLEERAASHL-KWPDPEEMAAIKAKFEKIQGLPNCCG 176
Query: 117 CVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T VL+ + + + +++ +++ + AI P+ +F V + WPGS+ D+ ++R
Sbjct: 177 AIDTTHVLMCSSAQPNSNVWLDGENRNSMVLQAIVDPDMRFRDVVSGWPGSLDDSCILRT 236
Query: 176 SMVYRKFDE----EGWRPFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
S YR + G P I+GD +YPL WL+ P + +A F
Sbjct: 237 SGFYRLCQKGSRLSGQMELPGESAGSIVREYIVGDPSYPLLPWLVTPYQEHGLSAEKVEF 296
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNEN 281
+CH TR +++ + LK+++ L P + ++ A L NI + ++ +
Sbjct: 297 NKCHTATRMVVQGALANLKDRWRVLKGELWRPDKHRLPRIIYACCLLTNIMIDLEDAVRD 356
Query: 282 QNVEIHDDEAEDDGEEEAVDD 302
H + DDG + V +
Sbjct: 357 GMPASH---SHDDGYRQQVSN 374
>gi|346468107|gb|AEO33898.1| hypothetical protein [Amblyomma maculatum]
Length = 414
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 128/279 (45%), Gaps = 12/279 (4%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
+FK++FR+S T ++I++ P + T A+S ++V L+ + + I
Sbjct: 84 SFKQSFRVSETTFKYIVDSCRPLMERQATNMREAVSIVKRVAVGLYKLCSSAEDRTIAHL 143
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTM 122
+S++ E +V+ ++ VK ++ D F GFP G +DG
Sbjct: 144 FDLGRSTVNTIYREFCETVVEVLEEQWVKMMSAGDIADHIREFQAVCGFPQAVGALDGCH 203
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + P H + N G +++ ++A+ Y+F VN G HD+ V+++S + R
Sbjct: 204 LPVSPPKNHASDYYNYKGWYSVILLAVVDHKYRFRYVNVGSXGRCHDSHVLQSSTLPRII 263
Query: 183 DEEGW-RPFPDA-------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
+ + RP +IL D A+PL L+ P R + A+ F ++RR++
Sbjct: 264 EGPSFMRPSLTVGGTDVPPLILCDQAFPLTPNLMKPFPRG-NTEAELTFNYQLSKSRRIV 322
Query: 235 ENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIA 272
EN+FG LK +F + M + ++V A L+NI
Sbjct: 323 ENAFGRLKARFRYVMKRMECKVKNVPVLVRACCVLNNIC 361
>gi|30688449|ref|NP_567834.2| uncharacterized protein [Arabidopsis thaliana]
gi|27754582|gb|AAO22738.1| unknown protein [Arabidopsis thaliana]
gi|28973634|gb|AAO64139.1| unknown protein [Arabidopsis thaliana]
gi|332660275|gb|AEE85675.1| uncharacterized protein [Arabidopsis thaliana]
Length = 540
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 120/280 (42%), Gaps = 18/280 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV----ALSAKEKVLSCLHWMGVGSPYHIIG 61
F+ FR+S + I EE+ TT T++N A+ A ++V C+ + G+P +
Sbjct: 213 FRREFRMSKSTFNLICEELD---TTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVS 269
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDG 120
G S+ + + EV I D + + WP ++ +N F PNV G +
Sbjct: 270 ERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPSDSEINSTKAKFESVHKIPNVVGSIYT 329
Query: 121 TMVLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
T + I AP H + N+ +++ V + + F V PGS+ D +++
Sbjct: 330 THIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQIL 389
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
S + R+ G D+ I+G+S +PL ++L+ P TR F + +
Sbjct: 390 EKSSLSRQRAARGM--LRDSWIVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGI 447
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
+F LK ++ CL + Q V+ A LHNI
Sbjct: 448 ATAAFERLKGRWACLQKRTEVKLQDLPYVLGACCVLHNIC 487
>gi|188501616|gb|ACD54739.1| PIF/Harbinger-like protein [Adineta vaga]
Length = 411
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 131/294 (44%), Gaps = 16/294 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHI 59
M+ FKE FR+ T ++++I P L T + +++ L+ +G S
Sbjct: 81 MSDHQFKENFRVERTTYACLIDKIGPYLEKLETNYRATIPVDKRIAIALYTLGSSSELRK 140
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGC 117
I G +S+ + E +V+T F + +K+P+ + + + F K +P G
Sbjct: 141 IAHLFGIGRSTTGEILREFCSTLVETLFYQFIKFPKSPDEIKETINGFYDKFNYPMCIGS 200
Query: 118 VDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
VDGT + I P +E + N H++ + AI + F VN G +D+ + S
Sbjct: 201 VDGTHIAIKPPKGYETDYYNYKKHHSIIMRAIVNSDLLFTYVNIGASGRCNDSSIYNRSS 260
Query: 178 VYRKFDEEGWRPFPDAV--------ILGDSAYPLKEWLIPPRTRNPDAAAD-QRFLRCHK 228
+ + ++ + + + DSA+ L + L+ P P+ + F
Sbjct: 261 LSQVIEDPIYDNHYMMINQIKVRCHFIADSAFSLSKTLMKPFPERPNMQKEYSTFNYRLS 320
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
R R +E +FG LK +F L + ++ + + +V A+T LHN+ + ++ DN+
Sbjct: 321 RARCSVERTFGALKNRFRLL-HKKIEYNLSNITNMVKAATILHNLCI-LNGDND 372
>gi|221122677|ref|XP_002156551.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 119/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQDWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G +DA V+ S R F+ +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAVLNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P V++GD + LK WL+ P + F R RR IENS
Sbjct: 178 YFNLPKISEFDPKVPRVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ ++ P ++ A+ LHN
Sbjct: 238 FGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|4914402|emb|CAB43653.1| hypothetical protein [Arabidopsis thaliana]
gi|7269877|emb|CAB79736.1| hypothetical protein [Arabidopsis thaliana]
Length = 515
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 120/279 (43%), Gaps = 18/279 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV----ALSAKEKVLSCLHWMGVGSPYHIIG 61
F+ FR+S + I EE+ TT T++N A+ A ++V C+ + G+P +
Sbjct: 188 FRREFRMSKSTFNLICEELD---TTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVS 244
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDG 120
G S+ + + EV I D + + WP ++ +N F PNV G +
Sbjct: 245 ERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPSDSEINSTKAKFESVHKIPNVVGSIYT 304
Query: 121 TMVLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
T + I AP H + N+ +++ V + + F V PGS+ D +++
Sbjct: 305 THIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQIL 364
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
S + R+ G D+ I+G+S +PL ++L+ P TR F + +
Sbjct: 365 EKSSLSRQRAARGM--LRDSWIVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGI 422
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
+F LK ++ CL + Q V+ A LHNI
Sbjct: 423 ATAAFERLKGRWACLQKRTEVKLQDLPYVLGACCVLHNI 461
>gi|331224673|ref|XP_003325008.1| hypothetical protein PGTG_06545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 385
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 123/281 (43%), Gaps = 15/281 (5%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F E FR+S ++ + L P R LS + +V L+ +G G Y +G
Sbjct: 51 PTRFIELFRMSIEDFRWLSNNLRDLLQLDPLRRGDPLSVEAQVAVGLYRLGHGCSYVTLG 110
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP----EENLNDIPMTFLRKGGFPNVAGC 117
K + ++ V ++ F + + +P + ++I +F K G PNV G
Sbjct: 111 HVFNIGKETADKAAGRFVIAVLARFRRVAISYPPLDRADQWSEISASFEAKHGIPNVVGA 170
Query: 118 VDGTMVLIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+DGT + + P E ++NR ++ + F V+ PGS+HD R+ R S
Sbjct: 171 IDGTHIPVATPAEDRWKGYINRKSWASIVFQCVVDGEGNFRNVSGGAPGSMHDGRLFRRS 230
Query: 177 MVYRKFDEEG-WRPF-PDAV-ILGDSAYP--LKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ RP PD ++GD+ YP ++ L P T P+ ++ F TR
Sbjct: 231 ELGHSITTGTRARPMIPDGTYLIGDAGYPSNVRILLPYPSTATPE---NEHFNYIQSSTR 287
Query: 232 RLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNI 271
++E FG LK +F L + +++ P A A LHN+
Sbjct: 288 IIVEQVFGRLKNRFRILLHAQMARPLRARNNAFACMILHNL 328
>gi|301617821|ref|XP_002938332.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 361
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 124/281 (44%), Gaps = 17/281 (6%)
Query: 8 EAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
EA + S T + +L + P L+ + +S +E++ L ++ G + +
Sbjct: 27 EATQESITSFDELLTILKPGLSRAHSLMRDPISPEERLCLTLRFLATGQSFSSLYFQFLI 86
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDGTMVLI 125
+++I R + E +I + ++ P+EN DI F +K FPN G +DG + +
Sbjct: 87 GRTTIGRIVRETCLLIWSELQRIVMPSPDENTWVDIAEDFHKKTNFPNCLGALDGKHIRV 146
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE- 184
P ++ N ++ ++A+ NY F ++ GS DA RNS + R+ E
Sbjct: 147 TMPFNSGSKYFNYKKYFSVVLLAVVDANYCFTIIDVGAYGSTGDASAFRNSALGRQLTEG 206
Query: 185 ------------EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
P P V +GD A+ L E ++ P + + + F R RR
Sbjct: 207 TLRLPLPKPLPGTAAPPMP-YVFVGDEAFGLAENIMRPYPGSQRSVQKRLFNYRLSRARR 265
Query: 233 LIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
++E +FGIL K+ + ++L P+F ++ A LHN
Sbjct: 266 MVECAFGILANKWRVFHTALQLEPEFVDKIIKACCVLHNFV 306
>gi|432891478|ref|XP_004075569.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 303
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 101/225 (44%), Gaps = 9/225 (4%)
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVA 115
Y +I G K+++ + ++ +V + ++K P ++ I F K P V
Sbjct: 19 YKVIANKFGVHKTTVKKFVYSFCKWMVSSVINHLIKVPTAKDARAIARRFEEKFCIPQVV 78
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
GC+D T + + P++ FVNR + + A+ +F+ +N PGS DA V+R
Sbjct: 79 GCIDRTHIPVLPPSDGFKHFVNRKRWPSYVLQAVADDMCRFWNINCQMPGSTADADVLRQ 138
Query: 176 SMVYRKFDEEGWRP------FPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKR 229
S +Y + P + +LGD+AYPL +WLI T +P A+Q + R
Sbjct: 139 SALYNQAHMLPQEPKEISGTSVNLFLLGDAAYPLLDWLISDYTPSPHLTAEQESFNAYLR 198
Query: 230 T-RRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIA 272
+ R ++ +FG LK ++ L F V+ LHN
Sbjct: 199 SARTTVDIAFGKLKSRWRVLLRKCDFHYTFIPYVIATCCALHNFC 243
>gi|345497320|ref|XP_001599679.2| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 224
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/169 (31%), Positives = 85/169 (50%), Gaps = 9/169 (5%)
Query: 110 GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHD 169
GFP V G +D T+ I P+EHE+ +VN N DH+LNV + +Y + GS +D
Sbjct: 10 GFPGVIGAIDCTLPKIHTPHEHEEAYVNHNQDHSLNVQVVSDMDYNILNIRIT-NGSSND 68
Query: 170 ARVMRNSMVYRKFDEEGWRPFPDAV-----ILGDSAYPLKEWLIPPRTRNPDAAADQRFL 224
V R+S + + P P + ++GD Y L+ P + + R+
Sbjct: 69 KFVWRHSQMRETMYQLRNNPDPAEINNEYYLIGDGGYTPSPVLLTPCQEAAPGSPEFRYS 128
Query: 225 RCHKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHN 270
+RTR +E +FGI K+ F C+N + +P+F+ +VLA+ L+N
Sbjct: 129 DRVRRTRCRVEQTFGIWKQVFRCINGDRSLHYTPEFSSQIVLATAVLYN 177
>gi|331215241|ref|XP_003320301.1| hypothetical protein PGTG_01213 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 352
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 123/287 (42%), Gaps = 23/287 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHI 59
+ P F E FR+S ++ + + L P R LS + +V L+ +G GS Y
Sbjct: 49 IPPTRFLELFRMSVEDFRWLSDSLRDLLQLDPLRRGDPLSVEAQVAVGLYRLGHGSSYLT 108
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP----EENLNDIPMTFLRKGGFPNVA 115
IG K + ++ V ++ F + +P ++ ++I +F K G PNV
Sbjct: 109 IGHVFNIGKETADKAAGRFVIAVLHRFRLAAISYPPLGRQDQWDEISASFEAKHGIPNVV 168
Query: 116 GCVDGTMVLIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DGT + + P E ++NR ++ + + F V+ PGS+HD R+ R
Sbjct: 169 GAIDGTHIPLAVPAEDRWKGYINRKSWVSIVFQCVVDGDGNFRNVSGGAPGSMHDGRLFR 228
Query: 175 NSMVYRKFDEEGWRPFPDAV---ILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCHKR 229
+ P V ++GD+ YP ++ P P AA ++ F
Sbjct: 229 RPELGHSITAGTMVPAMIPVGTYLIGDAGYPSNVRVLLP---YPSIAAPENEHFNFIQSS 285
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQFAGMV-----VLASTTLHNI 271
TR ++E +FG LK +F L L PQ A V A LHN+
Sbjct: 286 TRIIVEQAFGRLKNRFRIL----LHPQMARPVRARNNAFACMILHNL 328
>gi|21592524|gb|AAM64474.1| unknown [Arabidopsis thaliana]
Length = 515
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 120/279 (43%), Gaps = 18/279 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV----ALSAKEKVLSCLHWMGVGSPYHIIG 61
F+ FR+S + I EE+ TT T++N A+ A ++V C+ + G+P +
Sbjct: 188 FRREFRMSKSTFNLICEELD---TTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVS 244
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDG 120
G S+ + + EV I D + + WP ++ +N F PNV G +
Sbjct: 245 ERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPSDSEINSTKAKFESVHKIPNVVGSIYT 304
Query: 121 TMVLIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
T + I AP H + N+ +++ V + + F V PGS+ D +++
Sbjct: 305 THIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQIL 364
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
S + R+ G D+ I+G+S +PL ++L+ P TR F + +
Sbjct: 365 EKSSLSRQRAARGM--LRDSWIVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGI 422
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
+F LK ++ CL + Q V+ A LHNI
Sbjct: 423 ATAAFERLKGRWACLQKRTEVKLQDLPYVLGACCVLHNI 461
>gi|331226603|ref|XP_003325971.1| hypothetical protein PGTG_07801 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 413
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 123/281 (43%), Gaps = 15/281 (5%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F E FR+S ++ + L P R LS + +V L+ +G G Y +G
Sbjct: 51 PTRFIELFRMSIEDFRWLSNNLRDLLQLDPLRRGDPLSVEAQVAVGLYRLGHGCSYVTLG 110
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP----EENLNDIPMTFLRKGGFPNVAGC 117
K + ++ V ++ F + + +P + ++I +F K G PNV G
Sbjct: 111 HVFNIGKETADKAAGRFVIAVLARFRRVAISYPPLDRADQWSEISASFEAKHGIPNVVGA 170
Query: 118 VDGTMVLIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+DGT + + P E ++NR ++ + F V+ PGS+HD R+ R S
Sbjct: 171 IDGTHIPVATPAEDRWKGYINRKSWASIVFQCVVDGEGNFRNVSGGAPGSMHDGRLFRRS 230
Query: 177 MVYRKFDEEG-WRP-FPDAV-ILGDSAYP--LKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ RP PD ++GD+ YP ++ L P T P+ ++ F TR
Sbjct: 231 ELGHSITTGTRARPMIPDGTYLIGDAGYPSNVRILLPYPSTATPE---NEHFNYIQSSTR 287
Query: 232 RLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNI 271
++E FG LK +F L + +++ P A A LHN+
Sbjct: 288 IIVEQVFGRLKNRFRILLHAQMARPLRARNNAFACMILHNL 328
>gi|348510655|ref|XP_003442860.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 438
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/336 (23%), Positives = 145/336 (43%), Gaps = 27/336 (8%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P + + FR++ ++ +++ P L T +AL +++V L + Y I
Sbjct: 92 PSDWLDKFRMNRETFFYLCDKLRPRLARQNTSFRLALPVEKRVAVALWRLASNIEYRTIS 151
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDG 120
A G KS++CR + ++ IV ++ P E+ L D FL GFP+ +
Sbjct: 152 ALFGVGKSTVCRCVRDMCHAIVALLSSIYLRSPGEQELEDSAQLFLSHWGFPHCVAAIAT 211
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVM---AICGPNYKFYAVNANWPGSVHDARVMRNSM 177
I P+ + + N G L+VM A+ G + F+ V A++PG A +++NS
Sbjct: 212 LHTAIITPSNNASDYANPAG--WLSVMSQVAVSGQGH-FWDVCASFPGGTDPAEILQNSS 268
Query: 178 VYRKFDEEGWRPFP---------DAVILGDSAYPLKEWLIPP------RTRNPDAAADQR 222
++ E G P P V+LG++ YPL+ WL+ RT + A +Q+
Sbjct: 269 LWATAAEGGLSPAPPPTFMGKSLRYVLLGEACYPLQSWLMKAYPEEQGRTASRTALTEQQ 328
Query: 223 FLRCHK--RTRRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTI-DRD 278
L + R R+ + + L+ ++ CL+ ++LA LHN+ + D
Sbjct: 329 QLFNGRLARALRVSQEALLRLRARWQCLSKRNDCGLDVVPTMILACCILHNMCESHGDAF 388
Query: 279 NENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQ 314
VE+ + E+ ++ + E+ RQ
Sbjct: 389 KAEWQVEVAEAESPQPSHKQLLSTSMNQSNAEEVRQ 424
>gi|357454103|ref|XP_003597332.1| hypothetical protein MTR_2g096450 [Medicago truncatula]
gi|355486380|gb|AES67583.1| hypothetical protein MTR_2g096450 [Medicago truncatula]
Length = 388
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/319 (22%), Positives = 139/319 (43%), Gaps = 26/319 (8%)
Query: 12 LSATQVEFILEEIAPALTTPTERNVALSAK-----EKVLSCLHWMGVGSPYHIIGATHGP 66
+S E+I + +TT + + K ++V L +G G +G + G
Sbjct: 54 ISRKTFEYICSLVKDDMTTKSAHLTFTNGKPMSLFDQVAVALRRLGSGDSLVTVGDSFGL 113
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
+ S++ + V + D + +WP +E +N I F + GFPN G VD T +
Sbjct: 114 TYSTVSQVTWRFVESMEDRGLHHL-QWPSTQEEMNAIKSKFEKIQGFPNCCGAVDATHIT 172
Query: 125 IDAP--NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ P + D +++ +H++ + AI P+ KF + WPG + D + +S + K
Sbjct: 173 MCLPATDHSSDVWLDHRNNHSMVLQAIVDPDMKFRDIVTGWPGKLEDWSIFESSN-FNKL 231
Query: 183 DEEGWRPFPDAV-----------ILGDSAYPLKEWLIPPRTRNP--DAAADQRFLRCHKR 229
++G R + + I+GDS YPL +L+ P ++ A F + H
Sbjct: 232 CDKGERLNGNKLKLSEGSEIREYIIGDSGYPLLPYLVVPYKEKELLESEAKTHFNKLHLA 291
Query: 230 TRRLIENSFGILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
TR + + + LKE + + N R ++L LHNI + + + +++ + ++
Sbjct: 292 TRMVAQRALTRLKEMWRIIRGNMWRPDKHRLPRIILVCCLLHNIVIDMQDEVKDELLYLY 351
Query: 288 DDEAEDDGEEEAVDDPDEI 306
+ + + + DE+
Sbjct: 352 HNNHDSGYHQLICEGVDEM 370
>gi|328723564|ref|XP_003247874.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 194
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 76/149 (51%), Gaps = 3/149 (2%)
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD 192
+ F NR ++ + I + KF V + WPGS HDARV R S++ +K P+
Sbjct: 7 NMFYNRKQVPSVLLQGIVDSDLKFIDVFSGWPGSSHDARVFRRSLIGQKLLSNDLSILPE 66
Query: 193 AV-ILGDSAYPLKE-WLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY 250
ILGD AYPL E +IP R A + F RC +R ++E +FG L +F L +
Sbjct: 67 NCHILGDGAYPLSENVMIPYRDNGNLTLAQKHFNRCLSSSRVVVEQAFGKLYCRFRKLKH 126
Query: 251 MRL-SPQFAGMVVLASTTLHNIALTIDRD 278
M + G+++ A+ LHNI + + D
Sbjct: 127 MDVYHKSLCGLIITAACCLHNICIDMQDD 155
>gi|221105453|ref|XP_002166626.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 370
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/280 (24%), Positives = 120/280 (42%), Gaps = 18/280 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT PT ++ ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPTRDSIC--RLQRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDKNWTFPQCIGAIDGKHVHIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N +++ ++AIC Y F V+ G +DA ++ S R F+ +G
Sbjct: 119 APAKSGFSFYNYKEFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P V++GD + LK WL+ P + F R RR IENS
Sbjct: 178 YFSLPKISEFDPKVPLVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTID 276
FGIL ++ Y ++ P ++ A+ LHN + D
Sbjct: 238 FGILAARWRIYRYPIKAKPFKVEHIIKATVCLHNYSRLTD 277
>gi|449674891|ref|XP_004208281.1| PREDICTED: uncharacterized protein LOC101238149, partial [Hydra
magnipapillata]
Length = 354
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 114/263 (43%), Gaps = 35/263 (13%)
Query: 23 EIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVI 82
+ A TT T R +S ++V LH++ Y ++ + G K++ +HE + +
Sbjct: 70 QAAKGKTTTTMRE-PISEVKRVTVALHYLASWEEYRVVSSLFGIGKTTANIIVHEFINAV 128
Query: 83 VDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNG 140
+ + VK+P ENLN F GFP VDG + I AP + + N G
Sbjct: 129 NNILLPKYVKFPLSVENLNKHSRGFEAILGFPQCVEAVDGCHIPISAPKDQAISYYNYKG 188
Query: 141 DHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM--------VYRKFDEEGWRPFPD 192
PG +D+ +++NS ++ K +E
Sbjct: 189 --------------------VGSPGRNNDSYILQNSSLKAILELNLFDKCCKELGDSLVP 228
Query: 193 AVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCLNYM 251
++GDSA+PL L+ P N + + Q+ F + RR++EN+FG +K +F C+
Sbjct: 229 LCLIGDSAFPLTHHLLKPYPENLELSEIQKNFNKILCGARRVVENAFGRVKARF-CVTCK 287
Query: 252 RLSP--QFAGMVVLASTTLHNIA 272
R+ FA +V A TLHNI
Sbjct: 288 RMECDINFATRIVNACVTLHNIC 310
>gi|221114951|ref|XP_002160191.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 319
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 119/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLASGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F+ +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P+ V +GD + LK WL+ P + F R RR IENS
Sbjct: 178 YFNLPNISEFDPKVPPVFVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ ++ P ++ A+ LHN
Sbjct: 238 FGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|291230838|ref|XP_002735372.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 426
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 69/266 (25%), Positives = 119/266 (44%), Gaps = 24/266 (9%)
Query: 43 KVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLND 100
K+ L ++ G+ Y + + ++I I EV I+ + + P + +
Sbjct: 117 KLAITLRYLATGNSYKTLQYAFRVAHNTISLFIPEVCQTIISEYQDEVFSCPITPDEWRE 176
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+ T+ + F +V G +DG V I P + N G ++L ++A+ NYKF +
Sbjct: 177 VARTYADRWNFHHVCGALDGKHVAIRNPPGSGTIYYNYKGFYSLILLALVDGNYKFLWAD 236
Query: 161 ANWPGSVHDARVMRNSMVYRKFDE------------EGWRPFPDAVILGDSAYPLKEWLI 208
PGS D +V +S + R + + R P ++GD A+P + W+
Sbjct: 237 VGNPGSSLDVQVFNHSPLRRGLENGTLGLPDPEPLPDDDRDIP-YFLIGDDAFPFRTWMQ 295
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTL 268
P + ++ F R R++ENSFGIL ++ CL + A V++A L
Sbjct: 296 KPYSNREQTDEERIFNYRLSRACRVVENSFGILAHRWRCL------LKKARTVIMACMCL 349
Query: 269 HNIALTIDRDNENQNVEI-HDDEAED 293
HN L DR QN++I H+DE ++
Sbjct: 350 HN--LMRDRFPGIQNIDIDHEDELDN 373
>gi|224098298|ref|XP_002311148.1| predicted protein [Populus trichocarpa]
gi|222850968|gb|EEE88515.1| predicted protein [Populus trichocarpa]
Length = 407
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 122/265 (46%), Gaps = 24/265 (9%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
LS ++V L + G IG G ++S++ + V + + + WP E
Sbjct: 104 LSVTDQVAVALRRLSSGESLSNIGDLLGINQSTVSQITWRFVEAMEERGLHHLC-WPSTE 162
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPN 153
+ +I F + G PN G +D T +++ P + D +++R +H++ + AI P+
Sbjct: 163 AEMEEIKSNFEKICGLPNCCGSIDTTHIVMTLPTVDRSNDVWIDREKNHSMLLQAIVDPD 222
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYP 202
+F V +PGS+ DA V++NS Y+ +EE R + I+GDS +P
Sbjct: 223 MRFRDVIVGYPGSLSDALVLQNSSFYKLCEEEK-RLNGKKIELQEGMELGEYIIGDSGFP 281
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-- 260
L WL+ P +N + F + H T+ + + + LKE + ++ + P +
Sbjct: 282 LLSWLLTP-YQNALSDHQAEFNKRHSETQVVAQIALARLKEMWRIIHGVMWLPDKNRLPR 340
Query: 261 VVLASTTLHNIALTIDRDNENQNVE 285
++ LHNI + D E++ VE
Sbjct: 341 IIFVCCLLHNIVI----DMEDKGVE 361
>gi|294461638|gb|ADE76379.1| unknown [Picea sitchensis]
Length = 404
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 137/298 (45%), Gaps = 33/298 (11%)
Query: 6 FKEAFRLSATQVEFI-------LEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
FK F +S ++I LE P+ +E + L+ +++V L + G
Sbjct: 66 FKSTFGVSRKTFDYICFLVRQDLESKPPSGLINSEGRL-LTVEKQVGIALRRLASGESQV 124
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---VKWPEEN-LNDIPMTFLRKGGFPNV 114
+G + G +S++ +V +++ +R +KWP+ N + +I + F G PN
Sbjct: 125 SVGESFGVGQSTVS----QVTWRFIESMEERARHHLKWPDANQMEEIKVKFETIQGLPNC 180
Query: 115 AGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G +D T +++ P+ + ++ + ++++ + I + +F + PGS++D+R++
Sbjct: 181 CGAIDATHIIMTLPSIDSSMEWHDCESNYSMVLQGIVDADMRFRDIVTGLPGSMNDSRLL 240
Query: 174 RNSMVYRKFDEEGWR------PFPDA-----VILGDSAYPLKEWLIPPRTRNPDAAADQR 222
R+S +R E G R P I+GD+ YPL WLI P A++
Sbjct: 241 RSSGFFR-LCEHGERLNGPVKELPQGSQIREYIVGDTGYPLLSWLITPYQGKDLASSRVE 299
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRD 278
F H T ++ E +F LK + LN + P + ++L LHNI IDRD
Sbjct: 300 FNSKHHATSQIAECAFSRLKGTWRFLNNVIWRPDKHKLPIIILVCCLLHNI--IIDRD 355
>gi|224062361|ref|XP_002300822.1| predicted protein [Populus trichocarpa]
gi|222842548|gb|EEE80095.1| predicted protein [Populus trichocarpa]
Length = 397
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/336 (23%), Positives = 141/336 (41%), Gaps = 24/336 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
FK FR+S E+I + L + + LS +++V L + G
Sbjct: 63 GFKYFFRISKKTFEYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 122
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGC 117
+GA+ G +S++ + + + + +R +KWP+ + + +I F G PN G
Sbjct: 123 SVGASFGVGQSTVSQVTWRFIEALEERA-KRHLKWPDSDRMEEIKCKFETLFGLPNCCGA 181
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E D + +R ++++ + I +F + WPG + R+++ S
Sbjct: 182 IDATHIIMTLPAVETSDDWCDREKNYSMFLQGIVDHEMRFLDIVTGWPGGMTVPRLLKCS 241
Query: 177 MVYRKFDEEGWR-------PFPDA----VILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E G R F A I+G YPL WLI P + + F
Sbjct: 242 GFF-KLCENGERLNGNSRNLFQGAEIREYIVGGVGYPLLPWLITPEVSEGISGSMSTFNA 300
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTI-DRDNENQ 282
H+ R L SF LK + L+ + P + ++L LHNI + D+ + +
Sbjct: 301 RHEAARLLAVRSFMHLKGSWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDCGDQLHPDV 360
Query: 283 NVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNN 318
+ H D + + VD +I E + ++N
Sbjct: 361 ALSGHHDSGYGEQCCKQVDPSGKITRENLAKHLQHN 396
>gi|221122055|ref|XP_002167152.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 118/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICSS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQYWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G DA V+ S R F+ +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDDDAAVLNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P V++GD + LK WL+ P + F R RR I+NS
Sbjct: 178 YFNLPKISEFDPKVPPVLVGDDVFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIKNS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ ++ P ++ A+ LHN
Sbjct: 238 FGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|115953196|ref|XP_785740.2| PREDICTED: uncharacterized protein LOC580596 [Strongylocentrotus
purpuratus]
Length = 397
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 116/263 (44%), Gaps = 19/263 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P AF+ R++ + ILE +AP + T LSA K+ L + G Y +
Sbjct: 74 PVAFRNFTRVTPEVFDEILERVAPVIQKQETNYRHPLSAGLKLAITLRHLATGDNYRSLA 133
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVD 119
S+I I EV IV+ + + P E + + F ++ P+ G +D
Sbjct: 134 YGFRCGISTISEMIPEVYRAIVEGYKDEVFNIPTTPEAWSTLAQQFEQRWNVPHAIGALD 193
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
G ++I P + N G ++ ++A+ YKF + G + D+++ +S ++
Sbjct: 194 GKHIVIKKPANTGSLYYNYKGFFSIPLLALVDAEYKFIWIELGGKGHMSDSQIFTDSELF 253
Query: 180 RKFDEEGW------------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
++ +P ILGD A+ LK +++ P +R D+ + +
Sbjct: 254 ECLEDGSIGLPPPCHLPGENQPDIPYFILGDDAFALKSYMMKPYSRR--GMTDEHRICNY 311
Query: 228 K--RTRRLIENSFGILKEKFPCL 248
+ R RR++EN+FGIL +F CL
Sbjct: 312 RISRGRRVVENAFGILANRFRCL 334
>gi|356550247|ref|XP_003543499.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 393
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 143/333 (42%), Gaps = 29/333 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV-----ALSAKEKVLSCLHWMGVGSPYHII 60
F+ F++S E+I + + T V +S +E+V L +G G I
Sbjct: 57 FESVFKISRKTFEYICSLVKDDMMTKAAHFVFTNGKPISLREQVAVALRRLGSGESLVTI 116
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + G + S++ + V + + + +WP E +N I F + G PN G +
Sbjct: 117 GDSFGLNHSTVSQVTWRFVESMEERGLHHL-RWPSTEVEMNAIKSKFEKIRGLPNCCGVI 175
Query: 119 DGTMVLIDAPNE--HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T + + P + + +++ +H++ + AI P+ +F + WPG + D + +S
Sbjct: 176 DATNITMCLPASEPYCNVWLDHKKNHSMVLQAIVDPDMRFRDIVTGWPGKMEDWLIFESS 235
Query: 177 MVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E+G R + ILGDS Y L +L+ P + +F +
Sbjct: 236 N-FNKLCEKGERLNGSKLQLSEGSEIREYILGDSGYRLLPYLVVPYEGKELSEPKAQFNK 294
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQN 283
H TR + + + LKE + ++ M P + ++L LHNI ID ++E Q+
Sbjct: 295 QHLATRMVAQRALARLKEMWRIIHGMMWRPDKHRLPRIILVCCVLHNI--VIDLNDEVQD 352
Query: 284 VEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDR 316
E+ D G V +DE+ +DR
Sbjct: 353 -ELSLSLDHDSGYHRLV--CGAVDEKGVALRDR 382
>gi|390359402|ref|XP_003729473.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 336
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 123/288 (42%), Gaps = 34/288 (11%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
+L + P +T L K+ L ++ G+ Y + + +SI + EV
Sbjct: 9 LLLRVTPRITKCPRSRRPLEPGMKLAITLRFLVRGNSYRSLAYDFRVAHNSISTFVPEVC 68
Query: 80 PVIVDTFFQRM--VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVN 137
I + ++M + + ++ + F + F + G +DG + I N +F
Sbjct: 69 TAIYKEYKEKMFNMAYTQDEWKAVARQFGTRWKFHHCCGAIDGKHIAIKKTNYK--RFC- 125
Query: 138 RNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF-DEEGWRPFPDAV-- 194
++ ++AI NY F GS DA V S + D P PD +
Sbjct: 126 -----SVVLLAIVDANYSFLWCKVGANGSSSDAGVFNKSTLRGALEDNTTGFPVPDPLPG 180
Query: 195 --------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP 246
I+GD A+PL++WL+ P +R A Q+ + RT R++EN FGIL ++
Sbjct: 181 DDRDFPYFIVGDDAFPLRKWLLKPYSRR---AMTQKERVMNYRTHRVVENGFGILAHRWR 237
Query: 247 C-LNYMRLSPQFAGMVVLASTTLHNIA---------LTIDRDNENQNV 284
C L ++L P VVL TLHN L +DR++E+ NV
Sbjct: 238 CLLTTLQLEPGRVIYVVLGCLTLHNRLRKRRPGLQDLNLDREDEDGNV 285
>gi|449669963|ref|XP_004207158.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 315
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 119/273 (43%), Gaps = 16/273 (5%)
Query: 12 LSATQVEFILEEIAPALTTP--TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+S ++E +L I +LT + R+ ++ ++++ + ++ G ++
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCSSRD-SICPSQRLIITIRYLATGESQQTQSFYFRVGRA 59
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
++C I E I + ++ P + +I F + FP G +DG V I+A
Sbjct: 60 TVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQDWNFPQCIGAIDGKHVRIEA 119
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P + F N G +++ ++AIC Y F V+ G +DA V+ S R F+ +G+
Sbjct: 120 PAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAVLNASTFGRAFN-KGY 178
Query: 188 RPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
P V++GD + LK WL+ P + F R RR IENSF
Sbjct: 179 FNLPKISEFDPKVPPVLVGDYIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSF 238
Query: 239 GILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
GIL ++ ++ P ++ A+ LHN
Sbjct: 239 GILAVRWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|115692227|ref|XP_001195901.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 239
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 95/215 (44%), Gaps = 13/215 (6%)
Query: 43 KVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLND 100
K+ L ++ G+ Y + + KS+I R I EV +++ + + ++ P E+ D
Sbjct: 25 KLAVTLRFLATGNSYTTLQYSFRVEKSTISRFIPEVCNALIEVYKKEVLSCPKTEDGWKD 84
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+ F + + N G VDG V I P + + N G H++ +MA+ YKF V+
Sbjct: 85 VAKKFSSRWNYHNCLGAVDGKHVAIRKPPKSGSLYFNYKGFHSIVLMAVADAEYKFLYVD 144
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGW-----RPFPD------AVILGDSAYPLKEWLIP 209
G D +N +Y + E +P P+ + D A+ +K WL+
Sbjct: 145 VGAEGGSSDGGTWKNCNLYDAIEGERAGVPPPKPMPNDDMPIPFQFVADDAFAIKPWLMK 204
Query: 210 PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
P + + F R RR++ENSFGIL +
Sbjct: 205 PFSHRSQVHKEIIFSYRLSRARRVVENSFGILAHR 239
>gi|403171723|ref|XP_003889412.1| hypothetical protein PGTG_21889 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169310|gb|EHS63851.1| hypothetical protein PGTG_21889 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 117/267 (43%), Gaps = 30/267 (11%)
Query: 6 FKEAFRLSATQVEFILEEI---------APALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
FK+A R + T ++LE+I +P P +AL+ L L G G+
Sbjct: 105 FKQAVRTTKTGFTWLLEQIYFNPIFYSNSPRPQLPVPHQLALT-----LERLGSNGNGAS 159
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
+ ++ ++ V+ I D QR + WP+ + D ++ GF G
Sbjct: 160 VGRFSRNLSVGRGTVIKASRRVIRAIND-LSQRYLLWPDTDRRDEISKVMKAEGFEGCIG 218
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
VDGT + L P+ + F +R +++N IC + A WPGS D+ V +
Sbjct: 219 FVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDQFITAYMTGWPGSCGDSMVFKR 278
Query: 176 SMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ--RFLRCHKR 229
MV+++ FD P ++ DSAY L IP + P A + F C R
Sbjct: 279 MMVHKEPALFFD-------PGQYLIADSAYELGLHCIPA-YKAPAAYIQENTEFNYCLAR 330
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQ 256
+R E++ GILK ++ L ++RL+ Q
Sbjct: 331 SRVRNEHTIGILKGRWASLQHLRLAIQ 357
>gi|77552491|gb|ABA95288.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 916
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 104/232 (44%), Gaps = 29/232 (12%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 689 NCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 748
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--------RTRNPDA---AADQ 221
++ + +F + +P P L DS YP + + P + N +
Sbjct: 749 FNDA--HTRFSAKFPKPPPGKFYLVDSGYPNRPGYLAPYKGITYHFQEYNESTLPRGKRE 806
Query: 222 RFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALTIDRDNE 280
F CH R +IE SFG+LK K+ L + S + ++ A LHN RD++
Sbjct: 807 HFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHNFI----RDSQ 862
Query: 281 NQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
+ E D + D ++ D + P + P + +R++N FR
Sbjct: 863 MADTEF--DNCDHD------ENYDPLGGTSSPSSE---PTNELDSRVMNQFR 903
>gi|449677010|ref|XP_004208757.1| PREDICTED: uncharacterized protein LOC101236952 [Hydra
magnipapillata]
Length = 180
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 87/181 (48%), Gaps = 5/181 (2%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
M FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y
Sbjct: 1 MEECVFKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYR 59
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++ + G KS+ +HE + + D + VK+P ENLN F GFP G
Sbjct: 60 VVSSLFGIGKSTANLIVHEFINAVNDILLSKYVKFPLPVENLNKHSRDFEAILGFPQCVG 119
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG + I AP + + N G +++ + ++ Y+F + PG +D+ +++N
Sbjct: 120 AVDGCHIPILAPKDQAISYYNYKGWYSIVLFSVVDCQYRFIYTSVGLPGRNNDSYILQNL 179
Query: 177 M 177
+
Sbjct: 180 L 180
>gi|449675115|ref|XP_004208330.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 141
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 76/140 (54%), Gaps = 10/140 (7%)
Query: 43 KVLSCLHWMGVGSPYHIIG----ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EE 96
++L L + GS ++G A P+ S I R + + ++ + +K+P +
Sbjct: 6 QLLCALRFYATGSFQIVVGDSTAALSQPTISGIIRRVSLSLAKRINEY----IKYPTNQH 61
Query: 97 NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKF 156
LN+ + F + FP V G ++ T + I P+EHE +V+R+ +H++NV A+C KF
Sbjct: 62 VLNESRVKFYKIAEFPKVTGVINCTHICIQKPHEHEYAYVDRSSNHSINVQAVCDNKGKF 121
Query: 157 YAVNANWPGSVHDARVMRNS 176
V A WPGS HDAR++R S
Sbjct: 122 IDVVAKWPGSTHDARILRES 141
>gi|328717980|ref|XP_001947852.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 362
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 118/281 (41%), Gaps = 22/281 (7%)
Query: 49 HWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLR 107
+ G + IG + S++ + EV I+ + P N+ + F
Sbjct: 29 RYFATGDSFSTIGHSFRVGFSTVSNIVTEVSESIIKVMGPMFMPEPTTNIWEESARGFYE 88
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
K FPN G +DG V I +PN + ++ +MAI P+YKF V+ G
Sbjct: 89 KWQFPNCIGSIDGKHVTIKSPNNSGSRNFCYLKKFSIVLMAIVDPDYKFICVDVGGYGRN 148
Query: 168 HDARVMRNSMVYRKFDEEGWR-----PFPDA------VILGDSAYPLKEWLIPP---RTR 213
D ++ S + ++ + P P V++GD A+ LK +L+ P R
Sbjct: 149 SDGGILEESTMGKRLEAGTLNVPQNAPLPGQGEDTPMVLIGDEAFALKPYLMKPFPRRLS 208
Query: 214 NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIA 272
D D R R RR++EN+FGIL +K+ + + + +VLA+ LHN
Sbjct: 209 RSDTRLDNYNYRLC-RARRVVENAFGILTKKWRVYKGPIEVKEETTKKIVLATCILHNY- 266
Query: 273 LTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPR 313
L ++ N +V+I DE E ++D + E R
Sbjct: 267 LRVN----NSDVDITSDEDESSSGFSQIEDNNRRGTNESLR 303
>gi|72106786|ref|XP_796408.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 483
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 66/287 (22%), Positives = 126/287 (43%), Gaps = 20/287 (6%)
Query: 2 TPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
+P FK R+ +++ + P + + L K+ L ++ G+ + +
Sbjct: 124 SPADFKSFLRVEPDLFLELVQRVGPRIEKSKKGRPPLQPGLKLAITLRFLATGNSFRSLE 183
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVD 119
+ + ++I I V IVD + Q++ K P + + F + FP+V G VD
Sbjct: 184 FSFRVAHNTISIFIPVVCQAIVDEYRQKVFKTPSTPDEWRRVAQVFQDRWNFPHVCGAVD 243
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
G V P + G +++ ++ + YK + GS D + S +
Sbjct: 244 GKHVATRKPAHSGSTYYTYKGYYSIVILVLADGEYKALWADVGSQGSDSDCGIFNRSGLL 303
Query: 180 RKF--DEEGW---RPFPD------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH- 227
R G+ P P+ +LGD+A+PL+E+++ P ++ ++ + C+
Sbjct: 304 RSLITGTIGFPPPEPLPNDDRDSGFFLLGDNAFPLREFMLKPFSKR---YLNREEMVCNY 360
Query: 228 --KRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNI 271
R RR++EN FGI+ ++F C L + + P+ A + A TLHN+
Sbjct: 361 RLSRARRVVENLFGIMAKRFRCLLTTLDVEPERAMTISNACITLHNL 407
>gi|215740538|dbj|BAG97194.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 114/281 (40%), Gaps = 22/281 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++AFR+S E + +E+A A+ T A+ +++V C+ + G P ++
Sbjct: 125 FRKAFRMSKATFEVVCDELAAAVAKEDTMLRAAIPVRKRVAVCVWRLATGEPLRLVSKRF 184
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT-FLRKGGFPNVAGCVDGTMV 123
G S+ + + EV + ++V+WPE F G V G + T +
Sbjct: 185 GLGISTCHKLVLEVCAALKAMVMPKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHI 244
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + NR +++ V + F V WPGS D V+ S
Sbjct: 245 PIIAPKSNVASYYNRRHTERNQKTSYSMTVQCVVDSTGAFTDVCIGWPGSNSDEEVLEKS 304
Query: 177 MVYRKFDEEGWRPFPDAV----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
+Y R P + ++G ++PL +W++ P T A R
Sbjct: 305 ALYLH------RGVPGLIQGQWVVGGGSFPLMDWMLVPYTHQNLTWAQHMLNEKVAAVRG 358
Query: 233 LIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNI 271
+ ++F LK ++ CL R + + V+ A LHNI
Sbjct: 359 VARDAFERLKRRWGCLQ-KRTEVKLLDLPTVLGACCVLHNI 398
>gi|31430659|gb|AAP52540.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|62733440|gb|AAX95557.1| Transposase family IS4, putative [Oryza sativa Japonica Group]
gi|62733452|gb|AAX95569.1| Transposase family IS4, putative [Oryza sativa Japonica Group]
Length = 916
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 104/232 (44%), Gaps = 29/232 (12%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 689 NCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQNVMCVCDFDMRFSFVLAGWPGSVHDMRV 748
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--------RTRNPDA---AADQ 221
++ + +F + +P P L DS YP + + P + N +
Sbjct: 749 FNDA--HTRFSAKFPKPPPGKFYLVDSGYPNRPGYLAPYKGITYHFQEYNESTLPRGKRE 806
Query: 222 RFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALTIDRDNE 280
F CH R +IE SFG+LK K+ L + S + ++ A LHN RD++
Sbjct: 807 HFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHNFI----RDSQ 862
Query: 281 NQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
+ E D + D ++ D + P + P + +R++N FR
Sbjct: 863 MADTEF--DNCDHD------ENYDPLGGTSSPSSE---PTNELDSRVMNQFR 903
>gi|55771325|dbj|BAD72250.1| unknown protein [Oryza sativa Japonica Group]
gi|55771334|dbj|BAD72259.1| unknown protein [Oryza sativa Japonica Group]
Length = 439
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 114/281 (40%), Gaps = 22/281 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++AFR+S E + +E+A A+ T A+ +++V C+ + G P ++
Sbjct: 111 FRKAFRMSKATFEVVCDELAAAVAKEDTMLRAAIPVRKRVAVCVWRLATGEPLRLVSKRF 170
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT-FLRKGGFPNVAGCVDGTMV 123
G S+ + + EV + ++V+WPE F G V G + T +
Sbjct: 171 GLGISTCHKLVLEVCAALKAMVMPKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHI 230
Query: 124 LIDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
I AP + + NR +++ V + F V WPGS D V+ S
Sbjct: 231 PIIAPKSNVASYYNRRHTERNQKTSYSMTVQCVVDSTGAFTDVCIGWPGSNSDEEVLEKS 290
Query: 177 MVYRKFDEEGWRPFPDAV----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
+Y R P + ++G ++PL +W++ P T A R
Sbjct: 291 ALYLH------RGVPGLIQGQWVVGGGSFPLMDWMLVPYTHQNLTWAQHMLNEKVAAVRG 344
Query: 233 LIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNI 271
+ ++F LK ++ CL R + + V+ A LHNI
Sbjct: 345 VARDAFERLKRRWGCLQ-KRTEVKLLDLPTVLGACCVLHNI 384
>gi|357612248|gb|EHJ67878.1| hypothetical protein KGM_13821 [Danaus plexippus]
Length = 357
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 101/214 (47%), Gaps = 10/214 (4%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT---FLRKGGFPNVAGCVDGTMV 123
S SS R + VV ++ T +KWP++ ND+ + K G N+ G +D T +
Sbjct: 138 SISSTFRVLRRVVAWLL-TKLDDAIKWPQD-FNDVETICEQYHFKTGISNILGVIDCTHI 195
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I+ P + ++ N G ++ + A N +F + PGS + +RV+R S +Y+
Sbjct: 196 KIEKP-RNAREYCNPKGYFSIVLQATIDANLRFTNIYCGEPGSSNCSRVLRKSPLYQTAT 254
Query: 184 EEGWRPFP-DAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGIL 241
+ FP + ++G S YP WL+PP N + QR F H R+L + +F +L
Sbjct: 255 QNRDTLFPHNTFLVGHSGYPSLSWLVPPFRENKRLTSQQREFNSLHASARKLSDKAFNLL 314
Query: 242 KEKFPCLNYMRLSPQ--FAGMVVLASTTLHNIAL 273
K KF + + F ++A+ LHN L
Sbjct: 315 KTKFRRIKLFTVYRNIPFITDTIVAACILHNYCL 348
>gi|449678540|ref|XP_002170519.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 119/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F+ +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRTFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P V++GD + LK WL+ P + F R RR IENS
Sbjct: 178 YFNLPKISEFDPKVPPVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ ++ + ++ A+ LHN
Sbjct: 238 FGILAARWRIYRSPIKANSLKVEHIIKATVCLHN 271
>gi|328708299|ref|XP_003243649.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 133
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 68/126 (53%), Gaps = 12/126 (9%)
Query: 65 GPSKSSICRSIHEVVPVI-VDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGT 121
G S+ S+ R I EVV + + F VK+P+ LNDI F R GFP V GC+D T
Sbjct: 2 GLSQPSVSRCISEVVTALNLPEIFNAWVKFPKNINELNDIRNGFYRNTGFPGVVGCIDCT 61
Query: 122 MVLIDAP----NEHEDQ-----FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
V I P N +E+Q +VNR G H++NV IC N K +NA +PGS HD V
Sbjct: 62 HVAITPPSTNLNLNENQHPEYIYVNRKGYHSINVQLICDSNLKILNINALFPGSTHDTHV 121
Query: 173 MRNSMV 178
NS V
Sbjct: 122 WNNSKV 127
>gi|401888633|gb|EJT52586.1| hypothetical protein A1Q1_03388 [Trichosporon asahii var. asahii
CBS 2479]
Length = 419
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 153/344 (44%), Gaps = 31/344 (9%)
Query: 7 KEAFRLSATQVEFILEEIAP-ALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
KE FR+S ++E ++E I A+ P ++ L+ +G G+ + T G
Sbjct: 80 KECFRMSLPELERLVEHIGDHAIFHPGRGRPQKPVSFQIAVFLYRLGRGATISDVCRTMG 139
Query: 66 P-SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG-GFPNVAGCVDGTMV 123
+ S+ + I+ TF ++KWP+ + T LR+ G P G +DG+ +
Sbjct: 140 RIGRGSVTTYCMRTIVAILTTF-NNVIKWPDPRRREAISTRLRRDYGIPGCVGFIDGSHI 198
Query: 124 -LIDAPN---EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
L P+ E F +R + L ++A+C +F + S D R R+S ++
Sbjct: 199 PLHKCPSFSIEKNASFFSRKHRYGLLILAVCDEAKRFTYLQTGHYASASDFRAQRSSALH 258
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD-----QRFLRCHKRTRRLI 234
R+ E R D +LGDS + +IP R AA D Q+F + R I
Sbjct: 259 RRPRELFSR---DQFVLGDSGFYCSLNVIPMYRRR--AAQDLTREQQKFNDRVAKARVKI 313
Query: 235 ENSFGILKEKFPCLNYMRLSPQ------FAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
E++FG+LK+++ LN + L+ + A + + LHN L I N++ ++E +
Sbjct: 314 EDAFGVLKQRWLMLNDINLTMKTDKDINTAFAYIRTAVVLHN--LYIHTANQHWSLEDFE 371
Query: 289 DEAEDDGEEEAVDDPDE-----IDEEEQPRQDRNNPGTQRLARL 327
EE+ DP I+ E+ R DR + +R+A +
Sbjct: 372 RAKAKAEEEDRKADPLPEGELLIETVEEDRMDRRDRLARRVAEI 415
>gi|225435066|ref|XP_002281425.1| PREDICTED: putative nuclease HARBI1 [Vitis vinifera]
gi|297746131|emb|CBI16187.3| unnamed protein product [Vitis vinifera]
Length = 384
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/259 (24%), Positives = 115/259 (44%), Gaps = 21/259 (8%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLN 99
++V L + G IG G + S++ + V ++ + + +WP E +
Sbjct: 89 DQVAVALRRLSSGDSLLTIGDAFGLNHSTVSQVTWRFVEIMEERALHHL-QWPSTEPEIT 147
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPNYKFY 157
+I F + G PN G +D T +++ P + +++ H++ + AI P +F
Sbjct: 148 EITSKFEKIRGLPNCCGAIDTTHIMMCLPSADSANSVWLDSENHHSMILQAIVDPEMRFR 207
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEW 206
+ WPG + D+ V+++S + K E+G R + I+GDS YPL W
Sbjct: 208 DIVTGWPGKMKDSSVLQSSNFF-KLCEKGQRLNGKKIELAEGSEISEYIVGDSGYPLLPW 266
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLA 264
L+ P + + F R H TR + + + LKE + + + P + ++L
Sbjct: 267 LVTPYQGKELSESKAEFNRRHFATRMVAQRALARLKEMWKVIQGVMWRPDKNRLPRIILV 326
Query: 265 STTLHNIALTIDRDNENQN 283
LHNI ID ++E Q+
Sbjct: 327 CCLLHNI--VIDLEDEVQD 343
>gi|118120491|ref|XP_424152.2| PREDICTED: putative nuclease HARBI1 [Gallus gallus]
Length = 421
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 142/333 (42%), Gaps = 47/333 (14%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPAL---------TTPTERNVALSAKEKVLSCLHWMGV 53
P + + FR+S +I + P L T P E+ VA++ L +
Sbjct: 86 PQDWLDKFRMSRETFFYICNRLRPGLAPHSAHFHPTLPLEKRVAVA--------LWHLAT 137
Query: 54 GSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFP 112
Y + G S++ + EV +V ++ P+E L ++ F + GFP
Sbjct: 138 NVEYQTLSPLFGVGPSTVQTCVREVSYAVVLLLKPLYLRLPDEKELENMVRIFRTRWGFP 197
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
+ G +D + I+ P + N G H++ A +F+ V+ +PGS+ ++ V
Sbjct: 198 HCIGALDSLHIPINPPLRLSADYCNGQGWHSILTQATVDGLGQFWDVSTAFPGSMENSAV 257
Query: 173 MRNSMVYRKFDEEGWRPFP---------DAVILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
+ +S ++ E P P V+LGD+ YPL++W++ P D + QR
Sbjct: 258 LESSSLWVLAKEGRLCPNPPKDFMGKAQKYVLLGDATYPLQDWILKPYQE--DESLTQRQ 315
Query: 224 LRCHKRTRR---LIENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIALTIDRDN 279
L+ + R +R +IEN+F LK ++ L S + +VLA LHN+
Sbjct: 316 LQFNYRLKRAHSVIENAFLRLKARWQILLKCDDCSLELLPTLVLACCILHNVC------- 368
Query: 280 ENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQP 312
E HD+ ++ E +P E+ + QP
Sbjct: 369 -----EAHDNPFNEEWLEGT--EPTELPKPCQP 394
>gi|346473924|gb|AEO36806.1| hypothetical protein [Amblyomma maculatum]
Length = 330
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 109/253 (43%), Gaps = 10/253 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+ FR++ + +IL T A+ ++V L+ + + I G
Sbjct: 76 FRAHFRVTRSIFRYILNLCECMQRQDTAMRRAVPLDKRVAVSLYRLASSAEERTIANAFG 135
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
+S++ E V+V ++V+ P+ + L + F GFP G +DG +
Sbjct: 136 LGRSTVNTIFREFCAVVVRQLGTKLVRLPKRSELAEHLRQFTAVTGFPQGVGALDGCHIE 195
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS-----MVY 179
+ P EH ++N G ++ ++AI Y+F N PG HDA V +S +
Sbjct: 196 VCPPEEHASDYINNKGWYSTILLAIADHKYRFLYTNVGSPGRNHDAGVFESSSLPGILAS 255
Query: 180 RKFDEEG---WRPFPDAVILGDSAYPLKEWLIPPRTR-NPDAAADQRFLRCHKRTRRLIE 235
F EG + ++L D A+PL+ ++ P P + F R RR++E
Sbjct: 256 HLFRAEGKVVAGVTVEPLLLADQAFPLQNHIMKPFPNPGPPGSPTGVFNYHLCRARRIVE 315
Query: 236 NSFGILKEKFPCL 248
N+FG LK +F L
Sbjct: 316 NAFGRLKARFRIL 328
>gi|307136287|gb|ADN34114.1| retrotransposon protein [Cucumis melo subsp. melo]
Length = 657
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/294 (25%), Positives = 128/294 (43%), Gaps = 23/294 (7%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+ +E V LH + +I S +I R + V+ ++ + ++K P+
Sbjct: 67 VDVEEMVAMFLHILAHDVKSRVIKREFMRSGETISRHFNMVLLAVI-RLHEELLKKPQPV 125
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
N+ T R F N G +DGT + ++ P ++ R G+ A NV+ +C F
Sbjct: 126 PNEC--TDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRKGEVATNVLGVCDTKGDFV 183
Query: 158 AVNANWPGSVHDARVMRNSMV----------YRKFDEEGWRPFPDAVI--LGDSAYPLKE 205
V A W GS D+R++R+++ Y + G+ P + + Y L+E
Sbjct: 184 YVLAGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDVGY-PNAEGFLAPYRGQRYHLQE 242
Query: 206 WLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLA 264
W P N + + + F H R +IE +FG+LK ++ L P + +LA
Sbjct: 243 WRGP---ENAPSTSKEFFNMKHYSARNVIERAFGVLKGRWAILRGKSYYPVEVQCRTILA 299
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNN 318
LHN+ I+R+ N ++E + DE + A DD I+ + Q R+N
Sbjct: 300 CCLLHNL---INREMTNFDIEDNIDEVDSTHATTAADDIHYIETSNEWSQWRDN 350
>gi|255567510|ref|XP_002524734.1| conserved hypothetical protein [Ricinus communis]
gi|223535918|gb|EEF37577.1| conserved hypothetical protein [Ricinus communis]
Length = 363
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 130/307 (42%), Gaps = 28/307 (9%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AFK FR+S E+I + L + + LS +++V L + G
Sbjct: 23 AFKYFFRVSKKTFEYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 82
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + + +KWP+ N + DI F G PN G
Sbjct: 83 SVGAAFGVGQSTVSQVTWRFIEAL-EERARHHLKWPDSNRMEDIKSRFEALFGLPNCCGA 141
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E D + ++ ++++ + + +F + WPG + +R+++ S
Sbjct: 142 IDATHIIMTLPAVETSDHWCDQENNYSMFLQGVVDHEMRFLDIVTGWPGGMTVSRLLKCS 201
Query: 177 MVYRKFDEE-----GWRPFPDAV-----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
+++ + R + + I+G YPL WLI P + + F
Sbjct: 202 GFFKRCESGECLNGKLRKLSEGMEIREYIVGGVCYPLLPWLITPDGSDEHSDPLSNFNTM 261
Query: 227 HKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQNV 284
H R L SF LK + LN + P + ++L LHNI + DN +Q
Sbjct: 262 HAVARMLAVKSFLQLKGSWRILNKVMWRPDKRKLPSIILVCCLLHNIII----DNGDQ-- 315
Query: 285 EIHDDEA 291
+H D A
Sbjct: 316 -LHPDVA 321
>gi|221128761|ref|XP_002160578.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 119/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFCFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N +++ ++AIC Y F V+ G +DA ++ S R F+ +G
Sbjct: 119 APAKSGSYFYNYKRFYSMVLLAICDAKYCFTMVHIGAYGRDNDAAILNASTFGRAFN-KG 177
Query: 187 WRPFPD---------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P AV++GD + LK WL+ P + F R RR I+NS
Sbjct: 178 YFNLPKISEFDPKVPAVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIKNS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ ++ P ++ A+ LHN
Sbjct: 238 FGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|225449613|ref|XP_002279396.1| PREDICTED: putative nuclease HARBI1 [Vitis vinifera]
Length = 398
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 127/296 (42%), Gaps = 26/296 (8%)
Query: 4 FAFKEAFRLSATQVEFILEEIA------PALTTPTERNVALSAKEKVLSCLHWMGVGSPY 57
F+ F++S +I + PA + + N LS ++V L + G +
Sbjct: 60 LTFESVFKISRKTFNYICSLVKEDMMAKPANFSASNGN-PLSLNDRVAIALRRLSSGESF 118
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVA 115
I+G + G ++S++ + + V + + +WP E ++ + +F + G PN
Sbjct: 119 SIVGDSFGVNQSTVAQITWKFVESMEVRGLHHL-QWPSTEADMEETKSSFEKIRGLPNCC 177
Query: 116 GCVDGTMVL--IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VD T + + +++ H++ + AI P +F V WPGS+ ++ V+
Sbjct: 178 GAVDTTHISMSLSTMGTSNKVWIDHQKKHSMILQAIVDPEMRFRDVITGWPGSLSESCVL 237
Query: 174 RNSMVYRKFDEEGWR-----------PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR 222
+NS + K EEG R + I+GD+ +PL WL+ P +
Sbjct: 238 QNSGFF-KLSEEGKRLNGKMIELSEGTYLREYIVGDAGFPLLPWLLTPYQGKGLSDFQSE 296
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTID 276
F + H TR + + LKE + + + P + ++L LHNI + ++
Sbjct: 297 FNKRHFATRMVARRALARLKEMWRIIQGVMWRPDTNKLPRIILVCCLLHNIVIDLE 352
>gi|449681091|ref|XP_002162517.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 118/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G + G V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIGGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F+ +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIAAYGRDNDAVILNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P V++GD + LK WL+ P + F R RR IENS
Sbjct: 178 YFNLPKTSEFDPKVPPVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ ++ P ++ A+ LHN
Sbjct: 238 FGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|354542525|ref|NP_001121305.2| uncharacterized protein LOC100158389 [Xenopus laevis]
Length = 416
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 125/302 (41%), Gaps = 25/302 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK R+S + E +L+ +AP + T A+ A++++++ L ++ G +
Sbjct: 59 FKNRLRMSDSSFETLLQAVAPLIAKQDTCMRQAIPAEQRLIATLQFLATGRSLEDLKIPI 118
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTM 122
G S S+ I E I + +K+P E F FPN G + G
Sbjct: 119 GISAQSLGHIIPETCNAIFEALKTAYLKFPSTEAEWKATARHFEEFWNFPNCGGAIVGKH 178
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V I P+ F N G +++ ++AI Y+F V+ G V M + Y++
Sbjct: 179 VRIKPPSRSGSYFTNYKGYNSIVLLAIVNAKYEFLMVDVGKNGRVSSVESMEQTYFYQRL 238
Query: 183 DEEGWRPFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRL 233
+ ++ V + + L E ++ P + D + ++R R RR+
Sbjct: 239 QNHQLQLPSNSDTSEGMNYVFVTGEEFALHEHILTPFPQK-DMSFERRIFNYRLSRARRV 297
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI-----------ALTIDRDNEN 281
+N+FGIL +F + + LSP VVLA LHN A +DR++
Sbjct: 298 ADNAFGILSNRFRIFHTAINLSPSKTDSVVLACCVLHNFLCRTSGASYMPATMLDREDTE 357
Query: 282 QN 283
Q+
Sbjct: 358 QS 359
>gi|340376510|ref|XP_003386775.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 378
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 117/282 (41%), Gaps = 18/282 (6%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
+++ R+ E++L +AP + T ++S E++ L ++ G Y +
Sbjct: 22 SYRHFLRMDINSFEYLLTLVAPKIRRRDTHMRKSISPGERLALTLRFLATGESYSSLQYL 81
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRM-VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTM 122
+ +I + + E IV+ M V + +I + F +K F + G +DG
Sbjct: 82 YRVPCQTIGKIVIETCEAIVEVLADYMKVPKTQSEWKEIAIEFDQKWNFLHCLGALDGKH 141
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ I P + N ++ +A+ NYKF V+ G V D V R S + F
Sbjct: 142 INIRPPPSSGSYYYNYKQRFSIVFLALVDANYKFLYVDIGCNGRVSDGGVFRESSLSSAF 201
Query: 183 DEEGW------------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KR 229
P P +I+ D A+PLK+++ P R ++R R
Sbjct: 202 QNNTLDVPPLEVLPGCTTPIP-YIIVADEAFPLKDYIQKP-YRQTGLTTERRIYNYRLSR 259
Query: 230 TRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHN 270
RR++EN+FGIL F L + L+P+ + L LHN
Sbjct: 260 ARRVVENAFGILANHFRVLMTAINLAPEKVETLTLTCCLLHN 301
>gi|255582388|ref|XP_002531983.1| RNA binding protein, putative [Ricinus communis]
gi|223528380|gb|EEF30419.1| RNA binding protein, putative [Ricinus communis]
Length = 591
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/311 (24%), Positives = 133/311 (42%), Gaps = 35/311 (11%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
LS ++V L + G IG + G ++S++ V + + + +WP E
Sbjct: 95 LSPNDQVAIALRRLSSGESLSSIGDSFGVNQSTVSHVTWRFVEAMEERGLHHL-RWPSTE 153
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPN 153
+ I F + PN G +D T V++ P + D +++R +H++ + AI P+
Sbjct: 154 TEMAAIKSKFEKIHSLPNCCGVIDTTHVVMTLPAVDHSNDVWIDREKNHSMVLQAIVDPD 213
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYP 202
+F V +PGS+ DA V++NS + K EEG R + I+GD+ +P
Sbjct: 214 MRFRDVIVGYPGSLSDALVLQNSSFF-KLSEEGKRLNGKKMELMEGTELGEYIIGDAGFP 272
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-- 260
L WL P ++P F H T L + + LKE + ++ + P +
Sbjct: 273 LMPWLFTP-YQHPHQEHQIEFNNRHSATMLLAQIALTRLKEIWRIIHGVMWLPDKNRLPR 331
Query: 261 VVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPG 320
++ LHNI + + +D+ D E V D + D +Q + +N G
Sbjct: 332 IIFVCCLLHNIVIDM------------EDKVLD---EMPVSDSHDKDYRQQICESASNSG 376
Query: 321 TQRLARLLNYF 331
T+ +L Y
Sbjct: 377 TEMREKLSLYL 387
>gi|449680215|ref|XP_004209527.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 382
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 117/291 (40%), Gaps = 32/291 (10%)
Query: 5 AFKEAFRLSATQ----VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
A+K+ FRL V + I+P +P R A++ +KV L+++ ++
Sbjct: 63 AYKKNFRLDKNSFFELVSLLEPYISPDSYSPNIR--AITPDKKVAITLYYLKDSGTLNMT 120
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCV 118
T G + + I EV IV R V P+ + + + F K G GCV
Sbjct: 121 ANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCV 180
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DGT + I P H + ++L V A+C F V WPGSVHDA+V NS +
Sbjct: 181 DGTHISIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSI 240
Query: 179 YRKFDEEGWRPFPDAV----------ILGDSAYPL-----KEWLIPPRT-RNPDAAADQR 222
+ ++GD AYPL KE+ RT + D
Sbjct: 241 NTNLRSSRLPGTFQTITKNKIKVPCYLIGDPAYPLLPHCMKEY----RTCKKNDEVIFNS 296
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIA 272
LR R IE +FG LK ++ L M L + V+ A LHN
Sbjct: 297 MLR---TARNPIECAFGRLKARWKILTKKMDLKLEKIPTVIYACFILHNFC 344
>gi|77551559|gb|ABA94356.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 916
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/238 (28%), Positives = 104/238 (43%), Gaps = 41/238 (17%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR + + NVM +C + +F V A WPGSVHD RV
Sbjct: 689 NCIGAIDGTHIQVVVPNSAAVQHRNRYKEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 748
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP----------------PRTRNP 215
++ +F + +P P L DS YP + +L P PR R
Sbjct: 749 FNDAQT--RFSAKFPKPPPGKFYLVDSGYPNRPGYLAPYKGITYHFQEYNESTLPRGRR- 805
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
+ F CH R +IE SFG+LK K+ L + S + ++ A LHN
Sbjct: 806 -----EHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHNFI-- 858
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
RD++ + E D + D ++ D + P + P + +R++N FR
Sbjct: 859 --RDSQMADTEF--DNCDHD------ENYDPLGGTSSPSSE---PTNELDSRVMNQFR 903
>gi|356536488|ref|XP_003536769.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 382
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/328 (21%), Positives = 138/328 (42%), Gaps = 25/328 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV-----ALSAKEKVLSCLHWMGVGSPYHII 60
F+ F++S +I + + V LS ++V L + G I
Sbjct: 46 FESVFKISRKTFNYICSLVEEDMVARASNFVDLNGNRLSLNDQVAVALRRLSSGESLSTI 105
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + ++S++ + + V + + + WP E + +I F G N G +
Sbjct: 106 GDSFRMNQSTVSQVTWKFVETMEERGLHHL-SWPSTEMEMEEIKSKFENIRGLSNCCGAI 164
Query: 119 DGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T +++ P + +++R + ++ + A+ P+ +F+ + WPGS+ D +V+R+S
Sbjct: 165 DSTHIMMTLPSVDALNSVWIDREKNCSMVLQAVVDPDLRFHDIVTGWPGSMSDEQVLRSS 224
Query: 177 MVYRKFDEEGWR------PFPDAVIL-----GDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
++ K EEG R PD +L GD+ +PL WL+ P + F +
Sbjct: 225 SLF-KLAEEGKRLNGGKKTLPDGSVLREYIIGDTGFPLLPWLLTPFEGKGLSDIQVEFNK 283
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQN 283
T+ + + + LK+ + + + P + +VL LHNI + ++ + N
Sbjct: 284 RIVETQMVAKKALARLKDMWKIIQGVMWKPDKHKLPRIVLVCCILHNIVMDMEDEVHNDM 343
Query: 284 VEIHD-DEAEDDGEEEAVDDPDEIDEEE 310
+ H D D E D+ I E+
Sbjct: 344 LSCHQHDSRYQDQTSEFADNTANIMREK 371
>gi|432936771|ref|XP_004082271.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 439
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 129/293 (44%), Gaps = 26/293 (8%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P + + FR+S ++ E++ P L T +AL +++V L + Y I
Sbjct: 92 PSDWLDKFRMSRETFFYLCEKLRPRLARQDTSFRLALPVEKRVAVALWRLASNIEYRTIS 151
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDG 120
+ G KS++CR + ++ IV ++ P E+ L D F GFP+ +
Sbjct: 152 SLFGVGKSTVCRCVRDMCHAIVALLSCTYLQPPNEQELEDSARLFESYWGFPHCVAAIAT 211
Query: 121 TMVLIDAPNEHEDQFVNRNG-DHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
I P+ + + N G L+ +A+ G + F+ V A++PG A +++NS ++
Sbjct: 212 LHTAIITPSNNALDYANPAGWLSVLSQVAVSGRGH-FWDVCASFPGGTDPAEILQNSSLW 270
Query: 180 RKFDEEGWRPFP---------DAVILGDSAYPLKEWLIPP----RTRNPDAAA----DQR 222
E G P V+LG++ YPL+ WL+ R+R + AA Q
Sbjct: 271 ATASEGGLSPAQPPSFMGRAMRYVMLGEACYPLQSWLMKAFPEERSRRKNHAALTQRQQV 330
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCL---NYMRLSPQFAGMVVLASTTLHNIA 272
F + R R+ E + L+ ++ CL N RL ++LA LHN+
Sbjct: 331 FNQQLNRALRVSEEALLRLRARWQCLSKRNDCRLD--VVPTMILACCILHNMC 381
>gi|340385015|ref|XP_003391006.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 419
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 122/288 (42%), Gaps = 31/288 (10%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPY------ 57
++K+ R+ EF L ++P + T A+ A E++ L ++ G Y
Sbjct: 59 SYKQFLRMDHNSFEF-LSNVSPIIARQDTIMRRAIPAGERLALTLRFLATGESYSSLQYL 117
Query: 58 -HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNV 114
I T G C+ I+EVV Q +K P + I F + FP+
Sbjct: 118 YRIPSQTIGKIVIETCQGINEVV--------QDYLKVPKTQSEWRQIAEQFDSRWNFPHC 169
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DG V I + N ++ +MA+ +YKF V+ G V D + R
Sbjct: 170 CGALDGKHVNIIPRANSGSYYYNYKQRFSIVLMALVDADYKFIFVDIGCNGRVSDGGIFR 229
Query: 175 NSMVYRKFDEEGW-----RPFPDA------VILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
S + + P P +++ D+A+PLK ++ P ++N + +
Sbjct: 230 ESTLSAAQESNALDFPPPEPLPGCTLPIPYMVVADAAFPLKHYIQKPYSQNGLTHNKRIY 289
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
R RR++EN+FGIL +F L + L P+ ++VLA TLHN
Sbjct: 290 NYRLSRARRVVENAFGILGNRFRVLMTPIALVPEKVEVIVLACCTLHN 337
>gi|449663129|ref|XP_004205687.1| PREDICTED: uncharacterized protein LOC101241637 [Hydra
magnipapillata]
Length = 231
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 100/221 (45%), Gaps = 13/221 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y + +
Sbjct: 6 FKEHFRVNRNTFNFLVYELHPLLGKTTTTFRE-PISVVKRVAVALHYLASCEEYRGVSSL 64
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS +HE + + D + VK+P ENL F GF G VDG
Sbjct: 65 FGIGKSIANLIVHEFINAVNDILLPKYVKFPLSVENLIKHSRDFEAILGFQKCIGAVDGC 124
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN------ 175
+ I AP + + N G +++ + ++ Y+F + PG +D+ + +N
Sbjct: 125 HIPIFAPKDQAISYYNYKGWYSIVLFSVVDCRYRFIYTSVGSPGRNNDSYIFQNSSLKAI 184
Query: 176 --SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
S ++ K +E ++GDSA+PL L+ P +N
Sbjct: 185 LESSLFDKCCKELGDSLVPLCLIGDSAFPLTRHLLKPYPKN 225
>gi|307172839|gb|EFN64056.1| Putative nuclease HARBI1 [Camponotus floridanus]
Length = 143
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 71/149 (47%), Gaps = 16/149 (10%)
Query: 110 GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM----AICGPNYKFYAVNANWPG 165
GF V G +DGT + I AP + +VNR G H++ + A+C F A PG
Sbjct: 1 GFSKVIGAIDGTHINIPAPRIYPASYVNRKGHHSIQLFSDFQAVCNHKTHFLHCFAGHPG 60
Query: 166 SVHDARVMRNSMVYRKFDEEGW----RPFPDAV-ILGDSAYPLKEWLIPPRTRNPDAAAD 220
SVHD RV R S + + W FPD ILGD+AY L + LI P N
Sbjct: 61 SVHDQRVFRLSEL------QEWLGNPEKFPDDYHILGDAAYKLHQNLIVPYRDNGHLTPR 114
Query: 221 QR-FLRCHKRTRRLIENSFGILKEKFPCL 248
Q+ F CH R IE +FG +K F L
Sbjct: 115 QKNFNFCHSSARVAIERAFGFMKTSFRSL 143
>gi|449678194|ref|XP_004209026.1| PREDICTED: uncharacterized protein LOC100199174 [Hydra
magnipapillata]
Length = 449
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 115/277 (41%), Gaps = 48/277 (17%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ ++ P L TT T R +S +++ LH++ Y ++
Sbjct: 168 FKEHFRVNRNTFNFLVNQLHPHLGKTTTTMRE-PISVVKRIAFALHYLASCEEYRVVSCL 226
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ +HE + + D + K+P EN N F GFP G VDG
Sbjct: 227 FGIGKSTENLIVHEFINAVNDILLPKYAKFPLSVENSNKHSRDFEAILGFPQCVGAVDGC 286
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ I AP + + N G C + + +S+V
Sbjct: 287 HIPISAPKDQAISYYNYKG---------C-------------------CKELGDSLV--- 315
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGI 240
P +GDSA+PL L+ P N + + Q+ F RR++EN+FG
Sbjct: 316 -------PLCS---IGDSAFPLTRHLLKPYPENLELSEIQKNFNEILCGARRVVENAFGR 365
Query: 241 LKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
+K +F + M + FA +V A TLHNI D
Sbjct: 366 VKARFRVICKRMECNINFATRIVNACITLHNICEYYD 402
>gi|357454107|ref|XP_003597334.1| hypothetical protein MTR_2g096480 [Medicago truncatula]
gi|355486382|gb|AES67585.1| hypothetical protein MTR_2g096480 [Medicago truncatula]
Length = 390
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 114/257 (44%), Gaps = 21/257 (8%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
+S ++V L +G G + IG + G S S++ + V + + + +WP +
Sbjct: 87 VSLTDQVAVALRRLGSGDSFVTIGDSFGLSHSTVSQVTWRFVESMEERGLHHL-QWPSTQ 145
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
E +N I F + GFPN G VD T + L+ A + D +++ +H++ + AI P+
Sbjct: 146 EEMNSIKSKFEKIQGFPNCCGAVDVTHITMLLPATEQSSDVWLDHKNNHSMVLQAIVDPD 205
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYP 202
KF + WPG + D + +S + K + G R + I+GDS YP
Sbjct: 206 MKFRDIVTGWPGKMEDWSIFESSN-FNKLCDNGERLNGKKLKLSKGSEIREYIIGDSGYP 264
Query: 203 LKEWLIPPRTRNP--DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM 260
+L+ P ++ +F + H TR + + + LKE + + P +
Sbjct: 265 QLPYLVVPYEEKEILESEPKAKFNKLHLETRMVAQRALTRLKEMWKIIRGKMWRPDKHRL 324
Query: 261 --VVLASTTLHNIALTI 275
++L LHNI + +
Sbjct: 325 PRIILVCCILHNIVIDM 341
>gi|124359622|gb|ABN06006.1| Trp repressor/replication initiator [Medicago truncatula]
Length = 342
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 113/255 (44%), Gaps = 21/255 (8%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
+S ++V L +G G + IG + G S S++ + V + + + +WP +
Sbjct: 39 VSLTDQVAVALRRLGSGDSFVTIGDSFGLSHSTVSQVTWRFVESMEERGLHHL-QWPSTQ 97
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
E +N I F + GFPN G VD T + L+ A + D +++ +H++ + AI P+
Sbjct: 98 EEMNSIKSKFEKIQGFPNCCGAVDVTHITMLLPATEQSSDVWLDHKNNHSMVLQAIVDPD 157
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYP 202
KF + WPG + D + +S + K + G R + I+GDS YP
Sbjct: 158 MKFRDIVTGWPGKMEDWSIFESSN-FNKLCDNGERLNGKKLKLSKGSEIREYIIGDSGYP 216
Query: 203 LKEWLIPPRTRNP--DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM 260
+L+ P ++ +F + H TR + + + LKE + + P +
Sbjct: 217 QLPYLVVPYEEKEILESEPKAKFNKLHLETRMVAQRALTRLKEMWKIIRGKMWRPDKHRL 276
Query: 261 --VVLASTTLHNIAL 273
++L LHNI +
Sbjct: 277 PRIILVCCILHNIVI 291
>gi|340383778|ref|XP_003390393.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 386
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 107/253 (42%), Gaps = 26/253 (10%)
Query: 10 FRLSATQVEFILEEIAPAL---TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
R++ + + +L+E+ P L T + +S E+++ L ++ G+ +
Sbjct: 60 IRMTRERFDLLLQEVGPYLLHRTYNSSLRPQVSPAERLILTLRYLCTGNSQISLSFNFRL 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
+S++C + E I + +K P E+ I F FPN G +DG ++
Sbjct: 120 GRSTVCEILKETCDAIWNALHLAYLKAPSTEQEWIRISRNFETMWNFPNCIGAIDGKHIV 179
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I AP F N G H++ ++A+C +A G + D+ + N +E
Sbjct: 180 IQAPMNAGSTFYNYKGKHSIVLLAVC---------DAGEAGRLSDSGIFTNCQFGTALEE 230
Query: 185 EGWRPFPDA------------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
PD VI+GD +P+K++L+ P + F R RR
Sbjct: 231 GLLNIPPDKALTGTSEPSLPYVIIGDEGFPIKKYLLRPYPGRYLSKEKSIFNYRLSRARR 290
Query: 233 LIENSFGILKEKF 245
+IENSFGIL ++
Sbjct: 291 IIENSFGILAARW 303
>gi|295829883|gb|ADG38610.1| AT3G55350-like protein [Capsella grandiflora]
gi|295829885|gb|ADG38611.1| AT3G55350-like protein [Capsella grandiflora]
gi|295829887|gb|ADG38612.1| AT3G55350-like protein [Capsella grandiflora]
Length = 210
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 97/204 (47%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + +WP + L++I
Sbjct: 3 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-QWPSK-LDEI 60
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F + G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 61 KSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDV 120
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + + I+GDS +PL WL+
Sbjct: 121 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGEKLHLSERTELREYIIGDSGFPLLPWLL 179
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 180 TPYQGKPLSLPQTEFNKRHSEARK 203
>gi|345291409|gb|AEN82196.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291411|gb|AEN82197.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291415|gb|AEN82199.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291417|gb|AEN82200.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291419|gb|AEN82201.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291423|gb|AEN82203.1| AT3G55350-like protein, partial [Capsella rubella]
Length = 205
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 97/204 (47%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + +WP + L++I
Sbjct: 2 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-QWPSK-LDEI 59
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F + G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 60 KSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDV 119
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + + I+GDS +PL WL+
Sbjct: 120 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGEKLHLSERTELREYIIGDSGFPLLPWLL 178
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 179 TPYQGKPLSLPQTEFNKRHSEARK 202
>gi|147771935|emb|CAN75696.1| hypothetical protein VITISV_035983 [Vitis vinifera]
Length = 596
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 110/245 (44%), Gaps = 24/245 (9%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S +I + HEV+ +V+ F + M+ P N N I LR+ F +V G +DGT++
Sbjct: 326 SSQTIHKYFHEVLVAMVN-FSKEMITPPSFNDSSNGISNRRLRQI-FKDVVGAIDGTLIH 383
Query: 125 IDAPNEHEDQFVNRN-GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
P + + R G+ NVMA+C + F V W G+ HD+RV+ ++ R
Sbjct: 384 ACIPTNQQVPYRGRGRGECFQNVMALCDFDMIFRFVVVGWEGTTHDSRVLTETI--RNLQ 441
Query: 184 EEGWRPFPDAVILGDSAY-----PLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
P DA + ++ WL R+ ++ F +CH R R +IE +F
Sbjct: 442 HNFPMPPSDAAYTHSRGFMAPYRNVRYWLSDFRSGGKAVGKEEIFNQCHARLRNVIERAF 501
Query: 239 GILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTI--------DRDNENQNVEIHDD 289
G++K +FP L M S +V+ ++HN I + DNE VE+ D
Sbjct: 502 GVVKARFPILKRMTPYSFTTQTKIVMTCFSIHNFLRQISVADKLFSEYDNE---VELESD 558
Query: 290 EAEDD 294
A +
Sbjct: 559 NANQN 563
>gi|307135889|gb|ADN33754.1| retrotransposon protein [Cucumis melo subsp. melo]
Length = 623
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 119/286 (41%), Gaps = 40/286 (13%)
Query: 34 RNVA-------LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTF 86
RNVA + +E V LH + +I S ++ R + +V + V
Sbjct: 28 RNVAGLSSTEIVDVEEMVAMFLHVLAHDVKNRVIQQEFVRSGETVSRHFN-IVLLAVLRL 86
Query: 87 FQRMVKWP---EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHA 143
++ ++K P N ND R F N G +DGT + ++ P F R G+ A
Sbjct: 87 YEELIKRPVPVTSNCND-----QRWKCFENCLGALDGTYIKVNVPAGDRPTFRTRKGEIA 141
Query: 144 LNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPL 203
NV+ +C F V A W GS D+R++R+++ E G + L D+ YP
Sbjct: 142 TNVLGVCDMKGDFVYVLAGWEGSAADSRILRDAIS----QENGLQVPKGYYYLCDAGYPN 197
Query: 204 KEWLIPPR------------TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM 251
E + P N A + F H R +IE +FG+LK ++ L
Sbjct: 198 AEGFLAPYKGQRYHLQEWRGAANAPTNAKEYFNMKHSSARNVIERAFGVLKGRWTILRGK 257
Query: 252 RLSP-QFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGE 296
P Q +LA T LHN+ I+R+ N D E ED+G+
Sbjct: 258 SYYPLQVQCRTILACTLLHNL---INREMTYCN----DVEDEDEGD 296
>gi|255638892|gb|ACU19748.1| unknown [Glycine max]
Length = 382
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/328 (21%), Positives = 138/328 (42%), Gaps = 25/328 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV-----ALSAKEKVLSCLHWMGVGSPYHII 60
F+ F++S +I + + V LS ++V L + G I
Sbjct: 46 FESVFKISRKTFNYICSLVEEDMVARASNFVDLNGNRLSLNDQVAVALRRLSSGESLSTI 105
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + ++S++ + + V + + + WP E + +I F G N G +
Sbjct: 106 GDSFRMNQSTVSQVTWKFVETMEERGLHHL-SWPSTEMEMEEIKSKFENIRGLSNCCGAI 164
Query: 119 DGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T +++ P + +++R + ++ + A+ P+ +F+ + WPGS+ D +V+R+S
Sbjct: 165 DSTHIMMTLPSVDALNSVWIDREKNCSMVLQAVVDPDLRFHDIVTGWPGSMSDEQVLRSS 224
Query: 177 MVYRKFDEEGWR------PFPDAVIL-----GDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
++ K EEG R PD +L GD+ +PL WL+ P + F +
Sbjct: 225 SLF-KLAEEGKRLNGGKKTLPDGSVLREYIIGDTGFPLLPWLLTPFEGKGLSDIQVEFNK 283
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQN 283
T+ + + + LK+ + + + P + +VL LHN+ + ++ + N
Sbjct: 284 RIVETQMVAKKALARLKDMWKIIQGVMWKPDKHKLPRIVLVCCILHNVVMDMEDEVHNDM 343
Query: 284 VEIHD-DEAEDDGEEEAVDDPDEIDEEE 310
+ H D D E D+ I E+
Sbjct: 344 LSCHQHDSRYQDQTSEFADNTANIMREK 371
>gi|449691859|ref|XP_004212822.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 131
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 71/124 (57%), Gaps = 6/124 (4%)
Query: 65 GPSKSSICR-SIHEVVPVIVDTFFQRM---VKWPE--ENLNDIPMTFLRKGGFPNVAGCV 118
G SK+++ + +I ++ + + +R+ +K+P LN+ + F FP +
Sbjct: 4 GDSKAALSQPTISRIIRRVSLSLAKRVNEYIKYPAVPHVLNESRVNFYNVAEFPRITRLT 63
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DGT V I P EHE +VNR+ +H++NV A+C N KF + A WPGS HDA+++R S +
Sbjct: 64 DGTHVCIQKPREHEYVYVNRSSNHSINVQAVCDCNGKFIDIVAKWPGSTHDAKILRESKL 123
Query: 179 YRKF 182
+K
Sbjct: 124 GKKM 127
>gi|406701986|gb|EKD05058.1| hypothetical protein A1Q2_00665 [Trichosporon asahii var. asahii
CBS 8904]
Length = 419
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 152/344 (44%), Gaps = 31/344 (9%)
Query: 7 KEAFRLSATQVEFILEEIAP-ALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
KE FR+S ++E ++E I A+ P ++ L+ +G G+ + T G
Sbjct: 80 KECFRMSLPELERLVEHIGDHAIFHPGRGRPQKPVSFQIAVFLYRLGRGATISDVCRTMG 139
Query: 66 P-SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG-GFPNVAGCVDGTMV 123
+ S+ + I+ TF ++KWP+ + T LR+ G P G +DG+ +
Sbjct: 140 RIGRGSVTTYCMRTIVAILTTF-NNVIKWPDPRRREAISTRLRRDYGIPGCVGFIDGSHI 198
Query: 124 -LIDAPN---EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
L P+ E F +R + L ++A+C +F + S D R R+S ++
Sbjct: 199 PLHKCPSFSIEKNASFFSRKHRYGLLILAVCDEAKRFTYLQTGHYASASDFRAQRSSALH 258
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD-----QRFLRCHKRTRRLI 234
R+ E R D +LGDS + +IP R AA D Q+F + R I
Sbjct: 259 RRPRELFSR---DQFVLGDSGFYCSLNVIPMYRRR--AAQDLTREQQKFNDRVAKARVKI 313
Query: 235 ENSFGILKEKFPCLNYMRLSPQ------FAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
E++FG+LK ++ LN + L+ + A + + LHN L I N++ ++E +
Sbjct: 314 EHAFGVLKLRWLMLNDINLTMKTDKDINTAFAYIRTAVVLHN--LYIHTANQHWSLEDFE 371
Query: 289 DEAEDDGEEEAVDDPDE-----IDEEEQPRQDRNNPGTQRLARL 327
EE+ DP I+ E+ R DR + +R+A +
Sbjct: 372 RAKAKAEEEDRKADPLPEGELLIETVEEDRMDRRDRLARRVAEI 415
>gi|449686684|ref|XP_004211227.1| PREDICTED: uncharacterized protein LOC101238273 [Hydra
magnipapillata]
Length = 286
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 112/252 (44%), Gaps = 17/252 (6%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
FR+S T E +L +AP + T +S E++ L ++ G I A++ S
Sbjct: 36 FRMSPTTYEELLLFVAPIIVKQRTTMRDPVSLSERLAVTLRFLVTGDAQCTIAASYRISA 95
Query: 69 SSICRSIHEVVPVIVDTFFQR-MVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
S+I R I E I + +R + P E I F F + G +DGT +++
Sbjct: 96 STISRIISETCAAIWTSLKERNFLHVPSEKQEWKTIAKEFENMWNFLHAIGAIDGTHIVM 155
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN-SMVYRKFDE 184
AP+ ++ N H++ ++A+C YKF V G D V N S+ Y +
Sbjct: 156 QAPHNGGSEYFNYKKTHSIVLLAVCNAKYKFTTVELGDSGRQSDGSVFNNCSLGYGIENN 215
Query: 185 EGWRPFPD----------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRL 233
+ P P+ V++ D+A+ LK ++ P N + DQ+ R RR+
Sbjct: 216 KLNIPDPEYIGNSEKVLPYVLVADNAFGLKRHMMKPYP-NQNIPLDQKIFNYRLSRARRV 274
Query: 234 IENSFGILKEKF 245
IEN+FGI +F
Sbjct: 275 IENTFGIATTRF 286
>gi|449677576|ref|XP_002162156.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 329
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 95/215 (44%), Gaps = 13/215 (6%)
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
++++C I E I + ++ P + +I F + FP G +DG V I
Sbjct: 26 RATVCHIIKETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRI 85
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
+AP + F N G +++ ++AIC Y F V+ G +DA V+ S R F+ +
Sbjct: 86 EAPAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAVLNASTFGRAFN-K 144
Query: 186 GWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
G+ P V++GD + LK WL+ P + F R RR IEN
Sbjct: 145 GYFNLPKISEIDPKVPPVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIEN 204
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
SFGIL ++ ++ P ++ A+ LHN
Sbjct: 205 SFGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 239
>gi|328714341|ref|XP_001946988.2| PREDICTED: hypothetical protein LOC100165998 [Acyrthosiphon pisum]
Length = 407
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 119/278 (42%), Gaps = 36/278 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ R+SAT E+++++I P + T T + +E++ L +
Sbjct: 73 FQNFCRMSATDFEYLIQKIGPTICKTDTNMRKCIPVQERLAVTLRCI------------- 119
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
+C+++++ + + M+ E +I F K FPN G +DG ++
Sbjct: 120 ----QDVCKALNQELKNEI------MLPNNESGWKNIAADFYDKWNFPNCLGAIDGKHIV 169
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD- 183
I P ++ N G ++ +MA+ +Y F V+ G + D V+RN+ ++K +
Sbjct: 170 IQCPANSGSEYYNYKGTFSIVLMAVVNSDYSFTYVDIGCQGRISDGGVLRNTSFFKKLET 229
Query: 184 -----EEGWRPFPDA----VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRL 233
+EG P + V + D A+ L + ++ P + + +R R RR+
Sbjct: 230 DELIPQEGILPNTNVSLPYVFVTDEAFALSKNIMKPYSGVYENGNCKRIFNYRLSRARRV 289
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
+EN FGI+ F + L P+ + + LHN
Sbjct: 290 VENVFGIMASVFRVFRGPILLEPEKVRDITMTCVLLHN 327
>gi|295829889|gb|ADG38613.1| AT3G55350-like protein [Capsella grandiflora]
Length = 210
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 96/204 (47%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + +WP + L++I
Sbjct: 3 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-QWPSK-LDEI 60
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F + G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 61 KSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDV 120
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA-----------VILGDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + I+GDS +PL WL+
Sbjct: 121 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGEKXHLXERTELREYIIGDSGFPLLPWLL 179
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 180 TPYQGKPLSLPQTEFNKRHSEARK 203
>gi|345291421|gb|AEN82202.1| AT3G55350-like protein, partial [Capsella rubella]
Length = 205
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 96/204 (47%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + +WP + L++I
Sbjct: 2 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-QWPSK-LDEI 59
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F + G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 60 KSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDV 119
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + I+GDS +PL WL+
Sbjct: 120 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGEKFHLSERTELREYIIGDSGFPLLPWLL 178
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 179 TPYQGKPLSLPQTEFNKRHSEARK 202
>gi|115629193|ref|XP_001201331.1| PREDICTED: uncharacterized protein LOC764811 [Strongylocentrotus
purpuratus]
Length = 342
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 114/265 (43%), Gaps = 21/265 (7%)
Query: 43 KVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--D 100
K+ L + G+ Y + K +I + + +V IVD + ++ P N +
Sbjct: 54 KLSVFLRHLSTGATYAELSYNFRVGKETIQKFVPDVARAIVDEYAAEVISLPTTNEGWLE 113
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+ F + P+ G DG + + PN+ + N ++ +MA+ Y+F ++
Sbjct: 114 VAGDFEARWNLPHCLGAYDGKHIRLQKPNKSGSLYFNYKQFFSVVLMALVDSKYQFLWID 173
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV--------------ILGDSAYPLKEW 206
G DA++ NS + ++ E G PD +GD A+ ++ +
Sbjct: 174 VGGVGHQSDAQIYNNSEL-KECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTY 232
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLAS 265
++ P R + F R RR++EN+FGIL +F C L MR P +++ A+
Sbjct: 233 MMKPYGRRNMDQQQKIFNYRLSRARRVVENAFGILALRFQCFLGQMRQEPDTVRLLIEAA 292
Query: 266 TTLHNIALTIDRDNENQNVEIHDDE 290
LHN+ I + + +V + D E
Sbjct: 293 VMLHNL---IRKRYQALDVRMLDQE 314
>gi|296086277|emb|CBI31718.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 113/256 (44%), Gaps = 19/256 (7%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
LS ++V L + G + I+G + G ++S++ + + V + + +WP E
Sbjct: 16 LSLNDRVAIALRRLSSGESFSIVGDSFGVNQSTVAQITWKFVESMEVRGLHHL-QWPSTE 74
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVL--IDAPNEHEDQFVNRNGDHALNVMAICGPN 153
++ + +F + G PN G VD T + + +++ H++ + AI P
Sbjct: 75 ADMEETKSSFEKIRGLPNCCGAVDTTHISMSLSTMGTSNKVWIDHQKKHSMILQAIVDPE 134
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-----------PFPDAVILGDSAYP 202
+F V WPGS+ ++ V++NS + K EEG R + I+GD+ +P
Sbjct: 135 MRFRDVITGWPGSLSESCVLQNSGFF-KLSEEGKRLNGKMIELSEGTYLREYIVGDAGFP 193
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-- 260
L WL+ P + F + H TR + + LKE + + + P +
Sbjct: 194 LLPWLLTPYQGKGLSDFQSEFNKRHFATRMVARRALARLKEMWRIIQGVMWRPDTNKLPR 253
Query: 261 VVLASTTLHNIALTID 276
++L LHNI + ++
Sbjct: 254 IILVCCLLHNIVIDLE 269
>gi|331213751|ref|XP_003319557.1| hypothetical protein PGTG_01731 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309298547|gb|EFP75138.1| hypothetical protein PGTG_01731 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 374
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 116/267 (43%), Gaps = 30/267 (11%)
Query: 6 FKEAFRLSATQVEFILEEI---------APALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
FK+A R + T ++LE+I +P P +AL+ L L G G+
Sbjct: 47 FKQAVRTTKTGFTWLLEQIYFNAIFYSNSPRPQLPVPHQLALT-----LERLGSNGNGAS 101
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
+ ++ ++ V+ I D R + WP++ D ++ GF G
Sbjct: 102 VGRFSRNLSVGRGTVIKASRRVIRAIND-LSPRYLLWPDKERRDEISKVMKAEGFEGCIG 160
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
VDGT + L P+ + F +R +++N IC + A WPGS D+ V +
Sbjct: 161 FVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDRFITAYMTGWPGSCGDSMVFKR 220
Query: 176 SMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ--RFLRCHKR 229
MV+++ FD P ++ DSAY L IP + P A + F C R
Sbjct: 221 MMVHKEPALFFD-------PGQYLIADSAYELGLHCIPA-YKAPAAYIQENTEFNYCLAR 272
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQ 256
+R E++ GILK ++ L ++RL+ Q
Sbjct: 273 SRVRNEHTIGILKGRWASLQHLRLAIQ 299
>gi|346467053|gb|AEO33371.1| hypothetical protein [Amblyomma maculatum]
Length = 326
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 129/262 (49%), Gaps = 20/262 (7%)
Query: 74 SIHEVVPVIVD---TFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNE 130
++H +V ++D + R + +PE +L+ + F + GFP V GC+ GT + +P
Sbjct: 72 TVHGIVEKLLDYLCSLLPRKICFPE-DLDLLADDFEQLSGFPGVVGCIGGTCINTRSP-A 129
Query: 131 HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPF 190
H + N+ ++ + A+C +F V PG + V S + + W P
Sbjct: 130 HVLKSSNQQSFASVRLQAVCDNKCRFMDVFVGPPGDLESESVFLASPLAEELPY--WCPD 187
Query: 191 PDAVILGDSAYPLKEWLIPPRTRNPDAA--ADQR---FLRCHKRTRRLIENSFGILKEKF 245
+ +LGD YPL+E+L+ P + + A+Q F H++TR I+N+F +L+++F
Sbjct: 188 KNH-LLGDEGYPLREYLLTPYSSGQGSKDLAEQYELVFNERHEKTRAKIDNAFRLLRQRF 246
Query: 246 PCLNYMR-LSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPD 304
L+++ ++ ++A +HN+ + E +VE+ + E +G+ V +
Sbjct: 247 KQLHFLEFVTASKMRRFIMACCVVHNMCV------EAGDVELELEWTEANGDHVTVCNES 300
Query: 305 EIDEEEQPRQDRNNPGTQRLAR 326
+ + EE+ ++ + +RLA+
Sbjct: 301 DAEAEEKDERELGSLKRERLAK 322
>gi|345291413|gb|AEN82198.1| AT3G55350-like protein, partial [Capsella rubella]
Length = 205
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 96/204 (47%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + +WP + L++I
Sbjct: 2 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-QWPSK-LDEI 59
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F + G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 60 KSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNRVWLDGEKNFSMILQAVVDPDMRFLDV 119
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + I+GDS +PL WL+
Sbjct: 120 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGEKFHLSERTELREYIIGDSGFPLLPWLL 178
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 179 TPYQGKPLSLPQTEFNKRHSEARK 202
>gi|331211585|ref|XP_003307062.1| hypothetical protein PGTG_00012 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 392
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 120/281 (42%), Gaps = 11/281 (3%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHI 59
+ P F E FR+S ++ EE+ L R L+ + +V L+ +G G+ Y
Sbjct: 54 LPPVRFLEYFRMSRADFAWLCEELRGTLDQDHLRRGAPLTVEAQVAVGLYRLGHGATYVT 113
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND----IPMTFLRKGGFPNVA 115
I K + ++ V ++ R + +P + +D I +F + G P+V
Sbjct: 114 ISHVFNIGKETADKATGRFVKAVLKILRLRTISFPALDAHDEWDEIKESFESRHGIPDVV 173
Query: 116 GCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DGT + L P + ++NR +L + + F V GS+HD RV R
Sbjct: 174 GAIDGTHIPLSMPPGDRWKGYINRKNWPSLVFQCVVDGDGNFRDVFGGGAGSIHDTRVFR 233
Query: 175 NSMVYRKFDEEGWRPF---PDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
S + + +P A ++GD+ YP ++ P + A ++ F TR
Sbjct: 234 RSDLGVSLNRALGQPSMIPTGAHLIGDAGYPSDVNVLVPYP-SILAPENEYFNYIQSATR 292
Query: 232 RLIENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNI 271
++E +FG LK +F L R +P A A LHN+
Sbjct: 293 IVVEQAFGRLKNRFRILLTAQRANPIRARNTTFACMILHNL 333
>gi|328699971|ref|XP_001952082.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 415
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 135/320 (42%), Gaps = 23/320 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F E +R+S + + +LE++ P +T T+ + +E++ + ++ VG+ + +
Sbjct: 76 PSKFSEYYRMSISSFDELLEKLRPHITKKITKFRRPVCPEERLTITIRYLSVGTNFVALQ 135
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN---LNDIPMTFLRKGGFPNVAGCV 118
+S+I + E V+ +T + + PE N +I F K FPN G V
Sbjct: 136 YEFLLGRSTIGNIVRETCQVLWNTL--QPEEMPEPNPDQWTEIANKFYLKTNFPNCVGAV 193
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DG + P F N ++ +MA+ Y F +++ G D+ + +N
Sbjct: 194 DGKHIRCIKPINSGSMFYNYKKYFSIVLMAVVDAEYSFISIDVGAYGKEGDSTIFKNCPF 253
Query: 179 YRKFDEE-----------GWRPFPDA-VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
+K E FP V++GD A+ L + L+ P + F
Sbjct: 254 GKKLYSELLNLPAPVVLPNTDNFPQPFVLIGDEAFGLHKNLLRPYPGRGLTQKRKIFNYR 313
Query: 227 HKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID----RDNEN 281
R RR +E +FGIL K+ L + + + P FA V+ A LHN D D +
Sbjct: 314 LSRARRYVECAFGILANKWRVLHSAILVEPDFADDVIKACCILHNYVRRRDGYNFEDTLS 373
Query: 282 QNVEIHDDEAEDDGEEEAVD 301
++E +E ++ +D
Sbjct: 374 NSLEDIQNENNSGARQQGLD 393
>gi|449690159|ref|XP_004212260.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 416
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 130/316 (41%), Gaps = 31/316 (9%)
Query: 24 IAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV 83
++P P R L+ +K+ ++++ + G + ++ +++ V I
Sbjct: 107 VSPQSNCPNYR--FLTTHKKLAITIYYLKDTGSLWMTANVFGIHQCTVSKTVQVVCDAIN 164
Query: 84 DTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGD 141
+ + P+ E++ + F K G GC+DGT V I P ++ +
Sbjct: 165 NIVGPIYLHLPKNKEDMTKLASQFEVKFGMIQAFGCIDGTHVQIKRPIKNGQDYFCYKQY 224
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----------FDEEGWRPFP 191
+LNV A+C F V WPGSVHDA++ NS + +K + + P
Sbjct: 225 FSLNVQAVCDSKGYFIDVECKWPGSVHDAKMFTNSTINKKLIKGTLPQTLYSLPNYHSIP 284
Query: 192 DAVILGDSAYPLKEWLIP--PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL- 248
+ +I GD AYPL + I N + LR R IE +FG LK ++ L
Sbjct: 285 NYLI-GDPAYPLTNFCIKEFQSCSNNEEVIFNSMLR---SARNQIECAFGRLKARWGFLR 340
Query: 249 NYMRLSPQFAGMVVLASTTLHNIA---LTIDRDNE--NQNVEIHDDEAEDDGEEEAVDDP 303
+ + + +V+ LHN T D + E NQ +E H +A ++ + P
Sbjct: 341 KIIDIKIETVPIVIYTCFVLHNFCEKNKTYDLNEEEVNQQIERHRSDA-----LKSPNLP 395
Query: 304 DEIDEEEQPRQDRNNP 319
D I P + +P
Sbjct: 396 DSIYSANNPEAIKTSP 411
>gi|328702684|ref|XP_003241982.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 246
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 91/201 (45%), Gaps = 19/201 (9%)
Query: 88 QRMVKWPEEN-----LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP------NEHEDQ-- 134
+ +KWP N L D LR FP +DG + I P H DQ
Sbjct: 6 HKFIKWPRGNAIHQTLQD--FKHLRPKAFPGAFMALDGCHIKIPTPWNKRTRMRHVDQRC 63
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFP-DA 193
++NR ++ + IC N KF + A WPG+ HDAR+ R S + + P +
Sbjct: 64 YINRKHVASVVLQGICDSNQKFTNIFAGWPGASHDARIFRKSNIGVALNGPLQNLVPENC 123
Query: 194 VILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCLNYMR 252
IL DSAYPL + ++ P N ++ F R +R +I+ +FG L E+F L
Sbjct: 124 YILADSAYPLSDNVMVPYHDNGALTPEKHNFNRILNSSRVVIKLTFGKLLERFRYLEVY- 182
Query: 253 LSPQFAGMVVLASTTLHNIAL 273
+ F G + A+ LHNI +
Sbjct: 183 -NKDFCGAFISAACCLHNICM 202
>gi|328699817|ref|XP_003241056.1| PREDICTED: hypothetical protein LOC100574409 [Acyrthosiphon pisum]
Length = 416
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 122/284 (42%), Gaps = 29/284 (10%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+R+S +++ I+PA+ T ++A+E++L L ++G G + I G +
Sbjct: 71 YRMSKESYLYLVNLISPAINKQNTNMRECVNAEERILITLRYLGTGGTFASIAVYFGRGE 130
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S++ + E V VI + + + P E I F PN G ++G V I+
Sbjct: 131 STVGGIVTETVKVIWEVLEKTFMPIPSTEQWKGIANRFETLWNLPNCIGAINGKHVRIEK 190
Query: 128 -PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
PN F N H++ +MA C + F + +PG D + S + G
Sbjct: 191 FPNSGSSNF-NYKSYHSMVLMACCDADGLFTIIETGFPGRNSDGGIFSASAMKYWIQNAG 249
Query: 187 W--------------RPFPDAVILGDSAYPLKEWLIPPRTR----NPDAAADQRFLRCHK 228
PFP + D A+PL ++++ P + N + RF
Sbjct: 250 LNIPPPTPLTYDENDSPFP-YYFVADEAFPLLKYVMRPYPKRILDNVKRICNYRF----S 304
Query: 229 RTRRLIENSFGILKEKFPCLNY-MRL-SPQFAGMVVLASTTLHN 270
R R+ IE +FG+ EKF LN +R+ P+ +V+ A+ LHN
Sbjct: 305 RGRKTIECTFGMACEKFAVLNGPIRIRDPENVNLVIKAACILHN 348
>gi|449691762|ref|XP_004212788.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 229
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 75/145 (51%), Gaps = 4/145 (2%)
Query: 43 KVLSCLHWMGVGSPYHIIG-ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLN 99
+VL L GS +IG +T S+ +I R I V + +K+P + LN
Sbjct: 83 QVLCALRIYATGSFQIVIGDSTAALSQPTISRIIRRVSLSLAKRV-NEYIKYPTDLHVLN 141
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
+ + F FP + G ++GT V I P +H+ VNR +H++NV A+ N KF +
Sbjct: 142 EKRVNFYNVAEFPRITGLIEGTHVCIQKPRKHKYVSVNRLSNHSINVQAVSDYNGKFIDI 201
Query: 160 NANWPGSVHDARVMRNSMVYRKFDE 184
ANW GS H+AR++R S + +K D
Sbjct: 202 FANWAGSTHNARILRESKLSKKNDR 226
>gi|295829881|gb|ADG38609.1| AT3G55350-like protein [Capsella grandiflora]
Length = 210
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 96/204 (47%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + +WP + L++I
Sbjct: 3 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-QWPSK-LDEI 60
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F + G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 61 KSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDV 120
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + I+GDS +PL WL+
Sbjct: 121 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGXKLHLSERTELREYIIGDSGFPLLPWLL 179
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 180 TPYQGKPLSLPQTEFNKRHSEARK 203
>gi|221110109|ref|XP_002160688.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 118/274 (43%), Gaps = 18/274 (6%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I ++T P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSVTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I + + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEYDQNWNFPQCIGAIDGKHVHIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N +++ ++AIC Y F V+ G +D ++ S R F+ +G
Sbjct: 119 APAKSGFSFYNYKEFYSMVLLAICDAKYCFTMVDIGAYGRDNDVAILNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
+ P V++GD + LK WL+ P + F R RR IENS
Sbjct: 178 YFSLPKISEFDPKVPLVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENS 237
Query: 238 FGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
FGIL ++ Y ++ P ++ A+ LHN
Sbjct: 238 FGILAARWRIYRYPIKAKPFKVEHIIKATVCLHN 271
>gi|295829891|gb|ADG38614.1| AT3G55350-like protein [Neslia paniculata]
Length = 210
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 95/204 (46%), Gaps = 16/204 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
++V L +G G +IG T G ++S++ + V + + + WP + L++I
Sbjct: 3 DRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHL-SWPLK-LDEI 60
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVM--AICGPNYKFYAV 159
F R G PN G +D T ++++ P V +G+ +++ A+ P+ +F V
Sbjct: 61 KSKFERISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDV 120
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVIL-----------GDSAYPLKEWLI 208
A WPGS+ D V++NS Y K E+G R + +L GDS +PL WL+
Sbjct: 121 IAGWPGSLSDDVVLKNSGFY-KLVEKGKRLNGEKFLLSERGELREYIVGDSGFPLLPWLL 179
Query: 209 PPRTRNPDAAADQRFLRCHKRTRR 232
P P + F + H R+
Sbjct: 180 TPYQGKPLSLPQTEFNKRHSEARK 203
>gi|328712569|ref|XP_003244845.1| PREDICTED: hypothetical protein LOC100571710 [Acyrthosiphon pisum]
Length = 415
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 122/284 (42%), Gaps = 29/284 (10%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+R+S +++ I+PA+ T ++A+E++L L ++G G + I G +
Sbjct: 71 YRMSKESYLYLVNLISPAINKQNTNMRECVNAEERILITLRYLGTGGTFASIAVYFGRGE 130
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S++ + E V VI + + + P E I F PN G +DG V I+
Sbjct: 131 STVGGIVTETVKVIWEVLEKTFMPIPSTEQWKGIANRFETLWNLPNCIGAIDGKHVRIEK 190
Query: 128 -PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
PN F N H++ +MA C + F + +PG D + S + G
Sbjct: 191 FPNSGSSNF-NYKSYHSMVLMACCDADGLFTIIETGFPGRNSDGGIFSASAMKYWIQNAG 249
Query: 187 W--------------RPFPDAVILGDSAYPLKEWLI---PPRTR-NPDAAADQRFLRCHK 228
PFP + D A+PL ++++ P R N + RF
Sbjct: 250 LNIPPPTPLTYDENDSPFP-YYFVADEAFPLLKYVMRAYPKRILDNVKRICNYRF----S 304
Query: 229 RTRRLIENSFGILKEKFPCLNY-MRL-SPQFAGMVVLASTTLHN 270
R R+ IE +FG+ EKF LN +R+ P+ +V+ A+ LHN
Sbjct: 305 RGRKTIECTFGMACEKFAVLNGPIRIRDPENVNLVIKAACILHN 348
>gi|331221008|ref|XP_003323179.1| hypothetical protein PGTG_04716 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 471
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 103/217 (47%), Gaps = 14/217 (6%)
Query: 45 LSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPM 103
LS L G G H++G + S S+ + + I++T ++ V WP + ++
Sbjct: 186 LSNLGLYGNGGSPHVLGQYYNISPGSVENYTNRCMFAIIETLEKKHVYWPNTQQRANVTA 245
Query: 104 TFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKF-YAVNA 161
T K F G DGT+ L AP +H++ + R +A+N + +C + YAV+
Sbjct: 246 TLAGKTVFDGCIGFADGTIFPLASAPTKHKEDYWMRKMVYAVNSLIVCNWQRRIIYAVHG 305
Query: 162 NWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA--VILGDSAYPLKEWLIPPRTRNPDAAA 219
W GS HD RV +N +YR R F A +L DSAY + +IP R+P +
Sbjct: 306 -WCGSAHDQRVYKNCQLYRHP-----RGFFSAGEYLLADSAYTSTDTIIPAFKRSPGRSL 359
Query: 220 ---DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL 253
Q+F R ++E++ G+LK ++ L + +
Sbjct: 360 PPEKQQFNYELSHNRVVVEHTIGMLKNRWQSLKSLSI 396
>gi|62734261|gb|AAX96370.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77549127|gb|ABA91924.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 676
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 104/238 (43%), Gaps = 41/238 (17%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 449 NCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 508
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP----------------PRTRNP 215
++ +F + +P P L DS YP + +L P PR R
Sbjct: 509 FNDAQT--RFSAKFPKPPPGKFYLVDSGYPNRPGYLAPYKGITYHFQEYNESTLPRGRR- 565
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
+ F CH R +IE SFG+LK K+ L + S + ++ A LH
Sbjct: 566 -----EHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHYFI-- 618
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
RD++ + E D + D ++ D + P + P + +R++N FR
Sbjct: 619 --RDSQMADTEF--DNCDHD------ENYDPLGGTSSPSSE---PTNELDSRVMNQFR 663
>gi|290562609|gb|ADD38700.1| nuclease HARBI1 [Lepeophtheirus salmonis]
Length = 439
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 107/249 (42%), Gaps = 17/249 (6%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
FR++ + +E+ P L + E + +++ L+ + G G S++
Sbjct: 108 FRMTQMTFRILTDELRPYLISSDETAIIKDPAKQMALTLYTLTQNISMMESGLLFGVSQN 167
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S+ + +V I + +KWP + + ++ F R+ GFP G +D +
Sbjct: 168 SVVNIVKKVCKAISLHLGPKYLKWPSDQNEVEELVRCFEREHGFPQCIGAIDNFFIQTRR 227
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE--- 184
N +F+N G ++LN+ AIC Y F V A+ PGSVH A + NS + + F
Sbjct: 228 SN---SKFLNSLGFYSLNIQAICDYKYCFMDVMADSPGSVHVAEIFFNSSINKSFMSGNV 284
Query: 185 --------EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
RP P +++GDS YPL L+ ++ F ++ + +I
Sbjct: 285 SSFSRKLIPNTRPVP-ILLIGDSCYPLMPHLMIEYPLGGSNEKEKHFGSILRKAKEVIRC 343
Query: 237 SFGILKEKF 245
+ LK +F
Sbjct: 344 ALYRLKARF 352
>gi|432877870|ref|XP_004073237.1| PREDICTED: uncharacterized protein LOC101165567 [Oryzias latipes]
Length = 399
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 118/282 (41%), Gaps = 16/282 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FRLS Q + +L I +T T ++SA E++ CL ++ G Y I ++
Sbjct: 59 FQWYFRLSTAQFDDLLSRIGARITHQDTNYRRSISAAERLSVCLRYLATGDSYRTIATSY 118
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S++ I EV+ I D + + P E+ I F ++ FPN G VDG V
Sbjct: 119 RVGVSTVAIIIPEVLTAIWDGLVEDFMAVPSAEDWRSIAEEFHQRWNFPNCCGAVDGKHV 178
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
++ P Q N G ++ ++A+ +F ++ G D ++ NS + +
Sbjct: 179 ILKVPPNLGSQLHNCKGPFSVVLLAVVDALGRFRVIDVGDYGRTSDGGILANSTLGQALR 238
Query: 184 EEGW-----RPFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
P P A V + D A+PL+ ++ P + F R
Sbjct: 239 CGTLNLPPDHPLPGAEQRGPQPHVFVADEAFPLRRNMMRPFPGRALTPERRIFNYRLSRA 298
Query: 231 RRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNI 271
R ++E++FG+L ++ + + P+ V + LHN
Sbjct: 299 RLVVEDTFGLLFAQWRLFRRLVDVRPEVLEKFVKTTCLLHNF 340
>gi|410898682|ref|XP_003962826.1| PREDICTED: putative nuclease HARBI1-like [Takifugu rubripes]
Length = 435
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 125/295 (42%), Gaps = 23/295 (7%)
Query: 1 MTPFA---FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSP 56
MT F + + FR+S ++ +++ P LT T +AL +++V L +
Sbjct: 87 MTEFQPPDWLDKFRMSRETFFYLCDKLRPRLTRQDTTFRLALPVEKRVAVALWRLASNVE 146
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVA 115
Y I A G KS++CR + ++ IV ++ P + L+D L GFP+
Sbjct: 147 YRTISALFGVGKSTVCRCVRDMCHAIVALLSSIYLRPPSGQELHDSAQHCLSSWGFPHCV 206
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+ I P+ + + N G ++ + + +F+ V A++PG A +++N
Sbjct: 207 AAIATLHTAIITPSNNASDYANPAGWLSVLSQVVVNGSGQFWDVCASFPGGTDPADILQN 266
Query: 176 SMVYRKFDEEGWRPFP---------DAVILGDSAYPLKEWLIPP-------RTRNPDAAA 219
S ++ E G P P VILG++ YPL+ WL+ R R
Sbjct: 267 SSLWATAAEGGLSPSPLPMFTGRPLRYVILGEACYPLQSWLMKAYPEEGGRRGRKATLTE 326
Query: 220 DQ-RFLRCHKRTRRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIA 272
Q F + +R R E + L+ ++ CL+ ++LA LHN+
Sbjct: 327 PQCLFNQQLRRALRAPEETLLRLRARWQCLSKRNDCGLDVVPTMILACCILHNMC 381
>gi|356569920|ref|XP_003553142.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 379
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 109/257 (42%), Gaps = 15/257 (5%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG----GFPNVAGCVDGTM 122
S +I R H V+ + F + ++K + + D L+ F + G +DGT
Sbjct: 125 SGKTISRHFHNVLEAVC-MFAKDIIKPVDPSFRDTLHEILKDARYRPYFRDCIGAIDGTH 183
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + P+ + ++ R G NVMA+C + F V W GS HD ++ ++
Sbjct: 184 IRVCVPSHLQGVYIGRKGYTTTNVMAVCDFSMCFTFVWTGWEGSAHDTKIFMEALRKYYL 243
Query: 183 DEEGWRPFPDAVILG---DSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
+ G+ F LG + Y L ++ I PR R + F H R +IE +FG
Sbjct: 244 VDSGYPTFMG--FLGPYKKTRYHLSQFRIGPRIR----GRVEVFNYYHSSLRSIIERAFG 297
Query: 240 ILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
+ K ++ L M + + +++A +HN D+ +E + D+E D E+E
Sbjct: 298 LCKARWKILGNMSPFALKIQNQIIVACMAIHNFIQRNDKSDEEFDSLDEDNEYIDSDEDE 357
Query: 299 AVDDPDEIDEEEQPRQD 315
+ P EE Q
Sbjct: 358 SEVGPSTTTWEESYAQS 374
>gi|301631363|ref|XP_002944767.1| PREDICTED: putative nuclease HARBI1-like, partial [Xenopus
(Silurana) tropicalis]
Length = 274
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 90/198 (45%), Gaps = 11/198 (5%)
Query: 85 TFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDH 142
TFF+ +++PE E I F FPN G +DG + I+ P F N G
Sbjct: 1 TFFKFFLQFPETSEEWQVIAQQFNDYWNFPNCGGAIDGKHIRINPPPNSGSYFFNYKGFF 60
Query: 143 ALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA--------V 194
++ ++AI NY+F V+ G + D V+ + +K + +A V
Sbjct: 61 SIVLLAIVNANYEFIMVDIGKNGRLSDGGVIEQTHFNQKLKSKQLNLPTNAETKEGLNFV 120
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRL 253
+GD A+ L E L+ P + + F R RR++EN+FGIL +F + + L
Sbjct: 121 FVGDEAFGLHENLLKPFPQKVLTPERKIFNYRLSRARRVVENAFGILANRFRIFHTSINL 180
Query: 254 SPQFAGMVVLASTTLHNI 271
P+ MVVL+ LHN
Sbjct: 181 CPEKIDMVVLSCCVLHNF 198
>gi|116317822|emb|CAH65857.1| OSIGBa0126J24.2 [Oryza sativa Indica Group]
Length = 836
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 66/238 (27%), Positives = 106/238 (44%), Gaps = 41/238 (17%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 609 NCIGAIDGTHIQVVVPNSAAVQHRNRHQEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 668
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP----------------PRTRNP 215
+ ++ +F + +P L DS YP + +L P PR R
Sbjct: 669 LNDAQT--RFSAKFPKPPLGKFYLVDSGYPNRPGYLAPYKGITYQFQEYNESTLPRGRR- 725
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
+ F CH R +IE SFG+LK K+ L + S + ++ A LHN
Sbjct: 726 -----EHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHNFI-- 778
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
RD++ + E ++ + + ++ D + P + P + +R++N FR
Sbjct: 779 --RDSQMADTEFNNCDHD--------ENYDPLGGTSSPSSE---PTNELDSRVMNQFR 823
>gi|449671109|ref|XP_004207429.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 382
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 114/288 (39%), Gaps = 26/288 (9%)
Query: 5 AFKEAFRLSATQ----VEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
A+K+ FRL V + I+P +P R A++ +KV L+++ ++
Sbjct: 63 AYKKNFRLDKNSFFELVSLLEPYISPDSYSPNIR--AITPDKKVAITLYYLKDSGTLNMT 120
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCV 118
T G + + I EV IV R V P+ + + + F K G GCV
Sbjct: 121 ANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCV 180
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DGT + I P H + + L V A+C V WPGSVHDA+V NS +
Sbjct: 181 DGTHIPIVCPTNHSQDYFCYKQYYLLQVQAVCDYKGSVLDVECMWPGSVHDAKVFSNSSI 240
Query: 179 YRKFDEEGWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRC 226
+ ++GD AYP L+P T+ D+
Sbjct: 241 NTNLRSSRLPGTFQTITKNKIKVPCYLIGDPAYP----LLPHCTKEYSTCKKNDEVIFNS 296
Query: 227 HKRT-RRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIA 272
RT R IE +FG LK ++ L M L + V+ A LHN
Sbjct: 297 MLRTARNPIECAFGRLKARWKILTKKMDLKLEKIPTVIYACFILHNFC 344
>gi|38569143|emb|CAE05672.3| OSJNBb0033P05.11 [Oryza sativa Japonica Group]
Length = 877
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 104/238 (43%), Gaps = 41/238 (17%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 650 NCIGAIDGTHIQVVVPNSAAVQHRNRHQEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 709
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP----------------PRTRNP 215
++ +F + +P L DS YP + +L P PR R
Sbjct: 710 FNDAQT--RFSAKFPKPPLGKFYLVDSGYPNRPGYLAPYKGITYHFHEYNESTLPRGRR- 766
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
+ F CH R +IE SFG+LK K+ L + S + ++ A LHN
Sbjct: 767 -----EHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHNFI-- 819
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
RD++ + E D + D ++ D + P + P + +R++N FR
Sbjct: 820 --RDSQMADTEF--DNCDHD------ENYDPLGGTSSPSSE---PTNELDSRVMNQFR 864
>gi|389609307|dbj|BAM18265.1| unknown unsecreted protein [Papilio xuthus]
Length = 384
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 116/283 (40%), Gaps = 14/283 (4%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
F+ ++R+S + I P + T ++S++E++ CL ++ G + +G
Sbjct: 73 TFENSYRVSRCTFYELHSLIEPYIRKQDTNYRNSISSRERLAVCLKYLATGQSFTTMGEN 132
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTM 122
S+ R + EV + + ++ P EN +I F F N G +DG
Sbjct: 133 FRIGLKSVSRIVEEVCDALWNILQPLVMSQPTENDWKEIAKDFDELWQFKNCIGALDGKH 192
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V I AP++ F N ++ +M + K + G DA + NS+ +
Sbjct: 193 VYIKAPSKTGSSFFNYKKRFSVVLMCLADAKRKIIMADVGSMGRFSDAGIFDNSIFGKSL 252
Query: 183 DEEGWR-----PFPDA------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
E+ PF V +GD A+PL E + P R+ A + F R R
Sbjct: 253 KEKRLNLPQPVPFYQGGAKMPFVFIGDEAFPLMENFMRPYPRDGLNAEKKIFNYRLSRAR 312
Query: 232 RLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
R++E +FG+L K+ + + V+ A+ LHN +
Sbjct: 313 RIVEATFGVLTRKWYVYHKDFECKIETVDKVIKATCVLHNYLI 355
>gi|331232448|ref|XP_003328886.1| hypothetical protein PGTG_10187 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 377
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 106/221 (47%), Gaps = 37/221 (16%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESEQIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD++ + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDQKQF-------LLADSAYTND 274
Query: 205 EWLIPPRTRNPDAAADQRFLRCHK--------RTRRLIENSFGILKEKFPCL----NYMR 252
+++P A ++ L+C ++R IE++ GILK +F L +R
Sbjct: 275 CYVVP-------AFKGKQLLKCCNINFNYHLAQSRVRIEHAIGILKGRFASLREIQTQIR 327
Query: 253 LSPQFAGMV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ + G V +L LHN+ D ++Q ++++DEA
Sbjct: 328 NTEEMKGAVKWILTCIVLHNLL----ADLKDQWNDLYEDEA 364
>gi|290462281|gb|ADD24188.1| nuclease HARBI1 [Lepeophtheirus salmonis]
Length = 439
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 106/249 (42%), Gaps = 17/249 (6%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
FR++ + +E+ P L + E + +++ L+ + G G S++
Sbjct: 108 FRMTQMTFRILTDELRPYLISSNETAIIKDPAKQMALTLYTLTQNISMMESGLLFGVSQN 167
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S+ + +V I +KWP + + ++ F R+ GFP G +D +
Sbjct: 168 SVVNIVKKVCKAISLHLGPNYLKWPSDQNEVEELVRCFEREHGFPQCIGAIDNFFIQTRR 227
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE--- 184
N +F+N G ++LN+ AIC Y F V A+ PGSVH A + NS + + F
Sbjct: 228 SN---SKFLNSLGFYSLNIQAICDYKYCFMDVMADSPGSVHVAEIFFNSSINKSFMSGNV 284
Query: 185 --------EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
RP P +++GDS YPL L+ ++ F ++ + +I
Sbjct: 285 SSFSRKLIPNTRPVP-ILLIGDSCYPLMPHLMIEYPLGGSNEKEKHFGSILRKAKEVIRC 343
Query: 237 SFGILKEKF 245
+ LK +F
Sbjct: 344 ALYRLKARF 352
>gi|301627131|ref|XP_002942732.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 352
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 83/177 (46%), Gaps = 7/177 (3%)
Query: 103 MTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNAN 162
+ FL G F G +D T V + P H+++++NR H+++V +C + + +V
Sbjct: 100 LHFLGSGSFQQCLGAIDCTHVPLTPPRAHQERYLNRKRSHSISVQVVCDSHLRIMSVXXX 159
Query: 163 WPGSVHDARVMRNSMVYRKFDEEGWRPFPD--AVILGDSAYPLKEWLIPPRTRNPDAAAD 220
+++ + +LGD+ Y + WL+ P R P A
Sbjct: 160 XXXXXXXXXXXXXXYWCNRWELLNGSASTHNFSFLLGDAGYGVLPWLMTP-VRFPRTPAQ 218
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLNY----MRLSPQFAGMVVLASTTLHNIAL 273
+R+ R H++TR +IE FG+LK +F CL+ + SP +++ LHN+A+
Sbjct: 219 RRYNRAHRKTRNVIERLFGVLKLRFRCLSVTGGALLYSPIKVSEIIVVCAMLHNVAM 275
>gi|449517792|ref|XP_004165928.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Cucumis sativus]
Length = 388
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 127/301 (42%), Gaps = 24/301 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAK-----EKVLSCLHWMGVGSPYHII 60
FK F++S ++I + +T + L+ + ++V L +G G I
Sbjct: 56 FKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTI 115
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + G + S++ + V + + + WP E + + F + G PN G +
Sbjct: 116 GDSLGLNHSTVSQVTWRFVESMEERGLHHL-HWPSNEVEMAQVKSKFEKIQGLPNCCGSI 174
Query: 119 DGTMVLIDAPNEHEDQFV--NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T + + P +V + +H++ + I +F + PG + D V ++S
Sbjct: 175 DTTHITMCLPASDPTSYVWLDXQKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSS 234
Query: 177 MVYRKFDEEGWR------PFPD-----AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K ++G R PD I+GDS YPL +L+ P + + F +
Sbjct: 235 N-FHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNK 293
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQN 283
HK TR +++ + +LKE++ + + P + ++L LHNI + I + E
Sbjct: 294 RHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGE 353
Query: 284 V 284
V
Sbjct: 354 V 354
>gi|322785758|gb|EFZ12385.1| hypothetical protein SINV_02385 [Solenopsis invicta]
Length = 301
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 109/258 (42%), Gaps = 14/258 (5%)
Query: 50 WMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRK 108
++ G I H +S+ + I E V V +K P+ E+ +I + F
Sbjct: 1 FLATGDQVSSIAFAHRIGESTAYKVIKETCVVTVRILSSIYLKPPKKEDWKNIAIGFWNH 60
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
FPN G +DG LI AP+ + N + ++ ++A C YKF V+ GS
Sbjct: 61 WNFPNCLGAIDGKHFLIKAPSNSGTLYFNYKKNFSIVLLAACDYQYKFTIVDCGAYGSSS 120
Query: 169 DARVMRNSMVYRKFD--------EEGWRPFPDA----VILGDSAYPLKEWLIPPRTRNPD 216
D + S + + E P D + D A+PL + ++ P
Sbjct: 121 DGGIFAQSEFGKCLNSDNLDIPVENCKLPLTDVEMPYYFVADEAFPLSKRIMRPYPGQFL 180
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIALTI 275
F R RR+IEN+FGIL ++ + L P+ A ++++A+ LHN +T
Sbjct: 181 TDKKSIFNYRLSRARRIIENTFGILVSRWRLFQRCICLDPRHADVIIMAAINLHNYLMTE 240
Query: 276 DRDNENQNVEIHDDEAED 293
+ + + ++ ++ED
Sbjct: 241 NNTSTSSYCPVNYVDSED 258
>gi|189442523|gb|AAI67676.1| LOC100170595 protein [Xenopus (Silurana) tropicalis]
Length = 328
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/275 (24%), Positives = 116/275 (42%), Gaps = 16/275 (5%)
Query: 12 LSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S T + +L + P L+ T T +S +E++ L ++ G + + +++
Sbjct: 1 MSTTSFDELLTILKPGLSRTQTLMRDPISPEERLCLTLRFLATGQSFSSLYIQFHIGRTT 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN 129
I + + E ++ + ++ P+EN I F K FPN AG + G + + P
Sbjct: 61 IGKIVRETCLLLWSELQRIVMPSPDENKWMQIAEDFHEKVNFPNCAGALGGRHIRVRMPA 120
Query: 130 EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR- 188
++ +N L +A NY F ++ S +A +NS + R+ E
Sbjct: 121 NRGSKYYKKNSSVVL--LAAADVNYCFSVIDVGSYHSTGNASAFQNSELGRQLSERTLHL 178
Query: 189 -----------PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
P V LGD A+ L E +I P + + + F R +R++E +
Sbjct: 179 PLPKPLPGTVVPNMPYVFLGDEAFGLAENVIQPYPGSQKSVQKRVFNYRFSRAQRILECA 238
Query: 238 FGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
FGIL K+ L+ +L P F V+ A LHN
Sbjct: 239 FGILSNKWHVLHTAKLKPDFVSTVIKACCVLHNFV 273
>gi|449666033|ref|XP_004206266.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 168
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 77/148 (52%), Gaps = 7/148 (4%)
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWL 207
A+C N KF V A PGS HDAR++R S EE ++ G+S YP WL
Sbjct: 6 AVCDYNEKFIDVAAKGPGSTHDARILRES----NLGEEIIDGTLKGLLFGNSRYPCFRWL 61
Query: 208 IPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLAST 266
+ P +P A R+ KRTR LIE FG K +F L+ +R++P+ ++ A
Sbjct: 62 LTPYL-SPTTTAQCRYNISLKRTRVLIEQVFGCWKRRFHLLHGKVRMTPERTCTIIAACA 120
Query: 267 TLHNIALTI-DRDNENQNVEIHDDEAED 293
L N+A+ + D D ++ +E +++ ED
Sbjct: 121 VLQNLAIELNDMDIDDNPIENYENNNED 148
>gi|449462621|ref|XP_004149039.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 388
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 127/301 (42%), Gaps = 24/301 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAK-----EKVLSCLHWMGVGSPYHII 60
FK F++S ++I + +T + L+ + ++V L +G G I
Sbjct: 56 FKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTI 115
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + G + S++ + V + + + WP E + + F + G PN G +
Sbjct: 116 GDSLGLNHSTVSQVTWRFVESMEERGLHHL-HWPSNEVEMAQVKSKFEKIQGLPNCCGSI 174
Query: 119 DGTMVLIDAPNEHEDQFV--NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T + + P +V + +H++ + I +F + PG + D V ++S
Sbjct: 175 DTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSS 234
Query: 177 MVYRKFDEEGWR------PFPD-----AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K ++G R PD I+GDS YPL +L+ P + + F +
Sbjct: 235 N-FHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNK 293
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQN 283
HK TR +++ + +LKE++ + + P + ++L LHNI + I + E
Sbjct: 294 RHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGE 353
Query: 284 V 284
V
Sbjct: 354 V 354
>gi|449686978|ref|XP_004211313.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 388
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 116/268 (43%), Gaps = 15/268 (5%)
Query: 16 QVEFILEEIAPALTTPTERNVALSAKEKVLSCLHW-MGVGSPYHIIGAT-HGPSKSSICR 73
QV F L+EI A+ + + S K + L W + P H G T ++++C
Sbjct: 33 QVLF-LDEIPEAMFIDSLKPFNTSFKNGAIP-LDWKVASVIPIHKKGPTVDRVGRATVCH 90
Query: 74 SIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEH 131
I E I + ++ P + +I F + FP G +DG V I+AP +
Sbjct: 91 IIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPAKS 150
Query: 132 EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE------- 184
F N G +++ ++AIC Y F V+ G +DA ++ S R F++
Sbjct: 151 GSSFYNYKGFNSMVLLAICDAKYCFTMVDIRAYGRDNDAAILNASTFGRTFNKGYFNLLK 210
Query: 185 -EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
+ P +V++GD + LK WL+ P + F R R+ IENSFGIL
Sbjct: 211 ISEFDPKVPSVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARKTIENSFGILAA 270
Query: 244 KFPCLNY-MRLSPQFAGMVVLASTTLHN 270
++ ++ P ++ A+ LHN
Sbjct: 271 RWRIYRSPIKAKPLKVEHIIKATVCLHN 298
>gi|241640721|ref|XP_002410899.1| transposase, putative [Ixodes scapularis]
gi|215503597|gb|EEC13091.1| transposase, putative [Ixodes scapularis]
Length = 459
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 105/225 (46%), Gaps = 26/225 (11%)
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G + ++ R + E +V R V+ P +E+L DI F + FP AG + V
Sbjct: 190 GVTGDTVARIVREFCAAVVKVLMPRFVQVPYDEDLEDIAEQFRDRTKFPRCAGALCFAHV 249
Query: 124 LIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I P + E + + N G +++ V A+ G + +F+ +N WPGS A+V+R+S +++K
Sbjct: 250 PILPPQDDEGNDYTNPEGWNSIVVQAVVGADRRFWDLNIGWPGSTQVAQVLRSSSLWKKG 309
Query: 183 DEE-----------GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+E G R V+ + YPL+ W++ P + RF R
Sbjct: 310 EEGTLFAAKVEEICGIR--VKQVLAAGTGYPLRRWIVTPYLMPNE--CQLRFNRSLSEVL 365
Query: 232 RLIENSFGILKEKFPCL-----NYMRLSPQFAGMVVLASTTLHNI 271
+ E +F L+ +FP L + + L P +V A +TLHN+
Sbjct: 366 SVGEAAFRRLEARFPFLLRHNGSGIDLMPS----IVAACSTLHNM 406
>gi|331248209|ref|XP_003336729.1| hypothetical protein PGTG_17984 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/233 (28%), Positives = 113/233 (48%), Gaps = 36/233 (15%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD++ + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDQKQF-------LLADSAYTND 274
Query: 205 EWLIPP-------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL---- 253
++IP + RN D F ++R IE++ GILK +F L ++
Sbjct: 275 CYVIPAFKGKHLLKHRNID------FNYHLAQSRVRIEHAIGILKGRFASLREIQTQIHN 328
Query: 254 SPQFAGMV--VLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPD 304
+ + G V ++ S LHN+ + +D N ++++DE +D + ++ DD D
Sbjct: 329 AEEMKGAVKWIVTSIVLHNLLADL-KDKWN---DLYEDEVPEDNKSDS-DDED 376
>gi|147861273|emb|CAN81903.1| hypothetical protein VITISV_041568 [Vitis vinifera]
Length = 599
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 112/251 (44%), Gaps = 30/251 (11%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S +I + HEV+ +V+ F + M+ P N N I LR+ F +V G +DGT++
Sbjct: 323 SSQTIHKYFHEVLVAMVN-FSKEMITPPSFNDSSNGISNRRLRQI-FKDVVGAIDGTLIH 380
Query: 125 IDAPNEHEDQFVNRN-GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
P + + R G+ NVMA+C + F V W G+ HD+RV+ ++ R
Sbjct: 381 ACIPTNQQVPYRGRGRGECFQNVMAVCDFDMIFRFVVVGWEGTAHDSRVLTETI--RNPQ 438
Query: 184 EEGWRPFPDAVILGDSAYP-----------LKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
P + L D+AY + WL R+ ++ F +CH R R
Sbjct: 439 HNFPMPPSEKYYLVDAAYTHTQGFMTPYRNVHYWLSDFRSGGKAVGKEEIFNQCHARLRN 498
Query: 233 LIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTI---DR-----DNENQN 283
+IE +FG++K FP L M S +V+ ++HN I DR DNE
Sbjct: 499 VIERAFGVVKACFPILKRMAPYSFTTQTKIVMTCFSIHNFLQQISVADRLFSEYDNE--- 555
Query: 284 VEIHDDEAEDD 294
VE+ D A +
Sbjct: 556 VELESDNANQN 566
>gi|331251807|ref|XP_003338490.1| hypothetical protein PGTG_19974 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 387
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 108/249 (43%), Gaps = 22/249 (8%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--- 94
LS + +V L+ + GS Y IG K + ++ V ++ F + +++P
Sbjct: 89 LSVEAQVAVGLYQLAHGSSYLTIGHVFNIGKETADKAAGRFVIAVLQQFRRVAIRYPALA 148
Query: 95 -EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHE-DQFVNRNGDHALNVMAICGP 152
+E ++I +F K G PNV G +DGT + + P E ++NR ++ +
Sbjct: 149 NQEQWDEISASFEAKHGIPNVVGTIDGTHIPLAVPAEDRWKDYINRKSWASIVFQCVVDG 208
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRP--FP-DAVILGDSAYP--LKEWL 207
N F V+ PGS+HD R+ S + P P ++GD+ YP ++ L
Sbjct: 209 NGNFRNVSGGAPGSMHDGRLFWRSELGHSITPGSMAPAMIPMGTYLIGDAGYPSNVRVLL 268
Query: 208 IPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMV-----V 262
P P+ ++ F TR ++E +FG LK +F L L PQ A V
Sbjct: 269 PYPSIATPE---NEHFNFIQSSTRIIVEQAFGRLKNRFRIL----LHPQMARPVRARNNA 321
Query: 263 LASTTLHNI 271
LHN+
Sbjct: 322 FVCMILHNL 330
>gi|449686160|ref|XP_004211088.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 381
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 117/290 (40%), Gaps = 30/290 (10%)
Query: 5 AFKEAFRL---SATQVEFILE-EIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
A+K+ FRL S ++ ++LE I+P +P R A++ + V L+++ ++
Sbjct: 63 AYKKNFRLNKNSFFELVYLLEPYISPDSYSPNIR--AITPDKNVAITLYYLKDSGTLNMT 120
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCV 118
T G + + I EV IV R V P+ + + + F K G GCV
Sbjct: 121 ANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCV 180
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DGT + I P H + ++L V A+C F V WPG VHDA+V NS +
Sbjct: 181 DGTHIPIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVKCMWPGGVHDAKVFSNSSI 240
Query: 179 YRKFDEEGWRPFPDAV----------ILGDSAYPL-----KEWLIPPRTRNPDAAADQRF 223
+ ++GD AYPL KE+ + D
Sbjct: 241 NTNLRSSRLPGTFQTITKNKIKVPCYLIGDPAYPLLPHCMKEY---STCKKNDEVIFNSM 297
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIA 272
LR R IE +FG L ++ L M L + V+ A LHN
Sbjct: 298 LR---TARNPIECAFGRLNARWKILTKKMDLKLEKIPTVIYACFILHNFC 344
>gi|357162176|ref|XP_003579328.1| PREDICTED: uncharacterized protein LOC100834753 [Brachypodium
distachyon]
Length = 611
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 75/172 (43%), Gaps = 16/172 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + P E + NR G + NV+A+C + +F WPGSVHD
Sbjct: 391 FKDAIGAIDGTHIPVIVPAELKFIHTNRKGYTSQNVLAMCDFDMRFIFAVPGWPGSVHDT 450
Query: 171 RVMRNSMV-YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN----------PDAAA 219
RV ++ Y F P L DS YP + + P P
Sbjct: 451 RVWSDARAEYDTFPH----PPQGKYYLVDSGYPNRVGYLAPYKGQRYHVLEFENAPPVGM 506
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA-GMVVLASTTLHN 270
+ F CH R +IE +FG+LK K+P L + P M+V A LHN
Sbjct: 507 QEMFNHCHSSLRNVIERAFGVLKRKWPILQGIPAYPVLKQKMIVSACMCLHN 558
>gi|77556298|gb|ABA99094.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 869
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 103/238 (43%), Gaps = 41/238 (17%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 648 NCIGAIDGTHIQVVVPNSAAVQHRNRHEEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 707
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP----------------PRTRNP 215
++ +F + +P L DS YP + +L P PR R
Sbjct: 708 FNDAQT--RFSAKFPKPPLGKFYLVDSGYPNRPGYLAPYKGITYHFQEYNESTLPRGRR- 764
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
+ F CH R +IE SFG+LK K+ L + S + ++ A LHN
Sbjct: 765 -----EHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSYSQEKQSRIIHACIALHNFI-- 817
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYFR 332
RD++ + E D + D ++ D + P + P +R++N FR
Sbjct: 818 --RDSQMADTEF--DNCDHD------ENYDPLGGTSSPSSE---PTNDLDSRVMNQFR 862
>gi|449679552|ref|XP_002158813.2| PREDICTED: uncharacterized protein LOC100207042 [Hydra
magnipapillata]
Length = 381
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 124/300 (41%), Gaps = 35/300 (11%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M F ++FR+S +E +L +AP ++ + R +A++++ L ++ G I
Sbjct: 69 MNSVYFFQSFRMSPKTLEVLLSWVAPFISKSSLRRAVATAEKRLCIALRYLVTGDAQITI 128
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
T +++ R +I++T+ K P ++ +I F + FP G +
Sbjct: 129 AITFRMRPTTVVR-------IIIETY-----KTPSSQKEWLEISSEFYERWNFPYCLGAI 176
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DG ++I +P + N ++ ++A C Y+F V+ G D + NS V
Sbjct: 177 DGKHLIIQSPARSGSMYFNYKKTFSIVLLATCNAKYEFTLVDVGGSGRQSDGGIYNNSKV 236
Query: 179 YRKFDEEGWRPFPD--------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFL 224
E FPD L D + K ++ P R + D+ +
Sbjct: 237 GSAI-ENNLLNFPDPSFISGYSKSITISYTFLADETFATKPHMMRPYPRRTN--LDKTEI 293
Query: 225 RCHKRT---RRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIALTIDRDNE 280
+ R RR+IENSFG+L +F +S + +VV + LHN + I + +
Sbjct: 294 VFNYRLSGGRRVIENSFGVLASRFRIFRRPIVSKVENVKIVVKVTVALHNFLMKIQKKTD 353
>gi|331225553|ref|XP_003325447.1| hypothetical protein PGTG_07280 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 392
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 119/281 (42%), Gaps = 11/281 (3%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHI 59
+ P F E FR+S ++ EE+ L R L+ + +V L+ +G G+ Y
Sbjct: 54 LPPVRFLEYFRMSRADFAWLCEELRGTLDQDHLRRGAPLTVEAQVAVGLYRLGHGATYVT 113
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND----IPMTFLRKGGFPNVA 115
I K + ++ V ++ R + +P + +D I +F + G P+V
Sbjct: 114 ISHVFNIGKETADKATGRFVKAVLKILRLRTISFPALDAHDEWDEIKESFESRHGIPDVV 173
Query: 116 GCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DGT + L P + ++NR +L + + F V GS+HD RV R
Sbjct: 174 GAIDGTHIPLSMPPGDRWKGYINRKNWPSLVFQCVVDGDGNFRDVFGGGAGSIHDTRVFR 233
Query: 175 NSMVYRKFDEEGWRP--FPDAV-ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
S + + +P P ++GD+ YP ++ P + A ++ F TR
Sbjct: 234 RSDLGVSLNRALGQPSMIPTGSHLIGDAGYPSDVNVLVPYP-SILAPENEYFNYIQSATR 292
Query: 232 RLIENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNI 271
++E +FG LK +F L R +P A LHN+
Sbjct: 293 IVVEQAFGRLKNRFRILLTAQRANPIRARNTTFVCMILHNL 333
>gi|356558280|ref|XP_003547435.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 395
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/333 (23%), Positives = 140/333 (42%), Gaps = 29/333 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV-----ALSAKEKVLSCLHWMGVGSPYHII 60
F+ F++S E+I + + V +S E+V L +G G I
Sbjct: 59 FESVFKISRKTFEYICSLVKDDMMMKAAHFVFTNRKPISLCEQVAVALRRLGSGESLVTI 118
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + G + S++ + V + + + +WP E +N I F + G PN G +
Sbjct: 119 GDSFGLNHSTVSQVTWRFVESMEERGLHHL-RWPSTEVEMNAIKSKFEKIHGLPNCCGVI 177
Query: 119 DGTMVLIDAPNEHE--DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T + + P + +++ +H++ + AI P+ +F + WPG + D + +S
Sbjct: 178 DATHITMCLPASEPSCNVWLDHKKNHSMVLQAIVDPDMRFRDIVTGWPGKMEDWMIFESS 237
Query: 177 MVYRKFDEEGWRPFPDAV-----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E+G R + I+GDS Y L +L+ P + +F
Sbjct: 238 N-FNKLCEKGERLNGSILQLSEGSEIREYIIGDSGYRLLPYLVVPYEGKELSEPKAQFNN 296
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQN 283
H TR + + + LK+ + ++ M P + ++L LHNI ID ++E Q+
Sbjct: 297 QHLATRMVAQRALARLKDMWRIIHGMMWRPDKHRLPRIILVCCVLHNI--VIDLNDEVQD 354
Query: 284 VEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDR 316
E+ D G + V +DE+ +DR
Sbjct: 355 -ELSLSLDHDSGYHQLV--CGAVDEKGVALRDR 384
>gi|251823947|ref|NP_001156537.1| uncharacterized protein LOC100302406 [Acyrthosiphon pisum]
gi|239791514|dbj|BAH72212.1| hypothetical protein [Acyrthosiphon pisum]
Length = 385
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/290 (24%), Positives = 123/290 (42%), Gaps = 6/290 (2%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
+++ FR+ + ++L +I A+ T +A+ A+ K+ L ++ G + +
Sbjct: 37 SYRNFFRMDPSLHLYLLSKIKVAIQKEDTNMRLAIPAECKLNVTLRFLATGDSFSSLQYL 96
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGT 121
K++I I EV+ I +K P E +I F K F G +DG
Sbjct: 97 FRIPKNTISTFIPEVLDAIYSALLD-FLKVPNSVEEWKEIEKGFNEKWNFSGCIGAIDGK 155
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
V I AP ++ N ++ +MA+ NY F ++ G D V + +
Sbjct: 156 HVAIRAPMFSGSEYYNYKNSFSIILMAVVDANYCFRYIDIGPQGRHSDGGVFDHCSLRHM 215
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
++ + V +GD A+PLK +L+ P R + F R RR +EN+FGIL
Sbjct: 216 IEQNKLHIPENFVFVGDEAFPLKSYLMRPYPRRELNTDCKIFNYRLSRARRTVENAFGIL 275
Query: 242 KEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
+F + S A +V + LHN L RD+ ++ + D E
Sbjct: 276 VSRFRVFEKPIATSVPTAVKIVKTACALHN-WLQTRRDSNYFSLGLIDKE 324
>gi|156553248|ref|XP_001599841.1| PREDICTED: hypothetical protein LOC100115003 [Nasonia vitripennis]
Length = 319
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 136/313 (43%), Gaps = 45/313 (14%)
Query: 12 LSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSI 71
+++ Q + ILE + P LT ++R AL+ + ++ + L+++ G+ +S++
Sbjct: 1 MTSEQFDEILELVRPQLTKRSKRR-ALTPEMRLAAVLNFLAHGNSIQKSAWMFLIGRSTM 59
Query: 72 CRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEH 131
R + EV I + +++ W PN G +DG + + AP
Sbjct: 60 YRLVIEVCTAICNVLEEKI--W----------------HMPNCFGAIDGKHIRVKAPPNS 101
Query: 132 EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE------ 185
F N ++ +M + +F VN GS +DA + + S + + D E
Sbjct: 102 GSYFFNYKKHFSIVLMGLTDAFCRFIWVNIGDFGSSNDAGIFQRSDLRQALDNEEIDIPA 161
Query: 186 -GWRPFPDAV----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIENSFG 239
+ P D + I+GD A+PLK +L+ P TR + +++ R R IE +FG
Sbjct: 162 PTYLPRTDVLCPYFIIGDGAFPLKNYLMKPYTRANNLTHEEKIFNYRLSRARLTIERAFG 221
Query: 240 ILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
IL +K+ L + + V++A LHN +T ++ ++D+ E +
Sbjct: 222 ILTKKWRILESPVDWKLKNIETVIMALICLHNFLIT-------------EEMSKDEAERK 268
Query: 299 AVDDPDEIDEEEQ 311
V P I+ E +
Sbjct: 269 YVFHPYNIENEAE 281
>gi|331232917|ref|XP_003329120.1| hypothetical protein PGTG_10860 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 360
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 110/254 (43%), Gaps = 14/254 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F E FR+S +++ +E+ L P R LS + +V L+ +G G+ Y I
Sbjct: 59 FVEYFRMSRADFQWLCDELRETLVQDPLRRGAPLSVEAQVGVGLYRVGHGATYVTISNVF 118
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE----ENLNDIPMTFLRKGGFPNVAGCVDG 120
K + ++ V ++ R + +P+ E N+I +F G P+V G +DG
Sbjct: 119 NIGKETADKAFSRFVNAVLKVLRLRTISFPDLDAAEEWNEIKASFESCHGIPDVVGAIDG 178
Query: 121 TMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T + L P++ ++NR +L + + F V GS+HD RV + S +
Sbjct: 179 THIPLAMPPSDEWKGYINRKNWASLVFQCVVDGHGNFRDVFGGGAGSIHDGRVFQRSPIG 238
Query: 180 RKFDEEGWRPF---PDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCHKRTRRLI 234
+ P P ++GD+ YP ++ P P AA + F TR ++
Sbjct: 239 NSLNRALGLPLMIPPRTHLIGDAGYPSDVNILVP---YPSIAAPENDYFNYIQSATRIVV 295
Query: 235 ENSFGILKEKFPCL 248
E +FG LK F L
Sbjct: 296 EQAFGRLKNCFRIL 309
>gi|449679701|ref|XP_004209397.1| PREDICTED: uncharacterized protein LOC101238054 [Hydra
magnipapillata]
Length = 198
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 92/196 (46%), Gaps = 10/196 (5%)
Query: 36 VALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP- 94
V +S ++V LH++ Y ++ + G KS++ +HE + + D + VK+P
Sbjct: 3 VPISVVKRVAVGLHYLASCEEYRVVSSLFGIGKSTVNLIVHEFINAVNDILLPKYVKFPL 62
Query: 95 -EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
ENLN F GFP G VDG + I P + + N +++ + A+
Sbjct: 63 SVENLNKHSRDFKAILGFPQCVGAVDGCHIPISTPKDQAISYYNYKRWYSIVLFAVVDCR 122
Query: 154 YKFYAVNANWPGSVHDARVMRNS----MVYRKFDEEGWRPFPDAVI----LGDSAYPLKE 205
Y+F + PG +D+ +++NS ++ ++ + D+++ +GDSA+PL
Sbjct: 123 YRFIYTSVGSPGRNNDSYILQNSSLKAILESNLFDKCCKELGDSLVPLCFIGDSAFPLTR 182
Query: 206 WLIPPRTRNPDAAADQ 221
L+ P N + + Q
Sbjct: 183 HLLKPYPENLELSEIQ 198
>gi|331227606|ref|XP_003326471.1| hypothetical protein PGTG_07449 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 409
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 136/332 (40%), Gaps = 36/332 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALS-AKEKVLSCLHWMGV---GSPYHIIG 61
FK+ FR+S ++ EI NV E+++ L MG G+ ++
Sbjct: 84 FKQEFRMSQQSFCRLILEIQDHPVFHNHSNVPQQPVPEQLMVALKRMGTHGNGASVGMLA 143
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDG 120
S+ ++ V+ I+ + V WP E +I GF N G +DG
Sbjct: 144 RFFRISEGTVILYCSRVIEAIL-SLESTYVSWPSTEGRQEIAWQISSHTGFRNCVGFIDG 202
Query: 121 TMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T+ L + P+ + +R G + L + +C + WPG HD R+ N+ +
Sbjct: 203 TLFPLSEKPSIDPQDYYSRKGQYGLAALIVCNEEKRIMYYVTGWPGCCHDTRLWENTELK 262
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA---ADQRFLRCHKRTRRLIEN 236
+ D+ P ++ DS +P + ++P R P A A ++F + R E+
Sbjct: 263 LQKDQ---LFSPGQYLIADSGFPPETNIVPAFKRPPHGAMPRARKQFNQHLSSLRVCNEH 319
Query: 237 SFGILKEKFPCLNYMRLSPQFAGMV------VLASTTLHNIALTIDRDNENQNVEIHDDE 290
G+LK +F L +R AG + + A LHN L+ +Q+ ++ D
Sbjct: 320 CIGLLKGRFQSLRGLRKDLNSAGTMEKITHWISACVILHNFLLS------DQSPDVFTD- 372
Query: 291 AEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQ 322
VDD D I + ++ RN+ GTQ
Sbjct: 373 ---------VDDID-IHGHDGAQEPRNDLGTQ 394
>gi|115758132|ref|XP_001199795.1| PREDICTED: uncharacterized protein LOC763733 [Strongylocentrotus
purpuratus]
Length = 331
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 112/260 (43%), Gaps = 21/260 (8%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTF 105
L + G+ Y + K +I + + +V +VD + ++ P N ++ F
Sbjct: 48 LRHLANGATYAELSFNFRMGKETIQKFVPDVARAVVDEYAAEVISLPTTNECWLEVAGDF 107
Query: 106 LRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPG 165
+ P+ G DG + + PN+ + N ++ +MA+ Y+F ++ G
Sbjct: 108 EARWNLPHCLGGYDGKHIRLQKPNKSGSLYFNYKQFFSVVLMALVDSKYQFLWIDVGGVG 167
Query: 166 SVHDARVMRNSMVYRKFDEEGWRPFPD--------------AVILGDSAYPLKEWLIPPR 211
DA++ NS + ++ E G PD +GD A+ ++ +++ P
Sbjct: 168 HQSDAQIYNNSEL-KECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDYAFAMRTYMMKPY 226
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHN 270
R + F R RR++EN+FGIL +F C L MR P +++ A+ LHN
Sbjct: 227 GRRNMDQQQKIFNYRLSRARRVVENAFGILVLRFQCFLGQMRQEPDTVRLLIEAAVMLHN 286
Query: 271 IALTIDRDNENQNVEIHDDE 290
+ I + + +V I D E
Sbjct: 287 L---IRKHYQAWDVRILDQE 303
>gi|115617904|ref|XP_001201283.1| PREDICTED: uncharacterized protein LOC764781 [Strongylocentrotus
purpuratus]
Length = 391
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 127/303 (41%), Gaps = 22/303 (7%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK+ R+S + +L + P L T A+ K+ L + G+ Y +
Sbjct: 37 FKKYTRVSPEMFDDLLVRLTPHLQKKDTIFRKAIPPGLKLSVFLRHLATGATYAELSYNF 96
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTM 122
K +I + + +V IV + ++ P N ++ F + P+ G DG
Sbjct: 97 RVGKETIQKFVPDVARAIVVEYAAEVISLPTTNEGWLEVAGDFEARWNLPHCLGAYDGKH 156
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + PN+ + N ++ +MA+ Y+F ++ G DA++ NS + ++
Sbjct: 157 IRLKKPNKSGSLYFNYKQFFSVVLMALVDSKYQFLWIDVGRVGHQSDAQIYNNSEL-KEC 215
Query: 183 DEEGWRPFPDAV--------------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
E G PD +GD A+ ++ +++ P R + F
Sbjct: 216 IEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTYMMKPYGRRNMDQQQKVFNYRLS 275
Query: 229 RTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R RR++EN+FGIL F C L MR P ++V A+ LHN+ I + + +V +
Sbjct: 276 RARRVVENAFGILALWFQCFLGQMRQEPGTVRLLVEAAVMLHNL---IRKRYQALDVRML 332
Query: 288 DDE 290
D E
Sbjct: 333 DQE 335
>gi|52353755|gb|AAU44321.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125551096|gb|EAY96805.1| hypothetical protein OsI_18732 [Oryza sativa Indica Group]
Length = 483
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 117/284 (41%), Gaps = 23/284 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + + +++A A+ T A+ ++V CL + G P +
Sbjct: 144 FRRAFRMPRAVFDKLCDDLAAAVAKVDTTLRSAIPVPQRVAVCLWRLATGDPLREVSRRF 203
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G S+ I +V I R+V+WP+ + F P V G V
Sbjct: 204 GLGISTCHNIIVQVCAAITAVLLTRVVRWPDSHAA-AASRFQALSWIPGVVGAVHTEHFR 262
Query: 125 IDAPNEHEDQFV-------NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
I AP EH ++ N +++ + A+ + F V PGS+ DA ++ S
Sbjct: 263 IVAPREHAGKYYDHRLTGRNNKATYSVAMQAVVDADGAFTDVCIGHPGSLSDAAILAKSA 322
Query: 178 VYRK--------FDEEGWR-PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
+Y + D+ GW+ P ++G ++YPL W++ P T+ A R
Sbjct: 323 LYARCEAGLLLGHDKLGWQQPL---WLVGGASYPLTSWMLVPYTQPNLTWAQDRLNARVA 379
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNI 271
R +F L+ ++ CL + P+ A M+ A LHN+
Sbjct: 380 DARAAAVGAFRRLRARWQCLRRAEVKLPELANMLG-ACCVLHNL 422
>gi|332021416|gb|EGI61784.1| hypothetical protein G5I_09901 [Acromyrmex echinatior]
Length = 419
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 140/330 (42%), Gaps = 46/330 (13%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL-------TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
F A+R+ Q + ++ + P + PTE +A++ L ++ G
Sbjct: 47 FFGAYRMWPEQFDLLVNLLHPHFKKNILKPSLPTELRLAVT--------LLYLSQGDNAK 98
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+ A G KS++ + ++E I T ++K P +E+ I F++K FPN G
Sbjct: 99 LKHAEFGIGKSTVHKIVNETCWAIWTTLQPIVLKPPSKEDWIAISEDFMKKWQFPNCLGA 158
Query: 118 VDGTMVLIDAPNEHEDQFVN-----RNGDHA----LNVMAICGPNYKFYAVNANWPGSVH 168
+DG + I AP F N R+ + + ++A+C YKF VN GS+
Sbjct: 159 LDGRYMTIQAPPNSGSAFYNYKQWLRSAEKPQFFNMILLAVCDATYKFTWVNIEQRGSIS 218
Query: 169 DARVMRNSMVYRKFDE------------EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPD 216
D V N+ + E PFP V +GD A+PL ++ P R
Sbjct: 219 DDGVWANTKLASSLAAGDLLLPDPTLFPETNTPFP-FVFIGDEAFPLSTHMMRPYPREKL 277
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTI 275
+ F R RR IEN+FGIL ++ L+ + +S V+ A LHN + +
Sbjct: 278 TDDMRIFNYRLLRVRRTIENAFGILTARWRILHKPLYMSITNCENVLKALVCLHNFIM-L 336
Query: 276 DRDNENQN------VEIHDDEAEDDGEEEA 299
++E+ N ++ D E +D G E
Sbjct: 337 GEEHESINNRQYCTTDLIDIEKQDGGIREG 366
>gi|357622688|gb|EHJ74112.1| hypothetical protein KGM_12729 [Danaus plexippus]
Length = 342
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 118/276 (42%), Gaps = 23/276 (8%)
Query: 12 LSATQVEFILEEIAPALTTPTER-NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S +F+L ++ P +T R V + AK ++ L ++ G Y + S ++
Sbjct: 1 MSPNDFDFLLSKVEPLITKQKTRLRVPIPAKVRLALTLRFLATGDSYRSLHHLFKISSAA 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
I I EV I +T + +K P +I F RK +P+ GC+DG V+I P
Sbjct: 61 ITFIIQEVCTAI-NTVLKDQIKMPRTTTEWLNIESGFSRK--YPHCVGCIDGKHVVIQCP 117
Query: 129 -NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
N +++ G ++ ++A+ NY F + + D + +NS+++ K
Sbjct: 118 INSGTEKY---KGTYSFVLLALVDSNYCFIFADIGAQDRISDGGIFQNSVLWEKISTGTI 174
Query: 188 RPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR-LIE 235
PD+ V LGD A+ L + ++ P N + QR + R ++E
Sbjct: 175 NLPPDSPLSDGQCNMPHVFLGDGAFALSKHVMIPFPGNHVMGSLQRTFNMRLSSARVVVE 234
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
N FG+L F M + A ++ + LHN
Sbjct: 235 NVFGLLTTVFRIFKKPMEIKKDKAKLITMTCILLHN 270
>gi|108707827|gb|ABF95622.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed
[Oryza sativa Japonica Group]
Length = 747
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 132/307 (42%), Gaps = 36/307 (11%)
Query: 43 KVLSCLHWMGVGSPYHI--IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND 100
+ L W+ VG+P I + S ++I EV+ V++ +V+ + +
Sbjct: 453 EALGLFLWI-VGAPQSIRQVEDRFTRSLATIVSKFDEVLDVLIQ-LATDIVRPKDPHFTT 510
Query: 101 IPMTFLRKGGFP---NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
I L P N G +DGT + + PN Q N + + + NV+ IC + +F
Sbjct: 511 IHKKLLSPKYTPYLDNCIGAIDGTHIQVVVPNATAVQHRNSHKERSQNVLCICDFDMRFT 570
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------- 210
V A WPGSV D RV ++ +F + P + DS YP + + P
Sbjct: 571 FVLAGWPGSVDDMRVFSDAQ--NRFGHKFPWPPEEMFYFMDSGYPNRPGYLAPYKGITYH 628
Query: 211 -RTRNPDAAADQR---FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLAS 265
+ N R F CH R +IE SFG+LK K+ L ++ P Q M++ A
Sbjct: 629 FQEYNEGIMPQNRKEYFNYCHSSCRNVIERSFGVLKNKWRILFHLPSYPQQKQSMIICAC 688
Query: 266 TTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLA 325
LHN RD++ + E D + D E V P E+ P+ N+P + +
Sbjct: 689 LALHNFI----RDSKLADCEF--DMCDQD--ESYVPLP-----EQSPQS--NDPTSSLDS 733
Query: 326 RLLNYFR 332
+++N FR
Sbjct: 734 QVMNQFR 740
>gi|345494085|ref|XP_003427214.1| PREDICTED: hypothetical protein LOC100678860 [Nasonia vitripennis]
Length = 954
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 93/193 (48%), Gaps = 15/193 (7%)
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-----EENLNDIPMTFLRKGGFPNV 114
I + G +KS+ + + VV + + + + WP +E + I F GFP V
Sbjct: 720 ITSKFGVAKSTAWCATYRVVRALCN-YRNYYISWPSPAEAQETADRIEQRF----GFPGV 774
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +D T + I A + ++NR G H++ + IC +F A PGSVHD RV R
Sbjct: 775 IGALDRTDIYIAALRRDANAYINRKGRHSIQLQVICNDKLEFINCYAGMPGSVHDMRVYR 834
Query: 175 NSMVYRKFDEEGWRPFP-DAVILGDSAYPL-KEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
S V + +++ FP + +L D+AY L + ++P R + F R+R
Sbjct: 835 YSGVQNQCNDQF---FPNNTYLLADAAYTLQRHIMVPYRDHGHLTLEEICFNTMLSRSRM 891
Query: 233 LIENSFGILKEKF 245
++E + G+LK ++
Sbjct: 892 MVERAIGLLKMRW 904
>gi|357627760|gb|EHJ77342.1| hypothetical protein KGM_10683 [Danaus plexippus]
Length = 462
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 119/279 (42%), Gaps = 16/279 (5%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
AF + +RLS + EE+ P + + +++ S + KVL+ L + G IG
Sbjct: 66 AFLDMYRLSKDLARNLCEELKPVMPD-SIKSIEFSVESKVLAALSFYATGKYQKSIGGRS 124
Query: 65 GPSKSS--ICRSIHEVVPVIVD-TFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVD 119
PS + + ++ +V + D + ++ + +P I F K G PNV GCVD
Sbjct: 125 DPSITQYFVATAVMQVTEAMNDPSIIKKYIHFPHLRNEREVIKNGFYMKYGIPNVVGCVD 184
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV-HDARVMRNSMV 178
V I P+E + + N++ H+ V I + +V + GS HDA + R+++
Sbjct: 185 CVHVPIARPDEDQKKHFNKSY-HSKKVQIISDSRQRIMSVCSEGGGSYSHDALLARHAVT 243
Query: 179 YRKFDEEGWRPFPDAVILGDSAYPLKEWL---IPPRTRNPDAAADQRFLRCHKRTRRLIE 235
R +LG Y K +L +P T+ + ++ + H + +
Sbjct: 244 VDLVSLNNSRDL--CWLLGGPHYSQKPYLMAPVPKMTKKSSMSPEKYYTNLHAQAHSAVT 301
Query: 236 NSFGILKEKFPCLNYM---RLSPQFAGMVVLASTTLHNI 271
+ LK ++ CL + P +VLA LHNI
Sbjct: 302 ETIKQLKARWKCLQATSNKQFDPPTVAKMVLACCVLHNI 340
>gi|449684574|ref|XP_004210663.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 335
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 120/288 (41%), Gaps = 26/288 (9%)
Query: 24 IAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV 83
++P P R L+ +K+ ++++ + G + ++ +++ V I
Sbjct: 33 VSPQSNCPNYR--FLTTLKKLAITIYYLKDTGSLWMTANVFGIHQCTVSKTVKVVCDAIN 90
Query: 84 DTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGD 141
+ + P+ E++ + F K G GC+DGT V I P ++ +
Sbjct: 91 NIVGPIYLHLPKNKEDMTKLASQFEVKFGMIQAFGCIDGTHVQIKHPIKNGQDYFCFKQY 150
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----------FDEEGWRPFP 191
+LNV A+C F V WPGSVHDA++ NS + +K + + P
Sbjct: 151 FSLNVQAVCDSKGYFIDVECKWPGSVHDAKMFTNSTINKKLIKGTLPQTLYSLPNYHSIP 210
Query: 192 DAVILGDSAYPLKEWLIP--PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL- 248
+ +I GD AYPL + I N + LR R IE +FG LK ++ L
Sbjct: 211 NYLI-GDPAYPLTNFCIKDFQSCSNNEEVIFNSMLR---SARNQIECAFGRLKARWGFLR 266
Query: 249 NYMRLSPQFAGMVVLASTTLHNIA---LTIDRDNE--NQNVEIHDDEA 291
+ + + +V+ LHN T D + E NQ +E H +A
Sbjct: 267 KIIDIKIETVPIVIYTCFVLHNFCEKNKTYDLNEEEVNQQIERHRSDA 314
>gi|218189338|gb|EEC71765.1| hypothetical protein OsI_04365 [Oryza sativa Indica Group]
Length = 396
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 139/326 (42%), Gaps = 30/326 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RN------VALSAKEKVLSCLHWMGVGSPY 57
F+ F++S +++ I+ T T+ RN L +++V L + G
Sbjct: 54 FESVFKMSRKTFDYLCSLISGDFTRKTQSFRNFRFGDKAILGVEDQVGVALLRLTTGESL 113
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + + + KWP E + + F + G PN G
Sbjct: 114 LSIGNRFGMNHSAISNITWKFIEALEERAANHL-KWPTPEEMATVKAKFEKIQGLPNCCG 172
Query: 117 CVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +L+ + + + +++ +++ + AI + +F V + WPGS++D+ ++R
Sbjct: 173 AIDTTHILMCSSAQPNSNVWLDGENRNSMVLQAIVDADMRFRDVVSGWPGSLNDSCILRT 232
Query: 176 SMVYRKFDEEGWR----------PFPDAV---ILGDSAYPLKEWLIPPRTRNPDAAADQR 222
S YR E+G R P V ILGD++YPL WL+ P + A
Sbjct: 233 SGFYR-LCEKGARLDGQTELPGEPAGSVVRDYILGDASYPLLPWLMTPYREKDLSPAKAD 291
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
F + H T +++ + LKE++ L P + ++ L NI + ++
Sbjct: 292 FNKRHAATIMVVQGALAKLKERWQVLKGELWRPDKHRLPRIIYVCCLLTNIMIDLEDAAR 351
Query: 281 NQNVEIHDDEAEDDGEEEAVDDPDEI 306
H+ DDG + D ++
Sbjct: 352 GGMPPSHN---HDDGYRQQFSDVADV 374
>gi|115440939|ref|NP_001044749.1| Os01g0838900 [Oryza sativa Japonica Group]
gi|15623822|dbj|BAB67881.1| unknown protein [Oryza sativa Japonica Group]
gi|113534280|dbj|BAF06663.1| Os01g0838900 [Oryza sativa Japonica Group]
gi|215707022|dbj|BAG93482.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619512|gb|EEE55644.1| hypothetical protein OsJ_04014 [Oryza sativa Japonica Group]
Length = 396
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 139/326 (42%), Gaps = 30/326 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RN------VALSAKEKVLSCLHWMGVGSPY 57
F+ F++S +++ I+ T T+ RN L +++V L + G
Sbjct: 54 FESVFKMSRKTFDYLCSLISGDFTRKTQSFRNFRFGDKAILGVEDQVGVALLRLTTGESL 113
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + + + KWP E + + F + G PN G
Sbjct: 114 LSIGNRFGMNHSAISNITWKFIEALEERAANHL-KWPTPEEMATVKSKFEKIQGLPNCCG 172
Query: 117 CVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +L+ + + + +++ +++ + AI + +F V + WPGS++D+ ++R
Sbjct: 173 AIDTTHILMCSSAQPNSNVWLDGENRNSMVLQAIVDADMRFRDVVSGWPGSLNDSCILRT 232
Query: 176 SMVYRKFDEEGWR----------PFPDAV---ILGDSAYPLKEWLIPPRTRNPDAAADQR 222
S YR E+G R P V ILGD++YPL WL+ P + A
Sbjct: 233 SGFYR-LCEKGARLDGQTELPGEPAGSVVRDYILGDASYPLLPWLMTPYREKDLSPAKAD 291
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
F + H T +++ + LKE++ L P + ++ L NI + ++
Sbjct: 292 FNKRHAATIMVVQGALAKLKERWQVLKGELWRPDKHRLPRIIYVCCLLTNIMIDLEDAAR 351
Query: 281 NQNVEIHDDEAEDDGEEEAVDDPDEI 306
H+ DDG + D ++
Sbjct: 352 GGMPPSHN---HDDGYRQQFSDVADV 374
>gi|449680967|ref|XP_002158505.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 285
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 117/278 (42%), Gaps = 21/278 (7%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++CR E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCRIFKETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G D + N+ + + +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYG--RDNTAILNASTFGRAFNKG 176
Query: 187 WRPFP---------DAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIEN 236
+ P V++ D + LK WL+ + QR H R RR IEN
Sbjct: 177 YFNLPKFFEFDPKVSPVLVKDDIFALKPWLMKSYP-GKNLTVQQRVFNYHLSRARRTIEN 235
Query: 237 SFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
SFGIL ++ ++ P ++ A+ LHN L
Sbjct: 236 SFGILAARWRIYRSPIKAKPLIVEHIIKATVCLHNYIL 273
>gi|307205232|gb|EFN83615.1| Putative nuclease HARBI1 [Harpegnathos saltator]
Length = 181
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 74/151 (49%), Gaps = 2/151 (1%)
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G K++ R++ V+ +V Q ++WP I F + G V G +DG ++
Sbjct: 10 GIGKATCFRALRRVIFSLVQIVHQ-FIQWPTGRTAVKIMKDFYKISGLRGVIGAIDGILI 68
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I+ P E+ D ++ R A+++ C F +V +P S+HDARV+RNS ++
Sbjct: 69 KINKPTENSDDYICRKRYPAIHLQVTCTQQLLFTSVLVGYPRSIHDARVLRNSRIHEYLA 128
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
++ I+GD+AYP+ L+ P N
Sbjct: 129 HPDIYFSDNSYIVGDAAYPIHSNLMVPFKNN 159
>gi|449686912|ref|XP_004211294.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 153
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 76/150 (50%), Gaps = 11/150 (7%)
Query: 104 TFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANW 163
+F + GFP V G VDG + A N++ + ++NR H++ + + F + W
Sbjct: 3 SFEKLYGFPQVVGAVDGIHIRTKASNKNSEDYINRKDYHSIILQGLADSKCLFRDIFVRW 62
Query: 164 PGSVHDARVMRNSMVYRKFDEEGWRPFP----------DAVILGDSAYPLKEWLIPPRTR 213
G HD+RV +NS +Y++ + P +ILGDSA+ L+ WL+ P +
Sbjct: 63 TGKFHDSRVFKNSPLYKECLARTFLPNNLNKLIANIEIGPLILGDSAFSLENWLMKPYSD 122
Query: 214 NPDAAADQ-RFLRCHKRTRRLIENSFGILK 242
+ + ++ +F + R +IEN+FG LK
Sbjct: 123 RENLSIEEAKFNTSLSKFRVVIENAFGRLK 152
>gi|328713765|ref|XP_003245173.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 397
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 134/315 (42%), Gaps = 24/315 (7%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPY 57
+ P F +R+S + +L+ + ++ TP + LSA+E++ L ++ G +
Sbjct: 56 LYPEKFFIYYRMSIKSFDELLQNVRDHISKRYTPWRNH--LSAEERLTITLRYLSTGMTF 113
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
+ KS++ + E I +T + + P E+ +I TF K FPN G
Sbjct: 114 TALHFEFKIGKSTVGEIVRETCVAIWNTLQKEEMPEPTTEHWLEISNTFYTKTNFPNCLG 173
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDG + I P F N ++ +MA+ N F ++ G D+ + + S
Sbjct: 174 AVDGKHIRIQNPKNTGSLFFNYKKYFSIILMAVVDANLSFIYIDVGSYGKESDSNIFKES 233
Query: 177 MVYRKFDEE-----GWRPFPDA-------VILGDSAYPLKEWLIPPRTRNPDAAADQRFL 224
+K G P + V +GD A+ L L+ P + D+R +
Sbjct: 234 SFGKKLYSNALNIPGPIRLPSSQGSPQPYVFVGDEAFALHTNLLRPYPQR--GLNDERRI 291
Query: 225 RCHK--RTRRLIENSFGILKEKFPCLNYMRL-SPQFAGMVVLASTTLHNIALTIDRDN-E 280
++ R RR +E +FG+L K+ L+ L SP+F +V + LHN D N E
Sbjct: 292 FNYRLSRARRTVECAFGVLANKWRVLHTTILVSPEFVDDIVKSCCVLHNFVRKRDGYNFE 351
Query: 281 NQNVEIHDDEAEDDG 295
+ E + D+ E G
Sbjct: 352 DMESECNLDDIEVRG 366
>gi|242083890|ref|XP_002442370.1| hypothetical protein SORBIDRAFT_08g019015 [Sorghum bicolor]
gi|241943063|gb|EES16208.1| hypothetical protein SORBIDRAFT_08g019015 [Sorghum bicolor]
Length = 723
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 113/267 (42%), Gaps = 39/267 (14%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
++ +EK+ L+ + + Y + G S + I+ V++ T +R ++ P+ N
Sbjct: 74 ITVEEKLGFFLYMLSHNASYEDLQVFFGHSNDTFHHHINHFFKVVIPTLSRRFLQAPDPN 133
Query: 98 -----LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
+ D P + F N G +DGT + I +E F NR G ++NVM C
Sbjct: 134 QVHQKIQDNPRFY---PFFKNSLGAIDGTHIPISIASEKASPFRNRKGTLSINVMVACDF 190
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-- 210
+ +++ W GS D+RV+R++M +G++ P L D Y + P
Sbjct: 191 DLNITFISSGWEGSATDSRVLRSAM------SKGFQVPPGKFYLVDGGYANTPSFLAPYR 244
Query: 211 --------------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN---YMRL 253
R +NP + R H R +E + G+LK++FP L + L
Sbjct: 245 GVRYHLKEFGSGRRRPQNPMELFNHR----HALLRNHVERALGVLKKRFPILKVATFHML 300
Query: 254 SPQFAGMVVLASTTLHNIALTIDRDNE 280
Q + +A+ +HN+ + D E
Sbjct: 301 ENQVK--IPIAAAIIHNLIRLLHGDEE 325
>gi|356554812|ref|XP_003545736.1| PREDICTED: uncharacterized protein LOC100806549 [Glycine max]
Length = 418
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 118/269 (43%), Gaps = 35/269 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 95 ILQEKGQLVKT---KNVPID--EAVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 149
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ E NL R F N G +DG + + E ++ NR
Sbjct: 150 GAIMKVS-KEYLKFHEYNLEGSVENKWR--WFKNSIGALDGIHIPVTVSAEDRPRYRNRK 206
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK-----------FDEEGWR 188
GD + NV+ +CGP+ +F V W GS D+RV+R+++ +R+ + G+
Sbjct: 207 GDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLRDAL-HRQNCLHIPNGKYFLVDAGYT 265
Query: 189 PFPDAVI-LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
P + + Y L EW+ RN + F H R +IE SFG+LK+++
Sbjct: 266 NGPGFLAPYRGTRYHLNEWI-----RNTPQNYKELFNLRHASARNVIERSFGVLKKRWSI 320
Query: 248 LNYMRLSPQFAGM-----VVLASTTLHNI 271
L +P F + ++ A LHN
Sbjct: 321 LR----TPSFFDIKTQIRIINACFMLHNF 345
>gi|357119278|ref|XP_003561370.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 385
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 123/297 (41%), Gaps = 25/297 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
F+ FR S + ++I + LT + LS +++V + + G
Sbjct: 52 GFRYFFRTSRSTFDYICSIVRDDLTWRLPSGLIRIDGRLLSVEKQVAIAMRRLASGDSQV 111
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+GA G S++ + + + + Q + WP +E ++DI F G PN G
Sbjct: 112 SVGAAFGVGPSTVSQVTWRFIESM-EGRAQHHLAWPDQETMDDIKANFEVVSGLPNCCGA 170
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +L+ P E + + + G++++ + I +F + WPG + ++++ S
Sbjct: 171 IDATHILMTLPAVESSENWRDHAGNYSMILQGIFDHEMRFIDICTGWPGGMTVSQLLEFS 230
Query: 177 MVYRKFDEEGWR-PFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
Y KF E G R P V I+GD YPL WL+ P A F
Sbjct: 231 GFY-KFCEAGERLDGPSQVSREGAQIREFIVGDMCYPLLPWLMTPYEGESLAVPMVDFNA 289
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
K R L + LK + LN + P + ++L LHNI IDR ++
Sbjct: 290 RQKAARMLGTRALARLKGSWRILNKVMWRPDKHKLPSIILVCCLLHNI--MIDRQDQ 344
>gi|357139565|ref|XP_003571351.1| PREDICTED: uncharacterized protein LOC100834412 [Brachypodium
distachyon]
Length = 329
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 100/232 (43%), Gaps = 31/232 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + + P + + R+G + NV+A+C + +F WPGSVHD
Sbjct: 88 FRGAIGAIDGTHIPVIVPTDETVNYTCRHGYTSQNVLAMCDFDMRFIFGVDGWPGSVHDT 147
Query: 171 RVMRNSMV-YRKFDEEGWRPFP--DAVILGDSAYPLKE-WLIP--------PRTRNPDA- 217
RV+ +++ + F P P D L DS YP + +L P P R P
Sbjct: 148 RVLNHALANFPCF------PVPPKDKYYLVDSGYPNRTGYLAPIKGSTYHLPEFRLPRGR 201
Query: 218 ---AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIAL 273
D+ F H R +IE SF +LK+K+ L M P+ ++LA LHN
Sbjct: 202 VLQGRDEIFNFLHSSLRNVIERSFAVLKQKWRILKSMPSFKPRAQTRIILACIALHNFI- 260
Query: 274 TIDRDNENQNVEIH----DDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGT 321
RD++ +VE D+E +G E D+ E E PG
Sbjct: 261 ---RDSKLPDVEFDSCDADEEYMPEGAEVGTTQGDDGPEGEDGYVVPEGPGV 309
>gi|449673717|ref|XP_002156809.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 120/277 (43%), Gaps = 18/277 (6%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S T E +L +AP + T +S E++ L ++ I A + S S+
Sbjct: 1 MSLTTYEELLSFVAPIIVKQRTTMRDPVSPSERLAVTLRFLVTEDAQCTIAAGYRISAST 60
Query: 71 ICRSIHEVVPVIVDTFFQR---MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
I R I E I + +R V ++ + F FP+ G +DG +++ A
Sbjct: 61 ISRIISETCAAIWTSLKERNFLHVPSEKQEWKTVAKEFENMWNFPHAIGAIDGKHIVMQA 120
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN-SMVYRKFDEEG 186
P+ ++ N H++ ++A+C Y+F V+ G D RV N S+ Y + +
Sbjct: 121 PHNGGSEYFNYKKTHSIVLLAVCYAKYEFTMVDIGESGKQSDGRVFNNCSLGYAIENNKL 180
Query: 187 WRPFPD----------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIE 235
P P+ V++ D+A+ LK ++ P N + DQ+ R RR+IE
Sbjct: 181 NIPDPEYIGNSEKVLPYVLVADNAFGLKRHIMKPYP-NQNIPLDQKIFNYRLSRARRVIE 239
Query: 236 NSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNI 271
N+FGI +F + ++ + ++ A LHN
Sbjct: 240 NTFGIATTRFRIFRRLIIANLEKVILITQAIVALHNF 276
>gi|125534959|gb|EAY81507.1| hypothetical protein OsI_36676 [Oryza sativa Indica Group]
Length = 370
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 145/356 (40%), Gaps = 54/356 (15%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGA 62
P F + FR+S + + + + NV S+ E + L +G + + +
Sbjct: 33 PRYFYKMFRMSPEIFHALHDLLVSTYGLSSSNNV--SSIESLTMFLWIVGGPQSFSQVES 90
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG----------FP 112
S ++ HEV+ + K + N+ TF + G F
Sbjct: 91 HFTRSLWTVHTKFHEVLKC--------LRKLAKNNITPRDTTFSTEHGRLREDRFWPYFK 142
Query: 113 NVAGCVDGT----MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
+ G +DG+ +VL+D H R+G + NV+AIC + +F A WPGS H
Sbjct: 143 DAIGAIDGSHISVVVLLDETISH----TCRHGYTSQNVLAICDFDMRFIFAVAGWPGSAH 198
Query: 169 DARVMRNSMV-YRKFDEEGWRPFP--DAVILGDSAYPLKEWLIPP-----------RTRN 214
D+R++ +++ + F P P L DS YP + + P R R+
Sbjct: 199 DSRILSHALANFPSF------PMPPTGKYYLVDSGYPNRIGYLAPFKGTTYHIPEFRHRS 252
Query: 215 -PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIA 272
P + F H R +IE SFG+LK+K+ L + SP +++A LHN
Sbjct: 253 GPPQGKYEVFNFLHSSLRNVIERSFGVLKQKWRILKGIPSFSPATQKHIIMACLALHNFV 312
Query: 273 LTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLL 328
RD+ ++ E +A++D E D + + E+ D N R+A L
Sbjct: 313 ----RDSNLRDKEFERCDADEDYLLEDTSDTSDDESEDAENDDTMNTIRTRIADAL 364
>gi|391330334|ref|XP_003739618.1| PREDICTED: putative nuclease HARBI1-like [Metaseiulus occidentalis]
Length = 254
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 105/254 (41%), Gaps = 19/254 (7%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYH 58
+TP + RLS +Q ++LE I P + T RN AL + K+ L ++ G
Sbjct: 15 VTPSDLQNHTRLSESQFSYLLERIRPRIEKRDTTFRN-ALRPESKLHMTLRYLATGDSIG 73
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
+ A + +S+ V I D + ++ P EE + F + FPN G
Sbjct: 74 SLSALYRIPRSTFSSFFPVVCKAIYDAL-EEFIQMPSSEEEWKVVMQEFALRWQFPNACG 132
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+DG V I P + +F N ++ +A+ Y F ++ G V+DA V S
Sbjct: 133 AIDGKHVTIRCPPKSGSEFFNYKKSFSIISLAVVDARYNFLYIDVGTNGRVNDAGVFEKS 192
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC-HKRTRRLIE 235
R +E+ + V + D A+PL+ ++ P F RC H R+LI
Sbjct: 193 SFNRALEEQVLDVPIEGVFVADDAFPLRSNILKP------------FSRCGHLDERQLIF 240
Query: 236 NSFGILKEKFPCLN 249
N K+K N
Sbjct: 241 NYTARQKKKIRITN 254
>gi|331241303|ref|XP_003333300.1| hypothetical protein PGTG_14220 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 104/214 (48%), Gaps = 23/214 (10%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD++ + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDQKQF-------LLADSAYTNN 274
Query: 205 EWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAG 259
+++P + + + F ++R IE++ GILK +F L +R + + G
Sbjct: 275 RYVVPAFKGKQLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRNAEEMKG 334
Query: 260 MV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
V ++ LHN+ D ++Q ++++DEA
Sbjct: 335 AVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 364
>gi|331225010|ref|XP_003325176.1| hypothetical protein PGTG_06713 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|331246035|ref|XP_003335652.1| hypothetical protein PGTG_16424 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 104/214 (48%), Gaps = 23/214 (10%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 274
Query: 205 EWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAG 259
++IP + ++ + F ++R IE++ GILK +F L +R + + G
Sbjct: 275 RYVIPAFKGKHLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRNAEEMKG 334
Query: 260 MV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
V ++ LHN+ D ++Q ++++DEA
Sbjct: 335 AVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 364
>gi|115528684|gb|AAI24960.1| LOC100158389 protein [Xenopus laevis]
Length = 352
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 122/296 (41%), Gaps = 25/296 (8%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + E +L+ +AP + T A+ A++++++ L ++ G + G S S
Sbjct: 1 MSDSSFETLLQAVAPLIAKQDTCMRQAIPAEQRLIATLQFLATGRSLEDLKIPIGISAQS 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+ I E I + +K+P E F FPN G + G V I P
Sbjct: 61 LGHIIPETCNAIFEALKTAYLKFPSTEAEWKATARHFEEFWNFPNCGGAIVGKHVRIKPP 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR 188
+ F N G +++ ++AI Y+F V+ G V M + Y++ +
Sbjct: 121 SRSGSYFTNYKGYNSIVLLAIVNAKYEFLMVDVGKNGRVSSVESMEQTYFYQRLQNHQLQ 180
Query: 189 PFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIENSFG 239
++ V + + L E ++ P + D + ++R R RR+ +N+FG
Sbjct: 181 LPSNSDTSEGMNYVFVTGEEFALHEHILTPFPQK-DMSFERRIFNYRLSRARRVADNAFG 239
Query: 240 ILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI-----------ALTIDRDNENQN 283
IL +F + + LSP VVLA LHN A +DR++ Q+
Sbjct: 240 ILSNRFRIFHTAINLSPSKTDSVVLACCVLHNFLCRTSGASYMPATMLDREDTEQS 295
>gi|291230836|ref|XP_002735371.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 445
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 71/305 (23%), Positives = 123/305 (40%), Gaps = 24/305 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P AF+ RL+ + ILE +AP + T LSA K+ L + G Y +
Sbjct: 94 PAAFRNFTRLTPEVFDDILERVAPVIQKQQTNYRHPLSAGLKLAITLRHLATGDNYRSLA 153
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVD 119
S+I I EV IV + + P E ++ F ++ P+ G +D
Sbjct: 154 YGFRCGISTISELIPEVCTAIVQAYKDEVFNPPTTPEAWRNLAQQFEQRWNVPHAVGALD 213
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
G + I P + N G ++ ++A YKF + G + D+++ +S ++
Sbjct: 214 GKHIAIKKPANTGSLYHNYKGFFSITMLAFVDAEYKFIWIELGGKGHMSDSQIFTDSELF 273
Query: 180 RKFDEEGWR-----PFPDAV-------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
++ + P P ILGD A+ LK +++ P +R + +
Sbjct: 274 ECLEDGSFGLPPSCPLPGETEPDIPYFILGDDAFALKSYMMKPYSRRGMTDEQRIYNYRI 333
Query: 228 KRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNI--------ALTIDRD 278
R RR++EN L +F C L + + +V + LHN+ A +D +
Sbjct: 334 SRGRRVVENRQTGLANRFRCLLGTLEQKVENVRNLVETAVLLHNLLRQRVVMAANEVDHE 393
Query: 279 NENQN 283
+E N
Sbjct: 394 DEEHN 398
>gi|242063682|ref|XP_002453130.1| hypothetical protein SORBIDRAFT_04g000610 [Sorghum bicolor]
gi|241932961|gb|EES06106.1| hypothetical protein SORBIDRAFT_04g000610 [Sorghum bicolor]
Length = 317
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 113/267 (42%), Gaps = 39/267 (14%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
++ +EK+ L+ + + Y + G S + I+ V++ T +R ++ P+ N
Sbjct: 41 ITVEEKLGFFLYMLSHNASYEDLQVFFGHSNDTFHHHINHFFKVVIPTLSRRFLQAPDPN 100
Query: 98 -----LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
+ D P + F N G +DGT + I +E F NR G ++NVM C
Sbjct: 101 QVHQKIQDNPRFY---PFFKNCLGAIDGTHIPISIASEKASPFRNRKGTLSINVMVACDF 157
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-- 210
+ +++ W GS D+RV+R++M +G++ P L D Y + P
Sbjct: 158 DLNITFISSGWEGSATDSRVLRSAM------SKGFQVPPGKFYLVDGGYANTPSFLAPYR 211
Query: 211 --------------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL---NYMRL 253
R +NP + R H R +E + G+LK++FP L + L
Sbjct: 212 GVRYHLKEFGSGRRRPQNPMELFNHR----HALLRNHVERALGVLKKRFPILKVATFHML 267
Query: 254 SPQFAGMVVLASTTLHNIALTIDRDNE 280
Q + +A+ +HN+ + D E
Sbjct: 268 ENQVK--IPIAAAIIHNLIRLLHGDEE 292
>gi|347966157|ref|XP_003435876.1| AGAP013250-PB [Anopheles gambiae str. PEST]
gi|333470184|gb|EGK97538.1| AGAP013250-PB [Anopheles gambiae str. PEST]
Length = 358
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 124/286 (43%), Gaps = 26/286 (9%)
Query: 7 KEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
+E +S ++L EI+ ++ T A +AKE+++ L ++ G + + + +
Sbjct: 22 RELLGMSEEDFNYLLNEISGKISRRDTFMRKAFTAKERLIVTLRFLATGESFMALASLYD 81
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
S SSI I EV ++ +R V++P E + F + FP+ G +D V
Sbjct: 82 ISASSIRTIIPEVCECLIKAL-KRYVQFPPSEAGWLRVSEAFQDRWQFPHAIGVIDARHV 140
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I P + ++N G +++ ++AI + F V GS+ D ++RN+ + KF+
Sbjct: 141 KIRKPLHTDKDYLNYKGFYSIVLLAIVDASANFMYVCVGGKGSIADGGMLRNASYHSKFE 200
Query: 184 EEGWRPFPDAV------------ILGDSAYPLKEWLIPP-----RTRNPDAAADQRFLRC 226
P AV +LGD ++ E+ I P + +P++ + R
Sbjct: 201 HHELNVPPPAVLDERHAVKIPYMLLGDKSFLFTEYCIRPFGGHLKPDSPESTFNYRM--- 257
Query: 227 HKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNI 271
+ R +F L +F + L P AG VV+A+ L N
Sbjct: 258 -SQARTPAAVAFDGLCSRFKIFGTIINLQPDKAGKVVMAAVYLFNF 302
>gi|331227576|ref|XP_003326456.1| hypothetical protein PGTG_07434 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 277
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 104/214 (48%), Gaps = 23/214 (10%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 44 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 103
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 104 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 156
Query: 205 EWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAG 259
++IP + ++ + F ++R IE++ GILK +F L +R + + G
Sbjct: 157 RYVIPAFKGKHLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRNAEEMKG 216
Query: 260 MV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
V ++ LHN+ D ++Q ++++DEA
Sbjct: 217 AVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 246
>gi|357118418|ref|XP_003560952.1| PREDICTED: uncharacterized protein LOC100834066 [Brachypodium
distachyon]
Length = 320
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 90/217 (41%), Gaps = 29/217 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + P E + NR G + NV+A+C + +F W GSVHD
Sbjct: 100 FKDAIGAIDGTHIPVIVPAELKVIHTNRKGYTSQNVLAMCDYDMRFIFAVPGWLGSVHDT 159
Query: 171 RVMRNSMV-YRKFDEEGWRPFPDAVILGDSAYPLK----------EWLIPPRTRNPDAAA 219
RV ++ Y F P L DS YP + + +P P
Sbjct: 160 RVWSDARAEYDTFPH----PPQGKYYLVDSGYPNRVGYLAPYKGQRYHVPEFENAPPVGM 215
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA-GMVVLASTTLHN-IALTIDR 277
+ F CH R +IE +FG+LK K+P L + P M+V A LHN I + R
Sbjct: 216 QEMFNYCHSSLRNVIERAFGVLKRKWPILQGIPAYPVLKQKMIVSACMCLHNYIRDSKLR 275
Query: 278 DNENQNVE----IHDDE--------AEDDGEEEAVDD 302
D E +H+D DDG A+ D
Sbjct: 276 DEHFDRFERGAYVHEDGPTGVNALVGTDDGTMSAIRD 312
>gi|432951260|ref|XP_004084775.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 273
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 77/162 (47%), Gaps = 14/162 (8%)
Query: 1 MTPFAF-----KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGS 55
+ P AF E +R SA + ++ + P + T R+ LS ++ V LH+ G+
Sbjct: 31 LDPLAFPDDHLYERYRFSADGIRYLCRLLGPRIKHRTARSHTLSVEQMVCMALHFFASGA 90
Query: 56 PYHIIGATHGPSKSSICRSIHEV---VPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGF 111
+ +G +K++ICR+I V + + D F + P L D+ F GF
Sbjct: 91 FLYSVGDAEQLNKATICRTIRSVCLAIKALADVF----ISLPGHRRLCDMKEEFNWIAGF 146
Query: 112 PNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGP 152
P V G VD T + I AP+ HE FVNR H++NV A P
Sbjct: 147 PKVIGAVDCTHIRIKAPSGAHEADFVNRKSCHSINVQADLSP 188
>gi|115770499|ref|XP_001184795.1| PREDICTED: uncharacterized protein LOC754204 [Strongylocentrotus
purpuratus]
Length = 342
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/265 (23%), Positives = 113/265 (42%), Gaps = 21/265 (7%)
Query: 43 KVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--D 100
K+ LH + G+ Y + K +I + + +V +VD + ++ P N +
Sbjct: 54 KLSVFLHHLATGATYAELSYNFRVGKETIQKFVPDVARAVVDEYAAEVISLPTTNEGCLE 113
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+ F + P+ G DG + + PN+ + N ++ +MA+ Y+F ++
Sbjct: 114 VAGYFEARWNLPHCLGAYDGKHIRLQKPNKSGSLYFNYKQFFSVVLMALVDSKYQFPWID 173
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVIL--------------GDSAYPLKEW 206
G DA++ NS + ++ E G PD L GD A+ ++ +
Sbjct: 174 VGGVGHQSDAQIYNNSEL-KECIEAGTLGIPDPAPLPQDDEEHPMPYFSVGDDAFAMRTY 232
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLAS 265
++ P + F R RR++EN+FGIL +F C L MR P +++ A+
Sbjct: 233 MMKPYGLCNMDKQQKIFNYRLSRARRVVENAFGILALRFQCFLGQMRQEPDTVRLLIEAA 292
Query: 266 TTLHNIALTIDRDNENQNVEIHDDE 290
HN+ I + + +V + D E
Sbjct: 293 VMPHNL---IRKHYQALDVRMLDQE 314
>gi|38346294|emb|CAE04176.2| OSJNBa0029C04.7 [Oryza sativa Japonica Group]
Length = 758
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 111/241 (46%), Gaps = 23/241 (9%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFL---RKGGFPNVAGCVDGTMV 123
S S++ V+ +V +++K + N DIP L R G F G VDGT +
Sbjct: 484 SNSTVSIYFRRVLSRMV-MLGSKILKPIDPNFTDIPRRLLQDSRFGPFQFAVGAVDGTHI 542
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ + + +NR+ + NV+ I G + + +A WPGSVHD RV+ ++ +
Sbjct: 543 PVTVGVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWPGSVHDNRVLNEAI--DSYP 600
Query: 184 EEGWR-PFPDAVILGDSAYPLKEWLIPPRTR-------------NPDAAADQRFLRCHKR 229
EE R PF +L DS YP + + P R P +++F H +
Sbjct: 601 EEFPRLPF-RKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPHAPPPEGREEKFNYIHAK 659
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSP--QFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R +IE FGI+K+++ L + +P ++LA+ LHN + +++ N ++
Sbjct: 660 LRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDSKQNDFQANNPLY 719
Query: 288 D 288
+
Sbjct: 720 N 720
>gi|77549235|gb|ABA92032.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 743
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 103/219 (47%), Gaps = 22/219 (10%)
Query: 89 RMVKWPEENLNDIPMTFL---RKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALN 145
+++K + N DIP L R G F G +DGT + + + + +NR+ + N
Sbjct: 490 KILKPIDLNFTDIPRRLLQDSRFGPFQFAVGAIDGTHIPVTVGVDSAIEHMNRHDETTKN 549
Query: 146 VMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-PFPDAVILGDSAYPLK 204
V+ I G + + +A WPGSVHD RV+ ++ + EE R PF +L DS YP +
Sbjct: 550 VLTIVGFDGRVIFADAGWPGSVHDNRVLNEAI--DSYPEEFPRLPF-RKYLLVDSGYPSR 606
Query: 205 EWLIPPRTR-------------NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM 251
+ P R P +++F H + R +IE FGI+K+++ L +
Sbjct: 607 MGFLAPYPRVRYHKDQFKGPRAPPPEGREEKFNYIHAKLRNIIERQFGIVKKQWKILKGI 666
Query: 252 RLSP--QFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
+P ++LA+ LHN + +++ N +++
Sbjct: 667 PYNPYKNVQSNIILAAFCLHNFRIDSKQNDFQANNPLYN 705
>gi|6979320|gb|AAF34413.1|AF172282_2 hypothetical protein [Oryza sativa]
Length = 809
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 103/219 (47%), Gaps = 22/219 (10%)
Query: 89 RMVKWPEENLNDIPMTFL---RKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALN 145
+++K + N DIP L R G F G +DGT + + + + +NR+ + N
Sbjct: 556 KILKPIDLNFTDIPRRLLQDSRFGPFQFAVGAIDGTHIPVTVGVDSAIEHMNRHDETTKN 615
Query: 146 VMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-PFPDAVILGDSAYPLK 204
V+ I G + + +A WPGSVHD RV+ ++ + EE R PF +L DS YP +
Sbjct: 616 VLTIVGFDGRVIFADAGWPGSVHDNRVLNEAI--DSYPEEFPRLPF-RKYLLVDSGYPSR 672
Query: 205 EWLIPPRTR-------------NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM 251
+ P R P +++F H + R +IE FGI+K+++ L +
Sbjct: 673 MGFLAPYPRVRYHKDQFKGPRAPPPEGREEKFNYIHAKLRNIIERQFGIVKKQWKILKGI 732
Query: 252 RLSP--QFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
+P ++LA+ LHN + +++ N +++
Sbjct: 733 PYNPYKNVQSNIILAAFCLHNFRIDSKQNDFQANNPLYN 771
>gi|331247932|ref|XP_003336592.1| hypothetical protein PGTG_17903 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 508
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 106/225 (47%), Gaps = 35/225 (15%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP ++ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 122 LVTWPTKSERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 181
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 182 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 234
Query: 205 EWLIPP-------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF 257
+++P RN D F ++R IE++ GILK +F L +R +
Sbjct: 235 CYVVPAFKGKQLLNRRNID------FNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRN 288
Query: 258 A----GMV--VLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGE 296
A G V ++ LHN+ D ++Q ++++DEA +D E
Sbjct: 289 AEAMKGAVKWIVTCIVLHNLL----ADLKDQWNDLYEDEAPEDSE 329
>gi|391327964|ref|XP_003738464.1| PREDICTED: putative nuclease HARBI1-like, partial [Metaseiulus
occidentalis]
Length = 229
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 94/217 (43%), Gaps = 6/217 (2%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYH 58
+TP + RLS +Q ++LE I P + T RN AL + K+ L ++ G
Sbjct: 15 VTPSDLQNHTRLSESQFSYLLERIRPRIEKRDTTFRN-ALRPESKLHMTLRYLATGDSIG 73
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
+ A + +S+ V I D + ++ P EE + F + FPN G
Sbjct: 74 SLSALYRIPRSTFSSFFPVVCKAIYDAL-EEFIQMPSSEEEWKVVMQEFALRWQFPNACG 132
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+DG V I P + +F N ++ +A+ Y F ++ G V+DA V S
Sbjct: 133 AIDGKHVTIRCPPKSGSEFFNYKKSFSIISLAVVDARYNFLYIDVGTNGRVNDAGVFEKS 192
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTR 213
R +E+ + V + D A+PL+ ++ P +R
Sbjct: 193 SFNRALEEQVLDVPIEGVFVADDAFPLRSNILKPFSR 229
>gi|357161564|ref|XP_003579131.1| PREDICTED: uncharacterized protein LOC100839156 [Brachypodium
distachyon]
Length = 481
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 87/200 (43%), Gaps = 35/200 (17%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + + P E V R G NVMA+C + +F ++ A WPGS HD
Sbjct: 248 FNGCIGAIDGTHIRVVVPAEDIANHVGRYGYPTQNVMAVCDFDMRFISIVAGWPGSAHDT 307
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----------------RTRN 214
R+ +++++ + E P L DS YP + + P R+
Sbjct: 308 RIFKDTLI--TYSENFPHPPSGRYYLVDSGYPNQTGYLAPYKGQKYHLPEFRRGIEEGRH 365
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA----GMVVLASTTLHN 270
P + F H + R +IE SFG+LK K+ L ++ P +A +++A LHN
Sbjct: 366 P-TGKKEIFNHAHSQLRNVIERSFGVLKMKWRILLHV---PSYAIEKQTRIIVACMALHN 421
Query: 271 IALTIDRDNENQNVEIHDDE 290
+ +HDDE
Sbjct: 422 WI---------RESALHDDE 432
>gi|331247668|ref|XP_003336461.1| hypothetical protein PGTG_18132 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 103/220 (46%), Gaps = 35/220 (15%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + +R +++++
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPSDGQHYFDRKKRYSISLTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 274
Query: 205 EWLIPP-------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL---- 253
++IP + RN D F ++R IE++ GILK +F L +R
Sbjct: 275 RYVIPAFKGKHLLKRRNID------FNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRN 328
Query: 254 SPQFAGMV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ + G V ++ LHN+ D ++Q ++++DEA
Sbjct: 329 AEEMKGAVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 364
>gi|328706661|ref|XP_003243166.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 428
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 109/256 (42%), Gaps = 17/256 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ R+SA+ E ++ + P + R LS E + + L ++ G I +
Sbjct: 71 YQNFLRMSASTFEELVCLVGPKIMRFPSRPDILSVGEVLTATLRYLASGESMMSIMYSFR 130
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K+++ + I + V+ DT +++ P+ + + + F K PN G +DG ++
Sbjct: 131 IGKATVSKLIFQCCEVLWDTLNTKVLIVPDTKKWAQLGVEFENKWQVPNCIGSIDGKHIV 190
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
A + N G H++ ++A+C +Y F V+ G D V NS + + F
Sbjct: 191 HQAFANSGSENYNYKGSHSIILLAMCDASYNFTIVDIGADGRCSDGGVFSNSEMGKGFMA 250
Query: 185 EGWRPFPDA------------VILGDSAYPLKEWLI---PPRTRNPDAAADQRFLRCHKR 229
FP A LGD A+PL L+ P R + + F R
Sbjct: 251 NNLN-FPTAKDIDSNSGPIPYYALGDEAFPLLTNLMRPYPGRGKRKLPLNESIFNYRLSR 309
Query: 230 TRRLIENSFGILKEKF 245
RR IEN+FGI+ K+
Sbjct: 310 GRRTIENTFGIMASKW 325
>gi|331213343|ref|XP_003319353.1| hypothetical protein PGTG_01527 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 106/221 (47%), Gaps = 37/221 (16%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ F+++ + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFNQKQF-------LLADSAYTND 274
Query: 205 EWLIPPRTRNPDAAADQRFLRCHK--------RTRRLIENSFGILKEKFPCLNYMRL--- 253
+++P A ++FL+ ++R IE++ GILK +F L +R
Sbjct: 275 RYVVP-------AFKGKQFLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREIRTQIR 327
Query: 254 -SPQFAGMV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ + G V ++ LHN+ D ++Q ++++DEA
Sbjct: 328 NAEEMKGAVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 364
>gi|147852015|emb|CAN82284.1| hypothetical protein VITISV_039752 [Vitis vinifera]
Length = 592
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/251 (27%), Positives = 108/251 (43%), Gaps = 42/251 (16%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S +I + HEV+ +V+ F + M+ P N N I LR+ F + G +DGT++
Sbjct: 328 SSQTIHKYFHEVLVAMVN-FSKEMITPPSFNDSSNGISNRRLRQI-FKDAVGAIDGTLIH 385
Query: 125 IDAPNEHEDQFVNRN-GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
P + + G+ NVMA+C + F V W G+ HD+RV+ ++
Sbjct: 386 ACIPTNQQVPYRGXGRGECFQNVMAVCDFDMIFRFVVVGWEGTAHDSRVLTETI------ 439
Query: 184 EEGWRPFPDAVILGDSAYP-----------LKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
L D+AY ++ WL R+ ++ F +CH R R
Sbjct: 440 --------QKYYLVDAAYTHTRGFMAPYRNVRYWLSDFRSGGKXVGKEEIFNQCHARLRN 491
Query: 233 LIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTI---DR-----DNENQN 283
+IE FG++K +FP L M S +V+ ++HN I DR DNE
Sbjct: 492 VIERVFGVVKARFPILKRMTPYSFTTQTKIVMTCFSIHNFLRQISVADRLFSEYDNE--- 548
Query: 284 VEIHDDEAEDD 294
VE+ D A +
Sbjct: 549 VELESDNANQN 559
>gi|357140134|ref|XP_003571625.1| PREDICTED: uncharacterized protein LOC100822756 [Brachypodium
distachyon]
Length = 449
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 41/315 (13%)
Query: 34 RNVALSAKEKVLSCLHWMGVGSPYHIIGA----THGPSKSSICRSIHEVVPVIVDTFFQR 89
R+ +A + L WM +G P I A H S +I R V+ + ++
Sbjct: 138 RSTRETASIECLGMCLWM-LGGPQTFIQAEDRFVH--STETIHRKFKHVLECL-NSLGGD 193
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCV---DGTMVLIDAPNEHEDQFVNRNGDHALNV 146
++K + + K +P+ GC+ DG + + P + R+G + NV
Sbjct: 194 IIKPTDPTFTSMHPKIRDKRFWPHFKGCIGAIDGPHIPVIVPASQTVNYTGRHGYTSQNV 253
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEW 206
+A+C + +F V A W GS HD R+ SM K+ P L DS YP ++
Sbjct: 254 LAVCDFDMRFTFVVAGWAGSAHDTRIFNYSM--DKYANTYHSPPEGTYYLVDSGYPNRKG 311
Query: 207 LIPP-----------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
+ P + R P + F H R +IE SFG+LK+K +R P
Sbjct: 312 YLAPFKGQTYHLLEFQNRRPPTGKLEVFNHAHSSLRNVIERSFGVLKQK---CRILRDVP 368
Query: 256 QFA----GMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD---GEEEAVDDPDEIDE 308
Q+ M++ + LHN RD++ ++ E +A+++ G E A + E
Sbjct: 369 QYKIASQTMIINSCMALHNFI----RDSKLRDKEFDKCDADENYMPGGERAT---PLLGE 421
Query: 309 EEQPRQDRNNPGTQR 323
+ D +N T R
Sbjct: 422 DVSISHDESNMNTTR 436
>gi|449673827|ref|XP_004208041.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 398
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 106/250 (42%), Gaps = 23/250 (9%)
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVD 119
+ H + S + + + + IV + + K +E++ + F K GC+D
Sbjct: 134 VFGIHQCTVSKTVKVVCDAINNIVGPIYLHLPK-NKEDMTKLASQFEVKFDMIQAFGCID 192
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
GT V I P ++ + +LNV A+C F V WPGSVHDA++ NS +
Sbjct: 193 GTHVQIKRPIKNGQDYFCYKQYFSLNVQAVCDIKGYFIDVECKWPGSVHDAKMFTNSTIN 252
Query: 180 RKFDEE----------GWRPFPDAVILGDSAYPLKEWLIP--PRTRNPDAAADQRFLRCH 227
+KF + + P+ +I GD AYPL + I N + LR
Sbjct: 253 KKFIKGTLPQTLYSLPNYHSIPNYLI-GDPAYPLTNFCIKEFQSCSNNEEVIFNSMLR-- 309
Query: 228 KRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA---LTIDRDNE--N 281
R IE +FG LK ++ L + + + +V+ LHN T D + E N
Sbjct: 310 -SARNQIECAFGRLKARWGFLRKIIDIKIETVPIVIYTCFVLHNFCEKNKTYDLNEEEVN 368
Query: 282 QNVEIHDDEA 291
Q +E H +A
Sbjct: 369 QQIERHRSDA 378
>gi|221121048|ref|XP_002156289.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 328
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 84/189 (44%), Gaps = 12/189 (6%)
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
++++C I E I + ++ P + +I F + FP G +DG V I
Sbjct: 34 RTTVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRI 93
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
+AP + F N G + + ++AIC Y F V+ G +DA ++ S R F+ +
Sbjct: 94 EAPTKSGFSFYNYKGFYCMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFN-K 152
Query: 186 GWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
G+ P V++GD + LK WL+ P + F R RR I+N
Sbjct: 153 GYFNLPKISELDSKVPPVLVGDDIFALKPWLMKPYPGKNFTVQQRVFNYRLSRARRTIKN 212
Query: 237 SFGILKEKF 245
SFGIL ++
Sbjct: 213 SFGILAARW 221
>gi|356507066|ref|XP_003522292.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 401
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 106/264 (40%), Gaps = 27/264 (10%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG----GFPNVAGCVDGTM 122
S +I R H V+ + F + ++K + + D P L+ F + G +DGT
Sbjct: 132 SGETISRHFHNVLEAVC-MFAKDIIKHVDPSFRDTPDEILKDARYCPYFRDCIGAIDGTH 190
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + P + ++ R G NVM +C + F V A W GS HD ++ ++ RK
Sbjct: 191 IRVCVPYHLQGVYIGRKGYTTTNVMVVCDFSMCFTFVWAGWEGSAHDTKIFMEAL--RKP 248
Query: 183 DEEGWRPFPDAVILGDSAYP---------------LKEWLIPPRTRNPDAAADQRFLRCH 227
P L DS YP L ++ I PR R + F H
Sbjct: 249 ALHFPHPSQGKYYLVDSGYPTFMGFLGPYKKTRYHLPQFRIRPRIR----GRAEVFNYYH 304
Query: 228 KRTRRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
R IE +FG+ KE++ L M + + +++ +HN D+ + +
Sbjct: 305 SSLRSTIERAFGLCKERWKILGNMSPFALKTQNQIIVVCMAIHNFIQRNDKSDGEFDSLY 364
Query: 287 HDDEAEDDGEEEAVDDPDEIDEEE 310
D+E D E+E+ P EE
Sbjct: 365 EDNEDIDSDEDESEVGPSTTTWEE 388
>gi|449685623|ref|XP_002157724.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 368
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 116/273 (42%), Gaps = 16/273 (5%)
Query: 12 LSATQVEFILEEIAPALTTP--TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+S ++E +L I +LT + R+ ++ ++++ + ++ G ++
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCSSRD-SICPSQRLIITIRYLARGESQQTQSFYFRVGRA 59
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
++C I E I + ++ P + +I F + FP G +DG V I+A
Sbjct: 60 TVCHIIEETCCAIWKVLKKVFLRAPNDVKEWKNIIKEFDQNWNFPQCIGAIDGKHVRIEA 119
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P + F N G +++ ++ IC Y F V+ G +DA ++ S R F+ +G+
Sbjct: 120 PAKSGSFFYNYKGFYSMVLLEICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFN-KGY 178
Query: 188 RPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSF 238
P V++ D + LK W + P + F R RR IENSF
Sbjct: 179 FNLPKISEFDPKVPPVLVEDDIFALKPWSMKPYPGKNLTVQQRVFNYRLSRARRTIENSF 238
Query: 239 GILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
GIL ++ ++ P ++ A+ LHN
Sbjct: 239 GILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 271
>gi|57834109|emb|CAE04774.3| OSJNBa0079C19.15 [Oryza sativa Japonica Group]
Length = 728
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 72/154 (46%), Gaps = 13/154 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT +L+ P Q V RN NV+A+C + +F + A WPGS HD
Sbjct: 569 FNNCIGAIDGTHILVVVPASKVVQHVGRNKYPTQNVLAVCDFDMRFTFIVAGWPGSAHDM 628
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPDAAADQR---- 222
RV +++ K+ P P L DS YP + + P + P+ A R
Sbjct: 629 RVFNDALC--KYATIFPHPPPGKFYLVDSGYPNRMGFLAPYKGTKYHLPEFRAGPRPSGK 686
Query: 223 ---FLRCHKRTRRLIENSFGILKEKFPCLNYMRL 253
F H R +IE SFG+LK K+ L +R+
Sbjct: 687 KEVFNHLHSSLRNVIERSFGVLKMKWRILLDLRV 720
>gi|357130831|ref|XP_003567049.1| PREDICTED: uncharacterized protein LOC100824798 [Brachypodium
distachyon]
Length = 225
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 75/149 (50%), Gaps = 15/149 (10%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT V + P + + ++ +R + + NV+ +CGP+ K+ + W GS HD
Sbjct: 75 FKNCLGALDGTHVKVIVPVDIKRRYRSRKAEISTNVLGVCGPDMKYIYMLPGWEGSAHDG 134
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPR----------TRNPDAAA 219
RV+R+++ G R D L D+ Y K +L P R +NP +A
Sbjct: 135 RVLRDAIS----RPNGLRVPADQYYLVDARYTNGKGFLAPYRGQRYHIGGWTAQNPPNSA 190
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCL 248
++ F CH + R ++E SF +K K+ L
Sbjct: 191 EEYFNMCHAKARNIVERSFARIKNKWAIL 219
>gi|331239088|ref|XP_003332198.1| hypothetical protein PGTG_13565 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 393
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 93/200 (46%), Gaps = 31/200 (15%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 274
Query: 205 EWLIPP-------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL---- 253
+++P + RN D F ++R IE++ GILK +F L +R
Sbjct: 275 RYVVPAFKGKQLLKRRNID------FNYHLAQSRVRIEHAIGILKGRFASLQEIRTQIRN 328
Query: 254 SPQFAGMV--VLASTTLHNI 271
+ + G V ++ LHN+
Sbjct: 329 AEEMKGAVKWIVTCIVLHNL 348
>gi|356575367|ref|XP_003555813.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 402
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/320 (22%), Positives = 132/320 (41%), Gaps = 30/320 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV-----ALSAKEKVLSCLHWMGVGSPYHII 60
F+ F++S +I + + V LS ++V L + G I
Sbjct: 66 FESVFKISRKTFNYICSLVEEDMLARASNFVDLNGNRLSLNDQVAVALRRLSSGESLSTI 125
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCV 118
G + ++S++ + + V + + + W E + +I F G N G V
Sbjct: 126 GESFRMNQSTVSQVTWKFVETMEERGLHHL-SWASTEMEMEEIKSKFENIRGLSNCCGAV 184
Query: 119 DGTMVLIDAP--NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
D T +++ P + +++R + ++ + AI P+ +F + WPGS+ D +V+R+S
Sbjct: 185 DSTHIMMTLPSVDALNSVWLDREKNCSMVLQAIVDPDLRFRDIVTGWPGSMSDEQVLRSS 244
Query: 177 MVYRKFDEEGWR------PFPDAV-----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K EEG R PD I+GD+ +PL WL+ P + F +
Sbjct: 245 SFF-KLAEEGKRLNGGKKTLPDGTLFREYIIGDTGFPLFSWLLTPYEGKGFSNVQVEFNK 303
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQ- 282
T+ + + + LK+ + + + P + ++L LHNI ID ++E
Sbjct: 304 RVVETQMVAKKALARLKDMWKIIQGVMWKPDKHKLPRIILVCCILHNIV--IDMEDEVLI 361
Query: 283 ---NVEIHDDEAEDDGEEEA 299
+ HD +D E A
Sbjct: 362 DMPSCHQHDSRYQDQTSEFA 381
>gi|449668824|ref|XP_004206878.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 338
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 98/219 (44%), Gaps = 16/219 (7%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQR---MVKWPEENL 98
E++ L ++ G I A++ S S+I R I E I + +R V ++
Sbjct: 9 ERLAVTLRFLVTGDAQCTIAASYRISASTISRIISETCAAIWTSLKERNFLHVPSEKQEW 68
Query: 99 NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYA 158
I F FP+ G +DG +++ AP+ ++ N H++ ++A+C Y+F
Sbjct: 69 KTIAKEFENMWNFPHAIGAIDGKHIVMQAPHNGGSEYFNYKKTHSIVLLAVCNAKYEFTM 128
Query: 159 VNANWPGSVHDARVMRN-SMVYRKFDEEGWRPFPD----------AVILGDSAYPLKEWL 207
V+ G D V N S+ Y + + P P+ V++ D A+ LK +
Sbjct: 129 VDIGDSGRQSDGSVFNNCSLGYAIENNKLNIPDPEFIGNSEKVLPYVLVADDAFGLKRHM 188
Query: 208 IPPRTRNPDAAADQRFLRCH-KRTRRLIENSFGILKEKF 245
+ P N + + DQ+ R RR+IEN+FGI +F
Sbjct: 189 MKPYP-NQNISLDQKIFNYRLSRARRVIENTFGIATTRF 226
>gi|301108774|ref|XP_002903468.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262097192|gb|EEY55244.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 255
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 98/210 (46%), Gaps = 12/210 (5%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLN 99
+KV + L++MG Y +G G +S + EVV V+ Q +V +P + +
Sbjct: 48 KKVAAALYFMGSTGGYREVGGAMGMCRSYVSEITAEVVRVLRAAAPQ-IVAFPRDQGGWD 106
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I F + G+P V G +DG+++ ++ P E D F R ALNV AI + F +
Sbjct: 107 AIESEFAARHGYPGVVGAIDGSLIEVERPYEF-DGFYCRKCYPALNVQAIVTSDNVFLSA 165
Query: 160 NANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP---RTRNPD 216
PGS D + + S + R P P A +GD+ Y L LI P R +
Sbjct: 166 EVR-PGSWSDRKCWQYSKIGRTVYST--IP-PGAHFIGDAGYALLPGLIVPYSDREEGGE 221
Query: 217 AAADQR-FLRCHKRTRRLIENSFGILKEKF 245
+ QR F H TR +E++FGI K +F
Sbjct: 222 LSPRQRQFNFLHSSTRMAVESTFGIWKGRF 251
>gi|357622030|gb|EHJ73650.1| putative transposase [Danaus plexippus]
Length = 298
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 106/219 (48%), Gaps = 17/219 (7%)
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP---EENLNDIPMTFLRKGGFPNVAG 116
+G G KSS+ HEVV ++ + +Q + WP E+++ + + K GFPN G
Sbjct: 46 VGQIFGLHKSSVSYIFHEVVMLLAEQRYQ-FINWPSLEEQHITRVKVN--SKYGFPNCVG 102
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D + + + + R + + + A+C F ++ G+ +V R +
Sbjct: 103 FLDACRLKVGSKRK------KRQVSNFILLQAVCDETLMFIDIHIGEIGNTVKGKVFRET 156
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ-RFLRCHKRTRRLIE 235
+ ++ + + F D ILGD Y L++ L+ P +R + ++ +F H + I
Sbjct: 157 QLSQEL--KNFIDF-DNHILGDCEYKLRKNLLIPFSREELVSNEEIKFNEIHWKAHSYIG 213
Query: 236 NSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNIAL 273
++FG+LKE+F LN++ ++ P ++ A+ LHN L
Sbjct: 214 SAFGLLKERFQKLNHIDINRPDAVQALICAACVLHNFVL 252
>gi|403179756|ref|XP_003338057.2| hypothetical protein PGTG_19587 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165340|gb|EFP93638.2| hypothetical protein PGTG_19587 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 116/266 (43%), Gaps = 28/266 (10%)
Query: 6 FKEAFRLSATQVEFILEEIA-----------PALTTPTERNVALSAKEKVLSCLHWMGVG 54
FK+A R + + ++L +IA P L P + + L E++ S G G
Sbjct: 105 FKQAVRTTKSGFTWLLRQIAFNPIFHSGSFRPQLPVPHQLALTL---ERLGS----NGNG 157
Query: 55 SPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+ I + +I ++ V+ I D V WP+++ ++ GF
Sbjct: 158 ASVGRISRNLSVGRGTIIKASRRVIRAIND-LGSTYVLWPDKDRRKEISDVMKAEGFEGC 216
Query: 115 AGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDGT + L P+ + F +R +++N IC + WPGS D+ V
Sbjct: 217 IGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQIICDCDRFITGYMTGWPGSCGDSMVF 276
Query: 174 RNSMVYRKFDEEGWRPF-PDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCHKRT 230
+ MV++ E R F P ++ +SAY L IP + P A + F C R+
Sbjct: 277 KKMMVHK----EPERFFDPGQYLIANSAYELGVHCIPA-YKAPAAYIRENTEFNYCLARS 331
Query: 231 RRLIENSFGILKEKFPCLNYMRLSPQ 256
R E++ GILK ++ L ++RLS Q
Sbjct: 332 RVRNEHTIGILKGRWSSLQHLRLSIQ 357
>gi|331221192|ref|XP_003323271.1| hypothetical protein PGTG_04808 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 103/214 (48%), Gaps = 23/214 (10%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 274
Query: 205 EWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAG 259
+++P + + + F ++R IE++ GILK +F L +R + + G
Sbjct: 275 CYVVPAFKGKQLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRNAEEMKG 334
Query: 260 MV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
V ++ LHN+ D ++Q ++++DEA
Sbjct: 335 AVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 364
>gi|449692247|ref|XP_004212957.1| PREDICTED: uncharacterized protein LOC101240156, partial [Hydra
magnipapillata]
Length = 230
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 73/148 (49%), Gaps = 5/148 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y ++ +
Sbjct: 80 FKEHFRVNRNTFNFLVNELYPHLGKTTTTMRE-PISVVKRVAVALHYLASREEYRVVSSL 138
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ +HE + + D + VK+P ENLN F GFP G VDG
Sbjct: 139 FGIGKSTANLIVHEFINAVYDILLPKYVKFPLSVENLNKRSKDFEAILGFPQCIGAVDGR 198
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAI 149
++ I AP + + N G +++ + A+
Sbjct: 199 LIPISAPRDQTISYYNYKGWYSIVLFAV 226
>gi|340382280|ref|XP_003389648.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 241
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 84/191 (43%), Gaps = 25/191 (13%)
Query: 92 KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICG 151
KW E I F FP+ G +DG ++I P + N ++ ++A+
Sbjct: 11 KWKE-----IATGFETYWQFPHCIGALDGKHIVIRPPPNSGSYYFNYKHTFSIVLLALVD 65
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR---PFPDA--------VILGDSA 200
+YKF VN G RNS + + PFP +I+ D A
Sbjct: 66 ADYKFTYVNIGCNG--------RNSSLCAALETNSLNVPLPFPICEDGIPLPYMIVADEA 117
Query: 201 YPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAG 259
+PLK ++ P + + F C R RR++EN+FGIL +F + +RLSP A
Sbjct: 118 FPLKTYIQKPYAQIGLTKEKRIFNYCLSRARRIVENAFGILANRFQVFMTPIRLSPDKAE 177
Query: 260 MVVLASTTLHN 270
+VLA +LHN
Sbjct: 178 TIVLACCSLHN 188
>gi|108707497|gb|ABF95292.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 311
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 77/173 (44%), Gaps = 14/173 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT + + P Q V RN NV+AIC + +F + A WPGS HD
Sbjct: 86 FNNCIGAIDGTHIPVVVPASKVVQHVGRNKYPTQNVLAICDFDMRFTFIVAGWPGSAHDM 145
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK----------EWLIPPRTRNPDAAAD 220
RV +++ RK+ P P L DS YP + ++ +P P +
Sbjct: 146 RVFNDAL--RKYAAIFPHPPPGKFYLVDSGYPNQLGYLSPYKGTKYHLPEFREGPRPSGK 203
Query: 221 QR-FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLASTTLHNI 271
+ F H R +IE SFG+LK K+ L + P ++ A LHN
Sbjct: 204 KEVFNHLHSSLRNVIERSFGVLKMKWRILLDLPSYPMLKQTKIIHACMALHNF 256
>gi|328698243|ref|XP_001948773.2| PREDICTED: hypothetical protein LOC100166743 [Acyrthosiphon pisum]
Length = 302
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 95/214 (44%), Gaps = 20/214 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP G +DG +++ AP ++ N G ++ ++A +Y F + G + D
Sbjct: 56 FPQCVGPIDGKHIVMQAPFNSGTEYFNYKGTFSVVLLAAVDAHYCFIFASVGCQGRISDG 115
Query: 171 RVMRNSMVYRK-FDEEGWRPFPDA----------VILGDSAYPLKEWLIPPRTRNPDAAA 219
V NS++ +K +DE P P A V + D+A+PLKE L+ P +
Sbjct: 116 GVFNNSILAKKMYDESLNLPSPKALPGQKEKFPYVFVCDNAFPLKENLMKPFPGTHLGGS 175
Query: 220 DQR-FLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDR 277
QR F R RR++EN+FGI+ F L + L P+ +VLAS T + +
Sbjct: 176 PQRAFNYRLSRARRVVENTFGIMASVFRVLRKPLLLQPEKINTIVLASLTGALVGAGVAH 235
Query: 278 DNENQNVEIHDDEAE-------DDGEEEAVDDPD 304
E ++ + +E GE D+PD
Sbjct: 236 SAEVLDMPVDPNEPTYCLCNQVSYGEMIGCDNPD 269
>gi|242088183|ref|XP_002439924.1| hypothetical protein SORBIDRAFT_09g022640 [Sorghum bicolor]
gi|241945209|gb|EES18354.1| hypothetical protein SORBIDRAFT_09g022640 [Sorghum bicolor]
Length = 390
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 65/293 (22%), Positives = 126/293 (43%), Gaps = 24/293 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RN------VALSAKEKVLSCLHWMGVGSPY 57
+ ++S +++ + LTT T RN L +++V L + G
Sbjct: 56 MESVLKMSRKTFDYVCSLVKKDLTTKTYGFRNFRFGDKTILGVEDQVAVALMRLTTGESL 115
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + + KWP E + I F + G PN G
Sbjct: 116 QNIGMWFGMNHSAISNITWRFIESVEERAICHL-KWPSHEEMATIKARFDKIYGLPNCCG 174
Query: 117 CVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +L+ + + +++ +++ + A+ + +F + + WPGS+ D+ ++R
Sbjct: 175 AIDTTHILMCSSAQPNSKVWLDNENKNSMVLQAVVDADMRFRDIVSGWPGSMDDSCILRT 234
Query: 176 SMVYRKFDEEGWR-------PFPDAV---ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
S +YR E+G R P AV I+GD++YPL WL+ P + AA F +
Sbjct: 235 SGLYR-LCEKGLRLEGQMELPGGSAVREYIVGDASYPLLPWLMTPYRGHGLPAAKVEFNK 293
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTID 276
H +++ + LK ++ + P + ++ + NI + +D
Sbjct: 294 RHTAATAVVQTALATLKGRWRVIQGELWRPDKHRLPRIIFVCCLITNIIIDMD 346
>gi|397634896|gb|EJK71629.1| hypothetical protein THAOC_06907 [Thalassiosira oceanica]
Length = 485
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 120/285 (42%), Gaps = 22/285 (7%)
Query: 19 FILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEV 78
FI EI ALT + LS L G + Y I H + IC +
Sbjct: 152 FISGEIKLALTL----RLLAGGSYLDLSLLFECGSSTAYEIF---HKVIREWICSKDKPL 204
Query: 79 VPVIVDTFFQRMVKWPEENLNDIPMTFLRK--GGFPNVAGCVDGTMVLIDAPNEHED--- 133
V + F EE + + + F R G F G +DG +V I P++ ++
Sbjct: 205 VNINGKDFID-----DEERMAAVALEFARSSAGLFSGCIGAIDGWVVKIKKPSQRDNVGN 259
Query: 134 --QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFP 191
F +R G + LNV+ IC + N G+ HD+ +NS +YRK ++ R
Sbjct: 260 VASFYSRKGFYGLNVVVICDRKKRILYRVINSRGAEHDSTAFKNSSLYRKLMDDCDRLLE 319
Query: 192 DAV-ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-N 249
+GDSAY ++ +L+ P + F H +R IE +FG + ++ L +
Sbjct: 320 KGFHFIGDSAYAIRSFLLTPFDNAVHGTPEDNFNFFHSSSRICIECTFGEVDLRWGILWS 379
Query: 250 YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
++ S + V+ A LHN + R++ +NV + + DD
Sbjct: 380 PLKFSLRNNIKVIDACLMLHNF-IVDHRESLGENVSSMERDVFDD 423
>gi|356572216|ref|XP_003554266.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 392
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 135/322 (41%), Gaps = 28/322 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
FK FR+S T E+I + L + + LS +++V L + G
Sbjct: 60 GFKYFFRVSKTTFEYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 119
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + + + WP+ N + +I + F G PN G
Sbjct: 120 SVGAAFGVGQSTVSQVTWRFIEAL-EERAKHHLNWPDFNRMQEIKLGFEVSYGLPNCCGA 178
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P + D + ++ ++++ + I +F + PG + +R+++ S
Sbjct: 179 IDATHIMMTLPAVQTSDDWCDQEKNYSMLLQGIVDHEMRFIDIMTGLPGGMSVSRLLKCS 238
Query: 177 MVYRKFDEEGWR------PFPDAVI----LGDSAYPLKEWLIPPRTRNPD---AAADQRF 223
+ K E G R F VI +G +YPL WL+ P N + + F
Sbjct: 239 AFF-KLSENGERLNGNVKAFGGDVIREYVVGGCSYPLLPWLMTPYETNGANGISVSQSIF 297
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNEN 281
H + L +F +LK + L+ + P + ++L LHNI + D
Sbjct: 298 NHKHGAAKLLAVRAFSLLKGSWRILSKVMWRPDKRKLPSIILTCCLLHNIIIDCG-DTLQ 356
Query: 282 QNVEI--HDDEAEDDGEEEAVD 301
Q+V + H D + + VD
Sbjct: 357 QDVALSGHHDSGYQEQCCKQVD 378
>gi|403171294|ref|XP_003330542.2| hypothetical protein PGTG_12079 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169116|gb|EFP86123.2| hypothetical protein PGTG_12079 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 132/312 (42%), Gaps = 40/312 (12%)
Query: 6 FKEAFRLSATQVEFILEEI-----------APALTTPTERNVALSAKEKVLSCLHWMGVG 54
FK+A R + ++L ++ P L P + + L E++ S + VG
Sbjct: 105 FKQAVRTTKAGFTWLLSQVNLNPIFHSNSFRPQLPIPHQLALTL---ERLGSNGNGASVG 161
Query: 55 SPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+G G ++ ++ V+ I D ++ + WP+E ++ GF
Sbjct: 162 RFSRNLGVGRG----TVVKASRRVIRAIND-LSEKYLTWPDEVRRKEISDVMKCEGFEGC 216
Query: 115 AGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDGT + L P+ + F +R +++N +C + A WPGS D+ V
Sbjct: 217 VGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVVCDCDRFITAFMTGWPGSCGDSMVF 276
Query: 174 RNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR--FLRCH 227
+ M++++ FD + ++ DSAY L IP + P A + F C
Sbjct: 277 KRMMLHKEPTLFFDRGQY-------LIADSAYELGVHCIPA-YKAPAAYIKENSDFNYCL 328
Query: 228 KRTRRLIENSFGILKEKFPCLNYMRLSPQFAG--MVVL----ASTTLHNIALTIDRDNEN 281
R+R E++ GILK ++ L ++RL+ Q M V+ TLHNI +
Sbjct: 329 ARSRVRNEHTIGILKGRWASLQHLRLAIQKPSDMMEVIRWVNCCVTLHNILAHLGDAWAE 388
Query: 282 QNVEIHDDEAED 293
+V I+D D
Sbjct: 389 LDVSINDHPGSD 400
>gi|449687295|ref|XP_002156763.2| PREDICTED: uncharacterized protein LOC100211227 [Hydra
magnipapillata]
Length = 306
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 65/113 (57%), Gaps = 4/113 (3%)
Query: 172 VMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
V+RNS ++ KF E G PF AVIL DSAYP +EWLIP +PD +RF +++TR
Sbjct: 115 VIRNSNLWNKF-ENGEFPFNGAVILADSAYPCREWLIPHFPGDPD-GTQKRFNIAYRKTR 172
Query: 232 RLIENSFGILKEKFPCLNY-MRLSP-QFAGMVVLASTTLHNIALTIDRDNENQ 282
IE FGI+K +F L +RL + A +++ +HN+ L + +Q
Sbjct: 173 NTIERCFGIVKNRFYVLKTGIRLHKVEDASKLIMCGFVIHNLCLRYGDNGNDQ 225
>gi|449688289|ref|XP_004211707.1| PREDICTED: uncharacterized protein LOC100198190 [Hydra
magnipapillata]
Length = 347
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 124/308 (40%), Gaps = 26/308 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F FR+S T +E +L + P + T +SA E+ + ++ G I +
Sbjct: 47 FFRYFRMSPTILEELLSWVCPLIRKKSTHMRDCISAPERFCVTMRFLVTGDAQKTIAMNY 106
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S S + R I E I W + +I K FP+ G +D V+
Sbjct: 107 RMSPSVVGRIISETCRAI----------WLLTS--EIDYKIRNKWNFPHCLGAIDVKHVV 154
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ AP F N H++ ++A+C Y+F ++ G +D V NS + +
Sbjct: 155 MQAPACSGSSFYNYKKTHSIVLLAVCNAKYQFSLLDIGNSGRQNDGSVYANSQLGYAIEN 214
Query: 185 E-----------GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
+ G V +GD+++ LK ++ P + + F R RR+
Sbjct: 215 DLLDIPQACKVNGTETILLHVFVGDNSFGLKPQMMKPYPFCNLSYKKRIFNYGLSRVRRI 274
Query: 234 IENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAE 292
+EN+FGI +F + S + ++ A LHN ++ ++N N + + +
Sbjct: 275 VENAFGIAASRFRIFRRPIIASTKKVVLITKAVVALHNFLMSKRKENLNYDY-CPNTYID 333
Query: 293 DDGEEEAV 300
DG+ E +
Sbjct: 334 QDGQNELL 341
>gi|359479713|ref|XP_003632343.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 400
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 122/290 (42%), Gaps = 23/290 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
FK FR+S ++I + L + + LS +++V L + G
Sbjct: 60 GFKFFFRVSKKTFDYICSLVRQDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 119
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + V+ + ++WP+ N + +I F G N G
Sbjct: 120 SVGAAFGVGQSTVSQVTWRFIEA-VEERAKHHLRWPDFNRMEEIKSKFETSYGLSNCCGA 178
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P + D + ++ ++++ + + +F + WPG + +R+++ +
Sbjct: 179 IDATHIIMTLPAVQTSDDWCDQENNYSMFLQGVVDDEMRFLDIVTGWPGGMTVSRLLKCT 238
Query: 177 MVYRKFDEEGWRPFPDAVILGDSA-----------YPLKEWLIPPRTRNPDAAADQRFLR 225
YR E G R + IL + A YPL WLI P N ++ F
Sbjct: 239 GFYRNC-EAGERLNGNVRILSEGAEIREYVVGGVGYPLLPWLITPCEYNDIPSSMSGFKT 297
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIAL 273
H+ R L +F LK + L+ + P + ++L LHNI +
Sbjct: 298 NHEAARSLAVRAFSQLKGTWRILSKVMWRPDKRKLPSIILVCCLLHNIII 347
>gi|328699615|ref|XP_003240987.1| PREDICTED: hypothetical protein LOC100571096 [Acyrthosiphon pisum]
Length = 375
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 59/238 (24%), Positives = 99/238 (41%), Gaps = 17/238 (7%)
Query: 50 WMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRK 108
++ G Y + S S I I E++ I + + P E +L + F
Sbjct: 99 FLATGESYRSLSFQFRISHSWISTIIKELLVAICNRLKNITMPEPTEHSLKKVSNDFYEM 158
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
FPN G +DG + I P+ + N ++ ++A+C NYKF V+ G
Sbjct: 159 WNFPNCCGAIDGKHIRIVCPDSSGSLYFNFKSFFSVVLLALCDANYKFLVVDIGSYGKEG 218
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDA------------VILGDSAYPLKEWLIPPRTRNPD 216
DA + S + K G FPD V +GD + L E ++ P RN
Sbjct: 219 DAGIFPKSNL-GKLISTGKFKFPDPECLPNTDIVVPHVFVGDEGFKLTETMMRPYPRNQS 277
Query: 217 AAADQRFLRCHK--RTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNI 271
+ + ++ R RR EN+FGI+ + F + + P+ +++AS +HN+
Sbjct: 278 KTDQTKAIFNYRLSRARRTTENTFGIMCQNFRVFFTPINILPETVDNLIMASCIIHNL 335
>gi|345484037|ref|XP_003424931.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 271
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 97/218 (44%), Gaps = 16/218 (7%)
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S++ + I EV V+ + + +P + I F+ FPN G +DG I
Sbjct: 51 STVKQVIPEVCTVLCEVLMPLFLSFPSRQQFQVIANEFMEDLHFPNCIGALDGKHCRIRK 110
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR-----NSMVYRKF 182
P F N H++ +MA C +F N GS +DA V N ++ +
Sbjct: 111 PGGSGSLFFNFKNFHSIVLMACCDSKKRFIWANIGDYGSCNDASVFAESDFGNVLLNNQI 170
Query: 183 DEEGWRPFPDA------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIE 235
+P P+ V++GD +PLK++L+ P RN + R RR+IE
Sbjct: 171 QLPPSQPLPNTHIQSPYVLIGDGGFPLKDYLMTPFLRNENITIPHRVFNYRLSHARRIIE 230
Query: 236 NSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNI 271
++FG + E++ +N L + A ++++S LHN+
Sbjct: 231 SAFGEVTERW-LVNESSLKWKLATSERIIISSLCLHNV 267
>gi|331228853|ref|XP_003327093.1| hypothetical protein PGTG_08870 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 103/220 (46%), Gaps = 35/220 (15%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R + ++V
Sbjct: 162 LVTWPTESERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYCISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS H+A V + V ++ FD + + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHNAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 274
Query: 205 EWLIPP-------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL---- 253
+++P + RN D F ++R IE++ GILK +F L +R
Sbjct: 275 RYVVPAFKGKQLLKRRNID------FNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRN 328
Query: 254 SPQFAGMV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ + G V ++ LHN+ D ++Q ++++DEA
Sbjct: 329 AEEMKGAVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 364
>gi|81051977|gb|ABB55331.1| Zinc transporter, putative [Asparagus officinalis]
Length = 662
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 135/325 (41%), Gaps = 37/325 (11%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGA 62
P AFR+ + + EE+ RN ++ EKV ++ + +G +
Sbjct: 168 PIRCVNAFRMESRVFLQLCEELQSKYGLTPSRN--MTVVEKVGIFVYTIALGLSNRDVCE 225
Query: 63 THGPSKSSICRSIHEVVPVIVDT------FFQRMVKWPEENLNDIPMTFLRKGG----FP 112
S +I R+I +V+ I + +++ + N IP F
Sbjct: 226 RFQRSGETISRTITQVLEAICGRNKGYMGLARDVIQPKDRNFQFIPPQIANDTRYMPYFK 285
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
+ GC+DGT V P + ++ R NVMA+C + F ++ W GS HD RV
Sbjct: 286 DCIGCIDGTHVAACIPEADQLRYRGRKDIPTFNVMAVCDFDICFTFLSVAWEGSAHDTRV 345
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP-PRTRNPDAA--------ADQR 222
+++ + P L D YP + +L+P P+ R + A +
Sbjct: 346 FLHAINTPTMNFP--HPRQGQYYLVDKGYPDRLGYLVPYPKIRYHQSQFQREAPTNAKET 403
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM----VVLASTTLHNIALTIDRD 278
F R H R IE SFG+LK+++ L+ M PQ++ +++A+ LHN D
Sbjct: 404 FNRAHSSLRSCIERSFGVLKKRWKILHQM---PQYSVKTQIDIIMAAFALHNYIRINSVD 460
Query: 279 N------ENQNVEIHDDEAEDDGEE 297
+ E Q + +DE +D+ E+
Sbjct: 461 DPLFTVLEQQPNYVPEDETQDELED 485
>gi|328699471|ref|XP_003240943.1| PREDICTED: hypothetical protein LOC100570252 [Acyrthosiphon pisum]
Length = 348
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 123/286 (43%), Gaps = 21/286 (7%)
Query: 12 LSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSC-LHWMGVGSPYHIIGATHGPSKS 69
+S+T E ++ + P ++ P R L E +LSC L ++ G + + S
Sbjct: 1 MSSTTFEELVCLVGPYVSRFPNFRKDTLVVGE-ILSCTLRYLASGDSMMSLVYSFRMGHS 59
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+I + I E V+ ++++ P N F K P+ +DG ++ A
Sbjct: 60 TISKLICECCLVLWQVLEKKVLLTPSIFNWKKTAREFEDKWNLPHCVAAIDGKHIVHQAF 119
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF------ 182
+ N G H+ ++A+C NY F V+ G D V R+S + + F
Sbjct: 120 SNDGSTHFNYKGTHSTVLLAMCDANYNFLLVDIGALGRCSDGGVFRSSNIGKAFANKTIH 179
Query: 183 -----DEEGWR-PFPDAVILGDSAYPLKEWLI---PPRTRNPDAAADQRFLRCHKRTRRL 233
D +G P P I+GD A+PL ++L+ P R R+ ++ F R RR
Sbjct: 180 LPEPVDIDGVNGPIP-YFIVGDEAFPLTKYLMRPYPGRGRSTMPKDEEIFNYRLSRARRT 238
Query: 234 IENSFGILKEKFPCLNYMRLSPQFAGM-VVLASTTLHNIALTIDRD 278
IEN+FGIL +F ++ + + + A+ LHN ++ D
Sbjct: 239 IENAFGILASRFRIFRKPIIASEKTIINITKATIVLHNFIKKLETD 284
>gi|58263791|ref|XP_569175.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134108270|ref|XP_777086.1| hypothetical protein CNBB3180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259771|gb|EAL22439.1| hypothetical protein CNBB3180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223825|gb|AAW41868.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 427
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 109/262 (41%), Gaps = 35/262 (13%)
Query: 39 SAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVD-TFFQRMVKWP--- 94
+A E + L ++ + + IG S I IH+VV + + R VK+P
Sbjct: 113 TAAEHLAIALAYLRKKTSVNWIGEEFKKSNEFISFCIHDVVNALTSKVIYDRWVKYPTGR 172
Query: 95 ---EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICG 151
+ D P+ R F G +DGT + + + + NR G ++NV+A C
Sbjct: 173 EPPNAKVADDPLNRYRH--FHGAVGSIDGTHIAVKVSPRMQPAYRNRKGVVSINVLAACD 230
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFP----DAVILGDSAYPLKEWL 207
+ KF + A W GS +D+ N V+R FP + L D+ +P+ + L
Sbjct: 231 FDLKFTYIIAGWEGSANDSFTF-NQAVHRY-------NFPQLPSNRYFLADAGFPICDQL 282
Query: 208 IPPR------------TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN--YMRL 253
+ P +R + + F H + R +E FGI K+++ L+ +
Sbjct: 283 LTPYRGTRYHLADFHPSRGAPQSEKEVFNLAHAQLRNSVERIFGITKQRWGVLSGGLEKF 342
Query: 254 SPQFAGMVVLASTTLHNIALTI 275
++LA LHN+ ++I
Sbjct: 343 KGGMQSQIILACCVLHNMCISI 364
>gi|58266772|ref|XP_570542.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134110346|ref|XP_776000.1| hypothetical protein CNBD0500 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258668|gb|EAL21353.1| hypothetical protein CNBD0500 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57226775|gb|AAW43235.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 427
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 109/262 (41%), Gaps = 35/262 (13%)
Query: 39 SAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVD-TFFQRMVKWP--- 94
+A E + L ++ + + IG S I IH+VV + + R VK+P
Sbjct: 113 TAAEHLAIALAYLRKKTSVNWIGEEFKKSNEFISFCIHDVVNALTSKVIYDRWVKYPTGR 172
Query: 95 ---EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICG 151
+ D P+ R F G +DGT + + + + NR G ++NV+A C
Sbjct: 173 EPPNAKVADDPLNRYRH--FHGAVGSIDGTHIAVKVSPRMQPAYRNRKGVVSINVLAACD 230
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFP----DAVILGDSAYPLKEWL 207
+ KF + A W GS +D+ N V+R FP + L D+ +P+ + L
Sbjct: 231 FDLKFTYIIAGWEGSANDSFTF-NQAVHRY-------NFPQLPSNRYFLADAGFPICDQL 282
Query: 208 IPPR------------TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN--YMRL 253
+ P +R + + F H + R +E FGI K+++ L+ +
Sbjct: 283 LTPYRGTRYHLADFHPSRGAPQSEKEVFNLAHAQLRNSVERIFGITKQRWGVLSGGLEKF 342
Query: 254 SPQFAGMVVLASTTLHNIALTI 275
++LA LHN+ ++I
Sbjct: 343 KGGMQSQIILACCVLHNMCISI 364
>gi|242047560|ref|XP_002461526.1| hypothetical protein SORBIDRAFT_02g004140 [Sorghum bicolor]
gi|241924903|gb|EER98047.1| hypothetical protein SORBIDRAFT_02g004140 [Sorghum bicolor]
Length = 436
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 120/290 (41%), Gaps = 23/290 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S + ++I + L + + LS +++V + + G
Sbjct: 95 AFRHFFRTSRSTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 154
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGC 117
+G G +S++ + + + D + + WP+++ L +I G PN G
Sbjct: 155 SVGIAFGVGQSTVSQVTWRFIESMEDRA-RHHLSWPDQDRLEEIKAVLEEVYGLPNCCGG 213
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDGT V + P E + + + ++++ + + +F + WPGS+ +R+M+ S
Sbjct: 214 VDGTHVTMTLPAVESSEDWCDHAKNYSMFLQGVVDDQMRFIDIVTGWPGSMTFSRLMKYS 273
Query: 177 MVYRKFDEEGWRPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E G R + GDS YPL WL+ P +A Q F
Sbjct: 274 GFF-KLCEAGERLNGSVKVSAEGAEIREYVAGDSCYPLLPWLMTPYEGKNLSAPLQNFNV 332
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIAL 273
K R L N+ LK + L+ + P + ++L LHNI +
Sbjct: 333 RQKAARLLGTNALARLKGSWRILHKVMWRPDKNKLPSIILVCCLLHNILI 382
>gi|338808398|gb|AEJ07903.1| putative PIFa transposase [Zea mays subsp. mexicana]
Length = 558
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 104/225 (46%), Gaps = 23/225 (10%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
+S +EK+ L +G + +I +T+G S I R +EV+ I+ + +K P E
Sbjct: 75 VSVEEKLAIFLLIVGHNTKMRLIRSTYGWSLEPISRHFNEVLRGIL-SLSHEFIKLPNPE 133
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
L + P + F + G +DGT + ++ P + ++ NR NV+ +C K
Sbjct: 134 TTLPEDP----KWKWFEDCLGALDGTHIDVNVPLTDQGRYRNRKQRITTNVLGVCDRQMK 189
Query: 156 FYAVNANWPGSVHDARVMRNSMV-----------YRKFDEEGWRPFPDAVI-LGDSAYPL 203
F V A W GS D+R++R++M Y D G+ P + + Y L
Sbjct: 190 FLYVLAGWEGSASDSRILRDAMSREDSFVVPSGKYYLVD-AGYTNGPGFLAPYRSTRYHL 248
Query: 204 KEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL 248
EW I + NP A + LR H R +IE +FG+LK ++ L
Sbjct: 249 NEWAI--QGNNPSTARELFNLR-HATARNVIERTFGLLKMRWAIL 290
>gi|449683473|ref|XP_004210370.1| PREDICTED: uncharacterized protein LOC101240658 [Hydra
magnipapillata]
Length = 223
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 103/242 (42%), Gaps = 33/242 (13%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL--TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR++ F++ E+ P L TT T R +S ++V LH++ Y ++ +
Sbjct: 6 FKEHFRVNRNTFNFLVNELHPHLGKTTTTMRE-PISVVKRVAVALHYLASCEEYCVVSSL 64
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
G KS+ +HE + + D + VK+P ENLN F GFP G VD
Sbjct: 65 FGIGKSTANLFVHEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAILGFPQCVGAVDEC 124
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ I AP + + N +++ + ++ Y+F + PG +D R
Sbjct: 125 HIPILAPKDQAISYYNYKW-YSIVLFSVVDCRYRFIYTSVGSPGRNNDGR---------- 173
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGI 240
+SA+PL L+ P N + + Q+ F + RR+++ G
Sbjct: 174 ----------------NSAFPLTRHLLKPYPENLELSEIQKDFNKILCGARRVLKMLLGA 217
Query: 241 LK 242
LK
Sbjct: 218 LK 219
>gi|449671240|ref|XP_004207455.1| PREDICTED: uncharacterized protein LOC101234440 [Hydra
magnipapillata]
Length = 588
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 88/209 (42%), Gaps = 13/209 (6%)
Query: 12 LSATQVEFILEEIAPALTTPTERN-VALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + E +L +AP++T + R+ +S E++ L + G I +++
Sbjct: 1 MSPERYEHLLSMVAPSITKKSCRSRQTISPSERLTVTLRYFATGDSQQIQSFYFRLGRTT 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+C +E+ I D +K PE + I F + FPN G +DG V I+AP
Sbjct: 61 VCNITNEITKAIWDVLQPSYLKAPESSDEWEKIANEFENEWNFPNCIGAIDGKHVCIEAP 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW- 187
+ N H++ ++AIC Y F V+ G +DA + S + F +
Sbjct: 121 VSSGSAYYNYKNYHSMVLLAICDAKYCFTLVDIGSYGRDNDASIFNESKMGNAFKNNSFK 180
Query: 188 ----RPFPDA-----VILGDSAYPLKEWL 207
R P V++GD + LK W
Sbjct: 181 LPKNRQLPIGTQVPPVLVGDDIFALKSWF 209
>gi|331224990|ref|XP_003325166.1| hypothetical protein PGTG_06703 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 402
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 135/331 (40%), Gaps = 41/331 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALS-AKEKVLSCLHWMGV---GSPYHIIG 61
FK+ FR+S ++ EI + NV E+++ L MG G+ ++
Sbjct: 84 FKQEFRMSQQSFCRLILEIQDHPVFHNQSNVPQRPVPEQLMVTLKRMGTHGNGASVGMLA 143
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGT 121
S+ ++ V+ I+ ++ EE +I GF N G +DGT
Sbjct: 144 RFFRISEGTVILYCSRVIEAILS------LESTEER-QEIAWQISSHTGFRNCVGFIDGT 196
Query: 122 MV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ L + P+ + +R G + L + +C + WPG HD R+ N+ +
Sbjct: 197 LFPLSEKPSIDPQDYYSRKGHYGLAALIVCNEAKRIIYYVTGWPGCCHDTRLWENTELKL 256
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA---ADQRFLRCHKRTRRLIENS 237
+ D+ P ++ DS +P + ++P R P A A +RF + R E+
Sbjct: 257 QKDQ---LFSPGQYLIADSGFPPETNIVPAFKRPPHGAMERARKRFNQHLSSLRVCNEHC 313
Query: 238 FGILKEKFPCLNYMRLSPQFAGMV------VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
G+LK +F L +R AG + + A LHN L+ +Q+ ++ D
Sbjct: 314 IGLLKGRFQSLRGLRKDLNSAGTMEKITHWISACVILHNFLLS------DQSPDVFTD-- 365
Query: 292 EDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQ 322
VDD D I + R+ RN GTQ
Sbjct: 366 --------VDDND-IHGHDGAREPRNELGTQ 387
>gi|356504953|ref|XP_003521257.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 394
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 134/322 (41%), Gaps = 28/322 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
FK FR+S T E+I + L + + LS +++V L + G
Sbjct: 60 GFKYFFRVSKTTFEYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 119
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + + + WP+ N + +I + F G PN G
Sbjct: 120 SVGAAFGVGQSTVSQVTWRFIEAL-EERAKHHLNWPDFNQMQEIKLGFEASYGLPNCCGA 178
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P + D + ++ ++++ + I +F + PG + +R+++ S
Sbjct: 179 IDATHIMMTLPAVQTSDDWCDQEKNYSMLLQGIVDHEMRFVDIVTGLPGGMSVSRLLKCS 238
Query: 177 MVYRKFDEEGWR------PFPDAVI----LGDSAYPLKEWLIPPRTRNPD---AAADQRF 223
+ K E G R F VI +G +YPL WL+ P N + F
Sbjct: 239 AFF-KLSENGERLNGNVKAFGGDVIREYVVGGCSYPLLPWLMTPYETNGANGVSVPQSTF 297
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNEN 281
H + L +F +LK + L+ + P + ++L LHNI + D
Sbjct: 298 NHKHGAAKLLAVRAFSLLKGSWRILSKVMWRPDKRKLPSIILTCCLLHNIIIDCG-DTLQ 356
Query: 282 QNVEI--HDDEAEDDGEEEAVD 301
Q+V + H D + + VD
Sbjct: 357 QDVALSGHHDSGYQEQCCKQVD 378
>gi|322782997|gb|EFZ10714.1| hypothetical protein SINV_05513 [Solenopsis invicta]
Length = 195
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 77/161 (47%), Gaps = 13/161 (8%)
Query: 118 VDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T + ++ E + + NR G +LNV I + V A WPGS HD+ + +S
Sbjct: 1 IDCTHIKVLSYGGEDSELYRNRKGYFSLNVQVITNAKLEIIDVVARWPGSTHDSTIFNHS 60
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
+ F+ F + ++LGDS YP +L+ P NP A+ + +T IE
Sbjct: 61 RIKSLFEAN---RFNNGLLLGDSGYPNISYLMTP-LLNPSTPAEHLYNEAQIQTHSTIER 116
Query: 237 SFGILKEKFPCLNYMRLSPQFAGM-----VVLASTTLHNIA 272
FGI K +F L+ + +F + ++ A+ LHNIA
Sbjct: 117 CFGIWKRRFAVLS---IGSRFQTVEKILPIITATAILHNIA 154
>gi|82570160|gb|ABB83644.1| PIF-like transposase [Daucus carota]
Length = 425
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 104/241 (43%), Gaps = 26/241 (10%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG----GFPNVAGCVDGTM 122
S +I R HEV+ + +F + ++K + I L F + G +DGT
Sbjct: 149 SGETISRVFHEVLRSVC-SFAKELIKPDDPEFKKIASHILNDQRYMPHFKDCIGAIDGTH 207
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V P + +F+ R NVMA+C + +F V A W G+ HD R+ ++
Sbjct: 208 VHACVPVNDQVRFIGRKNLPTQNVMAVCSFDMRFTFVLAGWEGTAHDTRIFNAAVNTPHL 267
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPDAAADQR-------FLRCHKRTR 231
+ P + L D+ YP + P R PD Q F + H R
Sbjct: 268 NFPS--PPQNKYYLVDAGYPQTLGYLGPYKGVRYHLPDFRRGQAPEGYQEIFNKAHSSLR 325
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQFAGM----VVLASTTLHN-IALTIDRDNENQNVEI 286
IE +FG+ K+++ L YM PQ++ +V+AS LHN I L+ +D V++
Sbjct: 326 SCIERTFGVWKKRWKILAYM---PQYSFRSQRDIVVASMALHNYIRLSSIKDTIFHEVDM 382
Query: 287 H 287
H
Sbjct: 383 H 383
>gi|331251333|ref|XP_003338265.1| hypothetical protein PGTG_19700 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 443
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 104/214 (48%), Gaps = 23/214 (10%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP ++ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 210 LVTWPTKSERIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 269
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HD+ V + V ++ FD++ + +L DSAY
Sbjct: 270 ICDINKKFISYLAGFPGSSHDSYVFSHMQVAQQPEKYFDQKQF-------LLADSAYTND 322
Query: 205 EWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL----NYMRLSPQFAG 259
+++P + + + F ++R IE++ GILK +F L +R + + G
Sbjct: 323 RYVVPAFKGKQLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLQEIGTRIRNAEEMKG 382
Query: 260 MV--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
V ++ LHN+ D ++Q ++++DEA
Sbjct: 383 AVKWIVTCIVLHNLLA----DLKDQWNDLYEDEA 412
>gi|388490968|gb|AFK33550.1| unknown [Lotus japonicus]
Length = 386
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 63/259 (24%), Positives = 115/259 (44%), Gaps = 25/259 (9%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQR---MVKWP 94
LS ++V L + G IG + ++S++ +V + V+ +R + WP
Sbjct: 87 LSLNDQVAVALRRLSSGESLSTIGDSFLMNQSAV----SQVTWLFVEAMEERGLHHLSWP 142
Query: 95 --EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ--FVNRNGDHALNVMAIC 150
E + +I F G N G VD T +L+ P+ + +++R + ++ + AI
Sbjct: 143 STETAMEEIKFKFENIRGLSNCCGAVDSTHILMTLPSGDTENSVWLDRKKNCSMILQAIV 202
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR------PFPDAV-----ILGDS 199
P+ +F V WPGS+ D V+R+S + K EEG R P+ I+GD+
Sbjct: 203 DPDLRFRDVVGGWPGSLSDEYVLRHSEFF-KLAEEGKRLNGAEKMLPEGTALREYIIGDT 261
Query: 200 AYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAG 259
+PL WL+ P + + F + T + + + LK+ + + + P
Sbjct: 262 GFPLLPWLLTPYECKDLSDVEVEFNKRVVATHMVAKRALARLKQMWKIIQGVMWKPDKHK 321
Query: 260 M--VVLASTTLHNIALTID 276
+ +VL LHNI + ++
Sbjct: 322 LPRIVLVCCILHNIVIDME 340
>gi|322781647|gb|EFZ10267.1| hypothetical protein SINV_15374 [Solenopsis invicta]
Length = 207
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 83/175 (47%), Gaps = 16/175 (9%)
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
++G S+S++ R + V ++ + R +K P +E+ ++ F G P G
Sbjct: 34 VLGDIITISQSTVSRIVFRV-STLLASHINRYIKMPSTQESRSENKRLFKELGYGPGAIG 92
Query: 117 --CVDGTMVLIDAPNEHE------DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
C+DG + H + F NR G +LNV A+ GP +F + WPGS H
Sbjct: 93 LPCIDGAIDCCHIRLVHSRFQAIGETFRNRKGYFSLNVQAVVGPRTEFLDIVPEWPGSEH 152
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
D+R+ +NS +Y ++ + D +++GD+ YP +L+ P NP RF
Sbjct: 153 DSRIFQNSRIYMRYSQHEL----DGMLVGDAGYPALPFLLTP-VANPATDEQIRF 202
>gi|449690872|ref|XP_004212488.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 293
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 11/170 (6%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP G +DG V I+AP + F N G +++ ++AIC Y F V+ G +DA
Sbjct: 35 FPQCIGAIDGKHVRIEAPAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDA 94
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQ 221
V+ S R F+ +G+ P V++GD + L+ WL+ P +
Sbjct: 95 AVLNASTFGRAFN-KGYFNLPKISEFDPKVPPVLVGDDIFALRPWLMKPYPGKNLTVQQR 153
Query: 222 RFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
F R RR I+NSFGIL ++ ++ P ++ A+ LHN
Sbjct: 154 VFNYRLSRARRTIKNSFGILAVRWRIYRSPIKAKPLKVEHIIKATVCLHN 203
>gi|357117801|ref|XP_003560650.1| PREDICTED: uncharacterized protein LOC100830885 [Brachypodium
distachyon]
Length = 579
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 86/197 (43%), Gaps = 29/197 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + + P E V R G NVMA+C + +F ++ A WPGS HD
Sbjct: 346 FNGCIGAIDGTHIRVVVPAEDIANHVGRYGYPIQNVMAVCDFDMRFISIVAGWPGSAHDT 405
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----------------RTRN 214
R+ +++++ + E P L DS YP + + P R+
Sbjct: 406 RIFKDTLI--TYSENFPHPPSGRYYLVDSGYPNQTGYLAPYKGQKYHLPEFRRGIEEGRH 463
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIAL 273
P + F H + + +IE SFG+LK K+ L ++ + + +++A LHN
Sbjct: 464 P-TGKKEIFNHAHSQLQNVIERSFGVLKMKWRILLHVSSYAIEKQTRIIVACMALHNWI- 521
Query: 274 TIDRDNENQNVEIHDDE 290
+ +HDDE
Sbjct: 522 --------RESALHDDE 530
>gi|449689778|ref|XP_004212142.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 289
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 88/193 (45%), Gaps = 16/193 (8%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND---IPMTFLRKGGFPNVAGCVDGTMV 123
++++C I+E I + +K P LN+ + F + FPN G +DG V
Sbjct: 2 GRTTVCNIINETTKAIWNVLKPCYLKAPST-LNEWEELANQFENEWNFPNCIGAIDGKHV 60
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I+AP+ + N H++ ++AIC Y F V+ G +DA + S + + F
Sbjct: 61 CIEAPSLSGLAYYNYKIFHSMVLLAICDAKYCFTLVDIGSYGRDNDASIFNESKMGKAFK 120
Query: 184 EEGWRPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
++ P V++GD + LK WL+ P + ++ F RTRR
Sbjct: 121 NNLFK-LPKNRMLSNGKQVPPVLVGDDTFALKSWLMKPFSGKNLTIKERIFNYRLSRTRR 179
Query: 233 LIENSFGILKEKF 245
IEN+FGI+ K+
Sbjct: 180 TIENTFGIMVTKW 192
>gi|359476086|ref|XP_003631787.1| PREDICTED: uncharacterized protein LOC100853519 [Vitis vinifera]
Length = 414
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 84/204 (41%), Gaps = 18/204 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + P + + F R NVM C + F V A W G+ +DA
Sbjct: 173 FKDCIGAIDGTHISAWVPADRQTSFRGRKTVITQNVMCACNFDMMFTFVYAGWEGTANDA 232
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAA 218
RV +++ R W P + DS YP +PP N
Sbjct: 233 RVFLDALT-RPEVNFPW-PSEGKYYVVDSGYPCISGFLPPYRGERYHLQEYWGRHNQPIR 290
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHN-IALTID 276
+ F H R +IE FG+LK +FP L M P +V+A TLHN I L+
Sbjct: 291 YKELFNYRHSSLRNIIERCFGVLKTRFPILRMMPCYKPSRQPSIVVACCTLHNWIRLSTR 350
Query: 277 RDNENQNVEIHDDEAEDDGEEEAV 300
D + E+ D E GEEE+
Sbjct: 351 NDQLFREYEVEDLSIE--GEEEST 372
>gi|115470825|ref|NP_001059011.1| Os07g0175100 [Oryza sativa Japonica Group]
gi|113610547|dbj|BAF20925.1| Os07g0175100, partial [Oryza sativa Japonica Group]
Length = 436
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/319 (21%), Positives = 132/319 (41%), Gaps = 24/319 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 104 AFRYFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 163
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + +V WP +E + I F + G PN G
Sbjct: 164 SVGAAFGVGQSTVSQVTWRFIESMEERARHHLV-WPGQERMEQIKARFEAESGLPNCCGA 222
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E + + + ++++ + I +F + WPGS+ +R+++ S
Sbjct: 223 IDATHIIMTLPAVESSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCS 282
Query: 177 MVYRKFDEEGWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
++ D P V I+G++ YPL WL+ P +A F
Sbjct: 283 GFFKHCDAGTRLDGPVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNAR 342
Query: 227 HKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNENQNV 284
K R L + LK + LN + P + ++L LHNI + + D +V
Sbjct: 343 QKAARTLGPRALSRLKGSWRILNKVMWRPDKNKLPSIILVCCLLHNIIIDCE-DELLPDV 401
Query: 285 EI--HDDEAEDDGEEEAVD 301
++ H D + + E VD
Sbjct: 402 QLPDHHDTGYSEEKCEQVD 420
>gi|25553597|dbj|BAC24862.1| unknown protein [Oryza sativa Japonica Group]
Length = 441
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/320 (22%), Positives = 132/320 (41%), Gaps = 26/320 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 109 AFRYFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 168
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + +V WP +E + I F + G PN G
Sbjct: 169 SVGAAFGVGQSTVSQVTWRFIESMEERARHHLV-WPGQERMEQIKARFEAESGLPNCCGA 227
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E + + + ++++ + I +F + WPGS+ +R+++ S
Sbjct: 228 IDATHIIMTLPAVESSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCS 287
Query: 177 MVYRKFDEEGWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
++ D P V I+G++ YPL WL+ P +A F
Sbjct: 288 GFFKHCDAGTRLDGPVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNAR 347
Query: 227 HKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE---N 281
K R L + LK + LN + P + ++L LHNI ID ++E +
Sbjct: 348 QKAARTLGPRALSRLKGSWRILNKVMWRPDKNKLPSIILVCCLLHNI--IIDCEDELLPD 405
Query: 282 QNVEIHDDEAEDDGEEEAVD 301
+ H D + + E VD
Sbjct: 406 VQLPDHHDTGYSEEKCEQVD 425
>gi|390359099|ref|XP_003729409.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 287
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 105/239 (43%), Gaps = 37/239 (15%)
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+SI + + EVV IV+ + +++ P + G+ + G V AP
Sbjct: 10 NSISKVVREVVEAIVEEYVDELLRCPTN-----------EQGWRQLIG----KHVACKAP 54
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE--- 185
+ N G +++ + A+ +YKF ++A+ GS DA++ S + +
Sbjct: 55 ANSGSTYYNYKGFYSILIFAMVDADYKFIYIDASSKGSASDAQIYNASDLKDGLERNLIM 114
Query: 186 ---GWRPFPDAV------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
G P P+ I+GD A+ L+ +L+ P + A ++ F R RR++EN
Sbjct: 115 GFPGPDPLPNDTQDVPYFIVGDDAFSLRAYLMKPYSSRYLAREERIFNYRLSRARRVVEN 174
Query: 237 SFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIALT---------IDRDNENQNVE 285
+FGIL +F L M+ PQ +V A LHN+ T +DR N ++E
Sbjct: 175 AFGILANRFQILLTTMQHDPQTVKSIVEACCILHNLMRTRYPVLQNRLVDRQRGNGDME 233
>gi|198476040|ref|XP_002132244.1| GA25311 [Drosophila pseudoobscura pseudoobscura]
gi|198137519|gb|EDY69646.1| GA25311 [Drosophila pseudoobscura pseudoobscura]
Length = 412
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/247 (22%), Positives = 104/247 (42%), Gaps = 17/247 (6%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
L A+ ++ L ++ G + + + KSSI + EV+ IV ++ P E
Sbjct: 89 LPAETRLQMALRYLTTGDSFSTLAGIYRVGKSSIKYIVEEVIKAIVKELKGVCLQTPQTE 148
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
E D+ F FP+ G +DG + + +D + N ++ ++A+ +++
Sbjct: 149 EEWLDVAKQFEELWNFPHCLGSLDGHHIAFRSKTVKDDSYTNYRQFQSIIMLALVDAHHR 208
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEE-----------GWRPFPDAVILGDSAYPLK 204
F V+A+ PG DA +S +Y + G V+L D + L+
Sbjct: 209 FLYVDASSPGGATDA--FTDSTLYNGLESNLLNIPHEKPLLGLNEELPHVVLADKGFELQ 266
Query: 205 EWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVL 263
WL+ + P + F R R++ N+ GI+ F L ++L +V+
Sbjct: 267 PWLM-KQHEIPSTIEKKIFNYRLNRAHRVVVNALGIMSNSFRALQTEIKLEASMVEKLVI 325
Query: 264 ASTTLHN 270
A++ LHN
Sbjct: 326 ATSILHN 332
>gi|331226553|ref|XP_003325946.1| hypothetical protein PGTG_07776 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 336
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 100/203 (49%), Gaps = 23/203 (11%)
Query: 101 IPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I + +R+ GFP G VDGT + L P + + +R ++++V IC N KF +
Sbjct: 114 IAVAVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISY 173
Query: 160 NANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRN 214
A +PGS HDA V + V ++ FD + + +L DSAY ++IP + ++
Sbjct: 174 LAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTNDRYVIPAFKGKH 226
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAGMV--VLASTTL 268
+ F ++R IE++ GILK +F L +R + + G V ++ L
Sbjct: 227 LLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRNAEEMKGAVKWIVTCIVL 286
Query: 269 HNIALTIDRDNENQNVEIHDDEA 291
HN+ D ++Q ++++DEA
Sbjct: 287 HNLLA----DLKDQWNDLYEDEA 305
>gi|317420132|emb|CBN82168.1| Putative nuclease HARBI1 [Dicentrarchus labrax]
Length = 319
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 83/180 (46%), Gaps = 12/180 (6%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
E +R S + ++ + P + T R+ +L+ + V PY G +
Sbjct: 43 ERYRFSGDGLRYLCRLLGPKMKHQTARSHSLTVPQMVCYIDMVCKWELPYS--GDAENLN 100
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTF----LRKGGFPNVAGCVDGTMV 123
K +IC +I V + + +F + +P P+ F + FPN+ G VD T +
Sbjct: 101 KGTICCTIRRVC-LALKSFSNIFITFPGHRR---PLYFKEEFYKIAAFPNIIGAVDCTHI 156
Query: 124 LIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I P+ EHE ++NR H++NV I + + A WPGSVHD+RV R S +Y +
Sbjct: 157 RIKRPSGEHEGDYINRKSFHSINVQ-ISDADCLVSNLEAKWPGSVHDSRVFRASPIYERL 215
>gi|222636526|gb|EEE66658.1| hypothetical protein OsJ_23285 [Oryza sativa Japonica Group]
Length = 397
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/320 (22%), Positives = 132/320 (41%), Gaps = 26/320 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 65 AFRYFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 124
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + +V WP +E + I F + G PN G
Sbjct: 125 SVGAAFGVGQSTVSQVTWRFIESMEERARHHLV-WPGQERMEQIKARFEAESGLPNCCGA 183
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E + + + ++++ + I +F + WPGS+ +R+++ S
Sbjct: 184 IDATHIIMTLPAVESSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCS 243
Query: 177 MVYRKFDEEGWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
++ D P V I+G++ YPL WL+ P +A F
Sbjct: 244 GFFKHCDAGTRLDGPVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNAR 303
Query: 227 HKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE---N 281
K R L + LK + LN + P + ++L LHNI ID ++E +
Sbjct: 304 QKAARTLGPRALSRLKGSWRILNKVMWRPDKNKLPSIILVCCLLHNI--IIDCEDELLPD 361
Query: 282 QNVEIHDDEAEDDGEEEAVD 301
+ H D + + E VD
Sbjct: 362 VQLPDHHDTGYSEEKCEQVD 381
>gi|40737037|gb|AAR89050.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 693
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 110/241 (45%), Gaps = 23/241 (9%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFL---RKGGFPNVAGCVDGTMV 123
S S++ V+ +V +++K + N DIP L R G F G VDGT +
Sbjct: 419 SNSTVSIYFRRVLSRMV-MLGSKILKPIDPNFTDIPRRLLQDSRFGPFQFAVGAVDGTHI 477
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ + + +NR+ + NV+ I G + + +A W GSVHD RV+ ++ +
Sbjct: 478 PVTVGVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWLGSVHDNRVLNEAI--DSYP 535
Query: 184 EEGWR-PFPDAVILGDSAYPLKEWLIPPRTR-------------NPDAAADQRFLRCHKR 229
EE R PF +L DS YP + + P R P +++F H +
Sbjct: 536 EEFPRLPF-RKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEGREEKFNYIHAK 594
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSP--QFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R +IE FGI+K+++ L + +P ++LA+ LHN + +++ N ++
Sbjct: 595 LRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDSKQNDFQANNPLY 654
Query: 288 D 288
+
Sbjct: 655 N 655
>gi|449677574|ref|XP_004208878.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 264
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 89/231 (38%), Gaps = 30/231 (12%)
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDG 120
T G + + I EV IV R V P+ + + + F K G GCVDG
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
T + I P H + ++L V A+C F V WPGSVHDA+V NS +
Sbjct: 65 THIPIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSINT 124
Query: 181 KFDEEGWRPFPDAV-------------ILGDSAYPL-----KEWLIPPRTRNPDAAADQR 222
P + ++GD AYPL KE+ + D
Sbjct: 125 NLRSSR---LPGTIQTITKNKIKVPCYLIGDPAYPLLPHCMKEY---STCKKNDEVIFNS 178
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIA 272
LR R IE +FG LK ++ L M L + V+ A LHN
Sbjct: 179 MLR---TARNPIECAFGRLKTRWKILTKKMDLKLEKIPTVIYACFILHNFC 226
>gi|218199172|gb|EEC81599.1| hypothetical protein OsI_25082 [Oryza sativa Indica Group]
Length = 397
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/320 (22%), Positives = 132/320 (41%), Gaps = 26/320 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 65 AFRYFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 124
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + +V WP +E + I F + G PN G
Sbjct: 125 SVGAAFGVGQSTVSQVTWRFIESMEERARHHLV-WPGQERMEQIKARFEAESGLPNCCGA 183
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E + + + ++++ + I +F + WPGS+ +R+++ S
Sbjct: 184 IDATHIIMTLPAVESSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCS 243
Query: 177 MVYRKFDEEGWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
++ D P V I+G++ YPL WL+ P +A F
Sbjct: 244 GFFKHCDAGTRLDGPVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNAR 303
Query: 227 HKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE---N 281
K R L + LK + LN + P + ++L LHNI ID ++E +
Sbjct: 304 QKAARTLGPRALSRLKGSWRILNKVMWRPDKNKLPSIILVCCLLHNI--IIDCEDELLPD 361
Query: 282 QNVEIHDDEAEDDGEEEAVD 301
+ H D + + E VD
Sbjct: 362 VQLPDHHDTGYSEEKCEQVD 381
>gi|340383345|ref|XP_003390178.1| PREDICTED: hypothetical protein LOC100633099 [Amphimedon
queenslandica]
Length = 332
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 109/241 (45%), Gaps = 19/241 (7%)
Query: 12 LSATQVEFILEEIAPALT-TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S+++ + +L + P + T+ +SA+E++ L ++ G I ++ S+
Sbjct: 1 MSSSRFDNLLRRVKPFIVRKTTQLREPVSAEERLSVTLRYLVTGDSMQTISFSYRLGHST 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+ I E + ++ P+ + I F FP+ G +DG +L+ AP
Sbjct: 61 VSYIIEETCQALWRALSVEFLQPPKSSDEWKKISEGFADIWNFPHCIGAMDGKHILMQAP 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR 188
Q+ N G H++ +MA+C NY F ++ G D + N + +
Sbjct: 121 PNVASQYFNYKGTHSIVLMAVCYYNYCFLLLDIGDYGKKIDGGIF-NVIS----GQSAKL 175
Query: 189 PFPDAVILGDSAYPLKEWLIPPR--TRNPD--AAADQRFLRCHKRTRRLIENSFGILKEK 244
P+ +GDSA+PL ++ P T P+ ++ R L R RR+IEN+FGIL K
Sbjct: 176 PY---FFIGDSAFPLMNSMLKPYPGTYLPENKHISNYRLL----RARRVIENAFGILASK 228
Query: 245 F 245
F
Sbjct: 229 F 229
>gi|221116902|ref|XP_002160102.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 351
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 91/216 (42%), Gaps = 13/216 (6%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
++++C I E I + ++ P + +I F + FP G +D V
Sbjct: 47 GRATVCHIIKETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDEKHVR 106
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I+AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F+
Sbjct: 107 IEAPAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFN- 165
Query: 185 EGWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
+G+ P V +GD + LK WL+ P + F R RR IE
Sbjct: 166 KGYFNLPKISEFDPKVPPVFVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIE 225
Query: 236 NSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
NSF IL + ++ P ++ A+ LHN
Sbjct: 226 NSFEILAAGWRIYRSPIKAKPLKVEHIIKATVCLHN 261
>gi|357119271|ref|XP_003561367.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 471
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 125/297 (42%), Gaps = 25/297 (8%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S + +++ + L + + LS +++V + + G
Sbjct: 128 AFRYFFRTSRSTFDYVCSIVRDDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 187
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+GA G +S++ + + + + +V WP +E ++DI F G PN G
Sbjct: 188 SVGAAFGVGQSTVSQVTWRFIESMEERARHHLV-WPDQERMDDIKANFEVVSGLPNCCGA 246
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E + + + ++++ + I +F V WPGS +++++ S
Sbjct: 247 IDATHIVMMLPAVESSEYWYDHANNYSMFLQGIVDHEMRFIDVVTGWPGSTTFSQLLKLS 306
Query: 177 MVYRKFDEEGWR-PFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
Y K E G R P V I+GD +YPL WL+ P +A F
Sbjct: 307 EFY-KLCEAGKRLDGPAQVSREDLEIREFIVGDVSYPLLPWLMTPYQGESLSALMVDFNA 365
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
K R L + LK + L + P + ++L LHNI IDR ++
Sbjct: 366 RQKAARMLGTRALARLKGSWRILQKVMWRPDKNKLPSIILVCCLLHNI--MIDRQDQ 420
>gi|108709891|gb|ABF97686.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 701
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 110/241 (45%), Gaps = 23/241 (9%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFL---RKGGFPNVAGCVDGTMV 123
S S++ V+ +V +++K + N DIP L R G F G VDGT +
Sbjct: 427 SNSTVSIYFRRVLSRMV-MLGSKILKPIDPNFTDIPRRLLQDSRFGPFQFAVGAVDGTHI 485
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ + + +NR+ + NV+ I G + + +A W GSVHD RV+ ++ +
Sbjct: 486 PVTVGVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWLGSVHDNRVLNEAI--DSYP 543
Query: 184 EEGWR-PFPDAVILGDSAYPLKEWLIPPRTR-------------NPDAAADQRFLRCHKR 229
EE R PF +L DS YP + + P R P +++F H +
Sbjct: 544 EEFPRLPF-RKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEGREEKFNYIHAK 602
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSP--QFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R +IE FGI+K+++ L + +P ++LA+ LHN + +++ N ++
Sbjct: 603 LRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDSKQNDFQANNPLY 662
Query: 288 D 288
+
Sbjct: 663 N 663
>gi|449665294|ref|XP_004206114.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 264
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 91/230 (39%), Gaps = 28/230 (12%)
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDG 120
T G + + I EV IV R V P+ + + + F K G GCVDG
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV-- 178
T + I P H + ++L V A+C F V WPGSVHDA+V NS +
Sbjct: 65 THIPIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSINT 124
Query: 179 ----------YRKFDEEGWRPFPDAVILGDSAYPL-----KEWLIPPRTRNPDAAADQRF 223
++ + + ++GD AYPL KE+ + D
Sbjct: 125 NLRSSRLPGTFQTITKNKIK--VPCYLIGDPAYPLLPHCMKEY---STCKKNDEVIFNSM 179
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNIA 272
LR R IE +FG LK ++ L M L + V+ A LHN
Sbjct: 180 LR---TARNPIECAFGRLKARWKILTKKMDLKLEKIPTVIYACFILHNFC 226
>gi|195156327|ref|XP_002019052.1| GL26155 [Drosophila persimilis]
gi|194115205|gb|EDW37248.1| GL26155 [Drosophila persimilis]
Length = 412
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/247 (22%), Positives = 103/247 (41%), Gaps = 17/247 (6%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--E 95
L A+ ++ L ++ G + + + KSSI + EV+ IV ++ P E
Sbjct: 89 LPAETRLQMALRYLTTGDSFSTLAGIYRVGKSSIKYIVEEVIKAIVKELKGVCLQTPQTE 148
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
E D+ F FP+ G +DG + + +D + N ++ ++A+ +++
Sbjct: 149 EEWLDVAKQFEELWNFPHCLGSLDGHHIAFRSKTVKDDSYTNYRQFQSIIMLALVDAHHR 208
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEE-----------GWRPFPDAVILGDSAYPLK 204
F V+A+ PG DA S +Y + G V+L D + L+
Sbjct: 209 FLYVDASSPGGATDA--FTESTLYNGLESNLLNIPHEKPLLGLNEELPHVVLADKGFELQ 266
Query: 205 EWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVL 263
WL+ + P + F R R++ N+ GI+ F L ++L +V+
Sbjct: 267 PWLM-KQHEIPSTIEKKIFNYRLNRAHRVVVNALGIMSSSFRALQTEIKLEASMVEKLVI 325
Query: 264 ASTTLHN 270
A++ LHN
Sbjct: 326 ATSILHN 332
>gi|357128294|ref|XP_003565809.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
gi|255316766|gb|ACU01765.1| hypothetical protein [Brachypodium distachyon]
Length = 463
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 111/278 (39%), Gaps = 13/278 (4%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
AF+ FR+S + + + A+ T A+ ++V C+ + G P +
Sbjct: 127 AFRRDFRMSRATFAMVCDALGAAVAKEDTALRAAIPVPQRVAVCVWRLATGEPLREVSKR 186
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGT 121
G S+ + + EV I R ++W E F G P V G +
Sbjct: 187 FGIGISTCHKLVLEVSAAIRSILMPRFLQWASTAEAQARQKARFEACSGVPGVVGAMLTA 246
Query: 122 MVLIDAPNEHEDQFVNR-----NG--DHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
V I AP + NR NG +++ + + GP+ F V WPGS+ D +V+
Sbjct: 247 HVPIIAPKLSVAAYFNRRHTERNGKTSYSVTLQGVVGPDGAFTDVCIGWPGSMPDDQVLA 306
Query: 175 NSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI 234
S ++ + G P V+ G S +PL +W + P T + F R +
Sbjct: 307 RSALHNRAANGG-MPAGSWVVAGAS-FPLTDWTLVPYTHHNLTWTQHAFNDKVAGIRLVA 364
Query: 235 ENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
+F LK ++ CL + Q +V+ A LHNI
Sbjct: 365 VRAFRRLKARWVCLQKRTEVKLQDLPVVLGACCVLHNI 402
>gi|147779792|emb|CAN77802.1| hypothetical protein VITISV_004741 [Vitis vinifera]
Length = 473
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 112/264 (42%), Gaps = 49/264 (18%)
Query: 62 ATHGPSKSSIC--------RSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGF 111
+T PS++SI R+ ++ V + F + M+ P N N I LR+ F
Sbjct: 195 STSMPSETSISLEEDEDLPRNTNDEAEVAMVNFSKEMITPPSFNDSSNGISNRRLRQI-F 253
Query: 112 PNVAGCVDGTMVLIDAPNEHEDQFVNRN-GDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
+ G +DGT++ P + + R G+ NVMA+C + F V W G+ HD+
Sbjct: 254 KDAVGAIDGTLIHACIPTNQQVPYRGRGRGECFQNVMAVCDFDMIFRFVVVGWEGTAHDS 313
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-----------LKEWLIPPRTRNPDAAA 219
RV+ ++ L D+AY ++ WL R+
Sbjct: 314 RVLTETI--------------QKYYLVDAAYTHTRGFMAPYRNVRYWLSDFRSSGKAVGK 359
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTI--- 275
++ F +CH R R +IE +F ++K +FP L M S +V+ ++HN I
Sbjct: 360 EEIFNQCHARLRNVIERAFXVVKARFPXLKRMAPYSFTTQTKIVMTCFSIHNFLRQISXA 419
Query: 276 DR-----DNENQNVEIHDDEAEDD 294
DR DNE VE+ D A +
Sbjct: 420 DRLFSEYDNE---VELESDNANQN 440
>gi|222625529|gb|EEE59661.1| hypothetical protein OsJ_12058 [Oryza sativa Japonica Group]
Length = 704
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 115/257 (44%), Gaps = 27/257 (10%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
V + E + L+ +G GS S +I R I EV+ +V+ + +V+
Sbjct: 392 TVHMHTMEALAIFLYILGDGSSNQRAQNCFKHSGETISRKIEEVLFAVVE-LGRDIVRPK 450
Query: 95 EENLNDIPMTFLRKG-----GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
+ N +++ +RK F + G VDGT +L P++ + +++ R+ NVMAI
Sbjct: 451 DPNFSNVHER-IRKDRRMWPHFKDCIGAVDGTHILAVVPDDDKIRYIGRSKSTTQNVMAI 509
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLKEWLI 208
C + +F + PGS+HD V+ N++ + D + + P L D+ YP + +
Sbjct: 510 CDHDMRFTYASIGQPGSMHDTTVLFNAL---RTDIDIFPHPPQGKYYLVDAGYPNRPGYL 566
Query: 209 PP------------RTRNPDAAADQ-RFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
P R P ++ FL C RT +IE FG+ K K+ L M P
Sbjct: 567 APYKGQRYHVPEFRRGSAPSGVKEKFNFLHCSVRT--IIERCFGVWKMKWRVLLKMPSFP 624
Query: 256 QFA-GMVVLASTTLHNI 271
+ MVV A+ LHN
Sbjct: 625 LWKQKMVVAATMALHNF 641
>gi|390344081|ref|XP_003726039.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 178
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 46/208 (22%)
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
E +N + F GFP V G VDGT + ++ A GP
Sbjct: 2 EEVNRAQVDFFNISGFPQVIGVVDGTHIRLNG--------------------APLGP--- 38
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNP 215
H++RV + + G ++LGDS Y L+ +L+ P NP
Sbjct: 39 ----------GEHNSRV---GQTFEDLQQHG-------LLLGDSGYALRPYLMTP-VLNP 77
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALT 274
A+Q + R H TR IE G LK KF C LN M+++P+ A ++ A LHN+A
Sbjct: 78 RTPAEQAYNRAHATTRVRIEQVNGQLKHKFRCLLNGMQMAPRRACKIITACAVLHNVAKD 137
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDD 302
+ + +E + I E E D ++ VD+
Sbjct: 138 LKQPDE-VDFGIEQHEHEPDRDDHIVDN 164
>gi|322790399|gb|EFZ15333.1| hypothetical protein SINV_05465 [Solenopsis invicta]
Length = 291
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 106/270 (39%), Gaps = 53/270 (19%)
Query: 6 FKEAFRLSATQV-EFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++ +R S V E +L + L R + +S ++L L + S + G
Sbjct: 31 FRKRYRFSKEAVIEILLPLVNAQLRRLNNRGLPVSPLMQLLITLRFYATSSFQIVNGDLR 90
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G S++++ R + V ++ + + EE + F FP+V G
Sbjct: 91 GYSQATVSRIVVRVSKILASHLNEFINFSTEEKCRNNKNKFYEVANFPSVIG-------- 142
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
PGS HDA + S + +F+
Sbjct: 143 --------------------------------------HPGSTHDAVIFDRSGLRCRFEL 164
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
D ++LGD+ Y + L+ P R P+ A+ R+ HK+TR ++E FG+ K
Sbjct: 165 GHL----DGILLGDNGYACRRCLLTPVLR-PETDAELRYNTAHKKTRVIVEQMFGVWKRC 219
Query: 245 FPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
FPCL Y +R + +V A+ LHNI +
Sbjct: 220 FPCLYYGIRTKLSTSVAIVCATAVLHNICI 249
>gi|322780715|gb|EFZ09990.1| hypothetical protein SINV_08352 [Solenopsis invicta]
Length = 308
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 100/237 (42%), Gaps = 14/237 (5%)
Query: 50 WMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRK 108
++ G + I + +S+I I E I + + P +N +I F
Sbjct: 1 YLASGDGLNSIALNYRIGRSTITLIIKETCEAIWTNLNKEALFIPTQNGWKEIAQEFQEI 60
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
FPN G +DG V I P + Q+ N H++ +MA+ +YKF V+ G
Sbjct: 61 WNFPNCVGALDGKHVSIICPPKAGSQYYNYKNFHSVVLMALVSASYKFLIVDIGAQGRHS 120
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDA 217
D + +NS + ++F + PD+ VI+ D A+ L E+ + P
Sbjct: 121 DGGIFKNSAMGQRFYNKTMN-LPDSSDISERHTVPYVIVADEAFQLTEFTMRPYPSKNLT 179
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
+ F R R ++EN+FGIL ++ + + S + ++ A+ LHN +
Sbjct: 180 KQQKIFNYRLSRARHVVENTFGILASRWRIYHKPINTSLKTVDAIIKATVCLHNFLM 236
>gi|322800660|gb|EFZ21605.1| hypothetical protein SINV_10073 [Solenopsis invicta]
Length = 136
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 63/133 (47%), Gaps = 5/133 (3%)
Query: 137 NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD-AVI 195
NR ++ + + N KF + PGS+HDARV+R S +Y+ FP+ I
Sbjct: 1 NRKKYFSITLQGVVDANMKFTNIYYGEPGSLHDARVLRRSPLYQTAVHNKETLFPENTFI 60
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKF---PCLNYM 251
LGDSAY WL+PP N Q+ F H TR +IE +FG LK +F N
Sbjct: 61 LGDSAYASLSWLVPPFRDNGHLTPQQKEFNFLHSSTRMVIERAFGYLKGRFRRIKFFNEY 120
Query: 252 RLSPQFAGMVVLA 264
R P VV A
Sbjct: 121 RHMPFITNTVVCA 133
>gi|357119235|ref|XP_003561351.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 267
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 87/179 (48%), Gaps = 19/179 (10%)
Query: 108 KGGFPNVAGCVD---GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
K +P++ C+D GT + P + + +++ R+G + NV+AIC + +F + P
Sbjct: 24 KRAYPHLKDCIDALDGTHIRASLPADQQVRYIGRSGTASQNVLAICDFDMRFIYASIGQP 83
Query: 165 GSVHDARVMRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLKEWLIPP----RTRNPD--- 216
G++HD V+ ++ ++ D + + P L D+ YP + + P R PD
Sbjct: 84 GAMHDTSVLYHA---KRVDHDIFPHPPKGKYYLVDAGYPNRPGYLAPYKGERYHVPDFHR 140
Query: 217 ----AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHN 270
+ ++F R H R +IE SFG+LK K+ L M S ++V AS LHN
Sbjct: 141 GAEPSTPKEKFNRVHSAVRNVIERSFGVLKMKWRILLKMPNYSMDKQKLIVAASMVLHN 199
>gi|29788843|gb|AAP03389.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 411
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 114/255 (44%), Gaps = 23/255 (9%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
V + E + L+ +G GS S +I R I EV+ +V+ + +V+
Sbjct: 99 TVHMHTMEALAIFLYILGDGSSNQRAQNCFKHSGETISRKIEEVLFAVVE-LGRDIVRPK 157
Query: 95 EENLNDIPMTFLRKG-----GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
+ N +++ +RK F + G VDGT +L P++ + +++ R+ NVMAI
Sbjct: 158 DPNFSNVHER-IRKDRRMWPHFKDCIGAVDGTHILAVVPDDDKIRYIGRSKSTTQNVMAI 216
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLKEWLI 208
C + +F + PGS+HD V+ N++ + D + + P L D+ YP + +
Sbjct: 217 CDHDMRFTYASIGQPGSMHDTTVLFNAL---RTDIDIFPHPPQGKYYLVDAGYPNRPGYL 273
Query: 209 PP-----------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF 257
P R + + ++F H R +IE FG+ K K+ L M P +
Sbjct: 274 APYKGQRYHVPEFRRGSAPSGVKEKFNFLHCSVRTIIERCFGVWKMKWRVLLKMPSFPLW 333
Query: 258 A-GMVVLASTTLHNI 271
MVV A+ LHN
Sbjct: 334 KQKMVVAATMALHNF 348
>gi|125537378|gb|EAY83866.1| hypothetical protein OsI_39087 [Oryza sativa Indica Group]
Length = 717
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 100/221 (45%), Gaps = 16/221 (7%)
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI-PMTFLRKGGFPNVAGCVDGTMV 123
G S ++ R EV+ ++ +F +R+++ + + + P F G +DGT +
Sbjct: 445 GRSLDTVSRKFDEVLDAML-SFARRIIRPRDPSFRYVHPNLHQFSPYFDGCIGAIDGTYI 503
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ + D F+NR G + NV+AIC +F V G+ HD V R ++
Sbjct: 504 PVTVLDPTHDDFINRKGFTSQNVLAICDMGMRFTFVATGKRGAAHDMAVFREAV---NTA 560
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPP--RTRNPDAAADQR--------FLRCHKRTRRL 233
+ P P L DS YPL+E + P +TR + + F H R +
Sbjct: 561 DHFPHPPPGKYYLVDSGYPLREGYMAPYRKTRYHLKQFNGKGPENLCEIFNYHHSCLRNV 620
Query: 234 IENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIAL 273
+E SFG+LK K+ L + L P + +++A LHN A+
Sbjct: 621 VERSFGVLKNKWKILKGIPLYPMEKQSKIIVACFALHNFAM 661
>gi|322803340|gb|EFZ23265.1| hypothetical protein SINV_11036 [Solenopsis invicta]
Length = 153
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 63/131 (48%), Gaps = 15/131 (11%)
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-----PFPDAVILGDSAYP 202
A+ GP K + PGS HDA + FD G R D ++LGD+ Y
Sbjct: 1 AVAGPQRKILDLVVRHPGSTHDAVI---------FDRSGLRCRFKLGHLDGILLGDNGYA 51
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVV 262
+ +L+ P R P+ A+ R+ HK+TR ++E FG+ K FP L Y + + +
Sbjct: 52 CRRYLLTPVLR-PETDAELRYNTAHKKTRVIVEQMFGVWKRCFPRLYYGIRTKLSTSVAI 110
Query: 263 LASTTLHNIAL 273
+ + LHNI +
Sbjct: 111 VCAVVLHNICI 121
>gi|72112371|ref|XP_790108.1| PREDICTED: uncharacterized protein LOC585178 [Strongylocentrotus
purpuratus]
Length = 429
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/337 (21%), Positives = 138/337 (40%), Gaps = 28/337 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK R+ I++ + P ++ + L A ++ L ++ G YH +G +
Sbjct: 81 FKAYLRIEPPMFREIIDRLTPRISKHQDCRPGLPAGLRLAITLRFLATGDSYHSLGFSFR 140
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+ +I + EV IV + + ++ P + ++ F R+ + + G +DG +
Sbjct: 141 VACCTISVLVPEVCHEIVAEYKEEVLAIPTTPDGWWEVASAFSRRWNYHHCLGVMDGKHI 200
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I P + + N+ G ++ ++ + +Y F VN GS+ DA + RK D
Sbjct: 201 RIKKPRKSGSDYYNK-GFFSIILLGVVDADYTFMWVNVGARGSMSDAGGFNGCSMKRKID 259
Query: 184 EEGWRPFPD------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
G PD + +G+ A+ L+ ++ P + ++ F R R
Sbjct: 260 A-GMLGMPDPDPLSHDDQDTQYLSVGNDAFALRPSMMKPYSHRYLKNDERIFNYRTSRAR 318
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
R++EN+FG+L + + P A + A LH I R QN ++ +++
Sbjct: 319 RVVENAFGLLTT-------LAIMPNNAISITRACVILHQIMRM--RYPALQNADLDEEDT 369
Query: 292 EDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLL 328
+ A D +DE E R+ P R ++L
Sbjct: 370 DSGIVPGAWRDGAMMDEVEIARR---GPRKNRAGKML 403
>gi|221123821|ref|XP_002154997.1| PREDICTED: uncharacterized protein LOC100209168 [Hydra
magnipapillata]
Length = 338
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 96/224 (42%), Gaps = 16/224 (7%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQR---MVKW 93
++S E++ L ++ G I A++ S S+ R I E I + +R V
Sbjct: 4 SVSPSERLAVTLRFLVTGDAQCTIAASYRISASTTSRIISETCAAIWTSLKERNFLHVPS 63
Query: 94 PEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
++ I F FP+ G +DG +++ AP+ ++ N H++ ++A+C
Sbjct: 64 EKQEWKTIAKEFENMWNFPHAIGAIDGKHIVMQAPHNGGSEYFNYKKTHSIVLLAVCNAK 123
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-PFPD----------AVILGDSAYP 202
Y+F V+ G D V N + + P P+ V++ D A+
Sbjct: 124 YEFTMVDIGDSGRQSDGSVFNNCSLGCAIENNKLNIPDPEYIGNSEKVLPYVLVADDAFG 183
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIENSFGILKEKF 245
LK ++ P N + DQ+ R RR+IEN+FGI +F
Sbjct: 184 LKRHMMKPYP-NQNIPLDQKIFNYRLSRARRVIENAFGIATTRF 226
>gi|449686546|ref|XP_004211198.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 192
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 89/187 (47%), Gaps = 12/187 (6%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPM-TFLRKGGFPNVAGCVDGTMVLI 125
++S+ CR + + I +Q + +P EN I + F GF +D T + I
Sbjct: 12 NQSTGCRVVKNISIRIAQKRYQ-YIMFPLENDALIILHRFYELRGFSGCIVAIDATHIPI 70
Query: 126 DAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+P E+ + + NR G LN AI GP+ + + WP SVHD+++ NS + + +
Sbjct: 71 ISPGRENAESYRNRKGYFLLNCQAIAGPSNEILSACVQWPESVHDSQIFSNSAIKQVLEA 130
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
++G+S Y L P P+ +++++ TR +E FGILK++
Sbjct: 131 R-----QHVHVIGNSGYLCN--LTP--FDYPENISERKYSYSLSSTRVTVECCFGILKKR 181
Query: 245 FPCLNYM 251
F CL ++
Sbjct: 182 FGCLKFL 188
>gi|357620279|gb|EHJ72527.1| hypothetical protein KGM_19675 [Danaus plexippus]
Length = 272
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 74/164 (45%)
Query: 82 IVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGD 141
I F + +V E+ DI F K FP G +DG + + AP +F N
Sbjct: 7 IQGIFSRSLVPNTEQKWMDIKTRFQLKWNFPGCIGAIDGKHINLRAPACSGSEFYNNKKT 66
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAY 201
++ ++A +Y F ++ G D V N + R D+ ++VI+ D+A+
Sbjct: 67 ISIVLLACVDDDYSFTYIDIGAKGRHSDVGVFSNCSLKRAIDDGSLNIPAESVIVADAAF 126
Query: 202 PLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
PL+ ++ P N + F R RR++EN+FGIL +F
Sbjct: 127 PLQSNIMKPYPGNNLTTRQKIFNYRVSRARRIVENAFGILASRF 170
>gi|38344718|emb|CAE05260.2| OSJNBb0115I09.22 [Oryza sativa Japonica Group]
gi|38347026|emb|CAE02554.2| OSJNBb0067G11.1 [Oryza sativa Japonica Group]
Length = 804
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 70/153 (45%), Gaps = 25/153 (16%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 643 NCIGAIDGTHIQVVVPNSAAVQHRNRHEEKSQNVMCVCDFDMRFTFVLAGWPGSVHDMRV 702
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK-EWLIP----------------PRTRNP 215
++ +F + + L DS YP + +L P PR R
Sbjct: 703 FNDAQT--RFSAKFPKLPLGKFYLVDSGYPNRPGYLAPYKGITYHFQEYNESTLPRGRR- 759
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCL 248
+ F CH R +IE SFG+LK K+ L
Sbjct: 760 -----EHFNYCHSSCRNVIERSFGVLKNKWRIL 787
>gi|331237971|ref|XP_003331641.1| hypothetical protein PGTG_12806 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 336
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 100/209 (47%), Gaps = 35/209 (16%)
Query: 101 IPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I + +R+ GFP G VDGT + L P + + +R ++++V IC N KF +
Sbjct: 114 IAVAVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISY 173
Query: 160 NANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPP----- 210
A +PGS HDA V + V ++ FD + + +L DSAY +++P
Sbjct: 174 LAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTNDCYVVPAFKGKQ 226
Query: 211 --RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAGMV--V 262
+ RN D F ++R IE++ GILK +F L +R + + G V +
Sbjct: 227 LLKRRNID------FNYHLAQSRVRIEHAIGILKGRFASLREIRTQIRNAEEMKGAVKWI 280
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDEA 291
+ LHN+ D ++Q ++++DEA
Sbjct: 281 VTCIVLHNLLA----DLKDQWNDLYEDEA 305
>gi|331245395|ref|XP_003335334.1| hypothetical protein PGTG_17187 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309314324|gb|EFP90915.1| hypothetical protein PGTG_17187 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 337
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 113/252 (44%), Gaps = 23/252 (9%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LI 125
++ ++ + V+ ++ + + V+WP+++ +R GF G VDGT + +
Sbjct: 71 ARGTVVKVTRRVIEALI-SMGRVYVQWPDKDRRAEISEVMRMEGFSGCVGFVDGTTIPIF 129
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR---KF 182
P + F +R +++N +C + + + WPGS D++V + +++ +F
Sbjct: 130 QRPGFDGETFFDRKKRYSMNAQIVCDCDRFITSFISGWPGSCGDSKVYQRMQLHQNPSQF 189
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
++G +L DSAY L IP + + + F C + R E++ GIL
Sbjct: 190 FDQG------QYLLADSAYDLTRTTIPAYKAPATEVLINTDFNYCLAKARVRNEHTIGIL 243
Query: 242 KEKFPCLNYMRLSPQFAGMV------VLASTTLHNIALTI-----DRDNENQNVEIHDDE 290
K ++ L MRL G + + LHN+ T+ + E+QN++ +
Sbjct: 244 KSRWASLREMRLHLYNRGHMRQYLAWFYSCIILHNLLATLGDQWTELAPEDQNLDPSAES 303
Query: 291 AEDDGEEEAVDD 302
++DD A D+
Sbjct: 304 SDDDQIPAATDE 315
>gi|57900671|gb|AAW57796.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 748
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 86/178 (48%), Gaps = 21/178 (11%)
Query: 110 GFPNVAGCV---DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
FP+ GC+ DGT + + + + +++ + G NV+A+C + +F V PG+
Sbjct: 508 AFPHFKGCIGALDGTHIRVSLSPDEQVRYIGKTGIPTQNVLAVCDFDMRFTYVATGQPGA 567
Query: 167 VHDARVMRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLKEWLIPP------------RTR 213
+HD V+ N++ + DE+ + P + D+ +P + + P R
Sbjct: 568 MHDTSVLYNAL---RVDEKFFPHPPQGKYYVVDAGHPNRPGYLAPYKGERYHLPEWHRGM 624
Query: 214 NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA-GMVVLASTTLHN 270
P+ ++F R H R +IE SFG+LK K+ L M P + M+V+A+ LHN
Sbjct: 625 EPNTPM-EKFNRVHSSIRNVIERSFGLLKMKWQILYKMPCYPIYKQKMIVVAAMVLHN 681
>gi|341895010|gb|EGT50945.1| hypothetical protein CAEBREN_29807 [Caenorhabditis brenneri]
Length = 237
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 87/188 (46%), Gaps = 22/188 (11%)
Query: 94 PEENLND----IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
P E LND + F R+G F + D +R H+LNV
Sbjct: 26 PFETLNDEQFRLRYRFSRRGFFTLLESLEDSITT------------SDRKSYHSLNVGLA 73
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIP 209
C + F ++ + GS HD+RV R S ++ + G + ++LGDSAY + +L
Sbjct: 74 CDYDLSFIWISCRFGGSAHDSRVFRKSALHADL-QSGEKK---GILLGDSAYRAETYLYK 129
Query: 210 PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTL 268
P N A++++ R R IEN+ G LK +F L+ +R +P A +++ A+ L
Sbjct: 130 P-ILNSQNEAEKKYTDAVCRGRVRIENAIGALKRQFHSLHSEVRYAPDMASLIITAAVCL 188
Query: 269 HNIALTID 276
N ++ ++
Sbjct: 189 RNASILLN 196
>gi|328701841|ref|XP_003241725.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 180
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 67/122 (54%), Gaps = 11/122 (9%)
Query: 159 VNANWPGSVHDARVMRNSM---VYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRN 214
V+A +PGSV+DA + NS + R+ + +P+ +LGDS YPL+ WL+ P T+
Sbjct: 6 VSALFPGSVNDAYIWNNSQLEPILREINNH----YPEGFYLLGDSGYPLRPWLMTPLTQY 61
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL---SPQFAGMVVLASTTLHNI 271
++R+ K R LIE G+LK +F CL R+ SP A +V A T LHNI
Sbjct: 62 QPNTPEERYNHKFKYVRSLIERCIGLLKMRFRCLLKHRILHYSPVKACKIVNACTILHNI 121
Query: 272 AL 273
+
Sbjct: 122 CI 123
>gi|346471011|gb|AEO35350.1| hypothetical protein [Amblyomma maculatum]
Length = 445
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 125/296 (42%), Gaps = 28/296 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F++ R+S + + +L + +L+ T + +SA +++ + ++ G + +
Sbjct: 66 FRDYLRMSPSTFDTLLGFVKHSLSRQVTPFSDPISAHDRLAITVRFLANGDTFRSLSYNF 125
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL--NDIPMTFLRKGGFPNVAGCVDGTM 122
+S+ I E I + + +P+ + I + FPN G +DG
Sbjct: 126 LIGRSTASVLIRETTAAIWTNLWDEYIPFPQTEVEWRKIALDMENYWNFPNCIGSLDGKH 185
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V I+ PN + +N ++ ++A C +Y+F V+ G D+ + S + +
Sbjct: 186 VNIECPNNSGSRNMNYKKTFSVVLLACCDAHYRFTYVDLCHYGGEGDSGIFLRSDLLKDL 245
Query: 183 DEEGWRPFPD-----------AVILGDSAYPLKEWLIPPRTR--------NPDAAAD--Q 221
W P VI+GD A+PLK +L+ P R +P + Q
Sbjct: 246 -TNNWCGVPAPTTVGSAGDIPYVIVGDEAFPLKTFLMRPYARRDLQTHRLSPSGREEYQQ 304
Query: 222 RFLRCHK--RTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALT 274
R ++ R RR+IENSFGI+ ++ L R S + + A LHN ++
Sbjct: 305 RATFNYRLSRARRVIENSFGIMAARWRILRRPFRASEETTENICKACVVLHNFMMS 360
>gi|449664839|ref|XP_004206009.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 233
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 91/206 (44%), Gaps = 26/206 (12%)
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
K P G VDGT + I P E + N G H+++V AIC F V+ WPGS+
Sbjct: 4 KFNMPQAFGFVDGTHIPIRRPLESSQDYFNYKGFHSMSVQAICDSKGIFMNVDCKWPGSL 63
Query: 168 HDARVMRN------------SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--RTR 213
HDA+V N + YR+ G+ P VI GD AYPL + + +
Sbjct: 64 HDAKVFSNLTFNLRMISNSIPITYREL-LPGFCKVPCYVI-GDPAYPLSTFCLKEYLHCQ 121
Query: 214 NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNI- 271
N + LR R IE +FG LK ++ L + L + V+ A LHN
Sbjct: 122 NNNQVVFNTSLR---TARNQIECAFGRLKARWEILKKKIDLKLENIPTVIYACFVLHNFC 178
Query: 272 ---ALTIDRDNENQNVE--IHDDEAE 292
+TID + N++ I+ D+ +
Sbjct: 179 KIKCITIDEEVFINNIKENINSDKVD 204
>gi|449685416|ref|XP_004210889.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 216
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 68/125 (54%), Gaps = 12/125 (9%)
Query: 156 FYAVNANWPGSVHDARVMRNSMVY---RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRT 212
F +V+ +WP SVH +R+ +NS +Y + G+ +LGD+ Y + +L+
Sbjct: 55 FTSVDVSWPRSVHGSRIFKNSNIYPLLIGLENRGY-------LLGDAGYGITPFLMT-LF 106
Query: 213 RNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
N ++ F + H + R +IE++FG LK +FP L Y +R+ P+ + A LHNI
Sbjct: 107 ENAQTEIEKNFNKKHAKNRVIIEHAFGQLKRRFPILRYGVRIKPEMIATRITACFILHNI 166
Query: 272 ALTID 276
A T++
Sbjct: 167 AKTLN 171
>gi|357111421|ref|XP_003557512.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 406
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 127/300 (42%), Gaps = 31/300 (10%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S + ++I + L + + LS +++V + + G
Sbjct: 63 AFRYFFRTSRSTFDYICSIVRDDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 122
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---VKWP-EENLNDIPMTFLRKGGFPNV 114
+GA G +S++ +V +++ +R + WP +E ++DI F G PN
Sbjct: 123 SVGAAFGVGQSTV----SQVTWRFIESMEERARYHLAWPDQERMDDIKANFEVVSGLPNC 178
Query: 115 AGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G +D T +++ P + + + + ++++ + + +F + WPGS+ R++
Sbjct: 179 CGAIDATHIVMTLPAVDSSEDWCDHAKNYSMFLQGVFDHEMRFIDIVTGWPGSMTFPRLL 238
Query: 174 RNSMVYRKFDEEGWR-PFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQR 222
+ S Y K E G R P V I+GD YPL WL+ P +A
Sbjct: 239 KFSGFY-KLCEAGKRLDGPAQVSRDDAQIREFIVGDVCYPLLPWLMTPYEGENLSAQMVE 297
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNE 280
F K R L + LK + L+ + P + ++L LHNI IDR ++
Sbjct: 298 FNARQKAARMLGTRALARLKGSWRILHKVMWRPDKNKLPSIILVCCLLHNI--LIDRQDQ 355
>gi|331218562|ref|XP_003321958.1| hypothetical protein PGTG_03495 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309300948|gb|EFP77539.1| hypothetical protein PGTG_03495 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 433
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 96/226 (42%), Gaps = 27/226 (11%)
Query: 89 RMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID-APNEHEDQFVNRNGDHALNVM 147
+++ WPE + + + GF G +DGT+V++ P + + NR G +++ +
Sbjct: 189 QLLTWPEPAAREEIKSEFEEVGFDGCVGLIDGTLVILSTCPEKDGPDYYNRKGFYSVVTL 248
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK---FDEEGWRPFPDAVILGDSAYPLK 204
+C + WPG HD R+M N + + F G +L DSA+
Sbjct: 249 LVCNNKKNITYLYTGWPGCSHDMRLMSNCALTKHPGDFFSNG------EYLLADSAFTPT 302
Query: 205 EWLIPP--RTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCLNYMRLS-PQFAGM 260
+P R +N + +Q F R R IEN G+LK +F L +RL M
Sbjct: 303 RTTVPAFKRKKNQNLTDEQHDFNRHLSGVRVGIENCIGLLKNRFKSLKGLRLRVSSKEDM 362
Query: 261 V-----VLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVD 301
V ++A LHN NQ + + D E+D +E+ +
Sbjct: 363 VRVTAWIMACAVLHNFL--------NQGGDFNFDITEEDNDEDGAE 400
>gi|301114939|ref|XP_002999239.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111333|gb|EEY69385.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 401
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 93/209 (44%), Gaps = 22/209 (10%)
Query: 111 FPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHD 169
PN +DGT+ L P H + F +R +A+N + +C + N WPGS HD
Sbjct: 189 LPNYIVLIDGTLFPLSQRPENHGEDFFSRKSSYAINGLIVCDGRERIRYENVGWPGSSHD 248
Query: 170 ARVMRNSMVYRKFDEEGWRPFPD-AVILGDSAYPLKEWLIPP--RTRNPDAAADQRFLRC 226
RV RN + + + F D ILGDSAY LIP + R + +
Sbjct: 249 NRVWRNC----RLAQHQYDHFADNEYILGDSAYQASNVLIPAFKKCRGSEIPLKENTFNT 304
Query: 227 H-KRTRRLIENSFGILKEKFPCLNYMRLSPQFAG------MVVLASTTLHNIALTIDRDN 279
+ R +E+ G LK FP L +R + + G ++ + LHN+ + ++
Sbjct: 305 QLAKIRIRVEHCIGTLKGIFPLLKRLRATIRNVGELNSTISLIRPAIILHNLTV----ND 360
Query: 280 ENQNVEIHDDEAEDDGEEEAVDDPDEIDE 308
+ + I DEAE++ E ++ P +E
Sbjct: 361 KLPDAWIDCDEAENN---EPIELPSGTNE 386
>gi|449686016|ref|XP_004211043.1| PREDICTED: uncharacterized protein LOC101236526 [Hydra
magnipapillata]
Length = 313
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 121/289 (41%), Gaps = 36/289 (12%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHI 59
M KE FR+++ F++ E+ P L T T +S ++V LH++ Y +
Sbjct: 1 MEECELKEHFRVNSNTFNFLVNELHPHLGKTTTTTQEPISVVKRVAVALHYLASFEEYRV 60
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGC 117
+ + +G K + +HE + + D K+P ENLN F GF
Sbjct: 61 VSSLYGIGKLAN-LIVHEFINAVNDILLPIFFKFPLSVENLNKHNRDFEAILGF------ 113
Query: 118 VDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+ + P+ + R Y F + WPG +D+ +++NS
Sbjct: 114 -RWSCSWMPHPSFSPLRSNCR---------------YGFIYTSVGWPGRNNDSCILQNSS 157
Query: 178 V--------YRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHK 228
+ + K +E F +GDSA+PL L+ P +N + + Q+ F +
Sbjct: 158 LKAILESNLFDKCCKELGDSFVSLCFIGDSAFPLTRHLLKPYPKNLELSEIQKNFNKILC 217
Query: 229 RTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
+++++N+FG +K +F + M FA +V A TLHNI D
Sbjct: 218 GAKKVVKNAFGRVKARFRVICKRMECDINFATRIVNACVTLHNICEYYD 266
>gi|403178386|ref|XP_003336827.2| hypothetical protein PGTG_18233 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375164133|gb|EFP92408.2| hypothetical protein PGTG_18233 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 121/280 (43%), Gaps = 27/280 (9%)
Query: 26 PALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDT 85
P L P + + L E++ S + VG +G HG ++ ++ V+ I D
Sbjct: 136 PQLPIPHQLALTL---ERLGSNGNGASVGRFSRNLGVGHG----TVVKASRRVIRAIND- 187
Query: 86 FFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHAL 144
++ + WP+E ++ F G VDGT + L P+ + F +R +++
Sbjct: 188 LSEKYLTWPDEVRRKEISDVMKCEVFEGCVGFVDGTTIPLYQRPSIDGEVFFDRKKRYSI 247
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK---FDEEGWRPFPDAVILGDSAY 201
N +C + A WPGS D+ V + M++++ F + G ++ DSAY
Sbjct: 248 NCQVVCNCDRFITAFMTGWPGSCGDSMVFKRMMLHKEPTLFFDRG------QYLIADSAY 301
Query: 202 PLKEWLIPPRTRNPDAAADQR--FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAG 259
L IP + P A + F C R+R E++ GILK ++ L ++RL+ Q
Sbjct: 302 ELGVHCIPA-YKAPAAYIKENSDFNYCLARSRVWNEHTIGILKGQWASLQHLRLAIQKPS 360
Query: 260 --MVVL----ASTTLHNIALTIDRDNENQNVEIHDDEAED 293
M V+ TLHNI + +V I+D D
Sbjct: 361 DMMEVIRWVNCCVTLHNILAHLGDAWAELDVSINDHPGSD 400
>gi|346466591|gb|AEO33140.1| hypothetical protein [Amblyomma maculatum]
Length = 418
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 113/278 (40%), Gaps = 11/278 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+ FR++ + ++L T +A+ ++V L+ + I G
Sbjct: 91 FRTHFRVNRSTFRYLLSACECMRRQDTNMRMAVPLHKRVAIGLYRLATSGEDRSISNAFG 150
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
+S++ E V+V + V +P +L + F GFP G +DG +
Sbjct: 151 VGRSTVNEIFWEFCHVVVRLLEPKFVGFPTIRDLAEHMRQFAAVTGFPQGVGALDGCHIE 210
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ P E+ + N G ++ ++A+ YK PG HD+ V + S + +
Sbjct: 211 VCPPEENAVDYYNYKGWYSTILLAVADHKYKLLYCCVGAPGRNHDSGVFQASRLPKLLQS 270
Query: 185 EGWR-------PFPDA-VILGDSAYPLKEWLIPPRTR-NPDAAADQRFLRCHKRTRRLIE 235
E + P V+L D A+PL+ ++ P ++ + Q F RR++E
Sbjct: 271 ELFSREVKILNSVPVGPVLLADQAFPLQTHIMKPYSQPGAQGSPTQLFNFRLSSARRVVE 330
Query: 236 NSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIA 272
N FG LK +F L + + + A+ LHN+
Sbjct: 331 NVFGRLKARFRMLLKGLECNIHNVNTAIRAACVLHNVC 368
>gi|242038949|ref|XP_002466869.1| hypothetical protein SORBIDRAFT_01g015480 [Sorghum bicolor]
gi|241920723|gb|EER93867.1| hypothetical protein SORBIDRAFT_01g015480 [Sorghum bicolor]
Length = 515
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 92/215 (42%), Gaps = 22/215 (10%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V P E Q + R G NVMA C + +F V + WPGS HD
Sbjct: 155 FKDCIGAIDGTHVPCVVPQEKFVQHLCRKGMTTQNVMAACDFDMRFTFVMSGWPGSAHDM 214
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-----------RTRNPDAAA 219
V ++++ +F + P L DS Y + + P R
Sbjct: 215 TVFKDAI--SRFGDLFSHPPTGKYYLVDSGYANRLGYLAPYKGTKYHLQEYREGPQPEGK 272
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT---- 274
++ F H R +IE +FG+LK K+ L + S + +++A LHN T
Sbjct: 273 EETFNYAHSSLRNVIERAFGVLKMKWRMLREIPSYSTEKQSRIIVACCALHNFIRTSGIQ 332
Query: 275 ---IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
R + ++N + ++ ED E E V+D ++
Sbjct: 333 DRHFARCDRDENF-VPEEAFEDQPEPEVVEDDSQL 366
>gi|223943391|gb|ACN25779.1| unknown [Zea mays]
Length = 405
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 115/290 (39%), Gaps = 23/290 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 64 AFRHFFRTSRKTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 123
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+G G +S++ + + + D + + WP +E L I G PN G
Sbjct: 124 SVGTAFGVGQSTVSQVTWRFIESMEDRA-RYHLAWPSQERLEQIKAVLEDAYGLPNCCGG 182
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VD T V++ P E + + ++++ + I +F + WPGS+ +R+M+ S
Sbjct: 183 VDATHVIMTLPAVESSADWCDHAKNYSMFLQGIVDDQMRFIDIVTGWPGSMTFSRLMKCS 242
Query: 177 MVYRKFDEEGWRPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E G R I GDS YPL WL+ P A Q F
Sbjct: 243 GFF-KLCEAGERLNGSVKVSAEGAEIREYIAGDSCYPLLPWLMTPFEGKHLPAPLQNFNA 301
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIAL 273
K R L N+ LK + L+ + P + ++L LHNI +
Sbjct: 302 RQKAARLLGTNALARLKGSWRILHKVMWRPDKNKLPSIILVCCLLHNILI 351
>gi|331226844|ref|XP_003326091.1| hypothetical protein PGTG_07921 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 304
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 106/248 (42%), Gaps = 14/248 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F E FR+S +I +E+ L R L+ + +V L+ +G G+ Y I
Sbjct: 59 FIEYFRMSRADFHWICDELRDTLAQDQLRRGAPLTVEAQVAVGLYRLGHGATYVTISHVF 118
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND----IPMTFLRKGGFPNVAGCVDG 120
K + ++ V ++ R + +P+ + +D I +F + G P V G +DG
Sbjct: 119 NIGKETADKASGRFVNAVLKVLRLRAISFPDLDAHDEWVEIEDSFASRHGIPGVVGAIDG 178
Query: 121 TMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T V L P + ++NR +L + F V GS+HD+RV R S +
Sbjct: 179 THVPLAMPPLDKWKGYINRKNWPSLVFQCVVDGEGNFRDVFGGGAGSIHDSRVFRRSPLG 238
Query: 180 RKFDEEGWRPF---PDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCHKRTRRLI 234
++ P ++GD+ YP ++ P P AA ++ F TR ++
Sbjct: 239 CSLNQALDLPLMIPRSKYLIGDAGYPSNVNILVPY---PSIAAPENEWFNYVQSATRIVV 295
Query: 235 ENSFGILK 242
E SFG LK
Sbjct: 296 EQSFGRLK 303
>gi|18412695|ref|NP_567144.1| uncharacterized protein [Arabidopsis thaliana]
gi|13878075|gb|AAK44115.1|AF370300_1 unknown protein [Arabidopsis thaliana]
gi|17104745|gb|AAL34261.1| unknown protein [Arabidopsis thaliana]
gi|332646938|gb|AEE80459.1| uncharacterized protein [Arabidopsis thaliana]
Length = 396
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 63/314 (20%), Positives = 132/314 (42%), Gaps = 26/314 (8%)
Query: 4 FAFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPY 57
+AFK FR S T +I + L + + LS +++V L + G
Sbjct: 63 YAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQ 122
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAG 116
+GA G +S++ + + + + + ++WP+ + + +I F G PN G
Sbjct: 123 VSVGAAFGVGQSTVSQVTWRFIEALEERA-KHHLRWPDSDRIEEIKSKFEEMYGLPNCCG 181
Query: 117 CVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +++ P + D + ++ ++++ + + +F + WPG + +++++
Sbjct: 182 AIDTTHIIMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKF 241
Query: 176 SMVYR------------KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
S ++ K +G + ++G +YPL WLI P + + + F
Sbjct: 242 SGFFKLCENAQILDGNPKTLSQGAQI--REYVVGGISYPLLPWLITPHDSDHPSDSMVAF 299
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTI-DRDNE 280
H++ R + +F LK + L+ + P + ++L LHNI + D E
Sbjct: 300 NERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQE 359
Query: 281 NQNVEIHDDEAEDD 294
+ + H D D
Sbjct: 360 DVPLSGHHDSGYAD 373
>gi|449443949|ref|XP_004139738.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449528120|ref|XP_004171054.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 396
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 66/301 (21%), Positives = 123/301 (40%), Gaps = 29/301 (9%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
FK FR S ++I + L + + LS +++V + + G
Sbjct: 56 GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQV 115
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---VKWPEEN-LNDIPMTFLRKGGFPNV 114
+GA G +S++ +V V+ QR ++WP + L +I F G PN
Sbjct: 116 SVGAAFGVGQSTV----SQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNC 171
Query: 115 AGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G +D T +++ P + D + + N ++++ + I +F + WPG++ +R++
Sbjct: 172 CGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLL 231
Query: 174 RNSMVYRKFD-----EEGWRPFPDA-----VILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
+ S +++ D + F ++G YPL WLI P + + + F
Sbjct: 232 KCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNF 291
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDRDNEN 281
+ L +F LK + LN + P + ++L L NI ID +E
Sbjct: 292 NAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI--IIDNGDEL 349
Query: 282 Q 282
Q
Sbjct: 350 Q 350
>gi|212722242|ref|NP_001132690.1| Transposon protein CACTA, En/Spm sub-class [Zea mays]
gi|194695102|gb|ACF81635.1| unknown [Zea mays]
gi|195627614|gb|ACG35637.1| transposon protein CACTA, En/Spm sub-class [Zea mays]
gi|413945623|gb|AFW78272.1| Transposon protein CACTA, En/Spm sub-class [Zea mays]
Length = 404
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 106/242 (43%), Gaps = 22/242 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RNV------ALSAKEKVLSCLHWMGVGSPY 57
+ ++S ++I + LTT T RN L +++V L + G
Sbjct: 56 MESVLKMSRRTFDYICSLVKKDLTTKTYGFRNFRFGDKKVLGVEDQVAVALMRLTTGESL 115
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + + KWP E + I F + G PN G
Sbjct: 116 QNIGMWFGMNHSAISNITWRFIESVEERAICHL-KWPSPEEMATIKTRFDKVYGLPNCCG 174
Query: 117 CVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +L+ + + +++ +++ + A+ + +F V + WPGS+ D ++R
Sbjct: 175 AIDTTHILMCSSAQPNSKVWLDNENKNSMVLQAVVDADLRFRDVVSGWPGSMDDTCILRT 234
Query: 176 SMVYRKFDEEGWR-------PFPDAV---ILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
S +YR E+G R P AV ++GD++YPL WL+ P + AA F +
Sbjct: 235 SGLYR-LCEKGLRLGGQMELPGGSAVREYLVGDASYPLLPWLMTPYREHGLQAAKVEFNK 293
Query: 226 CH 227
H
Sbjct: 294 RH 295
>gi|403158998|ref|XP_003319668.2| hypothetical protein PGTG_01842 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166548|gb|EFP75249.2| hypothetical protein PGTG_01842 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 435
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 105/237 (44%), Gaps = 25/237 (10%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LI 125
+ ++ + V+ +V + + V+WP+ + +R GF G VDGT + +
Sbjct: 170 GRGTVIKVSRRVIEALV-SIGRTHVQWPDRHRRAEISEVMRMEGFGGCVGFVDGTTLPMF 228
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV----YRK 181
P + F +R ++LN+ +C + + WPG+V D++V + + Y
Sbjct: 229 QRPGYDGEVFFDRKKRYSLNLQILCDCDKFITSFITGWPGTVGDSKVYKRMQLNLNPYDF 288
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
FDE + ++ DSAY L +IP + + + + F C ++R E++ GI
Sbjct: 289 FDEGQY-------LIADSAYDLSSTVIPAYKGKAAEITINTDFNYCMAKSRVRNEHTIGI 341
Query: 241 LKEKFPCLNYMRLSPQFAGMV------VLASTTLHNIALT-----IDRDNENQNVEI 286
LK ++ L MRL G + + + LHN+ ++ E+QN+ +
Sbjct: 342 LKARWSSLKEMRLHLYNRGHMRQYTAWIYSCIVLHNLLAKLGDQWLELSEEDQNLTL 398
>gi|427780731|gb|JAA55817.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 199
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 76/155 (49%), Gaps = 15/155 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F++ FRLS V + +E+ P + +R LS + KVL L + GS +G
Sbjct: 39 FRQCFRLSKRTVRSLCDELDPII--GCQRASGLSTERKVLCALRFYATGSFQRSVGREEH 96
Query: 65 -GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-----NDIPMTFLRKGGFPNVAGCV 118
G +++++ +IHEV I+ +R K + +L ++ F R+G P V CV
Sbjct: 97 IGMAQAAVSNTIHEVTEAIITVSARR--KLVDFSLTPAAKDEAKAAFARRGVIPGVLACV 154
Query: 119 DGTMVLIDAPNEHE----DQFVNRNGDHALNVMAI 149
DGT+V I P F++R G +ALNVM +
Sbjct: 155 DGTLVAIRKPEGLSLVDTASFMSRKGYYALNVMIV 189
>gi|328700501|ref|XP_003241280.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 302
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 105/259 (40%), Gaps = 33/259 (12%)
Query: 50 WMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND---IPMTFL 106
++ G+ + + S+I + + E V+ + +K E D I +
Sbjct: 4 YLATGTNFTALHFEFLAGVSTIAKIVQETCDVLWKVL--QPLKMAEPTTKDWLGIANGYY 61
Query: 107 RKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
K FPN G VDG + ++ P + N +L +MAIC NY F ++ G
Sbjct: 62 EKTQFPNTVGAVDGKHIRLECPKNSGSLYYNYKNFFSLILMAICDSNYCFRIIDVGSYGK 121
Query: 167 VHDARVMRNSMVYRKF---------------DEEGW-RPFPDAVILGDSAYPLKEWLI-- 208
D V + S +K DE G +PF V++ D A+ L + L+
Sbjct: 122 ESDCNVFKTSTFGKKLYSDKVNFPSDRCLPGDENGVPQPF---VLVADEAFALHKHLLRP 178
Query: 209 -PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLAST 266
P RT N + L R R+ IE +FGIL +K+ + M + P A + A
Sbjct: 179 FPGRTLNNNRRIFNYRL---SRARQYIECTFGILSKKWRVFQSSMLVDPNVAVTITKACC 235
Query: 267 TLHNIALTIDRDNENQNVE 285
LHN RD E+ N E
Sbjct: 236 VLHNFVRR--RDGEDNNYE 252
>gi|226501860|ref|NP_001141503.1| uncharacterized protein LOC100273615 [Zea mays]
gi|194704850|gb|ACF86509.1| unknown [Zea mays]
Length = 405
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 115/290 (39%), Gaps = 23/290 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 64 AFRHFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 123
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+G G +S++ + + + D + + WP +E L I G PN G
Sbjct: 124 SVGTAFGVGQSTVSQVTWRFIESMEDRA-RYHLAWPSQERLEQIKAVLEDAYGLPNCCGG 182
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VD T V++ P E + + ++++ + I +F + WPGS+ +R+M+ S
Sbjct: 183 VDATHVIMTLPAVESSADWCDHAKNYSMFLQGIVDDQMRFIDIVTGWPGSMTFSRLMKCS 242
Query: 177 MVYRKFDEEGWRPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E G R I GDS YPL WL+ P A Q F
Sbjct: 243 GFF-KLCEAGERLNGSVKVSAEGAEIREYIAGDSCYPLLPWLMTPFEGKHLPAPLQNFNA 301
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIAL 273
K R L N+ LK + L+ + P + ++L LHNI +
Sbjct: 302 RQKAARLLGTNALARLKGSWRILHKVMWRPDKNKLPSIILVCCLLHNILI 351
>gi|297723773|ref|NP_001174250.1| Os05g0184901 [Oryza sativa Japonica Group]
gi|53749284|gb|AAU90143.1| hypothetical protein [Oryza sativa Japonica Group]
gi|255676090|dbj|BAH92978.1| Os05g0184901 [Oryza sativa Japonica Group]
Length = 468
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 94/220 (42%), Gaps = 24/220 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+ + + +++A A+ T A+ ++V CL + G P +
Sbjct: 131 FRRAFRMPRAVFDKLCDDLAAAVAKEDTTLRAAIPVPQRVAVCLWRLATGDPLREVSRRF 190
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G S+ C SI V + R+V+WP+ + F G P V G V +
Sbjct: 191 GLGIST-CHSIILQVCAAITAVLTRVVRWPDSHAA-AASRFQALSGIPGVVGAVHTEHIP 248
Query: 125 IDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
I AP E+ ++ +R +++ + A+ + F V PGS+ DA V+ S
Sbjct: 249 IVAPRENAGEYYDRRLTDRNNKATYSVAMQAVVDADGAFTDVCIGHPGSLSDAAVLTKSA 308
Query: 178 VYRKF-------DEEGWRPFPDAVILGDSAYPLKEWLIPP 210
+Y + D+ W ++G ++YPL W++ P
Sbjct: 309 LYARCEAGLLLGDDPQW-------LVGGASYPLTSWMLVP 341
>gi|449674741|ref|XP_004208247.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 190
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 79/188 (42%), Gaps = 17/188 (9%)
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
+ + F K G GC+DGT V I P ++ + +LNV A+C F
Sbjct: 1 MTKLASQFEVKFGMIQAFGCIDGTHVQIKRPIKNGQDYFCYKQYFSLNVQAVCDSKGYFI 60
Query: 158 AVNANWPGSVHDARVMRNSMVYRK----------FDEEGWRPFPDAVILGDSAYPLKEWL 207
V WPGSVHDA++ NS + +K + + P+ +I GD AYPL +
Sbjct: 61 DVECKWPGSVHDAKMFINSTINKKLIKGTLPQTLYSLPNYHSIPNYLI-GDPAYPLTNFC 119
Query: 208 IP--PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLA 264
I N + LR R IE +FG LK ++ L + + + +V+
Sbjct: 120 IKEFQSCSNNEEVIFNSMLR---SARNQIECAFGRLKARWGFLRKIIDIKIETVPIVIYT 176
Query: 265 STTLHNIA 272
LHN
Sbjct: 177 CFVLHNFC 184
>gi|414883714|tpg|DAA59728.1| TPA: hypothetical protein ZEAMMB73_550258 [Zea mays]
Length = 446
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 115/290 (39%), Gaps = 23/290 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 105 AFRHFFRTSRKTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 164
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+G G +S++ + + + D + + WP +E L I G PN G
Sbjct: 165 SVGTAFGVGQSTVSQVTWRFIESMEDRA-RYHLAWPSQERLEQIKAVLEDAYGLPNCCGG 223
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VD T V++ P E + + ++++ + I +F + WPGS+ +R+M+ S
Sbjct: 224 VDATHVIMTLPAVESSADWCDHAKNYSMFLQGIVDDQMRFIDIVTGWPGSMTFSRLMKCS 283
Query: 177 MVYRKFDEEGWRPFPDA-----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
+ K E G R I GDS YPL WL+ P A Q F
Sbjct: 284 GFF-KLCEAGERLNGSVKVSAEGAEIREYIAGDSCYPLLPWLMTPFEGKHLPAPLQNFNA 342
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIAL 273
K R L N+ LK + L+ + P + ++L LHNI +
Sbjct: 343 RQKAARLLGTNALARLKGSWRILHKVMWRPDKNKLPSIILVCCLLHNILI 392
>gi|147772236|emb|CAN71557.1| hypothetical protein VITISV_020377 [Vitis vinifera]
Length = 354
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 108/243 (44%), Gaps = 28/243 (11%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S +I + HEV+ +V+ F + ++ P N N I LR+ F +V G +DGT +
Sbjct: 92 SSQTIHKYFHEVLVAMVN-FSKEIITLPSFNNSTNGISNHRLRQI-FKDVVGAIDGTFIH 149
Query: 125 IDAPNEHEDQFVNRNGDHAL-NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
P + + NVM +C + F V W G+ HD+RV+ ++ +
Sbjct: 150 ACIPTNQQVPYRGCGRRECFHNVMVVCDFDMIFRFVIVGWEGTTHDSRVLTETI---RNP 206
Query: 184 EEGWRPFPDAVILGDSAYP-LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
+ + P + + Y ++ WL R+ ++ F +CH R R +IE FG++K
Sbjct: 207 QHNFPMLPSGFM---ALYRNVRYWLSDFRSGGKVVGKEEIFNQCHARLRNVIERVFGVVK 263
Query: 243 EKFPCLNYMRLSP---QFAGMVVLASTTLHNI--------ALTIDRDNENQNVEIHDDEA 291
+FP L R+SP +V+ ++HN L + DNE VE+ D A
Sbjct: 264 ARFPILK--RMSPYTFTTQTKIVMTCFSIHNFLRQISVVDGLFSEYDNE---VELESDNA 318
Query: 292 EDD 294
+
Sbjct: 319 NQN 321
>gi|449676887|ref|XP_004208727.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 267
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 69/154 (44%), Gaps = 3/154 (1%)
Query: 28 LTTPTERNV-ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTF 86
L + E N+ A++ +KV L+++ ++ T G + + I EV IV
Sbjct: 42 LVSLLEPNIRAITPDKKVAITLYYLKDSGTLNMTANTFGIAVCTTSAVIFEVCNAIVKYI 101
Query: 87 FQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHAL 144
R V P+ + + + F K G GCVDGT + I P H + ++L
Sbjct: 102 GPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDGTHIPIVCPTNHSQDYFCYKQHYSL 161
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
V A+C + V WPGSVHDA+V NS +
Sbjct: 162 QVQAVCDYKGSYLDVECMWPGSVHDAKVFSNSSI 195
>gi|328699180|ref|XP_003240854.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 322
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 106/253 (41%), Gaps = 17/253 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ R+SA+ E ++ + P + R LS E + + L ++ G I +
Sbjct: 71 YQNFLRMSASTFEELVCLVGPKIMRFPTRPDILSVGEVLTATLRYLASGESMMSIMYSFR 130
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K+++ + I + V+ DT ++ P+ + + + F K PN G +DG ++
Sbjct: 131 IGKATVSKLIFQCCEVLWDTLNTNVLIVPDTKKWAQLGVEFENKWQVPNCIGSIDGKHIV 190
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
A + N H++ ++A+C +Y F V+ G D V NS + + F
Sbjct: 191 HQAFANSGSENYNYKRSHSIILLAMCDASYNFTIVDIGADGRCSDGGVFSNSEMGKGFMA 250
Query: 185 EGWRPFPDA------------VILGDSAYPLKEWLI---PPRTRNPDAAADQRFLRCHKR 229
FP A LGD A+PL L+ P R + + F R
Sbjct: 251 NNLN-FPTAKDIDSNSGPIPYYALGDEAFPLLTNLMRPYPGRGKRKLPLNESIFNYRLSR 309
Query: 230 TRRLIENSFGILK 242
RR IEN+FG++K
Sbjct: 310 GRRTIENTFGLMK 322
>gi|403173013|ref|XP_003332118.2| hypothetical protein PGTG_13485 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170092|gb|EFP87699.2| hypothetical protein PGTG_13485 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 113/281 (40%), Gaps = 31/281 (11%)
Query: 31 PTERNVALSAKEKVLSCL-HWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQR 89
P E + ++ K L C + VG G G + R I ++ + +
Sbjct: 138 PIEEQLMVALKR--LGCFGNGASVGMLARFFGVGEGTVELYTNRCIMAILRIKT-----Q 190
Query: 90 MVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVM 147
+++WP E+ +I + + GF G +DG ++ L + P+++ F +R G + + +
Sbjct: 191 IIQWPSPEDRKEIKADYA-EVGFDGCVGLIDGVLIPLAECPSKNGSDFYSRKGSYGITTL 249
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPF-PDAVILGDSAYPLKEW 206
C N + WPG HD RVM NS + E + F P +L DSA+
Sbjct: 250 IACDSNRNINFLYTGWPGCSHDQRVMGNS----RLALEPKQFFSPGEYLLADSAFVTTPT 305
Query: 207 LIPPRTRNPDAAA--DQ-RFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF------ 257
++ R P DQ F R R L+E+ G LK +F L +RL
Sbjct: 306 IVAAFKRPPHGKLTDDQVSFNYYLARHRVLVEHCIGALKARFQSLKGLRLRIDGRNDQIR 365
Query: 258 AGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
+ A LHN + D E D E+D +E+
Sbjct: 366 VNAWIQACAVLHNFLIQGD------EFEFDDAGTENDQDED 400
>gi|16519473|gb|AAL25182.1|AC079852_15 Putative transposase [Oryza sativa]
gi|31430128|gb|AAP52086.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 435
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 118/285 (41%), Gaps = 36/285 (12%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+ +EK+ L+ + + + + G S S R + +V + +R +K P N
Sbjct: 93 IKVEEKLAFFLYMLSHNASFEDLQEKFGHSGDSFHRHVKHFFNSVVPSLSKRFLKPPNPN 152
Query: 98 -----LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
+ P + F N G +DGT + I ++ F NR + NVM C
Sbjct: 153 EVHWKIEKDPRFY---PYFKNCLGAIDGTHIPISISSDKAAPFRNRKNTLSQNVMIACDF 209
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-- 210
+ K ++ W GS DARV+ +++ +G++ P L D Y + P
Sbjct: 210 DLKITFMSTGWEGSATDARVLTSAV------NKGFQVPPGKFYLVDGGYANTNSFLAPYR 263
Query: 211 --RTRNPDAAADQR--------FLRCHKRTRRLIENSFGILKEKFPCL---NYMRLSPQF 257
R + A +R F H R +E + G++K++FP L + ++ Q
Sbjct: 264 KVRYHLKEYGAGRRRPQNYKELFNHRHAVLRNHVERTLGVVKKRFPILKVATFHKIKNQV 323
Query: 258 AGMVVLASTTLHNIALTIDRDNE---NQNVEIHDDEAED--DGEE 297
+ +A+ HNI L+++ D + NQ IH D DG+E
Sbjct: 324 K--IPVAAAVFHNIILSLNGDEQWLNNQPHNIHPSNYVDLPDGDE 366
>gi|449690868|ref|XP_004212486.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 262
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 10/145 (6%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FPN G +DG V I+AP+ + N H++ ++AIC Y F V+ G ++A
Sbjct: 21 FPNCIGAIDGKHVCIEAPSLSGSAYYNYKNFHSMVLLAICDAKYCFTLVDIGSYGRDNNA 80
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA----------VILGDSAYPLKEWLIPPRTRNPDAAAD 220
+ S + + F ++ + V++GD + LK WL+ P + +
Sbjct: 81 SIFNESKMGKAFKNNLFKLSKNRMLSNGTQVPPVLVGDDIFALKSWLMKPFSGKNLTIKE 140
Query: 221 QRFLRCHKRTRRLIENSFGILKEKF 245
+ F RTRR IEN+FGI+ K+
Sbjct: 141 RIFNYRLSRTRRTIENTFGIMVAKW 165
>gi|403162731|ref|XP_003322904.2| hypothetical protein PGTG_04441 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375173057|gb|EFP78485.2| hypothetical protein PGTG_04441 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 575
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 112/265 (42%), Gaps = 26/265 (9%)
Query: 6 FKEAFRLSATQVEFILEEIA-----------PALTTPTERNVALSAKEKVLSCLHWMGVG 54
FK+A R + ++L I P L P + + L E++ S + VG
Sbjct: 249 FKQAVRTTKAGFMWLLGRIMFNSVFHSASFRPQLPVPHQLALTL---ERLGSNGNGASVG 305
Query: 55 SPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+G G ++ ++ V+ I + + WP+E+ ++ GF
Sbjct: 306 RFSRNLGVGRG----TVIKASRRVIQAI-NELSHTYLLWPDEDRRKEISEVMKAEGFEGC 360
Query: 115 AGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDGT + L P+ + F +R +++N IC + WPGS D+ V
Sbjct: 361 IGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDRFITGYMTGWPGSCGDSMVF 420
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDA--AADQRFLRCHKRTR 231
+ V+R + G P ++ DSAY L IP + P A + F C R+R
Sbjct: 421 KKMAVHR---DPGRFFDPGQYLIADSAYELGLHCIPA-YKAPAAYILENTEFNYCLARSR 476
Query: 232 RLIENSFGILKEKFPCLNYMRLSPQ 256
E++ GILK ++ L ++RL+ Q
Sbjct: 477 VRNEHTIGILKGRWASLQHLRLAIQ 501
>gi|331222274|ref|XP_003323811.1| hypothetical protein PGTG_05713 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309302801|gb|EFP79392.1| hypothetical protein PGTG_05713 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 113/259 (43%), Gaps = 14/259 (5%)
Query: 6 FKEAFRLSATQVEFILEEIA--PALTTPTERNVALSAKEKVLSC--LHWMGVGSPYHIIG 61
FK+A R + ++L EI P + + R A + L+ L G G+
Sbjct: 105 FKQAVRTTKPGFIWLLNEIQLNPVFHSGSFRPQLPIAHQLALTLERLGSNGNGASVGRFS 164
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGT 121
+ ++ ++ V+ I D R + WP+ + +++ GF G VDGT
Sbjct: 165 RNLSVGRGTVVKASRRVIQAIND-LSGRYLLWPDTDRRREISEVMKEEGFEGCVGFVDGT 223
Query: 122 MV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ L P+ + F +R +++N IC + A WPGS D+ V + +++
Sbjct: 224 TIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDQFITAYMTGWPGSCGDSMVFKRMLLH- 282
Query: 181 KFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRTRNPDA--AADQRFLRCHKRTRRLIENS 237
+E + F D ++ DSAY L IP + P A + F C R+R E++
Sbjct: 283 ---QEPEKFFGDGQYLIADSAYELGVHCIPA-YKAPAAFILENTEFNYCLARSRVRNEHT 338
Query: 238 FGILKEKFPCLNYMRLSPQ 256
GILK ++ L +RL+ Q
Sbjct: 339 IGILKGRWASLQDLRLAIQ 357
>gi|326528045|dbj|BAJ89074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/274 (20%), Positives = 118/274 (43%), Gaps = 19/274 (6%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
LS +++V L + G +G + G + S++ V V+ ++WP
Sbjct: 90 GTVLSLEDRVAVALRRLNSGGSLVTVGTSVGVNHSTVSLITWRFVEA-VEARAGHHLRWP 148
Query: 95 EEN-LNDIPMTFLRKGGFPNVAGCVDGTMVL--IDAPNEHEDQFVNRNGDHALNVMAICG 151
+ + + I F + G PN G VD T ++ + + + +++ ++++ + A+
Sbjct: 149 DSDEMAMIKSKFEKIHGLPNCCGVVDTTHIIMCLSSAEPNCKVWLDHEKNYSMVLQAVID 208
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSA 200
P+ +F + WPGS+ ++ ++ +S ++R E G R + ++GD+
Sbjct: 209 PDMRFTDIVTGWPGSMKESSILHSSGLFR-LCENGVRLNGGKLMVSDGSEVGEYVIGDAG 267
Query: 201 YPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM 260
YPL WL+ P N + F + H R + + K+ + L P +
Sbjct: 268 YPLLPWLLTPYQENDLSDLKVEFNKRHSAARTVALKALARFKDTWKFLQGEMWRPDKHKL 327
Query: 261 --VVLASTTLHNIALTIDRDNENQNVEI-HDDEA 291
++ LHNI + ++ D + +I HD +A
Sbjct: 328 PRIIHVCCLLHNIVIDMEEDAAMDDAQISHDHDA 361
>gi|405975517|gb|EKC40076.1| Putative nuclease HARBI1 [Crassostrea gigas]
Length = 695
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 1/119 (0%)
Query: 32 TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMV 91
T R+ A+ A +VL+CL ++ G HG SKSS C ++H VV +
Sbjct: 499 TLRSHAIPAHIQVLTCLRFLAKGDYLSETTDIHGISKSSGCLAVHRVVDAVCKALQNIEF 558
Query: 92 KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA-PNEHEDQFVNRNGDHALNVMAI 149
E+++ I +F + GFPNV G +DGT + I + E +V R G H++NV A+
Sbjct: 559 PTRREHISRIKASFYKIAGFPNVIGAIDGTQIPIQGMGTDDEHLYVCRKGFHSINVQAV 617
>gi|9279561|dbj|BAB01019.1| unnamed protein product [Arabidopsis thaliana]
Length = 355
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 35/193 (18%)
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNA 161
PM + GF G +DG V + E + + NR+ + +LN+MAIC N F +
Sbjct: 99 PMQVITVIGF---VGAMDGKHVCVKVKPELQGMYWNRHDNASLNIMAICDLNMLFTYIWN 155
Query: 162 NWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI------------- 208
P S HD V++ + ++ D E P + L DS YP K+ +
Sbjct: 156 GAPDSCHDTVVLQ---IAQQSDSEFHLPPSEKYYLVDSGYPNKQGFLALYRSSQNRVVRY 212
Query: 209 ---------PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFA 258
PPR ++ + F +CH R +IE +FG+ K+K+ L ++ R +
Sbjct: 213 HMSQFYFGPPPRNKH------ELFNQCHASLRSVIERTFGVWKKKWRILSDFPRYNVHVQ 266
Query: 259 GMVVLASTTLHNI 271
VV+A+ L+NI
Sbjct: 267 KRVVMATVGLYNI 279
>gi|147798741|emb|CAN63216.1| hypothetical protein VITISV_033852 [Vitis vinifera]
Length = 660
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/254 (27%), Positives = 114/254 (44%), Gaps = 42/254 (16%)
Query: 67 SKSSICRSIHEVVPVIV---DTFFQR--MVKWPEE---NLNDIPMTFLRKGGFPNVAGCV 118
S +I R H+V+ I+ D F ++ +K P+E N P F + G +
Sbjct: 213 SSETISRHFHQVLRAIISLDDVFLKQPDGLKCPQEIKDNTKFWPY-------FKDCIGAI 265
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DG+ + N+ ++ RN NV+A C + KF V W GS D+R++ N++V
Sbjct: 266 DGSHFRVKVSNDVVHRYRGRNYYPTQNVLAACSFDLKFTYVLPGWEGSASDSRILDNALV 325
Query: 179 YRKFDE----EGWRPFPDA---VILG------DSAYPLKEWLIPPRTRNPDAAADQRFLR 225
R FD+ +G DA + +G + Y LKE+ + P+ + LR
Sbjct: 326 -RDFDKLIVPQGKYYLADAGFQLKIGFLTPYRSTRYHLKEYSV----HQPENDREVFNLR 380
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIALTIDRDNE- 280
H R IE +FG+LK++FP + P + ++LA LHN + +D D
Sbjct: 381 -HSSLRNAIERAFGVLKKRFPIIA-SGTEPHYPVDTQSDIILACCILHNYLMGVDPDERL 438
Query: 281 --NQNVEIHDDEAE 292
+ E+ +EAE
Sbjct: 439 IAEVDRELFSEEAE 452
>gi|58378661|ref|XP_308930.2| AGAP006816-PA [Anopheles gambiae str. PEST]
Length = 325
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/277 (21%), Positives = 120/277 (43%), Gaps = 17/277 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPY--HIIGAT 63
F + F +S + + + IAP L + E V +++ + L ++ G+ +G
Sbjct: 49 FLKCFHVSRDIFKSLYDGIAPTLLSQYEATV----YDRIAATLRYLATGTCTSDRGVGGY 104
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI--PMTFLRK--GGFPNVAGCVD 119
++ + + I + + +N ++ ++ R+ G P VA C
Sbjct: 105 KAMPRAEFQAMFPQTLNAIRQLICNQHISLKPQNRDEQWEATSYFRRILGPTPGVAFCAV 164
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
GT + I P+E + F + G ++LN + I + VNA++ G++HD+ + +S V
Sbjct: 165 GTHIPIAEPSEQKHLFYYKYGTYSLNALMIFDHRKRIRYVNASFCGAMHDSHLWNSSGVD 224
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
F ++ + +L +S +P + W+I P+ D + R H+R + +
Sbjct: 225 SHFAQQHAKGNEKCNLLANSMFPSQPWIIKPKPGIVDPTFNAR----HERALATADTAVH 280
Query: 240 ILKEKFPCLNY---MRLSPQFAGMVVLASTTLHNIAL 273
+LK +F CL + +P V+ LHN+ L
Sbjct: 281 LLKNRFKCLLGKPPIPYTPPECVAVIDVCCALHNMCL 317
>gi|292624800|ref|XP_002665789.1| PREDICTED: hypothetical protein LOC566826 [Danio rerio]
Length = 287
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 91/208 (43%), Gaps = 5/208 (2%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK F+L+ TQ E +++ +AP R + VLS L + Y +
Sbjct: 58 FKSHFQLTPTQTEELVQLLAPCKWAAI-RQEDWTVWHAVLSSLWTLSTQESYQSVANRFH 116
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT-FLRKGGFPNVAGCVDGTMVL 124
++S IC + E +V T + WP+ D+ + F G P+ V ++
Sbjct: 117 VAESLICDQMDEFC-TLVTTNLANHIHWPQAEEADMCVKGFFSAVGLPDTLCVVGTRLIP 175
Query: 125 IDAPNEHEDQFVNRN--GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I P + D V R+ G + +MA C +F V+A P + H++RV+ + V +
Sbjct: 176 IMKPTDVPDSEVYRDTEGAYFAKLMAFCDHKGRFTYVSAEHPINWHNSRVLSATEVGKAL 235
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPP 210
E+ ILGDS +PL E ++ P
Sbjct: 236 QEDPVALLHGKHILGDSTFPLTEHVLTP 263
>gi|357478393|ref|XP_003609482.1| hypothetical protein MTR_4g116180 [Medicago truncatula]
gi|355510537|gb|AES91679.1| hypothetical protein MTR_4g116180 [Medicago truncatula]
Length = 494
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/327 (23%), Positives = 135/327 (41%), Gaps = 45/327 (13%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTER---NVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
+AFR+S +L+ ++P LT+ + + AL+A + + G+ Y+ +
Sbjct: 101 DAFRMSNPSFFHLLDLLSPTLTSSIPQITPDCALAAA------IFRLAHGASYNSVARRF 154
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G S S CR+ V + D + + + F PN G +L
Sbjct: 155 GISPSDACRAFFTVCKAVNDNLGNLFELRTDSDRVVVGFGF---SSLPNCFG------IL 205
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY----- 179
A E +++ + +NG L V A+ +F V++ WP S+ ++ S +Y
Sbjct: 206 GLAGFEIDEEILGKNG--FLLVQALVDSEGRFLDVSSGWPNSMKPETILHESKLYHGVVE 263
Query: 180 -RKFDEEGWRPFPDA-----VILGDSAYPLKEWLIPPRTRNPD----AAADQRFLRCHKR 229
R+ + D +LGDS +PL WL+ P +R + ++A+ F H R
Sbjct: 264 SRELLQGPSYKLSDGSLIPQYVLGDSCFPLLPWLLTPYSRGNEEDGFSSAEIAFNSTHSR 323
Query: 230 TRRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTID----RDNEN 281
L ++FG L+ ++ L+ + R ++ VV+ LHN + + RD
Sbjct: 324 AMGLFGDAFGRLRTRWQLLSDSRKWKRGCVEYLPFVVVTGCLLHNFLIKCNDPLLRDKGV 383
Query: 282 QNVEIHDDEAEDDG--EEEAVDDPDEI 306
VE D DG +E AV D +
Sbjct: 384 SCVEKEGDVLASDGVVDESAVRIRDAL 410
>gi|270016707|gb|EFA13153.1| hypothetical protein TcasGA2_TC006950 [Tribolium castaneum]
Length = 305
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 29/238 (12%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVV--PVIVDTFFQRMVKWPEENLNDIPMTFLRK 108
MG+ S ++II T C++I V+ P+ ++ + KW + N D F
Sbjct: 1 MGLTSIHNIIYNT--------CKTIWAVLAAPIYLNKITES--KW-KVNAKD----FFTL 45
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
F N G +DG + + AP F N + ++A C NY F ++ GS
Sbjct: 46 WNFSNCIGALDGKHITVQAPPNSGSLFYNYKNSFNIVLLAACDANYIFTLIDIGAYGSQS 105
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDAVI-----------LGDSAYPLKEWLIPPRTRNPDA 217
D + + S+ R + PD I + D A+PLK++++ P
Sbjct: 106 DGGIFKESIFGRAMESNNLNIPPDDYIPATNVKIPYAVVADEAFPLKKFIMRPYPGRQLT 165
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALT 274
+ F R RR+IEN+FGIL ++ L + + P+ ++V A LHN T
Sbjct: 166 HKKRIFNYRLSRARRVIENAFGILTARWRILKHNINAHPKNVDVMVQAIVVLHNFCKT 223
>gi|31432223|gb|AAP53885.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 777
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 90/189 (47%), Gaps = 19/189 (10%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G VDGT + + + + +NR+ + NV+ I G + + +A WPGSVHD RV+
Sbjct: 554 GAVDGTHIPVTVGVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWPGSVHDNRVLNE 613
Query: 176 SMVYRKFDEEGWR-PFPDAVILGDSAYPLKEWLIPPRTR-------------NPDAAADQ 221
++ + EE R PF +L DS YP + + P R P ++
Sbjct: 614 AI--DSYPEEFPRLPF-RKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEGREE 670
Query: 222 RFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP--QFAGMVVLASTTLHNIALTIDRDN 279
+F H + R +IE FGI+K+++ L + +P ++LA+ LHN + +++
Sbjct: 671 KFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDSKQND 730
Query: 280 ENQNVEIHD 288
N +++
Sbjct: 731 YQANNPLYN 739
>gi|357128444|ref|XP_003565883.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 325
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 76/179 (42%), Gaps = 25/179 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V + P+ D+F R NV+A+ + F V+A W GS HD+
Sbjct: 98 FKDCIGAIDGTHVPCNVPSRMVDRFRGRKPFPTQNVLAVVDFDLIFTYVSAGWEGSAHDS 157
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--------------RTRNPD 216
V+R+S+ + G R L D+ Y + +PP R R P+
Sbjct: 158 TVLRHSLEH----PNGLRVPEGKYYLADTGYAARRGFVPPFRQTRYHLREWRGNRARTPN 213
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMV--VLASTTLHNIAL 273
+ R H R +E +FG LK +F L P F V V+A LHN L
Sbjct: 214 ELFNLR----HSSLRTTVERAFGTLKNRFKVLTTRPYYP-FPSQVRLVIACCILHNCIL 267
>gi|449674857|ref|XP_004208274.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 262
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 84/198 (42%), Gaps = 16/198 (8%)
Query: 24 IAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV 83
IAP +P R ALS ++K+ L+++ + G + +++ I +V IV
Sbjct: 11 IAPNPKSPNYR--ALSTEKKLAITLYYLKDTGSLLLTANCFGIAVNTVSLIITQVCKNIV 68
Query: 84 DTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGD 141
+ P+ + F K G GC+DGT V I P E+ +
Sbjct: 69 YHLGPIYISLPKTKTEMRQKVAEFESKFGMIQAFGCIDGTHVPIACPVENSQDYFCYKQY 128
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE-----------EGWRPF 190
+++NV A+C F V WPGS+H+++V NS + RK G+
Sbjct: 129 YSMNVQAVCDYKGMFMDVECVWPGSIHNSKVFANSSINRKLRNGSLPGIFQSILHGYEKI 188
Query: 191 PDAVILGDSAYPLKEWLI 208
I+GD AYPL + I
Sbjct: 189 -GTYIIGDPAYPLTPFCI 205
>gi|357141517|ref|XP_003572253.1| PREDICTED: uncharacterized protein LOC100846826 [Brachypodium
distachyon]
Length = 416
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 100/220 (45%), Gaps = 19/220 (8%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI-PMTFLRKGGFPNVAGCV---DGTM 122
S +I HEV+ ++ +K + N + + P + FP++ C+ DGT
Sbjct: 130 SGDTIHHKFHEVLLCVIK-MAAHYLKPKDPNFHSVHPTIRNDRMAFPHLKDCIGAIDGTH 188
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ P + + ++V R+ NV+AIC + F + PGS+HD V+ +++ K
Sbjct: 189 IRASIPADKKIRYVGRSDTTTQNVLAICDFDMHFTYASIGQPGSMHDTSVLYHAL--EKD 246
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPD---AAA----DQRFLRCHKRTR 231
+ P L D+ YP + + P R PD AA ++F + H R
Sbjct: 247 KDTFSHPPKGKYYLVDADYPNRPGYLAPYKGERYHVPDFYRGAAPNTPKEKFNKIHSSKR 306
Query: 232 RLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHN 270
IE +FG+LK K+ L M P + M+V A+ TLHN
Sbjct: 307 NAIERAFGVLKNKWQILLKMPNYPIEGQKMIVAAAMTLHN 346
>gi|328702145|ref|XP_001949529.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 233
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 82/189 (43%), Gaps = 19/189 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F FR+S + EI L ER+ L + +VL +++ G G
Sbjct: 43 FLNMFRVSPELAMDLTNEIRDQLQR--ERSTGLPEEIQVLVVINFYAKGGYQRATGDNFV 100
Query: 65 -GPSKSSICRSIHEVVPVIVDTFFQRMVKWP---------EENLNDIPMTFLRKGGFPNV 114
S+ S+ R IH V I +R V++P E ++ P F
Sbjct: 101 VNVSQPSVSRCIHSVTDAINTKLLRRWVRFPMTAIERDKAREKFSNAPQ------AFEGA 154
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G VD T + I AP HE+ +VN +G+H+LNV A+ P +NA +PG+ +D+ +
Sbjct: 155 IGAVDCTHINILAPKNHEEAYVNHHGNHSLNVQAVVDPESNILNINARFPGARNDSYIWS 214
Query: 175 NSMVYRKFD 183
S + R +
Sbjct: 215 VSPIRRAME 223
>gi|390342959|ref|XP_003725766.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 288
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 89/209 (42%), Gaps = 4/209 (1%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTPTE-RNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P ++K R++ E ++ ++ P L T L K+ L ++ G+ Y +
Sbjct: 75 PKSYKNFLRITPDLFEEMVGKLTPHLEKKTTFMREPLEVGLKLAVTLRFLATGNSYTTLQ 134
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVD 119
+ KS+I R I EV +++ + + ++ P E+ D+ F + + N G VD
Sbjct: 135 YSFRVEKSTISRFIPEVCNALIEVYKKEVLSCPKTEDGWKDVAKKFSSRWNYHNCLGAVD 194
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
G V I P + + N G H++ +MA+ YKF V+ G D +N +Y
Sbjct: 195 GKHVAIRKPPKSGSLYFNYKGFHSIVLMAVADAEYKFLYVDVGAEGGSSDGGTWKNCNLY 254
Query: 180 RKFD-EEGWRPFPDAVILGDSAYPLKEWL 207
+ E P P+ + D P WL
Sbjct: 255 DAIEGERAGVPPPEPMPNDDMPIPFHLWL 283
>gi|328699625|ref|XP_003240991.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 190
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 72/155 (46%), Gaps = 19/155 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F AFRLS V I++ + +L +R LS + +VL CLH+ GS +G
Sbjct: 43 FMGAFRLSQQLVINIIDVLRTSLQH--KRTSGLSPELQVLVCLHFYAQGSYQKGLGGNSI 100
Query: 65 -GPSKSSICRSIHEVVPVIVDTFFQRMVKWP---------EENLNDIPMTFLRKGGFPNV 114
S+ S+ R IH V I ++ +K+P EN P F
Sbjct: 101 LNISQPSVSRCIHYVTEAINQRLLRKWIKFPMTSAERQRARENFAHAPQPF------EGT 154
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
G +D T + I AP +HE+ +VN +G H+LNV A+
Sbjct: 155 IGAIDCTYIHILAPRQHEEAYVNHHGKHSLNVQAV 189
>gi|170068163|ref|XP_001868759.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864268|gb|EDS27651.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 302
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 91/206 (44%), Gaps = 13/206 (6%)
Query: 84 DTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHA 143
DTFF++ ++ P + L FPN G +DG + I AP F N +
Sbjct: 85 DTFFRKAIE-PSQRLALCLSEINTIWNFPNCLGALDGKHITIQAPPNSGSTFFNYKKTFS 143
Query: 144 LNVMAICGPNYKFYAVNA-----NWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA----- 193
+ ++A+ N F AV+ N G + + V+ ++ + + P P
Sbjct: 144 VVLLALVDANCNFIAVDVGAYGRNSDGGIFASSVLGKALAGNRLNVPEAAPLPGTNTLTP 203
Query: 194 -VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-M 251
VI+GD A+PLK +L+ P + A + F R RR EN+FGI + F N +
Sbjct: 204 HVIVGDEAFPLKPYLMRPYPGDNLDNAKRIFNYRLSRIRRASENTFGIYMQMFRIFNRPI 263
Query: 252 RLSPQFAGMVVLASTTLHNIALTIDR 277
+ P+ +++A+ L+ ++ R
Sbjct: 264 QAKPENVDKIIMAAIVLYQFIPSLQR 289
>gi|427780733|gb|JAA55818.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 214
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 76/165 (46%), Gaps = 25/165 (15%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T F++ FRLS V + +E+ P + +R LS + KVL L + GS +
Sbjct: 49 LTEEEFRQCFRLSKRTVRSLCDELEPII--GCQRASGLSTERKVLCALRFFATGSFQRSV 106
Query: 61 GATH--GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT----------FLRK 108
G G ++S++ +IHEV I+ +R L D +T F R+
Sbjct: 107 GREEHIGMAQSAVSNTIHEVTEAIITVSARR-------KLVDFSLTPAAKEEAKAAFARR 159
Query: 109 GGFPNVAGCVDGTMVLIDAPNEH----EDQFVNRNGDHALNVMAI 149
G P V CVDGT+V I P F++R G +ALNVM +
Sbjct: 160 GDIPGVLACVDGTLVAIRKPEGFNLPDTASFMSRKGYYALNVMIV 204
>gi|331225110|ref|XP_003325226.1| hypothetical protein PGTG_06763 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 336
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 99/203 (48%), Gaps = 23/203 (11%)
Query: 101 IPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I + +R+ GFP G VDGT + L P + + +R ++++V IC N KF +
Sbjct: 114 IAVAVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISY 173
Query: 160 NANWPGSVHDARVMRNSMVY----RKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRN 214
A +PGS HD+ V + V + FD++ + +L DSAY +++P + ++
Sbjct: 174 LAGFPGSSHDSYVFSHMQVAQYPEKYFDQKQF-------LLADSAYTNNCYVVPAFKGKH 226
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAGMV--VLASTTL 268
+ F ++R IE++ GILK F L +R + + G V ++ L
Sbjct: 227 LLKRCNINFNYHLHKSRVRIEHAIGILKGCFASLREIRTQIRNAEEMKGAVKWIITCIVL 286
Query: 269 HNIALTIDRDNENQNVEIHDDEA 291
HN+ + +D N ++++DEA
Sbjct: 287 HNLLADL-KDKWN---DLYEDEA 305
>gi|393235223|gb|EJD42779.1| hypothetical protein AURDEDRAFT_67179 [Auricularia delicata
TFB-10046 SS5]
Length = 467
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 80/195 (41%), Gaps = 25/195 (12%)
Query: 118 VDGTM-VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDGT+ V P + +R +++ NV I P+ K PGSVHDA + +
Sbjct: 226 VDGTLAVFYSKPGHFGVSWYDRKSNYSTNVQIISTPDCKIRDFGVGLPGSVHDATAWKAT 285
Query: 177 MVYRKFDEEGWRPF-PDAVILGDSAYPLKEWLIPPRTRNP--------DAAADQRFLRCH 227
Y E F P I DSAYPL+ WL P +N D + RF
Sbjct: 286 WTY----ENSASIFQPGEFIWADSAYPLQSWLCAPYKKNSSDTPRPERDEPENARFNYHV 341
Query: 228 KRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMV------VLASTTLHNIALTIDRDNEN 281
R R E+ G LK +F L +RL V ++A +HN ALT
Sbjct: 342 SRVRVRSEHCIGYLKGRFQSLRGLRLRVDKQADVLRLDSWIIACICVHNFALT-----RE 396
Query: 282 QNVEIHDDEAEDDGE 296
+ + H D+ + GE
Sbjct: 397 HSEDFHLDKFFEKGE 411
>gi|357153511|ref|XP_003576474.1| PREDICTED: uncharacterized protein LOC100832068 [Brachypodium
distachyon]
Length = 528
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 67/161 (41%), Gaps = 24/161 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + P E + NR G + NVMA+C + +F V WPGSVHD
Sbjct: 338 FKDAIGAIDGTHIPVIVPAELKVIHTNRKGYTSQNVMAMCDFDMRFIFVILGWPGSVHDT 397
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
R S + R + +P P + F CH
Sbjct: 398 RSWVLSSIQR-----------------------ATYHVPEFEDAPPVGIQETFNHCHSSL 434
Query: 231 RRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHN 270
R +IE +FG+LK K+ L + P +++A LH+
Sbjct: 435 RNVIERTFGVLKMKWRILLGILAYPPLKQKKIIIACMCLHD 475
>gi|346467333|gb|AEO33511.1| hypothetical protein [Amblyomma maculatum]
Length = 340
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 118/298 (39%), Gaps = 15/298 (5%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+ P F++ FR +++ + T + +N+ + +E + L + + + +
Sbjct: 31 IEPSLFRQQFRFDKADFDYLFGALQVPETVKSAQNITVPGREALCITLRRLAYPNRWCDL 90
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRM-----VKW--PEENLNDIPMTFLRKGGFPN 113
A G S + +VV I +F + W P R PN
Sbjct: 91 EAIFGRHSSVMSSVATQVVHHIAHSFKNLLNDSNNHDWLSPAHLAKFASAVQNRGAALPN 150
Query: 114 VAGCVDGTMVLIDAPNEHE-DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
G +DGT I P ++ + F H + ++ PN ++ +PG HDA +
Sbjct: 151 CWGFIDGTARPICRPTRNQREYFSGHKRMHVVKYQSVMCPNGIVCQLDGPYPGRRHDAGI 210
Query: 173 MRNSMVYRKFDE-EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
++ S +Y K ++ G F V+ GD AYPLK L+ P + A F R R
Sbjct: 211 LKTSGLYTKLEKLVGHHQF---VLYGDPAYPLKPLLMKPYGGSSLTATQLAFNYSMSRVR 267
Query: 232 RLIENSFGILKEKFPCLNY---MRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
+ +E FG + +F L++ +L Q + A+T L N + +Q I
Sbjct: 268 QSVEWGFGKVVAEFAFLDFKKNQKLLRQQVAQMYKAATILTNCHTCLYGSQVSQYFGI 325
>gi|359486961|ref|XP_003633498.1| PREDICTED: uncharacterized protein LOC100855100 [Vitis vinifera]
Length = 409
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 83/204 (40%), Gaps = 18/204 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + ++GT + P + + F R NVM C + F V A W G+ +DA
Sbjct: 173 FKDCISAINGTHISAWVPADRQTSFRGRKTVITQNVMCACNFDMMFTFVYAGWEGTTNDA 232
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAA 218
RV +++ R W P + DS YP +PP RN
Sbjct: 233 RVFLDALT-RPEVNFPW-PSEGKYYVVDSGYPCISGFLPPYRGERYHLQEYRGRRNQPIR 290
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHN-IALTID 276
+ F H R +IE FG+LK +FP M P +V+A TLHN I L+
Sbjct: 291 YKELFNYRHSSLRNIIERCFGVLKTRFPISRMMPCYKPSRQPSIVVACCTLHNWIRLSTR 350
Query: 277 RDNENQNVEIHDDEAEDDGEEEAV 300
D + E+ D E GEEE+
Sbjct: 351 NDQLFREYEVEDLSIE--GEEEST 372
>gi|299749160|ref|XP_001838556.2| hypothetical protein CC1G_11885 [Coprinopsis cinerea okayama7#130]
gi|298408310|gb|EAU83249.2| hypothetical protein CC1G_11885 [Coprinopsis cinerea okayama7#130]
Length = 453
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 92/201 (45%), Gaps = 16/201 (7%)
Query: 108 KGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
+ GFP+ G +D T++ L D P+ + + R A+NV +C +F + WPGS
Sbjct: 209 RTGFPDCIGMIDCTLIRLTDVPSMWGEVYYCRKKYPAVNVQGVCDHKLRFISFEMGWPGS 268
Query: 167 VHDARVMRNSMVYRKFDEEGWRPFP-DAVILGDSAYPLKEWLIPPRTR-------NPDAA 218
D V++NS ++R + F D + D Y +L+ P T P+
Sbjct: 269 TPDVTVLKNSDLWRNRT----KYFTGDQYLFADRGYQSSPYLLRPFTEPEVDAFAGPERR 324
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTI- 275
F R TR IE++FG+LK +F L + + V+ A LHN+ +
Sbjct: 325 RRLDFNRTLSGTRIYIEHAFGLLKGRFHSLKDLGRHRNVNEIYQVIHALMVLHNLCIDWE 384
Query: 276 DRDNENQNVEIHDDEAEDDGE 296
D ++ + ++ ++ ED+GE
Sbjct: 385 DSPHDFIDPDVLEEMTEDEGE 405
>gi|359475599|ref|XP_003631712.1| PREDICTED: uncharacterized protein LOC100853260 [Vitis vinifera]
Length = 506
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 98/210 (46%), Gaps = 25/210 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG+ + N+ ++ RN NV+A C + KF V W GS D+
Sbjct: 112 FKDCIGAIDGSHFRVKVSNDVVHRYRGRNYYPTQNVLAACSFDLKFTYVLPGWEGSASDS 171
Query: 171 RVMRNSMVYRKFDE----EGWRPFPDA---VILG------DSAYPLKEWLIPPRTRNPDA 217
R++ N++V R FD+ +G DA + +G + Y LKE+ + P+
Sbjct: 172 RILDNALV-RDFDKLIVPQGKYYLADAGFQLKIGFLTPYRSTRYHLKEYSV----HQPEN 226
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIAL 273
A + LR H R IE +FG+LK++FP + P + ++LA LHN +
Sbjct: 227 AREVFNLR-HSSLRNAIERAFGVLKKRFPIIA-SGTEPHYPVDTQSDIILACCILHNYLM 284
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDP 303
+D D E E+ + ++ E E++ P
Sbjct: 285 GVDPD-ERLIAEVDRELFSEEAEFESMAHP 313
>gi|390353552|ref|XP_001188248.2| PREDICTED: uncharacterized protein LOC755075 [Strongylocentrotus
purpuratus]
Length = 357
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 87/201 (43%), Gaps = 46/201 (22%)
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
E +N + F GFP V G VDGT + ++ A GP
Sbjct: 2 EEVNRAQVDFFNISGFPQVIGVVDGTHIRLNG--------------------APLGPGE- 40
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNP 215
H++RV + + G ++LGDS Y L+ +L+ P NP
Sbjct: 41 ------------HNSRV---GQTFEDLQQHG-------LLLGDSGYALRPYLMTP-VLNP 77
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALT 274
A+Q + R H TR IE G LK F C LN M+++P+ A ++ A LHN+A
Sbjct: 78 RTPAEQAYNRAHATTRVRIEQVNGQLKHNFRCLLNGMQMAPRRACKIITACAVLHNVAKD 137
Query: 275 IDRDNE-NQNVEIHDDEAEDD 294
+ + +E + +E H+ E + D
Sbjct: 138 LKQPDEVDFGIEQHEHEPDMD 158
>gi|357116426|ref|XP_003559982.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 265
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 85/179 (47%), Gaps = 19/179 (10%)
Query: 108 KGGFPNVAGCV---DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
K +P++ C+ DGT + P + + +++ R+G + NV+AIC + F + P
Sbjct: 24 KRAYPHLKDCIGALDGTHIRASLPADQQVRYIGRSGTASQNVLAICDFDMCFIYASIGQP 83
Query: 165 GSVHDARVMRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLKEWLIPP----RTRNPD--- 216
G++HD V+ ++ + D + + P L D+ YP + + P R PD
Sbjct: 84 GAMHDTSVLYPAI---RVDHDIFPHPSKGKYYLVDAGYPNRPGYLAPYKGERYHVPDFHR 140
Query: 217 ----AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHN 270
+ ++F R H R +IE SFG+LK K+ L M S ++V AS LHN
Sbjct: 141 GAEPSTPKKKFNRVHPAVRNVIERSFGVLKMKWRILLTMPNYSMDKQKLIVAASMVLHN 199
>gi|292616811|ref|XP_001344486.2| PREDICTED: putative nuclease HARBI1 [Danio rerio]
Length = 457
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 64/313 (20%), Positives = 134/313 (42%), Gaps = 20/313 (6%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+E F L +T+++ LE++ T T ++ ++ + L + + Y + G
Sbjct: 105 REIFLLLSTKLKPQLEQVYIQQTQQT-----ITLEKCIAMALMRLSTSTEYCFLSELFGV 159
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
++C+ + EV I+ ++ P ++ L D F GFP+ G +D + +
Sbjct: 160 PIPAVCQCVREVCEAIILLLKPIYMQLPAQHELEDNVEQFRSLWGFPHCIGVIDSLHIPV 219
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
+ P N +G H++ + + + F+ V + + GS D ++++S ++ ++
Sbjct: 220 NLPAAESQDCWNPSGWHSVVLQGVVNAHGNFWDVCSGFTGSTDDMTILQSSELWTMAEKG 279
Query: 186 GWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT-RRLIE 235
G+ P ++LGD + L+ WL+ + + Q+ ++IE
Sbjct: 280 GFCSQPPKELMGRPLGFLLLGDVGFSLQNWLLKCYPSSSNLTPQQQTFNVKLNLGLKVIE 339
Query: 236 NSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVE-IHDDEAED 293
+F LK ++ CL N + + +A LHNI D +++ VE + +E
Sbjct: 340 QAFQRLKARWQCLHNRNDNNVDLVSKMAVACCILHNICSVHDSPFKDEWVEALSQNECPQ 399
Query: 294 --DGEEEAVDDPD 304
D +DDP+
Sbjct: 400 PKDVSTVYIDDPN 412
>gi|223944065|gb|ACN26116.1| unknown [Zea mays]
Length = 298
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 119/300 (39%), Gaps = 25/300 (8%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+ +E+V L+ +G ++G +G S ++ R ++V+ I + + + E
Sbjct: 1 MCVEEQVAMFLNTVGHNLRNRLVGTNYGRSGETVSRYFNKVLRAIGELRAELIRPPSLET 60
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
+ I F + G +DGT + + E F R + NVMA + +F
Sbjct: 61 PSKIAGNPRWDPYFKDCIGAIDGTHIRASVSIDMEPSFRGRKSYASQNVMAAVDFDLRFT 120
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------- 210
V A W GS HDA V+R+++ E G R L D+ Y K +PP
Sbjct: 121 YVLAGWEGSTHDALVLRDALE----RENGLRVPQGKFYLVDAGYGAKPGFLPPFRAVRYH 176
Query: 211 ---RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVL 263
NP + F H R +E +FG LK +F L+ + S Q +V+
Sbjct: 177 LNEWGNNPVQNEKELFNLRHSSLRVTVERAFGSLKRRFKILDDATPFFPFSTQVD--IVV 234
Query: 264 ASTTLHNIALTIDRDN---ENQNVEIHDDEAEDDGE--EEAVDDPDEIDEEEQPRQDRNN 318
A +HN + D E N IH+ G+ E A + E Q +DR N
Sbjct: 235 ACCIIHNWVIQDGGDELIIEESNWPIHNHATTSSGQASEHAFMVNLRHEIENQMWEDRQN 294
>gi|353731061|ref|NP_001090585.2| uncharacterized protein LOC100036828 [Xenopus laevis]
Length = 393
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/285 (22%), Positives = 118/285 (41%), Gaps = 20/285 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P F+ FR+S + + +L + P L T +S +E++ L ++ G + +
Sbjct: 59 PKKFQRFFRMSISSFDELLTVLKPGLCRAHTLMMDPISPEERLCLTLRFLATGQSFSSLY 118
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDG 120
+++I + + E +I + ++ P+EN I F + FPN G + G
Sbjct: 119 FRFPIGRTTIGKIVRETCLLIWSELQRLVMPTPDENAWIHIAEDFYKTTNFPNCLGAMGG 178
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ I P + + + ++ ++A+ NY F ++ GS DA +S +
Sbjct: 179 KRIQIKMPFKSGSEKYS-----SVVLLAVVDANYCFSIIDVGAYGSTGDASAFWSSAMGH 233
Query: 181 KFDEEGWR------------PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
+ E P V +GD A+ L E ++ P + + F
Sbjct: 234 QLSEGALHLPLPKPLPGTAAPSMPYVFVGDEAFGLTENIMRPYPGSQMDIQKRLFNYRLL 293
Query: 229 RTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
R RR++E +FGI K+ N ++L P F +++ A LHN
Sbjct: 294 RARRMVECAFGIFSNKWRIFHNAIQLEPDFVDIIIKACCVLHNFV 338
>gi|449483945|ref|XP_004156740.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 215
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 83/196 (42%), Gaps = 17/196 (8%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + ++ ++ R G+ A NV+ +C F V A W GS D+ +
Sbjct: 3 NCLGALDGTYIKVNVSQTDRPRYRTRKGEVATNVLGVCDTKGDFVFVLAGWEGSAADSCI 62
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILG---DSAYPLKEWLIPPRTRNPDAAADQRFLRCHKR 229
+R+++ W P L Y L+EW RN AAA + F H
Sbjct: 63 LRDAIA-----RPNWLHVPQEGFLAPYRGQRYHLQEW---QGARNAPAAAKEYFNMKHFA 114
Query: 230 TRRLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
R +IE +F +LK+++ L Q +L LHN+ I+R E NV+ D
Sbjct: 115 ARNVIERAFDLLKDRWTILRGKSYYLVQIQCRTILVCCLLHNL---INR--EMTNVDFVD 169
Query: 289 DEAEDDGEEEAVDDPD 304
D E D + D
Sbjct: 170 DVDEGDSTYATIGGDD 185
>gi|322785023|gb|EFZ11784.1| hypothetical protein SINV_07926 [Solenopsis invicta]
Length = 168
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 67/141 (47%), Gaps = 7/141 (4%)
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNP 215
F A GSVHD RV R S V + P D+ ++GD+AY L E L+ P N
Sbjct: 1 FIHCYAGHVGSVHDQRVFRQSEVQHYLGDVTKFP-QDSHLVGDAAYKLHENLLVPYRDNG 59
Query: 216 DAAADQR-FLRCHKRTRRLIENSFGILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNIA 272
QR + CH R +IE +FG+LKE+F CL M +L+ LHNI
Sbjct: 60 HLTERQRNYNFCHSSARIVIERAFGLLKERFRCLLTTLAMDRTDLIPMHILSCCVLHNIC 119
Query: 273 LTIDRDNENQNVEIHDDEAED 293
L R +E N E +D E D
Sbjct: 120 LM--RGDE-LNFETNDVELTD 137
>gi|357119276|ref|XP_003561369.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 433
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 128/310 (41%), Gaps = 33/310 (10%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S + +I + LT+ + LS +++V + + G
Sbjct: 113 AFRYFFRTSRSTFHYICSIVRDDLTSRPPSGLIRIEGRLLSMEKQVAIAMRRLASGDSRV 172
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---VKWP-EENLNDIPMTFLRKGGFPNV 114
+GA G + ++ +V ++++ +R + WP +E ++DI F G PN
Sbjct: 173 SVGAAFGVGQPTVS----QVTWRLIESMEERARHHLAWPDQERMDDIKANFEVVSGLPNC 228
Query: 115 AGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G +D T ++I P E + + ++++ + I +F + WPG + ++++
Sbjct: 229 CGAIDATHIVITLPAVESSRNWRDHAWNYSMILQGIFDHEMRFIDICTGWPGGMTVSQLL 288
Query: 174 RNSMVYRKFDEEGWRPFPDAV----------ILGDSAYPLKEWLIPPRTRNPDAAADQR- 222
S Y + P V I+GD YPL WL+ P + AD
Sbjct: 289 EFSRFYELCEAGECLDGPAQVSREGAQIREYIVGDMCYPLLSWLMTP-YEGERSLADPMV 347
Query: 223 -FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTIDR-D 278
F K R L + LK + LN + P + ++L LHNI IDR D
Sbjct: 348 DFNARQKAARMLGTRALARLKGSWRILNKVMWRPDKHKLPSIILVCCLLHNI--MIDRQD 405
Query: 279 NENQNVEIHD 288
Q++E+ D
Sbjct: 406 QLLQSLELRD 415
>gi|37991863|gb|AAR06309.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 767
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 86/217 (39%), Gaps = 29/217 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V P+ Q + G NVM +C + +F V A WPGSVHD
Sbjct: 540 FKDCIGAIDGTHVPCVVPSNKLVQHLCCKGMTTQNVMVVCDFDMRFTFVLAGWPGSVHDM 599
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA----VILGDSAYPLKEWLIPP-----------RTRNP 215
RV ++ FP + L DS YP + + P R
Sbjct: 600 RVFDDAHTTYT------HVFPHSPTGKYYLVDSGYPNRPGYLAPYKGTKYHLQEYRDAPE 653
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
++ F H R +IE SFG+LK K+ L + P+ +++A LHN
Sbjct: 654 PQGKEENFNYAHSSLRNVIERSFGVLKMKWRMLEKIPSYDPRKQTQIIIACCALHNFIRK 713
Query: 275 -------IDRDNENQNVEIHDDEAEDDGEEEAVDDPD 304
DR + ++N A+ EE DD D
Sbjct: 714 SGIRDKHFDRCDRDENYVPPQASADQPKTEEVFDDCD 750
>gi|449663409|ref|XP_002161554.2| PREDICTED: uncharacterized protein LOC100214881 [Hydra
magnipapillata]
Length = 686
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 54/94 (57%), Gaps = 5/94 (5%)
Query: 114 VAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
V G +D T + I P++H+ +V+R+ +H++NV A+C KF V A WPGS H+ARV+
Sbjct: 7 VTGVIDCTHICIQKPHKHKYAYVDRSSNHSINVQAVCDNKGKFIDVVAKWPGSTHNARVL 66
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWL 207
R + E+ + P P + S P+ +W+
Sbjct: 67 RVKTC-GQLTEDIFVPTP----VTHSMIPINKWI 95
>gi|357143017|ref|XP_003572773.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 396
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 94/221 (42%), Gaps = 28/221 (12%)
Query: 102 PMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + LRK F N G VDGT +L+ P + ++ + NR + N M C + KF
Sbjct: 147 PHSILRKPKFYPFFKNCIGAVDGTHILMKLPLDQQEPYKNRKQTISQNAMVACDFDLKFV 206
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPR-TR-- 213
+N W GS DARV+++++ G+ L D Y ++L P R TR
Sbjct: 207 HINPGWEGSASDARVLQDAL------NHGFEVLDGKFYLVDVGYANTPQFLAPYRGTRYH 260
Query: 214 -NPDAAADQR-------FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLA 264
N A Q+ F H + R IE GI K +FP L + P+ + + +A
Sbjct: 261 LNEQGRARQKPQNHKELFNLRHAQLRNHIERIIGIWKMRFPILKAVSHFPKEKQIDISVA 320
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE 305
LHN N ++ D D ++ VD P+E
Sbjct: 321 CAVLHNFIRI-----HNGDMTWPSDATMDIDPDQIVDVPNE 356
>gi|331229811|ref|XP_003327571.1| hypothetical protein PGTG_09105 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 446
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 110/256 (42%), Gaps = 23/256 (8%)
Query: 7 KEAFRLSATQVE----FILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGA 62
KE F QV F + P L P + + L E++ S + VG +
Sbjct: 113 KEGFIWLINQVSHHTVFHGQSFRPQLPIPHQMALTL---ERLGSNGNGASVGRFARNLSV 169
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTM 122
HG ++ ++ V+ I ++ ++ V WP+ ++ G G VDGT
Sbjct: 170 GHG----TVIKASRRVIQAI-NSLSEKYVVWPDSTRRAKISQVMKNEGLSGCIGFVDGTT 224
Query: 123 V-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ L P + + +R +++N +C + A WPGS D+ V +NS + +
Sbjct: 225 IPLHQRPGLDGEVYFDRKKRYSINAQIVCDCDKFITAFTTGWPGSCGDSWVFQNSKIESE 284
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIP----PRTRNPDAAADQRFLRCHKRTRRLIENS 237
D+ R ++ DSAY L IP P ++ PD + F C ++R E++
Sbjct: 285 PDKYFDR---GQYLIADSAYGLSLTCIPAYKSPASKKPD---NTDFNYCIAKSRVRNEHT 338
Query: 238 FGILKEKFPCLNYMRL 253
GILK ++ L+ +RL
Sbjct: 339 IGILKGRWASLHELRL 354
>gi|331239185|ref|XP_003332246.1| hypothetical protein PGTG_14542 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 447
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 99/214 (46%), Gaps = 37/214 (17%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 120 LVTWPTESEQIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 179
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 180 ICDINKKFISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDRKQF-------LLADSAYTND 232
Query: 205 EWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMV--V 262
+++P A ++ L KR R F L+E +R + + G V +
Sbjct: 233 CYVVP-------AFKGKQLL---KRRR------FASLRE---IRTQIRNAEEMKGAVKWI 273
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDEAEDDGE 296
+ LHN+ D ++Q ++++DEA +D E
Sbjct: 274 VTCIVLHNLLA----DLKDQWNDLYEDEAPEDSE 303
>gi|108708757|gb|ABF96552.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 798
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 86/217 (39%), Gaps = 29/217 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V P+ Q + G NVM +C + +F V A WPGSVHD
Sbjct: 571 FKDCIGAIDGTHVPCVVPSNKLVQHLCCKGMTTQNVMVVCDFDMRFTFVLAGWPGSVHDM 630
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA----VILGDSAYPLKEWLIPP-----------RTRNP 215
RV ++ FP + L DS YP + + P R
Sbjct: 631 RVFDDAHTTYT------HVFPHSPTGKYYLVDSGYPNRPGYLAPYKGTKYHLQEYRDAPE 684
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
++ F H R +IE SFG+LK K+ L + P+ +++A LHN
Sbjct: 685 PQGKEENFNYAHSSLRNVIERSFGVLKMKWRMLEKIPSYDPRKQTQIIIACCALHNFIRK 744
Query: 275 -------IDRDNENQNVEIHDDEAEDDGEEEAVDDPD 304
DR + ++N A+ EE DD D
Sbjct: 745 SGIRDKHFDRCDRDENYVPPQASADQPKTEEVFDDCD 781
>gi|357129640|ref|XP_003566469.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 486
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 90/216 (41%), Gaps = 20/216 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S + + E ++ A+ T A+ ++V CL + G P +
Sbjct: 161 FRAAFRMSRATFDLLCESLSGAVAKEDTALRAAIPVHQRVAVCLWRLATGEPLREVSRRF 220
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
G S+ + +V + V+WP E+ F G P V G V V
Sbjct: 221 GLGISTCHNIVLQVCAALTAVLLPTAVRWPSED----NAGFQAVSGIPGVVGAVCTGHVP 276
Query: 125 IDAPNEHEDQFVNR-------NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
I P E+ +++NR +++ + A+ + F V PGS+ DA V+ S
Sbjct: 277 IGPPKENPGEYLNRRLTEKNKKASYSVALQAVVDADGAFTDVCIGLPGSLSDAAVLERSA 336
Query: 178 VYRKFDE---EGWRPFPDAVILGDSAYPLKEWLIPP 210
+ + + G R ++G ++ PL +W++ P
Sbjct: 337 LRARCEAGLLGGGR-----CLVGGASLPLADWMLVP 367
>gi|397602758|gb|EJK58257.1| hypothetical protein THAOC_21635 [Thalassiosira oceanica]
Length = 442
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 115/280 (41%), Gaps = 26/280 (9%)
Query: 19 FILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEV 78
FI EI ALT + LS L G + Y I H + IC +
Sbjct: 152 FISGEIKLALTL----RLLAGGSYLDLSLLFECGSSTAYEIF---HKVIREWICSKDKPL 204
Query: 79 VPVIVDTFFQRMVKWPEENLNDIPMTFLRK--GGFPNVAGCVDGTMVLIDAPNEHEDQFV 136
V + F EE + + + F R G F G +DG +V I P++
Sbjct: 205 VNINGKDFID-----DEERMAAVALEFARSSAGLFSGCIGAIDGWVVKIKKPSQ------ 253
Query: 137 NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-I 195
G + LNV+ IC + N G+ HD+ +NS +YRK ++ R
Sbjct: 254 ---GFYGLNVVVICDRKKRILYRVINSRGAEHDSTAFKNSSLYRKLMDDCDRLLEKGFHF 310
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLS 254
+GDSAY ++ +L+ P + F H +R IE +FG + ++ L + ++ S
Sbjct: 311 IGDSAYAIRSFLLTPFDNAVHGTPEDNFNFFHSSSRICIECTFGEVDLRWGILWSPLKFS 370
Query: 255 PQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD 294
+ V+ A LHN + R++ +NV + + DD
Sbjct: 371 LRNNIKVIDACLMLHNF-IVDHRESLGENVSSMERDVFDD 409
>gi|403158120|ref|XP_003307458.2| hypothetical protein PGTG_00408 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163687|gb|EFP74452.2| hypothetical protein PGTG_00408 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 438
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 103/260 (39%), Gaps = 22/260 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F++ R+S T +I ++ + LS +++ L +G + G
Sbjct: 113 FRQTTRMSKTSFLWIFNQVKDNPIFRKDGGPQLSIAQQLALTLELLGCSGQ----DGSAG 168
Query: 66 PSKSSICRSIHEVVPVI------VDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVD 119
P + S EV+ VI ++ + V WP N + +++ GF G VD
Sbjct: 169 PLARRLGLSRREVIKVICAPMQAINALAPKYVTWPSPNRRQLISHDMKEEGFDGCVGSVD 228
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
+ + P E ++ G +NV IC N + A A PG + S ++
Sbjct: 229 DLTIPLSYPLGEESFQNSQTGRCTMNVQIICDSNKRITAFFAGTPGEWRGSLSYEKSSLH 288
Query: 180 RK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPR-TRNPDAAADQRFLRCHKRTRRLI 234
+ FD P ++ DSAY ++PP + P A D+ F C ++
Sbjct: 289 KNPKKFFD-------PGQYLIADSAYRPSSTIVPPSNSPGPKAKTDEEFDFCLAKSHVRS 341
Query: 235 ENSFGILKEKFPCLNYMRLS 254
+ + G LK ++ LN +R S
Sbjct: 342 QKTIGALKARWASLNEIRFS 361
>gi|154281875|ref|XP_001541750.1| hypothetical protein HCAG_03848 [Ajellomyces capsulatus NAm1]
gi|150411929|gb|EDN07317.1| hypothetical protein HCAG_03848 [Ajellomyces capsulatus NAm1]
Length = 414
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 119/308 (38%), Gaps = 44/308 (14%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE 96
A+ ++KVL L+ M G Y S +I + HEV+ + ++ VK PE
Sbjct: 102 AIKIEQKVLIFLYIMTQGVSYRNTAEIFHHSLDTISKVFHEVLESLC-VLYEHYVKMPES 160
Query: 97 NLNDIPMTFLRKG------GFPNVAGCV---DGTMVLIDAPNEHEDQFVNRNGDHALNVM 147
PM + +K +P GC+ DGT + I P + + NR G + NV+
Sbjct: 161 -----PMEYSKKALGSNPRYWPFFKGCIGALDGTHLPISIPLNQQSVWRNRKGWISQNVL 215
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP----- 202
A + F + GS HD+RV+ + +F D L D+ Y
Sbjct: 216 AAYDHDMNFVYIYPGMEGSSHDSRVLSQAQERDQFGT----TLSDCYFLADAGYSQKSSL 271
Query: 203 -----------LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFP----C 247
LKEW P A++ LR H R +IE FG+ K +F C
Sbjct: 272 MLVPYSGVRYHLKEW--EKANSQPVNASELYNLR-HASLRGVIECVFGVFKRRFQIYDRC 328
Query: 248 LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEID 307
+ ++ Q +V A +HN +E + + D E+ E D + D
Sbjct: 329 RDGFSITTQID--LVFALAAVHNFMNHYKEMDELTQYDENSDIEEERRETVPADVQENGD 386
Query: 308 EEEQPRQD 315
E + R D
Sbjct: 387 EIMKKRDD 394
>gi|328696603|ref|XP_003240077.1| PREDICTED: hypothetical protein LOC100569927 [Acyrthosiphon pisum]
Length = 448
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 72/155 (46%), Gaps = 19/155 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F AFRLS V I++ + +L +R LS + +VL CLH+ GS +G
Sbjct: 301 FMGAFRLSQQLVINIIDVLRTSL--QHKRTSGLSPELQVLVCLHFYAQGSYQKGLGGNSI 358
Query: 65 -GPSKSSICRSIHEVVPVIVDTFFQRMVKWP---------EENLNDIPMTFLRKGGFPNV 114
S+ S+ R IH V I ++ +K+P EN P F
Sbjct: 359 LNISQPSVSRCIHYVTEAINQRLLRKWIKFPMTSAERQRARENFAHAPQPF------EGT 412
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
G +D T + I AP +HE+ +VN +G H+LNV A+
Sbjct: 413 IGAIDCTYIHILAPRQHEEAYVNHHGKHSLNVQAV 447
>gi|331243398|ref|XP_003334342.1| hypothetical protein PGTG_16211 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309313332|gb|EFP89923.1| hypothetical protein PGTG_16211 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 123/288 (42%), Gaps = 36/288 (12%)
Query: 6 FKEAFRLSATQVEFILEEIA-----------PALTTPTERNVALSAKEKVLSCLHWMGVG 54
FK+A R + + ++L +IA P L P + +AL+ L L G G
Sbjct: 105 FKQAVRTTKSGFTWLLSQIAFNPIFHSGSFRPQLPVPHQ--LALT-----LERLGSNGNG 157
Query: 55 SPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+ I + ++ ++ V+ I D ++WP+++ ++ GF
Sbjct: 158 ASVGRISRNLCVGRGTVIKASRRVIRAIND-MGPTYLRWPDKDRRKEISEVMKAEGFEGC 216
Query: 115 AGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDGT + L P+ + F +R +++N IC + WPGS ++ V
Sbjct: 217 VGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDRFITGYMTGWPGSCGNSMVF 276
Query: 174 RNSMVYRKFDEEGWRPF--PDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCHKR 229
+ V+++ D+ F P ++ DSAY L IP + P A + F C R
Sbjct: 277 KRMKVHKEPDD-----FFDPGQYLIADSAYELGVHCIPA-YKAPAAYIRENTEFNYCLAR 330
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQFAGMV------VLASTTLHNI 271
++ E++ GILK ++ L +RL+ Q + V TLHN+
Sbjct: 331 SQVRNEHTIGILKGRWASLQQLRLAIQKPSDMMEIIRWVNCCVTLHNM 378
>gi|449688772|ref|XP_004211844.1| PREDICTED: uncharacterized protein LOC101238960, partial [Hydra
magnipapillata]
Length = 269
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 103/253 (40%), Gaps = 15/253 (5%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + E +L+ +A +T T+ + + A+E++ + ++ G + I A + S ++
Sbjct: 1 MSPSTFEKLLQIVALDITKANTKMRIPICAQERLAITVRYLATGDAHTTIAANYRMSPTT 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+ R ++E I + VK P E I F + F + G +DG V + AP
Sbjct: 61 VGRIVYETCNAIWNNLLGEYVKAPNSETEWEKIAKEFETRWHFSHCVGAIDGKHVQMFAP 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR 188
+ N H++ +MA+ Y+F VN G D V NS + + +
Sbjct: 121 ARSGSSYFNYKKTHSIVLMAVSDAKYRFILVNIGDSGRQSDGSVYNNSQLGFAIENNLLK 180
Query: 189 PFPDAVI------------LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
D+ I + D A+ LK ++ P F R RR++EN
Sbjct: 181 IPKDSKISNNSDKIMPYAFVADDAFGLKRHMMKPYAFKNLLTDKLIFNYRLSRARRVVEN 240
Query: 237 SFGILKEKFPCLN 249
+ I +F L+
Sbjct: 241 ACDIASSRFRVLH 253
>gi|242078913|ref|XP_002444225.1| hypothetical protein SORBIDRAFT_07g015460 [Sorghum bicolor]
gi|241940575|gb|EES13720.1| hypothetical protein SORBIDRAFT_07g015460 [Sorghum bicolor]
Length = 694
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 101/234 (43%), Gaps = 29/234 (12%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT----FLRKGGFPNVAGCVDGTM 122
S ++ R I+EV +I QR ++ P + + +T F+ F N G +DGT
Sbjct: 122 SGETVHRKINEVFDII-PALIQRFIRLPNPSNTHVKITCDPRFMP--FFQNCIGAIDGTH 178
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V I + + NR G + NVM C + KF +++ W GS DA V+R+++
Sbjct: 179 VPITIGQDKASPYRNRKGTLSQNVMFACDFDLKFTLISSGWEGSASDAGVLRSAL----- 233
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDAAADQRFLRCHKR 229
+G+ L D Y + P R + +A A+ + L H+
Sbjct: 234 -GKGFTVPDGKFYLVDGGYANTPSFLAPYRGVKYHLSEFRRRGQRGNAYANYKELFNHRH 292
Query: 230 T--RRLIENSFGILKEKFPCLNYMRLSPQFAGMVV-LASTTLHNIALTIDRDNE 280
R IE +FG+LK++FP L P + + + A+ HNI ++ E
Sbjct: 293 AILRNHIERAFGVLKKRFPILKVGTHYPIESQVKIPAAAAVFHNIIRGLNGSEE 346
>gi|226492180|ref|NP_001149744.1| retrotransposon protein [Zea mays]
gi|195630667|gb|ACG36640.1| retrotransposon protein [Zea mays]
Length = 401
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 112/281 (39%), Gaps = 23/281 (8%)
Query: 33 ERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVK 92
E + + +E+V L+ +G ++G +G S ++ R ++V+ I + + +
Sbjct: 91 EDTIHMCVEEQVAMFLNTVGHNLRNRLVGTNYGRSGETVSRYFNKVLRAIGELRAELIRP 150
Query: 93 WPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
E + I F + G +DGT + + E F R + NVMA
Sbjct: 151 PSLETPSKIAGNPRWDPYFKDCIGAIDGTHIRTLVSIDMEPSFRGRKSYASQNVMAAVDF 210
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-- 210
+ +F V A W GS HDA V+R+++ E G R L D+ Y K +PP
Sbjct: 211 DLRFTYVLAGWEGSAHDALVLRDALE----RENGLRVPQGKFYLVDAGYGAKPGFLPPFR 266
Query: 211 --------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFA 258
NP + F H R +E +FG LK +F L+ + S Q
Sbjct: 267 AVRCHLNEWGNNPVQNEKELFNLRHSSLRVTVERAFGSLKRRFKILDDATPFFPFSTQVD 326
Query: 259 GMVVLASTTLHNIALTIDRDN---ENQNVEIHDDEAEDDGE 296
+V+A +HN + D E N IH+ G+
Sbjct: 327 --IVVACCIIHNWVIQDGGDELIIEESNWPIHNHATTSSGQ 365
>gi|357155711|ref|XP_003577212.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 399
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/255 (21%), Positives = 112/255 (43%), Gaps = 18/255 (7%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+S +++V L + G +G + G + S++ V V+ ++WP+ +
Sbjct: 97 MSLEDRVAVALRRLNSGGSLVTVGGSVGVNHSTVSLITWRFVEA-VEARAGHHLRWPDSD 155
Query: 98 -LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFV--NRNGDHALNVMAICGPNY 154
+ I F G PN G VD T +++ + + V + ++++ + A+ P+
Sbjct: 156 EMAMIKSKFEEIHGLPNCCGVVDTTHIVMCLSSAEPNCKVWQDHQKNYSMVLQAVIDPDM 215
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------ILGDSAYPL 203
+F + WPGS+ ++ ++ +S ++ K E G+R D + +LGD+ YPL
Sbjct: 216 RFTDIVTGWPGSMKESGILHSSGLF-KLCENGYRLNGDKLTVSDGSEIGEYLLGDAGYPL 274
Query: 204 KEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--V 261
WL+ P N + + F + H R + + K+ + L+ P + +
Sbjct: 275 IPWLLTPYQENDLSDSKAEFNKRHLAARTVAMKALAKFKDTWKFLHGEMWRPDKHKLPRI 334
Query: 262 VLASTTLHNIALTID 276
+ LHNI + ++
Sbjct: 335 IHVCCLLHNIVIDME 349
>gi|449693237|ref|XP_004213365.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 287
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 16/247 (6%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
S++S R IH+ + + +E + + + G G +DGTMV I
Sbjct: 6 SQASTNRIIHQFCESFMQHYITLSWYSSQEEITKVKQCYFEAFGVKGRLGLIDGTMVPIK 65
Query: 127 APN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
E F+ R G A+N + + +F V GS HDA V NS + + + +
Sbjct: 66 GVTVADEPAFICRKGYSAINCQFVVDYDGQFRDVVIKCSGSCHDAFVYSNSTLKQTMEAD 125
Query: 186 GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFL-RCHKRTRRLIENSFGILKEK 244
F ++GDS Y L +I P + P ++ F + H R R IE LK +
Sbjct: 126 PVAGF----LIGDSGYGLSLVMITPFS--PAITPEEMFFNKTHSRVRSRIERCISSLKNR 179
Query: 245 FPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
+ CL+ ++ SP ++ L N+ ++ + E NV DD G E +
Sbjct: 180 WRCLHKSGRSLQYSPTKCCSIIYTCVLLENMCNSLGLE-EPGNVVASDDAV---GNTEMI 235
Query: 301 DDPDEID 307
D + +
Sbjct: 236 TDNNSTE 242
>gi|390339191|ref|XP_003724948.1| PREDICTED: putative nuclease HARBI1-like isoform 1
[Strongylocentrotus purpuratus]
gi|390339193|ref|XP_795639.3| PREDICTED: putative nuclease HARBI1-like isoform 2
[Strongylocentrotus purpuratus]
Length = 413
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 112/282 (39%), Gaps = 25/282 (8%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHW-MGVGSPYHIIGATHGP 66
E FR++ ++L+E+ P ++ + ++ L+ W + Y +
Sbjct: 87 ENFRMTKATFRYLLKELKPHISKKDTNFRKATPPDQRLAITLWRLAANISYSTLADLFEV 146
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
SS+C V I F +R+ E L + F KG FP + + +
Sbjct: 147 GISSVCLITLTVTAAINSVFRKRLTLPTGERLQECIQRFEDKG-FPQAVAAIGCCHIPVK 205
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P + E ++N G ++ + A+ +F V+ PGS+ +A V RNS + R ++
Sbjct: 206 TPRKQE--YINIKGFSSMILQAVVDSFGQFIDVSVGCPGSMDEAAVFRNSEMSRMLKDDS 263
Query: 187 WRPFPDAVI---------LGDSAYPLKEWLIPPRTRNPDAAADQ-----RFLRCHKRTRR 232
P VI LG AYPL L+ P + N Q R CH R
Sbjct: 264 LFNLPPKVIEATSIPVLVLGGEAYPLMPTLMKPYSVNGTLTGAQKVFNRRLSSCHSTVNR 323
Query: 233 LIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIA 272
E+ LK ++ L+ M+ Q + +VLA L N+
Sbjct: 324 AFEH----LKGRWNILHNMQ-DNQVTKVPSIVLACCILQNLV 360
>gi|242043358|ref|XP_002459550.1| hypothetical protein SORBIDRAFT_02g006490 [Sorghum bicolor]
gi|241922927|gb|EER96071.1| hypothetical protein SORBIDRAFT_02g006490 [Sorghum bicolor]
Length = 396
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 101/234 (43%), Gaps = 29/234 (12%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT----FLRKGGFPNVAGCVDGTM 122
S ++ R I+EV +I QR ++ P + + +T F+ F N G +DGT
Sbjct: 122 SGETVHRKINEVFDII-PALIQRFIRLPNPSNTHVKITCDPRFMPF--FQNCIGAIDGTH 178
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
V I + + NR G + NVM C + KF +++ W GS DA V+R+++
Sbjct: 179 VPITIGQDKASPYRNRKGTLSQNVMFACDFDLKFTFISSGWEGSTSDAGVLRSAL----- 233
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDAAADQRFLRCHKR 229
+G+ L D Y + P R + +A A+ + L H+
Sbjct: 234 -GKGFTVPDGKFYLVDGGYANTPSFLAPYRGVKYHLSEFRRRGQRGNAYANYKELFNHRH 292
Query: 230 T--RRLIENSFGILKEKFPCLNYMRLSPQFAGMVV-LASTTLHNIALTIDRDNE 280
R IE +FG+LK++FP L P + + + A+ HNI ++ E
Sbjct: 293 AILRNHIERAFGVLKKRFPILKVGTHYPIESQVKIPAAAAVFHNIIRGLNGSEE 346
>gi|359477891|ref|XP_002265982.2| PREDICTED: uncharacterized protein LOC100253062 [Vitis vinifera]
Length = 463
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 25/211 (11%)
Query: 110 GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHD 169
GF + G +DG+ + N+ ++ R NV+A C + KF V W GS D
Sbjct: 71 GFQDCIGAIDGSHFRVKVSNDVVQRYRGRKYYPTQNVLAACSFDLKFTYVLPGWEGSASD 130
Query: 170 ARVMRNSMVYRKFDE----EGWRPFPDA---VILG------DSAYPLKEWLIPPRTRNPD 216
+R++ N+++ R FD+ +G DA + +G ++ Y LKE+ + P+
Sbjct: 131 SRILDNALM-RDFDKLIVPQGKYYLADADFQLKIGFLTPYRNTRYHLKEYSV----HQPE 185
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIA 272
A + LR H R IE +FG+LK++FP + P + ++LA LHN
Sbjct: 186 NAREVFNLR-HSSLRNAIERAFGVLKKRFPIIA-SGTEPHYPVDTQSDIILACCILHNYL 243
Query: 273 LTIDRDNENQNVEIHDDEAEDDGEEEAVDDP 303
+ +D NE E+ + ++ E E++ P
Sbjct: 244 MGVD-PNERLIAEVDRELFSEEAEFESMAHP 273
>gi|326489887|dbj|BAJ94017.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/290 (21%), Positives = 118/290 (40%), Gaps = 23/290 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPYH 58
AF+ FR S ++I + L + + LS +++V + + G
Sbjct: 110 AFRYFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 169
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGC 117
+G+ G +S++ + + + + +V WP ++ + +I G PN G
Sbjct: 170 SVGSAFGVGQSTVSQVTWRFIESMEERARHHLV-WPNQQRMEEIKARLEAVSGLPNCCGA 228
Query: 118 VDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D T +++ P E + + + ++++ + I +F + WPGS+ +R+++ S
Sbjct: 229 IDATHIIMTLPAVESSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMTFSRLLKCS 288
Query: 177 MVYRKFDEEGWR---PFPDA--------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLR 225
Y K E G R P + I+GD YPL WL+ P + F
Sbjct: 289 GFY-KLCEAGKRLNGPVRTSGEDSEIREFIVGDMCYPLLPWLMTPYQGESLSTPMVNFNT 347
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIAL 273
K R L + LK + L+ + P + ++L LHNI L
Sbjct: 348 QQKTARMLGTRALARLKGSWKILHKVMWRPDKNKLPSIILVCCLLHNIIL 397
>gi|357151821|ref|XP_003575915.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 393
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 122/281 (43%), Gaps = 32/281 (11%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+S +E + +G G Y +I +T+ + +I R +EV+ I+ + + +K P+
Sbjct: 101 VSVEEALTIFFLVVGHGMKYRLIRSTYRWTLETISRHFNEVLGAIL-SLSREFIKLPDPA 159
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
++P + K FP+ G +DGT V + + ++ N NV+ +C + KF
Sbjct: 160 -TELPQDNIWKW-FPDGLGTLDGTHVRVRVAARKQGRYRNMKHQITTNVLGVCDRSMKFV 217
Query: 158 AVNANWPGSVHDARVMRNSMVYRK----------FDEEGWRPFPDAVILGDSA-YPLKEW 206
V A + GS D+RV+R++M+ + + G+ P + S Y LKEW
Sbjct: 218 YVLAGYEGSASDSRVLRDAMLRQDAFKVPSGKYYLVDAGYTNVPGFLAPYRSVRYHLKEW 277
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLAST 266
NP + LR H R ++E F +K ++ + V+ A
Sbjct: 278 --AANGNNPQTPRELYNLR-HASARNVVERIFAFIKMRWAFIR-----------VINACC 323
Query: 267 TLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEID 307
LHN R+ + ++ D+E + A++D DE D
Sbjct: 324 ILHNFLAGRQREMDYIMLQQVDNEINN----AAIEDQDEHD 360
>gi|301106486|ref|XP_002902326.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262098946|gb|EEY56998.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 364
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 80/167 (47%), Gaps = 10/167 (5%)
Query: 93 WP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ-FVNRNGDHALNVMAIC 150
WP EE I + +K GF N G DGT+ + A +H+ + + +R ++++ + C
Sbjct: 177 WPNEEERVMISLRMQQKYGFVNCVGITDGTLFPLAAKPQHQGEDYFSRKASYSVHGLVTC 236
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNS-MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIP 209
+ ++ WPGS HD R NS ++ ++ D + +LGDSA+ +IP
Sbjct: 237 DDVGRVRSLVVGWPGSTHDNRAWMNSPLILKRADHFKHNEY----VLGDSAFQASSVMIP 292
Query: 210 PRTRNPDAAADQRFLRCHK---RTRRLIENSFGILKEKFPCLNYMRL 253
P A + R +K + R E+ G+LK +FP L +R+
Sbjct: 293 AFKNPPKAQMNPRHTYFNKQLAKARIKSEHCIGLLKMRFPYLREIRV 339
>gi|449692140|ref|XP_004212915.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 319
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 84/197 (42%), Gaps = 15/197 (7%)
Query: 24 IAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV 83
++P P R L+ +K+ ++++ + G + ++ +++ V I
Sbjct: 100 VSPQSNCPNYR--FLTTHKKLAITIYYLKDTGSLWMTANVFGIHQCTVSKTVKVVCDAIN 157
Query: 84 DTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGD 141
+ + P+ E++ + F K G GC+D T V I P ++ +
Sbjct: 158 NIVGPIYLHLPKNKEDMTKLASQFEVKFGMVQAFGCIDSTHVQIKRPIKNSQDYFCYKQY 217
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----------FDEEGWRPFP 191
+LNV A+C F V WPGSVHDA++ NS + +K + + P
Sbjct: 218 FSLNVQAVCDCKGYFIDVECKWPGSVHDAKMFTNSTINKKLIKGTLPQTLYSLPNYHSIP 277
Query: 192 DAVILGDSAYPLKEWLI 208
+ +I GD AYPL + I
Sbjct: 278 NYLI-GDPAYPLTNFCI 293
>gi|359494637|ref|XP_002263526.2| PREDICTED: uncharacterized protein LOC100243362 [Vitis vinifera]
Length = 447
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 98/224 (43%), Gaps = 29/224 (12%)
Query: 99 NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYA 158
N +PM + G +DG+ + N+ ++ R NV+A C + KF
Sbjct: 44 NQVPMDDGSMSSSQDCIGAIDGSHFRVKVSNDVVQRYRGRKYYPTQNVLAACSFDLKFTY 103
Query: 159 VNANWPGSVHDARVMRNSMVYRKFDE----EGWRPFPDAVILGDSAYPLKEWLIPP--RT 212
V W GS D+R++ N++V R FD+ +G L D+ + LK + P T
Sbjct: 104 VLPGWEGSASDSRILDNALV-RDFDKLIVPQG------KYYLADAGFQLKTGFLTPYRST 156
Query: 213 R---------NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AG 259
R P+ A + LR H R IE +FG+LK++FP + P +
Sbjct: 157 RYHLKEYSVHQPENAREVFNLR-HSSLRNAIERAFGVLKKRFPIIA-SGTEPHYPVDTQS 214
Query: 260 MVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDP 303
++LA LHN + +D D E E+ + ++ E E++ P
Sbjct: 215 DIILACCILHNYLMGVDPD-ERLIAEVDKELFSEEAEFESMAHP 257
>gi|218199708|gb|EEC82135.1| hypothetical protein OsI_26179 [Oryza sativa Indica Group]
Length = 325
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 92/214 (42%), Gaps = 27/214 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + P +D+F R + NV+A + +F V A W GS HD+
Sbjct: 99 FQGCIGALDGTHIPACVPMHMQDRFRGRKKFPSQNVLAAVDFDLRFLYVLAGWEGSAHDS 158
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAA 218
V+++++ G + L D+ Y + ++PP TR P+
Sbjct: 159 YVLQDALS----RTNGLKIPEGKYFLADAGYAARPGVLPPYRGTRYHLKEYKGTREPENP 214
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEK---FPCLNYMRLSPQFAGMVVLASTTLHNIALTI 275
+ LR H R IE +FG LK + F C + L Q +V+A LHN +
Sbjct: 215 KELFNLR-HSSLRTTIERAFGALKNRFRIFKCQPFFPLKTQVK--IVMACCALHNWII-- 269
Query: 276 DRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEE 309
+ NV ++DD A + +V + ++ +E
Sbjct: 270 ---EDGPNVYVYDDAAWYEALPRSVRNHTDMHQE 300
>gi|390331933|ref|XP_003723382.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 200
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 32/173 (18%)
Query: 104 TFLRKGGFPNVAG---CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+R G+ +V G V T+ ++ A + E V R A+C V
Sbjct: 14 VIIRIDGWASVIGFRETVGNTLKMVKADDNDEIDDVVRQ--------AVCTHEGLITDVV 65
Query: 161 ANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD 220
A WPG HD+++ R S + R IL ++++ K WL+ D+ +
Sbjct: 66 ARWPGGSHDSQIFRESWLKR--------------ILQEASHEAK-WLLG------DSGSQ 104
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIAL 273
QR+ R R R +IE +FG++K +FPCLN + L + ++A L N+++
Sbjct: 105 QRYNRAQIRGRNIIERTFGMMKRRFPCLNQLGLKLETTLTTIVAVVVLWNLSI 157
>gi|195114180|ref|XP_002001645.1| GI15789 [Drosophila mojavensis]
gi|193912220|gb|EDW11087.1| GI15789 [Drosophila mojavensis]
Length = 375
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 117/281 (41%), Gaps = 31/281 (11%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERN--VALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR+ + E +L+ + A+ +S EK+L L + + +G +
Sbjct: 60 FKEQFRMKRSSFELLLQAVGKAIAGAEHHQPIARVSLPEKLLYTLTLLSGNKSFREVGES 119
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRM---VKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
SKS S HE+ + F M VKWP++N I + L P V G +D
Sbjct: 120 FAISKS----SGHEIFKWVTSGFAALMPCYVKWPKDNACGIKCSQL-----PGVVGVIDE 170
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + P E + +L + A+C +F V+ + P + + ++++ + R
Sbjct: 171 CRIPLKLPVREEQGHMQYA---SLALQAVCDERSRFLDVHIDVPDN-QCSVLLKSELFER 226
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI------ 234
D E D ++G+ YPL L+ P N + CH R + I
Sbjct: 227 LIDMEEPLMPADKHLVGEMTYPLLLNLMTPYADN-----NGELTPCHIRYNQAIHLWNAP 281
Query: 235 -ENSFGILKEKFPCLNYMRL-SPQFAGMVVLASTTLHNIAL 273
E +F L +F L + + + + A +VV A+ LHN L
Sbjct: 282 AERAFTALMSRFRRLKSLDVGTMEVASIVVAAACMLHNFIL 322
>gi|449680023|ref|XP_004209474.1| PREDICTED: uncharacterized protein LOC101241378 [Hydra
magnipapillata]
Length = 200
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 82/190 (43%), Gaps = 13/190 (6%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHII---GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
+S + V LH++ Y ++ + G KS+ +HE + + D + VK+P
Sbjct: 5 ISVVKCVAVALHYLASCEEYRVVRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFP 64
Query: 95 --EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
E LN F GFP G VDG + I AP + + N G +++ + A+
Sbjct: 65 LSVEYLNKHSRDFEAILGFPQCIGAVDGRHIPISAPKDQAISYYNYKGWYSIALFAVVDC 124
Query: 153 NYKFYAVNANWPGSVHDARVMRNSM--------VYRKFDEEGWRPFPDAVILGDSAYPLK 204
Y F + G +D+ +++NS ++ K +E ++GDSA+PL
Sbjct: 125 RYCFIYTSVGSSGRNNDSYILQNSSLKAILESNLFDKCCKELGDSLVPLCLIGDSAFPLT 184
Query: 205 EWLIPPRTRN 214
L+ P N
Sbjct: 185 RHLLKPYLEN 194
>gi|449689832|ref|XP_004212160.1| PREDICTED: uncharacterized protein LOC101241210, partial [Hydra
magnipapillata]
Length = 164
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 70/154 (45%), Gaps = 25/154 (16%)
Query: 112 PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDAR 171
P G +DGT + I P E + N G H+++V AIC F V+ WPGS+HDA+
Sbjct: 2 PQALGFIDGTHIPIRRPLESSQDYFNYKGFHSISVQAICDSKGIFMDVDCKWPGSLHDAK 61
Query: 172 VMRNS------------MVYRKFDEEGWRPFPDAVILGDSAYP-----LKEWLIPPRTRN 214
V NS + YR+ G+ P VI D AYP LKE+L +N
Sbjct: 62 VFSNSTFNFKIISNSIPITYREL-LPGFCKVPCYVI-DDPAYPLSTFCLKEYL---HCQN 116
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL 248
+ LR R IE +FG LK ++ L
Sbjct: 117 KNQVVFNTLLR---TARNQIECAFGRLKARWAIL 147
>gi|225463187|ref|XP_002267524.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 391
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 83/180 (46%), Gaps = 29/180 (16%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG + P + + +F NR G + NV+A C + +F V W GSV D+
Sbjct: 156 FKDCIGVIDGMHIPAHVPAKDQSRFRNRKGILSQNVLAACTFDLQFIFVYPGWEGSVADS 215
Query: 171 RVMRNSMVYRKFDEEGWRPFPD----AVILGDSAYPLKEWLIPP---------RTRNPDA 217
RV+R + ++ + FP+ L DS Y E I P R +
Sbjct: 216 RVLRAVL------DDPDQNFPNIPEGKYYLVDSGYSNIEGFIAPYQGVRYHLHEYRGANQ 269
Query: 218 ---AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM----VVLASTTLHN 270
+A + F H R I+ SF +LK +FP L +L+PQ+A +V+A+ +HN
Sbjct: 270 LPRSAKELFNHRHAFLRNAIQRSFDVLKARFPIL---KLAPQYAFHIQRDIVIAACVIHN 326
>gi|403168385|ref|XP_003328028.2| hypothetical protein PGTG_09322 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167479|gb|EFP83609.2| hypothetical protein PGTG_09322 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 89/204 (43%), Gaps = 17/204 (8%)
Query: 61 GATHGPSKSSICRSIHEVVPV---IVDTFF---QRMVKWPEENLNDIPMTFLRKGGFPNV 114
GA+ G ++ VV V +++ F +R V WP+ + + GF
Sbjct: 157 GASVGRFSRNLSVGRGTVVKVSRRVIEAFISLGRRYVVWPDSARRAEISEVMSREGFRGC 216
Query: 115 AGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDGT + + P + F +R +++N IC + + WPGS D+RV
Sbjct: 217 VGFVDGTTIPMFQRPGYDGEVFFDRKRCYSINAQIICDCDKYITSFITGWPGSCGDSRVY 276
Query: 174 RNSMVY---RKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRNPDAAADQRFLRCHKR 229
+ ++ + EEG +L DSAY L +IP + + + +F C +
Sbjct: 277 KRMQLHLNPSNYFEEG------QYLLADSAYELSHTVIPAYKVPAANITINSQFNYCLAK 330
Query: 230 TRRLIENSFGILKEKFPCLNYMRL 253
R E++ G+LK ++ L MRL
Sbjct: 331 ARVRNEHTIGVLKSRWSSLREMRL 354
>gi|221106927|ref|XP_002156477.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 239
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 93/213 (43%), Gaps = 17/213 (7%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKELDQDWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
AP + F N G +++ ++AIC Y F V+ G +DA V+ S R F+ +G
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAVLNASTFGRAFN-KG 177
Query: 187 WRPFPDA---------VILGDSAYPLKEWLIPP 210
+ P V++GD + LK WL+ P
Sbjct: 178 YFNLPKISEFDPKVPPVLVGDDIFALKPWLMKP 210
>gi|331229801|ref|XP_003327566.1| hypothetical protein PGTG_09100 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 517
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 88/191 (46%), Gaps = 12/191 (6%)
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LID 126
+ ++ ++ V+ I ++ ++ V WP+ ++ G G VDGT + L
Sbjct: 242 RGTVIKASRRVIQAI-NSLSEKYVVWPDSTRRAEISQVMKNEGLSGCVGFVDGTTIPLHQ 300
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P + + +R +++N +C + A WPGS D+ V +NS + + D+
Sbjct: 301 RPGLDGEVYFDRKKRYSINAQIVCDCDKFITAFTTGWPGSCGDSWVFQNSKIESEPDKYF 360
Query: 187 WRPFPDAVILGDSAYPLKEWLIP----PRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
R ++ DSAY L IP P ++ PD + F C ++R E++ GILK
Sbjct: 361 DR---GQYLIADSAYGLSLTCIPAYKSPASKKPD---NTDFNYCIAKSRVRNEHTIGILK 414
Query: 243 EKFPCLNYMRL 253
++ L+ +RL
Sbjct: 415 GRWASLHELRL 425
>gi|356498377|ref|XP_003518029.1| PREDICTED: uncharacterized protein LOC100803756 [Glycine max]
Length = 409
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/233 (24%), Positives = 96/233 (41%), Gaps = 27/233 (11%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR----KGGFPNVAGCVDGTM 122
S +I R H V+ + F + ++K + + D P L+ + F + G +DGT
Sbjct: 132 SGETISRHFHNVLEAVC-MFAKDIIKPIDPSFRDAPDEILKDARYRPYFRDCIGAIDGTH 190
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + P+ + ++ R G NVM +C + F V A W GS HD ++ ++ RK
Sbjct: 191 IRVCVPSHLQGVYIGRKGYTTTNVMVVCDFSMCFTFVWAGWEGSAHDTKIFMEAL--RKP 248
Query: 183 DEEGWRPFPDAVILGDSAYP---------------LKEWLIPPRTRNPDAAADQRFLRCH 227
P L DS YP L ++ I PR R + F H
Sbjct: 249 ALHFSHPPQGKYYLVDSGYPTFMGFLGPYKKTRYHLPQFRIGPRIR----GRVEVFNYYH 304
Query: 228 KRTRRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTIDRDN 279
R IE +FG+ K ++ L M + + +++A +HN D+ +
Sbjct: 305 SSLRSTIERAFGLCKARWKILGNMSPFALKTQNQIIVACMAIHNFIQRNDKSD 357
>gi|331249539|ref|XP_003337386.1| hypothetical protein PGTG_18980 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 170
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 79/171 (46%), Gaps = 27/171 (15%)
Query: 91 VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAI 149
+ WP E+ ++ GFP G VDGT + L P + + +R ++++V I
Sbjct: 11 ITWPTESEQIESSQVRQEDGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVI 70
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKE 205
C N F + A +PGS HDA V + V ++ FD + + +L DSAY
Sbjct: 71 CDINKNFISYLAAFPGSSHDAYVFLHMQVAQQPEKYFDRKQF-------LLADSAYTNDC 123
Query: 206 WLIPPRTRNPDAAADQRFLRCHK--------RTRRLIENSFGILKEKFPCL 248
+++P A ++ L+C ++R IE++ GILK +F L
Sbjct: 124 YVVP-------AFKGKQLLKCRNINFNYHLAQSRVRIEHAIGILKGRFASL 167
>gi|390342156|ref|XP_003725600.1| PREDICTED: uncharacterized protein LOC100893530, partial
[Strongylocentrotus purpuratus]
Length = 251
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 91/212 (42%), Gaps = 19/212 (8%)
Query: 75 IHEVVPVI-VDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEH 131
IH I +T FQ ++ P N ++ F + P+ G DG + + PN+
Sbjct: 41 IHTSTQTIKFNTRFQEVISLPTTNEGWLEVAGDFEARWNLPHCLGGYDGKHIRLQKPNKS 100
Query: 132 EDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFP 191
+ N ++ +MA+ Y++ ++ G DA++ NS + ++ E G P
Sbjct: 101 GSLYFNYKQFFSVVLMALVDSKYQYLWIDVGGVGHQSDAQIYNNSEL-KECIEAGTLGIP 159
Query: 192 D--------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
D +GD A+ ++ +++ P R + F R RR++EN+
Sbjct: 160 DPAPLPHDDEEHPMPYFFVGDYAFAMRTYMMKPYGRRNMDQQQKIFNYRLSRARRVVENA 219
Query: 238 FGILKEKFPC-LNYMRLSPQFAGMVVLASTTL 268
FGIL +F C L MR P +++ A+ L
Sbjct: 220 FGILALRFQCFLGQMRQEPDTIRLLIEAAVML 251
>gi|403176854|ref|XP_003335463.2| hypothetical protein PGTG_17316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375172428|gb|EFP91044.2| hypothetical protein PGTG_17316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 86/195 (44%), Gaps = 8/195 (4%)
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV- 123
G + ++ ++ V+ I + + WP+ + ++ GF G VDGT +
Sbjct: 168 GVGRGTVIKASRRVIQAI-NHLSHTYLLWPDADRRKEISNVMKAEGFEGCIGFVDGTTIP 226
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
L P+ + F +R +++N IC + WPGS D+ V + V+R D
Sbjct: 227 LYQRPSIDGEVFFDRKKRYSINCQVICDCDRFITGYMTGWPGSCGDSMVFKKMAVHR--D 284
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCHKRTRRLIENSFGIL 241
G+ P ++ DSAY L IP + P A + F C R+R E++ GIL
Sbjct: 285 PGGFFD-PGQYLIADSAYELGLHCIPA-YKAPAAYVLENTEFNYCLARSRVRNEHTIGIL 342
Query: 242 KEKFPCLNYMRLSPQ 256
K + L +RL+ Q
Sbjct: 343 KGCWASLQQLRLAIQ 357
>gi|403160739|ref|XP_003321192.2| hypothetical protein PGTG_02234 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170375|gb|EFP76773.2| hypothetical protein PGTG_02234 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 377
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 111/259 (42%), Gaps = 14/259 (5%)
Query: 6 FKEAFRLSATQVEFILEEIA--PALTTPTERNVALSAKEKVLSC--LHWMGVGSPYHIIG 61
FK+A R + ++L EI P + + R A + L+ L G G+
Sbjct: 40 FKQAVRTTKPGFIWLLNEIQLNPVFHSESFRPQLPIAHQLALTLERLGSNGNGASVGRFS 99
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGT 121
+ ++ ++ V+ I D R + WP+ N +++ GF G VDGT
Sbjct: 100 RNLSVGRGTVVKASRRVIRAIND-LSGRYLLWPDTNRRREISEVMKEEGFEGCVGFVDGT 158
Query: 122 MV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ L P+ + F + +++N IC N A WPGS D+ V + +++
Sbjct: 159 TIPLYQQPSIDGEVFFDCKKRYSINCQVICNCNRFITAYMTGWPGSCGDSMVFKRMLLH- 217
Query: 181 KFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRTRNPDA--AADQRFLRCHKRTRRLIENS 237
+E + F D ++ DSAY L IP + P A + F C R+ E++
Sbjct: 218 ---QEPEKFFGDGQYLIADSAYELGVHCIPA-YKAPAAFILENTEFNYCLARSCVRNEHT 273
Query: 238 FGILKEKFPCLNYMRLSPQ 256
GILK ++ L +RL+ Q
Sbjct: 274 IGILKGRWASLQDLRLAIQ 292
>gi|2739387|gb|AAC14510.1| En/Spm-like transposon protein [Arabidopsis thaliana]
Length = 292
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 85/207 (41%), Gaps = 27/207 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT V + P++ ++ R + +NV+AIC + KF PG HD
Sbjct: 41 FLYCVGALDGTHVPVRPPSQTAKKYKGRKLEPTMNVLAICNFDMKFIYAYVGVPGRAHDT 100
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTR-----------NPDAAA 219
+V+ + P L DS YP + + P R P A
Sbjct: 101 KVLNYCATNEPYFSH---PPNGKYYLVDSGYPTRTGYLGPHRRMRYHLGQFGRGGPPVTA 157
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM----VVLASTTLHNIALTI 275
+ F R H R +IE +FG+ K K+ ++ R P++ +V A+ LHN
Sbjct: 158 RELFNRKHSGLRSVIERTFGVWKAKWRIVD--RKHPKYGLAKWIKIVTATMALHNFI--- 212
Query: 276 DRDNENQNVEIHDDEAEDDGEEEAVDD 302
RD+ ++ HD EE VDD
Sbjct: 213 -RDSHRED---HDFLQWQSIEEYHVDD 235
>gi|356570115|ref|XP_003553236.1| PREDICTED: uncharacterized protein LOC100810207 [Glycine max]
Length = 372
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 80/165 (48%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
ILEE + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILEEKGGLVRT---RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +DGT + + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDGTHIPVTVSPDEK 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|356527700|ref|XP_003532446.1| PREDICTED: uncharacterized protein LOC100787308 [Glycine max]
Length = 380
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 75/158 (47%), Gaps = 8/158 (5%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 95 ILQEKGQLVKT---KNVPID--EAVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 149
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ E NL R F N G +DG + + E ++ NR
Sbjct: 150 RAIMKVS-KEYLKFYEYNLEGSVENKWR--WFKNSIGALDGIHIPVTVSAEDRPRYCNRK 206
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
GD + NV+ +CGP+ +F V W GS D+RV+R+++
Sbjct: 207 GDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLRDAL 244
>gi|331211775|ref|XP_003307157.1| hypothetical protein PGTG_00107 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309297560|gb|EFP74151.1| hypothetical protein PGTG_00107 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 427
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/350 (22%), Positives = 131/350 (37%), Gaps = 55/350 (15%)
Query: 6 FKEAFRLSATQVEFILEEIA--PAL-------TTPTERNVALSAKEKVLSCLHWMGVGSP 56
FK+ FR+S + + +A P P E + ++ K L C G GS
Sbjct: 98 FKQEFRMSRDSFIMLCKRVADDPVFHNNSNNPQRPIEEQMMVTLKR--LGCF---GNGSS 152
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
++ + ++ + + I+ Q ++KWP + GF G
Sbjct: 153 VGMLARFFRVGEGTVELYTNRCIMAILRIQSQ-LLKWPSAEERVARAIEYGEDGFDGCIG 211
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+DG+++ L D P+ H + +R G + ++ + +C + WPG HD+RVM N
Sbjct: 212 VIDGSLIPLSDCPSRHGSDYYSRKGFYCISTLIVCDSQRNIQYMYTGWPGCSHDSRVMGN 271
Query: 176 SMVYR---KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA---DQRFLRCHKR 229
S + K+ + G +L DSAY ++ + + + F +
Sbjct: 272 SPITLNPDKYFKSG------EFLLADSAYTTTMNVVAAFKKPSHGSLTEDEHSFNYYLAQ 325
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQF------AGMVVLASTTLHNIALTIDRDNENQN 283
R +IE G LK +F L +RL A ++A LHN
Sbjct: 326 KRVVIEQCIGGLKGQFQSLKGLRLRIGGKRDRVRANAWIIACGVLHNFL----------- 374
Query: 284 VEIHDDEAEDDGEEEAVDDPD-EIDEEEQPRQDRNNPGTQRLARLLNYFR 332
+ G+E DD D + + P+Q P TQR FR
Sbjct: 375 ---------NQGDEYDFDDVDPGLTDAPAPQQMEVEPPTQRSTAAARAFR 415
>gi|294462202|gb|ADE76652.1| unknown [Picea sitchensis]
Length = 427
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/280 (21%), Positives = 113/280 (40%), Gaps = 18/280 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ + LS +L+ + P + T +++ + LH + G + + +
Sbjct: 98 WRTTYGLSYPLFTKLLDHLRPHIPPHT---ISMPLHSALAMVLHRLSQGHSARAVASQYK 154
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
+ + + V VI + ++ P +NL I F G PN+ G +DG +
Sbjct: 155 ADPWMVAKITNTVTRVISTKVYPYYIEIPNRQNLLQIIQGFKDLTGLPNMCGAIDGNHIK 214
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ +E + R+ +A+ + A+ F+ V PG D+ R S ++ K
Sbjct: 215 LHKKPNNEFMYKCRHNFYAVVLQAVSDHRKIFWDVCVRAPGGTDDSNHFRESSLFNKLTS 274
Query: 185 EGWRPFPDAVIL-----------GDSAYPLKEWLIPPRTRNPDAA-ADQRFLRCHKRTRR 232
E + D+VI GD YPL +L+ P + N A F + R
Sbjct: 275 E--QVLMDSVITIRGNQLRPYLAGDWCYPLLSFLLVPFSPNGSGTPAQNMFDAALMKGRS 332
Query: 233 LIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
++E + G+LK ++ L + + A ++A LHN+
Sbjct: 333 VVEQATGLLKGRWKMLQDLNVGLNHAPQTIVACCVLHNLC 372
>gi|242083356|ref|XP_002442103.1| hypothetical protein SORBIDRAFT_08g011993 [Sorghum bicolor]
gi|241942796|gb|EES15941.1| hypothetical protein SORBIDRAFT_08g011993 [Sorghum bicolor]
Length = 625
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 93/219 (42%), Gaps = 34/219 (15%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT +L+ + +NR G NV+A+C N F + A GS HD
Sbjct: 405 FDGCIGAMDGTHILVSVDDGVRLDHINRYGVTTQNVVAVCDFNMMFTYIGAGTEGSAHDM 464
Query: 171 RVMRNSMVYRKFDEEGWRPFP--DAVILGDSAYPLKEWLIPP----RTRNPDAA------ 218
RV +K +E+ P P L DS Y L +PP R R D
Sbjct: 465 RVK------KKVEEDPAFPIPPDGRYYLVDSGYALGPGYMPPYPQKRFRAKDFKNLGPQD 518
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAG----MVVLASTTLHNIALT 274
A++ F R H R +IE SFG+ K K+ L + P + G ++L LHN
Sbjct: 519 AEELFNRHHAGLRSVIERSFGVAKSKWRMLKSI---PHYPGTKQTQIILVLFALHNYV-- 573
Query: 275 IDRDNENQNVEIHDDEAEDDGEEE-----AVDDPDEIDE 308
R E ++ +A D G A+ DP+++++
Sbjct: 574 --RGLEGKHRVGRRRQAPDLGPSSKVTAMALSDPNDMEQ 610
>gi|331242287|ref|XP_003333790.1| hypothetical protein PGTG_15550 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 263
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 77/160 (48%), Gaps = 25/160 (15%)
Query: 101 IPMTFLRKGGFPNVAGCVDG-TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I + +R+ GFP G VDG T+ L P + + +R +++++ IC N KF +
Sbjct: 114 IAVAVMREEGFPGCVGFVDGATIPLSQKPPSNGQHYFDRKKRYSISLTVICDINKKFISY 173
Query: 160 NANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPP----- 210
A +PGS HDA V + V +K FD + + +L DSAY ++IP
Sbjct: 174 LAGFPGSSHDAYVFSHMQVAQKPEKYFDRKQF-------LLADSAYTNDCYVIPAFKGKH 226
Query: 211 --RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL 248
+ RN D F ++R IE++ GILK +F L
Sbjct: 227 LLKRRNID------FNYHLAQSRVRIEHAIGILKGRFASL 260
>gi|388507696|gb|AFK41914.1| unknown [Lotus japonicus]
Length = 170
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 64/130 (49%), Gaps = 2/130 (1%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR 107
LH + Y ++ T+ SK +I R + V+ ++ + + NL R
Sbjct: 3 LHILAHNLKYRVVHFTYYRSKETISRQFNNVLRAVMKVSGEYLKFQDHNNLEGSEAYKWR 62
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
F N G +DGT + + E + ++ NR GD + NV+ +CGP+ KF V W GS
Sbjct: 63 --WFQNSIGALDGTHIPVTVAAEDKPRYRNRKGDISTNVLGVCGPDLKFIYVLPGWEGSA 120
Query: 168 HDARVMRNSM 177
D+RV+R+++
Sbjct: 121 GDSRVLRDAL 130
>gi|390360559|ref|XP_003729721.1| PREDICTED: uncharacterized protein LOC752912 [Strongylocentrotus
purpuratus]
Length = 325
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 120/301 (39%), Gaps = 47/301 (15%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK R+S + +L + P L T A+S K+ L + G+ Y +
Sbjct: 28 FKNYTRVSPEMFDDLLVRLTPHLQKKDTHFRKAISPGLKLSLFLRHLATGATYAELSYNF 87
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K +I + + +V IVD + ++ P N G+ VAG +
Sbjct: 88 RVGKETIQKFVPDVARAIVDEYAAEVISLPTTNY-----------GWLEVAGSL------ 130
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ N ++ +MA+ Y+F ++ G DA++ NS + ++ E
Sbjct: 131 ----------YFNYKQFFSVVLMALVDSKYQFLWIDVGGVGHQSDAQIYNNSEL-KECIE 179
Query: 185 EGWRPFPD--------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
G PD +GD+A+ ++ +++ P R + F R
Sbjct: 180 AGTLGIPDPAPLPHDDEEHPMPYFFVGDNAFVMRTYMMKPYGRRNMDQQQKIFNYRLSRA 239
Query: 231 RRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
RR+ EN+FGIL +F C L M P +++ A+ LHN+ I + + +V + D
Sbjct: 240 RRVFENAFGILALRFQCFLGQMGQEPDTVRLLIEAAVMLHNL---IRKRYQALDVRMLDQ 296
Query: 290 E 290
E
Sbjct: 297 E 297
>gi|403172081|ref|XP_003331227.2| hypothetical protein PGTG_13190 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169703|gb|EFP86808.2| hypothetical protein PGTG_13190 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 457
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 87/178 (48%), Gaps = 10/178 (5%)
Query: 82 IVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRN 139
I++T ++ V WP + ++ K F G DGT+ L AP +H++ + R
Sbjct: 209 IIETLEKKQVYWPNTQQRANLTANLAGKTVFDGCIGFADGTIFPLASAPTKHKEDYWMRK 268
Query: 140 GDHALNVMAICGPNYKF-YAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGD 198
+ +N + +C + YAV+ W GS HD RV +N +YR + G+ + ++ G
Sbjct: 269 MVYTVNSLIVCNWQRRIIYAVHG-WCGSAHDQRVYKNCQLYR--NPRGFFSAGEYLLAG- 324
Query: 199 SAYPLKEWLIPPRTRNPDAAA---DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL 253
SAY + +IP R+P + Q+F R ++E++ G+LK ++ L + +
Sbjct: 325 SAYTSTDTIIPAFKRSPGRSLPPEKQQFNYELSHNRVVVEHTIGMLKNRWQSLKSLSI 382
>gi|390366216|ref|XP_003730990.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 288
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 80/184 (43%), Gaps = 3/184 (1%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTE-RNVALSAKEKVLSCLHWMGVGSPYHIIGAT 63
++K R++ E ++ ++ P L T L K+ L ++ G+ Y + +
Sbjct: 77 SYKNFLRITPDLFEEMVGKLTPHLEKKTTFMREPLEVGLKLAVTLRFLATGNSYATLQYS 136
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
KS+I R I EV +++ + + ++ P E+ D+ F + + N G VDG
Sbjct: 137 FRVEKSTISRFIPEVCNALIEVYKKEVLSCPKTEDGWKDVAKKFSSRWNYHNCLGAVDGK 196
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
V I P + + N G H++ +MA+ YKF V+ G D +N +Y
Sbjct: 197 HVAIRKPPKSGSLYFNYKGFHSIVLMAVADAEYKFLYVDVGAEGGSSDGGTWKNCNLYDA 256
Query: 182 FDEE 185
+ E
Sbjct: 257 IEGE 260
>gi|356532810|ref|XP_003534963.1| PREDICTED: uncharacterized protein LOC100782190 [Glycine max]
Length = 343
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 75/158 (47%), Gaps = 8/158 (5%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 58 ILQEKGQLVKT---KNVPID--EAVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 112
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ E NL R F N G +DG + + E ++ NR
Sbjct: 113 RAIMKVS-KEYLKFYEYNLEGSVENKWR--WFKNSIGALDGIHIPVTVSAEDRPRYRNRK 169
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
GD + NV+ +CGP+ +F V W GS D+RV+R+++
Sbjct: 170 GDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLRDAL 207
>gi|390358527|ref|XP_001194958.2| PREDICTED: uncharacterized protein LOC756412 [Strongylocentrotus
purpuratus]
Length = 300
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/277 (22%), Positives = 112/277 (40%), Gaps = 24/277 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGA 62
FK R+S + +L + P L T +++ K V C + G+ Y +
Sbjct: 28 FKNYTRVSPEMFDDLLVRLTPHLQKKDTHFRKSIPPGLKLSVFLC--HLATGATYAELSY 85
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDG 120
K +I + + +V +VD + ++ P N ++ F + P+ G DG
Sbjct: 86 NFRVGKETIQKFVPDVARAVVDEYDAEVISLPTTNEGWLEVAGDFEARWNLPHCLGAYDG 145
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + PN+ + N ++ +MA+ Y+F ++ G DA++ NS + +
Sbjct: 146 KHIRLQKPNKSGSLYFNYKQFFSVVLMALVDSKYQFLWIDGG-VGHQSDAQIYNNSEL-K 203
Query: 181 KFDEEGWRPFPDAV--------------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
+ E G PD +GD A+ ++ ++ P R + F
Sbjct: 204 ECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTDMMKPYGRRNMVQKQKIFNYR 263
Query: 227 HKRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVV 262
R R ++EN+FGIL F C L MR P ++V
Sbjct: 264 LSRARHVVENAFGILALWFQCFLGQMRQEPDTVRLLV 300
>gi|357138912|ref|XP_003571030.1| PREDICTED: uncharacterized protein LOC100837391 [Brachypodium
distachyon]
Length = 664
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 100/252 (39%), Gaps = 20/252 (7%)
Query: 36 VALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE 95
+ + +++V L+ +G ++ S ++ R + V+ I + + + K
Sbjct: 92 IHMCVEQQVAMFLNTVGHNLRNRLVATNFNRSGETVSRYFNLVLHAIGELRNELIRKPSM 151
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
E I + R F + G +DGT + E F R + NVMA + +
Sbjct: 152 ETPTKIQGNYRRDPYFQDCVGAIDGTHIRASVTKAMEPAFCGRKSHASQNVMAAVDFDLR 211
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----- 210
F V A W G+ HDA V+R+++ G R L D+ Y K +PP
Sbjct: 212 FTYVLAGWEGAAHDALVLRDALE----RPNGLRVPQGKYYLVDAGYGAKPGFLPPFRGVR 267
Query: 211 -----RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMV 261
NP ++ F H R +E +FG LK +F L+ + R Q +
Sbjct: 268 YHLNEWGNNPVQNENELFNLRHSSLRVTVERAFGSLKRRFKILDDATPFFRFPAQVD--I 325
Query: 262 VLASTTLHNIAL 273
V+A +HN L
Sbjct: 326 VVACCIIHNWVL 337
>gi|67083861|gb|AAY66865.1| possible transposase [Ixodes scapularis]
Length = 254
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 92/197 (46%), Gaps = 11/197 (5%)
Query: 85 TFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHAL 144
+ R++ +P +L+ + F + GFP V G + GT V + P H + V+ + + +L
Sbjct: 10 SLSSRLIAFPS-DLDLLAGEFQKVAGFPGVVGSMGGTHVNVRCP-AHRQRPVSTDRELSL 67
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLK 204
V A+ GP +F V PG + ++ S + ++ + R +L D YP +
Sbjct: 68 AVQAVSGPWLRFLDVFVGPPGDKRNMALLSLSPLGKRLESFDHRYH----LLTDVVYPPR 123
Query: 205 EWLIPPR----TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMR-LSPQFAG 259
E L+PP + +P+ A Q F H T ++ ++ ++K +F L+ + + +
Sbjct: 124 ECLVPPYGEKPSDDPETLAKQEFDALHGVTHAVVNDALALVKRRFRQLSRLEFFTLEKMS 183
Query: 260 MVVLASTTLHNIALTID 276
VLA LHN + D
Sbjct: 184 DFVLACCVLHNFLVDAD 200
>gi|449683680|ref|XP_002154386.2| PREDICTED: uncharacterized protein LOC100207161 [Hydra
magnipapillata]
Length = 219
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 89/212 (41%), Gaps = 31/212 (14%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQR-MVKWPEE 96
+S E++ L ++ G I A++ S S+I R I E I + +R + P E
Sbjct: 5 VSPSERLAVTLRFLVTGDAQCTIAASYRISTSTISRIISETCAAIWTSLKERNFLHVPSE 64
Query: 97 N--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
I F FP+ G +DG +++ AP+ +F N H++ ++A+C Y
Sbjct: 65 KQEWKTIAKEFENMWNFPHAIGAIDGKHIVMQAPHNGGSEFFNYKKTHSIVLLAVCNAKY 124
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRN 214
+F V D E P+ V++ D A+ LK + + N
Sbjct: 125 EFTMV-----------------------DIEKVLPY---VLVADDAFGLKRHTM-KLSPN 157
Query: 215 PDAAADQRFLRCH-KRTRRLIENSFGILKEKF 245
+ DQ+ R RR+IEN+FGI +F
Sbjct: 158 QNILLDQKIFNYRLSRARRVIENTFGIATTRF 189
>gi|449688588|ref|XP_004211784.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 231
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 82/208 (39%), Gaps = 29/208 (13%)
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDG 120
T G + + I EV IV R V P+ + + + F K G GCVDG
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV-- 178
T + I P H + ++L V A+C F V WPGSVHDA+V NS +
Sbjct: 65 THIPIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSINT 124
Query: 179 ----------YRKFDEEGWRPFPDAVILGDSAYPL-----KEWLIPPRTRNPDAAADQRF 223
++ + + ++G AYPL KE+ + D
Sbjct: 125 NLRSSRLPGTFQTITKNKIK--VPCYLIGGPAYPLLPHCMKEY---STCKKNDEVIFNSM 179
Query: 224 LRCHKRTRRLIENSFGILKEK--FPCLN 249
LR R IE +FG LK F C++
Sbjct: 180 LR---TARNPIECAFGRLKAHPIFSCID 204
>gi|242048510|ref|XP_002462001.1| hypothetical protein SORBIDRAFT_02g012350 [Sorghum bicolor]
gi|241925378|gb|EER98522.1| hypothetical protein SORBIDRAFT_02g012350 [Sorghum bicolor]
Length = 276
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 21/216 (9%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
S ++ R I+EV +I QR ++ P + N G +DGT V I
Sbjct: 30 SGETVHRKINEVFDII-PALIQRFIRLPNPSNTH------------NCIGAIDGTHVPIT 76
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
+ + NR G + NVM C + KF +++ W GS DA V+R+++ +G
Sbjct: 77 IGQDKASPYRNRKGTLSQNVMFACDFDLKFTFISSGWEGSASDAGVLRSAL------SKG 130
Query: 187 WR-PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKF 245
+ P ++ Y L E+ + N A + F H R IE +FG+LK+ F
Sbjct: 131 FTVPAGKFYLVDGVKYHLSEFRRRGQRGNAYANYKELFNHRHAILRNHIERAFGVLKKWF 190
Query: 246 PCLNYMRLSPQFAGMVV-LASTTLHNIALTIDRDNE 280
P L P + + + A+ HNI ++ E
Sbjct: 191 PILKVGTHYPIESQVKIPAAAAMFHNIIRGLNGSEE 226
>gi|449665569|ref|XP_002165138.2| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 193
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 75/165 (45%), Gaps = 11/165 (6%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DG V I+AP + F N G +++ ++AI + F V+ G +DA ++
Sbjct: 1 GAIDGKHVRIEAPAKSGSSFYNYKGFYSMVLLAIFDAKHCFTMVDIGAYGRDNDAAILNA 60
Query: 176 SMVYRKFDEEGWRPFPDA---------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
S R F+ +G+ P V++GD + LK WL+ P + F
Sbjct: 61 STFGRAFN-KGYFNLPKISEFDHKVPPVLVGDDMFALKPWLMKPYPGKNLTVQQRVFNYR 119
Query: 227 HKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN 270
R RR IENSFGIL ++ ++ P ++ A+ LHN
Sbjct: 120 LSRARRTIENSFGILAARWRIYRSPIKAKPLKVEHIIKATVCLHN 164
>gi|328702158|ref|XP_003241823.1| PREDICTED: hypothetical protein LOC100573170 [Acyrthosiphon pisum]
Length = 428
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 80/200 (40%), Gaps = 15/200 (7%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
M K +E DI TF K FPN G +DG V F N ++ +MA+
Sbjct: 1 MPKPNKEKWIDISETFYSKTNFPNCLGAIDGKHVRCRNSENSGSVFYNNKKYFSIVLMAV 60
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIP 209
N F ++ G D+ V R S V+ K PD V A PL E I
Sbjct: 61 VDANLNFIYIDVGAYGREADSTVFRQS-VFGKMLYSQQLQIPDPV-----ALPLTENNIQ 114
Query: 210 PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL-SPQFAGMVVLASTTL 268
P D A R RR +E +FG+L K+ L+ L P F ++ A L
Sbjct: 115 PFVFVADEAL--------SRARRTVECAFGVLANKWRVLHTTILVEPNFCDDIIKACCVL 166
Query: 269 HNIALTIDRDNENQNVEIHD 288
HN D N ++ ++H+
Sbjct: 167 HNFVRKRDGYNYDEETDVHN 186
>gi|449673172|ref|XP_004207882.1| PREDICTED: uncharacterized protein LOC101241684 [Hydra
magnipapillata]
Length = 215
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 93/212 (43%), Gaps = 16/212 (7%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQR-MVKWPEE 96
+S E++ L + G I A++ S S+I R I E I + +R + P E
Sbjct: 5 VSPSERLAVTLRFFVTGDAQCTIAASYRISASTISRIISETCAAIWTSLKERNFLHVPSE 64
Query: 97 NL--NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
N I F FP+V G +DG +++ AP+ ++ N H++ ++A+C Y
Sbjct: 65 NQERKTIAKEFENMWNFPHVKGAIDGKHIVMQAPHIGGSEYFNYKKTHSIVLLAVCNATY 124
Query: 155 KFYAVNANWPGSVHDARVMRN-SMVYRKFDEEGWRPFPD----------AVILGDSAYPL 203
+F V+ G D V N S+ Y + + P P+ V++ D A+ L
Sbjct: 125 QFTMVDIGDSGRQSDGSVFNNCSLGYAIENNKLNIPDPEYIGNSEKVLLYVLVADDAFGL 184
Query: 204 KEWLIPPRTRNPDAAADQRFLRCH-KRTRRLI 234
K ++ P + N + DQ+ R RR+I
Sbjct: 185 KRHMMKPYS-NQNIPLDQKIFNYRLSRARRVI 215
>gi|403164564|ref|XP_003324654.2| hypothetical protein PGTG_06191 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165217|gb|EFP80235.2| hypothetical protein PGTG_06191 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 404
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 88/196 (44%), Gaps = 14/196 (7%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LI 125
+ ++ ++ V+ I D ++ + WP+E ++ GF G VDGT + L
Sbjct: 137 GRGTVVKASRRVIRAIND-LSEKYLMWPDEVRRKEVSDVMKCEGFEGCVGFVDGTTIPLY 195
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK---- 181
P+ + F ++ +++N +C + A WPGS ++ V + M++++
Sbjct: 196 QRPSIDGEVFFDQKKHYSINCQVVCDCDRFITAYMTGWPGSCGNSMVFKRMMLHKEPTLF 255
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLIENSFGI 240
FD + ++ DSAY L IP D F C R+R E++ GI
Sbjct: 256 FDRGQY-------LIADSAYELGVHCIPAYKAPAAYIKDNSDFNYCLARSRVRNEHTIGI 308
Query: 241 LKEKFPCLNYMRLSPQ 256
LK ++ L ++RL+ Q
Sbjct: 309 LKGRWASLQHLRLAIQ 324
>gi|241594080|ref|XP_002404243.1| transposase, putative [Ixodes scapularis]
gi|215502315|gb|EEC11809.1| transposase, putative [Ixodes scapularis]
Length = 275
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 106/233 (45%), Gaps = 24/233 (10%)
Query: 90 MVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
+KWP+ E I F + GFP V G + + + I P + ++F NR H++ + A
Sbjct: 43 FIKWPDPEAAKSIIAGFEARSGFPGVIGAIGRSHIPIRCPVDDSERFRNRLDFHSVVLQA 102
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+C F +A PGS D V R S+ + + + PF D+ ++GD+ +P+ L+
Sbjct: 103 VCDHRMVFLDCSAGHPGSTSDMLVFRRSLFIQSLEASKF-PF-DSHMVGDATFPIGPHLM 160
Query: 209 PP-----RTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ-FAGMV 261
P +A+ +++ F C IE + +LK +F L + + +
Sbjct: 161 VPFEDDGELGEEEASFNEKIFDGCET-----IERALALLKGRFRRLQGLETGRRDLVVTI 215
Query: 262 VLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQ 314
++ + LHN + D+ + +++ ++ DD D++ E E+ RQ
Sbjct: 216 IIMACVLHNACIMW--DDLLEEFKVYRRSLDN-------DDGDDLVEVEEIRQ 259
>gi|195032221|ref|XP_001988458.1| GH11177 [Drosophila grimshawi]
gi|193904458|gb|EDW03325.1| GH11177 [Drosophila grimshawi]
Length = 383
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 115/278 (41%), Gaps = 24/278 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVA--LSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR+ + E +L+ I A+ + +S EK+L L + + +G
Sbjct: 67 FKEQFRMQRSSFETLLQVIGNAIAGGEQHQPIGRVSLPEKLLYTLTLLSGEKSFREVGIN 126
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
SKSS I V T VKWP +N+ + + K P V G +D +
Sbjct: 127 FAISKSS-GHEIFRWVTSAFATLLPHYVKWPADNV--CSGSGISK--LPGVVGVIDECRI 181
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ P E+ ++ + L + A+C +F V+ + PG+ ++++ + R D
Sbjct: 182 QLKLPVREENGHLHYSW---LALQAVCDERSRFLDVHIDVPGN-QQCVLLKSDLFERLID 237
Query: 184 EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI-------EN 236
EE P ++G+ YPL L+ P N + CH R + + E
Sbjct: 238 EESPLMPPHKHLVGEMMYPLLLNLMTPCADN-----NGELTPCHIRYNQAVHLWNAPAER 292
Query: 237 SFGILKEKFPCLNYMRLSPQFAG-MVVLASTTLHNIAL 273
+F L +F L + + G MV+ A+ LHN L
Sbjct: 293 AFAALLSRFRRLASLDVGTMEVGSMVISATCMLHNFIL 330
>gi|356532966|ref|XP_003535040.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 385
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 105/253 (41%), Gaps = 27/253 (10%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR----KGGFPNVAGCVDGTM 122
S +I R + V+ V+ F + ++K + + D P L+ + F + G VDGT
Sbjct: 132 SGETISRHFYSVLEVVC-MFAKDIIKPVDPSFRDTPDEILKDSRYRPYFRDCIGAVDGTH 190
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + P+ + ++ G NVM +C + F V A W GS HD ++ ++ RK
Sbjct: 191 IRVCVPSHLQGVYIGWKGYTTTNVMVVCDFSMCFTFVWAGWEGSAHDTKIFMEAL--RKP 248
Query: 183 DEEGWRPFPDAVILGDSAYP---------------LKEWLIPPRTRNPDAAADQRFLRCH 227
P L DS YP L ++ I PR R + F H
Sbjct: 249 ALHFPHPPQGKYYLVDSGYPTFMGFLGPYKKIMYHLPQFRIGPRIR----GRVEVFNYYH 304
Query: 228 KRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
+ IE +FG+ K ++ L M + + +++A +HN D+ + +
Sbjct: 305 SSLQSAIECAFGLCKARWKILGNMPPFTLKTQNQIIVACMAIHNFIQRNDKSDGEFDSLD 364
Query: 287 HDDEAEDDGEEEA 299
D+E D E+E+
Sbjct: 365 EDNEYIDSDEDES 377
>gi|322782708|gb|EFZ10552.1| hypothetical protein SINV_02440 [Solenopsis invicta]
Length = 241
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 64/145 (44%), Gaps = 20/145 (13%)
Query: 91 VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAIC 150
++WP+++ + F K G NV G +DGT + I+ P+ +E F+ G ++
Sbjct: 5 IRWPQKDYIAVREDFRAKKGINNVIGAIDGTTIRIEKPSINEKDFI-ATGKRVVDA---- 59
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
N KF + PGS+HDAR F ILGDSAY WL+PP
Sbjct: 60 --NIKFTNIYCGEPGSLHDARYKETLFPENTF------------ILGDSAYASLSWLVPP 105
Query: 211 RTRNPDAAADQR-FLRCHKRTRRLI 234
N Q+ F H TR ++
Sbjct: 106 FRDNGHLTLQQKEFNFLHSSTRMVV 130
>gi|443686502|gb|ELT89758.1| hypothetical protein CAPTEDRAFT_216726, partial [Capitella teleta]
Length = 193
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 66/140 (47%), Gaps = 4/140 (2%)
Query: 10 FRLSATQVEFILEEIA-PALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+RL ++ + EE + TER A+ +VL L + G G + A HG S+
Sbjct: 54 YRLPRAMIKSLSEEYGRSSFNNVTERGHAIPPDTQVLVALRFYGKGGFQSELAAIHGVSR 113
Query: 69 SSICRSIHEVVPVIVDTFFQRMV--KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
SS+ R I V + + M K + + + F K P + GC+DGT + I
Sbjct: 114 SSVSRIIRHVSEFLCSKAAESMTLPKSRQRQMQSM-QKFHGKRNIPGILGCIDGTHIGIL 172
Query: 127 APNEHEDQFVNRNGDHALNV 146
AP + E +VNR G H++NV
Sbjct: 173 APAKDEWAYVNRKGGHSINV 192
>gi|242067219|ref|XP_002448886.1| hypothetical protein SORBIDRAFT_05g000945 [Sorghum bicolor]
gi|241934729|gb|EES07874.1| hypothetical protein SORBIDRAFT_05g000945 [Sorghum bicolor]
Length = 649
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 93/219 (42%), Gaps = 34/219 (15%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT +L+ + +NR G NV+A+C N F + A GS HD
Sbjct: 429 FDGCIGAMDGTHILVSVDDGVRLDHINRYGVTTQNVVAVCDFNMMFTYIGAGTEGSAHDM 488
Query: 171 RVMRNSMVYRKFDEEGWRPFP--DAVILGDSAYPLKEWLIPP----RTRNPDAA------ 218
RV +K +E+ P P L DS Y L +PP R R D
Sbjct: 489 RVK------KKAEEDPAFPIPPDGRYYLVDSGYALGPGYMPPYPQKRFRAKDFKNLGPQD 542
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAG----MVVLASTTLHNIALT 274
A++ F R H R +IE SFG+ K K+ L + P + G ++L LHN
Sbjct: 543 AEELFNRHHAGLRSVIERSFGVAKSKWRMLKSI---PHYPGTKQTQIILVLFALHNYV-- 597
Query: 275 IDRDNENQNVEIHDDEAEDDGEEE-----AVDDPDEIDE 308
R E ++ +A D G A+ DP+++++
Sbjct: 598 --RGLEGKHRVGRRRQAPDLGPSSKVTAMALSDPNDMEQ 634
>gi|4585884|gb|AAD25557.1|AC005850_14 Hypothetical protein [Arabidopsis thaliana]
Length = 404
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 96/222 (43%), Gaps = 33/222 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V + P+ +++ R + +N++A+C + KF PG HD
Sbjct: 174 FIDCIGALDGTHVSVRPPSGDVERYRGRKSEATMNILAVCNFSMKFNIAYVGVPGRAHDT 233
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA--VILGDSAYPLKEWLIPP--RTR---------NPDA 217
+V+ + E P P A L DS YP + + P RTR P
Sbjct: 234 KVLTYCATH-----EASFPHPPAGKYYLVDSGYPTRSGYLGPHRRTRYHLELFNRGGPPT 288
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAG----MVVLASTTLHNIAL 273
+ + F R H R +IE +FG+ K K+ L+ R P++ +V ++ LHN
Sbjct: 289 NSRELFNRRHSSLRSVIERTFGVWKAKWRILD--RKHPKYEVKKWIKIVTSTMALHNYIR 346
Query: 274 TIDRDNEN-------QNVEIHDDEAEDDGEEEAVDDPDEIDE 308
+++ + ++ E H D E+DG V D + E
Sbjct: 347 DSQQEDSDFRHWEIVESYEQHGD--ENDGHVPYVPTGDRVME 386
>gi|331224048|ref|XP_003324696.1| hypothetical protein PGTG_06233 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 363
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 119/305 (39%), Gaps = 51/305 (16%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F E FR++ ++ ++++P L P R LS + +V L+ + G Y IG
Sbjct: 58 FVEFFRMTVEDFRWLSDQLSPELQQDPLRRGDPLSVEAQVAVGLYRLAHGVCYVTIGHVF 117
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN----LNDIPMTFLRKGGFPNVAGCVDG 120
K + ++ V ++ R V +P N ++I +F ++ G PN+ G +DG
Sbjct: 118 NIGKETAEKASGRFVNAVLRILRDRTVGYPPLNRPDLWDEIKASFEQRHGIPNIVGAIDG 177
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
T + + P NW GS+HD RV R S + +
Sbjct: 178 THIPVAMPPAD------------------------------NWKGSMHDTRVFRRSRLGQ 207
Query: 181 KF--DEEGWRPFP-DAVILGDSAYPLKEWLIPPRTRNPDAAAD--QRFLRCHKRTRRLIE 235
R P + ++GD+ YP ++ P P A + + F TR +E
Sbjct: 208 SMTPGTGAARIIPAGSYLIGDAGYPSNVDILVP---YPLVATEPNEWFNFIQSSTRICVE 264
Query: 236 NSFGILKEKFPCLNYMRLSPQF-AGMVVLASTTLHNIALTIDRDNENQNVEIHD-DEAED 293
+FG LK +F L + + F A LHNI N + +H DE +D
Sbjct: 265 QAFGRLKNQFRILRSAQNARPFRARNNTFVCMILHNIL------NRRGTLYLHVWDEGQD 318
Query: 294 DGEEE 298
E
Sbjct: 319 AHERR 323
>gi|403168073|ref|XP_003327777.2| hypothetical protein PGTG_08544 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167338|gb|EFP83358.2| hypothetical protein PGTG_08544 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 374
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 109/257 (42%), Gaps = 29/257 (11%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK+A R + + ++L + L TPT + + + VG
Sbjct: 88 FKQAIRTTKSGFTWLLSQ--QQLLTPTANPSPIGTDTRTTWIKRNLCVG----------- 134
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-L 124
+ ++ ++ V I D ++ + WP+++ ++ GF G VDGT + L
Sbjct: 135 --RGTVVKASCWVTRAIKD-LSEKYLIWPDKDRQKEVSDVMKGEGFEGCVGFVDGTTIPL 191
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK--- 181
P+ + F +R +++N IC + A WPG+ D+ V + M+ ++
Sbjct: 192 YQRPSIDGEVFFDRKKHYSINCQVICDCDRFITAYMTGWPGTCGDSMVFKRMMLQKEPAL 251
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ--RFLRCHKRTRRLIENSFG 239
F + G ++ DSAY L IP + P A + F C R+R E++ G
Sbjct: 252 FFDRG------KYLIADSAYELGVHCIPA-YKAPAAYIKENTEFNYCLARSRVQNEHTIG 304
Query: 240 ILKEKFPCLNYMRLSPQ 256
ILK ++ L ++L+ Q
Sbjct: 305 ILKGRWASLQQLQLAIQ 321
>gi|301116483|ref|XP_002905970.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109270|gb|EEY67322.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 326
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 111/252 (44%), Gaps = 28/252 (11%)
Query: 93 WP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ-FVNRNGDHALNVMAIC 150
WP EE I + +K GF N G DGT+ + A +H+ + + +R ++++ + C
Sbjct: 75 WPNEEERVMISLRMQQKYGFVNCVGITDGTLFPLAAKPQHQGEGYSSRKASYSVHGLVTC 134
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNS-MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIP 209
++ WPGS HD R NS ++ ++ D + +LGDSA+ +IP
Sbjct: 135 DDVGSVRSLVVGWPGSTHDNRAWMNSPLILKRADHFKHNEY----VLGDSAFQASSVMIP 190
Query: 210 PRTRNPDAAADQRFLRCHK---RTRRLIENSFGILKEKFPCLNYMRLS-----PQFAGMV 261
P A + R +K + R E+ +LK +FP L +R+ ++
Sbjct: 191 AFKNPPKAQMNPRHTYFNKQLAKARIKSEHCIRLLKMRFPYLREIRVKLGKKRKHIRRLI 250
Query: 262 --VLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNP 319
V ++ LHN+ + ++ QN EI E + D+ DE +P + N
Sbjct: 251 RHVTCASILHNLLIAEPIPHQWQN-EI---------ERQITSRLDDDDELNEPIPEGEN- 299
Query: 320 GTQRLARLLNYF 331
G +R +LL Y
Sbjct: 300 GDKRRNQLLAYM 311
>gi|301098673|ref|XP_002898429.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105200|gb|EEY63252.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 302
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 86/174 (49%), Gaps = 18/174 (10%)
Query: 44 VLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPM 103
+++ L ++ GS Y I T G S SS R I+E + I ++V E+ ++ M
Sbjct: 2 LMTILRYLAGGS-YLDIRRTVGISGSSYYRVINETMFAIFGLRELKIVFPDSESDREVVM 60
Query: 104 ----TFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVN----RNGDHA---LNVMAICGP 152
G GCVDG + +I AP+ + VN +G +A +NV A+C
Sbjct: 61 NDFEAISSSGIISGCVGCVDGWLCVIKAPSAEDAGDVNIGRYYSGHYACPGINVQAVCDA 120
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKE 205
+ +F +V+A++PGS +DAR R + V R FP + I+GD+AYP +
Sbjct: 121 HCRFISVDASYPGSTNDARDFRGTGVARNISS-----FPTGMYIIGDNAYPYHD 169
>gi|356545235|ref|XP_003541050.1| PREDICTED: uncharacterized protein LOC100782976 [Glycine max]
Length = 380
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 80/171 (46%), Gaps = 13/171 (7%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
KE F+L IL+E + T +NV + E V LH + Y ++ ++
Sbjct: 87 KEFFKLCR-----ILQEKGQLVKT---KNVPID--EAVAMFLHILAHNLKYRVVHFSYCR 136
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
S +I R V+ I+ + +K+ E N+ R F N G +DG + +
Sbjct: 137 SMETISRQFKNVLRAIMKVS-KEYLKFHEYNIEGSVENKWR--WFKNSIGALDGIHIPVT 193
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
E ++ NR GD + NV+ +CGP+ +F V W GS D+RV+R+++
Sbjct: 194 VSAEDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLRDAL 244
>gi|301113192|ref|XP_002998366.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111667|gb|EEY69719.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 306
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 9/141 (6%)
Query: 107 RKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPG 165
+ FP+ G +DGT+ L+ P +H + + +R +A+N + IC + N W G
Sbjct: 121 KTSSFPHCIGFIDGTLFPLLTRPQDHGEDYFSRKASYAINGLVICDDQSRICYENVGWSG 180
Query: 166 SVHDARVMRNSMVYRKFDEEGWRPFP-DAVILGDSAYPLKEWLIPPRTRNPD---AAADQ 221
S +D RV RN V D R F D +LGDSAY L+P + D +
Sbjct: 181 SSYDNRVWRNCKVALNKD----RYFGHDQYLLGDSAYQTPNVLVPAFKTSADVQLSVKKI 236
Query: 222 RFLRCHKRTRRLIENSFGILK 242
+F +C + R I+ + +++
Sbjct: 237 QFTKCLAKVRIPIKATIKLIR 257
>gi|47229913|emb|CAG10327.1| unnamed protein product [Tetraodon nigroviridis]
Length = 585
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 113/300 (37%), Gaps = 39/300 (13%)
Query: 10 FRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
FR+S ++ +++ P LT T +AL +++V L + Y I A G K
Sbjct: 226 FRMSRETFFYLCDKLRPRLTRQDTTFRLALPVEKRVAVALWRLASNVEYRTISALFGVGK 285
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
S++CR + ++ IV ++ P E+ L D L GFP+ + I
Sbjct: 286 STVCRCVRDMCHAIVALLSSVYLRAPDEQELQDSAQCCLSSWGFPHCVAAMTTLHTAIIT 345
Query: 128 PNEHEDQFVNRNG-----------DHALNVMAICGPNYK----FYAVNANWPGSVHDARV 172
P+ + + N G M C P+ F + S V
Sbjct: 346 PSNNASDYANPAGWLSVVSQVAQLQQTCAQMLACRPSLTSRRGFRWLLTAVGSSGMCVPV 405
Query: 173 MRNSMVYRKFDEEGWRPFPDA---------VILGDSAYPLKEWLIP-------PRTRNPD 216
+NS ++ E G P P VILG++ YPL+ WL+ R R
Sbjct: 406 SQNSSLWATAAEGGLSPSPPPVFMGRPLRYVILGEACYPLQSWLMKVYSEEEGKRGRKTA 465
Query: 217 AAADQRFLRCHKRTRRLI---ENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIA 272
QR + R RR + E + L+ ++ CL+ ++LA LHN+
Sbjct: 466 MTEPQRLF--NHRLRRALRAPEETLLRLRARWQCLSKRNDCGLDLVPTMILACCILHNLC 523
>gi|403177022|ref|XP_003888818.1| hypothetical protein PGTG_22337 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375172694|gb|EHS64701.1| hypothetical protein PGTG_22337 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 437
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 87/201 (43%), Gaps = 19/201 (9%)
Query: 83 VDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGD 141
+++ + V WP+ ++ GFP G VDGT + L P + + +R
Sbjct: 185 INSLSSKHVVWPDRYRRAEISEVMKDEGFPGCVGFVDGTTIPLHQRPGLDGEVYWDRKKK 244
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA--VILGDS 199
+++N +C + + WPGS D+ V RN+ ++ + + + DA ++ DS
Sbjct: 245 YSVNCQIVCDCDRYITSFTVGWPGSCGDSWVFRNTKLHTQAE-----AYFDAGQFLIADS 299
Query: 200 AYPLKEWLIPPR---TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ 256
AY L IP N AD F C ++R E++ GILK ++ L +RL
Sbjct: 300 AYGLSCTTIPAYKAPASNKRENAD--FNYCLAKSRVRNEHTIGILKGRWASLQQLRLHLH 357
Query: 257 FAGMV------VLASTTLHNI 271
+ V TLHN+
Sbjct: 358 KKSHMKEIIRWVSCCVTLHNM 378
>gi|388491000|gb|AFK33566.1| unknown [Medicago truncatula]
Length = 245
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 101/213 (47%), Gaps = 15/213 (7%)
Query: 89 RMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNR-------NG 140
+ + WP+EN +N I F G PNV G + T + I AP + + + N+
Sbjct: 3 KFITWPDENKMNLIKQEFEGLFGMPNVGGSMYTTHIPIIAPKNNVNSYFNKRQTQRNQKT 62
Query: 141 DHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSA 200
+++ V + F V +PGS +D +V+ S++Y++ + D+ ++G+S
Sbjct: 63 AYSVTVQGMVDAKGVFTDVFLGYPGSYNDDQVLEKSVMYQRAMTGNLK---DSWVVGNSG 119
Query: 201 YPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM 260
+PL + ++ P T F + ++L +++F +K ++ CL R+ + +
Sbjct: 120 FPLMDGILVPYTHQNLTWTQHAFNEKVEDIQKLSKDAFAKVKGRWSCLQ-KRIEVKIEEL 178
Query: 261 --VVLASTTLHNIA-LTIDRDNENQNVEIHDDE 290
V+ A LHNI + ++ + N E+ DDE
Sbjct: 179 PGVLGACCVLHNICEMRNEKMDPAWNFELFDDE 211
>gi|356510414|ref|XP_003523933.1| PREDICTED: uncharacterized protein LOC100814513 [Glycine max]
Length = 543
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 96/236 (40%), Gaps = 33/236 (13%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKG----GFPNVAGCVDGTM 122
S +I R H V+ + F + ++K + + D P L+ F + G +DGT
Sbjct: 77 SGETISRHFHNVLEAVC-MFAKDIIKPVDPSFRDTPDEILKDARYRPYFRDCIGAIDGTH 135
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ + P+ + ++ R G NVMA+C + F V A W GS HD ++ ++ K
Sbjct: 136 IRVCVPSHLQGVYIGRKGYTTTNVMAVCDFSMCFTFVWAGWEGSEHDTKIFMEAL--HKP 193
Query: 183 DEEGWRPFPDAVILGDSAYP---------------LKEWLIPPRTRNPDAAADQRFLRCH 227
P L DS YP L ++ I PR R + F H
Sbjct: 194 ALHFPHPPQGKYYLVDSGYPTFMSFLGPYMKTRYHLPQFRIGPRIR----GRVEVFNYYH 249
Query: 228 KRTRRLIENSFGILKEKFPCLNYMRLSPQFA----GMVVLASTTLHNIALTIDRDN 279
+ IE +FG+ K ++ L M P FA +++A +HN D+ +
Sbjct: 250 SSLQSTIERAFGLCKTRWKILGNM---PPFALKTQNQIIVACMAIHNFIQRNDKSD 302
>gi|357116794|ref|XP_003560162.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 406
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 121/277 (43%), Gaps = 32/277 (11%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
++ ++ ++KV L +G G ++ T+ S +I R V+ I+ + +K P
Sbjct: 103 SIYVTVEKKVAMFLLVVGHGLKMRLLCVTYKRSLETISRHFSTVLSAIL-SLTNEFIKLP 161
Query: 95 EENL---NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICG 151
+ ++ ND + F N G +DG V + ++ ++ NR N++ +
Sbjct: 162 DPSIPPPNDYKWKW-----FGNALGALDGCHVDVMVDVANQGRYRNRKQSITTNMLGVVD 216
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFD------------EEGWRPFPDAVI-LGD 198
N K V W GS D+RV+R++M + D + G+ P +
Sbjct: 217 WNMKILYVLPGWEGSTLDSRVLRDAMRPNRQDTFVVPKGKFYLVDAGYTNGPGFLSPFQS 276
Query: 199 SAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN---YMRLSP 255
+ Y LKEW+ + P A + LR H R R ++E +FG+ K+K+ L + +
Sbjct: 277 TRYHLKEWV--SSQQQPHTAKELYNLR-HSRARNVVERTFGLWKKKWAILRTQFFFDIKD 333
Query: 256 QFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAE 292
Q ++ A LHN DR + N+ + + +AE
Sbjct: 334 QIR--IINACCVLHN--FIRDRKHVRDNLLLPEVDAE 366
>gi|331228536|ref|XP_003326935.1| hypothetical protein PGTG_08472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309305925|gb|EFP82516.1| hypothetical protein PGTG_08472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 423
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 13/172 (7%)
Query: 88 QRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNV 146
+R V WP+ + + GF G VDGT + + P + F +R +++N
Sbjct: 190 RRYVVWPDAARRAEISEVMSREGFRGCVGFVDGTTIPMFQRPGYDGEVFFDRKRRYSINA 249
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVY----RKFDEEGWRPFPDAVILGDSAYP 202
IC + + WPGS D+RV + ++ FDE + +L DSAY
Sbjct: 250 QIICDCDKYITSFITGWPGSCGDSRVYKRMQLHLNPSNYFDEGQY-------LLADSAYE 302
Query: 203 LKEWLIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL 253
L +IP + + + +F C + R E++ G+LK ++ L MRL
Sbjct: 303 LSHTVIPAYKVPAANITINSQFNFCLAKARVRNEHTIGVLKSRWSSLREMRL 354
>gi|357140604|ref|XP_003571855.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 211
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 72/175 (41%), Gaps = 16/175 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V P +F R H NV+ + F V+A W GS HD+
Sbjct: 23 FKDCIGAIDGTHVPCHVPARIVHRFHGRKPFHTQNVLVAVDFDLLFTYVSAGWEGSAHDS 82
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-----------RTRNPDAAA 219
V+R+S+ + G R L D+ Y + +PP R N
Sbjct: 83 TVLRHSLEH----PNGLRVLEGKYYLADAGYVARRGFLPPFRQTQYHLREWRGNNKPRTP 138
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLASTTLHNIAL 273
++ F H R +E +FG LK +F L P + + VV+A LHN L
Sbjct: 139 NELFNLRHSSFRTTVERAFGTLKNRFKFLTTRPYYPFPSQVRVVIACCILHNXIL 193
>gi|328700507|ref|XP_003241283.1| PREDICTED: hypothetical protein LOC100159175 [Acyrthosiphon pisum]
Length = 730
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 80/174 (45%), Gaps = 26/174 (14%)
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE-----------GWRPFPDAVI 195
MA+ +Y V+ G D + +S + + D + G VI
Sbjct: 1 MALVDAHYNCIVVDVGAYGRNSDGGIFMHSKLGKGLDRKQLNVPPCSALPGTTNIAPFVI 60
Query: 196 LGDSAYPLKEWLIPPR-TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRL 253
LGD A+PLKE+L+ P + D + + + H R RR++EN+FGIL +KF N ++
Sbjct: 61 LGDEAFPLKEYLMRPYPGKQLDDDSKRIYNYHHCRGRRVVENAFGILSQKFRIYNRRIQA 120
Query: 254 SPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD-DEAEDDGEEEAVDDPDEI 306
P+ ++LA+ LHN ++IHD + + GE ++DP +
Sbjct: 121 KPENVDNIILATCILHNF------------IKIHDGKQTYNRGESTNLEDPSTL 162
>gi|356529493|ref|XP_003533325.1| PREDICTED: uncharacterized protein LOC100779133 [Glycine max]
Length = 392
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 77/174 (44%), Gaps = 20/174 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG V + + + F N+NG + N++A C + KF+ V A W GS D
Sbjct: 145 FKDCVGVIDGIHVPVTVGVDEQGPFRNKNGLLSQNILAACSFDLKFHYVLAGWEGSATDL 204
Query: 171 RVMRNSMVYRKFDE--EGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA---------- 218
V +++ R + EG + DS YP I P + P +
Sbjct: 205 LVFNSAITRRNKLQVPEG------KYYIVDSKYPNVPGFIAPYSSTPYYSKEFLSDYHPQ 258
Query: 219 -ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHN 270
A + F + H R + + +FGILK +FP L P Q +V+A+ LHN
Sbjct: 259 DAGELFNQRHSLLRHVTDRTFGILKARFPILMSAPSYPLQTQVKLVVAACALHN 312
>gi|242061736|ref|XP_002452157.1| hypothetical protein SORBIDRAFT_04g020772 [Sorghum bicolor]
gi|241931988|gb|EES05133.1| hypothetical protein SORBIDRAFT_04g020772 [Sorghum bicolor]
Length = 306
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 22/145 (15%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT + I + + + NR G + N+M C + F +++ W GS DA
Sbjct: 111 FKNCLGAIDGTHIPITISQDKQAPYRNRKGTLSQNIMLACDFDLNFTFISSGWEGSATDA 170
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
RV+R+++++ G+R L D Y P A R +R H
Sbjct: 171 RVLRSTLLH------GFRVPEGKFFLVDGGY----------ANTPSFIAPYRGVRNH--- 211
Query: 231 RRLIENSFGILKEKFPCLNYMRLSP 255
IE + G+LK++FP L L P
Sbjct: 212 ---IERAIGVLKKRFPILKVGTLHP 233
>gi|403160884|ref|XP_003890524.1| hypothetical protein PGTG_20815 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170436|gb|EHS64115.1| hypothetical protein PGTG_20815 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 421
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/313 (22%), Positives = 120/313 (38%), Gaps = 52/313 (16%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F E FR+S ++ +E+ L P R LS + +V L+ +G G+ Y +I
Sbjct: 59 FIEYFRMSRADFAWLSDELRETLQQDPLRRGAPLSVEAQVAVGLYRLGHGATYVMISHVF 118
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP----EENLNDIPMTFLRKGGFPNVAGCVDG 120
K + ++ V ++ + + +P + ++I F + G P++ G +DG
Sbjct: 119 NIGKETADKATGHFVQAVLKVLRLQTISFPGLDAHDKWDEIIELFEWRHGIPDIVGAIDG 178
Query: 121 TMV-LIDAPNEHEDQFVNR------------NGDHALNVMAICGPN-------------- 153
T + L P++ ++NR NGD N +C P
Sbjct: 179 THIPLAIPPSDEWKGYINRKNWASLVFQCVVNGDG--NFRDVCDPPLHNSYIQSTSSQLL 236
Query: 154 ---------YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPF---PDAVILGDSAY 201
+ V GS+HD RV R S + + R P ++GD+ Y
Sbjct: 237 HALMSDERVWTVGQVFGGGAGSIHDTRVFRRSDLGISLNNALGREMRIPPGTHLIGDAGY 296
Query: 202 PLKEWLIPPRTRNPDAAADQ--RFLRCHKRTRRLIENSFGILKEKFP-CLNYMRLSPQFA 258
P ++ P P A + F TR ++E +FG LK +F L + +P A
Sbjct: 297 PSDVNVLVP---YPSVVAPENVHFNHIQSATRIIVEQTFGRLKNRFRILLTAQKANPVRA 353
Query: 259 GMVVLASTTLHNI 271
LHN+
Sbjct: 354 RNTAFVCMILHNL 366
>gi|18958690|gb|AAL82673.1|AC092387_21 transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|20270066|gb|AAM18154.1|AC092172_14 Putative En/Spm-like transposon protein [Oryza sativa Japonica
Group]
gi|31430897|gb|AAP52749.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 563
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 51/98 (52%), Gaps = 2/98 (2%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARV 172
N G +DGT + + PN Q NR+ + + NVM +C + +F V A WPGSVHD RV
Sbjct: 434 NCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQNVMFVCDFDMRFTFVLAGWPGSVHDMRV 493
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ +F + +P P L DS YP + + P
Sbjct: 494 FNDAQT--RFSAKFPKPPPGKFYLVDSGYPNRLGYLAP 529
>gi|218198426|gb|EEC80853.1| hypothetical protein OsI_23470 [Oryza sativa Indica Group]
Length = 465
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 101/234 (43%), Gaps = 26/234 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG+ + + P + R+G + NV+AIC + +F A+WPG HD+
Sbjct: 236 FKDAIGAIDGSHIPVVVPVDETISHTCRHGYTSQNVLAICVFDMRFTFAVASWPGFAHDS 295
Query: 171 RVMRNSMV-YRKFDEEGWRPFP--DAVILGDSAYPLK-EWLIP--------PRTRNPDAA 218
R++ +++ + F P P L D YP + +L P P R+ +
Sbjct: 296 RILSHALANFPSF------PMPPTGKYYLVDLGYPNRIGYLAPFKGATYHIPEFRHHSGS 349
Query: 219 ADQR---FLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT 274
+ F H R +I+ SFG+LK+K+ L + P +++A LHN
Sbjct: 350 PQGKYEVFNFLHSSLRNVIKRSFGVLKQKWRILKGIPSFPPTTQKHIIMACLALHNFV-- 407
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLL 328
D+ + E +A++D E D + + E+ D N R+A L
Sbjct: 408 --HDSNLRYKEFERCDADEDYLLEDTSDTPDDESEDAKNDDTMNTIRTRIADAL 459
>gi|356577175|ref|XP_003556703.1| PREDICTED: uncharacterized protein LOC100780651 [Glycine max]
Length = 507
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 100/233 (42%), Gaps = 24/233 (10%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 95 ILQEKGQLVKT---KNVPID--EVVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 149
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ E NL R F N G +DG + + E ++ N+
Sbjct: 150 RAIMKVS-KEYLKFHEYNLEGSVENKWR--WFKNSIGALDGMHIPVTVSAEDRPRYRNKK 206
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----------FDEEGWRP 189
GD + NV+ +CGP+ +F V W G D+RV+R+++ + + G+
Sbjct: 207 GDISTNVLGVCGPDLRFIYVLPGWEGLAGDSRVLRDALRRQNCLHIPNGKYFLVDVGYTN 266
Query: 190 FPDAVI-LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
P + + Y L EW+ RN + F H R +IE SFG++
Sbjct: 267 GPGFLAPYRGTRYHLNEWI-----RNTPQNYKELFNLRHASARNVIERSFGLI 314
>gi|356507017|ref|XP_003522268.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 79/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ S+ +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSEETISRKFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +DGT +L+
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDGTHILVTVSPYER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 RRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|302802405|ref|XP_002982957.1| hypothetical protein SELMODRAFT_34337 [Selaginella moellendorffii]
gi|300149547|gb|EFJ16202.1| hypothetical protein SELMODRAFT_34337 [Selaginella moellendorffii]
Length = 217
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/200 (24%), Positives = 88/200 (44%), Gaps = 26/200 (13%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALT--------------TPTERNVALSAKEKVLSCL 48
P F+E FR+S + F+ +E+AP L P + VA++ K ++ S
Sbjct: 2 PQRFREVFRVSVSIFTFLCDELAPKLVRRPPPGLAEIPRRHLPISKQVAIALK-RLASGD 60
Query: 49 HWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLR 107
W +G + G +++ +CR + ++ M+ WP EE + ++ F R
Sbjct: 61 MWRTIGDAF---GVASCIAQACLCRFRYALL-----EHEGLMIHWPDEEGMKEVITGFQR 112
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHE--DQFVNRNGDHALNVMAICGPNYKFYAVNANWPG 165
GFPN G +D T + I+ P + D + +++ V A+ F + G
Sbjct: 113 LRGFPNCCGAMDCTHIAIELPGSEDATDWYAGAKKYYSMVVQAVVDSKTSFLDITIGIAG 172
Query: 166 SVHDARVMRNSMVYRKFDEE 185
SV D RV +S + + + E+
Sbjct: 173 SVPDRRVWNSSGLKKAWIEK 192
>gi|391335787|ref|XP_003742270.1| PREDICTED: putative nuclease HARBI1-like [Metaseiulus occidentalis]
Length = 447
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 121/280 (43%), Gaps = 20/280 (7%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F + FR++ ++L+EI+P + E + KV L W+ GS
Sbjct: 119 VFFDLFRMTRETFSYVLDEISPFMMADDE-----EVEVKVAVAL-WILGGSEAVYPPLLK 172
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
P + I + + E + + + + + L+ F K G N AG ++ T +
Sbjct: 173 VPEEQRI-QYLLEFCRQVTKSLMPKEISL-HDRLDSSMWNFQCKFGLRNCAGALNFTHIP 230
Query: 125 IDAPNEHED--QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
I + +D +++N G ++ + A+ +YKF ++ +PG V+ NS V+RK
Sbjct: 231 IYLGDMKDDACEYLNNAGFRSIIMQAVVSSDYKFCDLSIGYPGRTPVDHVLYNSAVWRKA 290
Query: 183 D---------EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
+ E G P ++ + YPL E ++ P + ++RF +
Sbjct: 291 ESGVLFPAEGEPGPNQGPHPYLIAGTCYPLSERIVTPFQAPKFSDMEKRFNNSIYNALSV 350
Query: 234 IENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHNIA 272
+ +F +L+ +FP L++ +++ + VV A LHNI
Sbjct: 351 TQRAFKLLRSRFPFLLDFDKVTVEDTPAVVAACCVLHNIC 390
>gi|357155317|ref|XP_003577080.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 329
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 82/209 (39%), Gaps = 31/209 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G VDGT + + P + ++ + NR + N M C + KF +N W GS DA
Sbjct: 97 FKNCIGAVDGTHIPMKLPLDQQEPYRNRKQTISQNAMVACDFDLKFVHINPGWEGSASDA 156
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------------RTRNPDAA 218
RV+++++ G+ D L D+ Y + P R R
Sbjct: 157 RVLQDAL------NHGFEVPDDKFYLVDAGYANTPQFLAPYRGTRYHLNEQGRARQKPQN 210
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLASTTLHNIALTIDR 277
+ F H + R IE GI K +FP L P+ + V +A LHN
Sbjct: 211 HKELFNLRHAQLRNHIERIIGIWKMRFPILKVASHFPKEKQIDVSVACAVLHNF------ 264
Query: 278 DNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
+ IH+ + + DPD+I
Sbjct: 265 ------IRIHNGDMTWPSDATMDIDPDQI 287
>gi|331221924|ref|XP_003323636.1| hypothetical protein PGTG_05538 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309302626|gb|EFP79217.1| hypothetical protein PGTG_05538 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 367
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/207 (23%), Positives = 91/207 (43%), Gaps = 9/207 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F E FR+ ++ +E++ L P + LS + +V+ L+ +G + Y +I
Sbjct: 59 FIEYFRMLRADFAWLSDELSKKLQQDPLRQGAPLSVEAQVVVGLYRLGHDATYVMISHVF 118
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND----IPMTFLRKGGFPNVAGCVDG 120
K + ++ V ++ R + +P + +D I +F + G P+ G ++G
Sbjct: 119 NIGKETADKATGRFVQAVLKVLRLRTISFPGLDAHDKWDKIIKSFEWRHGIPDSVGAING 178
Query: 121 TMVLIDAPNEHEDQ-FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T +L P E + ++NR +L + + F V GS+HD RV R S ++
Sbjct: 179 THILFAIPPLDEWKGYINRKNWVSLVFQCVVDGDGNFRDVFGGGAGSIHDTRVFRRSNLW 238
Query: 180 RKFDEEGWRPF---PDAVILGDSAYPL 203
+ R P ++GD+ YPL
Sbjct: 239 ISLNNALGRQMRIPPGTHLIGDAGYPL 265
>gi|359497175|ref|XP_003635445.1| PREDICTED: uncharacterized protein LOC100853349 [Vitis vinifera]
Length = 568
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 92/209 (44%), Gaps = 29/209 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG+ + N+ ++ R NV+A C + KF V W GS D+
Sbjct: 151 FKDCIGAIDGSHFRVKVSNDVVQRYRGRKYYPTQNVLAACSFDLKFTYVLPGWEGSASDS 210
Query: 171 RVMRNSMVYRKFDE----EGWRPFPDAVILGDSAYPLKEWLIPP-----------RTRNP 215
R++ N+++ R FD+ +G L D+ + LK + P P
Sbjct: 211 RILDNALM-RDFDKLIVPQG------KYYLADAGFQLKTGFLTPYRSTRYHLKEYSVHQP 263
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ A + LR H R IE +FG+LK++FP + P + ++LA LHN
Sbjct: 264 ENAREVFNLR-HSSLRNAIERAFGVLKKRFPIIA-SGTEPHYPVDTQSDIILACCILHNY 321
Query: 272 ALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
+ +D NE E+ + ++ E E++
Sbjct: 322 LMGVD-PNERLIAEVDRELFSEEAEFESM 349
>gi|20042909|gb|AAM08737.1|AC025098_4 Putative transposase [Oryza sativa Japonica Group]
gi|31429980|gb|AAP51957.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 1003
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 105/255 (41%), Gaps = 38/255 (14%)
Query: 73 RSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHE 132
+++ +V+P + F ++ + E I L F N G +DGT V I + +
Sbjct: 392 KAVFKVIPSLTYRFIKQTTR--VETHWKISTDQLFFPYFQNCLGAIDGTHVPITISQDLQ 449
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD 192
+ NR G + NVM +C + F + + W GS DARV+R++M+ +G+
Sbjct: 450 APYRNRKGTLSQNVMLVCDFDLNFLFIPSGWEGSATDARVLRSAML------KGFNVPQG 503
Query: 193 AVILGDSAYPLKEWLIPP----------------RTRNPDAAADQRFLRCHKRTRRLIEN 236
L D Y + P R RN + R H R IE
Sbjct: 504 KYYLVDGGYANTPSFLAPYRGVRYHLKEFGRGQQRPRNYKELFNHR----HAILRNHIER 559
Query: 237 SFGILKEKFPCL---NYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAED 293
+ G+LK++FP L + R+ Q + +A+ HN+ ++ D N + +
Sbjct: 560 AIGVLKKRFPILKVGTHHRIKNQVK--IPVATVVFHNLIRMLNGDEGWLN-----HQGSN 612
Query: 294 DGEEEAVDDPDEIDE 308
E+ +D P+ DE
Sbjct: 613 ISPEQFIDVPEGDDE 627
>gi|331227083|ref|XP_003326210.1| hypothetical protein PGTG_08040 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309305200|gb|EFP81791.1| hypothetical protein PGTG_08040 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 74/169 (43%), Gaps = 9/169 (5%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLID-APNEHEDQFVNRNGDHALNVMA 148
+++WP + GF G +DG++V++ P + + +R G + + +
Sbjct: 190 LLEWPTAAAREETKMHFGDVGFEGCVGLIDGSLVVLSTCPEKDGPDYYSRKGFYCIATLL 249
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWL 207
+C + V WPG HD R+M N + + + R F D +L DSA+
Sbjct: 250 VCDQHKNIIYVFTGWPGCSHDMRLMTNCALSKSPN----RYFSDGEYLLADSAFVPTLTT 305
Query: 208 IPP--RTRNPDAAADQR-FLRCHKRTRRLIENSFGILKEKFPCLNYMRL 253
+P R RN +Q F R R +EN G+LK +F L +RL
Sbjct: 306 VPAFKRKRNKPLTDEQTDFNRHLSGVRVAVENCIGLLKNRFQSLRGLRL 354
>gi|222625210|gb|EEE59342.1| hypothetical protein OsJ_11425 [Oryza sativa Japonica Group]
Length = 786
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 81/177 (45%), Gaps = 22/177 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DG + ++P E +++ R+ NVMA+ + +F + PGS+HD
Sbjct: 551 FKKCIGAIDGIHITSNSPREDYVRYIGRSKSPTQNVMAVVDFDMRFTYSSIGQPGSMHDT 610
Query: 171 RVMRNSMVYRKFDEEGW-RPFPDAVILGDSAYPLK----------EWLIPPRTRNPDAAA 219
V+ + + D++ + P L D+ YP + + IP R P +
Sbjct: 611 SVLYHVL---DVDKDIFPHPPQGKYYLVDAGYPNRPGYMAPYKGQRYHIPKWRRGPPPSG 667
Query: 220 DQRFL-RCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA----GMVVLASTTLHNI 271
+Q + +CH R +IE +FG+ K K+ L M P +A M++ A+ LHN
Sbjct: 668 EQEYFNQCHSSARNVIERTFGVWKMKWRILLKM---PTYAMDKQEMIIAATMCLHNF 721
>gi|115468766|ref|NP_001057982.1| Os06g0595700 [Oryza sativa Japonica Group]
gi|113596022|dbj|BAF19896.1| Os06g0595700 [Oryza sativa Japonica Group]
Length = 817
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 112/270 (41%), Gaps = 29/270 (10%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+ +EK+ L+ + + + + G S S R + +V + +R +K P N
Sbjct: 177 IKVEEKLAFFLYMLSHNASFEDLQEKFGHSGDSFHRHVKHFFNSVVPSLSKRFLKPP--N 234
Query: 98 LNDIPMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
N++ + F N G +DGT + I ++ F NR + NVM C +
Sbjct: 235 PNEVHWKIEKDPRFYPYFKNCLGAIDGTHIPISISSDKAAPFRNRKNTLSQNVMIACDFD 294
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPRT 212
K ++ W GS DARV+ +++ +G++ P L D Y +L P R
Sbjct: 295 LKITFMSTGWEGSATDARVLTSAV------NKGFQVPPGKFYLVDGGYANTNSFLAPYRK 348
Query: 213 -----------RNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL---NYMRLSPQFA 258
R + F H R +E + G++K++FP L + +++ Q
Sbjct: 349 VRYHLKEYGVGRRRPQNYKELFNHRHAVLRNHVERTLGVVKKRFPILKVATFHKINNQVK 408
Query: 259 GMVVLASTTLHNIALTIDRDNENQNVEIHD 288
VV A HNI +++ D + N + H+
Sbjct: 409 IPVVAA--VFHNIIRSLNGDEQWLNNQPHN 436
>gi|331226986|ref|XP_003326162.1| hypothetical protein PGTG_07992 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309305152|gb|EFP81743.1| hypothetical protein PGTG_07992 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 419
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 76/330 (23%), Positives = 135/330 (40%), Gaps = 37/330 (11%)
Query: 6 FKEAFRLSATQVEFILEEI---------APALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
FK+A R + ++L++I +P P +AL+ L L G G+
Sbjct: 105 FKQAVRTTKDGFLWLLDQIRHHPIFYSNSPRPQLPIPHQLALT-----LERLGSNGNGAS 159
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
+ ++ + V+ I + +V WP + + + GF G
Sbjct: 160 VGRFSRNLSVGRGTVIKVSRRVIQAINEVSTNYIV-WPNKERRAEISEVMTEEGFKGCIG 218
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
VDGT + L P + + +R +++N IC + WPG+ D+ V +N
Sbjct: 219 FVDGTTIPLHQQPGLDGEVYWDRKKQYSINCQVICDCDRFITLFMTGWPGTCGDSLVFKN 278
Query: 176 SMVY----RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ--RFLRCHKR 229
V+ R FD P ++ DSAY L IP ++P + ++ F C +
Sbjct: 279 MKVHLEPDRFFD-------PGQYLIADSAYALSMTTIPA-YKSPLSKINRNTEFNYCLAK 330
Query: 230 TRRLIENSFGILKEKFPCLNYMRLS---PQFAGMVVL---ASTTLHNIALTIDRDNENQN 283
+R E++ GILK ++ L +RLS PQ ++ LHN+ + D+ ++
Sbjct: 331 SRVRNEHTIGILKGRWASLQQLRLSLYTPQHMVEIICWINCCVALHNMLAHLG-DSWDEL 389
Query: 284 VEIHDDEAEDDGEEEAVDDPDEIDEEEQPR 313
+D++ D E D+ + EE Q +
Sbjct: 390 ATPNDNDEPGDSEGNHEDNTEGHREEIQEK 419
>gi|356498470|ref|XP_003518075.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 79/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +DGT + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDGTHIPVTVSPDER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|193625027|ref|XP_001951483.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 224
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 40/180 (22%), Positives = 78/180 (43%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F FR+S + +L ++ + ++S E++ L ++ G+ + + ++
Sbjct: 12 FFNYFRMSIKSFDELLSKLESGIKVSNSGRPSISPTERLCVTLRYLASGNTFTDLQYSYR 71
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
S+I + +V I + + EE +I + F + FPN G +DG + +
Sbjct: 72 MGISTISGIVEDVCEQIWKMKSECIPLPTEEKWREISLDFEKNTNFPNCIGALDGKHIRV 131
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
P + F N +++ +MAIC NY F V+ G D+ V +N + K + E
Sbjct: 132 IKPIKSGSLFYNYKHYYSIVLMAICDANYCFTFVDVGAYGKFSDSSVFKNGKFFEKLENE 191
>gi|194696338|gb|ACF82253.1| unknown [Zea mays]
gi|413945624|gb|AFW78273.1| hypothetical protein ZEAMMB73_884484 [Zea mays]
Length = 285
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 74/149 (49%), Gaps = 13/149 (8%)
Query: 91 VKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLI-DAPNEHEDQFVNRNGDHALNVMA 148
+KWP E + I F + G PN G +D T +L+ + + +++ +++ + A
Sbjct: 29 LKWPSPEEMATIKTRFDKVYGLPNCCGAIDTTHILMCSSAQPNSKVWLDNENKNSMVLQA 88
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR-------PFPDAV---ILGD 198
+ + +F V + WPGS+ D ++R S +YR E+G R P AV ++GD
Sbjct: 89 VVDADLRFRDVVSGWPGSMDDTCILRTSGLYR-LCEKGLRLGGQMELPGGSAVREYLVGD 147
Query: 199 SAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
++YPL WL+ P + AA F + H
Sbjct: 148 ASYPLLPWLMTPYREHGLQAAKVEFNKRH 176
>gi|356577624|ref|XP_003556924.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 79/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVTMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +DGT + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDGTHIPVTVSPDER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|331231555|ref|XP_003328441.1| hypothetical protein PGTG_09735 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 435
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 87/194 (44%), Gaps = 16/194 (8%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LI 125
+ ++ ++ V+ +V + ++ V WP ++ GF G VDGT + L
Sbjct: 171 GRGTVIKATRRVIEALV-SLGRQYVMWPNSTRRAEISEVMKNEGFVGCVGFVDGTTIPLF 229
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY----RK 181
P + F +R +++N IC + + WPG+ D+ V + ++ +
Sbjct: 230 QRPGYDGEVFYDRKRRYSINAQIICDCDKYITSFITGWPGTCGDSLVYKRMQLHLNPSQY 289
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA--ADQRFLRCHKRTRRLIENSFG 239
FD+ + +L DSAY + + ++P +NP A + F C + R E++ G
Sbjct: 290 FDDGQY-------LLADSAYEVSKTVVPA-YKNPSAKIPINADFNYCLAKARVRNEHTIG 341
Query: 240 ILKEKFPCLNYMRL 253
+LK ++ L MRL
Sbjct: 342 VLKSRWSSLREMRL 355
>gi|331237376|ref|XP_003331345.1| hypothetical protein PGTG_12667 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309310335|gb|EFP86926.1| hypothetical protein PGTG_12667 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 436
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 117/285 (41%), Gaps = 30/285 (10%)
Query: 6 FKEAFRLSATQVEFILEEIA--PALTTPTER-------NVALSAKEKVLSCLHWMGVGSP 56
FK++ R + ++L+ I+ P + + R +AL+ L L G G+
Sbjct: 105 FKQSVRTTKEAFLWLLDRISLHPVFHSQSHRPQLPIPHQLALT-----LERLGSNGNGAL 159
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
I + +I + V+ I ++ + V WP+ +++ GF G
Sbjct: 160 VGIFSRNLTVGRGTIIKVSRRVIEAI-NSLSSKHVVWPDRYRRAEISDVMKEEGFGGCVG 218
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
VDGT + L P + + +R +++N +C + + WPGS D+ V RN
Sbjct: 219 FVDGTTIPLHQRPGLEGEAYWDRKKKYSVNCQIVCDCDRYITSFTVGWPGSCGDSWVFRN 278
Query: 176 SMVYRKFDEEGWRPFPDA--VILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRR 232
+ ++ + + + DA ++ DSAY L IP D F C ++R
Sbjct: 279 TTLHMQAGD-----YFDAGQYLIADSAYGLSCTTIPAYKAPASNKRDNTDFNYCLAKSRV 333
Query: 233 LIENSFGILKEKFPCLNYMRLSPQFAGMV------VLASTTLHNI 271
E++ GILK ++ L +RL + V TLHN+
Sbjct: 334 RNEHTIGILKGRWASLQQLRLHLHKKSHMKEIIRWVSCCITLHNM 378
>gi|402080063|gb|EJT75208.1| hypothetical protein GGTG_05145 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 413
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 115/288 (39%), Gaps = 37/288 (12%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
AF E FR++ + + + + ++V S+ EKV S L+ G +
Sbjct: 67 AFPELFRVNQSTFRALYDLLITHTQLAGSKHV--SSHEKVASFLYICGQDASLRAAAHLF 124
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRM-------VKWPEENLNDIPMTFLRKGGFPNVAGC 117
G S + R+ HEV+ ++ + V WP+ ++ + F G
Sbjct: 125 GRSPDTTRRNFHEVLRAMMRLHVAFVTLPDVSDVVWPDCRRQEV------QEAFAGCLGA 178
Query: 118 VDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
VDGT++ E + +R G A NVMA +F +V A G VHD++ R ++
Sbjct: 179 VDGTLINAKVAAEEGVVWRSRKGPTAQNVMAAVDHRGRFVSVTAGIEGLVHDSQAFREAL 238
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLR----------CH 227
R F G +GD+ + L + +I P +Q F R H
Sbjct: 239 A-RGFVVPG-----GYYYVGDAGFGLGQGVITPYGATKYHLQEQGFARPETPEELFNLHH 292
Query: 228 KRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNI 271
+ R +E SFG LK ++ + + Q + +VLA T L N
Sbjct: 293 AKIRVRVERSFGELKNRWKIVRGPGPSYSIRTQIS--IVLAVTALQNF 338
>gi|326431558|gb|EGD77128.1| hypothetical protein PTSG_07463 [Salpingoeca sp. ATCC 50818]
Length = 498
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/302 (24%), Positives = 118/302 (39%), Gaps = 26/302 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWM-GVGSPYHIIGATH 64
F++ FR++ Q +L+ I+P L +V VL L WM G + +
Sbjct: 52 FRQNFRVTKDQFWELLDIISPHLPKRPGTDVP---DRVVLVGLLWMFAHGCSLQAVQFSI 108
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
G SKS I+D + +P E L + + + DG +
Sbjct: 109 GISKSMCSTHFPAFAKAIIDGLDN--ISFPTGEELKEEQRRWAQDDFLEGCVTAADGVHI 166
Query: 124 -LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
N HE+++ NR G + NV+ + + V GS HDA V+ + + K
Sbjct: 167 RYTPTTNGHEERWRNRKGFKSQNVLVCASFDRQIQHVVVGCEGSCHDASVVATADLTSKL 226
Query: 183 DEEGWRPFPDAVILGDSAYPLKEWLIPPR----------TRNPDAAADQRFLRCHKRTRR 232
E + F A+ L K WL P R +R + ++ F H R R
Sbjct: 227 PEGSFGLFDAAMRLTH-----KRWLTPIRGVTYHLDTFKSRGGPQSDEEIFNYMHARRRS 281
Query: 233 LIENSFGILKEKFPCLNYMRLSP--QFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDE 290
+E FGILK +F L Y S A + ++A LHN L D + +++ DE
Sbjct: 282 RVEACFGILKSRFRIL-YTGFSSCEDTARLYIVACCALHNFLLRQKDDGADFVEQLYQDE 340
Query: 291 AE 292
+
Sbjct: 341 TQ 342
>gi|389609757|dbj|BAM18490.1| unknown unsecreted protein [Papilio xuthus]
Length = 259
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 16/190 (8%)
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
E D+ F K P+ G +DG + + P D ++N + ++ ++ + NY
Sbjct: 6 EGWEDVAQNFKLKWNLPHCIGHLDGRHIEMIRPKNGND-YLNYRKNFSIVLLGLVDANYN 64
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDEEGW------------RPFPDAVILGDSAYPL 203
F V+ PG + D+ V ++ + R +P P VI+GD A+ L
Sbjct: 65 FLYVDVGTPGKISDSEVFNSTNLCRSLQNNTLNLPPPKPLPGRTQPSP-YVIVGDDAFAL 123
Query: 204 KEWLIPPRT-RNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMV 261
+ L+ P + + ++ F R R ++EN+FGIL +F L + +SP A +
Sbjct: 124 SKHLLKPYSYYQINGHPEKMFNYRLNRGRVVVENAFGILSARFRVLRQSILVSPPKASTI 183
Query: 262 VLASTTLHNI 271
LA LHN
Sbjct: 184 ALACVYLHNF 193
>gi|397614781|gb|EJK63010.1| hypothetical protein THAOC_16359 [Thalassiosira oceanica]
Length = 492
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 135/345 (39%), Gaps = 31/345 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTER-----NVALSAKEKVLSCLHWMGVGSPYHII 60
F+ +R+S Q + + + I + R + L + ++ L + GS +I
Sbjct: 95 FRRKYRMSKGQFQLLCQRIREKVGDEDFRPGKNQSNCLCGEIRLAIGLRLLAGGSYLDLI 154
Query: 61 GATHG-PSKSSICRSIHEVVPVIVDTF----------FQRMVKWPEENLNDIPMTFLRKG 109
G +G S SI H+V+ I +TF +R + L +I F
Sbjct: 155 GRAYGVESPQSIYNYFHKVIEWINNTFDFPLVGLLQGLRRKDRRAIARLKEISSEFASDS 214
Query: 110 GF--PNVAGCVDGTMVLIDAPNEHE----DQFVNRNGDHALNVMAICGPNYKFYAVNANW 163
F G +DG V I P++ E F R ALNV AIC + ++
Sbjct: 215 DFCFTGCIGAMDGLAVRIGCPSQAEIADPTCFFCRKNFFALNVQAICDRKKRILWISPGH 274
Query: 164 PGSVHDARVMRNSMVYRKFDEEGWRPFPDAVIL-GDSAYPLKEWLIPPRTRNPDAAADQR 222
GS HD+ +++ ++ +E L GDSAYPL +L P A+
Sbjct: 275 AGSTHDSTAWQDTKLFDLLEEMEAELEKAGFFLVGDSAYPLSAYLQVPYPDAKPVTAEDA 334
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIAL-----TID 276
F +R IE +FG + +F +R +V A+ LHN + T D
Sbjct: 335 FNFWLSNSRIHIECAFGEIIMRFGLFWRALRFPLAKCLDIVKAAALLHNFLVDCREDTAD 394
Query: 277 RDNENQNVEIHDDEAED--DGEEEAVDDPDEIDEEEQPRQDRNNP 319
++ +N+ + +A +E +D D+ D+ D N P
Sbjct: 395 DESYFRNMTLSSVQAMPVVQDQEPNSEDSDQNDQAFPLVSDNNEP 439
>gi|357129612|ref|XP_003566455.1| PREDICTED: uncharacterized protein LOC100843647 [Brachypodium
distachyon]
Length = 499
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 91/222 (40%), Gaps = 35/222 (15%)
Query: 102 PMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + +RK F N G VDGT + + P + ++ + NR + N M C + KF
Sbjct: 250 PHSIMRKPKFYPFFKNCIGAVDGTHIPMKLPLDQQEPYRNRKQTISQNAMVACDFDLKFV 309
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPRTR--- 213
+N W GS DARV+++++ G+ L D+ Y ++L P R+
Sbjct: 310 HINPGWEGSASDARVLQDAL------NHGFEVLDGKFYLVDAGYANTPQFLAPYRSTRYH 363
Query: 214 -NPDAAADQR-------FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLA 264
N A Q+ F H + R IE GI K +FP L P+ + + +A
Sbjct: 364 LNEQGRARQKPQNHKELFNLRHAQLRNHIERIIGIWKMRFPILKVASHFPKEKQIDISVA 423
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
LHN + IH+ + + DPD+I
Sbjct: 424 CAVLHNF------------IRIHNLDMTWPSDATMDIDPDQI 453
>gi|7523409|emb|CAB86428.1| putative protein [Arabidopsis thaliana]
Length = 463
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/342 (19%), Positives = 136/342 (39%), Gaps = 53/342 (15%)
Query: 4 FAFKEAFRLSATQVEFILEEIAPALTTPTERNVA------LSAKEKVLSCLHWMGVGSPY 57
+AFK FR S T +I + L + + LS +++V L + G
Sbjct: 63 YAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQ 122
Query: 58 HIIGATHGPSKSSICR--------------SIHEVVPVIVDTFFQ--------------R 89
+GA G +S++ + +I+ + P + F Q
Sbjct: 123 VSVGAAFGVGQSTVSQVPINLYSAPYFPDFNIYMLYPDTIHLFPQVTWRFIEALEERAKH 182
Query: 90 MVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVM 147
++WP+ + + +I F G PN G +D T +++ P + D + ++ ++++ +
Sbjct: 183 HLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDWCDQEKNYSMFLQ 242
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYR------------KFDEEGWRPFPDAVI 195
+ +F + WPG + +++++ S ++ K +G + +
Sbjct: 243 GVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQI--REYV 300
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
+G +YPL WLI P + + + F H++ R + +F LK + L+ + P
Sbjct: 301 VGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRP 360
Query: 256 QFAGM--VVLASTTLHNIALTI-DRDNENQNVEIHDDEAEDD 294
+ ++L LHNI + D E+ + H D D
Sbjct: 361 DRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYAD 402
>gi|356560995|ref|XP_003548771.1| PREDICTED: uncharacterized protein LOC100781080 [Glycine max]
Length = 454
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 78/163 (47%), Gaps = 18/163 (11%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFL-----RKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
I+ K ++ LN P T + F G +DGT + + + +
Sbjct: 146 RAIM--------KVSKDYLNFQPCTLECAEANKWRWFERCIGALDGTHIPVIVSPDERSR 197
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 198 YRNRKGDVSKNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|356566583|ref|XP_003551510.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 78/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R + V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EVVAMFLHILAHNLKYRVVQFSYCRSKETISRQFNYVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K +E LN P T +G F G +DGT + + +
Sbjct: 146 RAVM--------KVSKEYLNFQPCTL--EGAEANKWRWFERCIGALDGTHIPVTVSPDER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|301113624|ref|XP_002998582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111883|gb|EEY69935.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 335
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 113/250 (45%), Gaps = 41/250 (16%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKV------LSCLHWMGVGSPYHI 59
F+ FR++A ++L + P + +++ S +E + ++ L ++ GS Y
Sbjct: 8 FERTFRMTADSFSYLLGLLTPNIEVNRQQSANSSGEEPISPPLMLMTTLRYLAGGS-YLD 66
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVD 119
I T G S SS R I+E + I+ ++V E+ ++ M + A
Sbjct: 67 IRRTVGISGSSYYRVINETMFAILGLRELKIVFPDSESDREVVMNDFK-------AISSS 119
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHA---LNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
G + DA + ++ + G +A +NV A+C + +F +V+A++PGS +DAR R +
Sbjct: 120 GIISAEDAGDVGTGRYYS--GHYACPGINVQAVCDAHCRFISVDASYPGSTNDARAFRGT 177
Query: 177 MVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
V R + FP + I+GD+AYP + F R R IE
Sbjct: 178 GVARSL-----KSFPTGMYIIGDNAYPYHDSF--------------NFHASQIRIR--IE 216
Query: 236 NSFGILKEKF 245
+FG+L K+
Sbjct: 217 MAFGLLTTKW 226
>gi|322780068|gb|EFZ09807.1| hypothetical protein SINV_11159 [Solenopsis invicta]
Length = 255
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 77/150 (51%), Gaps = 5/150 (3%)
Query: 163 WPGSVHDA-RVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRTRNPDAAAD 220
W S D+ R+ RNS + ++ + FP+ I+GD AYP+ W IPP N +
Sbjct: 91 WLLSTPDSFRIFRNSDICKRINRNPPAFFPNGEYIIGDKAYPVLTWCIPPFRDNGRLTEE 150
Query: 221 Q-RFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIALTIDRD 278
Q RF + + R+ IE +F +LK +F L ++ +S +LA+ LHNI L + D
Sbjct: 151 QKRFNKILSQKRQTIERAFALLKGRFRRLKFLDMSRLDLIPFFILAACVLHNICLQSNDD 210
Query: 279 NENQNVEIHDDEAEDDGEEEAVDDPDEIDE 308
E+ E + E + + + E ++D DE E
Sbjct: 211 VEDFIGEGREPEEQGNHDRE-IEDNDEWQE 239
>gi|357127555|ref|XP_003565445.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 470
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/314 (21%), Positives = 122/314 (38%), Gaps = 40/314 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK F +S ++ +E+ ++ T + A+ ++V +CL + G P +
Sbjct: 132 FKREFHVSRRTFHYLCDELGASVGKEDTWLHAAVPVPKRVAACLWRLAHGHPCLTVAERF 191
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT---FLRKGGFPNVAGCVDGT 121
G S S++ + + + R + WPE N + F G V G V T
Sbjct: 192 GVSVSTVHKLRVDFCAAVNSVLLDRFIAWPETNPQAAAVAADKFEEVAGIHGVTGAVYTT 251
Query: 122 MVLIDAPNEHEDQF-----------VNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
V I AP +H + NR+ + +++ + F V PGS+ D
Sbjct: 252 HVPIVAPRQHVADYHNPRLQASRNDYNRSACYTVSIQGTVDAHGAFINVCVGNPGSLSDE 311
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
V+ + + R P V LG ++YPL ++L+ P + + +A D +
Sbjct: 312 EVL---LKLSDLNNGARRQQPFRV-LGGASYPLTDYLLVPYSSS--SATDNCLTPAQEEM 365
Query: 231 RRLIENSFGILKEKFPCLNYMRLSPQFAGM-------------VVLASTTLHNIALTI-D 276
+ +E + K+ F MRL ++ + V+ A LHNI + D
Sbjct: 366 NKAVEAGTAVAKDAF-----MRLKARWGFLRKRSEVKIPELSNVIQACCMLHNICVRFGD 420
Query: 277 RDNENQNVEIHDDE 290
+ E+ DD+
Sbjct: 421 ELDPELAFELEDDD 434
>gi|356566108|ref|XP_003551277.1| PREDICTED: uncharacterized protein LOC100797937 [Glycine max]
Length = 604
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 65/136 (47%), Gaps = 3/136 (2%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDI 101
E V LH + Y ++ ++ S +I R V+ I+ + +K+ E N+
Sbjct: 112 EVVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVLRAIMKVS-KEYLKFHEYNIEGS 170
Query: 102 PMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNA 161
R F N G +DG + + E ++ NR GD + NV+ +CGP+ +F V
Sbjct: 171 VENKWR--WFKNSIGALDGIHIPVTVSAEDRPRYRNRKGDISTNVLGVCGPDLRFIYVLP 228
Query: 162 NWPGSVHDARVMRNSM 177
W GS D+RV+R+++
Sbjct: 229 GWEGSAGDSRVLRDAL 244
>gi|222635238|gb|EEE65370.1| hypothetical protein OsJ_20664 [Oryza sativa Japonica Group]
Length = 372
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 90/214 (42%), Gaps = 27/214 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + P +D+F R + NV+A + +F V A W GS HD+
Sbjct: 146 FQGCIGALDGTHIPACVPMHMQDRFRGRKKFQSQNVLAAVDFDLRFLYVLAGWEGSAHDS 205
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAA 218
V+++++ G + L D+ Y + ++PP R P+
Sbjct: 206 YVLQDALS----RTNGLKIPEGKYFLADAGYAARPGVLPPYRGTRYHLKEYKGAREPENP 261
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEK---FPCLNYMRLSPQFAGMVVLASTTLHNIALTI 275
+ LR H R IE +FG LK + F C + L Q +V+A LHN +
Sbjct: 262 KELFNLR-HSSLRTTIERAFGALKNRFRIFKCQPFFPLKTQVK--IVMACCALHNWII-- 316
Query: 276 DRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEE 309
+ +V ++DD A +V + ++ +E
Sbjct: 317 ---EDGPDVYVYDDAAWYAALPRSVRNRTDMHQE 347
>gi|356520675|ref|XP_003528986.1| PREDICTED: uncharacterized protein LOC100778909 [Glycine max]
Length = 387
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 77/173 (44%), Gaps = 18/173 (10%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG V + + + F N+NG + N++A C + KF V A W GS D
Sbjct: 145 FKDCVGVIDGIHVPVTVGVDEQGPFRNKNGLLSQNILAACSFDLKFQYVLAGWEGSATDL 204
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRTRNPDAA----------- 218
+V NS + R + P+ + DS Y + P + P +
Sbjct: 205 QVF-NSAITR----QNKLQVPEGKYYIVDSKYQNVPGFVAPYSSTPYYSKEFLSDYHPQD 259
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHN 270
A + F + H R +I+ +FGILK +FP L P Q +V+A+ LHN
Sbjct: 260 ASELFNQRHSLLRHVIDRTFGILKARFPILMSAPSYPLQTQVKLVVAACALHN 312
>gi|356510804|ref|XP_003524124.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 77/163 (47%), Gaps = 18/163 (11%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVAMFLHILAHNLKYMVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-----FPNVAGCVDGTMVLIDAPNEHEDQ 134
++ K ++ LN P T F G +DGT + + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTLEGAKANKWRWFERCIGALDGTHIPVTVSPDERPR 197
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 198 YRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|222616945|gb|EEE53077.1| hypothetical protein OsJ_35831 [Oryza sativa Japonica Group]
Length = 339
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 103/263 (39%), Gaps = 40/263 (15%)
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-----LNDIPMTFLRKGGFPNVAGCVD 119
G + +SI +++P + R +K P N + P F F N G +D
Sbjct: 63 GSTLHRHIKSIFKIIPALT----YRFLKLPHANQTHWKIRTNPRFFPY---FKNCIGAID 115
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
GT + I E + NR G + NVM C + F ++ W GS DARV+R++M
Sbjct: 116 GTHIPITIDGEKAAPYRNRKGTLSQNVMVACDFDLNFTFISCGWEGSATDARVLRSAM-- 173
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------------RTRNPDAAADQRFLRCH 227
G++ L D Y I P R + F H
Sbjct: 174 ----NSGFQVPNGKFFLVDGGYANTPQFIAPYRGVRYHLKEFGRGHRRPRDYKELFNHRH 229
Query: 228 KRTRRLIENSFGILKEKFPCL---NYMRLSPQFAGMVVLASTTLHNIALTID-----RDN 279
R +E + G+LK++FP L + R+ Q + A+ HN+ ++ DN
Sbjct: 230 AILRNHVERALGVLKKRFPILKVGTFHRIKNQVR--IPAAAAVFHNMIRLLNGDEGWLDN 287
Query: 280 ENQNVEIHDDEAEDDGEEEAVDD 302
+ N+E + +G+ E +D
Sbjct: 288 QPDNIEPTNFVDLPEGDSEYQND 310
>gi|53370693|gb|AAU89188.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|108710091|gb|ABF97886.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 641
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 6/105 (5%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V + P ++NR+ + + NV+A+C + +F V + WPGS HD
Sbjct: 475 FNDCIGAIDGTHVNVVVPKSKRVPYLNRHNETSQNVLAVCDFDMRFTFVLSGWPGSAHDM 534
Query: 171 RVMRN--SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTR 213
RV +N S + KF P P + D+ YP + + P TR
Sbjct: 535 RVFKNAVSTYHHKFPH----PPPGKYYVVDAGYPNRPGYLSPYTR 575
>gi|22165102|gb|AAM93719.1| hypothetical protein [Oryza sativa Japonica Group]
gi|31432511|gb|AAP54133.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed
[Oryza sativa Japonica Group]
Length = 481
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 78/185 (42%), Gaps = 27/185 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT V I + + + NR G + NVM +C + F +++ W GS DA
Sbjct: 37 FQNCLGAIDGTHVPITISQDLQAPYRNRKGTLSQNVMLVCDFDLNFSFISSGWEGSATDA 96
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----------------RTRN 214
RV+R++M+ +G+ L D Y + P R RN
Sbjct: 97 RVLRSAML------KGFNVPQGKYYLVDGGYANTPSFLAPYRGVRYHLKEFGCGQQRPRN 150
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
+ R H R IE + G+LK++FP L S + + +A+ HN+
Sbjct: 151 YKELFNHR----HAILRNHIERTIGVLKKRFPILKVGTHHSIENQVKIPVATVVFHNLIR 206
Query: 274 TIDRD 278
++ D
Sbjct: 207 MLNGD 211
>gi|326489332|dbj|BAK01649.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 312
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 118/289 (40%), Gaps = 47/289 (16%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
+V LSA +SC +G+ + + I+ +V I D + ++WP
Sbjct: 39 SVLLSASGASISCRFGLGISTCHSIV---------------LQVCAAITDILMPKAIRWP 83
Query: 95 EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVN-----RN--GDHALNVM 147
++L ++ F G P V G V + I P E+ D++ N RN +++ +
Sbjct: 84 LDSLPEVAAGFQAMSGIPGVVGTVCTDHIPIGLPKENVDEYYNHRLSVRNNKASYSVALQ 143
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK-----FDEEGWRPFPDAVILGDSAYP 202
+ F V P ++ +A V++ S +Y + ++ +R +LG +YP
Sbjct: 144 VVVDAGGAFTDVCIGIPSALSNAAVLKRSALYIRCVTGLLGDDQFR------LLGGVSYP 197
Query: 203 LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGI---LKEKFPCLNY---MRLSPQ 256
L +W+I P A F ++R S G LK ++ CL ++L P
Sbjct: 198 LTDWMIVPYKHLNLTWAQHVF---NERVAAANAASHGALHRLKARWRCLQRRTELKL-PD 253
Query: 257 FAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE 305
M+ A L N +R E ++++ D+ + AV D ++
Sbjct: 254 LHNMIG-ACCVLRNFC---ERSGEELDIDLESQLCCDEDDVVAVADAEK 298
>gi|356524593|ref|XP_003530913.1| PREDICTED: uncharacterized protein LOC100806372 [Glycine max]
Length = 343
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 74/158 (46%), Gaps = 8/158 (5%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 58 ILQEKGQLVKT---KNVPID--EAVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 112
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ E NL R F N G +DG + + E ++ NR
Sbjct: 113 RAIMKVS-KEYLKFYEYNLEGSVENKWR--WFKNSIGALDGIHIPVTVSAEDRPRYRNRK 169
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
GD + NV+ +CG + +F V W GS D+RV+R+++
Sbjct: 170 GDISTNVLGVCGSDLRFIYVLPGWEGSAGDSRVLRDAL 207
>gi|356565147|ref|XP_003550806.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 406
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 104/274 (37%), Gaps = 34/274 (12%)
Query: 46 SCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTF 105
+ L + +PY + S + CR+ V + D + + F
Sbjct: 125 AALFRLAHAAPYPAVSRRFAISPADACRAFFAVCKALADNLGHLFELRTDSQRVVVGFGF 184
Query: 106 LRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPG 165
PN G + ID D + NG AL V A+ +F V+A WP
Sbjct: 185 ---SSLPNCFGVLGFARFKID------DSLLGENG--ALMVQALVDSEGRFLDVSAGWPS 233
Query: 166 SVHDARVMRNSMVYRKFDE-------------EGWRPFPDAVILGDSAYPLKEWLIPPRT 212
++ ++R + +YR+ +E EG ILGDS +PL WL+ P
Sbjct: 234 TMKPETILRETKLYREVEESKELLNGPSYNLSEG--CLIPQYILGDSCFPLLPWLLTPYN 291
Query: 213 R----NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL----NYMRLSPQFAGMVVLA 264
R + +A++ F H L+ ++F L+ + L + R + V++A
Sbjct: 292 RVNEEDSFGSAERAFNTAHDNAMGLLGDAFARLRASWRLLAASRKWKRDCVEHLPFVLVA 351
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
LHN L + ++ VE + DG E+
Sbjct: 352 GCLLHNFVLKFNEAMPDEGVEKEEGLPSFDGVED 385
>gi|359488044|ref|XP_003633693.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 271
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 104/232 (44%), Gaps = 41/232 (17%)
Query: 99 NDIPMTFLRKGGFPNVAG----------CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
N +PM + NV G ++G+ + N+ ++ R NV+A
Sbjct: 41 NQVPMDEMDSSNLVNVFGSMSSDQDCIRVINGSHFSVKVSNDVVQRYRGRKYYPTQNVLA 100
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE----EGWRPFPDAVILGDSAYPLK 204
C + KF V W GSV ++R++ N+++ R FD+ +G L ++ + LK
Sbjct: 101 SCSFDLKFTYVLLGWEGSVSNSRILDNALM-RDFDKLIVPQG------KYYLANAGFQLK 153
Query: 205 -EWLIPPRT----------RNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN---- 249
E+L P R+ P+ A + LR H R IE +FG+LK++FP +
Sbjct: 154 TEFLTPYRSTRYHLKEYSVHQPENAREVFNLR-HSSLRNAIERAFGVLKKRFPIIARGTE 212
Query: 250 -YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
+ + Q+ ++LA LHN + +D NE VE+ + + E E++
Sbjct: 213 PHYPVDTQYD--IILACCILHNYLMGVD-PNERLIVEVDRELFSKEAEFESM 261
>gi|241566933|ref|XP_002402225.1| transposase, putative [Ixodes scapularis]
gi|215500005|gb|EEC09499.1| transposase, putative [Ixodes scapularis]
Length = 283
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/277 (22%), Positives = 113/277 (40%), Gaps = 13/277 (4%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ FRLS+T ++++ ++ T ++ +++V L+ + + IG
Sbjct: 8 FRACFRLSSTTFRYLVDVCRSSMQRQDTAIKTTITVEKRVAISLYRLCSTAEERTIGHLF 67
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+S + S E V+++ V E+L F GFPN G DG +
Sbjct: 68 AVGQSVVNESYREFCDVVIEELEATTVSMIRNEDLEHHMREFQAVLGFPNDIGAPDGCHL 127
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ + + N G +++ ++A+ F V+ PG D+ V + R +
Sbjct: 128 PVSPLKDSAVDYRNYKGCYSVILLALVDRRCLFRYVSVGSPGKCQDSNVYGRCPLGRLLE 187
Query: 184 EE--------GWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKRTRRLI 234
+ G P ++L D A+PL LI P + + D+ + + RR++
Sbjct: 188 DYQVAVPGSIGGTKIP-PIVLCDQAFPLTRNLIKPLPHSLNHPQDEGDYSYALSKARRVV 246
Query: 235 ENSFGILKEKFP-CLNYMRLSPQFAGMVVLASTTLHN 270
EN+FG LK +F L M + V A LHN
Sbjct: 247 ENAFGRLKARFRIVLKRMEVRIDNVYTAVRACCILHN 283
>gi|58262144|ref|XP_568482.1| hypothetical protein CNM02140 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230655|gb|AAW46965.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 445
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 110/271 (40%), Gaps = 32/271 (11%)
Query: 52 GVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGG 110
G + IG G S+ + V+ I+ + + V WP+EN I F +
Sbjct: 125 GCAASTFEIGEQFGVSEGTSAIWTTRVIKAIL-SLERNNVYWPDENERKAIDRHFEEEED 183
Query: 111 FPN-VAGCVDGTMV-LIDAPNEHED-QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
P+ G +DG V P H+ F + G + N++ IC + +P S
Sbjct: 184 IPDGCVGIIDGFHVPFAYKPARHDAVNFFSHKGRYGFNILGICDHLKRIRYFQYGYPASA 243
Query: 168 HDARVMRNSMVYRKFDEEGW---------RPFPDAVI------LGDSAYPLKEWLIP--- 209
HDAR+ +N ++ + + + R +I L DSA+P +W +P
Sbjct: 244 HDARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFK 303
Query: 210 -PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLS------PQFAGMVV 262
R +N + +F + R IE+++GILK ++ L +R+ A +
Sbjct: 304 RRRGQNDLDRPEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKIRNVRDESVATCWI 363
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDEAED 293
A LHN L ID + ++ DE +D
Sbjct: 364 RACVVLHN--LLIDTGDWYNQLDGDADEPDD 392
>gi|77551917|gb|ABA94714.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 645
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 101/235 (42%), Gaps = 36/235 (15%)
Query: 111 FPNVAGCVDGT----MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
F + G +DG+ +VL+D H +G + NV+AI + +F A WPGS
Sbjct: 424 FKDAIGAIDGSHISVVVLLDETISH----TCHHGYTSQNVLAIYNFDMRFIFAVAGWPGS 479
Query: 167 VHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-----------RTRN- 214
HD+R++ +++ + FP + + YP + + P R R+
Sbjct: 480 AHDSRILSHALA-------NFPSFP----MPPTGYPNRIGYLAPFKGTTYHIPEFRHRSG 528
Query: 215 PDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNIAL 273
P + F H R +IE SFG+LK+K+ L + S +++A LHN
Sbjct: 529 PPQGKYEVFNFLHSSLRNVIERSFGVLKQKWRILKGIPSFSHTTQKHIIMACLALHNFV- 587
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLL 328
RD+ ++ E +A++D E D + + E+ D N R+A L
Sbjct: 588 ---RDSNLRDKEFERCDADEDYLLEDTSDTSDDESEDAENDDTMNTIRTRIADAL 639
>gi|147799971|emb|CAN65950.1| hypothetical protein VITISV_029437 [Vitis vinifera]
Length = 235
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 20/197 (10%)
Query: 113 NVAGCVDGTMVLIDAPNEHEDQFVNRN-GDHALNVMAICGPNYKFYAVNANWPGSVHDAR 171
+ G +DG ++ P + + R G+ NVMA+C + F V W G+ HD+R
Sbjct: 11 DAVGAIDGNLIHACIPTNQQVPYRGRGRGECFQNVMAVCDFDMIFKFVVVGWEGTTHDSR 70
Query: 172 VMRNSMVYRKFDEEGWRPFPDAVILGDSAY-----PLKEWLIPPRTRNPDAAADQRFLRC 226
V+ ++ R P DA + ++ WL R+ ++ F +C
Sbjct: 71 VLTETI--RNPQHNFPMPPSDAAYTHTRGFMAPYRNVRYWLSDFRSGGKVVGKEEIFXQC 128
Query: 227 HKRTRRLIENSFGILKEKFPCLNYMR-LSPQFAGMVVLASTTLHNIALTI---DR----- 277
H R R +IE +FG++K P L M S +V+ ++HN I DR
Sbjct: 129 HARLRNVIERAFGVVKVCXPILKRMTPYSFTTQTKIVMTCFSIHNFLRQISVADRLFSEY 188
Query: 278 DNENQNVEIHDDEAEDD 294
DNE VE+ D A +
Sbjct: 189 DNE---VELESDNANQN 202
>gi|134118537|ref|XP_772042.1| hypothetical protein CNBM2000 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254648|gb|EAL17395.1| hypothetical protein CNBM2000 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 445
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 128/321 (39%), Gaps = 36/321 (11%)
Query: 6 FKEAFRLSATQVEFILEEIA--PALTTPTERNVA--LSAKEKVLSCLHWMGVGSPYHIIG 61
F+ R++ T+ +L I P + R A L L L G + IG
Sbjct: 75 FRRKLRVNHTEFRKLLCLIKDHPVFVSHGPRKQANPLLQLTVALYRLGHCGCAASTFEIG 134
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPN-VAGCVD 119
G S+ + V+ I+ + + V WP+EN I F + P+ G +D
Sbjct: 135 EQFGVSEGTSAIWTTRVIKAIL-SLERNNVYWPDENERKAIDRHFEEEEDIPDGCVGIID 193
Query: 120 GTMV-LIDAPNEHED-QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
G V P H+ F + G + N++ IC + +P S HDAR+ +N
Sbjct: 194 GFHVPFAYKPARHDAVDFFSYKGRYGFNILGICDHLKRIRYFQYGYPASAHDARIFKNCS 253
Query: 178 VYRKFDEEGW---------RPFPDAVI------LGDSAYPLKEWLIP----PRTRNPDAA 218
++ + + + R +I L DSA+P +W +P R +N
Sbjct: 254 LFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKRRRGQNDLDR 313
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ------FAGMVVLASTTLHNIA 272
+ +F + R IE+++GILK ++ L +R+ + A + A LHN
Sbjct: 314 PEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKIRNVRDEGVATCWIRACVVLHN-- 371
Query: 273 LTIDRDNENQNVEIHDDEAED 293
L ID + ++ DE +D
Sbjct: 372 LLIDTGDWYNQLDGDADEPDD 392
>gi|449688941|ref|XP_004211892.1| PREDICTED: uncharacterized protein LOC101241270 [Hydra
magnipapillata]
Length = 186
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 75/168 (44%), Gaps = 4/168 (2%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S T E +L +AP + T +S E++ L ++ G + I A++ S S+
Sbjct: 1 MSPTTYEELLSFVAPIIVKQRTTMRDPVSPSERLTVTLRFLVTGDAHCNIAASYKISTST 60
Query: 71 ICRSIHEVVPVIVDTFFQR---MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
I R I E I +F +R V ++ I F FP+ G +DG +++ A
Sbjct: 61 ISRIISETCAAIWTSFKERNFLHVPSEKQKWKTIAKEFENMWNFPHAIGAIDGKHIVMQA 120
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
P+ ++ N H++ ++A+C Y+F V+ G D V N
Sbjct: 121 PHNGGSEYFNYKKTHSIVLLAVCNAKYEFIMVDIGDSGRQSDGSVFNN 168
>gi|356533420|ref|XP_003535262.1| PREDICTED: uncharacterized protein LOC100791415 [Glycine max]
Length = 488
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 116/289 (40%), Gaps = 27/289 (9%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR 107
L +G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 192 LEIVGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRF 251
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
F + G +DGT + + +R+G+ + NV+A C + +F V + W GS
Sbjct: 252 YPYFKDCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGST 311
Query: 168 HDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPDAAA---- 219
HD++V+ +++ + G + L D +P + + P R D A
Sbjct: 312 HDSKVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGND 367
Query: 220 ----DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ F H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 368 PENEKELFNLRHASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNF 424
Query: 272 ALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
R +E VE D+ + +D D I +EQ R+D N
Sbjct: 425 LRKECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIIQTQEQEREDAN 472
>gi|223997854|ref|XP_002288600.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975708|gb|EED94036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 499
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/336 (24%), Positives = 136/336 (40%), Gaps = 37/336 (11%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEI----------APALTTPTERNVALSAKEKVLSCLHW 50
+T FK+ +R+S + I E I P+ PT + + L L L
Sbjct: 110 LTADEFKQKYRMSREALLVIAERIKDDDVFKNKRGPSQMNPTHQLMVL------LDYLGT 163
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKG 109
G G+ A S+ S ++ + + WP+EN +I + +
Sbjct: 164 AGSGANNPKQRAYFHVGNGSVNNSRKRARDAVIHSLGKDFYHWPDENERKNISNCYKMEF 223
Query: 110 GFPNVAGCVDGTMVLIDAPNEHED--QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
PN G +DGT+ + E ED + R +L + + + +PGS
Sbjct: 224 NLPNCVGVMDGTLFPLAFQPETEDAADYHGRKFQWSLTCLVVSDQKRRIRWYITGYPGSA 283
Query: 168 HDARVMRNS-MVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA----DQR 222
HD R++R S + RK EE + + I+GD+A+ E ++P NP+ A D+R
Sbjct: 284 HDNRMLRRSPLKVRK--EEYFTVY--QYIIGDTAFDPSENVVPAYKANPNKAEPDDPDER 339
Query: 223 FL-RCHKRTRRLIENSFGILKEKFPCL----NYMRLSPQFAGMV--VLASTTLHNIALTI 275
L + + R E+ G+ K +FP L N +R ++ + + LHN +
Sbjct: 340 LLNKVISKPRVSSEHVNGMWKGRFPWLRLIPNRVRDKKSLTEVMKYIHCTVILHNFLIEF 399
Query: 276 DRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQ 311
D ++ E +D D E A D D++ EE
Sbjct: 400 G-DQHIKSWESEEDRLSDIA-EPANDPTDDMPGEEH 433
>gi|58258447|ref|XP_566636.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|58261124|ref|XP_567972.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|58269606|ref|XP_571959.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134111525|ref|XP_775298.1| hypothetical protein CNBE0170 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|134116925|ref|XP_772689.1| hypothetical protein CNBK0630 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255307|gb|EAL18042.1| hypothetical protein CNBK0630 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257957|gb|EAL20651.1| hypothetical protein CNBE0170 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57222773|gb|AAW40817.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|57228195|gb|AAW44652.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230054|gb|AAW46455.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 445
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 128/321 (39%), Gaps = 36/321 (11%)
Query: 6 FKEAFRLSATQVEFILEEIA--PALTTPTERNVA--LSAKEKVLSCLHWMGVGSPYHIIG 61
F+ R++ T+ +L I P + R A L L L G + IG
Sbjct: 75 FRRKLRVNHTEFRKLLCLIKDHPVFVSHGPRKQANPLLQLTVALYRLGHCGCAASTFEIG 134
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPN-VAGCVD 119
G S+ + V+ I+ + + V WP+EN I F + P+ G +D
Sbjct: 135 EQFGVSEGTSAIWTTRVIKAIL-SLERNNVYWPDENERKAIDRHFEEEEDIPDGCVGIID 193
Query: 120 GTMV-LIDAPNEHED-QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
G V P H+ F + G + N++ IC + +P S HDAR+ +N
Sbjct: 194 GFHVPFAYKPARHDAVDFFSYKGRYGFNILGICDHLKRIRYFQYGYPASAHDARIFKNCS 253
Query: 178 VYRKFDEEGW---------RPFPDAVI------LGDSAYPLKEWLIP----PRTRNPDAA 218
++ + + + R +I L DSA+P +W +P R +N
Sbjct: 254 LFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKRRRGQNDLDR 313
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ------FAGMVVLASTTLHNIA 272
+ +F + R IE+++GILK ++ L +R+ + A + A LHN
Sbjct: 314 PEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKIRNVRDEGVATCWIRACVVLHN-- 371
Query: 273 LTIDRDNENQNVEIHDDEAED 293
L ID + ++ DE +D
Sbjct: 372 LLIDTGDWYNRLDGDADEPDD 392
>gi|356561027|ref|XP_003548787.1| PREDICTED: uncharacterized protein LOC100797534 [Glycine max]
Length = 309
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 75/158 (47%), Gaps = 8/158 (5%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 95 ILQEKGQLVKT---KNVPID--EVVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 149
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ E NL R F N G +DG +L+ E ++ NR
Sbjct: 150 RAIMKVS-KEYLKFHEYNLEGSVENKWR--WFKNSIGALDGIHILVTVSAEDRPRYRNRK 206
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
D + NV+ +CGP+ +F V W GS D++V+R+++
Sbjct: 207 CDISTNVLGVCGPDLRFIYVLPGWEGSAGDSQVLRDAL 244
>gi|253761486|ref|XP_002489122.1| hypothetical protein SORBIDRAFT_0020s002020 [Sorghum bicolor]
gi|241947263|gb|EES20408.1| hypothetical protein SORBIDRAFT_0020s002020 [Sorghum bicolor]
Length = 351
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 71/177 (40%), Gaps = 20/177 (11%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DG VL P + F+ R D NVM + KF V A W GS HDA ++ +
Sbjct: 36 GAIDGNHVLARVPRHMQQAFMGRKKDPTQNVMVAVDFDLKFTYVLAGWEGSAHDAHILSD 95
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR-----------TRNPDAAADQRFL 224
++ E+G+ L D+ Y + +PP N A + F
Sbjct: 96 AIER----EDGFTVPQGKFYLVDAGYACRNGFLPPYRGVRYHLTKFGGTNRSTNARELFN 151
Query: 225 RCHKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHNIALTIDRD 278
H R +E + G LK +F L + R Q +VLA LHN ++ D
Sbjct: 152 LRHSSLRVTVERAIGALKNRFRILYNKPFHRYKTQV--RLVLACAILHNWIISFGTD 206
>gi|331252652|ref|XP_003338861.1| hypothetical protein PGTG_20398 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 237
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 57/114 (50%), Gaps = 18/114 (15%)
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I + F+R+ GFP G VDGT + H Q ++G H ++ IC N KF +
Sbjct: 113 QIAVAFMREEGFPGCVGFVDGTTI-------HLSQKPPKDGQHYFDLTVICDINKKFISY 165
Query: 160 NANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIP 209
A +PGS HDA V + V ++ FD++ + +L DSAY +++P
Sbjct: 166 LAGFPGSSHDAYVFSHMQVAQQPEKYFDQKQF-------LLADSAYTNNCYVVP 212
>gi|21397269|gb|AAM51833.1|AC105730_7 Unknown protein [Oryza sativa Japonica Group]
Length = 1202
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 82/204 (40%), Gaps = 28/204 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + P + D+F R NV+A + +F + A W GS HD+
Sbjct: 696 FKDCIGALDGTHIRASVPAKKVDRFRGRKPYPTQNVLAAVDFDLRFIYILAGWEGSAHDS 755
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP---------RTRNPDAAADQ 221
V+++++ G + L D+ Y + ++P R P
Sbjct: 756 LVLQDALS----RPNGLKILEGKFFLADAGYAARPGILPSYRRVRYHLKEFRGPQGPHGP 811
Query: 222 RFLRC--------HKRTRRLIENSFGILKEKFPCLNYMRLSPQFA-GMVVLASTTLHNIA 272
+ +C H R IE +FG LK +F L P A MVV+A LHN
Sbjct: 812 QGPKCPKELFNYRHSSLRTTIERAFGALKNRFKILMNKPFIPLKAQSMVVIACCALHNWI 871
Query: 273 LTIDRDNENQNVEIHDDEAEDDGE 296
L EN + E +DE G+
Sbjct: 872 L------ENGSDEFVNDEKTCMGK 889
>gi|357141107|ref|XP_003572087.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 377
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 82/209 (39%), Gaps = 31/209 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G VDGT + + P + ++ + NR + N M C + KF +N +W GS DA
Sbjct: 160 FKNCIGAVDGTDIPMKLPLDQQESYRNRKQTISQNAMVACDFDLKFVHINPDWEGSASDA 219
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------------RTRNPDAA 218
RV+++++ G+ L D+ Y + P R R
Sbjct: 220 RVLQDAL------NHGFEVPDGKFYLVDAGYANTPQFLAPYRGTRYHLNEQGRARQKPQN 273
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLASTTLHNIALTIDR 277
+ F H + R IE GI K +FP L P+ + + +A LHN
Sbjct: 274 HKELFNLRHAQLRNHIERIIGIWKMRFPILKVASHFPKEKQIDISVACAVLHNF------ 327
Query: 278 DNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
+ IH+ + + DPD+I
Sbjct: 328 ------IRIHNGDMTWPSDATMDIDPDQI 350
>gi|297729097|ref|NP_001176912.1| Os12g0299600 [Oryza sativa Japonica Group]
gi|77554714|gb|ABA97510.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|255670242|dbj|BAH95640.1| Os12g0299600 [Oryza sativa Japonica Group]
Length = 754
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 101/262 (38%), Gaps = 32/262 (12%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE 96
L +EK+ + + + + + S S++ R I + +I + R +K P
Sbjct: 88 GLRIEEKLGIFMFMLAHNASFQDLQYEFKHSGSTLHRHIKSIFKIIPALTY-RFLKLPHA 146
Query: 97 N-----LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICG 151
N + P F F N G +DGT + I E + NR G + NVM C
Sbjct: 147 NQTHWKIRTNPRFF---PYFKNCIGAIDGTHIPITIDGEKAAPYRNRKGTLSQNVMVACD 203
Query: 152 PNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP- 210
+ F ++ W GS DARV+R++M G++ L D Y I P
Sbjct: 204 FDLNFTFISCGWEGSATDARVLRSAM------NSGFQVPNGKFFLVDGGYANTPQFIAPY 257
Query: 211 -----------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN---YMRLSPQ 256
R + F H R +E + G+LK++FP L + R+ Q
Sbjct: 258 RGVRYHLKEFGRGHRRPRDYKELFNHRHAILRNHVERALGVLKKRFPILKVGTFHRIKNQ 317
Query: 257 FAGMVVLASTTLHNIALTIDRD 278
+ A+ HN+ ++ D
Sbjct: 318 VR--IPAAAAVFHNMIRLLNGD 337
>gi|449671111|ref|XP_004207430.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 192
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 77/196 (39%), Gaps = 27/196 (13%)
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDG 120
T G + + I EV IV R V P+ + + + F K GCVDG
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFRMIQAFGCVDG 64
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV-- 178
T + I P H + ++L V A+C F V WPGSVHDA+V NS +
Sbjct: 65 THIPIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSINT 124
Query: 179 ----------YRKFDEEGWRPFPDAVILGDSAYPL-----KEWLIPPRTRNPDAAADQRF 223
++ + + P ++GD AYPL KE+ + D
Sbjct: 125 NLRSSRLPGTFQTITKNKIK-IP-CYLIGDPAYPLLPHCMKEY---STCKKKDEVIFNSM 179
Query: 224 LRCHKRTRRLIENSFG 239
LR R IE +FG
Sbjct: 180 LR---TARNPIECAFG 192
>gi|115467204|ref|NP_001057201.1| Os06g0226000 [Oryza sativa Japonica Group]
gi|113595241|dbj|BAF19115.1| Os06g0226000, partial [Oryza sativa Japonica Group]
Length = 225
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 82/191 (42%), Gaps = 27/191 (14%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT + P +D+F R + NV+A + +F V A W GS HD+ V+++
Sbjct: 4 GALDGTHIPACVPMHMQDRFRGRKKFQSQNVLAAVDFDLRFLYVLAGWEGSAHDSYVLQD 63
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAAADQRF 223
++ G + L D+ Y + ++PP R P+ +
Sbjct: 64 ALS----RTNGLKIPEGKYFLADAGYAARPGVLPPYRGTRYHLKEYKGAREPENPKELFN 119
Query: 224 LRCHKRTRRLIENSFGILKEK---FPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNE 280
LR H R IE +FG LK + F C + L Q +V+A LHN + +
Sbjct: 120 LR-HSSLRTTIERAFGALKNRFRIFKCQPFFPLKTQVK--IVMACCALHNWII-----ED 171
Query: 281 NQNVEIHDDEA 291
+V ++DD A
Sbjct: 172 GPDVYVYDDAA 182
>gi|389751484|gb|EIM92557.1| hypothetical protein STEHIDRAFT_107036 [Stereum hirsutum FP-91666
SS1]
Length = 362
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 65/145 (44%), Gaps = 12/145 (8%)
Query: 110 GFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
GFPN G VDGT++ L AP F R A+ V A+ +F WPGSV
Sbjct: 188 GFPNCIGVVDGTLIRLTGAPQNTGGIFYCRKKFPAITVQAVVDEFCRFLNFELGWPGSVP 247
Query: 169 DARVMRNSMVY-RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
D+ + + S V+ + D G D ++GD P E I + D A +R +
Sbjct: 248 DSSMWKQSWVWLNRVDLFG----EDQYVMGDKVRPFNENEI---SHAADGAERKRMRSFN 300
Query: 228 KR---TRRLIENSFGILKEKFPCLN 249
K R +E +FG LK +FP L
Sbjct: 301 KELSGVRAWVEQAFGRLKARFPSLK 325
>gi|356507176|ref|XP_003522346.1| PREDICTED: uncharacterized protein LOC100809302 [Glycine max]
Length = 531
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 66/287 (22%), Positives = 117/287 (40%), Gaps = 29/287 (10%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG 110
+G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 238 VGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPY 297
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 298 FKDCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 357
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDA 217
+V+ +++ + G + L D +P + + P + +P+
Sbjct: 358 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHSNDPEN 413
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIAL 273
+ LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 414 EKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNFLR 469
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
R +E VE D+ + +D D + +EQ R+D N
Sbjct: 470 KECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 515
>gi|403159653|ref|XP_003890645.1| hypothetical protein PGTG_20678 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168189|gb|EHS63584.1| hypothetical protein PGTG_20678 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 310
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 63/289 (21%), Positives = 115/289 (39%), Gaps = 26/289 (8%)
Query: 31 PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM 90
P R LS + +V L+ +G GS Y IG K + ++ V ++ F +
Sbjct: 21 PLRRGDPLSVEAQVAVGLYRLGHGSSYVTIGHVFSIGKETADKATGRFVNAVLARFRRVA 80
Query: 91 VKWP----EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP-NEHEDQFVNRNGDHALN 145
+ +P + + I +F K G PN+ G +DGT + + P ++ ++NR ++
Sbjct: 81 ICYPPLARGDQWDKISASFEAKHGIPNIVGAIDGTHIPLATPADDRWKGYINRKSWASIV 140
Query: 146 VMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKE 205
+ + W + N G ++GD+ YP
Sbjct: 141 FQCVVDG-------DGEWGSRSELGHSITNGTAAEPMIPHG------TYLIGDAGYPSNV 187
Query: 206 WLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLA 264
++ P A ++ F TR ++E SFG LK +F L + +++ P A A
Sbjct: 188 RVLLPYLSTA-TAENEEFNFIQSSTRIIVEQSFGSLKNRFRILLHAQMARPLRARNNAFA 246
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPR 313
LHN+ N+ ++ + +A + E+ + PDE E + R
Sbjct: 247 CMILHNLL------NKRGSLYLQAWDARNPAEQVFNELPDERGEGGRRR 289
>gi|58270796|ref|XP_572554.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57228812|gb|AAW45247.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 428
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 110/271 (40%), Gaps = 32/271 (11%)
Query: 52 GVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGG 110
G + IG G S+ + V+ I+ + + V WP+EN I F +
Sbjct: 125 GCAASTFEIGEQFGVSEGTSAIWTTRVIKAIL-SLERNNVYWPDENERKAIDRHFEEEED 183
Query: 111 FPN-VAGCVDGTMV-LIDAPNEHED-QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
P+ G +DG V P H+ F + G + N++ IC + +P S
Sbjct: 184 IPDGCVGIIDGFHVPFAYKPARHDAVDFFSYKGRYGFNILGICDHLKRIRYFQYGYPASA 243
Query: 168 HDARVMRNSMVYRKFDEEGW---------RPFPDAVI------LGDSAYPLKEWLIP--- 209
HDAR+ +N ++ + + + R +I L DSA+P +W +P
Sbjct: 244 HDARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFK 303
Query: 210 -PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLS------PQFAGMVV 262
R +N + +F + R IE+++GILK ++ L +R+ A +
Sbjct: 304 RRRGQNDLDRPEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKICNVRDEGVATCWI 363
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDEAED 293
A LHN L ID + ++ DE +D
Sbjct: 364 QACVVLHN--LLIDTGDWYNQLDGDADEPDD 392
>gi|328726675|ref|XP_003248996.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 337
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 58/275 (21%), Positives = 111/275 (40%), Gaps = 19/275 (6%)
Query: 21 LEEIAPALTTPTERN-----VALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSI 75
+E+A ++T + +++ E + L ++G GS + T+ ++I + +
Sbjct: 6 FDELAVKISTKIKSQDTCMRLSIPPLEMLAVTLRYLGSGSDQIDLHLTYRLGHTTIGKIL 65
Query: 76 HEVVPVIVDTFFQRMV-KWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
+V + D + ++ E +I F + FPN G +DG V I P
Sbjct: 66 RKVCNALWDCLREESFPEFTENRWREIAEGFEKYCQFPNCLGAIDGKHVRIRKPRISGSL 125
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW-----RP 189
F N ++ ++AI Y F ++ G D+ V S +Y + + +P
Sbjct: 126 FYNYKNFFSIVLLAIVDAKYNFIYIDVGAFGKESDSTVFERSNLYEQLENNELHIPRGKP 185
Query: 190 FPDAV-------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
P V +GD A+ L + ++ P + + F R RR +E++FGIL
Sbjct: 186 LPGTVSPNMPYTFVGDEAFSLSKNIMRPYSGKYLVDKKRIFNYRLSRARRNVESAFGILS 245
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIALTID 276
K+ + + + + ++V LHN D
Sbjct: 246 NKWKIFHKPINANLDLSILIVKTCCALHNFVRARD 280
>gi|403167044|ref|XP_003326870.2| hypothetical protein PGTG_08407 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166834|gb|EFP82451.2| hypothetical protein PGTG_08407 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 418
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 109/265 (41%), Gaps = 30/265 (11%)
Query: 6 FKEAFRLSATQVEFILEEI---------APALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
FK+A R + ++L+ I +P P +AL+ L L G G+
Sbjct: 88 FKQAVRTTKDGFLWLLDRIRHHPIFYSNSPRPQLPIPHQLALT-----LERLGSNGNGAS 142
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
+ ++ + V+ I + +V WP + + + GF G
Sbjct: 143 VGRFSRNLSVGRGTVIKVSRRVIQAINEVSTNYIV-WPNKERRAEISEVMTEEGFEGCIG 201
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
VDGT + L P + + +R +++N IC + + WPG+ D+ V +N
Sbjct: 202 FVDGTTIPLHQRPGLDGEVYWDRKKQYSINCQVICDCDRFITSFMTGWPGTCGDSLVFKN 261
Query: 176 SMVY----RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ--RFLRCHKR 229
V+ R FD P ++ DSAY L IP ++P + ++ F C +
Sbjct: 262 MKVHLEPDRFFD-------PGQYLIADSAYALSMTTIPA-YKSPLSKINRNTEFNYCLAK 313
Query: 230 TRRLIENSFGILKEKFPCLNYMRLS 254
+R E++ GILK ++ L +RLS
Sbjct: 314 SRVRNEHTIGILKGRWASLQQLRLS 338
>gi|222616440|gb|EEE52572.1| hypothetical protein OsJ_34844 [Oryza sativa Japonica Group]
Length = 552
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 126/306 (41%), Gaps = 38/306 (12%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM------ 90
LS +++V L + G +G+ G + H V +I F + M
Sbjct: 95 VLSLEDRVAVALIRLNSGGSLVTVGSAVGVN--------HSTVSLITWRFVEAMEERASH 146
Query: 91 -VKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNV 146
++WP+ + + I F + G PN G VD T + + + + +++ ++++ +
Sbjct: 147 HLRWPDSSEVEKIKSMFEKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVL 206
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------I 195
A+ P+ +F + WPGS+ ++ ++ +S ++ K E+G R + I
Sbjct: 207 QAVISPDMRFMDIVTGWPGSMKESSILHSSGLF-KMCEKGARLNGSKMVVSDGSEIGEYI 265
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
+GD+ YPL WL+ P + + F + H + ++ K+ + L+ P
Sbjct: 266 IGDAGYPLLPWLLTPYQEKDLSDSKLEFNKRHAAAITVAPSTLANFKDTWKFLHGEMWRP 325
Query: 256 QFAGM--VVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD------GEEEAVDDPDEID 307
+ ++ LHNI + + ++ +D +A +E AV D++
Sbjct: 326 DKHRLPRIIHVCCMLHNIIICLQDATIDEAAMSNDHDANYKQQVCQLADENAVRVRDKLS 385
Query: 308 EEEQPR 313
E R
Sbjct: 386 EHLVSR 391
>gi|356502894|ref|XP_003520249.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK + R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSKETTSRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +DGT + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDGTHIPVTVSPDER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYHNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|403167155|ref|XP_003326965.2| hypothetical protein PGTG_08502 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166880|gb|EFP82546.2| hypothetical protein PGTG_08502 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 222
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 76/166 (45%), Gaps = 11/166 (6%)
Query: 91 VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAI 149
V+WP+++ +R GF G VDGT + + P + F +R +++N +
Sbjct: 63 VQWPDKDRRAEISEAMRMEGFSGCVGFVDGTTIPIFQRPGFDGETFFDRKKRYSMNAQIV 122
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSMVYR---KFDEEGWRPFPDAVILGDSAYPLKEW 206
C + + + WPGS D++V + +++ +F ++G +L DSAY L
Sbjct: 123 CDCDRFITSFISGWPGSCGDSKVYQRMQLHQNPSQFFDQG------QYLLADSAYDLTRT 176
Query: 207 LIPP-RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM 251
IP + + + F C + R E++ GILK ++ L M
Sbjct: 177 TIPAYKAPATEVLINTDFNYCLAKARVRNEHTIGILKSRWASLREM 222
>gi|331227257|ref|XP_003326297.1| hypothetical protein PGTG_08127 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 937
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 62/125 (49%), Gaps = 12/125 (9%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP ++ +R+ GFP G VDGT + L P + + +R ++++V
Sbjct: 144 LVTWPTKSEQIESSQVMREEGFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 203
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLK 204
IC N K + A +PGS HDA V + V ++ FD++ + +L DSAY
Sbjct: 204 ICDINKKIISYLAGFPGSSHDAYVFSHMQVAQQPEKYFDQKQF-------LLADSAYTND 256
Query: 205 EWLIP 209
+++P
Sbjct: 257 CYVVP 261
>gi|77556528|gb|ABA99324.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77556531|gb|ABA99327.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 549
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 83/200 (41%), Gaps = 26/200 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G VDGT +L P++ + +++ R+ NVMAIC + +F PGS+HD
Sbjct: 316 FKDCIGAVDGTHILAVVPDDEKIRYIGRSKSTTQNVMAICDHDMRFTYAFIGQPGSMHDI 375
Query: 171 RVMRNSMVYRKFDEEGWRPFP----DAVILGDSAYPLKEWLIPP-----------RTRNP 215
V+ ++ + FP L D+ YP + P R
Sbjct: 376 TVLFTAL------QTDLTIFPHPPQGKYYLVDAGYPNCPGYLAPYKGQRYHVPEFRRGTA 429
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFA-GMVVLASTTLHNIALT 274
+ ++F H R +IE FG+ K K+ L + P + MVV A+ LHN
Sbjct: 430 PSGEKEKFNFLHSSLRTIIERCFGVWKMKWWILLKIPSFPMWKQKMVVAATMALHNFI-- 487
Query: 275 IDRDNENQNVEIHDDEAEDD 294
RD++ + H E D
Sbjct: 488 --RDHDAPDRHFHRFERNPD 505
>gi|356560691|ref|XP_003548623.1| PREDICTED: uncharacterized protein LOC100801431 [Glycine max]
Length = 469
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 74/158 (46%), Gaps = 8/158 (5%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV ++ E V H + Y ++ ++ S I R V+
Sbjct: 95 ILQEKGKLVKT---RNVPIA--EVVAMFFHILAHNLKYRVMHFSYCRSMEIISRQFKNVL 149
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ + NL R F N G +DG +L+ E ++ N+
Sbjct: 150 RAIMK-ISKEYLKFHDYNLEGSVENKWR--WFKNSIGALDGIHILVTVATEDRPRYRNKK 206
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
D + NV+ +CGP+ +F V W GS D+RV+R+++
Sbjct: 207 CDISTNVLGVCGPDLRFIYVLPGWQGSTGDSRVLRDAL 244
>gi|356533889|ref|XP_003535490.1| PREDICTED: uncharacterized protein LOC100780221 [Glycine max]
Length = 531
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 68/287 (23%), Positives = 120/287 (41%), Gaps = 29/287 (10%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG 110
+G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 238 VGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSIVPAKIRESTRFYPY 297
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 298 FKDCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 357
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPR------------TRNPDA 217
+V+ +++ + G + L D +P +++L P R +P++
Sbjct: 358 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPES 413
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIAL 273
+ LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 414 EKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNFLR 469
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
R +E VE D+ + +D D + +EQ R+D N
Sbjct: 470 KECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQKREDAN 515
>gi|147790109|emb|CAN65312.1| hypothetical protein VITISV_002752 [Vitis vinifera]
Length = 1057
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 129/310 (41%), Gaps = 46/310 (14%)
Query: 34 RNVALS-AKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV---DTFFQR 89
RN A S KE+V LH +G + S ++ H+V+ I+ D F ++
Sbjct: 140 RNSAHSNVKEQVAKFLHIVGHNLRNRTMKFYFKHSSETVSHHFHQVLRTIISLNDVFLKQ 199
Query: 90 --MVKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNV 146
+K P+E N T+ F + G +DG+ + N+ ++ R NV
Sbjct: 200 PDGLKCPQEIKDNSKFWTY-----FKDYIGAIDGSHFRVKVSNDVVQRYRGRKYYPTQNV 254
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEW 206
+A C + KF V +W GS D+R++ N+++ R FD+
Sbjct: 255 LAACSFDLKFTYVLPSWEGSALDSRILDNALM-RDFDK---------------------- 291
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVV 262
LI P+ A + F H R IE FG+LK++FP + P + ++
Sbjct: 292 LIVPQEN-----AREVFNIRHLSLRNAIEKXFGVLKKRFPIIA-SGTEPHYPVDTQSDII 345
Query: 263 LASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQ 322
LA LHN + +D NE E+ + ++ E E++ + E E + + +
Sbjct: 346 LACCILHNYLMGVD-PNERLIAEVDRELFSEETEFESMMEHSEGFEVHKSLRYKPIRNYD 404
Query: 323 RLARLLNYFR 332
+L LL +R
Sbjct: 405 KLXILLGKYR 414
>gi|357168015|ref|XP_003581441.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 286
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 73/179 (40%), Gaps = 24/179 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + + P D+F R NV+A + F V+A W GS HD+
Sbjct: 98 FKECIGAIDGTHIPCNVPARIVDRFRGRKPFRTQNVLAAVDFDLLFTYVSAGWEGSTHDS 157
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP---------------RTRNP 215
V+R+S+ + G R L D+ Y + +PP + R P
Sbjct: 158 TVLRHSLEH----PNGLRVPEGKYYLADAGYAARRGFLPPFHQTRYHLREWRGNYKPRTP 213
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLASTTLHNIAL 273
+ + R H R +E +FG LK F L P + + VV+A LHN L
Sbjct: 214 NELFNLR----HSSLRTTVERAFGTLKNWFKVLTTRPYYPFPSQVRVVIACCILHNWIL 268
>gi|356506799|ref|XP_003522163.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 297
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 67/137 (48%), Gaps = 17/137 (12%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR 107
LH + Y ++ ++ SK +I R ++V+ ++ K ++ LN P T
Sbjct: 102 LHILAHNLKYRVVQFSYCTSKETISRQFNDVLRAVM--------KVSKDYLNFQPCTL-- 151
Query: 108 KGG-------FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVN 160
+G F G +DGT + + + ++ NR GD + NV+A CGP+ KF V
Sbjct: 152 EGAEANKWRWFERCIGALDGTHISVTVSPDERPRYRNRKGDVSTNVLAACGPDLKFIHVL 211
Query: 161 ANWPGSVHDARVMRNSM 177
W GS D++V+R+++
Sbjct: 212 PGWEGSTGDSQVLRDAL 228
>gi|322789477|gb|EFZ14761.1| hypothetical protein SINV_04990 [Solenopsis invicta]
Length = 185
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 84/209 (40%), Gaps = 36/209 (17%)
Query: 107 RKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
R GF V G +DG + P + + NR G H++ + +C KF PG
Sbjct: 5 RSRGFQGVIGAIDGCHIPCKQPIRNPHDYYNRKGFHSIILQGVCNHRGKFIDCFIGLPG- 63
Query: 167 VHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ-RFLR 225
DSAYPL L+ P N Q ++
Sbjct: 64 -------------------------------DSAYPLMVNLMTPYRENGHLIGPQIQYNT 92
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIALTIDRDNENQNV 284
R R +IE +FG+LK KF L Y+ ++ ++ A+ TLHN ID DN N
Sbjct: 93 KLSRIRSIIERAFGLLKTKFRRLKYLDIADFDLGNKMIAAACTLHN--FIIDGDNLNVED 150
Query: 285 EIHDDEAEDDGEEEAVDDPDEIDEEEQPR 313
E + ++ + + E+A + ID E+ R
Sbjct: 151 EDYLEDEDVNDIEQARRQEEVIDAVEKRR 179
>gi|115486779|ref|NP_001068533.1| Os11g0702700 [Oryza sativa Japonica Group]
gi|62733225|gb|AAX95342.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108864702|gb|ABG22606.1| expressed protein [Oryza sativa Japonica Group]
gi|113645755|dbj|BAF28896.1| Os11g0702700 [Oryza sativa Japonica Group]
gi|215693890|dbj|BAG89089.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 391
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 126/306 (41%), Gaps = 38/306 (12%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM------ 90
LS +++V L + G +G+ G + H V +I F + M
Sbjct: 95 VLSLEDRVAVALIRLNSGGSLVTVGSAVGVN--------HSTVSLITWRFVEAMEERASH 146
Query: 91 -VKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNV 146
++WP+ + + I F + G PN G VD T + + + + +++ ++++ +
Sbjct: 147 HLRWPDSSEVEKIKSMFEKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVL 206
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------I 195
A+ P+ +F + WPGS+ ++ ++ +S ++ K E+G R + I
Sbjct: 207 QAVISPDMRFMDIVTGWPGSMKESSILHSSGLF-KMCEKGARLNGSKMVVSDGSEIGEYI 265
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
+GD+ YPL WL+ P + + F + H + ++ K+ + L+ P
Sbjct: 266 IGDAGYPLLPWLLTPYQEKDLSDSKLEFNKRHAAAITVAPSTLANFKDTWKFLHGEMWRP 325
Query: 256 QFAGM--VVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD------GEEEAVDDPDEID 307
+ ++ LHNI + + ++ +D +A +E AV D++
Sbjct: 326 DKHRLPRIIHVCCMLHNIIICLQDATIDEAAMSNDHDANYKQQVCQLADENAVRVRDKLS 385
Query: 308 EEEQPR 313
E R
Sbjct: 386 EHLVSR 391
>gi|108864703|gb|ABA95488.2| expressed protein [Oryza sativa Japonica Group]
Length = 398
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 126/306 (41%), Gaps = 38/306 (12%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM------ 90
LS +++V L + G +G+ G + H V +I F + M
Sbjct: 102 VLSLEDRVAVALIRLNSGGSLVTVGSAVGVN--------HSTVSLITWRFVEAMEERASH 153
Query: 91 -VKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNV 146
++WP+ + + I F + G PN G VD T + + + + +++ ++++ +
Sbjct: 154 HLRWPDSSEVEKIKSMFEKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVL 213
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------I 195
A+ P+ +F + WPGS+ ++ ++ +S ++ K E+G R + I
Sbjct: 214 QAVISPDMRFMDIVTGWPGSMKESSILHSSGLF-KMCEKGARLNGSKMVVSDGSEIGEYI 272
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
+GD+ YPL WL+ P + + F + H + ++ K+ + L+ P
Sbjct: 273 IGDAGYPLLPWLLTPYQEKDLSDSKLEFNKRHAAAITVAPSTLANFKDTWKFLHGEMWRP 332
Query: 256 QFAGM--VVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD------GEEEAVDDPDEID 307
+ ++ LHNI + + ++ +D +A +E AV D++
Sbjct: 333 DKHRLPRIIHVCCMLHNIIICLQDATIDEAAMSNDHDANYKQQVCQLADENAVRVRDKLS 392
Query: 308 EEEQPR 313
E R
Sbjct: 393 EHLVSR 398
>gi|449685004|ref|XP_004210777.1| PREDICTED: uncharacterized protein LOC101237542 [Hydra
magnipapillata]
Length = 149
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 69/147 (46%), Gaps = 13/147 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP+ G +DG +++ AP+ ++ N H++ ++A+C Y+F V+ G D
Sbjct: 4 FPHAIGAIDGKHIVMQAPHNGGSEYFNYKKTHSIALLAVCNAKYEFTMVDIGDSGRQSDG 63
Query: 171 RVMRN-SMVYRKFDEEGWRPFPD----------AVILGDSAYPLKEWLIPPRTRNPDAAA 219
V N S+ Y + + P P+ V++ D + LK ++ P N +
Sbjct: 64 SVFNNCSLGYAIENNKLNIPDPEYIGNSEKVLPYVLVADDTFGLKRHMMKPYP-NQNIPL 122
Query: 220 DQRFLRCH-KRTRRLIENSFGILKEKF 245
DQ+ R RR+IEN+FGI +F
Sbjct: 123 DQKIFNYRLSRARRVIENTFGIATTRF 149
>gi|242070509|ref|XP_002450531.1| hypothetical protein SORBIDRAFT_05g006635 [Sorghum bicolor]
gi|241936374|gb|EES09519.1| hypothetical protein SORBIDRAFT_05g006635 [Sorghum bicolor]
Length = 250
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 76/176 (43%), Gaps = 25/176 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT +L+ + +NR G NV+A+C N F + A GS +D
Sbjct: 30 FDGYIGAMDGTHILVSVDDGVRLDHINRYGVTTQNVVAVCDFNMMFTYIGAGTEGSANDM 89
Query: 171 RVMRNSMVYRKFDEEGWRPFP--DAVILGDSAYPLKEWLIPP----RTRNPDAA------ 218
R+ +K +E+ P P L DS Y L +PP R R D
Sbjct: 90 RMK------KKAEEDPAFPIPPDRRYYLVDSGYALGPGYMPPYPQKRFRAKDFKNLGPQD 143
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAG----MVVLASTTLHN 270
A++ F R H R +IE SFG+ K K+ L + P + G ++L LHN
Sbjct: 144 AEELFNRHHAGLRSVIERSFGVAKSKWRMLKSI---PHYPGTKQTQIILVLFALHN 196
>gi|242067475|ref|XP_002449014.1| hypothetical protein SORBIDRAFT_05g003395 [Sorghum bicolor]
gi|241934857|gb|EES08002.1| hypothetical protein SORBIDRAFT_05g003395 [Sorghum bicolor]
Length = 725
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 111/255 (43%), Gaps = 18/255 (7%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE 96
++ +EK+ L+ + + Y + S +I R I +V VI + R VK P
Sbjct: 73 GITVEEKLGHFLYMISHNASYEDLQHEFHHSGETIHRHIKDVFNVIPSLTY-RFVKSPTG 131
Query: 97 NLNDIPMTFLRKG--GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
++ ++ F N G +DGT V I + + + NR + NVM +C +
Sbjct: 132 TETHWKISTDQRFFPYFKNCLGFIDGTHVPITISQDLQAPYRNRKNTLSQNVMLVCDLDL 191
Query: 155 KFYAVNANWPGSVHDARVMRNSMV------YRKFD--EEGWRPFPDAVI-LGDSAYPLKE 205
F +++ W GS DARV+ ++M+ + K+ + G+ P + Y LKE
Sbjct: 192 NFSFISSGWEGSATDARVLSSAMLGGFNVPHGKYYLVDGGYANTPSFIAPYRGVRYHLKE 251
Query: 206 W-LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVL 263
+ R RN + R H R IE + G+LK++FP L P + + +
Sbjct: 252 FGRGQQRPRNYKELFNHR----HAILRNHIERAIGVLKKRFPILKVGTHHPIENQVKIPV 307
Query: 264 ASTTLHNIALTIDRD 278
A+ HN+ ++ D
Sbjct: 308 AAVVFHNLIRMLNGD 322
>gi|347970693|ref|XP_559600.5| AGAP003812-PA [Anopheles gambiae str. PEST]
gi|333466789|gb|EAL41345.3| AGAP003812-PA [Anopheles gambiae str. PEST]
Length = 445
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/277 (22%), Positives = 114/277 (41%), Gaps = 18/277 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F+ + T V + +++AP P + SA++KV L+ + G+ Y IG G
Sbjct: 102 FRMGRAMYDTLVAMLRQDLAPH---PLLAAQSCSAEKKVGIGLYKLTTGADYTTIGNMFG 158
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
K+++ +H+ +V + + P +E + +I +F K P V G + V
Sbjct: 159 VHKATVKNCVHQFCKSLVKNYMDSEIYLPDQEEMAEISKSFEDKSDIPLVIGAMGRLHVP 218
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I ++N +L + A+ N+ F + G+ D+ V+ +S +Y+ FD
Sbjct: 219 ITPSLADSKNYINGRKWPSLILQAVVDNNHCFRHITCGHVGATEDSVVLGDSGLYQHFDG 278
Query: 185 EGWRPFPD---------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
P + I+ D+ YPL WL P ++ F + R ++
Sbjct: 279 A---DLPSQNINGNTVKSFIVSDTVYPLLPWL-QHGYMTPTTTEEETFNEHLNKARVAVD 334
Query: 236 NSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNI 271
+F L+ +F L + + F ++L LHN
Sbjct: 335 EAFDRLRARFRILQRKIDIDINFVPQILLTCCILHNF 371
>gi|317419308|emb|CBN81345.1| Putative nuclease HARBI1 [Dicentrarchus labrax]
Length = 371
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 115/277 (41%), Gaps = 14/277 (5%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
A + F L+ + FI + + + + ALS V+ L++ G+ I
Sbjct: 36 ALFQMFHLTRPCIAFITDAVRIRMKKVALKKPALSVDSMVMVTLNYYAHGASSAAILKGV 95
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTM 122
G ++ C+S+ V ++ + + +P E ++ R G PNV G +
Sbjct: 96 GLTQKD-CQSLVGTVSGVIAGMSDQFISFPLIREAKANVAFKVKRFCGIPNVLGVLAPAH 154
Query: 123 VLIDAPNEHED---QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
I A +D FVN G ++ IC + +V GS + + +S
Sbjct: 155 FKIRASPYEKDTFRSFVNTLGYTSVVSQIICDSDGNILSVEKCCVGSTFEQEMWESSFKG 214
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
R+ +EE P+ ++G Y L + ++ P ++ P + RF H + ++ + G
Sbjct: 215 REMEEELHGPY---WVIGGKGYHLSKHVLTPVSQ-PANDNEVRFNEAHAKILNVMRTTLG 270
Query: 240 ILKEKFPCLNYMRLSPQFA----GMVVLASTTLHNIA 272
+K +F CL + + + + ++ A + LHNI+
Sbjct: 271 SMKRRFRCLMQLGFAQEGSLDKKSNIIKACSVLHNIS 307
>gi|212721120|ref|NP_001132644.1| hypothetical protein [Zea mays]
gi|194694982|gb|ACF81575.1| unknown [Zea mays]
gi|413920177|gb|AFW60109.1| hypothetical protein ZEAMMB73_335492 [Zea mays]
Length = 390
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 53/264 (20%), Positives = 107/264 (40%), Gaps = 32/264 (12%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---- 90
L +++V L + G +G++ G + H V +I F + M
Sbjct: 91 GTVLCLEDRVAIALRRLNSGGSLVTVGSSVGVN--------HSTVSLITWRFIEAMEERA 142
Query: 91 ---VKWPEE-NLNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHAL 144
++WP+ + I F + G PN G VD T + + + + ++++ ++++
Sbjct: 143 SHHLRWPDSIEMEKIKSKFEKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDQEKNYSM 202
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV---------- 194
+ A+ + KF + WPGS+ ++ ++ +S ++ K E+G R +
Sbjct: 203 VLQAVVDLDTKFTDIVTGWPGSMKESSILHSSGLF-KLCEKGERLNGSKLKVSDGSEIGE 261
Query: 195 -ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL 253
++GDS YPL WL+ P + F H R + + K+ + L
Sbjct: 262 YLIGDSGYPLLPWLLTPYQEKDLTESSAEFNIRHSSARTVAPRTLAKFKDTWKFLQGEMW 321
Query: 254 SPQFAGM--VVLASTTLHNIALTI 275
P + ++ LHNI + +
Sbjct: 322 RPDKHKLPRIIHVCCLLHNIIIDL 345
>gi|328699639|ref|XP_001946822.2| PREDICTED: hypothetical protein LOC100159851 [Acyrthosiphon pisum]
Length = 401
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 71/346 (20%), Positives = 135/346 (39%), Gaps = 28/346 (8%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIG 61
P FK +R+S + + ++ + P + T +A+ +E++L L ++ G + +
Sbjct: 51 PDKFKTFYRMSKSNFDNLVHIVGPKIFKKDTNFRIAVPVEERILITLRFLATGCNFRALA 110
Query: 62 ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDG 120
++++ + I E I + + P L +I + P+ G +DG
Sbjct: 111 QHFMRGETTVGKIIAETTEAIWECLQPTYLPVPSLELWKNIAARYNLLWQLPHCLGSIDG 170
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ I N + N G ++ ++A + F V+ G D V R S + R
Sbjct: 171 KHIRIKKFNNTGSRNFNYKGFFSIQLLACADADGCFITVDIGDLGRNSDGGVFRLSRLGR 230
Query: 181 KFDEEGW-----RPFPD--------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
+ G P P GD A+PLK +L+ P + + F
Sbjct: 231 WLEIGGMNVPQSEPLPHDNEGPNFPYYFCGDEAFPLKSYLLRPYPQKTLNDQKRIFNYRL 290
Query: 228 KRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHN-IALTIDRDNENQNVE 285
R R+ +E +FG++ KF + S +V + LHN I T + E+QN+
Sbjct: 291 SRGRKSVECAFGMMVSKFRVFETPIACSESTVISIVKCACALHNYIRKTEGKLYESQNIN 350
Query: 286 IHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLNYF 331
D+ ++ P + + +Q + + + R L +YF
Sbjct: 351 SQDE----------INIPHHLTQHQQAVHNLSTASSVR-EYLSSYF 385
>gi|147792468|emb|CAN74706.1| hypothetical protein VITISV_036360 [Vitis vinifera]
Length = 352
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 80/167 (47%), Gaps = 19/167 (11%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVL 124
S +I + HEV+ +V+ F + M+ P ++ N I LR+ F NV G +DGT++
Sbjct: 156 SSQTIHKYFHEVLVAMVN-FSKEMITLPLFNDSSNGISNRRLRQI-FKNVVGAIDGTLIH 213
Query: 125 IDAPNEHEDQFVNR-NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
P + + R G+ NVMA+C + F V W G+ HD+RV+ ++ Y + +
Sbjct: 214 ACIPTNQQVPYRGRGKGECFQNVMAVCDFDIIFRFVVVGWEGTTHDSRVLTETIHYPQHN 273
Query: 184 EEGWRPFPDA----VILGDSAYPLKEWLIPP----RTRNPDAAADQR 222
FP A L D+AY + P +TR+P A + R
Sbjct: 274 ------FPMAPSEKYYLVDAAYTHTRGFMAPYRNVKTRSPMAEFETR 314
>gi|449691505|ref|XP_004212695.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 223
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 79/187 (42%), Gaps = 28/187 (14%)
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAP-NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANW 163
F K G GCVDG+ + I P N +D F + +++ V A+C F V W
Sbjct: 8 FESKFGMIQAHGCVDGSHIPIVCPANPSQDYFCYKQY-YSMQVQAVCDYKGSFLDVECKW 66
Query: 164 PGSVHDARVMRNSMV------------YRKFDEEGWRPFPDAVILGDSAYP-----LKEW 206
PGSVHDA+V NS + ++ + + ++GD AYP +KE+
Sbjct: 67 PGSVHDAKVFSNSSINTNLRSSRLPGTFQTITKNKIK--APCYLIGDPAYPILPHCMKEY 124
Query: 207 LIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YMRLSPQFAGMVVLAS 265
+ D LR R IE +FG LK ++ L M L + V+ A
Sbjct: 125 ---STCKKNDEFIFNSMLR---TARNPIECAFGRLKVRWKILTKKMDLKLEKIPTVIYAC 178
Query: 266 TTLHNIA 272
LHN+
Sbjct: 179 FILHNLC 185
>gi|331217820|ref|XP_003321588.1| hypothetical protein PGTG_03125 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 418
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 118/289 (40%), Gaps = 38/289 (13%)
Query: 6 FKEAFRLSATQVEFILEEIA-----------PALTTPTERNVALSAKEKVLSCLHWMGVG 54
FK+A R + T ++L +I P L P + + L E++ S G G
Sbjct: 88 FKQAVRTTKTGFLWLLNQIRIHPIFYNNSTRPQLPIPHQLALTL---ERLGS----NGNG 140
Query: 55 SPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+ + ++ + V+ I + + + WP+++ + K GF
Sbjct: 141 ASVGRFSRNLSVGRGTVVKVSRRVIQAI-NEISSKYIVWPDKDRRKEISGVMEKEGFKGC 199
Query: 115 AGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G VDGT + L P + + +R +++N IC + + WPGS D+ V
Sbjct: 200 IGFVDGTTIPLHQRPGIDGEVYWDRKKRYSINCQVICDCDKFITSFMTGWPGSCGDSLVF 259
Query: 174 RNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRNPDAAADQRFLRCHK 228
N V+++ FD + ++ D+AY L IP + + F C
Sbjct: 260 TNMKVHKEPENYFDSGQY-------LIADTAYGLSMTTIPAYKAPLSKVHRNTEFNYCLA 312
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLS--PQFAGMVVL----ASTTLHNI 271
++R E++ GILK ++ L ++LS Q M +L A LHN+
Sbjct: 313 KSRVRNEHTIGILKGRWASLQQLQLSLNEQKDMMEILRWINACVALHNM 361
>gi|62733091|gb|AAX95208.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77549101|gb|ABA91898.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77553958|gb|ABA96754.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 396
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 113/290 (38%), Gaps = 33/290 (11%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ FRL+A ++ + + + + V + EK+ L+ MG P
Sbjct: 62 RAMFRLTAEEIHNLHDLLVTSFGLVGTAGVC--SMEKLGIFLYTMGGNRPIRDANNRWVR 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFP---NVAGCVDGTMV 123
S S++ H V+ + D +++K + N D L++ F G VDGT +
Sbjct: 120 SNSTVSVHFHHVLQAMND-LAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHI 178
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ Q NR+ NV+ IC + + +A WPGSVHD R++ ++ +D
Sbjct: 179 PVLPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILNEAVQAYPYD 238
Query: 184 EEGWRPFPDA----VILGDSAYPLKEWLIPP-------------RTRNPDAAADQRFLRC 226
FP +L DS +P + + P P ++ F
Sbjct: 239 ------FPRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREETFNHR 292
Query: 227 HKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHNIAL 273
H R ++E FGI K+ + L Y R ++ A+ LHN L
Sbjct: 293 HSTLRGIVERQFGIAKKMWKILKEIPYYR-DEDIPARIIHAAFALHNFRL 341
>gi|218186214|gb|EEC68641.1| hypothetical protein OsI_37051 [Oryza sativa Indica Group]
Length = 391
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 125/306 (40%), Gaps = 38/306 (12%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM------ 90
LS +++V L + G +G+ G + H V +I F + M
Sbjct: 95 VLSLEDRVAVALIRLNSGGSLVTVGSAVGVN--------HSTVSLITWRFVEAMEERASH 146
Query: 91 -VKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNV 146
++WP+ + + I F + G PN G VD T + + + + +++ ++++ +
Sbjct: 147 HLRWPDSSEVEKIKSMFEKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVL 206
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------I 195
A+ P+ +F + WPGS+ ++ ++ +S ++ K E+G R + I
Sbjct: 207 QAVISPDMRFMDIVTGWPGSMKESSILHSSGLF-KMCEKGARLNGSKMVVSDGSEIGEYI 265
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP 255
+GD+ YPL WL+ P + + F + H + + K+ + L+ P
Sbjct: 266 IGDAGYPLLPWLLTPYQEKDLSDSKLEFNKRHAAAITVAPRTLANFKDTWKFLHGEMWRP 325
Query: 256 QFAGM--VVLASTTLHNIALTIDRDNENQNVEIHDDEAEDD------GEEEAVDDPDEID 307
+ ++ LHNI + + ++ +D +A +E AV D++
Sbjct: 326 DKHRLPRIIHVCCMLHNIIICLQDATIDEAAMSNDHDANYKQQVCQLADENAVRVRDKLS 385
Query: 308 EEEQPR 313
E R
Sbjct: 386 EHLVSR 391
>gi|294461349|gb|ADE76236.1| unknown [Picea sitchensis]
Length = 368
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 70/317 (22%), Positives = 124/317 (39%), Gaps = 22/317 (6%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
M +FK FR+S T E++ ++ P +T + V E+V L + GS Y +
Sbjct: 27 MESASFKANFRMSPTTFEWLCKQFEPLVTKTNDVGVF----ERVGMGLFRLATGSTYQAV 82
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVD 119
++ + E+ VI T + V +P +L ++ F G PN G +D
Sbjct: 83 SQRFVSTEPTARLCTKELCRVIC-TNLKFWVAYPSPCDLPNVSDEFQTLSGLPNCCGAID 141
Query: 120 GTMVLIDAPNEHEDQFVNRNGD---HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
T N + N D ++ + + + ++ + G+ D+RV+R+S
Sbjct: 142 CTR--FKFTNTNYPYIYNEEDDDLQQSVVTQIVVDCSSRILSIITGFKGNKGDSRVLRSS 199
Query: 177 MVYRKFDEEGWRPFP---------DAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH 227
+Y +E P ++GDS YPL WL+ P T + + F H
Sbjct: 200 TLYADIEEGRLLNSPPFHLKGVPVRQYLVGDSGYPLLPWLMVPYTDPVVTSCQEDFNSKH 259
Query: 228 KRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R+ + L++ + + A + A LHNI L R+ + +E
Sbjct: 260 YVMRQAALRTLSCLRKWGILSKPIEEDVRVAVGCIGACAILHNILLA--REGGSTILEGT 317
Query: 288 DDEAEDDGEEEAVDDPD 304
+D A D + D +
Sbjct: 318 EDYALHDQSSQYYRDAN 334
>gi|156540836|ref|XP_001601490.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 325
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 103/245 (42%), Gaps = 26/245 (10%)
Query: 44 VLSCLHWMGVGSPYHIIGA--THGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLN 99
VLS L + GS + H +S+ R + + + +++ R +++P +E
Sbjct: 82 VLSMLRFFAEGSFQKGFASDYRHPMGQSTASRYLTQFLQAVLE-LAPRFIRFPTSQEERR 140
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYA- 158
I F R P + G VD +V + P+E+E+ F N A+NV + NY +
Sbjct: 141 QISTQFQRTIRIPGIIGLVDAFIVCMKRPSENEEAFYNYRHGPAMNVQIVVDCNYIIRSI 200
Query: 159 ----------VNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
N N+ G+ ++R + + EG +LGDS Y L+
Sbjct: 201 RIISGSNNDQFNWNFCGARDYLELLRRNP--EIIENEG-----HYYMLGDSGYARSSVLL 253
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMR---LSPQFAGMVVLAS 265
P P+ + + H TR ++E + G+L + ++ R SPQ M++ A+
Sbjct: 254 TPILDAPEGSPASVYTYDHVHTRYMVEQTIGMLSGTWKVISRCRKLYYSPQKVSMIINAA 313
Query: 266 TTLHN 270
L+N
Sbjct: 314 AILYN 318
>gi|449680707|ref|XP_004209655.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 199
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 52/118 (44%), Gaps = 2/118 (1%)
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDG 120
T G + + I EV IV R V P+ + + + F K G GCVDG
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
T + I P H + ++L V A+C F V WPGSV+DA+V NS +
Sbjct: 65 THIPIVCPTNHSQDYFCYKQYYSLQVQAVCDYKGSFLDVEWLWPGSVYDAKVFSNSSI 122
>gi|28411822|dbj|BAC57297.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 396
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 113/290 (38%), Gaps = 33/290 (11%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ FRL+A ++ + + + + V + EK+ L+ MG P
Sbjct: 62 RAMFRLTAEEIHNLHDLLVTSFGLVGTAGVC--SMEKLGIFLYTMGGNRPIRDANNRWVR 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFP---NVAGCVDGTMV 123
S S++ H V+ + D +++K + N D L++ F G VDGT +
Sbjct: 120 SNSTVSVHFHHVLQAMND-LAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHI 178
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ Q NR+ NV+ IC + + +A WPGSVHD R++ ++ +D
Sbjct: 179 PVLPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILNEAVQAYPYD 238
Query: 184 EEGWRPFPDA----VILGDSAYPLKEWLIPP-------------RTRNPDAAADQRFLRC 226
FP +L DS +P + + P P ++ F
Sbjct: 239 ------FPRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREETFNHR 292
Query: 227 HKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHNIAL 273
H R ++E FGI K+ + L Y R ++ A+ LHN L
Sbjct: 293 HSTLRGIVERQFGIAKKMWKILKEIPYYR-DDDIPARIIHAAFALHNFRL 341
>gi|22330012|ref|NP_683376.1| uncharacterized protein [Arabidopsis thaliana]
gi|332193870|gb|AEE31991.1| uncharacterized protein [Arabidopsis thaliana]
Length = 324
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 93/232 (40%), Gaps = 33/232 (14%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EE 96
+S +E V L G Y +G G ++ ++ R EV+ + ++ P +
Sbjct: 96 ISIEESVAMFLRICGHNEVYRDVGLRFGRNQETVQRKFREVLTA-TELLACDYIRTPTRQ 154
Query: 97 NLNDIPMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
L IP F G +DGT V + + + + NR+ + +LN+MAIC
Sbjct: 155 ELYRIPERLQVDQRYWPYFSGFVGAMDGTHVCVKVKPDLQGMYWNRHDNASLNIMAICDL 214
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIP--- 209
F + PGS +D V++ + ++ D E P + L DS YP K+ L+
Sbjct: 215 KMLFTYIWNGAPGSCYDTAVLQ---IAQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAPYR 271
Query: 210 -----------------PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
PR RN +Q CH R +IE +F I K K
Sbjct: 272 SSRNRVVRYHMSQFYYGPRPRNKHELFNQ----CHTSLRSVIERTFRIWKNK 319
>gi|356566437|ref|XP_003551438.1| PREDICTED: uncharacterized protein LOC100812675 [Glycine max]
Length = 531
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 54/238 (22%), Positives = 97/238 (40%), Gaps = 25/238 (10%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG 110
+G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 238 VGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESIRFYPY 297
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 298 FKDCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 357
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDA 217
+V+ +++ + G + L D +P + + P +P+
Sbjct: 358 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPEN 413
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 414 EKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNF 467
>gi|308464339|ref|XP_003094437.1| hypothetical protein CRE_06085 [Caenorhabditis remanei]
gi|308247756|gb|EFO91708.1| hypothetical protein CRE_06085 [Caenorhabditis remanei]
Length = 248
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 103/236 (43%), Gaps = 29/236 (12%)
Query: 91 VKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
+++PE E+ + MT+ R+ P + G +DG V I AP + F N G + ++A
Sbjct: 4 IRFPENFEDWRSVEMTYRRRNMIPTL-GSLDGKHVRIRAPPQSGSLFFNYKGFFSFVLLA 62
Query: 149 ICGPNYKFYAVN----------------ANWPGSVHDARVMRNSMVYRKFDEEGWRP--- 189
+ + KF V+ PGS +DA + NS + ++ P
Sbjct: 63 LVDGDGKFVWVDLGKLRKKDVHDFISDFKGSPGSTNDATIYNNSRLKTVLEDGATLPKCT 122
Query: 190 FPDA------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
F D+ I+ D +PL + L+ P R + + + R +E++FG+L +
Sbjct: 123 FWDSDIVMPSFIIADGIFPLSKSLMKPFGRRSLTPEESVYNKKLSNARVRVEHTFGMLAK 182
Query: 244 KFPCLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
+F L+ + S + A +V A LHN+ + + N +VE D D +E+
Sbjct: 183 RFRILDRSIECSYECAIEIVTAMCHLHNLLVPPPQSNTIHSVEECDVYPYKDAKEQ 238
>gi|359481091|ref|XP_003632566.1| PREDICTED: uncharacterized protein LOC100853389 [Vitis vinifera]
Length = 311
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 69/177 (38%), Gaps = 17/177 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + P + + F R NVM C + F V A W G +DA
Sbjct: 124 FKDCIGAIDGTHISAWVPADRQTSFRGRKTVITQNVMCACNFDMMFTFVYAGWEGIANDA 183
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAA 218
RV +++ R W P + DS YP +PP RN
Sbjct: 184 RVFLDALT-RPEVNFPW-PSEGKYYVVDSGYPCISRFLPPYRGERYHLQEYRGRRNQPIR 241
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTI 275
+ F H R +IE FG+LK +FP L R+ P + V LH + I
Sbjct: 242 YKELFNYRHSFLRNIIERCFGVLKTQFPIL---RMMPCYKRQVGNHQLLLHVVPFII 295
>gi|308446430|ref|XP_003087180.1| hypothetical protein CRE_23869 [Caenorhabditis remanei]
gi|308448813|ref|XP_003087761.1| hypothetical protein CRE_15157 [Caenorhabditis remanei]
gi|308253051|gb|EFO97003.1| hypothetical protein CRE_15157 [Caenorhabditis remanei]
gi|308260373|gb|EFP04326.1| hypothetical protein CRE_23869 [Caenorhabditis remanei]
Length = 248
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 103/236 (43%), Gaps = 29/236 (12%)
Query: 91 VKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
+++PE E+ + MT+ R+ P + G +DG V I AP + F N G + ++A
Sbjct: 4 IRFPENVEDWRSVEMTYRRRNMIPTL-GSLDGKHVRIRAPPQSGSLFFNYKGFFSFVLLA 62
Query: 149 ICGPNYKFYAVN----------------ANWPGSVHDARVMRNSMVYRKFDEEGWRP--- 189
+ + KF V+ PGS +DA + NS + ++ P
Sbjct: 63 LVDGDGKFVWVDLGKLRKKDVHDFISDFKGSPGSTNDATIYNNSRLKTVLEDGATLPKCT 122
Query: 190 FPDA------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKE 243
F D+ I+ D +PL + L+ P R + + + R +E++FG+L +
Sbjct: 123 FWDSDIVMPSFIIADGIFPLSKSLMKPFGRRSLTPEESIYNKKLSNARVRVEHTFGMLAK 182
Query: 244 KFPCLN-YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEE 298
+F L+ + S + A +V A LHN+ + + N +VE D D +E+
Sbjct: 183 RFRILDRSIECSYECAIEIVTAMCHLHNLLVPPTQSNSIDSVEECDVYPYKDAKEQ 238
>gi|356519216|ref|XP_003528269.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVTMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +D T + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDETHIPVTVSPDER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|357127579|ref|XP_003565457.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 396
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 89/222 (40%), Gaps = 35/222 (15%)
Query: 102 PMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + LRK F N G VDGT + + P + + + NR + N M C + KF
Sbjct: 147 PHSILRKPKIYPFFKNCIGAVDGTHIPMKLPLDQQKPYRNRKQTISQNAMVACDFDLKFV 206
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPR-TR-- 213
+N W GS DARV+++++ G+ L D+ Y ++L P R TR
Sbjct: 207 HINPGWEGSASDARVLQDAL------NHGFEVPDGKFYLVDAGYANTPQFLAPYRDTRYH 260
Query: 214 -NPDAAADQR-------FLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLA 264
N A Q+ F H + R IE GI K +FP L S + +A
Sbjct: 261 LNEQGRARQKPQNHKELFNLGHAQLRNHIERIIGIWKMRFPILKVASHFSKEKQIDTSVA 320
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
LHN + IH+ + + DPD+I
Sbjct: 321 CAVLHNF------------IRIHNGDMTWPSDATMDIDPDQI 350
>gi|357499113|ref|XP_003619845.1| hypothetical protein MTR_6g069810 [Medicago truncatula]
gi|355494860|gb|AES76063.1| hypothetical protein MTR_6g069810 [Medicago truncatula]
Length = 390
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 77/173 (44%), Gaps = 18/173 (10%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG V + + + F N++G + NV+A C + KF V A W GS +
Sbjct: 142 FKDCVGAIDGIYVPVTVGVDEQGPFRNKDGLLSQNVLAACSFDLKFCYVLAGWEGSASNL 201
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRTRNPDAA----------- 218
+V NS + RK + P+ L D+ +P I P R P +
Sbjct: 202 QVF-NSAITRKNKLQ----VPEGKYYLVDNKFPNVPGFIAPYPRTPYHSKEFPTGYQPQN 256
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHN 270
A + F + H R + +FG LKE+FP L P Q +V+A+ LHN
Sbjct: 257 ACELFNQRHSLLRSVTARTFGALKERFPILMAAPSYPLQTQVKLVVAACALHN 309
>gi|301611284|ref|XP_002935170.1| PREDICTED: putative nuclease HARBI1-like isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 109
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 190 FPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
P +L D+ YP WLI P R P A+ F H RTR +IE +FG+LK +F CL+
Sbjct: 1 MPHGWLLRDAGYPCSRWLITPIHR-PRTRAECAFNEAHVRTRSMIERTFGVLKSRFHCLD 59
Query: 250 ----YMRLSPQFAGMVVLASTTLHNIA 272
+ SP +V LHN+A
Sbjct: 60 KSGGSLMYSPYKVSQIVAVCAVLHNLA 86
>gi|328697810|ref|XP_003240445.1| PREDICTED: hypothetical protein LOC100569619 [Acyrthosiphon pisum]
Length = 461
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 79/191 (41%), Gaps = 21/191 (10%)
Query: 101 IPMTFLRKGGFPNVAGCVDGTMVL--IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYA 158
+ F + PN G +DG V+ + N F N G H+ + A+C NY F
Sbjct: 54 VAKQFYEQWQIPNCCGALDGKHVVHQVAFANSGSSNF-NYKGSHSTVLPALCDANYYFTI 112
Query: 159 VNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA 218
V+ PG D + E G P+ I+ D A+PL E ++ P T A
Sbjct: 113 VDIGSPGCSSDGEPIEID------SENGKIPY---YIIADEAFPLMENIMRPYTGRGKAN 163
Query: 219 ADQR---FLRCHKRTRRLIENSFGILKEKFPCLNYMR---LSPQFAGMVVLASTTLHNIA 272
R F R RR+IEN+FGIL K C Y R + ++ A+ LHN
Sbjct: 164 LPIRESVFNYRLSRVRRVIENTFGILASK--CHIYRRSIIAGEKTINAIIKATVVLHNF- 220
Query: 273 LTIDRDNENQN 283
+ I D + N
Sbjct: 221 IKISEDEKGIN 231
>gi|119226353|gb|AAI28985.1| LOC100036828 protein [Xenopus laevis]
Length = 326
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/276 (21%), Positives = 113/276 (40%), Gaps = 20/276 (7%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + + +L + P L T +S +E++ L ++ G + + +++
Sbjct: 1 MSISSFDELLTVLKPGLCRAHTLMMDPISPEERLCLTLRFLATGQSFSSLYFRFPIGRTT 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN 129
I + + E +I + ++ P+EN I F + FPN G + G + I P
Sbjct: 61 IGKIVRETCLLIWSELQRLVMPTPDENAWIHIAEDFYKTTNFPNCLGAMGGKRIQIKMPF 120
Query: 130 EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR- 188
+ + + ++ ++A+ NY F ++ GS DA +S + + E
Sbjct: 121 KSGSEKYS-----SVVLLAVVDANYCFSIIDVGAYGSTGDASAFWSSAMGHQLSEGALHL 175
Query: 189 -----------PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
P V +GD A+ L E ++ P + + F R RR++E +
Sbjct: 176 PLPKPLPGTAAPSMPYVFVGDEAFGLTENIMRPYPGSQMDIQKRLFNYRLLRARRMVECA 235
Query: 238 FGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIA 272
FGI K+ N ++L P F +++ A LHN
Sbjct: 236 FGIFSNKWRIFHNAIQLEPDFVDIIIKACCVLHNFV 271
>gi|331252603|ref|XP_003338842.1| hypothetical protein PGTG_20379 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 382
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 93/213 (43%), Gaps = 34/213 (15%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMA 148
+V WP E+ +R G P G VDGT + L P + + +R ++++V
Sbjct: 162 LVTWPTESEQIESSQVMRDEGLPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTV 221
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVY----RKFDEEGWRPFPDAVILGDSAYPLK 204
IC N KF + A +PGS HD+ V + V + FD++ + +L DSAY
Sbjct: 222 ICDINKKFISYLAGFPGSSHDSYVFSHMQVAQYPEKYFDQKQF-------LLADSAYTNN 274
Query: 205 EWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL----SPQFAGM 260
+++P F H R I ++ ++ F L +R + + G
Sbjct: 275 CYVVPA------------FKGKHLLKRCNINFNYHLILSCFASLREIRTQICNAEEMKGA 322
Query: 261 V--VLASTTLHNIALTIDRDNENQNVEIHDDEA 291
V ++ LHN+ D ++Q ++++DEA
Sbjct: 323 VKWIVTCIVLHNLL----ADLKDQWNDLYEDEA 351
>gi|11994169|dbj|BAB01198.1| unnamed protein product [Arabidopsis thaliana]
Length = 329
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 64/324 (19%)
Query: 16 QVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSI 75
+V F L EI T + LS KE L+ G + I G S+ +ICR I
Sbjct: 37 KVFFDLCEIIEK-TYKIRSSQNLSVKESGAIFLYICGHNASQRSIMRMFGHSQETICRKI 95
Query: 76 HEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQF 135
+EV+ +I + G +DGT V ++
Sbjct: 96 YEVLNLI------------------------------GIVGAMDGTHVPAMVSGRDHQRY 125
Query: 136 VNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVI 195
NR ++N++A+C + F + GS HD +V+ +M EG FP I
Sbjct: 126 WNRKSICSMNILAVCNFDMLFTYIYVGVFGSAHDTKVLSLAM-------EGDPNFPHPHI 178
Query: 196 ----LGDSAYPLKEWLIPP-----------RTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
L DS Y L+ + P + + P ++F H R +IE +FG+
Sbjct: 179 GKYYLVDSGYALRRGYLGPFRQTWYHHNQFQNQAPPNNHKEKFNWRHSLLRCVIERTFGV 238
Query: 241 LKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGE 296
K K+ + Y ++ + +++A+ LHN + + + ++ D +D+
Sbjct: 239 WKGKWRIMQDRAWYNIVTTR---KIMVATMALHNFVWKLGIPDLDFDI----DWMQDNDH 291
Query: 297 EEAVDDPDEIDEEEQPRQDRNNPG 320
+DD DE E++ ++ G
Sbjct: 292 HPTLDDEDETVEQDMTGSRQHMEG 315
>gi|449687326|ref|XP_004211426.1| PREDICTED: uncharacterized protein LOC101237335 [Hydra
magnipapillata]
Length = 198
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 85/195 (43%), Gaps = 5/195 (2%)
Query: 12 LSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S T E +L +AP + T +S E++ L ++ G I A++ S S+
Sbjct: 1 MSPTTYEELLSFVAPIIVKQRTTMRDPVSPSERLAVTLRFLVTGDAQCTIAASYRISAST 60
Query: 71 ICRSIHEVVPVIVDTFFQR---MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDA 127
I R I E I + +R V ++ I F FP+ G +DG +++ A
Sbjct: 61 ISRIISETCAAIWTSLKERNFLHVPLEKQEWKTIAKEFENMWNFPHAIGAIDGKHIVMQA 120
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN-SMVYRKFDEEG 186
P+ ++ N H++ ++A+C Y+F V+ G D V N S+ Y + +
Sbjct: 121 PHNGGSEYFNYKKTHSIVLLAVCNAKYEFTMVDIGDSGRQSDCSVFNNCSLGYAIENNKL 180
Query: 187 WRPFPDAVILGDSAY 201
P P+ +++ Y
Sbjct: 181 NIPDPELLVIQKRYY 195
>gi|242069533|ref|XP_002450043.1| hypothetical protein SORBIDRAFT_05g027460 [Sorghum bicolor]
gi|241935886|gb|EES09031.1| hypothetical protein SORBIDRAFT_05g027460 [Sorghum bicolor]
Length = 390
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/261 (20%), Positives = 108/261 (41%), Gaps = 32/261 (12%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM------- 90
L +++V L + G +G++ G + H V +I F + M
Sbjct: 94 LCLEDRVAIALRRLNSGGSLVTVGSSVGVN--------HSTVSLITWRFIEAMEERASHH 145
Query: 91 VKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNVM 147
++WP+ + + I F + G PN G VD T + + + + ++++ ++++ +
Sbjct: 146 LRWPDSSEMEKIKSKFQKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDQEKNYSMVLQ 205
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWR------PFPDA-----VIL 196
A+ + +F + WPGS+ ++ ++ +S ++ K E+G R D ++
Sbjct: 206 AVVDLDTRFIDIVTGWPGSMKESSILHSSGLF-KLWEKGERLNGSKFKVSDGSEIGEYLI 264
Query: 197 GDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ 256
GDS YPL WL+ P + F H R + + K+ + L P
Sbjct: 265 GDSGYPLLPWLLTPYQEKELTESSTEFNSRHSAARIVAPRTLAKFKDTWKFLQGEMWRPD 324
Query: 257 FAGM--VVLASTTLHNIALTI 275
+ ++ LHNI + +
Sbjct: 325 KHKLPRIIHVCCLLHNIIIDL 345
>gi|242084962|ref|XP_002442906.1| hypothetical protein SORBIDRAFT_08g004725 [Sorghum bicolor]
gi|241943599|gb|EES16744.1| hypothetical protein SORBIDRAFT_08g004725 [Sorghum bicolor]
Length = 651
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/261 (23%), Positives = 105/261 (40%), Gaps = 30/261 (11%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE 96
++ +EK+ L+ + + Y + S +I R I +V VI + R VK P
Sbjct: 72 GITVEEKLGHFLYMISHNASYEDLQHEFHHSGETIHRHIKDVFNVIPSLTY-RFVKSPTG 130
Query: 97 NLNDIPMTFLRKG--GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
++ ++ F N G +DGT V + + + + NR + NVM +C +
Sbjct: 131 TETHWKISTDQRFFPYFQNCLGFIDGTHVPVTISQDLQAPYRNRKNTLSQNVMLVCDLDL 190
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP---- 210
F +++ W GS DARV+ ++M+ G+ L D Y I P
Sbjct: 191 NFSFISSGWEGSATDARVLSSAML------GGFNVPHGKYYLVDGGYANTPSFIAPYRGV 244
Query: 211 ------------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QF 257
R RN + R H R IE + G+LK++FP L P +
Sbjct: 245 RYYLKEFGRGQQRPRNYKELFNHR----HAILRNHIERAIGVLKKRFPILKVGTHHPIEN 300
Query: 258 AGMVVLASTTLHNIALTIDRD 278
+ +A+ HN+ ++ D
Sbjct: 301 QVKIPVAAVVFHNLIWMLNGD 321
>gi|270015657|gb|EFA12105.1| hypothetical protein TcasGA2_TC002251 [Tribolium castaneum]
Length = 446
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 60/130 (46%), Gaps = 10/130 (7%)
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANW 163
F GF V G + V I +P+ E + F NR ++N IC K V A W
Sbjct: 325 FFEVDGFIRVVGAI----VKIQSPSGEDVEIFRNRKSYFSINTQMICTAELKITNVVARW 380
Query: 164 PGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
PGS D + NS + F+ +R + ++LGDS YP K +L P NP A+QR
Sbjct: 381 PGS-DDGTIFNNSQIQADFERRTYR---NCLLLGDSGYPKKTYLFTP-LLNPLTPAEQRH 435
Query: 224 LRCHKRTRRL 233
+ ++L
Sbjct: 436 NTSQYQKQKL 445
>gi|195386252|ref|XP_002051818.1| GJ10264 [Drosophila virilis]
gi|194148275|gb|EDW63973.1| GJ10264 [Drosophila virilis]
Length = 364
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 128/325 (39%), Gaps = 36/325 (11%)
Query: 6 FKEAFRLSATQVEFILEEIAPALT-----TPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
FKE FR+ + E +L+ + A+ P R +S EK+L L + + +
Sbjct: 48 FKEQFRMQRSSFEKLLQAVGKAIAGAEHHQPIGR---VSLPEKLLYTLTLLSGKKSFREV 104
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
G + SKSS I + V R VKWP +N P V G +D
Sbjct: 105 GDSFAISKSS-GHEIFKWVTAAFAALMPRYVKWPADNACGGSAI----SQLPGVVGVIDE 159
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + P E V +L + A+C +F V+ + S + ++++ + R
Sbjct: 160 CRIPLKLPVREE---VGHLQYASLALQAVCDERSRFLDVHIDV--SDNQCVLLKSELFER 214
Query: 181 KFD-EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI----- 234
D EE P P ++G+ YPL L+ P N + CH R R +
Sbjct: 215 LIDMEEPLMP-PHKHLVGEMMYPLLLNLMTPYADN-----NGELTPCHIRYNRAVHLWNA 268
Query: 235 --ENSFGILKEKFPCLNYMRLSPQFAGMVVLASTT-LHNIALTIDRDNENQNVEIH---D 288
E +F L +F L + + G +V+++T LHN L E+ ++ D
Sbjct: 269 PAERAFAALMSRFGRLKSLDVGTMEVGSIVVSATCMLHNFILDCGEPIEDATLDFEMLPD 328
Query: 289 DEAEDDGEEEAVDDPDEIDEEEQPR 313
++ EE + + D E+ R
Sbjct: 329 NQNSIVFEEHGLVNWDHTSAAEKKR 353
>gi|183985722|gb|AAI66249.1| Unknown (protein for MGC:185250) [Xenopus (Silurana) tropicalis]
Length = 473
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 120/295 (40%), Gaps = 29/295 (9%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPY---- 57
P F R+S + + +L+ + P L T A+S E+++ L ++ GS +
Sbjct: 136 PEKFFAYIRMSISTFDELLKLVHPHLHRMDTNMRQAISPAERLVVTLRFLATGSTFAALH 195
Query: 58 --HIIG-ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+IG AT G C++I V +V M + E I F K FPN
Sbjct: 196 YQFLIGRATIGMIVRETCKTIWNVTKDLV------MPEPNTEKWMKIAEGFYEKTDFPNC 249
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DG + + P + ++ L +A+ NY F ++ GS DA
Sbjct: 250 IGALDGKHIRVTRPPNTVSKSCSKVFFTVL--LALVDSNYCFTYIDVGAYGSDGDASGFF 307
Query: 175 NSMVYRKFDEEGW-----RPFPDA-------VILGDSAYPLKEWLIPPRTRNPDAAADQR 222
S + + +E +P P VI+ D A+ + ++ P +
Sbjct: 308 KSNLGKMVNEGKLNFPANKPLPGTKGPALPYVIVADEAFGISTNVMRPYPIRNLTGTKRA 367
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
F R RR++E +FGIL K+ + ++L+ F ++ + LHN+ L D
Sbjct: 368 FNYRLTRARRMVECAFGILANKWRVFHSAIQLNTAFVDDIIKCACVLHNLVLLRD 422
>gi|449666348|ref|XP_004206332.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 204
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 5/175 (2%)
Query: 12 LSATQVEFILEEIAPALTT-PTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + E +L +AP++T P + +S E++ L ++ G +++
Sbjct: 1 MSPERYEHLLSMVAPSITKKPCQSRQIISPSERLTVTLRYLATGDSQLTQAFYFRLGRTT 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEENLND---IPMTFLRKGGFPNVAGCVDGTMVLIDA 127
C +E I D +K P LN+ + F + FPN G +DG V I A
Sbjct: 61 ACNITNETTKAIWDVLKPCYLKAPS-TLNEWEELANQFENEWNFPNCIGAIDGKHVCIKA 119
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
P+ + N H++ ++AIC Y F V+ G +DA + S + + F
Sbjct: 120 PSLSGSAYCNYKNFHSMVLLAICDAKYCFTLVDIGSYGRDNDASIFNESKMGKAF 174
>gi|156375358|ref|XP_001630048.1| predicted protein [Nematostella vectensis]
gi|156217061|gb|EDO37985.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 114/291 (39%), Gaps = 35/291 (12%)
Query: 8 EAFRLSATQVEFILEEIAPAL---------TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
E F +S +IL L + P E+ VA++ L + GSPY+
Sbjct: 2 EKFHVSRNTFMYILNTFGSQLARKDTLMRESVPVEKRVAVA--------LFRLATGSPYN 53
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
G G + + E V+V + +K+P EE+ F K P +AG
Sbjct: 54 KTGEQFGIGRCTSLNIKDEFCEVLVKNA-KDFIKFPTTEEDTRAAIDGFEGKCQIPQLAG 112
Query: 117 CVDGTMVLIDAPN--EHEDQFVNRNGD-HALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
+ I AP+ H+ ++ N N H++ + A+ + KF V+ +PG + + +
Sbjct: 113 AIKAVRFKIKAPSLVTHKMKYYNDNLKCHSMTLQAVTDSSGKFLDVSTGYPGGITNTEAL 172
Query: 174 RNSMVYRKFDEEGWRPFPDAVILG---------DSAYPLKEWLIPPRTRNPDAAADQ-RF 223
S +Y+ + P A + G D++Y L++W I P N + Q RF
Sbjct: 173 AMSDIYKNVKQGVILNSPSATVEGLQVKPIIATDASYKLRKWCICPYPDNNMLSPSQERF 232
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIA 272
+ E++ L+ ++ L L + VL LHNI
Sbjct: 233 NSELGSAVHIAEDALARLRGRWNTLAGSCLDENVEKIPETVLICCVLHNIC 283
>gi|357168077|ref|XP_003581471.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 333
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 91/222 (40%), Gaps = 35/222 (15%)
Query: 102 PMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + LRK F N G VDGT + P + ++ + NR + N M C + KF
Sbjct: 84 PHSILRKPKFYPFFKNCIGDVDGTHTPMKLPLDQQEPYRNRKQTISQNAMVACDFDLKFV 143
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPR-TR-- 213
+N W GS DARV+++++ G+ L D+ Y ++L P R TR
Sbjct: 144 HINPGWEGSASDARVLQDAL------NHGFEVPDGKFYLVDAGYANTPQFLAPYRGTRYH 197
Query: 214 -NPDAAADQR-------FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLA 264
N A Q+ F H + R IE GI K +FP L P+ + + +A
Sbjct: 198 LNEQGRALQKPQNHKELFNLRHAQLRNHIERIIGIWKMRFPILKVASHFPKEKQIDISVA 257
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
LHN + IH+ + + DPD+I
Sbjct: 258 CAVLHNF------------IRIHNGDMTWPSDATMDIDPDQI 287
>gi|301117620|ref|XP_002906538.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107887|gb|EEY65939.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 231
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 68/148 (45%), Gaps = 7/148 (4%)
Query: 110 GFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
GF N G DGT++ L P + + + +R +A+N + C + V W GSVH
Sbjct: 6 GFVNCVGITDGTLLPLATKPRHNGEDYYSRKSSYAVNALVTCDDIARVRNVVIGWLGSVH 65
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDA---AADQRFLR 225
D RV NS V + D + +LGDSA+ +IP + P A ++ F
Sbjct: 66 DNRVWSNSPVCQNPDLHFEH---NEYLLGDSAFQASRTMIPAYKKPPKADMHPSNSYFNN 122
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRL 253
+ R E+ G+LK +F L +R+
Sbjct: 123 QLAKARIKSEHCIGLLKMRFQYLREVRV 150
>gi|356546197|ref|XP_003541517.1| PREDICTED: uncharacterized protein LOC100780130 [Glycine max]
Length = 436
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 116/281 (41%), Gaps = 29/281 (10%)
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
Y +I T G S+ + + H+++ + MV+ I + F + G
Sbjct: 135 YCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFKDCIG 194
Query: 117 CVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+DGT + + +R+G+ + NV+A C + +F V + W GS HD++V+ ++
Sbjct: 195 AIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKVLSDA 254
Query: 177 MVYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPR------------TRNPDAAADQRF 223
+ + G + L D +P +++L P R +P+ +
Sbjct: 255 LARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPENEKELFN 310
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIALTIDRDN 279
LR H R +IE FGI K +F + +P F +VLA LHN R +
Sbjct: 311 LR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNFLRKECRSD 366
Query: 280 ENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
E VE D+ + +D D + +EQ R+D N
Sbjct: 367 EFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 406
>gi|198413941|ref|XP_002119367.1| PREDICTED: hypothetical protein [Ciona intestinalis]
Length = 99
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 1/86 (1%)
Query: 95 EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
EE L I F +K FP V GC+DG+ + + AP + + + NR +L + A+ Y
Sbjct: 5 EEALK-IAARFEKKSHFPRVMGCIDGSHIAVTAPLQRKKDYTNRKDWDSLVLQAVVDDLY 63
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYR 180
F ++ PGS HDA V +NS +++
Sbjct: 64 CFRNISVKMPGSSHDATVFKNSGLFK 89
>gi|356530983|ref|XP_003534058.1| PREDICTED: uncharacterized protein LOC100811756 [Glycine max]
Length = 481
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 81/186 (43%), Gaps = 23/186 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG + P + + +F N+ G + NV+A C + +F V W GSV D+
Sbjct: 246 FKDCIGVIDGMNIPAHVPAKDQSRFRNKKGILSQNVLAACTFDLQFIFVYPGWEGSVTDS 305
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRT---------RNPDAA-- 218
RV+R + ++ + P L D Y E I P R +
Sbjct: 306 RVLRAVL---DNPDQNFPQIPQGKYYLVDKGYLNTEGFIAPYQGVRYQHYEFRGANQLPR 362
Query: 219 -ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM----VVLASTTLHNIAL 273
A + F H R I SF +LK +FP L +L+PQ++ +V+A LHN
Sbjct: 363 NAKELFNHRHCFLRNAILRSFNVLKTRFPIL---KLAPQYSFQIQRDIVIAGCVLHNFIR 419
Query: 274 TIDRDN 279
+R++
Sbjct: 420 REERND 425
>gi|224056104|ref|XP_002298728.1| predicted protein [Populus trichocarpa]
gi|222845986|gb|EEE83533.1| predicted protein [Populus trichocarpa]
Length = 433
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/334 (20%), Positives = 143/334 (42%), Gaps = 33/334 (9%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHII 60
P +FK FR+ ++ E++ + P L P + LS++ ++ L + GS Y I
Sbjct: 99 PDSFKTTFRMRSSTFEWLSGLLEPLLECRDPIGTPINLSSELRLGIGLFRLATGSSYIEI 158
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVD 119
G ++S + R + + ++ T F+ + +P L + G PN G +D
Sbjct: 159 AGRFGVTES-VTRFCAKQLCRVLCTNFRFWIAFPTSTELQLVSKDIEGLTGLPNCCGVID 217
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
T + N+ + + D ++ V + + + ++ A + G +D+R+++++ +
Sbjct: 218 CTRFNVVKRNDCKLASDDEVQDDSIAVQIVVDSSSRILSIIAGFRGDKNDSRILKSTTLC 277
Query: 180 RKFDEEGWRPFP-----------DAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
D EG R D ++GD YPL WL+ P +++++F +
Sbjct: 278 H--DIEGRRLLNATPVIVNGVAIDQYLIGDGGYPLLPWLMVPFVDVVPGSSEEKFNAANN 335
Query: 229 -------RTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNEN 281
RT ++N +G+L + P + + F G A + LHN+ L R++++
Sbjct: 336 LMHVFALRTIASLKN-WGVLNK--PVEEEFKTAVAFIG----ACSILHNVLLM--REDDS 386
Query: 282 QNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQD 315
+++ D D + D E + E+ D
Sbjct: 387 ALIDVEDYSLYDQDSQFYKDAMTEENLTEKKASD 420
>gi|115474393|ref|NP_001060793.1| Os08g0106900 [Oryza sativa Japonica Group]
gi|113622762|dbj|BAF22707.1| Os08g0106900 [Oryza sativa Japonica Group]
Length = 629
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 60/149 (40%), Gaps = 14/149 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + E F R NVMA + +F V W G+ HDA
Sbjct: 149 FKDCIGAIDGTHIRASVRKNMESSFRGRKSHATQNVMAAVDFDLRFTYVLVGWEGTAHDA 208
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR----------TRNPDAAAD 220
V+R+++ E G R L D+ Y K+ +PP NP
Sbjct: 209 VVLRDALER----ENGLRIPQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNPVQNEK 264
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLN 249
+ F H R +E +FG LK +F L+
Sbjct: 265 ELFNLRHSSLRITVERAFGSLKRRFKILD 293
>gi|390359618|ref|XP_003729523.1| PREDICTED: apical endosomal glycoprotein-like [Strongylocentrotus
purpuratus]
Length = 892
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 40/60 (66%)
Query: 214 NPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIAL 273
+P A+++R+ R HKR R +IE +FG +K +FPCLN +RL Q V++A L NI++
Sbjct: 6 HPGTASEERYNRAHKRGRNVIERTFGRMKRRFPCLNGLRLKLQTTLTVIVAVAVLWNISI 65
>gi|356561128|ref|XP_003548837.1| PREDICTED: uncharacterized protein LOC100801437 [Glycine max]
Length = 306
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQENGGLVRT---RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K ++ LN P T +G F G +D T + + +
Sbjct: 146 RAVM--------KVSKDYLNFQPCTL--EGAEANKWRWFERCIGALDETHIPVTVSPDER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|356566240|ref|XP_003551342.1| PREDICTED: uncharacterized protein LOC100791416 [Glycine max]
Length = 422
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 117/293 (39%), Gaps = 29/293 (9%)
Query: 45 LSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT 104
+ + +G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 123 IGTIAIVGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRES 182
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F + G +DGT + + +R+G+ + NV+A C + +F V + W
Sbjct: 183 TRFYPYFKDCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWE 242
Query: 165 GSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------R 211
GS HD++V+ ++ + G + L D +P + + P
Sbjct: 243 GSAHDSKVLSVALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGH 298
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTT 267
+P+ + LR H R +IE FGI K +F + +P F +VLA
Sbjct: 299 GNDPENEKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAA 354
Query: 268 LHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
LHN R +E VE D+ + +D D + +EQ R+D N
Sbjct: 355 LHNFLRKECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 406
>gi|403173656|ref|XP_003332716.2| hypothetical protein PGTG_14381 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170638|gb|EFP88297.2| hypothetical protein PGTG_14381 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 319
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 69/158 (43%), Gaps = 9/158 (5%)
Query: 90 MVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ-FVNRNGDHALNVMA 148
+++WP + GF G +DG++V++ E + Q + +R G + + +
Sbjct: 165 LLEWPTAAARQETKAHYGEVGFKGCVGLIDGSLVVLSTCPEKDGQDYYSRKGFYCIATLL 224
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWL 207
+C + V WPG HD R+M N + + ++ F D +L DSA+
Sbjct: 225 VCDQHKNITYVFTGWPGCSHDMRLMTNCALSKSPNQY----FSDGEYLLADSAFVPTLTT 280
Query: 208 IPP--RTRNPDAAADQR-FLRCHKRTRRLIENSFGILK 242
+P R RN +Q F R R IEN G+LK
Sbjct: 281 VPAYKRKRNKQLTDEQTDFNRHLSGVRVAIENCIGLLK 318
>gi|449664611|ref|XP_004205962.1| PREDICTED: uncharacterized protein LOC101240376 [Hydra
magnipapillata]
Length = 194
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 79/188 (42%), Gaps = 14/188 (7%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-- 94
++SA+E+ + + ++ G I ++ S S + R I+E I + F++ P
Sbjct: 4 SISARERFYATVRFLVTGDAQKTIAISYRMSPSVVGRIINETCRAIWVSLFKKKYIDPPS 63
Query: 95 -EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
E+ L I F + FP+ G +G V++ P F N H++ ++A+C
Sbjct: 64 SEKELKKIATNFENRWNFPHCLGANNGKHVVMQTPGCSGSSFYNYKKTHSIVLLAVCNVK 123
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEE-----------GWRPFPDAVILGDSAYP 202
Y+F + G D V NS + + + G V +GD+A+
Sbjct: 124 YQFSLSDIENSGRQSDGSVYANSQLGYAIENDLLDIPQACKVNGTETIFPYVFVGDNAFG 183
Query: 203 LKEWLIPP 210
LK +I P
Sbjct: 184 LKPQMIKP 191
>gi|449688564|ref|XP_004211776.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 133
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 10/128 (7%)
Query: 104 TFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANW 163
+F + GFP V VD + I AP + + ++NR H++ + + Y F + W
Sbjct: 3 SFEKLYGFPQVVRAVDKCQIRIKAPYNNSEDYINRKEYHSIILQCLADSRYFFRDIFVGW 62
Query: 164 PGSVHDARVMRNSMVYRKFDEEGWRP-----FPDAV-----ILGDSAYPLKEWLIPPRTR 213
D+RV++ S +Y++ + + P D + ILGDS Y L+ WL+ P +
Sbjct: 63 TSKSRDSRVLKISPLYKECLAKTFLPNNLNKIIDNIEIGPHILGDSTYSLENWLMKPYSD 122
Query: 214 NPDAAADQ 221
+ + ++
Sbjct: 123 RGNLSVEE 130
>gi|297604178|ref|NP_001055055.2| Os05g0269800 [Oryza sativa Japonica Group]
gi|255676197|dbj|BAF16969.2| Os05g0269800 [Oryza sativa Japonica Group]
Length = 529
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 90/231 (38%), Gaps = 22/231 (9%)
Query: 33 ERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVK 92
E + + +E+V LH +G ++ + S ++ R ++V+ I + +++
Sbjct: 16 EDTIHMCIEEQVAMFLHTVGHNLRNRLVRTNYSRSGETVSRYFNKVLHAIGE-LRDELIR 74
Query: 93 WPEENLNDIPMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
P D P F + G +DGT + E F R NVMA
Sbjct: 75 PPSL---DTPTKIAGNPRWDPYFKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMA 131
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+ +F V A W G+ HDA V+R+++ E G L D+ Y K+ +
Sbjct: 132 AVDFDLRFTYVLAGWEGTAHDAVVLRDALER----ENGLHVPQGKFYLVDAGYGAKQGFL 187
Query: 209 PPR----------TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
PP NP + F H R +E +FG LK +F L+
Sbjct: 188 PPFRAVRYHLKEWGNNPVQNEKELFNLRHSSLRITVERAFGSLKRRFKVLD 238
>gi|242087131|ref|XP_002439398.1| hypothetical protein SORBIDRAFT_09g005770 [Sorghum bicolor]
gi|241944683|gb|EES17828.1| hypothetical protein SORBIDRAFT_09g005770 [Sorghum bicolor]
Length = 535
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/279 (20%), Positives = 108/279 (38%), Gaps = 17/279 (6%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
F+ AFR+S + + + A+ T A+ +++V CL + P I
Sbjct: 205 FRRAFRMSRATFGALCDALVGAVAKEDTALRTAIPVRQRVAVCLWRLATAEPLREISRRF 264
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTM 122
G S+ + +V + + WPE + + F G P + G V T
Sbjct: 265 GLGISTCHSIVLQVYHALATVLRPTAISWPEPDSAAATVAARFEAASGLPGIIGAVYTTR 324
Query: 123 VLIDAPNEHEDQFVNRN-------GDHALNVMAICGPNYKFYAVNANW--PGSVHDARVM 173
V + P + + +R +++ V A+ + F V W PGS+ DA ++
Sbjct: 325 VPVVTPKANVAAYYDRGLTDRRQRASYSVAVQAVADADGAFTDV---WIEPGSLSDAAIL 381
Query: 174 RNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRL 233
S + G + ++G ++YPL +W++ P + F R R +
Sbjct: 382 GRSALSGLLALVGGH-GQEQRLVGGTSYPLMDWMLVPYAHQNLTWTEHAFNERVARARGV 440
Query: 234 IENSFGILKEKFPCLN-YMRLSPQFAGMVVLASTTLHNI 271
++ LK ++ CL + Q ++ A LHN+
Sbjct: 441 ARDAVRRLKARWRCLQRRSEVKMQDLPSMIAACCVLHNV 479
>gi|328704310|ref|XP_001949214.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 257
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 83/213 (38%), Gaps = 28/213 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FPN G +DG + + N G ++ ++ +C Y+F V+ G + D
Sbjct: 4 FPNCLGAMDGKHITFRPRRADGAFYHNYKGTDSIVLLGVCDAKYRFICVDVGQTGRMSDG 63
Query: 171 RVMRNSMVYRKFDEEGWR-----PFPDA--------VILGDSAYPLKEWLIPPRTRNPDA 217
V NS + FP++ V + D A+ L E L+ P RN
Sbjct: 64 GVYNNSKLSSAIKTNALNFPDDITFPNSKSKIVVPHVFIADDAFALSERLMKP-FRNRLL 122
Query: 218 AADQRFLRCH-KRTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTI 275
++ R RR+IEN+FGIL +F LN M L + LHN L
Sbjct: 123 TKEELIYNYRVSRARRVIENAFGILANRFRILLNSMNLCVNKVEKITQCCVILHNFLLI- 181
Query: 276 DRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDE 308
EN + I++ E + +DD I E
Sbjct: 182 ----ENNLLYINN-------EVDKIDDIGNIIE 203
>gi|20270088|gb|AAM18176.1|AC092172_36 Hypothetical protein [Oryza sativa Japonica Group]
gi|31430919|gb|AAP52771.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 396
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 113/290 (38%), Gaps = 33/290 (11%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ FRL+A ++ + + + + V + EK+ L+ MG P
Sbjct: 62 RAMFRLTAEEIHNLHDLLVTSFGLVGTAWVC--SMEKLGIFLYTMGGNRPIRDANNRWVR 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFP---NVAGCVDGTMV 123
S S++ H V+ + D +++K + N D L++ F G VDGT +
Sbjct: 120 SNSTVSVHFHHVLQAMND-LAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHI 178
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ Q NR+ NV+ IC + + +A WPGSVHD R++ ++ +D
Sbjct: 179 PVLPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILNEAVQAYPYD 238
Query: 184 EEGWRPFPDA----VILGDSAYPLKEWLIPP-------------RTRNPDAAADQRFLRC 226
FP +L DS +P + + P P ++ F
Sbjct: 239 ------FPRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREETFNHR 292
Query: 227 HKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHNIAL 273
H R ++E FGI K+ + L Y R ++ A+ LHN L
Sbjct: 293 HSTLRGIVERQFGIAKKMWKILKEIPYYR-DEDIPARIIHAAFALHNFRL 341
>gi|390354585|ref|XP_003728361.1| PREDICTED: uncharacterized protein LOC100890829 [Strongylocentrotus
purpuratus]
Length = 220
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/213 (22%), Positives = 88/213 (41%), Gaps = 18/213 (8%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTMVL 124
K +I + + +V IVD + ++ P N ++ F + P+ G +G +
Sbjct: 9 GKETIQKFVPDVARAIVDEYAAEVISLPTTNEGWLEVAGDFEARWNLPHCLGAYNGKHIH 68
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
+ PN+ + N ++ +MA+ Y+F ++ G DA++ NS + ++ E
Sbjct: 69 LQKPNKSGSLYFNYKQFFSVVLMALVDSKYQFMWIDVGGVGHQSDAQIYNNSEL-KECIE 127
Query: 185 EGWRPFPD--------------AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
G PD +GD A+ ++ ++ P + F R
Sbjct: 128 AGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTDMMKPYGHRNMVQKQKIFNYRLSRA 187
Query: 231 RRLIENSFGILKEKFPC-LNYMRLSPQFAGMVV 262
R ++EN+FGIL F C L MR P ++V
Sbjct: 188 RHVVENAFGILALWFQCFLGQMRQEPDTVRLLV 220
>gi|242060480|ref|XP_002451529.1| hypothetical protein SORBIDRAFT_04g003338 [Sorghum bicolor]
gi|241931360|gb|EES04505.1| hypothetical protein SORBIDRAFT_04g003338 [Sorghum bicolor]
Length = 222
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 71/176 (40%), Gaps = 19/176 (10%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P + F R D NVM + + KF V A W GS HDA ++ +
Sbjct: 4 GAIDGTHVLARVPRNMQQAFRGRKKDPTQNVMVVVDFDLKFTYVLAGWEGSAHDALILSD 63
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR----------TRNPDAAADQRFLR 225
++ ++G+ L D+ Y K +PP N A + F
Sbjct: 64 AIE----RQDGFTMPQGKYYLVDAGYACKNGFLPPYRGVRYHLSEFGSNRPTNAQELFNL 119
Query: 226 CHKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHNIALTIDRD 278
H R +E + G LK +F L+ + + Q +V+ +HN L D
Sbjct: 120 RHSSLRVTVERAIGALKNRFRILDNKPFHKYKTQLK--LVIDCCIIHNWILGFGTD 173
>gi|390357388|ref|XP_792559.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 182
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 37/174 (21%)
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA---------VILGDSAYPLKE 205
+F + WPG VHDARV+ NS +Y+K P ++GD AYPL
Sbjct: 5 RFTDILVGWPGCVHDARVLSNSSLYQKAQAGTLLPLQTKEIEGVQVPLFVIGDPAYPLLP 64
Query: 206 WL---------IPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP- 255
W+ + PR +N F R R ++E +FG+LK ++ L + S
Sbjct: 65 WMMKGYADCGRLTPRQQN--------FNYRLSRARMVVERAFGLLKMRWRSLYKINESKV 116
Query: 256 QFAGMVVLASTTLHNIA------LTIDRDNEN----QNVEIHDDEAEDDGEEEA 299
+ +V A+ LHNI L +D EN QN H + A D+ E
Sbjct: 117 ENVIHMVTAACVLHNICEIARDTLDLDLYEENIRQQQNHHHHPERAADNNRAET 170
>gi|55168096|gb|AAV43964.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 561
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 90/231 (38%), Gaps = 22/231 (9%)
Query: 33 ERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVK 92
E + + +E+V LH +G ++ + S ++ R ++V+ I + +++
Sbjct: 48 EDTIHMCIEEQVAMFLHTVGHNLRNRLVRTNYSRSGETVSRYFNKVLHAIGE-LRDELIR 106
Query: 93 WPEENLNDIPMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
P D P F + G +DGT + E F R NVMA
Sbjct: 107 PPSL---DTPTKIAGNPRWDPYFKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMA 163
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI 208
+ +F V A W G+ HDA V+R+++ E G L D+ Y K+ +
Sbjct: 164 AVDFDLRFTYVLAGWEGTAHDAVVLRDALER----ENGLHVPQGKFYLVDAGYGAKQGFL 219
Query: 209 PPR----------TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
PP NP + F H R +E +FG LK +F L+
Sbjct: 220 PPFRAVRYHLKEWGNNPVQNEKELFNLRHSSLRITVERAFGSLKRRFKVLD 270
>gi|390360557|ref|XP_003729720.1| PREDICTED: uncharacterized protein LOC752922 [Strongylocentrotus
purpuratus]
Length = 325
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/303 (21%), Positives = 118/303 (38%), Gaps = 41/303 (13%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
FK R+S + +L + P L T A+ K+ L + G+ Y +
Sbjct: 28 FKNYTRVSPEMFDDLLVRLTPHLQKKDTHFRKAIPPGLKLSLFLRHLATGATYAELSYNF 87
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVD-GTMV 123
K +I + + +V IVD + ++ P G ++ G +
Sbjct: 88 RVGKETIQKFVPDVARAIVDEYAAEVISLPTTKY-----------------GWLEPGKHI 130
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ PN+ + N ++ +MA+ Y+F ++ G DA++ + + ++
Sbjct: 131 HLQKPNKSGSLYFNYKQFFSVVLMALVDSKYQFMWIDVGGVGHQSDAQIYNSEL--KECI 188
Query: 184 EEGW---------------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
E G P P +GD A+ ++ +++ P R +
Sbjct: 189 EAGTLGIPDPDPLPHDDEEHPMP-YFFVGDDAFAMRTYMMKPYGRRNMVQKQKIXHYRLS 247
Query: 229 RTRRLIENSFGILKEKFPC-LNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIH 287
R RR+ EN+FGIL +F C L MR P ++V A LHN+ I + + +V +
Sbjct: 248 RARRVFENAFGILALRFQCFLGQMRHEPDTVRLLVGAVVMLHNL---IRKRYQALDVRML 304
Query: 288 DDE 290
D E
Sbjct: 305 DQE 307
>gi|328707695|ref|XP_001951939.2| PREDICTED: hypothetical protein LOC100163229 [Acyrthosiphon pisum]
Length = 614
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 111/291 (38%), Gaps = 24/291 (8%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTP-TERNVALSAKEKVLSCLHWMGVGSPYHI 59
M P FK +R+S + + L T +++S +E++L L ++ +G
Sbjct: 146 MYPEKFKNFYRMSQESFNVLSHMVRTKLQKKDTNFRLSISVEERLLITLRFLAMGGNMKA 205
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EENLNDIPMTFLRKGGFPNVAGCV 118
+S+ I E VI D + P +E+ + F PN G +
Sbjct: 206 HSMYFLRGESTTRTIISETSQVIWDCLHDIYMPIPSKEHWKKVADRFYDLWNIPNCIGAI 265
Query: 119 DGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
DG I P+ + N ++ +MA + F ++ G D RV R S +
Sbjct: 266 DGKHFKIKCPSNTGSAYFNYKHYFSVVLMACVDADGLFLTIDVGDYGRNSDGRVFRRSSL 325
Query: 179 YRKFDEE-----------GWR---PFPDAVILGDSAYPLKEWLI---PPRTRNPDAAADQ 221
+ GW FP + D A+PLK ++ P R+ N +
Sbjct: 326 GITLENNALDIPEPKVLPGWENKDKFPHYFV-ADEAFPLKTNIMRPFPKRSLNKERRIYN 384
Query: 222 RFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI 271
RC R RR +E SFG+L KF + + A ++ A+ LHN
Sbjct: 385 --YRC-SRARRSVECSFGMLVSKFRLFEQPIGCKVETAEALIKAACVLHNF 432
>gi|7594564|emb|CAB88131.1| putative protein [Arabidopsis thaliana]
Length = 775
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 66/152 (43%), Gaps = 31/152 (20%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT V + E + + NR+ + +LN+MAIC N F + PGS HD
Sbjct: 633 FSGFVGAMDGTHVCVKVKPELQGMYWNRHDNASLNIMAICDLNMLFTYIWNGVPGSCHDT 692
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLI---------------------- 208
V++ + ++ D E P + L DS YP K+ +
Sbjct: 693 VVLQ---IAQQSDSEFPLPPSEKYYLVDSCYPNKQGFLALYRSSRNRVVRYHMSQFYPGP 749
Query: 209 PPRTRNPDAAADQRFLRCHKRTRRLIENSFGI 240
PPR ++ + F +CH R +IE + G+
Sbjct: 750 PPRNKH------ELFNQCHTSLRSVIERTVGV 775
>gi|40786599|gb|AAR89874.1| expressed protein [Oryza sativa Japonica Group]
Length = 608
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 70/177 (39%), Gaps = 20/177 (11%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT V P E F R NVMA + +F V A W G+ HD V+R+
Sbjct: 142 GAIDGTHVRASVPKNMESSFRGRKNHATQNVMAAVDFDLRFTYVLAGWEGAAHDVVVLRD 201
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--RTR--------NPDAAADQRFLR 225
++ E G L D Y K +PP TR NP + F
Sbjct: 202 ALER----ENGLHVPQGKFYLVDVGYGAKPGFLPPFRSTRYHLNEWGNNPVQNEKELFNL 257
Query: 226 CHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMVVLASTTLHNIALTIDRD 278
H R +E +FG LK +F L+ + Q +V+A +HN + D D
Sbjct: 258 RHSSLRITVERAFGSLKRRFKILDDATPFFPFQTQVN--IVVACCIIHNWVINDDID 312
>gi|58264374|ref|XP_569343.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57225575|gb|AAW42036.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 429
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 31/232 (13%)
Query: 91 VKWPEEN-LNDIPMTFLRKGGFPN-VAGCVDGTMV-LIDAPNEHED-QFVNRNGDHALNV 146
V WP+EN I F + P+ G +DG V P H+ ++ G + N+
Sbjct: 147 VYWPDENERKAIDRHFEEEEDIPDGCVGIIDGFHVPFAYKPARHDAVDSLSYKGRYGFNI 206
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW---------RPFPDAVI-- 195
+ IC + +P S HDAR+ +N ++ + + + R +I
Sbjct: 207 LGICDHLKRIRYFQYGYPASAHDARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQ 266
Query: 196 ----LGDSAYPLKEWLIP----PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
L DSA+P +W +P R +N A + +F + R IE+++GILK ++
Sbjct: 267 GEYLLADSAFPAGDWCVPLFKRRRGQNDLDAPEAKFNKKCSSARVKIEHAYGILKNRWQS 326
Query: 248 LNYMRLSPQ------FAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAED 293
L +R+ + A + A LHN L ID + ++ DE +D
Sbjct: 327 LRNLRVKIRNVRDEGVATCWIRACVVLHN--LLIDTGDWYNQLDGDADEPDD 376
>gi|134110141|ref|XP_776281.1| hypothetical protein CNBC6700 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258953|gb|EAL21634.1| hypothetical protein CNBC6700 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 429
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 31/232 (13%)
Query: 91 VKWPEEN-LNDIPMTFLRKGGFPN-VAGCVDGTMV-LIDAPNEHED-QFVNRNGDHALNV 146
V WP+EN I F + P+ G +DG V P H+ ++ G + N+
Sbjct: 147 VYWPDENERKAIDRHFEEEEDIPDGCVGIIDGFHVPFAYKPARHDAVDSLSYKGRYGFNI 206
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW---------RPFPDAVI-- 195
+ IC + +P S HDAR+ +N ++ + + + R +I
Sbjct: 207 LGICDHLKRIRYFQYGYPASAHDARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQ 266
Query: 196 ----LGDSAYPLKEWLIP----PRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
L DSA+P +W +P R +N A + +F + R IE+++GILK ++
Sbjct: 267 GEYLLADSAFPAGDWCVPLFKRRRGQNDLDAPEAKFNKKCSSARVKIEHAYGILKNRWQS 326
Query: 248 LNYMRLSPQ------FAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAED 293
L +R+ + A + A LHN L ID + ++ DE +D
Sbjct: 327 LRNLRVKIRNVRDEGVATCWIRACVVLHN--LLIDTGDWYNQLDGDADEPDD 376
>gi|15987055|gb|AAL11884.1|AF412282_1 putative transposase [Zea mays]
Length = 298
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 71/144 (49%), Gaps = 11/144 (7%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+S +EKV L +G G+ +I +++G S I R +EV+ ++ + +K P+
Sbjct: 95 VSVEEKVAIFLLVVGHGTKMRMIRSSYGWSLEPISRYFNEVLRGVL-SLCHEFIKLPD-- 151
Query: 98 LNDIPMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
P+ + F + G +DGT + + P + ++ NR NV+ +C +
Sbjct: 152 ----PLAVQPEDSKWRWFEDCLGALDGTHIDVFVPLADQGRYRNRKQQITTNVLGVCDRH 207
Query: 154 YKFYAVNANWPGSVHDARVMRNSM 177
KF V A W GS D+RV+R++M
Sbjct: 208 MKFVYVLAGWEGSASDSRVLRDAM 231
>gi|357119672|ref|XP_003561559.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 332
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 72/186 (38%), Gaps = 22/186 (11%)
Query: 102 PMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + LRK F N G VDGT + + P + ++ + NR + N M C + KF
Sbjct: 84 PHSILRKPKFYPFFKNCIGAVDGTHIPMKLPLDQQEPYRNRKQTISXNAMVACDCDLKFV 143
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------- 210
+N W GS DA+V+++++ G+ L D+ Y + P
Sbjct: 144 HINLGWEGSASDAKVLQDAL------NHGFEVPDGKFYLVDAGYANTPQFLAPYRGTRYH 197
Query: 211 -----RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLAS 265
R R + F H + R IE GI K +FP + + +A
Sbjct: 198 LNEQGRARQKPQNHKELFNLRHAQLRNHIERIIGIWKMRFPIQVASHFPKEKQIDISVAC 257
Query: 266 TTLHNI 271
LHN
Sbjct: 258 AVLHNF 263
>gi|331232148|ref|XP_003328736.1| hypothetical protein PGTG_10037 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 159
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 12/121 (9%)
Query: 86 FFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDG-TMVLIDAPNEHEDQFVNRNGDHAL 144
+F M+ WP ++ F+++ GF G VDG T+ L P + + + +++
Sbjct: 19 YFFPMITWPTKSEGIELSQFMQEEGFSGCVGFVDGKTIPLSQKPPKDGQHYFDHKKRYSI 78
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD----AVILGDSA 200
+VM IC N KF A +PGS HDA V + V W+P D ++L DSA
Sbjct: 79 SVMVICDINKKFIFYLAGFPGSSHDAYVFLHMQV-------AWQPEKDFNQKQLLLADSA 131
Query: 201 Y 201
Y
Sbjct: 132 Y 132
>gi|359483992|ref|XP_003633049.1| PREDICTED: uncharacterized protein LOC100853879 [Vitis vinifera]
Length = 469
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 90/209 (43%), Gaps = 23/209 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + +DG+ + N+ ++ R N++A C + KF V W GS ++
Sbjct: 108 FKDCIRAIDGSHFRVKVSNDVVQRYRGRKYYPTQNILAACSFDLKFTYVLPGWEGSASNS 167
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPP-----------RTRNPDAA 218
R++ N++ R FD+ P L D+ + LK + P P+ A
Sbjct: 168 RILDNALK-RDFDK---LIVPQGKYYLADAGFQLKTGFLTPYRSTCYHLKEYSVHQPENA 223
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIALT 274
+ LR H R IE +FG+LK++FP + P + ++LA LHN +
Sbjct: 224 KEVFNLR-HSSLRNAIERAFGVLKKRFPIIA-SGTKPHYPVDTQSDIILACCILHNYLMG 281
Query: 275 IDRDNENQNVEIHDDEAEDDGEEEAVDDP 303
+D NE VE+ + + E E++ P
Sbjct: 282 VD-PNEILIVEVDRELFSKETEFESMAHP 309
>gi|356565852|ref|XP_003551150.1| PREDICTED: uncharacterized protein LOC100782375 [Glycine max]
Length = 518
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 73/163 (44%), Gaps = 27/163 (16%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E V LH + Y ++ ++ SK +I R ++V+
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAVTMFLHILAHKLKYRVVQFSYCRSKETISRQFNDVL 145
Query: 80 -----PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQ 134
P ++ +W F G +DGT + + + +
Sbjct: 146 RANFHPCTLEGAEANKWRW-----------------FERCIGALDGTHIPVTVSLDERPR 188
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 189 YHNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 231
>gi|225428725|ref|XP_002285007.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 449
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/283 (19%), Positives = 113/283 (39%), Gaps = 25/283 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ + LS +++++ P + N++L + V L + G + + +
Sbjct: 124 WRSLYGLSYPVFTTVVDKLKPYIAL---SNLSLPSDYAVAMVLSRLSHGFSAKTLSSRYS 180
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
I + + V ++ + +K P L++ F PN+ G +DG+ V
Sbjct: 181 LEPYLISKITNMVTRLLATKLYPEFIKIPVSRRRLHETTQAFEELTSLPNMCGAIDGSPV 240
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I N + R G ++ + + F+ V PG DA +R+S++Y +
Sbjct: 241 KIR--NNSASSYRCRYGFQSVLLQVVADHKKIFWDVCVKAPGGTDDATHLRDSLLYNRLT 298
Query: 184 EEG--W-----------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKR 229
W RP+ I+GD YPL +L+ P + + + Q F +
Sbjct: 299 SGDIVWDKVINVRNHHVRPY----IVGDWCYPLLSFLLTPFSPSGSGTSGQNLFDAALMK 354
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
R ++ + G+LK ++ L + + A ++A LHN+
Sbjct: 355 GRSVVVEAIGLLKGRWRILQDLNVGMNHAPQTIVACCVLHNLC 397
>gi|322795514|gb|EFZ18220.1| hypothetical protein SINV_08457 [Solenopsis invicta]
Length = 97
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 6/93 (6%)
Query: 90 MVKWPEENLNDIPMT---FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNV 146
++KWP+E+ ++ +T F K G NV G +D T + I P + + NR ++N+
Sbjct: 4 VIKWPQEH--EVIITCEKFSLKRGIQNVLGAIDSTHIQIIKPTSNAQDYCNRKKFFSINL 61
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
A+ + +F + PGS+HDARV R S++Y
Sbjct: 62 QAVVDSDMRFINIYCGEPGSLHDARVFR-SLLY 93
>gi|322778990|gb|EFZ09397.1| hypothetical protein SINV_01358 [Solenopsis invicta]
Length = 185
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 68/142 (47%), Gaps = 13/142 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH- 64
F FRLS V+ +++ I + P R AL A++K+L L ++ ++ +G+
Sbjct: 42 FMRLFRLSKDMVQRVID-IVEEYSDPISRRSALDAEQKILYYLGFVFTALRFYAVGSYQL 100
Query: 65 --------GPSKSSICRSIHEVVPVIVD-TFFQRMVKWPE--ENLNDIPMTFLRKGGFPN 113
G S+SS+ R I +V V+ F V +P E L I F + GFP
Sbjct: 101 GIGCNGYIGVSQSSVSRCIRDVNIVLNHPNVFNEWVYFPRNLEELTTIRNAFYTEYGFPG 160
Query: 114 VAGCVDGTMVLIDAPNEHEDQF 135
V GC+D T V I P+++ + +
Sbjct: 161 VVGCIDCTHVAIFPPSKNNEMY 182
>gi|115454089|ref|NP_001050645.1| Os03g0608700 [Oryza sativa Japonica Group]
gi|113549116|dbj|BAF12559.1| Os03g0608700, partial [Oryza sativa Japonica Group]
Length = 249
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 69/175 (39%), Gaps = 16/175 (9%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT V P E F R NVMA + +F V A W G+ HD V+R+
Sbjct: 30 GAIDGTHVRASVPKNMESSFRGRKNHATQNVMAAVDFDLRFTYVLAGWEGAAHDVVVLRD 89
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--RTR--------NPDAAADQRFLR 225
++ E G L D Y K +PP TR NP + F
Sbjct: 90 ALER----ENGLHVPQGKFYLVDVGYGAKPGFLPPFRSTRYHLNEWGNNPVQNEKELFNL 145
Query: 226 CHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMV--VLASTTLHNIALTIDRD 278
H R +E +FG LK +F L+ F V V+A +HN + D D
Sbjct: 146 RHSSLRITVERAFGSLKRRFKILDDATPFFPFQTQVNIVVACCIIHNWVINDDID 200
>gi|388518663|gb|AFK47393.1| unknown [Lotus japonicus]
Length = 155
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 68/159 (42%), Gaps = 19/159 (11%)
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
P +F R NV+A C + KF V A W G+ D+R++++++ E+
Sbjct: 2 VPRAQAPRFRGRKDYPTQNVLAACDFDMKFTYVLAGWEGTTSDSRILKDALT----REDP 57
Query: 187 WRPFPDAVILGDSAYPLKEWLIPP-----------RTRNPDAAADQRFLRCHKRTRRLIE 235
R +GD+ Y LK L+ P R P + LR H R LIE
Sbjct: 58 LRIHEGKFYIGDAEYMLKRGLLTPYRGVKYHLKEYSARGPQNYRELFNLR-HSSLRNLIE 116
Query: 236 NSFGILKEKFPCLNY---MRLSPQFAGMVVLASTTLHNI 271
+F +LK++FP ++ S ++LA LHN
Sbjct: 117 RTFRVLKKRFPIISSGTEAHYSVDIMTDIILACCILHNF 155
>gi|301624193|ref|XP_002941412.1| PREDICTED: hypothetical protein LOC100487167 [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 69/295 (23%), Positives = 120/295 (40%), Gaps = 29/295 (9%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPAL-TTPTERNVALSAKEKVLSCLHWMGVGSPY---- 57
P F R+S + + +L+ + P L T A+S E+++ L ++ GS +
Sbjct: 133 PEKFFAYIRMSISTFDELLKLVHPHLHRMDTNMRQAISPAERLVVTLRFLATGSTFAALH 192
Query: 58 --HIIG-ATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNV 114
+IG AT G C++I V +V M + E I F K FPN
Sbjct: 193 YQFLIGRATIGMIVRETCKTIWNVTKDLV------MPEPNTEKWMKIAEGFYEKTDFPNC 246
Query: 115 AGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +DG + + P + ++ L +A+ +Y F ++ GS DA
Sbjct: 247 IGALDGKHIRVTRPPNTVSKSCSKVFFTVL--LALVDSSYCFTYIDVGAYGSDGDASGFF 304
Query: 175 NSMVYRKFDEEGW-----RPFPDA-------VILGDSAYPLKEWLIPPRTRNPDAAADQR 222
S + + +E +P P VI+ D A+ + ++ P +
Sbjct: 305 KSNLGKMVNEGKLNFPANKPLPGTKGPALPYVIVADEAFGISTNVMRPYPIRNLTGTKRA 364
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLASTTLHNIALTID 276
F R RR++E +FGIL K+ + ++L+ F ++ + LHN+ L D
Sbjct: 365 FNYRLTRARRMVECAFGILANKWRVFHSAIQLNTAFVDDIIKCACVLHNLVLLRD 419
>gi|345488406|ref|XP_001600321.2| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 340
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 80/194 (41%), Gaps = 10/194 (5%)
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F R P + G VDG MV P +E+ F N ++NV I +Y + P
Sbjct: 120 FQRTTRIPGIIGAVDGFMVTFYRPTINEEAFFNYRVGTSMNVQIIVDSDYNILNIRV-CP 178
Query: 165 GSVHDARVMRNSMVYRKF-DEEGWRPFPD-AVILGDSAYPLKEWLIPPRTRNPDAAADQR 222
GS +D V + S D FP+ I+GDS Y L+ + +
Sbjct: 179 GSNNDRFVWQFSEAKEYMEDLRADENFPNRYYIIGDSGYTPSRVLLTSDLNAAEGSPAHV 238
Query: 223 FLRCHKRTRRLIENSFGILKEKFPCLNYMR---LSPQFAGMVVLASTTLHNI----ALTI 275
+ H RTR ++E + GIL + +N R SP+ ++ A LHN +
Sbjct: 239 YTLEHVRTRCIVERTIGILTNVWLVINRSRKLHYSPEAVVNIIHACAVLHNFRRRQGILD 298
Query: 276 DRDNENQNVEIHDD 289
+ D+ Q E+ DD
Sbjct: 299 ENDHNFQPAELRDD 312
>gi|449692739|ref|XP_004213152.1| PREDICTED: uncharacterized protein LOC101239759, partial [Hydra
magnipapillata]
Length = 226
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 78/194 (40%), Gaps = 30/194 (15%)
Query: 29 TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQ 88
TT T R +S ++V LH + Y ++ + G KS+
Sbjct: 2 TTTTMRE-PISVVKRVAVALHNLASCEEYRVVSSLFGIGKST-----------------A 43
Query: 89 RMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMA 148
++ ENLN F GFP G VDG + I AP E + N G +++ + A
Sbjct: 44 NLI----ENLNKHSRDFEAILGFPQCVGTVDGCHIPILAPKEQATSYYNYKGWYSIVLFA 99
Query: 149 ICGPNYKFYAVNANWPGSVHDARVMRN--------SMVYRKFDEEGWRPFPDAVILGDSA 200
+ Y+F + PG +D ++ N S ++ K +E ++GDSA
Sbjct: 100 VVDCRYRFIYTSVGLPGRNNDIYILPNFPLEGILKSNLFDKCCKELGDSLVSLCLIGDSA 159
Query: 201 YPLKEWLIPPRTRN 214
+PL L+ P N
Sbjct: 160 FPLTRHLLKPHPEN 173
>gi|348686436|gb|EGZ26251.1| hypothetical protein PHYSODRAFT_327165 [Phytophthora sojae]
Length = 391
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 90/225 (40%), Gaps = 31/225 (13%)
Query: 6 FKEAFRLSATQVEFILEEIAPAL------TTPTERNVALSAKEKVLSCLHWMGVGSPYHI 59
F+ +R+SA E +L I PAL + P +S + + + W+ GS YH
Sbjct: 64 FQRCYRMSAESFERLLSFIRPALVRDELQSRPRTGTDPISPEMMLQMTISWLA-GSRYHT 122
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN------LNDIPMTFLRKGGFPN 113
G S ++I ++H V+ I D + +++ P E+ L D +
Sbjct: 123 TRNLGGTSVTAIYITMHAVMDAICDC-PELIIRAPTESAERMYELADGFKAISKDSVLTG 181
Query: 114 VAGCVDGTMVLIDAPNEHE-----DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
GC+DG + I AP+ E F + LNV IC +F N PG
Sbjct: 182 CVGCIDGWLCEIRAPSAREVPDVAAFFSGHYQKYGLNVQVICDSLSRFTGYCFNSPGKKW 241
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTR 213
K EE + I+GD+ YPL + L+ P ++
Sbjct: 242 ------------KLQEEIMQLPAGFYIIGDNVYPLSDSLMVPFSK 274
>gi|224123456|ref|XP_002330319.1| predicted protein [Populus trichocarpa]
gi|222871354|gb|EEF08485.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/279 (18%), Positives = 111/279 (39%), Gaps = 15/279 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ + LS +++++ P +T N++L + V L + G + + +
Sbjct: 113 WRTLYGLSYPVFTTVVDKLKPHITA---SNLSLPSDYAVSMVLSRLAHGFSAKTLASRYS 169
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
I + + V ++ + +K P L + F PN+ G +DG+ +
Sbjct: 170 LEPYLISKITNMVTRLLATKLYPEFIKIPVSRRRLIETTKAFEELTSLPNMCGAIDGSPI 229
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF- 182
+ + + + R G ++ + + F+ V PG DA +R S++Y +
Sbjct: 230 KVKCGDNEGNVYKCRYGYSSVLLQVVADHKKMFWDVCVKAPGGSDDASHLRESLLYNRLV 289
Query: 183 -DEEGWRPFPDA-------VILGDSAYPLKEWLIPPRTRNPDAA-ADQRFLRCHKRTRRL 233
+ W D I+GD YPL +L+ P + N A F + R +
Sbjct: 290 SGDVVWDKVIDVRGHHVRPYIVGDWCYPLLSFLMTPFSPNGSGTPAQNLFDGMLMKGRSV 349
Query: 234 IENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
+ ++ +LK ++ L + + A ++A LHN+
Sbjct: 350 VVDAIALLKGRWKILQDLNVGLHHAPQTIVACCVLHNLC 388
>gi|397595562|gb|EJK56513.1| hypothetical protein THAOC_23584, partial [Thalassiosira oceanica]
Length = 242
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 88/214 (41%), Gaps = 28/214 (13%)
Query: 91 VKWPEENLN--DIPMTFLRKG--GFPNVAGCVDGTMVLIDAPNEHEDQ---------FVN 137
+K+PE +I F RK G G +DG ++ I P++ + + F
Sbjct: 26 IKFPESRFEQAEIARGFKRKSSIGIDCCVGAIDGILIWIHKPSDSDAESIGFGPIKFFCG 85
Query: 138 RNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW------RPFP 191
R + LN+ +C N F V +PG+ D S + +K +E G+ F
Sbjct: 86 RKKKYGLNMQGVCDANGYFLDVEIRFPGAASDFYAFDESALKKKLEESGFLKSISGSSFA 145
Query: 192 DAVIL-GDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY 250
L GD+AY +++ P R A A + + R IE +FG+L ++ L
Sbjct: 146 TTYCLFGDNAYVQSPYMVVP-WRKVGAGAKDAYNFYQSQLRINIECAFGMLVHRWGMLRK 204
Query: 251 ---MRLSPQFAGMVVLASTTLHNIAL----TIDR 277
++ +VLA LHN + TIDR
Sbjct: 205 PVPNNITVLRTTRLVLALCKLHNYCITRRVTIDR 238
>gi|390343604|ref|XP_798400.2| PREDICTED: uncharacterized protein LOC593848 [Strongylocentrotus
purpuratus]
Length = 336
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 3/189 (1%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK R+SA + +L + P L T A+ K+ L + G+ Y +
Sbjct: 126 FKNYTRVSAEMFDDLLVRLTPHLQKDTHYRKAIPPGLKLSVFLRHLATGATYAELSYNFR 185
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLN--DIPMTFLRKGGFPNVAGCVDGTMV 123
K +I + + +V IVD + ++ P N ++ F + P+ G DG +
Sbjct: 186 VGKETIQKFVPDVARAIVDEYAAEVISLPTTNKGWLEVAGDFEARWNLPHCLGAYDGKHI 245
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I PN+ + N ++ +MA+ Y F ++ G DA++ NS + ++
Sbjct: 246 RIKKPNKSGSLYFNYKQFFSVVLMALVDSKYHFLLIDVGGVGHQSDAQIYNNSEL-KECI 304
Query: 184 EEGWRPFPD 192
E G PD
Sbjct: 305 EAGTLGIPD 313
>gi|356528952|ref|XP_003533061.1| PREDICTED: uncharacterized protein LOC100796294 [Glycine max]
Length = 394
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 94/227 (41%), Gaps = 29/227 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 161 FKDCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 220
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDA 217
+V+ +++ + G + L D +P + + P +P+
Sbjct: 221 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPEN 276
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIAL 273
+ LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 277 EKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNFLR 332
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
R +E VE D+ + +D D + +EQ R+D N
Sbjct: 333 KECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 378
>gi|336385307|gb|EGO26454.1| hypothetical protein SERLADRAFT_360716 [Serpula lacrymans var.
lacrymans S7.9]
Length = 250
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 90/206 (43%), Gaps = 28/206 (13%)
Query: 118 VDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGP-NYKFYAVNANWPGSVHDARVMRN 175
VDGT L P H + F +R +++L+ P N K GSVHD +
Sbjct: 42 VDGTTFNLFQKPGHHGETFFDRKSNYSLSAQITTLPHNLKIVDYGLGHTGSVHDLSAFCD 101
Query: 176 SMVYR---KFDEEG-WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCH-K 228
+ +Y+ KF G W + GDSAYP+ + P R P+ A QR H
Sbjct: 102 TAIYQTPEKFFTAGEW-------LWGDSAYPVSPTCVLPFKRVPNQALTRQQRHFNYHLS 154
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLAST------TLHNIALTID------ 276
R + +++ G+LK +F L + + + + A + LHN + I+
Sbjct: 155 RVQIRSKHAIGLLKIRFQSLKELCIQINCSQQIDYAISWIRCCFILHNFIIRIEGQDHNS 214
Query: 277 RDNENQNVEIHDDEAEDDGEEEAVDD 302
R+N N++ ++ DE DG E+ DD
Sbjct: 215 RNNINRDWQLELDELLADGIEDVGDD 240
>gi|390595323|gb|EIN04729.1| hypothetical protein PUNSTDRAFT_24189, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 301
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 118 VDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
VDGT+V L D P+ + + + +R +++LN+ I PN + + GS HD+ + +
Sbjct: 208 VDGTLVPLYDRPHWYGESYYDRKCNYSLNIQVISLPNLQIIDFGYGFTGSTHDSTAWKET 267
Query: 177 MVYRKFDE--EGWRPFPDAVILGDSAYPLKEWLIPP 210
VYR+ + EG I DSAYP+K W+I P
Sbjct: 268 RVYREHEMILEG-----QEWIWADSAYPVKHWVIAP 298
>gi|156375356|ref|XP_001630047.1| predicted protein [Nematostella vectensis]
gi|156217060|gb|EDO37984.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 113/291 (38%), Gaps = 35/291 (12%)
Query: 8 EAFRLSATQVEFILEEIAPAL---------TTPTERNVALSAKEKVLSCLHWMGVGSPYH 58
E F +S +IL L + P E+ VA+ L + GSPY+
Sbjct: 2 EKFHVSRNTFMYILNTFGSQLARKDTLMRESVPVEKRVAVG--------LFRLATGSPYN 53
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAG 116
G G + + E V+V + +K+P EE+ F K P +AG
Sbjct: 54 KTGEQFGIGRRTSLYIKDEFCEVLVKNA-KYFIKFPTTEEDTRAAIDGFEGKCQIPQLAG 112
Query: 117 CVDGTMVLIDAPN--EHEDQFVNRNGD-HALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
+ I AP+ H+ ++ N N H++ + A+ + KF V+ +PG + + +
Sbjct: 113 AIKAVRFKIKAPSLVTHKMKYYNDNLKCHSMILQAVTDSSGKFLDVSTGYPGGITNTEAL 172
Query: 174 RNSMVYRKFDEEGWRPFPDAVILG---------DSAYPLKEWLIPPRTRNPDAAADQ-RF 223
S +Y+ + P A + G D++Y L++W I P N + Q RF
Sbjct: 173 AISDIYKNVKQGVILNSPSATVEGLQVKPIIATDASYKLRKWCICPYPDNNTLSPSQERF 232
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIA 272
+ E++ L+ ++ L L + VL LHNI
Sbjct: 233 NSELGSAVHIAEDALARLRGRWNTLARSCLDENVEKIPETVLICCVLHNIC 283
>gi|403173268|ref|XP_003332347.2| hypothetical protein PGTG_13732 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170202|gb|EFP87928.2| hypothetical protein PGTG_13732 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 418
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 108/265 (40%), Gaps = 30/265 (11%)
Query: 6 FKEAFRLSATQVEFILEEI---------APALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
FK+A R + ++L+ I +P P +AL+ L L G G+
Sbjct: 88 FKQAVRTTKDGFLWLLDRIRHHPIFYSNSPRPQLPIPHQLALT-----LERLGSNGNGAS 142
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAG 116
+ ++ + V+ I + +V WP + + + GF G
Sbjct: 143 VGRFSRNLSVGRGTVIKVSRRVIRAINEVSTNYIV-WPNKERRAEISEVMTEEGFEGCIG 201
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
VDGT + L P + + +R +++N IC + + WPG+ D+ V +N
Sbjct: 202 FVDGTTIPLHQRPGLDGEVYWDRKKQYSINCQVICDCDRFITSFMTGWPGTCGDSLVFKN 261
Query: 176 SMVY----RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQ--RFLRCHKR 229
V+ R FD P ++ DSAY L IP ++P + ++ F C +
Sbjct: 262 MKVHLEPDRFFD-------PGQYLIADSAYALSMTTIPA-YKSPLSKINRNTEFNYCLAK 313
Query: 230 TRRLIENSFGILKEKFPCLNYMRLS 254
+R E++ GILK ++ L + LS
Sbjct: 314 SRVRNEHTIGILKGRWASLQQLWLS 338
>gi|356546142|ref|XP_003541490.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 448
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 95/253 (37%), Gaps = 34/253 (13%)
Query: 54 GSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPN 113
+PY + S CR+ V + D + + F PN
Sbjct: 166 AAPYPAVSRRFAISPPDACRAFFAVCKALADNLGHLFELRTDSERVVVGFGF---SSLPN 222
Query: 114 VAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
G + T ID D + NG AL V A+ +F V+A WP ++ +
Sbjct: 223 CFGVLGFTRFKID------DSLLGENG--ALIVQALVDSEGRFLDVSAGWPSTMKPEIFL 274
Query: 174 RNSMVYRKFDE-------------EGWRPFPDAVILGDSAYPLKEWLIPPRTR----NPD 216
R S +YR+ ++ EG ILGDS +PL WL+ P R +
Sbjct: 275 RESKLYREVEQSKELLNGPSYNLSEG--CLIPQYILGDSCFPLLPWLLTPYNRVNEEDSF 332
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ----FAGMVVLASTTLHNIA 272
+A++ F H L+ ++FG L+ ++ L R Q VV+A LHN
Sbjct: 333 GSAERAFNCVHGNAMELVGDAFGRLRARWRLLAASRKWKQECVEHLPFVVVACCLLHNFV 392
Query: 273 LTIDRDNENQNVE 285
+ + ++ V
Sbjct: 393 VKFNEAMPDEGVS 405
>gi|449677532|ref|XP_004208870.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 266
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 101/230 (43%), Gaps = 22/230 (9%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
FR + ++ + L + ++ + +VL L + G P+ I+ G S +
Sbjct: 50 FRFRRASIFYLENVLKTDLQLKSNQSFCVPPIYQVLCALRFYATG-PFQIVI---GDSSA 105
Query: 70 SICR-SIHEVVPVIVDTFFQRM---VKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
++ + +I ++P + + +R+ +K+P LN+ + F FP + G T
Sbjct: 106 ALSQPTITRIIPRVSLSLAKRVNECIKYPTNPHVLNESKVNFYNVAEFPRITGLTAHTF- 164
Query: 124 LIDAPNEHEDQFVNRNGDHAL--NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
A H + DH + ++ C + Y V A WPGS HDAR++R S + K
Sbjct: 165 ---AYKNHASTNMFMLIDHQIIQSMSKRCVIITESYIV-AKWPGSTHDARILRESKLGIK 220
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+ + ++ GDS YP WL+ P +P AA +++ +R R
Sbjct: 221 MIDATLK----GLLFGDSGYPCFRWLLTPYL-SPTTAARRQYNLSLRRKR 265
>gi|356573553|ref|XP_003554922.1| PREDICTED: uncharacterized protein LOC100809505 [Glycine max]
Length = 531
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/287 (22%), Positives = 115/287 (40%), Gaps = 29/287 (10%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG 110
+G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 238 VGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPY 297
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +D T + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 298 FKDCIGAIDSTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 357
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDA 217
+V+ +++ + G + L D +P + + P +P+
Sbjct: 358 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPEN 413
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNIAL 273
+ LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 414 EKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNFLR 469
Query: 274 TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
R +E VE D+ + +D D + +EQ R+D N
Sbjct: 470 KECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 515
>gi|336372493|gb|EGO00832.1| hypothetical protein SERLA73DRAFT_51376 [Serpula lacrymans var.
lacrymans S7.3]
Length = 281
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 90/206 (43%), Gaps = 28/206 (13%)
Query: 118 VDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGP-NYKFYAVNANWPGSVHDARVMRN 175
VDGT L P H + F +R +++L+ P N K GSVHD +
Sbjct: 73 VDGTTFNLFQKPGHHGETFFDRKSNYSLSAQITTLPHNLKIVDYGLGHTGSVHDLSAFCD 132
Query: 176 SMVYR---KFDEEG-WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA--DQRFLRCH-K 228
+ +Y+ KF G W + GDSAYP+ + P R P+ A QR H
Sbjct: 133 TAIYQTPEKFFTAGEW-------LWGDSAYPVSPTCVLPFKRVPNQALTRQQRHFNYHLS 185
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLAST------TLHNIALTID------ 276
R + +++ G+LK +F L + + + + A + LHN + I+
Sbjct: 186 RVQIRSKHAIGLLKIRFQSLKELCIQINCSQQIDYAISWIRCCFILHNFIIRIEGQDHNS 245
Query: 277 RDNENQNVEIHDDEAEDDGEEEAVDD 302
R+N N++ ++ DE DG E+ DD
Sbjct: 246 RNNINRDWQLELDELLADGIEDVGDD 271
>gi|55167956|gb|AAV43825.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1067
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 60/149 (40%), Gaps = 14/149 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + E F R NVMA + +F V A W G+ HDA
Sbjct: 78 FKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTTHDA 137
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----------RTRNPDAAAD 220
V+R+++ E R L D+ Y K+ +PP NP
Sbjct: 138 VVLRDALER----ENSLRVPQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNPVQNEK 193
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLN 249
+ F H R +E +FG LK +F L+
Sbjct: 194 ELFNLRHSSLRVTVERAFGSLKRRFKVLD 222
>gi|326430471|gb|EGD76041.1| hypothetical protein PTSG_00750 [Salpingoeca sp. ATCC 50818]
Length = 1299
Score = 58.5 bits (140), Expect = 5e-06, Method: Composition-based stats.
Identities = 61/273 (22%), Positives = 112/273 (41%), Gaps = 37/273 (13%)
Query: 23 EIAPALTTPT----ERNVALSAKEKVLSCLHW----MGVGSPYHIIGATHGPSKSSICRS 74
++APA PT + + SA VLS + W V ++G+ P++ +
Sbjct: 308 QVAPATRLPTYIPPATSASPSAAGSVLSGVAWGRRCFAVKLCLRLLGSQLSPTQLAAEWH 367
Query: 75 IHEVVPVIVDTFFQRMVKWPEENLNDIPMTF--------LRKGG--FPNVAGCVDGTMVL 124
+ E ++ W + I +TF +R+ P+ G + +
Sbjct: 368 VKE----------SDLMLWTRQVTEAIVITFSPSLSVHNIREADPLLPSCLGVLHHDYIG 417
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I P + F N + ++ V A+ G + KFY+V A +PGSV +R + + R
Sbjct: 418 IVRPARDNEFFRNVDNRPSIGVQAVVGHDDKFYSVAAGFPGSVKHDLCLRQTPLCR---- 473
Query: 185 EGWRPFPDAVILGDSAYPLKEWLIP--PRTRNPDAAADQRFLRCHKRTRRLIENSFGILK 242
+G R ++ + +PL WL+ P + A R + C ++ + F ++
Sbjct: 474 QGARALYPGYLVAGATFPLLSWLVTPFPAPQTAKQMALNRIISCRQQRAGSV---FARVR 530
Query: 243 EKFPCLNYMRLSPQFAGMVVLASTTLHNIALTI 275
+FP L L +AG +A +HN+ L +
Sbjct: 531 NRFPRLEQFDLGLDWAGTAAVACCLIHNLCLDM 563
>gi|255583084|ref|XP_002532309.1| conserved hypothetical protein [Ricinus communis]
gi|223527978|gb|EEF30061.1| conserved hypothetical protein [Ricinus communis]
Length = 420
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 81/196 (41%), Gaps = 22/196 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FPN G + ID+ + +NG ++ V A+ +F ++A WP ++
Sbjct: 203 FPNCCGVLGFGKFAIDS------DVLGKNG--SMLVQALVDSEGRFLDISAGWPCTMKPN 254
Query: 171 RVMRNSMVYRKFDEEGWRPFPD--------AVILGDSAYPLKEWLIPPRTR----NPDAA 218
+ + +Y + E P ILGDS +PL WL+ P R + +
Sbjct: 255 SIFTQTKLYSRIQELLSGPCCQLNDGNSIPQYILGDSCFPLFPWLLTPYIRPNEEDSFGS 314
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNIALTID 276
A++ F H R RL+ +FG +K ++ L + +F V++ LHN +
Sbjct: 315 AEREFNAAHNRAMRLVNTAFGRVKARWQLLARRWKEECVEFFPFVIVMGCLLHNFLIKCS 374
Query: 277 RDNENQNVEIHDDEAE 292
++V + E E
Sbjct: 375 EPVAEESVGCSEVEEE 390
>gi|20514796|gb|AAM23241.1|AC092553_7 Putative transposase [Oryza sativa Japonica Group]
gi|21326484|gb|AAM47612.1|AC122147_1 Putative transposase [Oryza sativa Japonica Group]
gi|110288571|gb|ABB46678.2| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 535
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 63/161 (39%), Gaps = 14/161 (8%)
Query: 99 NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYA 158
N + T + G + G +DGT + E F R NVMA + +F
Sbjct: 41 NRLVRTNYGRSGETDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMAAVDFDLRFTY 100
Query: 159 VNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------- 210
V A W G+ HDA V+R+++ E G L D Y K+ +PP
Sbjct: 101 VLAGWEGTAHDAVVLRDALER----ENGLHVPQGKFYLVDGGYGAKQGFLPPFRAVRYHL 156
Query: 211 --RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
NP + F H R +E +FG LK +F L+
Sbjct: 157 KEWGNNPVQNEKELFNLRHSSLRITVERAFGSLKRRFKVLD 197
>gi|390336769|ref|XP_001199715.2| PREDICTED: uncharacterized protein LOC763668 [Strongylocentrotus
purpuratus]
Length = 183
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 72/160 (45%), Gaps = 19/160 (11%)
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV------------ 194
MA+ Y+F ++ G DA++ NS + ++ E G PD
Sbjct: 1 MALVDSKYQFLWIDVGGVGHQSDAQIYNNSEL-KECIEAGTLGIPDPAPLPHDDEEHPMP 59
Query: 195 --ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYM 251
+GD A+ ++ +++ P R + F R RR++ENSFGIL +F C L M
Sbjct: 60 YFFVGDDAFAMRTYMMTPYGRRNMDQQQKIFNYRMSRARRVVENSFGILVLRFQCFLGQM 119
Query: 252 RLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
R P +++ A+ LHN+ I + + +V + D E
Sbjct: 120 RHEPDTVRLLIEAAVMLHNL---IRKHYQALDVRMLDQEG 156
>gi|307166611|gb|EFN60645.1| Putative nuclease HARBI1 [Camponotus floridanus]
Length = 184
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 74/149 (49%), Gaps = 7/149 (4%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHII 60
+T FK+ +RL+ V +++ + P L R LS +++VL+ L + G +
Sbjct: 38 LTNEEFKQLYRLTPDIVFNLIDVLEPQLQRT--RINGLSIEKQVLAALRFYATGCFQRPV 95
Query: 61 GATHG--PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLR-KGGFPNVA 115
G G S+SSI R +H+V I + F++ +++P E + F + F
Sbjct: 96 GEQWGISMSQSSISRCVHKVTDAINNLIFRQWIQFPITAEARYQAKLKFQNARHSFEGAI 155
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHAL 144
G +D + V I AP HE+ +VN +G H+L
Sbjct: 156 GAIDCSHVAIIAPRIHEEAYVNHHGYHSL 184
>gi|357139331|ref|XP_003571236.1| PREDICTED: uncharacterized protein LOC100835727 [Brachypodium
distachyon]
Length = 226
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 72/141 (51%), Gaps = 5/141 (3%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
+S +E + L +G G Y +I +T+ + +I R +E + + F +K P+
Sbjct: 72 VSVEEDLTIFLLVVGHGMKYRLIRSTYRWTLETISRHFNEGAILSLSREF---IKLPDPA 128
Query: 98 LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
++P + K FP+ G +DGT V + ++ ++ NR NV+ +C + KF
Sbjct: 129 -TELPQDNVWKW-FPDRLGALDGTHVRVRVAARNQGRYRNRKHQITTNVLGVCDRSMKFV 186
Query: 158 AVNANWPGSVHDARVMRNSMV 178
V A + GS D+RV+R++M+
Sbjct: 187 YVLAGYEGSASDSRVLRDAML 207
>gi|356519530|ref|XP_003528425.1| PREDICTED: uncharacterized protein LOC100787278 [Glycine max]
Length = 392
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 115/289 (39%), Gaps = 27/289 (9%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR 107
L +G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 96 LKIVGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRF 155
Query: 108 KGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSV 167
F + G +DGT + + +R+G+ + NV+A C + +F + + GS
Sbjct: 156 YPYFKDCIGAIDGTHIAASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYILSGLEGSA 215
Query: 168 HDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPDAAA---- 219
HD++V+ +++ + G + L D +P + + P R D A
Sbjct: 216 HDSKVLSDALARK----NGLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGND 271
Query: 220 ----DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ F H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 272 PENEKELFNLRHASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNF 328
Query: 272 ALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
R +E VE D+ + +D D + +EQ R+D N
Sbjct: 329 LRKECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 376
>gi|328712060|ref|XP_003244719.1| PREDICTED: hypothetical protein LOC100574275 [Acyrthosiphon pisum]
Length = 435
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 75/185 (40%), Gaps = 24/185 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
PN G +DG + I AP F N G ++ +MA + KF ++ G D
Sbjct: 56 LPNCVGSIDGKHIRIKAPANSGSAFYNYKGFFSIVLMATADADGKFITIDVGEYGRNSDG 115
Query: 171 RVMR-----NSMVYRKFD---------EEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPD 216
RV++ ++ K D EE P+ +GD A+PL L+ P R
Sbjct: 116 RVLKECAFGQQLLKNKLDLPEPSTLPGEENEPPYA-YYFVGDEAFPLMNNLLRPYPRRQL 174
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-----QFAGMVVLASTTLHNI 271
A + F R R+ IE +FG++ KF L SP + +V A LHN
Sbjct: 175 TNAKRIFNYRLSRGRKSIECAFGMMTSKFRILE----SPICRKIEKVDTLVKAICVLHNF 230
Query: 272 ALTID 276
T D
Sbjct: 231 IRTHD 235
>gi|331234515|ref|XP_003329917.1| hypothetical protein PGTG_11854 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 153
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 69/149 (46%), Gaps = 19/149 (12%)
Query: 83 VDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGD 141
V + V W ++ +++ GFP G VDGT + L P + + +R
Sbjct: 5 VSKYLVWYVTWSTKSEQIELSQVMQEEGFPGCVGFVDGTAISLSQKPPKDGQHYFDRKKR 64
Query: 142 HALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILG 197
++++V IC N KF + A +PGS HDA V + V ++ FD++ + +L
Sbjct: 65 YSISVTVICDINKKFISYLAGFPGSSHDAYVFLHMQVAQEPEKYFDQKQF-------LLA 117
Query: 198 DSAYPLKEWLIPPRTRNPDAAADQRFLRC 226
DSAY + +++P A + L+C
Sbjct: 118 DSAYTINCYVVP-------AFKGKNLLKC 139
>gi|55168239|gb|AAV44105.1| unknown protein [Oryza sativa Japonica Group]
Length = 1220
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 60/149 (40%), Gaps = 14/149 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + E F R NVMA + +F V A W G+ HDA
Sbjct: 231 FKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTTHDA 290
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----------RTRNPDAAAD 220
V+R+++ E R L D+ Y K+ +PP NP
Sbjct: 291 VVLRDALER----ENSLRVPQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNPVQNEK 346
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLN 249
+ F H R +E +FG LK +F L+
Sbjct: 347 ELFNLRHSSLRVTVERAFGSLKRRFKVLD 375
>gi|6967112|emb|CAB72466.1| putative protein [Arabidopsis thaliana]
Length = 377
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 92/224 (41%), Gaps = 21/224 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + P F NR G + NV+A C + +F + + W GS HD+
Sbjct: 143 FKDCIGAIDGTHIFAMVPTCDAASFRNRKGFISQNVLAACNFDLQFIYILSGWKGSAHDS 202
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPP-------------RTRNPD 216
+V+ +++ P+ L D Y + + P + R+P
Sbjct: 203 KVLNDALT----RNTNRLQVPEGKFYLVDCGYANRRKFLAPFRRTRYHLQDFRGQGRDPK 258
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIALTI 275
+ LR H R +IE FGI K +F P + +VLA LHN
Sbjct: 259 TQNELFNLR-HASLRNVIERIFGIFKSRFLIFKSAPPYPFKTQTELVLACAGLHNFLHQE 317
Query: 276 DRDNEN-QNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQDRNN 318
R +E I+++ ++D+ E ++ D E Q R+ NN
Sbjct: 318 CRSDEFPPETSINEENSDDNEENNYEENGDVGILESQQREYANN 361
>gi|356514841|ref|XP_003526111.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 329
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 103/236 (43%), Gaps = 24/236 (10%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T NV ++ E V LH + Y ++ ++ S I R V+
Sbjct: 107 ILQEKGQLIKTI---NVPIA--EVVAIFLHILAHNLKYRVVHFSYCRSMEIISRQFKNVL 161
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ + +K+ + NL ++ F N+ G +DG + + + ++ NR
Sbjct: 162 RAIMKVS-KEYLKFHDYNLEGSVEN--KRRWFKNLIGALDGIHIPVIVVAKDRLRYRNRK 218
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF----------DEEGWRP 189
D + NV+ +CG + +F V W GS D+RV+R+++ + + + G+
Sbjct: 219 WDISTNVLGVCGTDLRFIYVLHGWEGSAGDSRVLRDALRRQNYLHIPNGKYFLVDAGYTN 278
Query: 190 FPDAVI-LGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEK 244
P + + Y L EW+ P + LR H +IE SFG+LK++
Sbjct: 279 GPGFLAPYQGTRYHLNEWI----GNTPQNYKELFNLR-HAGAMNVIERSFGVLKKR 329
>gi|331251189|ref|XP_003338195.1| hypothetical protein PGTG_19803 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 485
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 69/155 (44%), Gaps = 13/155 (8%)
Query: 106 LRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
+ K GF G VDGT + L P + + +R +++N IC + + WP
Sbjct: 1 MEKEGFKGCIGFVDGTTIPLHQQPGIDGEVYWDRKKRYSINCQVICDCDKFITSFMTGWP 60
Query: 165 GSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRNPDAAA 219
GS D+ V N V+++ FD + ++ D+AY L IP +
Sbjct: 61 GSCGDSLVFTNMKVHKEPENYFDSGQY-------LIADTAYGLSMTTIPAYKAPLSKVHR 113
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLS 254
+ F C ++R E++ GILK ++ L +RLS
Sbjct: 114 NTEFNYCLAKSRVRNEHTIGILKGRWASLQQLRLS 148
>gi|194765531|ref|XP_001964880.1| GF22751 [Drosophila ananassae]
gi|190617490|gb|EDV33014.1| GF22751 [Drosophila ananassae]
Length = 412
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 61/272 (22%), Positives = 111/272 (40%), Gaps = 25/272 (9%)
Query: 38 LSAKEKVLSCLHWMGVGSPY-HIIGATHGPS---KSSICRSIHEVVPVIVDTFFQRMVKW 93
+S + ++ L ++ G + + G H P+ ++ + ++ +V + TF Q +
Sbjct: 90 VSPEVRLQMTLRFLATGESFGTLTGVFHIPTADVRNIVAETLASIVKRLKRTFLQ--IPR 147
Query: 94 PEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
EE I F FP+ G VDG + E ++ + N +++ ++A+
Sbjct: 148 SEEAWLQIASDFQDLWNFPHCLGAVDGHHLAFRGGTEADESYSNYRNFNSIILLALVDAQ 207
Query: 154 YKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA------------VILGDSAY 201
++F ++A G DA S ++ E W P VIL D +
Sbjct: 208 HRFLHIDATSKGGATDA--FNQSSLFDAM-ESNWLNIPPECCLLGLDEELPHVILADQGF 264
Query: 202 PLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLS-PQFAG 259
L+ WL+ P P + F R R+ N+ GI+ KF L + L PQ
Sbjct: 265 TLQPWLMKP-YEAPANLTRKMFNYRLNRAHRVAINALGIMNSKFRALQTEINLEVPQMEK 323
Query: 260 MVVLASTTLHNIALTIDRDNENQNVEIHDDEA 291
+V A+ LHN + + ++ + +E D E
Sbjct: 324 LVT-ATGILHNFLIGEEGEDYLRGLEKEDTEC 354
>gi|301092120|ref|XP_002896232.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262094843|gb|EEY52895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 188
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 89/189 (47%), Gaps = 28/189 (14%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKE------KVLSCLHWMGVGSPYHI 59
F+ AFR++A ++L + P + +++ S +E ++++ L ++ GS Y
Sbjct: 8 FERAFRMTADSFSYLLGLLTPNIEANRQQSANSSGEEPISPPLRLMTTLRYLAGGS-YLD 66
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVD 119
I T G S SS R I+E + I+ ++V E+ ++ M +
Sbjct: 67 IRRTVGISGSSYYRVINETMFAILGLRELKIVFPYSESDREVVMNDFKA----------- 115
Query: 120 GTMVLIDAPNEHEDQFVNR--NGDHA---LNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
I + + D R +G +A +NV A+C + +F +V+A++PGS +DAR R
Sbjct: 116 -----ISSTEDAGDVGTGRYYSGHYACPGINVQAVCDAHCRFISVDASYPGSTNDARAFR 170
Query: 175 NSMVYRKFD 183
+ V R FD
Sbjct: 171 GTGVARTFD 179
>gi|242068835|ref|XP_002449694.1| hypothetical protein SORBIDRAFT_05g021710 [Sorghum bicolor]
gi|241935537|gb|EES08682.1| hypothetical protein SORBIDRAFT_05g021710 [Sorghum bicolor]
Length = 381
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/261 (19%), Positives = 105/261 (40%), Gaps = 32/261 (12%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM------- 90
L +++V L + G +G++ G + H V +I F + M
Sbjct: 83 LCLEDRVAIALRRLNSGGSLATVGSSVGVN--------HSTVSLITWRFIEAMEERASHH 134
Query: 91 VKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMV--LIDAPNEHEDQFVNRNGDHALNVM 147
++WP+ + I F + G N G VD T + + + + ++++ ++++ +
Sbjct: 135 LRWPDSGEMEKIKSKFEKIHGLTNCCGVVDTTHITMCLSSAEPNCKVWLDQEKNYSMVLQ 194
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-----------IL 196
A+ + +F + WPGS+ ++ ++ +S ++ K EG R V ++
Sbjct: 195 AVVDLDTRFTDIVTGWPGSMKESSILHSSGLF-KLCGEGERLNGSKVKVSDGSEIGEYLI 253
Query: 197 GDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQ 256
GDS YPL WL+ P + F H R + + K+ + L P
Sbjct: 254 GDSGYPLLPWLLTPYQEKDLTESSAEFNSRHSAARAVAPRTLAKFKDTWKFLQGEMWRPD 313
Query: 257 FAGM--VVLASTTLHNIALTI 275
+ ++ LHNI + +
Sbjct: 314 KHKLPRIIHVCCLLHNIIIDL 334
>gi|449683836|ref|XP_004210470.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 194
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 65/155 (41%), Gaps = 12/155 (7%)
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
++++C I+E I D +K PE + I F + FPN G +DG V I
Sbjct: 8 RTTVCNIINETTKAIWDVLQLSYLKAPESSDEWKKIANQFENELNFPNCIGAIDGRQVCI 67
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
+AP + N H++ ++AIC Y F V+ G +D + S + F
Sbjct: 68 EAPVSSGSAYYNYKNYHSMVLLAICDAKYCFTLVDIGSYGRDNDVSIFNESKMGEAFKNN 127
Query: 186 GWRPFPD----------AVILGDSAYPLKEWLIPP 210
++ + V++ D + LK WL+ P
Sbjct: 128 LFKLTKNRQLSNGTQVPPVLIEDDIFMLKPWLMKP 162
>gi|15238244|ref|NP_199013.1| uncharacterized protein [Arabidopsis thaliana]
gi|9757943|dbj|BAB08431.1| unnamed protein product [Arabidopsis thaliana]
gi|28416671|gb|AAO42866.1| At5g41980 [Arabidopsis thaliana]
gi|110743283|dbj|BAE99532.1| hypothetical protein [Arabidopsis thaliana]
gi|332007366|gb|AED94749.1| uncharacterized protein [Arabidopsis thaliana]
Length = 374
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 94/224 (41%), Gaps = 24/224 (10%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G VD + + + + F N NG NV+A + +F V A W GS D
Sbjct: 140 FKDCVGVVDSFHIPVMVGVDEQGPFRNGNGLLTQNVLAASSFDLRFNYVLAGWEGSASDQ 199
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR---TRNPDAAADQRFLRCH 227
+V+ ++ R + P I+ D+ YP I P + N A + F H
Sbjct: 200 QVLNAALTRRN---KLQVPQGKYYIV-DNKYPNLPGFIAPYHGVSTNSREEAKEMFNERH 255
Query: 228 KRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIAL------------- 273
K R I +FG LKE+FP L P Q +V+A+ LHN
Sbjct: 256 KLLHRAIHRTFGALKERFPILLSAPPYPLQTQVKLVIAACALHNYVRLEKPDDLVFRMFE 315
Query: 274 --TIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDEEEQPRQD 315
T+ E++ V + +++ E G+E P+E+++ + R +
Sbjct: 316 EETLAEAGEDREVALEEEQVEIVGQEHGF-RPEEVEDSLRLRDE 358
>gi|357150977|ref|XP_003575641.1| PREDICTED: uncharacterized protein LOC100827806 [Brachypodium
distachyon]
Length = 705
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 62/299 (20%), Positives = 124/299 (41%), Gaps = 47/299 (15%)
Query: 6 FKEAFRLSATQVEFILEEI-APALTTPTERN------VALSAKEKVLSCLHWMGVGSPYH 58
F+ +++ T +++ + P L R+ L +++V L + G P
Sbjct: 7 FEHLYKMKITTFDYVCSLVRVPFLEDMMARDHRFADGRVLCLQDRVAVALRMLNSGEPPE 66
Query: 59 IIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVK---WPEEN-LNDIPMTFLRKGGFPNV 114
+G++ G +KS + + VD +++ + WP N + +I F + G PN
Sbjct: 67 TVGSSLGMNKSIVLL----ITKSFVDAMWEKAMHHLDWPGSNRIENIKYKFDKIHGLPNC 122
Query: 115 AGCVDGTMVLIDAPNEHEDQ--FVNRNGDHALN----VMAICGPNYKFYAVNANWPGSVH 168
G V H D F ++N DH +N + A+ + +F + +++
Sbjct: 123 CGVV------------HTDHITFESQNSDHEVNAGMLMQAVVDTDMRFTNIWLGSSSNMN 170
Query: 169 DARVMRNSMVYRKFDEEGWRP-----FPDA-----VILGDSAYPLKEWLIPPRTRNPDAA 218
+ ++ +S++++ +++ W D I+GD+ +PL WL+ N +
Sbjct: 171 QSSILHDSVLFKHCEKDTWLNGSKLNLSDGRQVGEYIIGDAGFPLLPWLLTSFQENDLSD 230
Query: 219 ADQRFLRCHKRTRRL-IENSFGILKEKFPCLNYMRLSP--QFAGM-VVLASTTLHNIAL 273
F R H + + + + LK+ + L+ P QF M V+ A LHNI +
Sbjct: 231 YQVEFNRRHSQAMTITLTKALAKLKDTWKFLHGGVWRPENQFEPMWVIYACCMLHNIVI 289
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 57/297 (19%), Positives = 119/297 (40%), Gaps = 50/297 (16%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMV------ 91
S +E+V L + G P +G+ G ++S+ VP++ +F M+
Sbjct: 385 FSLEERVAIALIMLNSGDPPATVGSFIGVNEST--------VPLVTKSFVDAMLERAQHH 436
Query: 92 -KWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDH----ALN 145
+WP+ + + + F G PN G + T V +R+ DH +
Sbjct: 437 LRWPQSDEMEKMKSMFDEIHGLPNCCGVLHTTHV----------TSASRSWDHFDKDSFV 486
Query: 146 VMAICGPNYKFYAV-NANWPGSVHDARVMRNSMVYRKFDEEGW------RPFPDAV---I 195
+ + P+ +F ++ A P + + + +S ++ ++ W + + V +
Sbjct: 487 LQGVIAPDMRFTSIWVAPRPANTSQSSFLHDSNLFEYCEKGAWLNGSKLKVASEEVGEYV 546
Query: 196 LGDSAYPLKEWLIPPRTRNPDAAAD----QRFLRCHKRTRRLIENSFGILKEKFPCLN-- 249
+GD YPL WL+ P + + +D F H + + L+ + ++
Sbjct: 547 IGDVGYPLLPWLLTPYYQLQNDLSDIPYQVEFNSRHSAVKNIALKVLARLEGTWKSMHGE 606
Query: 250 YMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
+ +P+ + A LHNI + I+ D + D E +D ++ ++D D +
Sbjct: 607 WRPDTPREMSRAIHACCILHNIVIDIEEDAGMPS----DQEEDDSKQKRQLEDEDAV 659
>gi|357139633|ref|XP_003571385.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 361
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 85/222 (38%), Gaps = 35/222 (15%)
Query: 102 PMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + LRK F N G VDGT + + + ++ + NR + N M + +F
Sbjct: 112 PHSILRKPKFYPFSKNCIGAVDGTHIPMKLSLDQQEPYRNRKQTISQNAMVASDFDLRFV 171
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------- 210
+N W GS DARV+++++ G+ L D+ Y + P
Sbjct: 172 HINPGWEGSASDARVLQDAL------NHGFEVLDGKFYLVDAGYANTPQFLAPYRGTRYH 225
Query: 211 -----RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-VVLA 264
R R+ + F H + R IE GI K +FP L P+ + + +A
Sbjct: 226 LNEQGRARHKPXNHKEXFNLRHAQPRNHIERIIGIWKMRFPILKVASHFPKEKQIDISVA 285
Query: 265 STTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEI 306
LHN + IH+ + + DPD+I
Sbjct: 286 CAVLHNF------------IRIHNGDMTWPSDATMDIDPDQI 315
>gi|218187156|gb|EEC69583.1| hypothetical protein OsI_38912 [Oryza sativa Indica Group]
Length = 269
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 71/173 (41%), Gaps = 17/173 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + P +D+F R + NV+A + +F V A W GS HD+
Sbjct: 99 FQGCIGALDGTHIPACVPMHMQDRFRGRKKFPSQNVLAAVDFDLRFLYVLAGWEGSAHDS 158
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPR------------TRNPDAA 218
V+++++ G + L D+ Y + L+PP R P+
Sbjct: 159 YVLQDALS----RTNGLKIPEGKYFLADAGYAARPGLLPPYQGTRYHLKEYKGAREPENP 214
Query: 219 ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNI 271
+ LR H R IE +FG LK +F P + ++ + +H I
Sbjct: 215 KELFNLR-HSSLRTTIERAFGALKNRFRIFKCQPFFPLKTQVKIVMACCVHFI 266
>gi|356566327|ref|XP_003551384.1| PREDICTED: uncharacterized protein LOC100797028 [Glycine max]
Length = 321
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 60/136 (44%), Gaps = 16/136 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DG + + E ++ NR GD + NV+ +CGP+ +F V W GS D+
Sbjct: 119 FKNSIGALDGIHIPVTVSAEDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDS 178
Query: 171 RVMRNSMVYRK----------FDEEGWRPFPDAVI-LGDSAYPLKEWLIPPRTRNPDAAA 219
RV+R+++ + + G+ P + + Y L EW+ N
Sbjct: 179 RVLRDALRRQNCLHIPNGKYFLVDAGYTNGPGFLAPYRGTRYHLNEWI-----GNTPQNY 233
Query: 220 DQRFLRCHKRTRRLIE 235
+ F CH R +IE
Sbjct: 234 KELFNLCHASARNVIE 249
>gi|449676923|ref|XP_002160362.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 189
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/178 (22%), Positives = 79/178 (44%), Gaps = 7/178 (3%)
Query: 12 LSATQVEFILEEIAPALT---TPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSK 68
+S ++E +L I +LT P+ ++ S ++++ + ++ G +
Sbjct: 1 MSPDRLEHLLSLIGSSLTKKYCPSRDSICPS--QRLIITIRYLATGESQQTQSFYFRVGR 58
Query: 69 SSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLID 126
+++C I E I + ++ P + +I F + FP G +DG V I+
Sbjct: 59 ATVCHIIEETCFAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIE 118
Query: 127 APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F++
Sbjct: 119 APAKSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFNK 176
>gi|356533383|ref|XP_003535244.1| PREDICTED: uncharacterized protein LOC100817811 [Glycine max]
Length = 422
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 53/244 (21%), Positives = 98/244 (40%), Gaps = 25/244 (10%)
Query: 45 LSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT 104
+ + +G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 123 IGTIAIVGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRES 182
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F + G +DGT + + +R+ + + NV+A C + +F V + W
Sbjct: 183 TRFYPYFKDCIGAIDGTHIPASVKGRDVSSYRDRHENISQNVLAACNFDLEFMYVLSGWE 242
Query: 165 GSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------R 211
GS HD++V+ +++ + G + L D +P + + P
Sbjct: 243 GSAHDSKVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGH 298
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTT 267
+P+ + LR H R +IE FGI K +F + +P F +VLA
Sbjct: 299 GNDPENEKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAA 354
Query: 268 LHNI 271
LHN
Sbjct: 355 LHNF 358
>gi|270015953|gb|EFA12401.1| hypothetical protein TcasGA2_TC016403 [Tribolium castaneum]
Length = 308
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 60/138 (43%), Gaps = 16/138 (11%)
Query: 191 PDAVILGDSAYPLKEWLIPPRT-RNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
P+ I+GD AYPL W IPP R F R H + R+++E SF +L +F L
Sbjct: 172 PNTFIIGDKAYPLLRWCIPPYIERRQHTPLQANFNRVHAKIRQVVERSFALLFGRFRRLK 231
Query: 250 YMRLSP-QFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDEIDE 308
Y+ + +LA LHNI L H D ++ +E + ++
Sbjct: 232 YLDMQRLDLIPSTILACCVLHNICLQ------------HQDLFVENYIQEGLLFVQNVNI 279
Query: 309 EEQPRQDRNNPGTQRLAR 326
E + RQD NP R R
Sbjct: 280 EREDRQD--NPVEARNQR 295
>gi|357150951|ref|XP_003575633.1| PREDICTED: uncharacterized protein LOC100824418 [Brachypodium
distachyon]
Length = 2089
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 63/290 (21%), Positives = 125/290 (43%), Gaps = 41/290 (14%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---VKWP 94
S +E+V L + G P +G+ G ++S++ V VD ++R V WP
Sbjct: 1769 FSLEERVAIALIMLNSGDPPATVGSFLGVNESTVSL----VTKSFVDAMYERAKYHVYWP 1824
Query: 95 EENLND-IPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDH----ALNVMAI 149
+ + + + F + G PN G + T V +P +R+ DH ++ +
Sbjct: 1825 QSDEREKMKSKFAKIHGLPNCCGVLHTTHV---SP-------ASRSWDHQDKDSILFQGV 1874
Query: 150 CGPNYKFYAVNANW-PGSVHDARVMRNSMVYRKFDEEGW------RPFPDAV---ILGDS 199
GP+ +F ++ G+ + + +SM++++ ++ W + + V ++GD+
Sbjct: 1875 VGPDMRFTSILVGLQAGNTNQLSFLHDSMLFKECEKGAWLNGNKLKVSSEEVGEYVIGDA 1934
Query: 200 AYPLKEWLIPP---RTRNPDAAADQ-RFLRCHKRTRRLIENSFGILKEKFPCLN--YMRL 253
YPL WL+ P + PD ++ Q F R H R + + + + ++ +
Sbjct: 1935 GYPLLPWLLTPYELQNDLPDISSYQAEFNRRHSSARNITLKAMARFESTWKIMHSEWRPD 1994
Query: 254 SPQFAGMVVLASTTLHNIALTID---RDNENQNVEIHDDEAEDDGEEEAV 300
+P+ + LHNI + ++ R Q V DD+ +E+AV
Sbjct: 1995 NPREMTRAINVCCRLHNIVIGMEEGARMPSEQEVNYSDDQVRQLADEDAV 2044
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/252 (21%), Positives = 109/252 (43%), Gaps = 19/252 (7%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-E 95
+ S +E+V L + G P +G G +++++ V+++ + + WP
Sbjct: 1436 SFSLEERVAIALIMLNSGDPPATVGCYIGVNEATVSLVTRTFAGVMLERALEHLW-WPGS 1494
Query: 96 ENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
++ I F + G PN G V V I + + ++D VN + + + A+ + +
Sbjct: 1495 TEIDKIKYKFDKIHGLPNCCGVVH--TVHITSGSHNKDHEVNDD----MLMQAVVDTDMR 1548
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDE----EGWR-PFPDAV-----ILGDSAYPLKE 205
F P S+ + ++ +S ++ + ++ G + PD + I+GD+ YPL
Sbjct: 1549 FINTWLTSPDSMKQSSILHDSELFTECEKGTGLNGSKLNLPDGLQVGEYIIGDAGYPLLP 1608
Query: 206 WLIPPRTRNPDAAADQ-RFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLA 264
WL+ P + + Q F R H ++ + LK+ + L+ PQ + A
Sbjct: 1609 WLLTPYELENELSDSQVEFNRRHLEATTIVPRAVTRLKDTWKFLHGGAWRPQNHHWAIYA 1668
Query: 265 STTLHNIALTID 276
LHNI + ++
Sbjct: 1669 CFVLHNIVIHME 1680
>gi|348676154|gb|EGZ15972.1| hypothetical protein PHYSODRAFT_316091 [Phytophthora sojae]
Length = 306
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 6/104 (5%)
Query: 91 VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAI 149
+KWP + +R+ GFP G +DGT + P F NR ++LN +
Sbjct: 113 IKWPSPDERSQTARRMRREGFPGCVGFIDGTTFPFVQKPGVDGQCFCNRKHRYSLNGQVV 172
Query: 150 CGPNYKFYAVNANWPGSVHDARVMR-----NSMVYRKFDEEGWR 188
C + + A + WPGS D+ V R N R+F EG+R
Sbjct: 173 CDVHRRIIAFYSGWPGSCADSTVYREMALPNYGYKRQFFSEGYR 216
>gi|341897450|gb|EGT53385.1| hypothetical protein CAEBREN_28482 [Caenorhabditis brenneri]
Length = 321
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 58/260 (22%), Positives = 113/260 (43%), Gaps = 21/260 (8%)
Query: 33 ERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVK 92
ER+ +S ++ L + G +++ G +S++ R IHE++ +V +K
Sbjct: 30 ERSPKISTAARLFIFLKFAREGRSENMLAKDIGICQSTVSRIIHEMIDDLV-AVAPHYIK 88
Query: 93 WP--EENLNDIPMTFLRKG---GFPNVAGC---VDGTMVLIDAPNEHEDQFVNRNGDHAL 144
+P +E + + FL K G P C +DG + P N +
Sbjct: 89 FPTTKEEIVALERGFLSKCDHWGRPRNVPCFGTIDGKHWATEHPPHSGSLNANYKKFFSY 148
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNS----MVYRKFDEEGWRPFPDAV------ 194
N + +C + + + + G +DA++ R+S M++ G+R D +
Sbjct: 149 NSLFVCDSDGRVMYLEVSELGVNNDAQLFRDSPLPLMLHDTVKAAGYRLLDDNITILPPF 208
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRL 253
++GD+ + + ++ P R + RF R R IEN FG + KF + ++L
Sbjct: 209 LVGDNGFGFTKHIMQP-YRGELSDEQMRFNRKLSAVRVSIENVFGRMNSKFQVFDRNLKL 267
Query: 254 SPQFAGMVVLASTTLHNIAL 273
+P ++ ++ A + +HNI L
Sbjct: 268 APHYSRCLIAALSVVHNIQL 287
>gi|357151003|ref|XP_003575650.1| PREDICTED: uncharacterized protein LOC100833301 [Brachypodium
distachyon]
Length = 698
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 59/292 (20%), Positives = 119/292 (40%), Gaps = 30/292 (10%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE 96
L +++V L + G +G++ G +KS++ V + + M KWP
Sbjct: 397 VLCLEDRVAVALIMLNAGQTLDDVGSSVGVNKSTVSLVTERFVDAMRERARHHM-KWPGS 455
Query: 97 N-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNY 154
+ ++ F + G PN G V + + + N +HE F + + A+ P+
Sbjct: 456 GEMENVKSKFDKILGLPNCCGVVHTSHIPFGSENCDHERNF-------CVLMQAVVDPDM 508
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA-----------VILGDSAYPL 203
+F + W + ++ NS ++++ + W ++GD+ YPL
Sbjct: 509 RFRNIWQGWSDRTNQLGLLHNSELFKECENGAWLNGSKLEVSSEGSEVGEYVIGDAGYPL 568
Query: 204 KEWLIPP-RTRNPDAAADQR--FLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF--- 257
WL+ P + ++ D D + F R H + L + + CL+ R SP+
Sbjct: 569 LPWLLTPYQQKDTDDQLDSQVEFNRRHSVAVSFALKALARLTDTWKCLH--RGSPKIPCE 626
Query: 258 AGMVVLASTTLHNIALTIDRDNENQNVEI-HDDEAEDDGEEEAVDDPDEIDE 308
+ A LHNI + ++ ++ ++ + ++ EE+A D + E
Sbjct: 627 MWKAIQACCMLHNIVIDMEEAPMARDYKVNYSEQVRGISEEDAARARDLLSE 678
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/263 (19%), Positives = 105/263 (39%), Gaps = 34/263 (12%)
Query: 37 ALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRM---VKW 93
L +++V L + G P + + G ++S+I V +D +++ + W
Sbjct: 55 VLCLEDRVAIALIMLNSGEPLEAVALSVGVNESTISL----VTERFIDATWEQADHHLNW 110
Query: 94 P-EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
P + + F + G PN G + T ++ + N N + + V + P
Sbjct: 111 PGSSEIEKVKSMFDKIHGLPNCCGIICTTHIIFGSQN------CNNETNDDIIVQVVVDP 164
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFD-------------EEGWRPFPDAVILGDS 199
+ +F + +++ ++ +S +++ D G + VI G +
Sbjct: 165 DMRFNNIWLGGSDAMNQMSLLHDSQFFKECDAGALVNGSKLELSSNGSEEVEEYVI-GAA 223
Query: 200 AYPLKEWLIPP-RTRNPDAAADQR--FLRCHKRTRRLIENSFGILKEKFPCLN---YMRL 253
YPL+ WL+ P + +N D D + F R H + LK+ + CL+ +
Sbjct: 224 GYPLRPWLLTPYKQQNMDELLDSQVEFNRRHSAALTFVLKVLARLKDTWKCLHGGMWHPE 283
Query: 254 SPQFAGMVVLASTTLHNIALTID 276
P+ V+ TLHNI + ++
Sbjct: 284 DPKELSRVIYVCITLHNIVIDME 306
>gi|449681715|ref|XP_004209905.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 242
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 67/154 (43%), Gaps = 12/154 (7%)
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
++++C I E I + ++ P + +I F + F G +DG V I
Sbjct: 73 RATVCHIIEETCCAIWKVLKKVFLRAPNDVKEWQNIIKEFDQNWNFLQCIGAIDGKHVRI 132
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEE 185
AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F+ +
Sbjct: 133 KAPAKSGPSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRAFN-K 191
Query: 186 GWRPFPDA---------VILGDSAYPLKEWLIPP 210
G+ P V +GD + LK WL+ P
Sbjct: 192 GYFNLPKISEFDPKVPPVFVGDDIFALKPWLMKP 225
>gi|195018230|ref|XP_001984745.1| GH14854 [Drosophila grimshawi]
gi|193898227|gb|EDV97093.1| GH14854 [Drosophila grimshawi]
Length = 394
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 65/337 (19%), Positives = 135/337 (40%), Gaps = 29/337 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK+ FR+ + +++ + T A+ ++++ L+ +G + Y ++G G
Sbjct: 72 FKQHFRMEQEYFDILVQLLPELRKNDTNFRKAIPIEKRIAIALYTLGSTAEYRVVGKLFG 131
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPM----TFLRKGGFPNVAGCVDGT 121
S S + + +++ + Q + P+E L + T GFP G +
Sbjct: 132 ISMSMVGKILNDFCQAVKRVLSQEYL--PKEFLTQQKLEECVTGFEAKGFPQCLGAIGIC 189
Query: 122 MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRK 181
+ + A + + N ++ + A+ +++F VN + PGS A+V S +
Sbjct: 190 HLEVSAAISNSSLYYNNQDWYSKILFALV--DHRFLYVNYSCPGSFTPAQVYEESYLKEI 247
Query: 182 FD-----EEGWRPFPD----AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR 232
+ + +P ++LGD+A+ + P P +++F + R+
Sbjct: 248 LESSPQFKSNRQPIAGVEVPVLVLGDAAFEATTVMTP--YSQPHGETEKQFNVALQDARQ 305
Query: 233 LIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAE 292
+++ +F LK +F + +++ LHN + I DD AE
Sbjct: 306 VVDAAFQQLKSRFRRITKSADCHSRRNDIIICCCILHNFLI-------GNGSNIFDD-AE 357
Query: 293 DDGEEEAVDDPDEIDEEEQPRQDRNNPGTQRLARLLN 329
E+++V EEE P Q Q ++ L+N
Sbjct: 358 LQQEQQSVVTAAA--EEESPEQSATEDIRQAISTLIN 392
>gi|428186175|gb|EKX55026.1| hypothetical protein GUITHDRAFT_50618, partial [Guillardia theta
CCMP2712]
Length = 181
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 83/180 (46%), Gaps = 22/180 (12%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPT--ERNVALSAKEKVLSCLHWMGVGSPYHI-IGA 62
F+ +R+S + IL I+P + ++++ ++ L W+ GS H+ I +
Sbjct: 6 FRRYYRVSWNTFQVILTRISPMIDKADVFASRGCITSEIRLACVLRWLAGGS--HLDITS 63
Query: 63 THGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG--------FPNV 114
HG S ++ ++ ++ I +T F + +P + D+ M + G F +
Sbjct: 64 VHGISSTAFWDAVDLILDSI-NTGFD--ISFPFND--DLEMQKIAHGFANLHASPLFHHC 118
Query: 115 AGCVDGTMVLIDAPNEHEDQ----FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
G VDG ++ I +P + Q F R G +ALNV +C +F V N PGSVHD
Sbjct: 119 VGAVDGMLLQIQSPASSDTQTPSRFWCRKGFYALNVQGVCDSTMRFVYVGINAPGSVHDC 178
>gi|414589198|tpg|DAA39769.1| TPA: hypothetical protein ZEAMMB73_938202 [Zea mays]
Length = 207
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 93/209 (44%), Gaps = 17/209 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTE--RNV------ALSAKEKVLSCLHWMGVGSPY 57
+ +++ ++I + LTT T RN L +++V L + G
Sbjct: 1 MESVLKMTRRTFDYICSLVKKDLTTKTYGFRNFRFGDKKVLGVEDQVAVALMKLTTGESL 60
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
IG G + S+I + + + + KW E + I +F + G PN G
Sbjct: 61 QNIGMWFGMNHSAISNITWWFIESVEECAICHL-KWSSPEEMATIKTSFDKVYGLPNCCG 119
Query: 117 CVDGTMVLIDAPNE-HEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+D T +L+ + + + +++ + + + A+ P+ +F V + WPGS+ DA ++R
Sbjct: 120 AIDTTHILMCSSAQPNSKVWLDNENKNNMVLQAVVDPDLRFRDVVSGWPGSMDDACILRT 179
Query: 176 SMVYRKFDEEGWR-----PFPDAVILGDS 199
S +YR E+G R FP+ + LG +
Sbjct: 180 SGLYR-LCEKGLRLGGRWSFPEDLRLGST 207
>gi|108862658|gb|ABA98143.2| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 581
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 61/154 (39%), Gaps = 19/154 (12%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + E F R NVMA + +F V A W G+ HDA
Sbjct: 78 FKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTAHDA 137
Query: 171 RVMRNSMVYRKFDEEGWR-----PFPDAVILGDSAYPLKEWLIPP----------RTRNP 215
V+R+++ E G R L D+ Y K+ +PP NP
Sbjct: 138 VVLRDALER----ENGLRVPQGNRLQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNP 193
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
+ F H R +E +FG LK +F L+
Sbjct: 194 VQNEKELFNLRHSSLRVTVERAFGSLKRRFKVLD 227
>gi|326668615|ref|XP_003198838.1| PREDICTED: hypothetical protein LOC100536291 [Danio rerio]
Length = 252
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 68/149 (45%), Gaps = 4/149 (2%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTE-RNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ FR+S EF+L+ ++P+L T L + ++ L W S Y I G
Sbjct: 105 QHFRMSRNTFEFVLQLLSPSLKRKTTGWRKPLEPRLRLAVVLWWYATPSEYRTISCLFGL 164
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
S++C + +V + T +R + P+ E L F +G + AG +DG + I
Sbjct: 165 GISTVCMLVRQVTNAL-KTLCERFICLPKGERLQKTIDGFFARG-YKMCAGAIDGCHIPI 222
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNY 154
P+ + + NR G H++ + A+ N+
Sbjct: 223 LKPHVDQAAYCNRKGWHSIVLQAVVDHNF 251
>gi|331240515|ref|XP_003332908.1| hypothetical protein PGTG_14067 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309311898|gb|EFP88489.1| hypothetical protein PGTG_14067 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 271
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 76/181 (41%), Gaps = 16/181 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAP-ALTTPTERNVALSAKEKVLSCLHWMG-------VGSPY 57
FK+ FR++ + I ++ N KE+++ L +G VG
Sbjct: 78 FKQEFRMTQLAFTKLCARIRNNTVSQSNSHNPQHPIKEQLMVALERLGCFGNGASVGMLA 137
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPE-ENLNDIPMTFLRKGGFPNVAG 116
G G + R I ++ + ++++WP E+ +I + + GF G
Sbjct: 138 RFFGVGEGTFELCTNRFIMAILRIKT-----QIIQWPSPEDRKEIKANY-AEVGFDGCVG 191
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
+DG ++ L + P+++ F +R G + + + C N + WPG HD RVM N
Sbjct: 192 LIDGVLIPLTECPSKNGSDFYSRKGSYGITTLIACDSNRNINFLYTGWPGCSHDQRVMGN 251
Query: 176 S 176
S
Sbjct: 252 S 252
>gi|147779878|emb|CAN65842.1| hypothetical protein VITISV_027369 [Vitis vinifera]
Length = 579
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 75/177 (42%), Gaps = 24/177 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT + ++ + ++ R + A NV+ +C + +F V W GS D+
Sbjct: 168 FKNCLGALDGTYIKVNVREGDKPRYRTRKNEIATNVLGVCSQDMQFIYVLPGWEGSTSDS 227
Query: 171 RVMRNSMVYRKFDE--EGWRPFPDAVILGDSA---------YPLKEWLIPPRTRNPDAAA 219
RV+R+++ R G+ D Y L +W R +
Sbjct: 228 RVLRDAVSRRNGLTVPHGYYYLVDVGYTNGKGFLAPYRGQRYHLNDW----REGHMPTTH 283
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM-----VVLASTTLHNI 271
++ F H R +IE FG+LK ++ L SP F + ++LA +HN+
Sbjct: 284 EEFFNMKHSAARNVIERCFGLLKLRWAILR----SPCFYPIKTQCKIILACCLIHNL 336
>gi|356551004|ref|XP_003543869.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 308
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 76/165 (46%), Gaps = 22/165 (13%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T RNV + E + LH + Y ++ ++ SK +I R ++++
Sbjct: 91 ILQEKGGLVRT---RNVPTT--EAISMFLHILAHNLKYRVVQFSYCRSKETISRQFNDIL 145
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGG-------FPNVAGCVDGTMVLIDAPNEHE 132
++ K + LN P T +G F G +D T + +
Sbjct: 146 RAVM--------KVSKNYLNFQPCTL--EGAEANKWRWFERCIGALDETHIPVTVSPYER 195
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
++ NR GD + NV+A CGP+ +F V W GS D+RV+R+++
Sbjct: 196 PRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRDAL 240
>gi|397625395|gb|EJK67778.1| hypothetical protein THAOC_11144 [Thalassiosira oceanica]
Length = 347
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 98/250 (39%), Gaps = 26/250 (10%)
Query: 48 LHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLND--IPMTF 105
L + G P I HG + S+ +VV I + + +K+PE + + F
Sbjct: 3 LRYCAGGDPTDI-ADIHGVKDDEVLNSLWDVVDAIHQSP-ELDIKFPETEADQRRVAEGF 60
Query: 106 LRKG--GFPNVAGCVDGTMVLIDAPNEHEDQ---------FVNRNGDHALNVMAICGPNY 154
RK G VDG ++ I P+ +++ F R LN+ +C
Sbjct: 61 RRKSQIDIDCCVGAVDGILIWIHKPSSEDEKVIRIGPKKFFCGRKKKFGLNMQGVCDARG 120
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV-------ILGDSAYPLKEWL 207
F V +PGS D S + +K ++EG+ ++ + GD+AY ++
Sbjct: 121 YFLDVEIRFPGSSSDFYAFDESSLKKKLEQEGFLGTAVSMDFGGKLCLFGDNAYVQSSYM 180
Query: 208 IPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY---MRLSPQFAGMVVLA 264
P D F H + R IE +FGIL ++ L + +S +VLA
Sbjct: 181 CTPWKAVTSGPKDA-FNFYHSQVRINIECAFGILVHRWGMLRKAIPVGISVSRTTRLVLA 239
Query: 265 STTLHNIALT 274
LHN +
Sbjct: 240 LCKLHNFCIA 249
>gi|403175037|ref|XP_003333918.2| hypothetical protein PGTG_15341 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171427|gb|EFP89499.2| hypothetical protein PGTG_15341 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 332
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 81/184 (44%), Gaps = 16/184 (8%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LI 125
+ ++ ++ V+ +V + ++ V WP ++ GF G VDGT + L
Sbjct: 158 GRGTVIKATRRVIEALV-SLGRQYVMWPNTTRRAEISEVMKNEGFVGCVGFVDGTTIPLF 216
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY----RK 181
P + F +R +++N IC + + WPG+ D+ V + ++ +
Sbjct: 217 QRPGYDGEVFYDRKRRYSINAQIICDCDKYITSFITGWPGTCGDSLVYKRMQLHLNPSQY 276
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAA--ADQRFLRCHKRTRRLIENSFG 239
FD+ + +L DSAY + + ++P +NP A + F C + R E++ G
Sbjct: 277 FDDGQY-------LLADSAYEVSKTVVPA-YKNPSAKIPINADFNYCLAKARVRNEHTIG 328
Query: 240 ILKE 243
+LK
Sbjct: 329 VLKS 332
>gi|37624213|gb|AAQ95643.1| transposase [Sorghum bicolor]
Length = 118
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P H F+ R NVMA + +F V A W GS HDAR++ +
Sbjct: 1 GALDGTHVLARVPERHRAAFMGRKHTTTQNVMAAVDFDLRFTYVLAGWEGSAHDARILAH 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ ++G + L D+ Y L+ +PP
Sbjct: 61 ALE----KDDGLKVPTGKFYLVDAGYALRPGFLPP 91
>gi|449682085|ref|XP_004209994.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 129
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 58/102 (56%), Gaps = 8/102 (7%)
Query: 193 AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YM 251
+++LGDS +P WL+ P +P AA +R+ +RTR LIE FG K +F L+ +
Sbjct: 8 SLLLGDSGHPCFRWLLTPYL-SPTTAAQRRYNISLRRTRVLIEQVFGRWKRRFHLLHGEV 66
Query: 252 RLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAED 293
R++P+ ++ A L N+A+ ++ NV+I D+ E+
Sbjct: 67 RMTPERTCTIISACAVLQNLAIELN------NVDIDDNPIEN 102
>gi|357120067|ref|XP_003561752.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 500
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/321 (21%), Positives = 120/321 (37%), Gaps = 42/321 (13%)
Query: 5 AFKEAFRLSATQVEFILEEIA--------PALTTPTERNVALSAKEKVLSCLHWMGVGSP 56
AF+ FR+S + +I + P L T R L +++V + + G
Sbjct: 161 AFRHFFRMSKSTFAYICSIVRKDLISLPPPGLITIEGR--LLRVEKQVAIAIWRLASGDS 218
Query: 57 YHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVA 115
+GA G +S++ + + + +V WP + +I G PN
Sbjct: 219 QMSVGAAFGVGQSTVSQVTWRFIQSMERNARHHLV-WPNRARMEEIKSNLEALSGLPNCC 277
Query: 116 GCVDGTMVLIDAPN-EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMR 174
G +D T +++ PN + + + + +++ + I +F + +PGS+ +++
Sbjct: 278 GAIDATHIVMTLPNAKSSEDWRDPAQKYSMLLQGIFDDEMRFINIVTGYPGSLPFEKMLE 337
Query: 175 NSMVYR-----------------KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDA 217
S Y K D R F I+GD YPL WL+ P +
Sbjct: 338 FSQFYELCQAGKCLDGPVRGSSCKEDAAEIREF----IVGDKCYPLLPWLMTPYEEEDPS 393
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLASTTLHNIALTI 275
A F K R L + LK + L+ + P + ++L LHNI +
Sbjct: 394 APMVNFNDHQKAARMLGTRALAKLKGCWRILDKVIWRPDKNKLPSLILVCCLLHNILIDC 453
Query: 276 DRDNENQNVEIHDDEAEDDGE 296
D + DDE D E
Sbjct: 454 KDDF------LSDDEIPDRHE 468
>gi|449688402|ref|XP_004211736.1| PREDICTED: uncharacterized protein LOC101239839 [Hydra
magnipapillata]
Length = 235
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 69/147 (46%), Gaps = 13/147 (8%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP+ G +DG +++ AP+ ++ N ++ ++A+C Y+F V+ G D
Sbjct: 4 FPHAIGAIDGKHIVMQAPHNGGSEYFNYKKTRSIVLLAVCNAKYEFTMVDIGDSGRQSDG 63
Query: 171 RVMRN-SMVYRKFDEEGWRPFPD----------AVILGDSAYPLKEWLIPPRTRNPDAAA 219
V N S+ Y + + P P+ V++ D A+ LK ++ R N +
Sbjct: 64 SVFNNCSLGYAIENNKLNIPDPEYIGNSENVLPYVLVADDAFGLKIHMM-KRYPNQNIPL 122
Query: 220 DQRFLRCH-KRTRRLIENSFGILKEKF 245
DQ+ R RR+IEN+FGI +F
Sbjct: 123 DQKIFNYRLSRARRVIENTFGIATTRF 149
>gi|449682508|ref|XP_004210097.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 126
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 59/103 (57%), Gaps = 3/103 (2%)
Query: 193 AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-YM 251
+++GDS Y WL+ P NP A+ R+ ++TR +IE FG K +F L+ +
Sbjct: 7 GLLIGDSGYLCFRWLLTPYL-NPTTASQHRYNISLRKTRVIIEQVFGRWKRRFHLLHGEI 65
Query: 252 RLSPQFAGMVVLASTTLHNIALTIDRDNENQN-VEIHDDEAED 293
R+SP+ +V A LHN+A+ ++ + ++N +E +++ ED
Sbjct: 66 RMSPERTCTLVAACAVLHNLAIQLNDGDMDENPIENDENDNED 108
>gi|38345857|emb|CAD41055.2| OSJNBa0084K11.21 [Oryza sativa Japonica Group]
gi|38345926|emb|CAE01919.2| OSJNBb0078D11.1 [Oryza sativa Japonica Group]
Length = 377
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 71/179 (39%), Gaps = 25/179 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + E F R NVMA + +F V A W G+ HDA
Sbjct: 149 FKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTAHDA 208
Query: 171 RVMRNSMVYRKFDEEGWR-----PFPDAVILGDSAYPLKEWLIPP----------RTRNP 215
V+R+++ E G R L D+ Y K+ +PP NP
Sbjct: 209 VVLRDALE----RENGLRVPQGNRLQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNP 264
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM----VVLASTTLHN 270
+ F H R +E +FG LK +F L+ +P F +V+A +HN
Sbjct: 265 VQNEKELFNLRHSSLRVTVERAFGSLKRRFKVLD--DATPFFPFQTQVDIVVACCIIHN 321
>gi|297720307|ref|NP_001172515.1| Os01g0693600 [Oryza sativa Japonica Group]
gi|255673580|dbj|BAH91245.1| Os01g0693600 [Oryza sativa Japonica Group]
Length = 807
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 68/158 (43%), Gaps = 39/158 (24%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT V + P Q++NR + + NV+A+C + +F V + WPGS HD
Sbjct: 687 FNDCIGAIDGTHVKLVVPKTKRIQYLNRYNETSQNVLAVCDFDMRFTFVLSGWPGSAHDM 746
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRT 230
RV F DA +P PP+ +R
Sbjct: 747 RV-----------------FKDATTTHAHKFPH-----PPQV---------------ERA 769
Query: 231 RRLIENSFGILKEK--FPCLNYMRLSPQFAGMVVLAST 266
+ I N G+++ + F CL+++ L+ Q +++ +T
Sbjct: 770 KLGIGNLLGVMRMQIIFQCLHHLTLTGQKMNLLLRIAT 807
>gi|301630170|ref|XP_002944199.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 128
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 5/82 (6%)
Query: 195 ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----Y 250
++ D+ YP WLI P R P + AD+ F + H R R +IE +FG+LK F CL+
Sbjct: 4 LIRDAGYPCGRWLITPIHR-PRSLADRAFNQAHVRARSVIERTFGVLKSWFRCLDKSGGS 62
Query: 251 MRLSPQFAGMVVLASTTLHNIA 272
+ SP V A LHN+A
Sbjct: 63 LMYSPTKVANFVGACAVLHNLA 84
>gi|359480387|ref|XP_003632444.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 349
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 91/210 (43%), Gaps = 31/210 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + +D + + N+ ++ R NV+A C + KF V W GS D+
Sbjct: 141 FKDCIRAIDASHFRVKVSNDVVQRYRGRKYYPTQNVLAACSFDLKFTYVLPGWEGSALDS 200
Query: 171 RVMRNSMVYRKFDE----EGWRPFPDAVILGDSAYPLKEWLIPP-----------RTRNP 215
R++ N+++ R FD+ +G L D+ + LK + P P
Sbjct: 201 RILDNALM-RDFDKLIVPQG------KYYLADAGFQLKTGFLTPYRSTCYHLKEYSVHQP 253
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN-----YMRLSPQFAGMVVLASTTLHN 270
+ A + LR H R IE +FG+L+++FP + + + Q+ ++LA LHN
Sbjct: 254 ENAREVFNLR-HSSLRNAIERAFGVLRKRFPIIASGTKPHYPVDTQYD--IILACCILHN 310
Query: 271 IALTIDRDNENQNVEIHDDEAEDDGEEEAV 300
+ +D NE E+ + + E E++
Sbjct: 311 YLMGVD-PNERLIAEVDREFFSKEAEFESM 339
>gi|410895951|ref|XP_003961463.1| PREDICTED: putative nuclease HARBI1-like [Takifugu rubripes]
Length = 396
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/279 (20%), Positives = 119/279 (42%), Gaps = 18/279 (6%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
A +E F L+ + FI + + + T + ALS + L++ GS I G T
Sbjct: 36 ALQETFHLTKPCLHFIRDAVLVRMKKCTLKKFALSLDVMLPLALNYYAHGS---ISGDTL 92
Query: 65 GPSK--SSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDG 120
++ + C +I V ++ + + +P + + + G PNV G +
Sbjct: 93 QRTRRFETDCPAIISAVSGVIAGMSDQFISFPLLQGARASVASKTEKICGIPNVLGVLAP 152
Query: 121 TMVLIDAPNEHED---QFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+ + +D FV+ +G ++ IC + +V GS + + +S
Sbjct: 153 AHFEVQVSLKEKDTFKSFVSASGFPSVVSQLICDLDGNLLSVEKCCVGSTSEQEMWESSF 212
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENS 237
R+ ++E P+ ++G + Y L + ++ P ++P D R+ H + R+++ +
Sbjct: 213 KGRELEKELHGPY---WVIGGNGYHLSKHVLTP-VQDPSNEKDIRYNEAHAKLYRVMQTT 268
Query: 238 FGILKEKFPCLNYMRLSPQFA----GMVVLASTTLHNIA 272
G +K +F CL + + + + ++ A LHNIA
Sbjct: 269 LGHIKTRFRCLVNLGFAQKGSLDKKSNIIKACCVLHNIA 307
>gi|224105513|ref|XP_002313838.1| predicted protein [Populus trichocarpa]
gi|222850246|gb|EEE87793.1| predicted protein [Populus trichocarpa]
Length = 442
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/283 (18%), Positives = 113/283 (39%), Gaps = 23/283 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ + LS +++++ P +T N++L V L + G + + +
Sbjct: 115 WRSMYGLSYPVFTTVVDKLKPHITA---SNLSLPTDYAVSMVLSRLAHGFSAKTLASRYS 171
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+ + + V ++ + +K P + L + F PN+ G +DG+ +
Sbjct: 172 LEPYLVSKITNMVTRLLATKLYPEFIKIPVSKRRLIETTQAFEELTSLPNMCGAIDGSPI 231
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF- 182
+ + + + R G ++ + + F+ V PG DA +R S++Y +
Sbjct: 232 KVKRADIGGNMYKCRYGYSSVLLQVVADHKKVFWDVCVKAPGGSDDASHLRGSVLYNRLV 291
Query: 183 -DEEGW-----------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR-FLRCHKR 229
+ W RP+ I+GD YPL +L+ P + N Q F +
Sbjct: 292 SGDVVWDKVINVRGHHVRPY----IVGDWCYPLLSFLMTPFSPNGSGTPSQNLFDGMLMK 347
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIA 272
R ++ ++ +LK ++ L + + A ++A LHN+
Sbjct: 348 GRSVVVDAIALLKGRWKILQDLNVGLDHAPQTIVACCVLHNLC 390
>gi|406697727|gb|EKD00981.1| hypothetical protein A1Q2_04722 [Trichosporon asahii var. asahii
CBS 8904]
Length = 441
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 145/340 (42%), Gaps = 47/340 (13%)
Query: 1 MTPFAFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKE----KVLSCLHWMGVGSP 56
M F++AFR+S ++ + + P N + ++ L+ MG G+
Sbjct: 75 MNASNFRQAFRMSRQELSNLYNYVG---NHPVYHNTRGRPQRNPVVQLAVFLYRMGSGNT 131
Query: 57 YHIIGATHG--PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLR-KGGFPN 113
+ T G P + + + VV + + +VKWP+ TFL K P
Sbjct: 132 LADVQRTLGRLPRGTIVKYCMRSVVAI--SSKLAEVVKWPDPQRRREIETFLHDKYMIPG 189
Query: 114 VAGCVDGTMVLI-DAPN---EHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHD 169
G +DG+ +L+ +P+ E F +R + L ++A+C +F + S +D
Sbjct: 190 CVGFIDGSHILLHKSPSFSVEKNATFWSRKKRYGLLILAVCDEKKRFTYLQTGHFASAND 249
Query: 170 ARVMRNSMVYRKFDEEGWRPFPDA-VILGDSAYPLKEWLIPPRTRNP---DAAADQRFLR 225
R +++ + R + + F D ILGDS + ++P R+ D ++
Sbjct: 250 FRAQQSTPMAR----DPAQLFSDGQYILGDSGFYCDSNVVPMYRRSSGQVDLPEERLEFN 305
Query: 226 CHKRTRRL-IENSFGILKEKFPCLNYMRLSPQ------FAGMVVLASTTLHNIALTIDRD 278
H T R+ IE++FGILK+++ LN + L + FA + A LHN L ID
Sbjct: 306 EHVATARVKIEHAFGILKQRWLILNDLHLLLKDQDNLTFAYAAIQAVVVLHN--LYIDTS 363
Query: 279 NENQNV--------------EIHDDEAEDDGEEEAVDDPD 304
+ + ++ E + E D G EA+++P+
Sbjct: 364 SYHWDLAEFLAAQQRALELQEKMEKEERDAGNLEALEEPN 403
>gi|47824925|gb|AAT38700.1| Putative transposase, related [Solanum demissum]
Length = 130
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 39/71 (54%)
Query: 107 RKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGS 166
R+ F G +DGT + I P + + +++ R G+ A NV+ +C N F V W GS
Sbjct: 59 RRRWFKGCLGALDGTYIQIRVPYKDKPRYMTRKGEIATNVLEVCDKNLNFTYVLPGWEGS 118
Query: 167 VHDARVMRNSM 177
D RV+RN++
Sbjct: 119 TTDGRVLRNAI 129
>gi|356562026|ref|XP_003549276.1| PREDICTED: uncharacterized protein LOC100782703 [Glycine max]
Length = 348
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 37/67 (55%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DG + + E ++ NR GD NV +CGP+ +F V W GSV D+
Sbjct: 146 FKNSIGALDGIHIPVTVSAEDRPRYSNRKGDIFTNVFGVCGPDLRFIYVLPGWEGSVGDS 205
Query: 171 RVMRNSM 177
RV+R+ +
Sbjct: 206 RVLRDVL 212
>gi|356497601|ref|XP_003517648.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 379
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/237 (22%), Positives = 95/237 (40%), Gaps = 23/237 (9%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG 110
+G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 86 VGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPY 145
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + +R+ + + NV+A C + +F V + W GS +D+
Sbjct: 146 FKDCIGAIDGTHIPASVKGRDVSSYRDRHENISQNVLAACNFDLEFMYVLSGWEGSAYDS 205
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPDAAA------- 219
+V+ +++ + G + L D +P + + P R D A
Sbjct: 206 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPEN 261
Query: 220 -DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ F H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 262 EKELFNLRHASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNF 315
>gi|108710333|gb|ABF98128.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 627
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 72/148 (48%), Gaps = 7/148 (4%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
V + E + L+ +G GS S +I R I EV+ +V+ + +V+
Sbjct: 392 TVHMHTMEALAIFLYILGDGSSNQRAQNCFKHSGETISRKIEEVLFAVVE-LGRDIVRPK 450
Query: 95 EENLNDIPMTFLRKG-----GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAI 149
+ N +++ +RK F + G VDGT +L P++ + +++ R+ NVMAI
Sbjct: 451 DPNFSNV-HERIRKDRRMWPHFKDCIGAVDGTHILAVVPDDDKIRYIGRSKSTTQNVMAI 509
Query: 150 CGPNYKFYAVNANWPGSVHDARVMRNSM 177
C + +F + PGS+HD V+ N++
Sbjct: 510 CDHDMRFTYASIGQPGSMHDTTVLFNAL 537
>gi|29837768|gb|AAP05804.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108709497|gb|ABF97292.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 396
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/290 (22%), Positives = 112/290 (38%), Gaps = 33/290 (11%)
Query: 7 KEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGP 66
+ FRL+A ++ + + + + V + EK+ L+ MG P
Sbjct: 62 RAMFRLTAEEIHNLHDLLVTSFGLVGTAGVC--SMEKLGIFLYTMGGNRPIRDGNNRWVR 119
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFP---NVAGCVDGTMV 123
S S++ H V+ + D +++K + N D L++ F G VDGT +
Sbjct: 120 SNSTVSVHFHHVLQAMND-LAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHI 178
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ Q NR+ NV+ IC + + +A WPGSVHD R++ ++ +D
Sbjct: 179 PVLPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILNEAVQAYPYD 238
Query: 184 EEGWRPFPDA----VILGDSAYPLKEWLIPP-------------RTRNPDAAADQRFLRC 226
FP +L DS +P + + P P ++ F
Sbjct: 239 ------FPRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREETFNHR 292
Query: 227 HKRTRRLIENSFGILKEKFPCLN---YMRLSPQFAGMVVLASTTLHNIAL 273
H R ++E F I K+ + L Y R ++ A+ LHN L
Sbjct: 293 HSTLRGIVERQFVIAKKMWKILKEIPYYR-DEDIPARIIHAAFALHNFRL 341
>gi|198475885|ref|XP_002132210.1| GA25327 [Drosophila pseudoobscura pseudoobscura]
gi|198137450|gb|EDY69612.1| GA25327 [Drosophila pseudoobscura pseudoobscura]
Length = 388
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/281 (22%), Positives = 109/281 (38%), Gaps = 29/281 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERN--VALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR+ + E +L+ + A+ + ++S EK+L L +G G + G
Sbjct: 69 FKERFRMQRSTFETLLQVVGRAIAGAEQYQPIASVSLPEKLLYTLMLLGGGISFRDAGEI 128
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRM---VKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
+ SKS S HE+ + F M V+WP+ + + + K P V G ++
Sbjct: 129 YSISKS----SGHEIFKWVTSAFSALMPCYVQWPKYGAAGVGSSSISK--LPGVLGVIEE 182
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + P +D N AL + A+C +F V + P ++ + + +
Sbjct: 183 CRIPLKLPIRIDDD--NPKQYPALTLQAVCDAQSRFLDVYVDAPNDDQHCILLASPLFEK 240
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI------ 234
+ I+GD YPL L+ P CH + I
Sbjct: 241 LIHTQTPLMAEHRHIVGDKTYPLLLNLMTPYV--------GELTPCHAQYNLAIRLWNAP 292
Query: 235 -ENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIAL 273
E + L +F L+ + +S + V+ A+ LHN L
Sbjct: 293 AERALATLISRFKRLDSLDVSSLELGTTVISAACMLHNFIL 333
>gi|345560284|gb|EGX43409.1| hypothetical protein AOL_s00215g145 [Arthrobotrys oligospora ATCC
24927]
Length = 515
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/345 (23%), Positives = 139/345 (40%), Gaps = 32/345 (9%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F++ FR++ FIL++I+ T P + SA+ ++ L G I G
Sbjct: 144 FRQNFRVNPDAFAFILQKIS---TNPVFGDRQGSAELQLKIALRRFGGLDDVSQIAQKFG 200
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFP---NVAGCVDGTM 122
+ ++ V +++ + V+WP LR+ F + G +DGTM
Sbjct: 201 IGEGTVVLYTQRVAGALME-LWSEYVRWPTVEEQAAMKARLRQKDFAVWEDCVGFIDGTM 259
Query: 123 VLID-----APNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
+ + + R + + +C + + +PGSV D R R +
Sbjct: 260 FPFATRPAFGKEDARNYYNMRKHAYGQHATVVCDDQNRITHFTSLFPGSVSDQRAFRVTD 319
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP--RTRNPDAAADQRFLRCHKRTRRL-I 234
+++K +E + +LGD Y L E LI P + R+ +QR + R+
Sbjct: 320 LFQKPEEFFKNQY--QYLLGDKGYALNERLIIPFKQPRSGKPPKEQRRFNWKLSSLRVKA 377
Query: 235 ENSFGILKEKFPCLNYM--RLSPQFAGMVVL----ASTTLHNIALTIDRDNENQNVEIHD 288
E++ GILK +F L + RL Q L A LHN+ + RD E+ +
Sbjct: 378 EHTIGILKLRFRSLQRLPVRLVSQEKLSEALRWIGACVVLHNMLVDF-RDEWEPTTEMLE 436
Query: 289 D--EAEDDGEEEAVDDPDEIDEEEQP------RQDRNNPGTQRLA 325
+ + E + +EE + D + E P +D T+RLA
Sbjct: 437 EVLKEERERDEEYLGAIDGSVQVENPGLLARMSKDERRGHTRRLA 481
>gi|390353259|ref|XP_003728073.1| PREDICTED: uncharacterized protein LOC100893149 [Strongylocentrotus
purpuratus]
Length = 159
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 64/140 (45%), Gaps = 16/140 (11%)
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV------------ 194
MA+ Y+F ++ G DA++ NS + ++ E G PD
Sbjct: 1 MALVDSKYQFLWIDVGGVGHQSDAQIYNNSEL-KECIEAGTLGIPDPAPLPHDDEEHAMP 59
Query: 195 --ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYM 251
+GD A+ ++ +++ P R + F R RR++EN+FGIL +F C L M
Sbjct: 60 YFFVGDYAFAMRTYMMKPYGRRNMDQQQKIFNYRLSRARRVVENAFGILALRFQCFLGQM 119
Query: 252 RLSPQFAGMVVLASTTLHNI 271
R P +++ A+ LHN+
Sbjct: 120 RQEPDTVRLLIEAAVMLHNL 139
>gi|222635649|gb|EEE65781.1| hypothetical protein OsJ_21474 [Oryza sativa Japonica Group]
Length = 821
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 71/169 (42%), Gaps = 24/169 (14%)
Query: 148 AICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWL 207
+ C + F V A WPGSVHD RV ++M ++ P P L DS YP +
Sbjct: 650 SFCDFDMIFTFVFAGWPGSVHDMRVFNDAMT--RYKNVFPHPPPGKFYLVDSWYPNRIGY 707
Query: 208 IPPRTRNP-----------DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSP 255
+ P ++ F H R ++E SFG+LK+K+ L Y+ P
Sbjct: 708 LAPYKGTKYHLQEFQNAVEPEGKEEVFNYAHSSLRNVMERSFGVLKQKWRILKYVPSYPP 767
Query: 256 QFAGMVVLASTTLHNI----ALT------IDRDNENQNVEIHDDEAEDD 294
+ +++A LHN LT DRD E ++D+ +D
Sbjct: 768 ETQTHIIVACIALHNYIRLSGLTDRHFGRCDRDENYVPPEAYEDQPPED 816
>gi|449673392|ref|XP_004207945.1| PREDICTED: protocadherin Fat 4-like [Hydra magnipapillata]
Length = 2166
Score = 56.2 bits (134), Expect = 2e-05, Method: Composition-based stats.
Identities = 39/153 (25%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPEE--NLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
++++C I E I + ++ P + +I F + F G +DG V I
Sbjct: 1997 RATVCHIIEETCCAIWKFLKKVFLRAPNDVKEWQNIIKEFDQNWNFLQCIGAIDGKHVRI 2056
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE- 184
+AP + F N G +++ ++AIC Y F V+ G +DA ++ S R F++
Sbjct: 2057 EAPAKSGPSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAILNASTFGRVFNKG 2116
Query: 185 -------EGWRPFPDAVILGDSAYPLKEWLIPP 210
+ P V++GD + LK WL+ P
Sbjct: 2117 YFNLPKISEFDPKVSPVLVGDDIFALKPWLMKP 2149
>gi|242052433|ref|XP_002455362.1| hypothetical protein SORBIDRAFT_03g009305 [Sorghum bicolor]
gi|241927337|gb|EES00482.1| hypothetical protein SORBIDRAFT_03g009305 [Sorghum bicolor]
Length = 341
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 22/193 (11%)
Query: 133 DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPD 192
Q + R G NVMA C + +F V + WPGS HD V ++++ +F + P
Sbjct: 68 QQHLCRKGMTTQNVMAACDFDMRFTFVMSGWPGSAHDMTVFKDAI--SRFGDLFPHPPTG 125
Query: 193 AVILGDSAYPLKEWLIPP-----------RTRNPDAAADQRFLRCHKRTRRLIENSFGIL 241
L DS Y + + P R ++ F H R +IE +FG+L
Sbjct: 126 KYYLVDSGYANRLGYLAPYKGTKYHLQEYREGPQPEGKEETFNYAHSSLRNVIERAFGVL 185
Query: 242 KEKFPCLNYM-RLSPQFAGMVVLASTTLHNIALT-------IDRDNENQNVEIHDDEAED 293
K K+ L + S + +++A LHN T R + ++N + ++ ED
Sbjct: 186 KMKWRMLREIPSYSTEKQSRIIVACCALHNFIRTSGIQDRHFARCDRDENF-VPEEAFED 244
Query: 294 DGEEEAVDDPDEI 306
E + V+D ++
Sbjct: 245 QPEPKVVEDDSQL 257
>gi|449669438|ref|XP_004207024.1| PREDICTED: uncharacterized protein LOC101240467 [Hydra
magnipapillata]
Length = 188
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 67/140 (47%), Gaps = 13/140 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FP+ G +DG +++ AP+ ++ N N H++ ++A+C Y+F V+ G D
Sbjct: 4 FPHAIGAIDGKHIVMQAPHNGGSEYFNYNKTHSIVLLAVCNAKYEFTMVDIGDSGRQSDG 63
Query: 171 RVMRN-SMVYRKFDEEGWRPFPD----------AVILGDSAYPLKEWLIPPRTRNPDAAA 219
V N S+ Y + + P P+ V++ D A+ LK ++ P N +
Sbjct: 64 SVFNNCSLGYAIENNKLNIPDPEYIGNSEKVLPYVLVDDDAFGLKRHMMKPYP-NQNIPL 122
Query: 220 DQRFLRCH-KRTRRLIENSF 238
DQ+ R RR+IEN+F
Sbjct: 123 DQKIFNYRLSRARRVIENTF 142
>gi|301616261|ref|XP_002937579.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 342
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 1/130 (0%)
Query: 10 FRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKS 69
+RL+ + + E + P+L T R+ A+ K+L LH++ GS + G G S+
Sbjct: 24 YRLNREAISSLYELLEPSLQPLTRRSRAVPGMVKLLCSLHFLATGSFQRVGGVYRGVSQP 83
Query: 70 SICRSIHEVVPVIVDTFFQRMVKWPEEN-LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+ R + +++ I ++ N N + F G PNV G +D T V +++P
Sbjct: 84 TFSRCLGQILDAICSVSGNFILLPQHRNEWNAVKRHFYGVSGIPNVLGAIDCTHVALNSP 143
Query: 129 NEHEDQFVNR 138
+ E F N+
Sbjct: 144 QDREHIFRNK 153
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 7/70 (10%)
Query: 202 PLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGMV 261
P WLI P R P + A+ F H R +IE +FG+LK F CL+ + G +
Sbjct: 154 PCSRWLITPIHR-PCSRAEYAFNEAHVRAWSVIERTFGVLKSWFRCLD------KSGGSL 206
Query: 262 VLASTTLHNI 271
+ + T NI
Sbjct: 207 MHSPTKFSNI 216
>gi|449439467|ref|XP_004137507.1| PREDICTED: uncharacterized protein LOC101221521 [Cucumis sativus]
Length = 392
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 75/172 (43%), Gaps = 16/172 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG + + + + F N+NG + V+A C + KF+ V A W GS D
Sbjct: 147 FKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDL 206
Query: 171 RVMRNSMVYRK----------FDEEGWRPFPDAVI-LGDSAYPLKEWLIPPRTRNPDAAA 219
+V+ +++ R ++ + P V D Y KE+ P +P A
Sbjct: 207 QVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEY---PGGYHPQDAK 263
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHN 270
+ LR H R E +F LK +FP L P Q +V+A+ +HN
Sbjct: 264 ELFNLR-HSLLRNATERTFEALKARFPILLSAPPYPLQTQVKLVVATCAIHN 314
>gi|357475473|ref|XP_003608022.1| hypothetical protein MTR_4g086650 [Medicago truncatula]
gi|355509077|gb|AES90219.1| hypothetical protein MTR_4g086650 [Medicago truncatula]
Length = 517
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/318 (18%), Positives = 124/318 (38%), Gaps = 27/318 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ + LS +++++ P + N++L + V L + G + +
Sbjct: 114 WRSLYGLSYPVFTTVVDKLKPHIAV---SNLSLPSDYAVAMVLSRLAHGLSAKTVANRYS 170
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+ + + V ++ + +K P L +I +F PN+ G +D T V
Sbjct: 171 LEPYLVTKITNMVTRLLATKLYPEFIKIPVGRRRLIEITQSFEELTSLPNMCGAIDSTSV 230
Query: 124 -LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
L P+ + +++R G ++ + + F+ V PG DA R+S++Y++
Sbjct: 231 KLRSGPSSNPATYLSRYGYPSVLLQVVSDHKKIFWDVCVKAPGGTDDATHFRDSLLYQRL 290
Query: 183 DEEG--W-----------RPFPDAVILGDSAYPLKEWLIPPRT-RNPDAAADQRFLRCHK 228
W RP+ ++GD YPL +L+ P + A F
Sbjct: 291 TSGDVVWDKVINVRGHHVRPY----VVGDWCYPLLPFLLTPFSPSGMGTPAQNLFDGMLM 346
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHD 288
+ R ++ + +LK ++ L + + ++A LHN+ I R E + E+
Sbjct: 347 KGRSVVVEAIALLKGRWKILQELNVGLHHVPQTIVACCVLHNLC-QIAR--EPEPAELWK 403
Query: 289 DEAEDDGEEEAVDDPDEI 306
D E + +D +
Sbjct: 404 DPDESGAQPRVLDSEKSL 421
>gi|37624195|gb|AAQ95636.1| transposase [Gongora ilense]
Length = 122
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 54/120 (45%), Gaps = 12/120 (10%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT V P ++ R G NV+A+C N F V A W GS HD+RV+ +
Sbjct: 1 GALDGTHVDARVPFAEMVPYIGRGGSTTQNVLAVCDFNMCFTFVLAGWEGSAHDSRVLNH 60
Query: 176 SMVYRKFDEEGWRPFP----DAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTR 231
+M +++F+ FP L D YP++ + P D F RC + R
Sbjct: 61 AMRHKEFN------FPYLPSGKYYLVDVGYPMQRGFLKPYPNTKYHLPD--FERCSQVVR 112
>gi|307212987|gb|EFN88560.1| Putative nuclease HARBI1 [Harpegnathos saltator]
Length = 142
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 56/126 (44%), Gaps = 7/126 (5%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
FK FRL+ V IL I L TERN A+SA ++L L G +G G
Sbjct: 20 FKMRFRLNKETVRMILHNINGELQFRTERNNAISAMYQLLLTLRLYATGFFLITMGDFAG 79
Query: 66 PSKSSICRSIHEVVPVI--VDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
S +S R +H V I + +F + +P E + + F FP V GC+D T
Sbjct: 80 VSTTSAHRIVHRVSKAIARLQLYF---IHFPTTREEIRKEQLKFFNIARFPKVIGCIDCT 136
Query: 122 MVLIDA 127
+ + +
Sbjct: 137 QMRVQS 142
>gi|301092211|ref|XP_002996965.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262112227|gb|EEY70279.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 207
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 87/189 (46%), Gaps = 20/189 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKV------LSCLHWMGVGSPYHI 59
F+ FR+SA +L ++ PALT +++ S + V ++ L ++ GS Y
Sbjct: 8 FERTFRMSAESFSHLLWKVTPALTVSEQQSSNSSGEAPVSPSIMLMTALRYLAEGS-YLN 66
Query: 60 IGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP----EENLNDIPMTFLRKGG-FPNV 114
I T G S S R I + I+ F + + +P E+ + + GG
Sbjct: 67 IRRTVGISAPSYYRVIDLTMGAIL-AFKELQITFPNSDSEKGVVMEAFKNISSGGVISGC 125
Query: 115 AGCVDGTMVLIDAPNEHEDQFVN----RNGDHA---LNVMAICGPNYKFYAVNANWPGSV 167
GCVDG + I P + + +G +A +NV A+C + +F A A++PGS+
Sbjct: 126 IGCVDGWLCRIKTPTLADAGEIGVARYYSGHYACPGINVQAVCDAHCRFIAFEASFPGSI 185
Query: 168 HDARVMRNS 176
+DA R++
Sbjct: 186 NDAHAFRDT 194
>gi|37624211|gb|AAQ95642.1| transposase [Sorghum halepense]
Length = 118
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P H F+ R NVMA + +F V A W GS HDAR++ +
Sbjct: 1 GAMDGTHVLARVPERHRAAFMGRKHTTTQNVMAAVDFDLRFTYVLAGWEGSAHDARILAH 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ ++G + L D+ Y ++ +PP
Sbjct: 61 ALE----KDDGLKVPTGKFYLVDAGYAVRPGFLPP 91
>gi|7523685|gb|AAF63124.1|AC009526_9 Hypothetical protein [Arabidopsis thaliana]
Length = 408
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 77/178 (43%), Gaps = 9/178 (5%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP-EE 96
+S +E V L G Y +G G ++ ++ R EV+ + ++ P +
Sbjct: 96 ISIEESVAMFLRICGHNEVYRDVGLRFGRNQETVQRKFREVLTA-TELLACDYIRTPTRQ 154
Query: 97 NLNDIPMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGP 152
L IP F G +DGT V + + + + NR+ + +LN+MAIC
Sbjct: 155 ELYRIPERLQVDQRYWPYFSGFVGAMDGTHVCVKVKPDLQGMYWNRHDNASLNIMAICDL 214
Query: 153 NYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
F + PGS +D V++ + ++ D E P + L DS YP K+ L+ P
Sbjct: 215 KMLFTYIWNGAPGSCYDTAVLQ---IAQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAP 269
>gi|195146504|ref|XP_002014224.1| GL19055 [Drosophila persimilis]
gi|194106177|gb|EDW28220.1| GL19055 [Drosophila persimilis]
Length = 387
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/281 (22%), Positives = 109/281 (38%), Gaps = 29/281 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERN--VALSAKEKVLSCLHWMGVGSPYHIIGAT 63
FKE FR+ + E +L+ + A+ + ++S EK+L L +G G + G
Sbjct: 68 FKERFRMQRSTFETLLQVVGRAIAGAEQYQPIGSVSLPEKLLYTLMLLGGGISFRDAGEI 127
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRM---VKWPEENLNDIPMTFLRKGGFPNVAGCVDG 120
+ SKS S HE+ + F M V+WP+ + + + K P V G ++
Sbjct: 128 YSISKS----SGHEIFKWVTSAFSALMPCYVQWPKYGAAGVGSSSISK--LPGVLGVIEE 181
Query: 121 TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYR 180
+ + P +D N AL + A+C +F V + P ++ + + +
Sbjct: 182 CRIPLKLPIRIDDD--NPKQYPALTLQAVCDAQSRFLDVYVDAPNDDQHCILLASPLFEK 239
Query: 181 KFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLI------ 234
+ I+GD YPL L+ P CH + I
Sbjct: 240 LIHTQTPLMAEHRHIVGDKTYPLLLNLMTPYV--------GELTPCHAQYNLAIRLWNAP 291
Query: 235 -ENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHNIAL 273
E + L +F L+ + +S + V+ A+ LHN L
Sbjct: 292 AERALATLISRFKRLDSLDVSSLELGTTVISAACMLHNFIL 332
>gi|331214498|ref|XP_003319930.1| hypothetical protein PGTG_00842 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 489
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 91/187 (48%), Gaps = 23/187 (12%)
Query: 117 CVDGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
++GT + L P + + +R ++++V IC N KF + A +PGS HDA V +
Sbjct: 40 SMNGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISYLAGFPGSSHDAYVFSH 99
Query: 176 SMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPP-RTRNPDAAADQRFLRCHKRT 230
V ++ F+++ + +L DSAY +++P + + + F ++
Sbjct: 100 MQVAQQPEKYFNQKQF-------LLADSAYTNDRYVVPAFKGKQLLKRRNINFNYHLAQS 152
Query: 231 RRLIENSFGILKEKFPCLNYMRL----SPQFAGMV--VLASTTLHNIALTIDRDNENQNV 284
R IE++ GILK +F L +R + + G V ++ LHN+ D ++Q
Sbjct: 153 RVRIEHAIGILKGRFASLQEIRTQIRNAEEMKGAVKWIVTCIVLHNLLA----DLKDQWN 208
Query: 285 EIHDDEA 291
++++DEA
Sbjct: 209 DLYEDEA 215
>gi|328724322|ref|XP_003248105.1| PREDICTED: hypothetical protein LOC100569919 [Acyrthosiphon pisum]
Length = 404
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/207 (23%), Positives = 89/207 (42%), Gaps = 22/207 (10%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
PN G +DG + I AP F N G ++ +MA + KF ++ G D+
Sbjct: 163 LPNCVGSIDGKHIRIKAPVNSGSSFFNYKGYFSIVLMATADADGKFITIDVGEYGRNSDS 222
Query: 171 RVMRNS----MVYRKF----------DEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPD 216
+V++ S ++++K EE FP + D A+PL + ++ P R
Sbjct: 223 KVLKESAFGQLLFKKKLNLPENACLPHEENDPTFP-YYFVADEAFPLLDNVMRPYPRRSL 281
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNI---- 271
+ F R R+ +E +FG++ KF L + + +V+ A LHN+
Sbjct: 282 TNTKRIFNYRLSRGRKSVECAFGMMASKFRILERPINFKTEKIEIVIKAICVLHNLIRVH 341
Query: 272 --ALTIDRDNENQNVEIHDDEAEDDGE 296
+I + + NV++ ++ E + E
Sbjct: 342 DGTYSIPQGIQQDNVDLLYEDTEGNIE 368
>gi|297608688|ref|NP_001061974.2| Os08g0459400 [Oryza sativa Japonica Group]
gi|255678502|dbj|BAF23888.2| Os08g0459400 [Oryza sativa Japonica Group]
Length = 705
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 109/262 (41%), Gaps = 25/262 (9%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV----DTFFQRM 90
+ +S +E++ LH +G +IG S ++ R +EV+ + D R
Sbjct: 127 TIHVSVEEQLAMFLHTVGHNLRNRVIGFYVIRSSETVSRYFNEVLKALCCLAKDMIQIRS 186
Query: 91 VKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAIC 150
++ + +++ P F F + G +DGT + P ++F R NV+A
Sbjct: 187 IETHSKIVSN-PGRFYPY--FEDCIGALDGTHIPAFVPENIVNRFRGRKSYPTQNVLAAV 243
Query: 151 GPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
+ +F V W GS HD+ V++ ++ R+ G + L D+ Y + ++PP
Sbjct: 244 DFDLRFTYVLVGWEGSAHDSVVLKAAL--RR--SSGIQIPEGKYYLADAGYAARPGILPP 299
Query: 211 ------------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QF 257
++P+ + LR H R +E +FG LK +F L P +
Sbjct: 300 FRGVRYHLKEYEGGKSPETPQELFNLR-HSSLRTSVERAFGTLKNRFKVLASKPFFPYKV 358
Query: 258 AGMVVLASTTLHNIALTIDRDN 279
+V+A LHN L DN
Sbjct: 359 QVKIVIACCVLHNWILDNGPDN 380
>gi|449443271|ref|XP_004139403.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449494034|ref|XP_004159428.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 447
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 104/251 (41%), Gaps = 33/251 (13%)
Query: 54 GSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPN 113
G+ Y +G G + CRS + V I + + +++ I + F G+ +
Sbjct: 169 GASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLE--LRSDIDRIVVGF----GWIS 222
Query: 114 VAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
+ C G + L E E +NG +L V A+ +F V+A WP S+ A ++
Sbjct: 223 LPNCC-GVLGLRRFGFEGE----LKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPATIL 275
Query: 174 RNSMVYRKFDE------------EGWRPFPDAVILGDSAYPLKEWLIPPRT----RNPDA 217
R S +Y + ++ + +P P +I GDS +PL WL+ P +
Sbjct: 276 RQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLI-GDSCFPLLPWLLTPYMELNEEDSSG 334
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLN--YMRLSPQFAGMVVLASTTLHNIALTI 275
+ F H R L+ +F L+ ++ L+ + F ++L L N +
Sbjct: 335 FCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKC 394
Query: 276 -DRDNENQNVE 285
++ +E Q+ E
Sbjct: 395 SEKLDEEQDQE 405
>gi|357117845|ref|XP_003560672.1| PREDICTED: uncharacterized protein LOC100840686 [Brachypodium
distachyon]
Length = 221
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 35/62 (56%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + + P E + NR G + NVMA+C + +F V WPGSVHD
Sbjct: 160 FKDAIGAIDGTHIPVIVPAELKVIHTNRKGYTSQNVMAMCDFDMRFIFVVPGWPGSVHDT 219
Query: 171 RV 172
RV
Sbjct: 220 RV 221
>gi|347972217|ref|XP_003436859.1| AGAP013326-PA [Anopheles gambiae str. PEST]
gi|333469354|gb|EGK97264.1| AGAP013326-PA [Anopheles gambiae str. PEST]
Length = 329
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/279 (23%), Positives = 111/279 (39%), Gaps = 28/279 (10%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNV-------ALSAKEKVLSCLHWMGV-GSPY 57
F FR+ V+ I + TT + + A+S + VLS L ++G + Y
Sbjct: 19 FLRQFRIGRKAVQMICNHLE---TTSAYKKLRGHGGYEAISPQTHVLSFLWFLGHDKTSY 75
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGC 117
+ S S + I V I+ M+ E + F +K F V GC
Sbjct: 76 RDVATQFNLSVSCLHSVICRVADAILSMKHILMIPLSEARKTASDIAFSKKCNFSGVIGC 135
Query: 118 VDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
V GT + ID P E+ ++++ G +++ + AI N +F V +P ++
Sbjct: 136 VGGTQIKIDKPRENPERYLLTTGHYSIQLQAIIDENLRFVDVFVEYPDEPELLETVKRIC 195
Query: 178 VYRKFDEEGWRPFPDAVILGDSAYP-LKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIEN 236
R +LG+++YP + + L+P Q+ H +T N
Sbjct: 196 GNR------------YCLLGNASYPCMNQLLVPYPVDEVLLTQAQKTYNDHLQTVTSQCN 243
Query: 237 S-FGILKEKFPCLNYMRLSPQFAGMVVL--ASTTLHNIA 272
F LK +F L++++ + MV L LHN+A
Sbjct: 244 QIFAHLKTRFRRLHHLK-GRHLSRMVDLIKVCCILHNLA 281
>gi|449693121|ref|XP_004213309.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 184
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/174 (22%), Positives = 71/174 (40%), Gaps = 3/174 (1%)
Query: 12 LSATQVEFILEEIAPALTTPT-ERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSS 70
+S + + +L AP++T + + +S E++ L ++ G ++
Sbjct: 1 MSPERYKHLLSMAAPSITKKSCQSRQTISPSERLTVTLRYLATGDSQQTQSFYFRLDTTT 60
Query: 71 ICRSIHEVVPVIVDTFFQRMVKWPEEN--LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP 128
+C I+E I D +K PE + I F + FPN G + G V I+A
Sbjct: 61 VCSIINETTKAIWDVLQPSYLKAPESSDEWEKIANEFENEWNFPNCIGAIVGKHVCIEAS 120
Query: 129 NEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKF 182
+ N H++ + AIC Y F V+ G +DA + S + + F
Sbjct: 121 VSSGSAYYNNKNYHSMVLQAICDAKYCFTLVDIGSYGRDNDASIFNESKMGKAF 174
>gi|403418083|emb|CCM04783.1| predicted protein [Fibroporia radiculosa]
Length = 192
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 16/144 (11%)
Query: 119 DGTMV-LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKF-----YAVNANWPGSVHDARV 172
DGT + L + P ++ F ++ +A+N+ A+ N K YAV PGSVHDA
Sbjct: 33 DGTAINLAEKPGLFDELFYGKDKTYAINLQAVV--NVKTLQIVDYAVGP--PGSVHDATA 88
Query: 173 MRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAA---DQRFLRCHKR 229
+ + + D PD I DSAY L W++ P R P + +RF +
Sbjct: 89 FKETWLS---DSHNSLLSPDEWIWADSAYTLAHWVLTPFKRPPGGSLTPEQKRFNLWLSK 145
Query: 230 TRRLIENSFGILKEKFPCLNYMRL 253
R +E++ G+LK L +R+
Sbjct: 146 IRIAVEHAIGLLKGCLESLKELRI 169
>gi|255579925|ref|XP_002530798.1| conserved hypothetical protein [Ricinus communis]
gi|223529653|gb|EEF31599.1| conserved hypothetical protein [Ricinus communis]
Length = 390
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 75/174 (43%), Gaps = 20/174 (11%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G VDG + + + + F ++NG + NV+A C + KF+ V A W GS D
Sbjct: 145 FKDCVGAVDGIHIPVMVGVDEQGPFRDKNGLLSQNVLAACSFDLKFHYVLAGWEGSASDL 204
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD---------- 220
RV+ +++ R + P D + DS Y I P P + +
Sbjct: 205 RVLNSALKRRN---KLMVP-EDKYYVVDSKYANLPGFIAPYNGIPHRSDEYSSAYHPQDP 260
Query: 221 -QRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP---QFAGMVVLASTTLHN 270
+ F + H R + FG LK +FP L M P Q +V+A+ +HN
Sbjct: 261 RELFNQRHSLLRNATDRIFGALKARFPIL--MSAPPYPLQTQVKLVVAACAIHN 312
>gi|345484344|ref|XP_003425010.1| PREDICTED: hypothetical protein LOC100679846 [Nasonia vitripennis]
Length = 207
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/180 (22%), Positives = 72/180 (40%), Gaps = 1/180 (0%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F + R+ ++L + P L ++ A+SA++++ L ++ G I ++
Sbjct: 8 FFQYVRMDVDTFNYLLSMVEPYLRNRKKKRDAISAEQRLTITLRYLASGDDILSIAISYR 67
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPEENL-NDIPMTFLRKGGFPNVAGCVDGTMVL 124
K ++ I VI V+ P + +I F + P G VDG +
Sbjct: 68 LGKPTVYSIIKRTCSVIHQVLSPIYVRPPCTPVWRNIIKGFEQDWQLPGCIGAVDGKHIC 127
Query: 125 IDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
I +P + F N H++ ++A C Y+F V GS D + S + DE
Sbjct: 128 IQSPPDSGALFYNYKKFHSIALLAACNHRYEFTVVEVGAYGSESDGGIFLRSEFGKSLDE 187
>gi|270010059|gb|EFA06507.1| hypothetical protein TcasGA2_TC009406 [Tribolium castaneum]
Length = 505
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 41/83 (49%), Gaps = 3/83 (3%)
Query: 193 AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL---N 249
A GDS YPL+ WL+ P + ++RF H R LIE G+LK +F CL
Sbjct: 366 AYYKGDSGYPLRPWLLTPFLNVNPGSPEERFNSRHNSVRSLIERCNGVLKMRFRCLFKHR 425
Query: 250 YMRLSPQFAGMVVLASTTLHNIA 272
+ +P A +V A LHNI
Sbjct: 426 ALHYAPLKATKIVNACVVLHNIC 448
>gi|356532808|ref|XP_003534962.1| PREDICTED: uncharacterized protein LOC100781654 [Glycine max]
Length = 355
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 66/158 (41%), Gaps = 33/158 (20%)
Query: 20 ILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVV 79
IL+E + T +NV + E V LH + Y ++ ++ S +I R V+
Sbjct: 95 ILQEKGQLVKT---KNVPID--EAVAMFLHILAHNLKYRVVHFSYCRSMETISRQFKNVL 149
Query: 80 PVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRN 139
I+ N G +DG + + E ++ NR
Sbjct: 150 RAIM----------------------------KNSIGELDGIHIPVTVSAEDRPRYRNRK 181
Query: 140 GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSM 177
GD + NV+ +CGP+ +F V W GS D+RV+R+++
Sbjct: 182 GDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLRDAL 219
>gi|357140827|ref|XP_003571964.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 272
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 68/173 (39%), Gaps = 39/173 (22%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F N G +DGT V + P + V R G NVM +C + +F +V A WPG +
Sbjct: 69 FNNCIGAIDGTHVPVIVPAKDTVNHVGRAGYATHNVMGVCDFDMRFISVVAGWPGKFY-- 126
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP----RTRNPD----AAADQR 222
L DS YP + + P R P+ AAA R
Sbjct: 127 -------------------------LVDSGYPNQTGYLEPFRGTRYHLPEYQHGAAASGR 161
Query: 223 ---FLRCHKRTRRLIENSFGILKEKFPCLNYM-RLSPQFAGMVVLASTTLHNI 271
F H R +IE F +LK+++ L + P+ +++A LHN
Sbjct: 162 KEIFNYLHSSLRNVIERCFAVLKQRWRMLKAVPSYPPRKQARIIIACMALHNF 214
>gi|432902557|ref|XP_004077014.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 242
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 192 DAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYM 251
+ +LGD YP +L+ P + +F H RTR +E + GILK +F CL +
Sbjct: 121 NGYLLGDRGYPCLPFLMTPYPET-QPGPEAQFNLAHSRTRARVEMTTGILKARFQCLRGL 179
Query: 252 RLSPQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEA 299
R++P+ ++ A LHNIA + E E+DG E+A
Sbjct: 180 RVTPERPCDIIAACAVLHNIA--------TMSGERAPPPLEEDGPEDA 219
>gi|340387209|ref|XP_003392100.1| PREDICTED: hypothetical protein LOC100642039, partial [Amphimedon
queenslandica]
Length = 209
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 67/146 (45%), Gaps = 9/146 (6%)
Query: 10 FRLSATQVEFILEEIAPAL-----TTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATH 64
R+S + +F+L ++AP+L T+ N+ S EK+ L ++ G + +
Sbjct: 66 LRMSREKFDFLLSQVAPSLKRRHYTSSIRPNI--SPAEKLTLTLRFLAAGVSQASLSFSF 123
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTM 122
S +S+C ++E I V P +E+ I F + FP+ G +DG
Sbjct: 124 RISSASVCNIVYETCEAIWSALQPMYVTAPTSKEDWLKISKEFEQLWNFPHCIGAIDGKH 183
Query: 123 VLIDAPNEHEDQFVNRNGDHALNVMA 148
V+I AP ++ N G H++ ++A
Sbjct: 184 VVIQAPPNAGSEYYNYKGTHSIVLLA 209
>gi|46485870|gb|AAS98495.1| unknown protein [Oryza sativa Japonica Group]
Length = 1005
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G VDGT +L P+E + +++ R+ NVMAIC + +F + PGS+HD
Sbjct: 100 FKDCIGTVDGTHILAVVPDEEKIRYIGRSKSTTQNVMAICDHDMRFIYASIGQPGSMHDT 159
Query: 171 RVMRNSM 177
V+ N++
Sbjct: 160 TVLFNAL 166
>gi|356495041|ref|XP_003516389.1| PREDICTED: uncharacterized protein LOC100780713 [Glycine max]
Length = 450
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 62/144 (43%), Gaps = 33/144 (22%)
Query: 34 RNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKW 93
RNV + E V LH + Y ++ ++ SK +I R ++V+ V
Sbjct: 102 RNVPTT--EAVAMFLHILAHNLKYRVVQFSYCRSKETISRQFNDVLRV------------ 147
Query: 94 PEENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPN 153
G +DGT + + + ++ NR GD + NV+A CGP+
Sbjct: 148 -------------------RCIGALDGTHIPVTISPDERPRYRNRKGDVSTNVLAACGPD 188
Query: 154 YKFYAVNANWPGSVHDARVMRNSM 177
+F V W GS D+RV+R+++
Sbjct: 189 LRFIYVLPGWEGSAGDSRVLRDAL 212
>gi|112382585|gb|ABI17502.1| transposase [Chimonobambusa tumidissinoda]
Length = 119
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P + F R G NVMA+ + KF V A W GS HDA ++ +
Sbjct: 1 GAMDGTHVLARVPTKVSAVFRGRKGTTTQNVMAVVDFDLKFTYVLAGWEGSAHDALILAD 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ ++G R L D+ Y + L+PP
Sbjct: 61 ALE----RDDGLRVPQGKYYLVDAGYAARPGLLPP 91
>gi|225456199|ref|XP_002282832.1| PREDICTED: uncharacterized protein LOC100262931 [Vitis vinifera]
gi|147776987|emb|CAN67849.1| hypothetical protein VITISV_033748 [Vitis vinifera]
Length = 390
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 82/189 (43%), Gaps = 24/189 (12%)
Query: 100 DIPMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYK 155
DIP L G F + G VDG + + + + F N++G + NV+A C + K
Sbjct: 131 DIPPEILEDGRFYPYFQDCVGAVDGIHIPVMVGVDEQGPFRNKSGFLSQNVLAACSLDLK 190
Query: 156 FYAVNANWPGSVHDARVMRNSMVYRKFDE--EGWRPFPDAVILGDSAYP-LKEWLIP--- 209
F+ + A W GS D RV+ +++ R + EG L D+ Y + ++ P
Sbjct: 191 FHYILAGWEGSAADLRVLNSALTRRNKLQIPEG------KYYLVDTKYANMPGFIAPYPG 244
Query: 210 -PRTRNPDAA------ADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMV 261
P N +A A + F H R + FG LK +FP L P Q +
Sbjct: 245 VPYHLNESSAGFHPQDAKELFNYRHFLLRSATDRIFGALKARFPILMSAPPYPLQTQVKL 304
Query: 262 VLASTTLHN 270
V+A+ +HN
Sbjct: 305 VVAACAIHN 313
>gi|449526788|ref|XP_004170395.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101221521
[Cucumis sativus]
Length = 392
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 74/172 (43%), Gaps = 16/172 (9%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DG + + + + F N+NG + V+A C + KF+ V A W GS D
Sbjct: 147 FKDCXGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDL 206
Query: 171 RVMRNSMVYRK----------FDEEGWRPFPDAVI-LGDSAYPLKEWLIPPRTRNPDAAA 219
+V+ +++ R + + P V D Y KE+ P +P A
Sbjct: 207 QVLNSALTRRNKLHVPEGKYYLVDXKYMNMPGFVAPYHDITYQSKEY---PGGYHPQDAK 263
Query: 220 DQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSP-QFAGMVVLASTTLHN 270
+ LR H R E +F LK +FP L P Q +V+A+ +HN
Sbjct: 264 ELFNLR-HSLLRNATERTFEALKARFPILLSAPPYPLQTQVKLVVATCAIHN 314
>gi|348525833|ref|XP_003450426.1| PREDICTED: hypothetical protein LOC100702467 [Oreochromis
niloticus]
Length = 873
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 51/110 (46%), Gaps = 9/110 (8%)
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEW---LIPPRTRNPDAAADQRFLRCH 227
RV+ NS YR + P P IL D YP + LI P R QRF H
Sbjct: 219 RVLHNSPQYR----QAIYPPPVHFILTDGGYPCLQHPLPLITPYKRPVQDVGAQRFNSHH 274
Query: 228 KRTRRLIENSFGILKEKFPC--LNYMRLSPQFAGMVVLASTTLHNIALTI 275
R R +IE++FG++K +F L + + F V+ A LHNI L +
Sbjct: 275 SRARSIIEHAFGMMKTRFHAIFLQALEVHHTFVPYVIAACAVLHNICLGV 324
>gi|331244786|ref|XP_003335032.1| hypothetical protein PGTG_16639 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309314022|gb|EFP90613.1| hypothetical protein PGTG_16639 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 332
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 72/156 (46%), Gaps = 14/156 (8%)
Query: 67 SKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGCVDGTMV-LI 125
S+S++ ++ +V+ +V + ++ V WP ++ GF G VDGT + L
Sbjct: 171 SRSTVIKATQQVIEALV-SLGRQYVMWPNTTRWAEISEVMKNEGFVGCVGFVDGTTIPLF 229
Query: 126 DAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY----RK 181
P + F +R +++N IC + + WPG+ D+ V + ++ +
Sbjct: 230 QRPGYDGEVFYDRKRRYSINAQIICDCDKYITSFITGWPGTCGDSLVYKRMQLHLNPSQY 289
Query: 182 FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDA 217
FD+ + +L DSAY + + ++P +NP A
Sbjct: 290 FDDGQY-------LLADSAYEVSKTVVPA-YKNPSA 317
>gi|357167736|ref|XP_003581308.1| PREDICTED: uncharacterized protein LOC100844952 [Brachypodium
distachyon]
Length = 550
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 86/225 (38%), Gaps = 14/225 (6%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
++ +S +E+V L+ +G +IGA S ++ R V+ I + + +
Sbjct: 25 SIHVSVEEQVAMFLNTIGHNLRNRVIGANFDKSNETVSRYFRLVLHAIGELRAEYISPPS 84
Query: 95 EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
E I F + G +DGT V + + F R NVMA +
Sbjct: 85 LETPAKIAGNPRFDPYFKDCIGAIDGTHVRASVSKDMKPSFRGRKSFPTQNVMAAVDFDL 144
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP---- 210
+F V A W G HDA ++ +++ E G L D+ Y K +PP
Sbjct: 145 RFTYVLAGWEGIAHDATILADAIE----RENGLYVPQGKFYLVDAGYGAKPGFLPPFRAV 200
Query: 211 ------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN 249
NP + F H R +E +FG LK +F L+
Sbjct: 201 RYQLNEWGSNPVQNEKELFNLRHSSLRVTVERAFGTLKRRFKILD 245
>gi|297795315|ref|XP_002865542.1| hypothetical protein ARALYDRAFT_917556 [Arabidopsis lyrata subsp.
lyrata]
gi|297311377|gb|EFH41801.1| hypothetical protein ARALYDRAFT_917556 [Arabidopsis lyrata subsp.
lyrata]
Length = 593
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 79/206 (38%), Gaps = 22/206 (10%)
Query: 24 IAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV 83
+ P L E++VA+ L G +G G ++ ++ R EV+
Sbjct: 355 LQPTLNVSIEKSVAM--------FLRICGHNEVQRDVGLRFGRTQETVNRKFFEVLRA-T 405
Query: 84 DTFFQRMVKWP-EENLNDIPMTFLRKGG----FPNVAGCVDGTMVLIDAPNEHEDQFVNR 138
+ +K P + L IP R F G +DG V + E + + NR
Sbjct: 406 ELLACDYIKTPTRQELRRIPKKLQRDRRYWPYFSGFVGAIDGVHVCVKVKPELQGMYWNR 465
Query: 139 NGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGD 198
+ + N+MAIC N F V PGS HD V+ + + P D + D
Sbjct: 466 HDRTSFNIMAICDINMLFTYVWNGAPGSCHDTAVLTMA--------QDLLPPRDKYYVVD 517
Query: 199 SAYPLKEWLIPPRTRNPDAAADQRFL 224
S YP K+ + P + D+ F+
Sbjct: 518 SGYPNKQGFLAPYRSSISNYFDEDFV 543
>gi|77556456|gb|ABA99252.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 524
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 61/149 (40%), Gaps = 20/149 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +DGT + E F R NVMA + +F V A W G+ HDA
Sbjct: 78 FKDCIGAIDGTHIRASVRKNVESSFRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTAHDA 137
Query: 171 RVMRNSMVY----------RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAAD 220
V+R+++ R ++G+ P AV Y L EW NP
Sbjct: 138 VVLRDALERENGLRVPQGNRLQAKQGFLPPFRAV-----RYHLNEW-----GNNPVQNEK 187
Query: 221 QRFLRCHKRTRRLIENSFGILKEKFPCLN 249
+ F H R +E +FG LK +F L+
Sbjct: 188 ELFNLRHSSLRVTVERAFGSLKRRFKVLD 216
>gi|328726543|ref|XP_003248942.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 127
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 88 QRMVKWPEEN-----LNDIPMTFLRKGGFPNVAGCVDGTMVLIDAP------NEHEDQ-- 134
+ +KWP N L D LR FP +DG + I P H DQ
Sbjct: 6 HKFIKWPRGNAIHQTLQD--FKHLRPKAFPGAFMALDGCHIKIPTPWNKRTRMRHVDQRC 63
Query: 135 FVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMV 178
++NR ++ + IC N KF + A WPG+ HDAR+ R S +
Sbjct: 64 YINRKHVASVVLQGICDSNQKFTNIFAGWPGASHDARIFRKSNI 107
>gi|308814097|ref|XP_003084354.1| unnamed protein product [Ostreococcus tauri]
gi|116056238|emb|CAL58419.1| unnamed protein product [Ostreococcus tauri]
Length = 481
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 51/236 (21%), Positives = 96/236 (40%), Gaps = 21/236 (8%)
Query: 33 ERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSI-HEVVPVIVDTFFQRMV 91
ER+ A K K++ L ++ G P+ + K ++ ++ +P ++D + V
Sbjct: 142 ERSTACPLKIKLIGALLYLAYGCPWDAVENQVNVDKRTLRDWFWNKFIPWMLDREYPNHV 201
Query: 92 KWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNE-HEDQFVNRNGDHALNVMA 148
++P EE L + + GFP GC DG V A + + +G +L
Sbjct: 202 RYPQSEEELRQLLWPY-EHAGFPGAIGCTDGVHVPYGAYRSGRRYMYADHHGGFSLGFNV 260
Query: 149 ICGPNYKFYAVNANWPGSVHD----------ARVMRNSMVYR------KFDEEGWRPFPD 192
C + + V+ G+ +D + +MR +Y + ++ +
Sbjct: 261 TCDYHGRIIYVSEWSAGNTNDKTKINDDHFQSVIMREDPLYTQQRSIVRVSDDAFETIQG 320
Query: 193 AVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL 248
+L D Y + R ++ D ++ R H+ R+ IE FGI+K+KF L
Sbjct: 321 VYVLSDDGYHRWPTTVSSIKRPIPSSWDAKWTRWHESLRKEIECVFGIMKKKFAIL 376
>gi|343429532|emb|CBQ73105.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 324
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 112/274 (40%), Gaps = 32/274 (11%)
Query: 65 GPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKG--GFPNVAGCVDG 120
G S S+ + V + D + W EE L + G + + VDG
Sbjct: 9 GLSIGSVVNYTNNVSSALFD-LRDSAIAWASEEEKLRSKAWVLSKSGVTAWQHGFSMVDG 67
Query: 121 TMV-LIDAPNEHEDQFVNRNGDHALNVMAICGP-NYKFYAVNANWPGSVHDARVMRNSMV 178
T+V L AP D+ +R +++LN+ + P N + A PGS HD+ S +
Sbjct: 68 TLVPLQFAPGM--DRHFDRKKNYSLNIQLVILPHNLRVIEYVAGLPGSTHDSSAFAKSDI 125
Query: 179 Y---RKFDEEG-WRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQR--FLRCHKRTRR 232
+ R++ G W I D Y L E + P + A+++ F R
Sbjct: 126 FKNPRRYLTRGEW-------IWADLGYSLSEHFVSPYGHSASLASEEFRPFNTAVSNVRI 178
Query: 233 LIENSFGILKEKFPCLNYMR------LSPQFAGMVVLASTTLHNIALTIDRDNENQNVEI 286
E++ G LK +F L +R S A +V+A+ HN+A+ D D + I
Sbjct: 179 RSEHAIGYLKGRFQALKGLRNMIRDKESQAHASKLVVAAIAAHNLAIPFD-DVDEYMFYI 237
Query: 287 HDDEAEDDGEEEAV---DDPDEIDEEEQPRQDRN 317
D + + EA +D +++ EEQ Q ++
Sbjct: 238 DDGQGSEQIAREAQQQREDANQLYLEEQEFQQQH 271
>gi|357116843|ref|XP_003560186.1| PREDICTED: uncharacterized protein LOC100823720 [Brachypodium
distachyon]
Length = 259
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 38/67 (56%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F G +DGT + + P + + R+G + NV+A+C + +F A WPGSVHD
Sbjct: 168 FRGAIGAIDGTHIPVIVPTDETVNYTCRHGYTSQNVLAMCDFDMRFIFAVAGWPGSVHDT 227
Query: 171 RVMRNSM 177
RV+ +++
Sbjct: 228 RVLNHAL 234
>gi|356567246|ref|XP_003551832.1| PREDICTED: uncharacterized protein LOC100807146 [Glycine max]
Length = 470
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 74/180 (41%), Gaps = 25/180 (13%)
Query: 109 GGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVH 168
G + G +DGT + + +R+G+ + NV+A C + +F V + W GS H
Sbjct: 233 GTIADCIGAIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAH 292
Query: 169 DARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNP 215
D++V+ +++ + + L D +P + + P +P
Sbjct: 293 DSKVLSDALARKN----ELKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDP 348
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ + LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 349 ENEKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNF 404
>gi|348665556|gb|EGZ05385.1| hypothetical protein PHYSODRAFT_376748 [Phytophthora sojae]
Length = 209
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 88/201 (43%), Gaps = 8/201 (3%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
F++ R + + + P N S + +V + L+++ + T G
Sbjct: 8 FQDNMRFTQASFHKLCALLRPLRRKQPATNTKHSFETRVATMLYYVASMGGFRETAQTFG 67
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
SKS +++E++ I D + +R + P E+ + + F + G P V VDGT++
Sbjct: 68 VSKSWAITAVNELLTAIADDY-RRYIYSPSSLEDWSALQEAFEIRAGVPLVCAAVDGTLI 126
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
+ A E+ + + R G A+N+ + P+ + + + PGS D ++ S + D
Sbjct: 127 EL-ARFENYEGWYCRKGYPAVNLQVVVDPHQRIMSFDMR-PGSWSDKKIWAASHFGQHID 184
Query: 184 EEGWRPFPDAVILGDSAYPLK 204
G P P ILGD Y L
Sbjct: 185 --GILP-PGGFILGDKGYTLS 202
>gi|147767136|emb|CAN67325.1| hypothetical protein VITISV_012830 [Vitis vinifera]
Length = 942
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 106/258 (41%), Gaps = 28/258 (10%)
Query: 42 EKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLN 99
++V L + G IG G + S++ + V ++ + + +WP E +
Sbjct: 595 DQVAVALRRLSSGDSLLTIGDAFGLNHSTVSQVTWRFVEIMEERALHHL-QWPSTEPEIT 653
Query: 100 DIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
+I F + G PN G +D T +++ P+ + +++ + CGP + +
Sbjct: 654 EITSKFEKIRGLPNCCGAIDTTHIMMCLPSA--------DSANSVWLDXXCGPRDEV-SR 704
Query: 160 NANWPGSVHDARVMRNSMVY-RKFDEEGWRPFPDAV-----------ILGDSAYPLKEWL 207
++ W ++ + K E+G R + I+GDS YPL WL
Sbjct: 705 HSYWMARQNEGLFSAPEFKFPFKLCEKGQRLNGKKIELAEGSEIXEYIVGDSGYPLLPWL 764
Query: 208 IPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQFAGM--VVLAS 265
+ P + + F R H TR + + + LKE + + + P + ++L
Sbjct: 765 VTPYQGKELSESKAEFNRRHFATRMVAQRALARLKEMWKVIQGVMWRPDKNRLPRIILVC 824
Query: 266 TTLHNIALTIDRDNENQN 283
LHNI ID ++E Q+
Sbjct: 825 CLLHNI--VIDLEDEVQD 840
>gi|37624203|gb|AAQ95638.1| transposase [Coix aquatica]
Length = 119
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 4/95 (4%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P + F R D NVMA+ + KF V A W GS HDA ++ +
Sbjct: 1 GAMDGTHVLARVPRRMQQAFRGRKKDPTQNVMAVVDFDLKFTYVLAGWEGSAHDAHILAD 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ E+G+ L D+ Y + +PP
Sbjct: 61 AIE----REDGFTMPEGKCYLVDAVYACRSGFLPP 91
>gi|331227223|ref|XP_003326280.1| hypothetical protein PGTG_08110 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 335
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 86/202 (42%), Gaps = 22/202 (10%)
Query: 101 IPMTFLRKGGFPNVAGCVDG-TMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAV 159
I + +R+ GF G VD T+ L P + + +R ++++V IC N KF +
Sbjct: 114 IAVAVMREEGFLGCVGFVDERTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISY 173
Query: 160 NANWPGSVHDARVMRNSMVYRK----FDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNP 215
A +PGS HD V V ++ FD++ + +L DSAY +++P
Sbjct: 174 LAGFPGSSHDDYVFLRMQVAQQPEKYFDQKQF-------LLADSAYTNDRYVVPAFKGKQ 226
Query: 216 DAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGMV--VLASTTLH 269
H R+ GILK +F L +R + + G V ++ LH
Sbjct: 227 LLKRRNINFNYHLAQSRVRIEPIGILKGRFASLQEIQTQIRNAEEMKGAVKWIVTCIVLH 286
Query: 270 NIALTIDRDNENQNVEIHDDEA 291
N+ D ++Q +++DE
Sbjct: 287 NLL----ADLKDQWNNLYEDEG 304
>gi|255555409|ref|XP_002518741.1| conserved hypothetical protein [Ricinus communis]
gi|223542122|gb|EEF43666.1| conserved hypothetical protein [Ricinus communis]
Length = 445
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/328 (20%), Positives = 138/328 (42%), Gaps = 32/328 (9%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAPALTT--PTERNVALSAKEKVLSCLHWMGVGSPYHII 60
P +F+ F++ A+ E++ + P L P ++LSA+ ++ L + GS Y I
Sbjct: 107 PDSFRTFFKMKASTFEWLSGLLEPLLDCRDPIGSPLSLSAELRLGVGLFRLATGSNYSEI 166
Query: 61 GATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEE-NLNDIPMTFLRKGGFPNVAGCVD 119
G ++S+ R + + ++ T F+ V +P L + F + G PN G +D
Sbjct: 167 ADRFGVTESA-ARFCAKQLCRVLCTNFRFWVSFPSPVELQSVSNAFEKLIGLPNCCGVID 225
Query: 120 GTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
+ + + ++ D + + + + ++ A + G ++R+++++ +Y
Sbjct: 226 SARFNLVKKADDKLASNGKDQDDMIAAQIVVDSSSRILSIVAGFRGEKGNSRMLKSTTLY 285
Query: 180 RKFDEEGWRPF---PDAV--------ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHK 228
+ D EG R P+ V ++G YPL WL+ P + +++F + +
Sbjct: 286 K--DIEGGRVLNSSPEIVNGVAINRYLIGGGRYPLLPWLMVPFLDALPGSCEEKFNKAND 343
Query: 229 RTRRLIENSFGILKEKFPCLNYMRLS----PQFAGMVVL--ASTTLHNIALTIDRDN--- 279
R + LK N+ LS +F V L A + LHN L + D+
Sbjct: 344 LMRVSSLRAIASLK------NWGVLSRPIQEEFKTAVALIGACSILHNALLMREDDSALL 397
Query: 280 ENQNVEIHDDEAEDDGEEEAVDDPDEID 307
+ +++ + + V+D ID
Sbjct: 398 DMGGYSLYNQQCSQHFMDAEVEDISRID 425
>gi|322784762|gb|EFZ11579.1| hypothetical protein SINV_08910 [Solenopsis invicta]
Length = 106
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 41/67 (61%), Gaps = 3/67 (4%)
Query: 144 LNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPL 203
LN +C + + + W SVHDAR+ ++S + ++F EEG F ++LGD+ YP+
Sbjct: 14 LNTQVVCDHKMRIIDIVSRWSSSVHDARIFKHSRLRQRF-EEG--EFHSEILLGDNGYPI 70
Query: 204 KEWLIPP 210
K++L+ P
Sbjct: 71 KKYLLTP 77
>gi|405950053|gb|EKC18060.1| Protein FAM92B [Crassostrea gigas]
Length = 880
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 48/85 (56%), Gaps = 4/85 (4%)
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC 247
RP+ ++GD A+PL+ W++ P + ++ F R RR++E +FGIL ++ C
Sbjct: 713 RPY---FLIGDDAFPLRSWMMKPYSNRNMNNKERIFNYRLSRARRVVEKTFGILAYRWRC 769
Query: 248 -LNYMRLSPQFAGMVVLASTTLHNI 271
L M+ P ++V+A+ LHN+
Sbjct: 770 MLGTMQQDPYRVNLIVMAAICLHNL 794
>gi|357622695|gb|EHJ74117.1| hypothetical protein KGM_16059 [Danaus plexippus]
Length = 209
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 69/152 (45%), Gaps = 15/152 (9%)
Query: 136 VNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDA-- 193
+N ++ ++A+ YKF V+ G D +++S K G+ P
Sbjct: 3 INYKKFFSIVLLAVVDAKYKFVIVDVGAYGRNSDGGTLQSSKFGSKL-RNGFLCIPPEKA 61
Query: 194 ----------VILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCH-KRTRRLIENSFGILK 242
V + D A+PL E ++ P + +R R RR++EN+FG+L+
Sbjct: 62 LPHSTQKLPNVFVADEAFPLTENIMRPYPSHLLNDEKKRIFNYRLSRERRIVENAFGMLQ 121
Query: 243 EKFPCLNY-MRLSPQFAGMVVLASTTLHNIAL 273
E+F +++ P++ ++LAST LHN +
Sbjct: 122 ERFELFQKGIKVQPKYINNIILASTCLHNFII 153
>gi|356518973|ref|XP_003528149.1| PREDICTED: uncharacterized protein LOC100782123 [Glycine max]
Length = 467
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/293 (21%), Positives = 116/293 (39%), Gaps = 29/293 (9%)
Query: 45 LSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMT 104
+ + +G + Y +I T G S+ + + ++++ + MV+ I +
Sbjct: 168 IGTIAIVGQNTRYCVIRNTFGQSQFATSENFYKILKALNSLAPDLMVRPGSTVPAKIRES 227
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F + G +DGT + + + +G+ + NV+A C + +F V + W
Sbjct: 228 TRFYPYFKDCIGAIDGTHIPASVKGRDVSSYRDHHGNISQNVLAACNFDLEFMYVLSGWE 287
Query: 165 GSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------R 211
GS HD++V+ +++ + G + L D +P + +
Sbjct: 288 GSAHDSKVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLASYRGVRYHLQDFAGH 343
Query: 212 TRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTT 267
+P+ + LR H R +IE FGI K +F + +P F +VLA
Sbjct: 344 GNDPENEKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAA 399
Query: 268 LHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE---IDEEEQPRQDRN 317
LHN R +E VE D+ + +D D + +EQ R+D N
Sbjct: 400 LHNFLRKECRSDEFP-VEPTDESSSSSSVLPNYEDNDHEPIVQTQEQEREDAN 451
>gi|390363979|ref|XP_003730494.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 141
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 48/98 (48%), Gaps = 7/98 (7%)
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIE 235
S + R F +R ++LGDS YPL WL+ P NPD + + H RTR IE
Sbjct: 2 SRLGRAFANRRFR----GILLGDSGYPLLPWLMTP-VINPDTQEHRAYNSAHVRTRVKIE 56
Query: 236 NSFGILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNI 271
G LK KF CL + L P A V++A L N+
Sbjct: 57 MVNGQLKNKFQCLIGRGLNLIPSRACDVIVACCVLFNL 94
>gi|357129889|ref|XP_003566592.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 333
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 64/163 (39%), Gaps = 22/163 (13%)
Query: 102 PMTFLRKGGF----PNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFY 157
P + LRK F N G V GT + + P + ++ + NR + N M C + KF
Sbjct: 167 PHSILRKPKFYPFFKNCIGAVHGTHIPMKLPLDQQEPYRNRKQTISQNAMVACNFDLKFV 226
Query: 158 AVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP------- 210
+N W GS DARV+++++ G+ L D+ Y + P
Sbjct: 227 HINPGWEGSASDARVLQDAL------NHGFEVPDGKFYLVDAGYANTPQFLAPYHGTRYH 280
Query: 211 -----RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL 248
R R + F H + R IE GI K +F L
Sbjct: 281 LNEQGRARQKPQNHKELFNLRHAQLRNHIERIIGIWKMRFSIL 323
>gi|357115582|ref|XP_003559567.1| PREDICTED: uncharacterized protein LOC100824438 [Brachypodium
distachyon]
Length = 426
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/290 (19%), Positives = 110/290 (37%), Gaps = 22/290 (7%)
Query: 3 PFAFKEAFRLSATQVEFILEEIAP-----ALTTPTERNVALSAKEKVLSCLHWMGVGSPY 57
P F+ FR++ I+ I L + S+ +K + + +G G+P
Sbjct: 78 PAMFRRRFRMAKHVFMRIVNGIRNYDDYFILKRDCTGTLGFSSLQKCTAAIRMVGYGAPA 137
Query: 58 HIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGGFPNVAGC 117
+ I S+S ++ +V F ++ P GFP + GC
Sbjct: 138 NAIDEYLRMSESVCVEAMLRFCTAVVKVFGSEYLREPNAEDTTRLKAIGEARGFPGMLGC 197
Query: 118 VDGT-MVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNS 176
+D + P + Q+ G+ + + A+ + + PGS +D V++ S
Sbjct: 198 IDCMHWEWKNCPYAWQGQYQGHVGEPTIILEAVASYDLWIWHSFFGMPGSHNDINVLQRS 257
Query: 177 MVYRK----------FDEEGWRPFPDAVILGDSAYPLKEWLIPPRT-RNPDAAADQRFLR 225
++ + FD G P+ L D YP +W +T NP F +
Sbjct: 258 PLFARLAEGNAPECNFDVNG-HPYTMGYYLADGIYP--QWATFVKTISNPQGNKRALFAQ 314
Query: 226 CHKRTRRLIENSFGILKEKFPCLN--YMRLSPQFAGMVVLASTTLHNIAL 273
+ R+ +E +FG+L+ ++ + P+ V+ A +HN+ +
Sbjct: 315 QQESARKDVERAFGVLQSRWAIVRGPARMWQPETLWEVMTACVIMHNMIV 364
>gi|308458847|ref|XP_003091754.1| hypothetical protein CRE_10682 [Caenorhabditis remanei]
gi|308255101|gb|EFO99053.1| hypothetical protein CRE_10682 [Caenorhabditis remanei]
Length = 486
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 89/197 (45%), Gaps = 18/197 (9%)
Query: 110 GFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHD 169
G P A VDG D P N + N + +C + + + G D
Sbjct: 276 GVPCFA-VVDGKHWRCDHPPNTGSLNRNYKSFFSFNSLFVCDNDTRIIYAQISELGVNSD 334
Query: 170 ARVMR----NSMVYRKFDEEGWRPFPDA------VILGDSAYPLKEWLIPPRTRNPDAAA 219
A++ R + ++ R G++ D+ ILGD+ + L + +I P +N A++
Sbjct: 335 AQLFRCGILDELMERAVRTVGYQKLGDSDVIMPPFILGDNGFGLSKHVITPYRQNQLASS 394
Query: 220 DQ-RFLRCHKRTRRLIENSFGILKEKFPCLNY-MRLSPQFAGMVVLASTTLHNIALT-ID 276
+F +R IEN FGIL KF + +RL P+ + ++++++ +HNI+L ++
Sbjct: 395 GHIKFNEIISGSRVKIENCFGILTSKFQIFSRNLRLDPKTSRALIVSASVIHNISLGPLE 454
Query: 277 RDNENQNVEIHDDEAED 293
EN + DDE D
Sbjct: 455 VSPENDD----DDECTD 467
>gi|356532794|ref|XP_003534955.1| PREDICTED: uncharacterized protein LOC100776307 [Glycine max]
Length = 491
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/178 (24%), Positives = 75/178 (42%), Gaps = 26/178 (14%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
PN+ +DGT + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 259 IPNL-DVIDGTHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 317
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDA 217
+V+ +++ + G + L D +P + + P +P+
Sbjct: 318 KVLSDALARKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPEN 373
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRLSPQF----AGMVVLASTTLHNI 271
+ LR H R +IE FGI K +F + +P F +VLA LHN
Sbjct: 374 EKELFNLR-HASLRNVIERIFGIFKSRFTIF---KSAPPFLFKTQAELVLACAALHNF 427
>gi|397624813|gb|EJK67522.1| hypothetical protein THAOC_11427, partial [Thalassiosira oceanica]
Length = 285
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 83/207 (40%), Gaps = 24/207 (11%)
Query: 91 VKWPEENLND--IPMTFLRKG--GFPNVAGCVDGTMVLIDAPNEHEDQ---------FVN 137
+K+PE + F RK G G VDG +V I P+ +++ F
Sbjct: 4 IKFPETEAEQRAVAEGFRRKSEIGIDCCCGAVDGMLVWIHKPSTRDERVIGIGPKKFFCG 63
Query: 138 RNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW-RPFPDAV-- 194
R LN+ +C F V +PG+ D S + +K ++EG+ P ++
Sbjct: 64 RKKKFGLNMQGVCDSRGFFLDVEVRFPGAASDFFAFDESGLKKKVEKEGFLGSAPSSLFG 123
Query: 195 ----ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNY 250
+ GD+AY ++ P + D F H + R IE +FG+L ++ L
Sbjct: 124 GKLCLFGDNAYVQTPYMCTPWKAVSGGSKDA-FNFFHSQVRINIECAFGMLVHRWGILRR 182
Query: 251 ---MRLSPQFAGMVVLASTTLHNIALT 274
M + +VLA LHN +
Sbjct: 183 AIPMGIDVNKTTRLVLALCKLHNFCIA 209
>gi|218197272|gb|EEC79699.1| hypothetical protein OsI_20988 [Oryza sativa Indica Group]
Length = 295
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G VDGT +L P+E + +++ R+ NVMAIC + +F + PGS+HD
Sbjct: 205 FKDCIGTVDGTHILAVVPDEEKIRYIGRSKSTTQNVMAICDHDMRFIYASIGQPGSMHDT 264
Query: 171 RVMRNSM 177
V+ N++
Sbjct: 265 TVLFNAL 271
>gi|357116984|ref|XP_003560256.1| PREDICTED: uncharacterized protein LOC100824549 [Brachypodium
distachyon]
Length = 764
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 98/254 (38%), Gaps = 18/254 (7%)
Query: 35 NVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWP 94
++ +S +E+V + L+ +G +IGA S + R V+ I + + +
Sbjct: 286 SIHVSVEEQVATFLNTIGHNLRNRVIGANFDRSNERVSRYFRLVLHAIGELRAEYIRPPS 345
Query: 95 EENLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNY 154
E I F + G +DGT V + E F R NVMA +
Sbjct: 346 LETPAKIAGKPRFDPYFKDCIGAIDGTHVRASVSKDMEPSFRRRKSFPTQNVMAAVDFDL 405
Query: 155 KFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP---- 210
+F V A W G+ HDA ++ +++ E G L D+ Y K +PP
Sbjct: 406 RFTYVLAGWEGTAHDATILADAIER----ENGMYVPQGKFYLVDAGYGAKPEFLPPFRAM 461
Query: 211 ------RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLN----YMRLSPQFAGM 260
NP ++ F H R +E +F LK +F L+ + G
Sbjct: 462 RYHLNEWGSNPVQNENELFNLRHSSLRVTVERAFRTLKRRFKILDDATPFFSFPAMEGGG 521
Query: 261 VVLASTTLHNIALT 274
V++ + ++ + L+
Sbjct: 522 VIVWTNSMSAMMLS 535
>gi|449669996|ref|XP_004207170.1| PREDICTED: uncharacterized protein LOC101241006 [Hydra
magnipapillata]
Length = 180
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 58/128 (45%), Gaps = 20/128 (15%)
Query: 160 NANWPGSVHDARVMRNS------------MVYRKFDEEGWRPFPDAVILGDSAYPLKEWL 207
NA WPGS+HDA+V NS + YR+ G+R P VI GD AYPL +
Sbjct: 3 NAKWPGSLHDAKVFSNSTFNLKMISNSIPITYREL-LPGFRKVPCYVI-GDPAYPLSTFC 60
Query: 208 IPP--RTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCL-NYMRLSPQFAGMVVLA 264
+ +N D LR R IE +FG LK ++ L + L + V+ A
Sbjct: 61 LKEYLHCQNNDQVVFNTLLR---TARNQIECAFGHLKARWAILKKKIDLKLENIPTVIYA 117
Query: 265 STTLHNIA 272
+ LHN
Sbjct: 118 CSALHNFC 125
>gi|47207146|emb|CAF94629.1| unnamed protein product [Tetraodon nigroviridis]
Length = 388
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/277 (20%), Positives = 114/277 (41%), Gaps = 15/277 (5%)
Query: 5 AFKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGS-PYHIIGAT 63
A +E F LS ++FI + + T + +ALS + L++ G I+ T
Sbjct: 36 ALEETFHLSRPCLDFIRNAVLVRMKKCTLKKLALSVDVMLPVALNYYAHGDISADILQRT 95
Query: 64 HGPSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGT 121
C +I V ++ + + +P + + + G PNV G +
Sbjct: 96 MRVEGD--CPAIISAVSGVIAGMSDQFISFPLLQGARASVASKTEKMCGIPNVLGVLAPA 153
Query: 122 MVLIDAPNEHEDQFVNRN--GDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVY 179
+ + + FV+ + G ++ IC + +V GS + V +S
Sbjct: 154 HFEVQVCLKEKAAFVSVSAPGHASVVSQVICDLDGNILSVEKCCVGSTPEQEVWESSFKG 213
Query: 180 RKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFG 239
R+ ++E P+ +G S Y L + ++ P R+P D R+ H + +++ + G
Sbjct: 214 REVEKELHGPY---WFIGGSGYRLSKHVLTP-VRDPSKDKDMRYNEAHAKLYGVMQATLG 269
Query: 240 ILKEKFPCLNYMRLSPQFA----GMVVLASTTLHNIA 272
+K +F CL+ + + + + ++ A LHNIA
Sbjct: 270 HMKRRFRCLSDLGFTQKGSLDKKSNIIKACCVLHNIA 306
>gi|356554082|ref|XP_003545378.1| PREDICTED: uncharacterized protein LOC100781945 [Glycine max]
Length = 531
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/208 (21%), Positives = 85/208 (40%), Gaps = 18/208 (8%)
Query: 51 MGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEENLNDIPMTFLRKGG 110
+G + Y +I T G S+ + + H+++ + MV+ I +
Sbjct: 238 VGQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPY 297
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
F + G +D T + + +R+G+ + NV+A C + +F V + W GS HD+
Sbjct: 298 FKDCIGAIDATHIPASVKGRDVSSYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDS 357
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP-------------RTRNPDA 217
+V+ +++ + G + L D +P + + P +P+
Sbjct: 358 KVLSDALTRKN----GLKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPEN 413
Query: 218 AADQRFLRCHKRTRRLIENSFGILKEKF 245
+ LR H R +IE FGI K +F
Sbjct: 414 EKELFNLR-HASLRNVIERIFGIFKSRF 440
>gi|391326930|ref|XP_003737962.1| PREDICTED: uncharacterized protein LOC100902815 [Metaseiulus
occidentalis]
Length = 679
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 45/103 (43%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
FPN+ G +D V I P + +F N +L ++A+ NY F + G V+DA
Sbjct: 505 FPNMCGAIDSKNVNIRCPPKSGSEFFNYKKTFSLILLAVVDANYNFLYIGVGTDGRVNDA 564
Query: 171 RVMRNSMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPPRTR 213
V NS E V + D A+PL ++ P +R
Sbjct: 565 AVFANSSFNSALQAETLNLPSPGVFVADDAFPLNTSILKPFSR 607
>gi|322788481|gb|EFZ14142.1| hypothetical protein SINV_11342 [Solenopsis invicta]
Length = 122
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 3/95 (3%)
Query: 198 DSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPCLNYMRL---S 254
DS Y L+ WL+ P A ++R+ +C KRT +IE G+LK +F CL R+ +
Sbjct: 1 DSGYALRPWLLTPLNDVQPATPEERYNQCFKRTCSIIERCNGLLKMRFRCLLKHRVLHYT 60
Query: 255 PQFAGMVVLASTTLHNIALTIDRDNENQNVEIHDD 289
P A ++ + LHN+ + + + + ++H D
Sbjct: 61 PNVASKIINSCVVLHNMCVENNLPLPDDDYDLHMD 95
>gi|112382422|gb|ABI17439.1| transposase [Brachystachyum densiflorum]
Length = 119
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 4/95 (4%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P + F R G NVMA + KF V A W GSVHDA ++ +
Sbjct: 1 GAMDGTHVLARVPTKVSAAFRGRKGTTTQNVMAAVDFDLKFTYVLAGWEGSVHDALILAD 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ ++G R L D+ Y + +PP
Sbjct: 61 ALE----RDDGLRVPQGKYYLVDAGYAARPGFLPP 91
>gi|224144073|ref|XP_002325176.1| predicted protein [Populus trichocarpa]
gi|222866610|gb|EEF03741.1| predicted protein [Populus trichocarpa]
Length = 421
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 75/179 (41%), Gaps = 24/179 (13%)
Query: 111 FPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDA 170
PN G + +D + RNG ++ V A+ +F ++A WP S+
Sbjct: 202 LPNCCGALGFGRFGVDG------EVFGRNG--SILVQALVDSEGRFLDISAGWPSSMEPV 253
Query: 171 RVMRNSMVYRKFDEEGWRPFPDA-------------VILGDSAYPLKEWLIPPRT-RNPD 216
+ R + +Y ++ ++GDS +PL WL+ P + ++
Sbjct: 254 AIFRQTRLYLGVEDSRESELLKGPTYMLSDGCSIPQYVMGDSCFPLLPWLLTPYSEQDSF 313
Query: 217 AAADQRFLRCHKRTRRLIENSFGILKEKFPCL--NYMRLSPQFAGMVVLASTTLHNIAL 273
+A++ F H R +L+ +FG +K ++ L + S +F V++ LHN +
Sbjct: 314 GSAEREFNVAHSRAMKLVSTAFGRVKARWQLLARRWKEDSLEFLPFVIVMGCVLHNFLI 372
>gi|112382615|gb|ABI17516.1| transposase [Yushania uniramosa]
Length = 119
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 44/95 (46%), Gaps = 4/95 (4%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT VL P + F R G NVMA+ + KF V W GS HDA ++ +
Sbjct: 1 GALDGTHVLARVPTKVSAAFRGRKGTTTQNVMAVVDFDLKFTYVLVGWEGSAHDALILAD 60
Query: 176 SMVYRKFDEEGWRPFPDAVILGDSAYPLKEWLIPP 210
++ ++G R L DS Y + +PP
Sbjct: 61 ALE----RDDGLRVPQGKYYLVDSGYAARPGFLPP 91
>gi|53749283|gb|AAU90142.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 256
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 82/194 (42%), Gaps = 26/194 (13%)
Query: 121 TMVLIDAPNEHEDQFV-------NRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVM 173
T V I P EH ++ N +++ + A+ F V PGS+ DA V+
Sbjct: 34 TSVRIVVPREHAGEYYDHRLTDRNNKATYSVAMQAVVDDEGAFTDVCIGHPGSLSDAAVL 93
Query: 174 RNSMVYRK--------FDEEGWR-PFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFL 224
S +Y + D+ GW+ P ++G ++YPL W++ P T+ A R
Sbjct: 94 AKSALYARCEAGLLLGHDKLGWQQPL---WLVGGASYPLTSWMLVPYTQPNQTWAQDRLN 150
Query: 225 RCHKRTRRLIENSFGILKEKFPCLNYMRLS-PQFAGMVVLASTTLHNIALTIDRDNENQN 283
R +F L+ ++ CL + P+ A M+ + LHN+ +R E +
Sbjct: 151 AHVADARAAAVGAFRRLRARWQCLRRAEVKLPELANMLAVC-CVLHNLC---ERIGEELD 206
Query: 284 VEIHDDEAE--DDG 295
+ DE E DDG
Sbjct: 207 ANLLHDELEVVDDG 220
>gi|321470242|gb|EFX81219.1| hypothetical protein DAPPUDRAFT_317897 [Daphnia pulex]
Length = 206
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 78/190 (41%), Gaps = 43/190 (22%)
Query: 128 PNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGW 187
P +H + NR ++ + +C + KF + WPGS HD+R++ S + R
Sbjct: 44 PIKHLAAYTNRKAVTSVKLQGVCDSSLKFMDIRCGWPGSKHDSRILEMSSLSRA------ 97
Query: 188 RPFPDAVILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRR-LIENSFGILKEKFP 246
QR+ +R LIE SF +LK K
Sbjct: 98 ---------------------------------QRYFNSRDSGQRCLIERSFRLLKGKRR 124
Query: 247 CLNYMRLSP-QFAGMVVLASTTLHNIALTIDRDNENQNVEIHDDEAEDDGEEEAVDDPDE 305
L Y+ ++ + +++ + LHN ID + E+ ++ + + E D + EA DD +E
Sbjct: 125 KLKYLDITNLDYIPFIIITACLLHN--FIIDIEGEDTDINGDESDDEGDSDNEADDDQNE 182
Query: 306 IDEEEQPRQD 315
DE E +D
Sbjct: 183 EDEAEVENRD 192
>gi|390367980|ref|XP_003731366.1| PREDICTED: uncharacterized protein LOC100889920 [Strongylocentrotus
purpuratus]
Length = 161
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 64/140 (45%), Gaps = 16/140 (11%)
Query: 147 MAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEGWRPFPDAV------------ 194
MA+ Y+F ++ + G DA++ NS + ++ + G PD
Sbjct: 1 MALVDSKYQFLWIDVDGVGHQSDAQIYNNSEL-KECIKAGTLGIPDPAPLPHDDEEHPMP 59
Query: 195 --ILGDSAYPLKEWLIPPRTRNPDAAADQRFLRCHKRTRRLIENSFGILKEKFPC-LNYM 251
+GD A+ ++ +++ P R + F R RR++EN+FGIL +F C L M
Sbjct: 60 YFFVGDYAFAMRTYMMKPYGRRNMDQQQKIFNYRLSRARRVVENAFGILALRFQCFLGQM 119
Query: 252 RLSPQFAGMVVLASTTLHNI 271
R P +++ + LHN+
Sbjct: 120 RQEPDTVRLLIEVAVMLHNL 139
>gi|147783763|emb|CAN70254.1| hypothetical protein VITISV_024382 [Vitis vinifera]
Length = 380
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 73/160 (45%), Gaps = 17/160 (10%)
Query: 34 RNVALS-AKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIV---DTFFQR 89
RN A S +E+V LH +G + S ++ R H+V+ I+ D F ++
Sbjct: 114 RNSAHSNVEEQVAKFLHIVGHNLRNRTMKFYFKRSSETVSRHFHQVLRAIISLDDVFLKQ 173
Query: 90 --MVKWPEE---NLNDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHAL 144
+K P+E N +P F + G +DG+ + N+ ++ R
Sbjct: 174 PDGLKCPQEIKDNTKFLPY-------FKDCIGAIDGSYFRVKVSNDVVQRYRGRKYYPTQ 226
Query: 145 NVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDE 184
NV+A C + KF V W GS D+R++ N+++ R FD+
Sbjct: 227 NVLAACSFDLKFTYVLLGWEGSASDSRILDNALM-RDFDK 265
>gi|328696821|ref|XP_003240140.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 132
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%)
Query: 38 LSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRSIHEVVPVIVDTFFQRMVKWPEEN 97
L +K S LH+ G+ +IG +KS++ R+I V I + Q + P
Sbjct: 2 LHCLQKYKSTLHYYATGTFQTVIGDHIRVNKSTVFRTIKRVSTAIARLYPQS-INMPRTA 60
Query: 98 L--NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEH-EDQFVNRNGDHALNVMAICGPN 153
L + + F + FP V G +D T + I +PN ++F NR G + NV IC N
Sbjct: 61 LERSIVQTGFYKIRNFPRVIGAIDCTHIRIQSPNSDIGERFRNRKGYFSFNVQVICNSN 119
>gi|449671595|ref|XP_004207527.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 190
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 60/124 (48%), Gaps = 3/124 (2%)
Query: 8 EAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPS 67
+ FRL + I+ + ++ T R+ ++S++ KVL L + GS +IG T S
Sbjct: 34 QKFRLPKKNIRDIVNTLHNDMSPITSRSNSISSEAKVLISLRLLSSGSFQGVIGDTSHVS 93
Query: 68 KSSICRSIHEVVPVIVDTFFQRMVKWPE--ENLNDIPMTFLRKGGFPNVAGCVDGTMVLI 125
+ S R +H+ + ++ +VKW E E + +I + + G + G +DGTM+ I
Sbjct: 94 QPSCSRILHQFCKYFIH-HYRYLVKWYESPEEMMEIKRKYFQSYGVIGLLGVIDGTMIQI 152
Query: 126 DAPN 129
Sbjct: 153 KGAT 156
>gi|443731694|gb|ELU16728.1| hypothetical protein CAPTEDRAFT_145380 [Capitella teleta]
Length = 499
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/254 (21%), Positives = 105/254 (41%), Gaps = 36/254 (14%)
Query: 15 TQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHGPSKSSICRS 74
T +E ++E + P L + LS ++++ L + + + +I G S+I +
Sbjct: 223 TILEAVIEFLEPLLPDLGK----LSKFQQIIVTLIKLRLNVGHQLIAYQFGVHASTISKH 278
Query: 75 IHEVVPVIVDTFFQRMVKWPE-ENL-NDIPMTFLRKGGFPNVAGCVDGTMVLIDAPNEHE 132
+H VV ++ +++WPE E L N +PM F + FP A +D V ++ ++
Sbjct: 279 LHNVVSLMYTVLVPEVIRWPECEQLRNTLPMCF--RATFPKCAVIIDCFEVQVEKASDQT 336
Query: 133 ------DQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFDEEG 186
Q+ +RN L + P ++ W G D + NS R
Sbjct: 337 ANAQTYSQYKSRN---TLKYLIGISPLGAVTFISKGWGGRTSDKYITENSGFLRHL---- 389
Query: 187 WRPFPDAVILGDSAYPLKEWL--------IPPRTRNPDAAADQRFLRCHKR---TRRLIE 235
P V++ D + ++E + IP TR + D+R + ++ R +E
Sbjct: 390 ---VPGDVVMADRGFFIEESVAMQGAKLEIPAFTRG-KSQLDKREVESTRKLASVRIHVE 445
Query: 236 NSFGILKEKFPCLN 249
G +++K+ L
Sbjct: 446 RVIGTIRQKYQVLQ 459
>gi|255561699|ref|XP_002521859.1| conserved hypothetical protein [Ricinus communis]
gi|223538897|gb|EEF40495.1| conserved hypothetical protein [Ricinus communis]
Length = 459
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/282 (19%), Positives = 110/282 (39%), Gaps = 24/282 (8%)
Query: 6 FKEAFRLSATQVEFILEEIAPALTTPTERNVALSAKEKVLSCLHWMGVGSPYHIIGATHG 65
++ + LS ++E++ P +T N++L + V L + G + + +
Sbjct: 133 WRSLYGLSYPVFTTVVEKLKPHITA---SNLSLPSDYAVAMVLSRLAFGYSAKALASRYS 189
Query: 66 PSKSSICRSIHEVVPVIVDTFFQRMVKWP--EENLNDIPMTFLRKGGFPNVAGCVDGTMV 123
+ + + V ++ + +K P L + F PN+ G +DG+ +
Sbjct: 190 LEPYLVSKITNMVTRLLATKLYPEFIKIPVSRRRLIETTQAFEELTSLPNICGAIDGSPI 249
Query: 124 LIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRNSMVYRKFD 183
I N + + + G ++ + + F+ V G DA R+S++Y +
Sbjct: 250 KIRRENG-ANLYQCKYGFPSVLLQVVADHKKVFWDVCVKASGGTDDATHFRDSLLYNRLV 308
Query: 184 EEG--W-----------RPFPDAVILGDSAYPLKEWLIPPRTRNPDAA-ADQRFLRCHKR 229
W RP+ ++GD YPL +L+ P + N A F +
Sbjct: 309 SADLVWEKVINVRGHHVRPY----VVGDWCYPLLSFLMTPFSPNGSGTPAQNLFDGMLMK 364
Query: 230 TRRLIENSFGILKEKFPCLNYMRLSPQFAGMVVLASTTLHNI 271
R + + G+LK ++ L + + A ++A LHN+
Sbjct: 365 GRSAVVEAIGLLKGRWKILQDLDVGLNHAPQTIVACCVLHNL 406
>gi|301105293|ref|XP_002901730.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099068|gb|EEY57120.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 256
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 20/146 (13%)
Query: 105 FLRKGGFPNVAGCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWP 164
F KGG P V G DG++V I+ P++ E + R AL V A+ +F + + P
Sbjct: 93 FEAKGGIPGVVGAFDGSLVEINRPDDFEGYYC-RKMHPALIVQAVANDKMQFMSADIR-P 150
Query: 165 GSVHDARVMRNSMVYRKFDEEGWRPFPDAV-ILGDSAYPLKEWLIPPRTRNPDAAADQRF 223
GS D + R S + + D P LGD+ L TR R+
Sbjct: 151 GSWSDQKTWRASALSSRIDSY----LPTGTHFLGDAGECL--------TRK-----QLRY 193
Query: 224 LRCHKRTRRLIENSFGILKEKFPCLN 249
H R+R +E SFG+ K +F +N
Sbjct: 194 SYLHSRSRMAVECSFGLWKGRFRIVN 219
>gi|356524053|ref|XP_003530647.1| PREDICTED: uncharacterized protein LOC100811542 [Glycine max]
Length = 280
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 37/62 (59%)
Query: 116 GCVDGTMVLIDAPNEHEDQFVNRNGDHALNVMAICGPNYKFYAVNANWPGSVHDARVMRN 175
G +DGT + + ++ NR GD + NV+A CGP+ +F V W GS D+RV+R+
Sbjct: 151 GALDGTHIPVTVSPYERPRYRNRKGDVSTNVLAACGPDLRFIYVLPGWEGSAGDSRVLRD 210
Query: 176 SM 177
++
Sbjct: 211 AL 212
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,706,741,909
Number of Sequences: 23463169
Number of extensions: 253474906
Number of successful extensions: 1487432
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1177
Number of HSP's successfully gapped in prelim test: 1039
Number of HSP's that attempted gapping in prelim test: 1464601
Number of HSP's gapped (non-prelim): 17906
length of query: 332
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 189
effective length of database: 9,003,962,200
effective search space: 1701748855800
effective search space used: 1701748855800
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)